Abey_Resume_Template
Abey_Resume_Template
Abey_Resume_Template
TECHNICAL SKILLS
Languages: Python, SQL, PowerShell, Shell Scripting, C, C++
Databases: SQL Server, PostgreSQL, MySQL, SQLite
Frameworks: Spark, Pandas, Pytest, Unittest, Scikit, Git, Docker, Scrapy
Technologies: PySpark, GitHub, Azure Databricks, AWS Lambda, Glue, S3, ECR, EC2, Postman
EXPERIENCE
Circulant Soluions July 2022 – Present
Associate Data Engineer
• Improved 90% of SQL query generation by integrating ChatGPT API with the database using Python.
• Built a bioinformatics NGS workflow, ensuring validated ingestion of big data into S3 and its smooth
processing through AWS EFS for report generation and output archival, saving Databricks cost by 15%.
• Developed a streamlined pipeline that integrates shell triggers for SFTP, Databricks flow monitoring,
DynamoDB metadata validation and view creation in RedShift reducing error handling time by 80%.
• Automated a daily Power BI dashboard refresh flow, reducing manual effort by 98% by using PySpark.
• Reduced claims data ingestion time by 75% through PySpark on Databricks from Azure Blob Storage.
• Automated a backend project of folder permissions by PowerShell reducing manual time by 98%.
• Slashed data load time by 60% with an ML pipeline using Docker, S3, Lambda and ECR. Boosted
patient selection accuracy by 90% with an ML model’s feature selection, model training and prediction.
• Automated a basic ETL, reducing 75% of manual data loading ensuring efficient clinical data ingestion.
• Proposed a POC to automate provisioning CRUD tasks for fast creation, updation and deletion of the users.
Cognizant April 2022 – June 2022
Programmer Analyst Trainee
• Played a key role in architecting and simple ETL pipeline, harnessing the capabilities of Python and Spark for
efficient data extraction, transformation, and loading. Actively involved in optimizing pipeline performance
and ensuring seamless integration with existing systems.
Avvashya CCI Logistics Ltd August 2021 – February 2022
Solutions Design Graduate Engineer
• Optimized cost data solutions by tracking productivity and managing procurement data.
PROJECTS
Circulant Soluions July 2022 – Present
Associate Data Engineer
• Built a bioinformatics NGS workflow involving validated ingestion of big data to S3 and its smooth processing
through AWS EFS for report generation and output archival which helped save Databricks cost by 15%.
ACHIEVEMENTS
• Received a Spot Award for the development project featuring rigorous unit testing.
• Achieved Client Appreciation for ETL automation by reducing the manual verification of data inconsistencies.
EDUCATION
Indian Institute of Technology Madras January 2024 – Present
Diploma in Data Science
Mumbai University, Dwarkadas.J.Sanghvi College of Engineering 2017 – 2021
Bachelors of Engineering in Mechanical Engineering CGPA - 9.22/10
CERTIFICATIONS
• Lakehouse Fundamentals - Databricks
• Fundamentals of Generative AI - Databricks
• Machine Learning with Apache Spark - Coursera