
About Me
With a wealth of experience, I bring expertise as a Data Engineer proficient in crafting scalable data infrastructure, excelling in ETL processes, and optimizing data pipelines. Leveraging a dynamic skill set that includes Python and Apache Spark, I ensure the meticulous extraction, transformation, and loading of extensive datasets. My commitment as a quick learner aligns with the goal of delivering top-tier data solutions, facilitating informed decision-making within the fast-paced dynamics of professional environments.
Work Experience
Data Enginner at
Edgematics
Solutions

July, 2023 - Present
As a Data Engineer within the company, my role revolves around the
comprehensive management of
data. This entails the meticulous cleaning and transformation of raw data into curated formats
leveraging PySpark, facilitating the creation of Dashboards by Data Analysts and
enabling Data Scientists to conduct predictive modeling with increased accuracy. My responsibilities
contribute to the seamless flow of high-quality data, optimizing the analytical processes and
supporting data-driven decision-making across the organization.
Associate Software Engineer at
AI Hawks Software Solutions
pvt Ltd.

December, 2022 - June, 2023
This position necessitated versatility, compelling me to engage with a
diverse range of
technologies. Consequently, I developed proficient skills in Python, utilizing
Django as a primary framework. Additionally, I gained experience in
JavaScript, particularly within the context of the NodeJS framework.
This multifaceted exposure has honed my ability to navigate and contribute effectively to projects
involving varied technological stacks.
Data Science intern at
Corizo

Sept, 2022 - Nov, 2022
- Gained foundational knowledge in Python, SQL, Statistics, and Machine Learning algorithms.
- Developed hands-on experience with Machine Learning, completing a project on Flight Price Prediction using classical machine learning techniques and boosting methods.
- Conducted Sentiment Analysis using natural language processing (NLP) tools like NLTK, Spacy, and Scikit-Learn.
Professional Skills
Languages
Python
PySpark
SQL
JavaScript
DataBase
MySQL
MongoDB
Snowflake
Amazon S3
Experties
Machine Learning
Deep Learning
PyTorch
Tensorflow/Keras
Framework Knowledge
Hadoop
DataBricks
Apache Spark
Django & ReactJS
Projects
Real-time Squat Counter
Ai Hawks
Developed a Machine Learning model for Squat counting using MediaPipe and OpenCV
libraries, with a user-friendly
interface created in Tkinter. Leveraging MediaPipe's pose estimation models in real-time video streams.
OpenCV is employed for video processing tasks,
such as frame extraction and preprocessing. The application provides accurate feedback on squat counts,
enhancing workout performance monitoring and analysis.
Real-time Squat Counter
Ai Hawks
Developed a Machine Learning model for Squat counting using MediaPipe and OpenCV
libraries, with a user-friendly
interface created in Tkinter. Leveraging MediaPipe's pose estimation models in real-time video streams.
OpenCV is employed for video processing tasks,
such as frame extraction and preprocessing. The application provides accurate feedback on squat counts,
enhancing workout performance monitoring and analysis.
Automated Dashboard
Ai Hawks
Developed an automated software utilizing Google API, Selenium, and Openpyxl to scrape values
from Google Sheets across various branch folders. The software then updates the main report sheet
located in a different folder. Additionally, created a user-friendly GUI using Tkinter for conveniently
updating the sheet code on a yearly basis.
Social Sentiment Analysis
Ai Hawks
Developed an advanced sentiment analysis system tailored for social media content,
employing cutting-edge natural language processing (NLP) techniques.
Leveraging machine learning models and sophisticated NLP algorithms,
the system accurately interprets and analyzes the sentiment expressed in social media posts.
Audio Transcription Model with OpenAI and Streamlit
Ai Hawks
Designed and implemented a transcription model utilizing OpenAI's Whisper technology for efficient and
accurate audio transcription. Integrated with Streamlit as the frontend interface, the system offers a
user-friendly platform for audio file input and real-time transcription output. The model ensures high-quality transcription
results across various audio/video formats and languages, enhancing productivity and accessibility in transcription tasks.
Capstone Project 1
Imarticus learning
Completed a diploma capstone project on flight price prediction.
Conducted meticulous data cleaning and in-depth exploratory analysis, employing various ML algorithms to
build accurate predictive models. Selected the best-performing model through rigorous evaluation, refining skills in
data preprocessing, ML modeling, and evaluation. The best-performing model was selected based on training and
test data performance.
Notebook
Capstone Project 2
Imarticus learning
Undertook a deep learning capstone project during my diploma to gain practical experience with Neural networks.
Utilized PyTorch to develop a model for breast cancer detection, focusing on data preprocessing, network design, training, and evaluation. This project aimed to provide hands-on experience and enhance understanding of deep learning principles, particularly in medical data analysis and tumor classification.
Notebook
Certificates
The module covers SQL programming for data querying and
Python for data
science tasks, including Visualization. It introduces Statistics, Machine learning
concepts , and
explores data visualization using Tableau and Power BI, emphasizing effective communication of
insights to stakeholders.
Gain an immersive understanding of the practices and processes employed by a junior or associate
data analyst in their daily responsibilities. Learn to proficiently clean and organize data for
analysis, perform calculations using spreadsheets, SQL, and R programming, and master the art of
visualizing and presenting data findings through dashboards, presentations, and common visualization
platforms.
Completed "Apache Spark 3 Essentials" course, mastering core
architecture, distributed data processing, and Spark SQL for structured querying. Hands-on
experience in optimizing Spark applications and utilizing advanced features like machine learning
and graph processing libraries. Enhances proficiency in big data analytics.
Completed a "Web Development" course on Internshala, acquiring skills in
HTML, CSS, Bootstrap, and ReactJS for frontend development. Additionally, gained proficiency in PHP
for backend development. This training enhances my capability in full-stack web development.
Education
Masters in
Data Science
from
CHRIST (Deemed to be University)

2020 - 2022
- Gained deep expertise in Mathematical Foundations, Probability and Statistics, Data Science Principles, and Research Methodology.
- Hands-on programming in Python and R for data analysis, along with exposure to Database Technologies.
- Developed proficiency in Machine Learning, Regression Analysis, Multivariate Analysis, and Deep Learning.
- Studied specialized topics such as Neural Networks, Time Series Analysis, and Natural Language Processing (NLP).
- Completed an Industry Project, applying theoretical concepts to real-world data challenges.
M.B.S.Collage of Engineering & Technology
affiliation with University of Jammu
2016 - 2020
During my mechanical engineering studies, I not only focused on core
engineering subjects but also explored programming languages like C and Java. This exposure sparked my
interest in coding, giving me a basic understanding of software development. Studying these
programming languages not only expanded my skills but also allowed me to explore the connection
between mechanical engineering and computer science. This experience led to a strong interest in
coding and its various applications, encouraging me to approach problem-solving with a
multidisciplinary mindset.
Senior High Secondary Kendriya vidyalaya
Bantalab
2014 - 2016
Completed Senior High Secondary with a focus on both Medical and
Non-Medical subjects from the
Central Board of Secondary Education (CBSE).
Contact
Address
East Sapphire Flat no 302 E/F, The Nest, Sector 45, Goutam Buddha
Nagar, Noida, 201031
Phone
+91-9149869687
ambardarviresh18@gmail.com