About
I am currently pursuing a graduate degree in Data Science at George Washington University(expected graduation - May 2025). I take pleasure in collaborating with like-minded individuals who share similar interests, as it allows for the development of both personal and professional skills within the peer. I am looking for a dynamic role that challenges me to utilize my Software Engineering skills, providing chances for professional growth, exciting experiences, and personal development.
- Programming languages: R, Python, and C.
- Database: SQL, MongoDB, and Neo4j (Graph Database).
- Machine Learning Algorithms: Linear/Logistic/Lasso/Ridge Regression, Decision Trees, Naive Bayes, KNN, Random Forest, Stacking, SVM, XGBM, Bagging Methods, Cascading Classifiers.
- Data Mining: PCA, t-SNE, Recommendation Systems & Matrix Factorization, and Clustering - K Means, Hierarchical, DBSCAN.
- Time series analysis/Forecasting: AR, ARMA, ARIMA, and SARIMA.
- Deep Learning: Artificial Neural Networks, Convolutional Neural Networks, Multilayer Perceptron, Long Short Term Memory(LSTM), Recurrent Neural Network(RNN) and Generative adversarial networks(GAN).
- Product Development: Agile Methodology, Product Life Cycle, JIRA for Ticketing, Git, GitHub.
- Others: Tableau, Flask, AWS EC-2, Streamlit and Heroku.
Experience
- Processed and cleaned 28,000 articles to analyze customer behaviour, utilizing Databricks for data ingestion, HTML parsing and data cleaning.
- Developed a dynamic labelling system using LLMs, iteratively generating new labels for articles based on previous batches, resulting in a 90% reduction in manual labelling efforts.
- Conducted linguistic validation on 500 randomly selected articles to ensure accurate labelling and continuously improved labelling accuracy by refining LLM prompts.
- Automated customer query resolution by fine-tuning a T5 model on 34,000 customer queries and resolutions, applying Parameter Efficient Fine Tuning(PEFT) to optimize model performance.
- Optimized a query resolution model, leading to a 60% reduction in the customer care team's workload by improving response accuracy and efficiency.
- Engaged in ongoing work to generate personalized marketing messages for 10 million customers, leveraging customer attributes to deliver highly targeted content and enhance marketing effectiveness.
- Worked on Advanced Driving Assistance Systems and developed products like Emergency Brake Assist, and Rear Pre-Crash Predict. Major products: Volkswagen ID Buzz and Mercedes Benz Sprinter Van.
- Developed algorithm using C. Implemented automation using Python scripting.
- Provided problem-solving solutions to customer-reported problems in the simulation environment.
- Delivered better performance with just 2 false positives per 10,000 kilometers, optimizing key performance indicators.
- Skills learnt: Development in C, Python for scripting, Git, GTest for Testing, QAC for Quality, JIRA for ticketing, Product Development, Agile Methodologies, ASPICE.
- Collaborated with a dynamic team to conduct in-depth data analysis utilizing Python and Tableau, providing valuable insights into client's sales data. Analyzed user behavior, temporal trends, and distinctions between free and paid users.
- Formulated data-driven recommendations and compelling narratives and communicated to our client
- Skills learnt:Python, Data Analysis, ML Modelling, Flask.
- Worked on validation and verification process standards in avionics hardware.
- Collaborating with different teams and Reviewing standards of all the Validation and verification processes.
- Skills learnt: DO-254 and DAL-C Certification of Hardware.
Projects
User-friendly software product designed to democratize machine learning.
- Tools: Python, Flask, HTML and CSS.
- Leading the development of EzFlow.ai, a user-friendly platform designed to empower users with no coding experience to learn and implement machine learning projects.
- By automating data preprocessing, model training, and result visualization, EzFlow.ai provides users with predictions and comprehensive summary reports.
Aims to identify duplicate questions using natural language processing
To predict the total ride duration of taxi trips in New York city
Employs advanced NLP and ML to analyze Amazon reviews
- Tools: Python, Flask, AWS-for deployment.
- Developed a model to predict if a text review of a product given by user is positive or negative
- Performed extensive text cleaning and featurizing text data, achieved an AUC score of 0.90 using SGD Classifier.
- Deployed using Flask on AWS EC-2 virtual machine.
Model to Generate music using LSTM neural networks.
- Tools: Python, Keras, Tensorflow
- Implements an Artificial Music Generator using LSTM (Long Short-Term Memory) networks, a type of recurrent neural network (RNN).
- The system generates music character by character based on a given input dataset.
- LSTM model is trained on a corpus of music data and then sampled from the trained model to generate new music compositions.
A recommendation system for Recommending Similar Universities
Research Publications
Facial Feature Extraction and Emotional Analysis Using ML
Performance Comparison of Prediction Algorithms for Forecasting of Wind Power Generation
- Research compares ARIMA, SARIMAX, and ARMA algorithms for Wind power generation forecasting.
- ARIMA identified as the most accurate with the lowest MSE (523.01).
- Accurate forecasting minimizes errors, enhances power grid reliability.
- Click on the link to view the IEEE paper for detailed information.
Skills
Languages and Databases
Libraries
Frameworks
Other
Education
Washington DC, USA
Degree: Master of Science in Data Science
CGPA: 4.0/4.0
- Data Warehousing
- Introduction to Data Science
- Introduction to Data Mining
- Machine Learning
- Data Visualization
- Algorithms for Data Science
Relevant Courseworks:
Bangalore, India
Degree: Bachelor of Engineering in Electrical and Electronics Engineering
GPA: 7.68/10
- Programming in C and Data Structures
- Engineering Mathematics
- Engineering Physics
- Object Oriented Programming Using C++
- Python Application Programming
Relevant Courseworks:
Blogs
- 1. Logistic Regression’s Journey with Imbalanced Data
- - Read on TowardsAI: Logistic Regression’s Journey with Imbalanced Data
- 2. Unleashing the Power of Skrub: Revolutionizing Table Preparation for Machine Learning
- - Explore on Stackademic: Unleashing the Power of Skrub: Revolutionizing Table Preparation for Machine Learning
- 3. CarMaker by IPG Automotive: An innovative tool for the development and validation of vehicles
- - Explore on Medium: CarMaker by IPG Automotive: An innovative tool for the development and validation of vehicles
- 4. Unraveling the World of ADAS: Enhancing Automotive Safety and Comfort
- - Explore on Medium: Unraveling the World of ADAS: Enhancing Automotive Safety and Comfort
- 5. Clustering unveiled: The Intersection of Data Mining, Unsupervised Learning, and Machine Learning
- - Explore on Medium: Clustering unveiled: The Intersection of Data Mining, Unsupervised Learning, and Machine Learning
- You can visit my medium account here:
- - Checkout MEDIUM