I'm a passionate Data Engineer and AI/ML Engineer specializing in building scalable data pipelines, implementing machine learning solutions, and developing Gen-AI applications. With expertise in cloud platforms and modern data technologies, I transform complex data challenges into innovative solutions.
- π Currently working on: Advanced Data Engineering projects, LLM applications, and MLOps pipelines
- π± Learning: Advanced LLM fine-tuning, Real-time streaming architectures, and Cloud-native data solutions
- π― Collaboration: Open to collaborating on open-source data engineering, AI/ML, and Gen-AI projects
- π¬ Ask me about: Python, SQL, Apache Spark, Kafka, dbt, MLOps, LangChain, RAG systems, Cloud platforms
- π Education: B.Tech in Electronics & Telecommunication Engineering from PCCOE, Pune
- π« Reach me:
-
Gen-AI-Course π€
- Comprehensive course on Generative AI with LLMs and practical implementations
-
ML π
- Machine Learning projects and experiments with Jupyter notebooks
-
LLM π§
- Large Language Model exploration and applications
-
- Data analytics project focused on lending data insights
-
taxi_rides_ny_dbt π
- dbt project for NYC taxi ride data transformation and analysis
-
agri_data_pipeline πΎ
- Agricultural data pipeline for ETL processes (5 stars β)
-
de-zc-2025 π
- Data Engineering Zoomcamp exercises and implementations
-
spark-practice β‘
- Apache Spark practice projects for distributed data processing
-
search-engine π
- Building search engine with AI capabilities
-
projects π
- Collection of various interesting projects
- 35+ repositories
- 7 followers
- 150+ following
- π Achievements: Quickdraw, Pull Shark x2, YOLO
- π» Active in Data Engineering and AI/ML communities
- ποΈ Building scalable data pipelines with Apache Spark, Kafka, and Airflow
- π€ Developing LLM applications using LangChain, RAG systems, and vector databases
- π Implementing real-time analytics solutions with streaming technologies
- βοΈ Architecting cloud-native data platforms on GCP, AWS, and Azure
- π¬ Exploring MLOps practices for production ML deployment
- π Learning advanced distributed systems and data mesh architectures
- Data Engineering: ETL/ELT pipelines, Data warehousing, Batch & Stream processing
- Machine Learning: Supervised/Unsupervised learning, Deep Learning, Model deployment
- Gen-AI: RAG systems, LLM fine-tuning, Prompt engineering, Vector embeddings
- Cloud Platforms: GCP (BigQuery, Dataflow), AWS (S3, Redshift, EMR), Azure
- Orchestration: Apache Airflow, Prefect, Workflow automation
- Data Modeling: Dimensional modeling, dbt, Data quality frameworks
- π B.Tech in Electronics & Telecommunication Engineering - PCCOE PUNE
- π GitHub Achievements: Quickdraw, Pull Shark x2, YOLO
- π Completed: Data Engineering Zoomcamp, Gen-AI Course
- π‘ Active Contributor: Open-source data engineering projects



