Skip to content
View abhayra12's full-sized avatar

Block or report abhayra12

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
abhayra12/README.md

Hi πŸ‘‹, I'm Abhay Ahirkar

A passionate developer from India | Data Engineer | AI/ML Enthusiast

Profile views

πŸ‘¨β€πŸ’» About Me

I'm a passionate Data Engineer and AI/ML Engineer specializing in building scalable data pipelines, implementing machine learning solutions, and developing Gen-AI applications. With expertise in cloud platforms and modern data technologies, I transform complex data challenges into innovative solutions.

  • πŸ”­ Currently working on: Advanced Data Engineering projects, LLM applications, and MLOps pipelines
  • 🌱 Learning: Advanced LLM fine-tuning, Real-time streaming architectures, and Cloud-native data solutions
  • πŸ‘― Collaboration: Open to collaborating on open-source data engineering, AI/ML, and Gen-AI projects
  • πŸ’¬ Ask me about: Python, SQL, Apache Spark, Kafka, dbt, MLOps, LangChain, RAG systems, Cloud platforms
  • πŸŽ“ Education: B.Tech in Electronics & Telecommunication Engineering from PCCOE, Pune
  • πŸ“« Reach me: Linkedin Badge Email Badge

πŸ› οΈ Tech Stack

Programming Languages

Python Bash

Big Data & Data Engineering

Spark Kafka Hadoop Hive Airflow dbt

Databases

PostgreSQL MySQL MongoDB SQLite

Machine Learning & AI

PyTorch TensorFlow Scikit-Learn Pandas NumPy Matplotlib Seaborn

Cloud & DevOps

GCP AWS Azure Docker Kubernetes Git

Gen-AI & LLM Tools

OpenAI LangChain LlamaIndex

Development Tools

Jupyter VS Code Linux Postman

πŸš€ Featured Projects (Last 1 Year)

Data Engineering & Analytics

  • Gen-AI-Course πŸ€–

    • Comprehensive course on Generative AI with LLMs and practical implementations
  • ML πŸ“š

    • Machine Learning projects and experiments with Jupyter notebooks
  • LLM 🧠

    • Large Language Model exploration and applications
  • lending_data_analytics πŸ’°

    • Data analytics project focused on lending data insights
  • taxi_rides_ny_dbt πŸš•

    • dbt project for NYC taxi ride data transformation and analysis
  • agri_data_pipeline 🌾

    • Agricultural data pipeline for ETL processes (5 stars ⭐)
  • de-zc-2025 πŸ“ˆ

    • Data Engineering Zoomcamp exercises and implementations
  • spark-practice ⚑

    • Apache Spark practice projects for distributed data processing

Search & AI Exploration

  • search-engine πŸ”

    • Building search engine with AI capabilities
  • projects πŸ“

    • Collection of various interesting projects

πŸ”₯ My GitHub Stats

GitHub Stats Top Languages
GitHub Streak

πŸ“Š Quick Stats

  • 35+ repositories
  • 7 followers
  • 150+ following
  • πŸ† Achievements: Quickdraw, Pull Shark x2, YOLO
  • πŸ’» Active in Data Engineering and AI/ML communities

🎯 What I'm Working On

  • πŸ—οΈ Building scalable data pipelines with Apache Spark, Kafka, and Airflow
  • πŸ€– Developing LLM applications using LangChain, RAG systems, and vector databases
  • πŸ“Š Implementing real-time analytics solutions with streaming technologies
  • ☁️ Architecting cloud-native data platforms on GCP, AWS, and Azure
  • πŸ”¬ Exploring MLOps practices for production ML deployment
  • πŸ“š Learning advanced distributed systems and data mesh architectures

πŸ’Ό Core Competencies

  • Data Engineering: ETL/ELT pipelines, Data warehousing, Batch & Stream processing
  • Machine Learning: Supervised/Unsupervised learning, Deep Learning, Model deployment
  • Gen-AI: RAG systems, LLM fine-tuning, Prompt engineering, Vector embeddings
  • Cloud Platforms: GCP (BigQuery, Dataflow), AWS (S3, Redshift, EMR), Azure
  • Orchestration: Apache Airflow, Prefect, Workflow automation
  • Data Modeling: Dimensional modeling, dbt, Data quality frameworks

πŸ† Certifications & Achievements

  • πŸŽ“ B.Tech in Electronics & Telecommunication Engineering - PCCOE PUNE
  • πŸ… GitHub Achievements: Quickdraw, Pull Shark x2, YOLO
  • πŸ“š Completed: Data Engineering Zoomcamp, Gen-AI Course
  • πŸ’‘ Active Contributor: Open-source data engineering projects

πŸ“ˆ GitHub Activity


πŸ’‘ "Data is the new oil, and I'm here to refine it!"

Thanks for visiting! Let's connect and build something amazing together! πŸš€

LinkedIn YouTube Email

Popular repositories Loading

  1. agri_data_pipeline agri_data_pipeline Public

    Shell 5 2

  2. kp kp Public

    2

  3. GitHub-Profile-Builder GitHub-Profile-Builder Public

    Forked from Ntshangase/GitHub-Profile-Builder

    A repository to help you build your best GitHub Profile. If you find this repository helpful please ⭐

    1

  4. taxi_rides_ny_dbt taxi_rides_ny_dbt Public

    Dockerfile 1

  5. lending_data_analytics lending_data_analytics Public

    Python 1

  6. image-to-stl image-to-stl Public

    Python