A data science project focused on exploring, visualizing, and modeling an Indian movies dataset. This project applies Exploratory Data Analysis (EDA), classification, and regression techniques to uncover patterns and predict movie success metrics.
--
├── data/ # Dataset files (CSV)
├── eda/ # EDA notebooks and visualizations
├── models/ # Classification & regression models
├── utils/ # Utility functions
├── outputs/ # Graphs, plots, and predictions
├── requirements.txt # Python dependencies
└── main.ipynb # Main notebook (EDA + Modeling)
- ✅ Data cleaning and preprocessing
- 📊 Exploratory Data Analysis (EDA) with Matplotlib & Seaborn
- 🤖 Machine Learning models:
- Classification (Decision Trees, Random Forests, etc.)
- Regression (Linear Regression, Random Forest Regressor, etc.)
- 📈 Evaluation metrics: Accuracy, MAE, RMSE, R²
- 🔍 Insightful visualizations and trend analysis
The dataset contains information about Indian movies such as:
- Title, Genre, Language
- IMDb rating
- Number of votes
- Release date
- Budget and box office performance
📁 Source of dataset (add link if public)
git clone https://github.com/adityaranjan08/Analysis-of-Movie-Dataset-Using-Exploratory-Data-Analysis-Classification-and-Regression.git
cd Analysis-of-Movie-Dataset-Using-Exploratory-Data-Analysis-Classification-and-Regression
pip install -r requirements.txt
Use Jupyter Notebook or Jupyter Lab:
jupyter notebook main.ipynb
Some interesting findings:
- Certain genres and languages are more likely to succeed.
- IMDb ratings are strongly correlated with vote counts.
- Regression models can reasonably predict movie popularity.
Detailed results and charts are available in the
outputs/
folder.
- Python (Pandas, NumPy, Scikit-learn)
- Seaborn & Matplotlib for Visualization
- Jupyter Notebook
- Git & GitHub
Pull requests are welcome! For major changes, please open an issue first to discuss.
This project is licensed under the MIT License - see the LICENSE file for details.
Aditya Ranjan
📧 Email
🌐 GitHub
⭐️ If you found this repo helpful, feel free to give it a star!