Description
Describe the bug
The current review scraping process in the IMDb ratings system exhibits certain inaccuracies, particularly in distinguishing between valid and fake user actions. Addressing this issue is crucial for maintaining the integrity of the review data. Additionally, IMDb’s movie recommendation system requires enhancement to provide more precise and personalized recommendations based on user preferences. I goal for this project to improve the recommendation system using various machine learning techniques and Python programming, while also implementing a robust classification mechanism to differentiate between genuine and fraudulent reviews. The final results will be exported in CSV format for further analysis
To Reproduce
Steps to reproduce the behavior:
Scraping Reviews:
1.Using Python libraries like BeautifulSoup to load IMDb pages.
2. Find movie links on IMDb.
3. Extract user reviews from the movie pages.
4. Store the scraped data in a suitable format (e.g., CSV).
- Enhance Movie Recommendation System:
- Preprocess datasets using Pandas.
- Explore content-based and collaborative filtering techniques.
- Calculate similarity metrics (e.g., cosine similarity) to recommend similar movies.
Expected behavior
I have expected to fix the bugs and make the review scraping system precision and accuray rate to be more high.
Desktop (please complete the following information):
- OS: Microsoft Windows 11
- Browser: chrome, edge and brave
- Version : 23H2
Smartphone (please complete the following information):
- Device: Realme 9 5g [Android]
- OS: realme UI 4.0
- Browser: chrome, google
- Version: RMX3388_11
Additional context
we can also add features like: advertising the better movie option for the user with bad experience. also, to conduct some interactive sessions to seek more attention and empower the IMDb promotions. the sessions can be like: movie quizs, riddles, funny facts and myth brusters etc.. just to have more interaction with users and make users participation higher.
PLEASE PULL UP THE REQUEST FOR THE PROJECT. OPEN TO ANY SUGGESTION OR IDEAS.
I am a Contributor in GSSoc'24##
Metadata
Metadata
Assignees
Type
Projects
Status