Cyber Bullying Detection

Overview

This project is a machine learning-based web application for detecting cyberbullying in tweets. It uses natural language processing (NLP) and supervised learning to classify text as bullying or not. The app is built with Streamlit for an interactive user interface.

Features

Input text/tweet and get instant cyberbullying prediction
Uses a trained machine learning model (e.g., Decision Tree, Random Forest, or similar)
Text preprocessing with NLTK (stopwords, tokenization)
TF-IDF vectorization for feature extraction
Model accuracy and evaluation metrics

Dataset

cyberbullying_tweets.csv: Contains labeled tweets for training and testing
Classes: Bullying, Not Bullying (binary classification)

Machine Learning Pipeline

Data Preprocessing
- Remove stopwords
- Tokenize and clean text
- Convert text to lowercase
Feature Extraction
- TF-IDF Vectorizer transforms text into numerical features
- Vectorizer is saved as tfidf_vectorizer.pkl
Model Training
- Model (e.g., DecisionTreeClassifier) is trained on the vectorized data
- Model is saved as bullying_model.pkl
- Model accuracy is evaluated and reported
Prediction
- User input is preprocessed and vectorized
- Model predicts if the input is bullying or not

Streamlit App

Main file: app.py
Loads the trained model and vectorizer
Provides a simple UI for text input and displays prediction
Can be run locally or deployed online

How to Run Locally

Install dependencies:
```
pip install -r requirements.txt
```
Start the app:
```
streamlit run app.py
```
The app will open in your browser.

How to Deploy Online

Push your project to GitHub
Go to Streamlit Community Cloud
Link your GitHub repo and select app.py as the main file
Deploy and share your app

Files

app.py: Streamlit web app
bullying_model.pkl: Trained ML model
tfidf_vectorizer.pkl: TF-IDF vectorizer
cyberbullying_tweets.csv: Dataset
bullying-classification-accuracy-80.ipynb: Training notebook
requirements.txt: Python dependencies

Requirements

Python 3.8+
scikit-learn
nltk
streamlit

License

MIT License

Author

[Rishav Shah]

Feel free to modify this README to add more details about your model, dataset, or deployment process.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Cyber Bullying Detection

Overview

Features

Dataset

Machine Learning Pipeline

Streamlit App

How to Run Locally

How to Deploy Online

Files

Requirements

License

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.devcontainer		.devcontainer
.gitattributes		.gitattributes
README.md		README.md
app.py		app.py
bullying-classification-accuracy-80.ipynb		bullying-classification-accuracy-80.ipynb
bullying_model.pkl		bullying_model.pkl
cyberbullying_tweets.csv		cyberbullying_tweets.csv
requirements.txt		requirements.txt
tfidf_vectorizer.pkl		tfidf_vectorizer.pkl

rishavafk/Cyber_Bullying-Detection

Folders and files

Latest commit

History

Repository files navigation

Cyber Bullying Detection

Overview

Features

Dataset

Machine Learning Pipeline

Streamlit App

How to Run Locally

How to Deploy Online

Files

Requirements

License

Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages