Skip to content

Data-Wrangling-and-Visualisation/JobHack

Repository files navigation

JobHack - Job Market Data Wrangling & Analysis

GitHub
GitHub last commit
GitHub repo size

JobHack is a powerful toolkit for scraping, analyzing, and visualizing job market data from platforms like LinkedIn, Headhunter, and Indeed. Designed for job seekers, recruiters, and data enthusiasts.

🔥 Key Features

🕷️ Web Scrapers

  • LinkedIn Scraper (scrapy spider working fully)
  • Headhunter Integration (2,000+ jobs collected)
  • Coming Soon: Indeed & Glassdoor support

🧠 AI-Powered Augmentation

  • LLaMA 3.2 (1B) via Ollama (for future...)
  • Extracts:
    • Salary ranges
    • Technical requirements
    • Education levels
    • Company details

📊 Analysis Tools

  • Salary distribution visualizations
  • Skills demand heatmaps
  • Remote work trends analysis

🛠️ Tech Stack

# Core Technologies
- Scrapy, BeautifulSoup  # Web scraping
- Pandas, NumPy         # Data wrangling
- Matplotlib, Plotly    # Visualization
- Ollama (LLaMA 3.2)    # NLP augmentation
- Jupyter Notebooks     # Analysis

🚀 Quick Start

Clone the repo:

git clone https://github.com/Data-Wrangling-and-Visualisation/JobHack.git
cd JobHack

Run the webservice:

cd my_flask_app
python app.py

Then you can open your browser and check different visuals

Alt text Alt text Alt text

Alt text

Alt text

Alt text

About

Understanding job market

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •