Solving Audio Captchas

Solving Audio Captchas using Machine Learning

Authors: Sampriti Panda, Duy Nguyen

Requirements

We have provided around 50 train and 10 test cases per category, but you need to generate around 1000 train data to replicate our results.
To generate data using our scripts, please cd into the training_data/ directory and run: ./gen_data.sh.
You can also download pre-generated training data from: https://drive.google.com/file/d/19ypbdOiafc3Ocr9ltHIFjJI9uQXlEuJR/view?usp=sharing
poc.py contains our original algorithm, which gives around 70% accuracy on digits and 50% on letters.
poc2.py contains our improved algorithm, which gives around 95% accuracy.
To run either of these implementations, modify the DIR_TRAIN and DIR_TEST directories to the necessary locations, and run python poc.py.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
data		data
training_data		training_data
.gitignore		.gitignore
.gitmodules		.gitmodules
FeatureExtraction.py		FeatureExtraction.py
LICENSE		LICENSE
MLAlgo.py		MLAlgo.py
README.md		README.md
collectTrainData.py		collectTrainData.py
collectTrainData2.py		collectTrainData2.py
getPotentialSpeakLocation.py		getPotentialSpeakLocation.py
getPotentialSpeakLocation2.py		getPotentialSpeakLocation2.py
poc.py		poc.py
poc2.py		poc2.py
rasta.py		rasta.py
results_poc.txt		results_poc.txt
results_poc2.txt		results_poc2.txt