Solving Audio Captchas using Machine Learning
Authors: Sampriti Panda, Duy Nguyen
- python >= 3.6
- numpy
- scipy
- matplotlib
- python_speech_features:
pip install python_speech_features - sklearn
- keras (Not-needed, you can comment out the imports)
pip install numba==0.48.0- librosa, spectrum:
pip install librosa spectrum
- We have provided around 50 train and 10 test cases per category, but you need to generate around 1000 train data to replicate our results.
- To generate data using our scripts, please cd into the
training_data/directory and run:./gen_data.sh. - You can also download pre-generated training data from: https://drive.google.com/file/d/19ypbdOiafc3Ocr9ltHIFjJI9uQXlEuJR/view?usp=sharing
poc.pycontains our original algorithm, which gives around 70% accuracy on digits and 50% on letters.poc2.pycontains our improved algorithm, which gives around 95% accuracy.- To run either of these implementations, modify the
DIR_TRAINandDIR_TESTdirectories to the necessary locations, and runpython poc.py.