Skip to content

sampritipanda/audio_captcha_solver

Repository files navigation

Solving Audio Captchas

Solving Audio Captchas using Machine Learning

Authors: Sampriti Panda, Duy Nguyen

Requirements

  • python >= 3.6
  • numpy
  • scipy
  • matplotlib
  • python_speech_features: pip install python_speech_features
  • sklearn
  • keras (Not-needed, you can comment out the imports)
  • pip install numba==0.48.0
  • librosa, spectrum: pip install librosa spectrum

Generating Training Data

  • We have provided around 50 train and 10 test cases per category, but you need to generate around 1000 train data to replicate our results.
  • To generate data using our scripts, please cd into the training_data/ directory and run: ./gen_data.sh.
  • You can also download pre-generated training data from: https://drive.google.com/file/d/19ypbdOiafc3Ocr9ltHIFjJI9uQXlEuJR/view?usp=sharing
  • poc.py contains our original algorithm, which gives around 70% accuracy on digits and 50% on letters.
  • poc2.py contains our improved algorithm, which gives around 95% accuracy.
  • To run either of these implementations, modify the DIR_TRAIN and DIR_TEST directories to the necessary locations, and run python poc.py.

About

Solving Audio Captchas using Machine Learning

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •