Project Origin

This project originated from the author's frustration when listening to English podcasts, struggling with unfamiliar words and finding it cumbersome to look them up individually. Thus, the goal was to automate the process of generating a vocabulary list of difficult words.

Difficult English Word Viewer

The Difficult English Word Viewer allows users to upload audio/text and automatically generates a vocabulary list of challenging words, providing a convenient way to review them.

Main Features

Supports uploading word list files in various formats (txt, md, rtf, mp3, wav, ogg, flac, json)
Automatically generates a vocabulary list of difficult words
View word definitions and example sentences
Generate new example sentences for words
Export word data in JSON format

Installation Steps

Clone the repository:

git clone https://github.com/adot08/audio-to-word-list-generator.git
cd audio-to-word-list-generato

Create and activate a virtual environment:

python -m venv venv
# On Windows, use:
venv\Scripts\activate
# On macOS and Linux, use:
source venv/bin/activate

Install required packages:
```
pip install -r requirements.txt
```

Usage Instructions

Start the Flask application:
```
python app.py
```
Visit http://localhost:5000 in your web browser
Use the "Choose File" button to upload a word list file
Interact with the loaded words using the provided buttons and features

Notes

For larger audio files, the waiting time can be longer, mainly due to file segmentation and ASR. Subsequent AI-generated explanations will also take some time, which is related to the number of difficult words, so please be patient when using.
Processed files can be exported and imported directly for future use.

Configuration

All optional configurations are in config.yaml, where you can modify the ASR service and LLM service you need to use. In this project, I chose SiliconFlow's service, which provides a comprehensive set of services, making it convenient for expansion and model switching.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data		data
nltk_data		nltk_data
prompts		prompts
templates		templates
.gitignore		.gitignore
app.py		app.py
config.py		config.py
config.yaml		config.yaml
readme.md		readme.md
readme_cn.md		readme_cn.md
requirements.txt		requirements.txt
tools.py		tools.py
web_page.png		web_page.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Origin

Difficult English Word Viewer

Main Features

Installation Steps

Usage Instructions

Notes

Configuration

About

Releases

Packages

Languages

adot08/audio-to-word-list-generator

Folders and files

Latest commit

History

Repository files navigation

Project Origin

Difficult English Word Viewer

Main Features

Installation Steps

Usage Instructions

Notes

Configuration

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages