A local-first script to process ebooks into slideshows or movies using several types of generative AI.
conda create --name book2movie python=3.11
pip install -r requirements.txtpython main.py data/AStudyInScarlet.epub 4 17python main.py --listen 0.0.0.0 --port 8188- For best results, prepend all models in Kokoro-FastAPI with
male,female, and at least onemachine. - Leave
af_heart.ptas is (it's hardcoded). - This improves voice mapping so male characters are more likely to use male voices, etc. Current voice types are "Masculine", "Feminine", "Machine", and "Unknown".
ollama pull gemma3:27b-it-qat
ollama pull mistral-small3.2:24b- Gemma3: Best for structured outputting a list of characters.
- Mistral-small3.2: Slightly better for mapping characters to quotes.
- Model names are in the config.
