Live: rudil24.github.io/pdf-audio-reader
A static web app that reads PDF files aloud using the browser's built-in Web Speech API. No backend, no API keys, no install — runs entirely in the browser and deploys to GitHub Pages.
- Open any local PDF and have it read aloud page by page
- Pause / Resume playback at any time
- Skip to the next page manually
- Voice selector — choose from your OS's available voices
- Page counter synced to audio
- Zero cost, zero data sent anywhere
- Open the app in Chrome, Firefox, or Safari (desktop)
- Click Open PDF and select a
.pdffile from your computer - The app renders the first page and begins reading automatically
- Use Pause / Resume and Next Page to control playback
- Switch voices from the dropdown at any time
| Browser | Status |
|---|---|
| Chrome (desktop) | Full support |
| Firefox (desktop) | Full support |
| Safari (desktop) | Full support |
| Mobile browsers | Partial — iOS Safari has known quirks |
- Push this repo to GitHub
- Go to Settings → Pages
- Set source to
mainbranch, root folder/ - Your app will be live at
https://<username>.github.io/<repo-name>/
No build step required.
- Chrome has a ~60-second TTS timeout bug — mitigated with a keepalive timer
- Text extraction quality depends on the PDF's encoding (scanned/image PDFs will have no extractable text)
- Voice availability depends on the user's OS and browser
- PRD.md — Product Requirements Document
- TASK_LIST.md — Development task list
- LOCAL_LOG.md — Project log