Skip to content

Audiobook alignment for North American Indigenous languages

License

Notifications You must be signed in to change notification settings

finguist/ReadAlong-Studio

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

readlongs

Audiobook alignment for North American Indigenous languages

end product

The concept is a web application with a series of stages of processing, which ultimately leads to a time-aligned audiobook - i.e. a package of:

  • SMIL file describing time alignments
  • TEI file describing text
  • Audio file (WAV or MP3)

Which can be loaded using the read-along JavaScript component.

Optionally a book can be generated as a standalone HTML page or as an ePub file.

  1. (optional) Pre-segment inputs, consisting of:
    • Single audio file
    • Text with page markings (assume paragraph breaks = pages)
  2. Input pages: each page consists of
    • Image file
    • Audio file
    • Text
  3. Run alignment
  4. View output and download components

roadmap

  • MVP app:
    • Single page (image, audio, text)
    • Select language (crl or atj for now)
    • Run alignment and launch read-along app with output

running the web app

  1. pip install -e .
  2. python
  3. >>> from readalongs.app import app
  4. app.run()

generating an ePub

  1. pip install -e .
  2. readalongs_align --output-xhtml XMLFILE WAVFILE OUTPUTNAME
  3. readalongs_create_epub OUTPUTNAME.smil OUTPUTNAME.epub

About

Audiobook alignment for North American Indigenous languages

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • JavaScript 72.0%
  • Python 17.5%
  • CSS 8.1%
  • HTML 2.4%