Skip to content

fm39hz/hako-crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

hako-crawler

A Python-based tool to download light novels from Hako (docln.net, ln.hako.vn) and convert them into EPUB format for offline reading.

Features

  • Interactive Downloader: Guides you through selecting a novel and choosing which volumes to download.
  • EPUB Generation: Converts the downloaded content into well-formatted EPUB files.
  • Flexible Build Options:
    • Merged EPUB: Combine all selected volumes into a single, comprehensive EPUB file.
    • Separate EPUBs: Create an individual EPUB file for each volume.
  • Smart Caching: Saves downloaded chapter data locally. It intelligently re-downloads only if the cached data is missing or incomplete, saving time and bandwidth.
  • Image and Footnote Handling: Downloads and embeds images within the chapters and correctly processes footnotes for a clean reading experience.
  • Resilient Networking: Automatically retries downloads and cycles through available domains to handle network errors and site changes gracefully.

Usage

  1. Install dependencies:

    pip install -r requirements.txt
  2. Run the script:

    python hako_crawler.py

    Or provide a URL directly:

    python hako_crawler.py "docln.sbs url"
  3. Follow the on-screen prompts to select an action (Download, Build, or Full Process) and choose the desired volumes.

The script will create a saved_data directory to store the downloaded content and a result directory for the final EPUB files.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages