Skip to content

NIH-CARD/longread-report-parser

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Long Read HTML Report Parsing

To clone from GitHub:

# Download this repo
git clone https://github.com/NIH-CARD/longread-report-parser.git
cd longread-report-parser

# Generate list of files using `find`
find /data/CARDPB/data/PPMI/SEQ_REPORTS/example_reports/ -type f -name '*.html' > examplereports.txt

# Execute with file list of html reports (one per line):
python3 extract.py --filelist examplereports.txt > output.tsv

# Alternatively, execute on all html files within a directory
# (does not descend into subdirectories)
python3 extract.py --html_dir /data/CARDPB/data/PPMI/SEQ_REPORTS/example_reports/ > output.tsv

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%