-
-
Notifications
You must be signed in to change notification settings - Fork 267
Issues: adbar/trafilatura
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Documentation: on precision
documentation
Docs in need of update or extension
#766
opened Dec 10, 2024 by
DesBw
CLI: better control of output file names
enhancement
New feature or request
#754
opened Nov 30, 2024 by
DesBw
Support for sidemap parsing from text instead of urls
feedback
Feedback from users requested
#751
opened Nov 27, 2024 by
NiClassic
Performance bottleneck in Further information is requested
prune_unwanted_nodes
causing 200ms per call
question
#750
opened Nov 23, 2024 by
thsunkid
Review input type for New feature or request
is_probably_readerable()
function
enhancement
#749
opened Nov 22, 2024 by
adbar
Documentation about settings could use examples
documentation
Docs in need of update or extension
#746
opened Nov 15, 2024 by
georgedorn
Review HTML element list and conversion
enhancement
New feature or request
#720
opened Oct 15, 2024 by
adbar
2 tasks
Docs: add page explaining how to run tests
documentation
Docs in need of update or extension
#698
opened Sep 9, 2024 by
adbar
Downloads: add support to switch between proxies
enhancement
New feature or request
#697
opened Sep 9, 2024 by
adbar
Empty Results When Using Spider Function with Category URL
question
Further information is requested
#696
opened Sep 9, 2024 by
felipehertzer
feat(cli/lib): Add tqdm based progress bar as an option
enhancement
New feature or request
#663
opened Jul 30, 2024 by
chitralverma
Investigate spacing in element tails
question
Further information is requested
#661
opened Jul 26, 2024 by
adbar
Faulty extraction for very short documents
enhancement
New feature or request
#660
opened Jul 26, 2024 by
Psynbiotik
Missing h1 heading if <header> outside of <article>
question
Further information is requested
#642
opened Jul 11, 2024 by
chrisgoddard
some extraction duplicated in xml
question
Further information is requested
#634
opened Jun 27, 2024 by
fortyfourforty
Account for empty cells in table extraction (xml)
enhancement
New feature or request
#633
opened Jun 27, 2024 by
fortyfourforty
Image/Video caption and credits removal
documentation
Docs in need of update or extension
question
Further information is requested
#616
opened Jun 6, 2024 by
hamsarajan
It's set include_images=True, but there is no picture
bug
Something isn't working
#610
opened May 31, 2024 by
dark2star
New port of readability.js?
question
Further information is requested
#604
opened May 23, 2024 by
zirkelc
Add option to provide XPaths for content extraction
enhancement
New feature or request
#596
opened May 16, 2024 by
klvbdmh
utils.decode_file()
: add switch for full detection or GZip only
enhancement
#595
opened May 15, 2024 by
adbar
Extracting content from an URl is getting none
question
Further information is requested
#586
opened May 5, 2024 by
Fabiha15
Wrong links position in text from telegram post
question
Further information is requested
#585
opened May 4, 2024 by
RedHotUnicorn
Removing related links at end of article/sidebar on news websites?
bug
Something isn't working
#584
opened May 3, 2024 by
rahulbot
Previous Next
ProTip!
Follow long discussions with comments:>50.