trafilatura-1.6.3
Extraction:
- preserve space in certain elements with @idoshamun (#429)
- optional list of xPaths to prune by @HeLehm (#414)
Metadata:
- more precise date extraction (see htmldate)
- new
htmldate
extensive search parameter in config (#434) - changes in URLs: normalization, trackers removed (see courlan)
Navigation:
Documentation: