Skip to content

Charset Normalizer

Compare
Choose a tag to compare
@Ousret Ousret released this 28 Sep 19:16
· 474 commits to master since this release
6ea66b2

Changes :

  • Feature : Added has_submatch, percent_chaos and percent_coherence properties on single match object.
  • Improvement : best() method of CharsetNormalizerMatches has been rewritten for better readability.
  • Feature : Added explain boolean positional parameter to print out what actually happen when searching for a match.
  • Improvement : Detection has been globally improved.
  • Feature : You can exclude some encoding when searching for a match with parameter cp_exclusion. List of str. for from_bytes from_path and from_fp.
  • Feature : You can limit the search to some encoding when looking for a match with parameter cp_isolation. List of str. for from_bytes from_path and from_fp.
  • Feature : import charset_normalizer is enough to provide additional help when you encounter UnicodeDecodeError exception.