cpdetector 1.0.6 (Default branch) |
|
|
cpdetector is a small yet clever framework for
codepage detection that integrates different
strategies. It may be used as a library for third
party software that accesses textual data over
network. It also includes a best-practice
implementation in form of a command line tool that
allows sorting and transforming large collections
of documents based on their codepage. Available
strategies include: jchardet (exclusion, frequency
analysis, and guessing), detection of the HTML
charset property, and detection of the XML
encoding declaration.
License: Mozilla Public License (MPL)
Changes:
The proguard shrinker is now used, so the
cpdetector jar is now more than ten times smaller.
System.out is no longer used for logging in
JChardetFacade. All packages were renamed with the
prefix "info.monitorenter".
|