Develop and Download Open Source Software

cpdetector

Back to Project Summary

Would you recoomend this project?

Project Release infomations and Project Resources. Note that these informations are from this projects Freecode.com page and the downloads themselves may not be hosted with SourceForge.JP.

Project Release Information

2011-12-05 03:19
This release fixes a crash in command line mode when an invalid declared charset (the "" charset) was found.
The return code of the command line tool (CodepageProcessor) does not return 0 in case of an error anymore. A bug that broke the ability to reset input streams after detection was fixed.
2011-11-17 05:44
This major bugfix release fixes two issues in commandline batch mode. The switch to skip moving undetected documents works now again. No attempt will be made to transcode undetected documents (the latter caused exceptional program flow).
2010-06-27 21:59
This version is a stability release and fixes the byte order mark detection and incompatibility with OpenJDK. It also requires Java 1.5 now.
2008-06-18 06:19
The release structure has been changed: cpdetetor.jar does not contain 3rd party library files anymore. Missing public functions are contained again. The proguard shrinker has been updated from version 3.8 to 4.2.
2008-06-15 18:22
The proguard shrinker is now used, so the
cpdetector jar is now more than ten times smaller.
System.out is no longer used for logging in
JChardetFacade. All packages were renamed with the
prefix "info.monitorenter".

Project Resources

http://freecode.com/urls/883c03257eb733a0ef6e20229a807200
http://freecode.com/urls/cb946a5567c6a7ae46b1dc27260b8653
http://freecode.com/urls/12840a7a6a845af564fb5d1a0c541ecf
http://freecode.com/urls/6ce1befe58be81cbe633f13f78c87f05
http://freecode.com/urls/e1d95b2516fbe83b7cb40092e854eae2
http://freecode.com/projects/cpdetector

Project Description

cpdetector is a small yet clever framework for
codepage detection that integrates different
strategies. It may be used as a library for third
party software that accesses textual data over
network. It also includes a best-practice
implementation in form of a command line tool that
allows sorting and transforming large collections
of documents based on their codepage. Available
strategies include: jchardet (exclusion, frequency
analysis, and guessing), detection of the HTML
charset property, and detection of the XML
encoding declaration.

(This Description is auto-translated)

SourceForge.JP is a Japanese version of SourceForge.net. For developments that are not related to Japan, we recommend you to use SourceForge.net.