Re: [Corpora-List] determining the correct character encoding

From: Alexander Schutz (goalscoringsuperstarhero@gmail.com)
Date: Wed Oct 12 2005 - 12:27:23 MET DST

  • Next message: YANAGI Tomohiro: "[Corpora-List] Tagged Corpus of Old Norse"

    Dear List,

    here is s short summary of the contributions to my java
    charset-detection trouble:

    Peter Adolphs suggested to have a look at
    http://glaforge.free.fr/wiki/index.php?wiki=GuessEncoding

    David Evans proposed to use jchardet , the java port of the
    mozilla charset detection, to be found at
    http://jchardet.sourceforge.net/index.html#4
    from which I found it is more customizable than the first one.

    Thank you very much for contributing, it has already been of
    great help :-)

    Alex

    --
    Alexander Schutz
    Student of Computational Linguistics
    University of Saarland, Germany
    



    This archive was generated by hypermail 2b29 : Wed Oct 12 2005 - 12:54:49 MET DST