Re: [Corpora-List] DTD for HTML documents?

From: Peter Adolphs (peter.adolphs@student.hu-berlin.de)
Date: Fri Jun 13 2003 - 13:32:16 MET DST

  • Next message: Jan Strunk: "[Corpora-List] Summary: Korean Corpus"

    wassim souayah wrote:
    > I'm attempting to convert HTML documents to XML.
    >
    > Someone could Help me to have (if exist) a DTD for
    > HTML documents?

    Why do you need a DTD to convert HTML to XML?

    You could use HTML Tidy to convert your HTML files to XHTML (which is an
    XML format). If you want to process those files further, you could use XSLT.

    See
    http://www.w3.org/People/Raggett/tidy/
    http://www.w3.org/TR/xslt
    http://www.w3.org/MarkUp/ (XHTML and HTML)
    http://xml.apache.org/xalan-j/index.html (an XSLT processor)

    Best regards,
    Peter Adolphs.



    This archive was generated by hypermail 2b29 : Fri Jun 13 2003 - 13:29:18 MET DST