realbasic-nug
[Top] [All Lists]

Re: XHTML Parser

To: REALbasic NUG <realbasic-nug at lists dot realsoftware dot com>
Subject: Re: XHTML Parser
From: dda <headspin at gmail dot com>
Date: Sun, 31 Jul 2005 18:58:42 +0200
Delivered-to: realbasic-nug at lists dot realsoftware dot com
Domainkey-signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:reply-to:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=SW9TstWhE0s15UGpNKf685m6XCYPovm+YJyjyuKi17o5j8/PeFSCHlqrGqpxvm9KeIhDGs0458j2PFysG9XFfmW754SzgK9xWEwPG2Q7j6eVyyf/BbuXxUDyQ98YU504u32IBYE43Px2hoZK1v7KXbRJ6umFN2Cl0a84UdrVenU=
References: <54F30332-30BA-4F96-9D6A-E363D16FD345 at thezaz dot com>
Hmmm, xmllint [included in libxml2] parses xhtml. I did some
experimenting a few days ago with tidy and xmllint [tidy to output
some "safe" xhtml], and xmllint's mini shell provided what I needed –
which was little.

OTOH, I haven't played with libxml2 in a while, so I dunno how hard
declaring into it would be. But in any case, I'd advise you to tidy
your xhtml first to be sure it doesn't make the parser bork.

HTH


-- 
dda 
http://sungnyemun.org/
RBDeveloper Columnist, "Beyond the Limits"
http://rbdeveloper.com

Liste Française Solutions RB
http://www.solutionsrb.com/


On 7/31/05, Thom McGrath <lists at thezaz dot com> wrote:
> I need to parse XHTML. Thought that the built-in XML parser would do
> the trick, but it fails. Is there something out there?
> 
> --
> Thom McGrath
_______________________________________________
Unsubscribe or switch delivery mode:
<http://www.realsoftware.com/support/listmanager/>

Search the archives of this list here:
<http://support.realsoftware.com/listarchives/lists.html>
<Prev in Thread] Current Thread [Next in Thread>