Wednesday, April 20, 2005

Java theory and practice: Screen-scraping with XQuery

Java theory and practice: Screen-scraping with XQuery: "XQuery makes light work of HTML extraction and transformation

Level: Introductory

Brian Goetz (brian@quiotix.com)
Principal Consultant, Quiotix
22 Mar 2005

Column iconXQuery is a W3C standard for extracting information from XML documents, currently spanning 14 working drafts. While the majority of interest in XQuery is centered around querying large bases of semi-structured document data, XQuery can be surprisingly effective for some much more mundane uses as well. In this month's Java theory and practice, columnist Brian Goetz shows you how XQuery can be used effectively as an HTML screen-scraping engine."

No comments: