Java theory and practice: Screen-scraping with XQuery: "XQuery makes light work of HTML extraction and transformation
Level: Introductory
Brian Goetz (brian@quiotix.com)
Principal Consultant, Quiotix
22 Mar 2005
Column iconXQuery is a W3C standard for extracting information from XML documents, currently spanning 14 working drafts. While the majority of interest in XQuery is centered around querying large bases of semi-structured document data, XQuery can be surprisingly effective for some much more mundane uses as well. In this month's Java theory and practice, columnist Brian Goetz shows you how XQuery can be used effectively as an HTML screen-scraping engine."
No comments:
Post a Comment