Package org.apache.any23.plugin.htmlscraper

The HTMLScraperExtractor is a special extractor to scrape textual content from a generic HTML pages.

See:
          Description

Class Summary
HTMLScraperExtractor Implementation of content extractor for performing HTML scraping.
HTMLScraperPlugin Implementation of ExtractorPlugin based on the BoilerPipe Library.
 

Package org.apache.any23.plugin.htmlscraper Description

The HTMLScraperExtractor is a special extractor to scrape textual content from a generic HTML pages.



Copyright © 2010-2012 The Apache Software Foundation. All Rights Reserved.