org.apache.nutch.parse
Class ParseFilters

java.lang.Object
  extended by org.apache.nutch.parse.ParseFilters

public class ParseFilters
extends Object

Creates and caches ParseFilter implementing plugins.


Field Summary
static String HTMLPARSEFILTER_ORDER
           
 
Constructor Summary
ParseFilters(org.apache.hadoop.conf.Configuration conf)
           
 
Method Summary
 Parse filter(String url, WebPage page, Parse parse, HTMLMetaTags metaTags, DocumentFragment doc)
          Run all defined filters.
 Collection<WebPage.Field> getFields()
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

HTMLPARSEFILTER_ORDER

public static final String HTMLPARSEFILTER_ORDER
See Also:
Constant Field Values
Constructor Detail

ParseFilters

public ParseFilters(org.apache.hadoop.conf.Configuration conf)
Method Detail

filter

public Parse filter(String url,
                    WebPage page,
                    Parse parse,
                    HTMLMetaTags metaTags,
                    DocumentFragment doc)
Run all defined filters.


getFields

public Collection<WebPage.Field> getFields()


Copyright © 2013 The Apache Software Foundation