|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.nutch.microformats.reltag.RelTagIndexingFilter
public class RelTagIndexingFilter
An IndexingFilter that adds tag
field(s) to the document.
| Field Summary |
|---|
| Fields inherited from interface org.apache.nutch.indexer.IndexingFilter |
|---|
X_POINT_ID |
| Constructor Summary | |
|---|---|
RelTagIndexingFilter()
|
|
| Method Summary | |
|---|---|
NutchDocument |
filter(NutchDocument doc,
String url,
WebPage page)
The RelTagIndexingFilter filter object. |
Configuration |
getConf()
Get the Configuration object |
Collection<WebPage.Field> |
getFields()
Gets all the fields for a given WebPage
Many datastores need to setup the mapreduce job by specifying the fields
needed. |
void |
setConf(Configuration conf)
Set the Configuration object |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public RelTagIndexingFilter()
| Method Detail |
|---|
public Collection<WebPage.Field> getFields()
WebPage
Many datastores need to setup the mapreduce job by specifying the fields
needed. All extensions that work on WebPage are able to specify what fields
they need.
getFields in interface FieldPluggablepublic void setConf(Configuration conf)
Configuration object
setConf in interface Configurablepublic Configuration getConf()
Configuration object
getConf in interface Configurable
public NutchDocument filter(NutchDocument doc,
String url,
WebPage page)
throws IndexingException
RelTagIndexingFilter filter object.
filter in interface IndexingFilterdoc - The NutchDocument objecturl - URL to be filtered for rel-tag'spage - WebPage object relative to the URL
IndexingException
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||