Package org.apache.nutch.parse.headings
Parse filter to extract headings (h1, h2, etc.) from DOM parse tree.
-
Class Summary Class Description HeadingsParseFilter HtmlParseFilter to retrieve h1 and h2 values from the DOM.
Class | Description |
---|---|
HeadingsParseFilter |
HtmlParseFilter to retrieve h1 and h2 values from the DOM.
|