Uses of Interface
org.apache.nutch.plugin.Pluggable

Packages that use Pluggable
org.apache.nutch.analysis Tokenizer for documents and query parser. 
org.apache.nutch.analysis.lang Text document language identifier. 
org.apache.nutch.clustering   
org.apache.nutch.clustering.carrot2   
org.apache.nutch.indexer Maintain Lucene full-text indexes. 
org.apache.nutch.indexer.basic A basic indexing plugin. 
org.apache.nutch.indexer.field   
org.apache.nutch.indexer.more A more indexing plugin. 
org.apache.nutch.microformats.reltag A microformats Rel-Tag Parser/Indexer/Querier plugin. 
org.apache.nutch.net   
org.apache.nutch.ontology   
org.apache.nutch.ontology.jena   
org.apache.nutch.parse   
org.apache.nutch.parse.ext   
org.apache.nutch.parse.html An HTML document parsing plugin. 
org.apache.nutch.parse.js   
org.apache.nutch.parse.ms Common API for Microsoft © documents parsing. 
org.apache.nutch.parse.msexcel A Microsoft © Excel document parsing plugin. 
org.apache.nutch.parse.mspowerpoint A Microsoft © PowerPoint document parsing plugin. 
org.apache.nutch.parse.msword A Microsoft © Word document parsing plugin. 
org.apache.nutch.parse.oo   
org.apache.nutch.parse.pdf A pdf parsing plugin. 
org.apache.nutch.parse.rss   
org.apache.nutch.parse.swf   
org.apache.nutch.parse.text A plain text parsing plugin. 
org.apache.nutch.parse.zip   
org.apache.nutch.protocol   
org.apache.nutch.protocol.file Protocol plugin which supports retrieving local file resources. 
org.apache.nutch.protocol.ftp Protocol plugin which supports retrieving documents via the ftp protocol. 
org.apache.nutch.protocol.http Protocol plugin which supports retrieving documents via the http protocol. 
org.apache.nutch.protocol.http.api Common API used by HTTP plugins (http, httpclient
org.apache.nutch.scoring   
org.apache.nutch.scoring.opic   
org.apache.nutch.searcher Search API 
org.apache.nutch.searcher.basic   
org.apache.nutch.searcher.more A more query plugin. 
org.apache.nutch.searcher.response   
org.apache.nutch.searcher.site   
org.apache.nutch.searcher.url   
org.apache.nutch.summary.basic A basic summarizer implementation. 
org.apache.nutch.summary.lucene A Lucene Highlighter based summarizer implementation. 
org.apache.nutch.urlfilter.api   
org.apache.nutch.urlfilter.automaton A url filter plugin based on dk.brics.automaton Finite-State Automata for JavaTM
org.apache.nutch.urlfilter.prefix A url filter plugin. 
org.apache.nutch.urlfilter.regex A url filter plugin. 
org.creativecommons.nutch Sample plugins that parse and index Creative Commons medadata. 
 

Uses of Pluggable in org.apache.nutch.analysis
 

Classes in org.apache.nutch.analysis that implement Pluggable
 class NutchAnalyzer
          Extension point for analysis.
 class NutchDocumentAnalyzer
          The analyzer used for Nutch documents.
 

Uses of Pluggable in org.apache.nutch.analysis.lang
 

Classes in org.apache.nutch.analysis.lang that implement Pluggable
 class HTMLLanguageParser
          Adds metadata identifying language of document if found We could also run statistical analysis here but we'd miss all other formats
 class LanguageIndexingFilter
          An IndexingFilter that add a lang (language) field to the document.
 class LanguageQueryFilter
          Handles "lang:" query clauses, causing them to search the "lang" field indexed by LanguageIdentifier.
 

Uses of Pluggable in org.apache.nutch.clustering
 

Subinterfaces of Pluggable in org.apache.nutch.clustering
 interface OnlineClusterer
          An extension point interface for online search results clustering algorithms.
 

Uses of Pluggable in org.apache.nutch.clustering.carrot2
 

Classes in org.apache.nutch.clustering.carrot2 that implement Pluggable
 class Clusterer
          This plugin provides an implementation of OnlineClusterer extension using clustering components of the Carrot2 project (http://www.carrot2.org).
 

Uses of Pluggable in org.apache.nutch.indexer
 

Subinterfaces of Pluggable in org.apache.nutch.indexer
 interface IndexingFilter
          Extension point for indexing.
 

Uses of Pluggable in org.apache.nutch.indexer.basic
 

Classes in org.apache.nutch.indexer.basic that implement Pluggable
 class BasicIndexingFilter
          Adds basic searchable fields to a document.
 

Uses of Pluggable in org.apache.nutch.indexer.field
 

Subinterfaces of Pluggable in org.apache.nutch.indexer.field
 interface FieldFilter
          Filter to manipulate FieldWritable objects for a given url during indexing.
 

Uses of Pluggable in org.apache.nutch.indexer.more
 

Classes in org.apache.nutch.indexer.more that implement Pluggable
 class MoreIndexingFilter
          Add (or reset) a few metaData properties as respective fields (if they are available), so that they can be displayed by more.jsp (called by search.jsp).
 

Uses of Pluggable in org.apache.nutch.microformats.reltag
 

Classes in org.apache.nutch.microformats.reltag that implement Pluggable
 class RelTagIndexingFilter
          An IndexingFilter that add tag field(s) to the document.
 class RelTagParser
          Adds microformat rel-tags of document if found.
 class RelTagQueryFilter
          Handles "tag:" query clauses.
 

Uses of Pluggable in org.apache.nutch.net
 

Subinterfaces of Pluggable in org.apache.nutch.net
 interface URLFilter
          Interface used to limit which URLs enter Nutch.
 

Uses of Pluggable in org.apache.nutch.ontology
 

Subinterfaces of Pluggable in org.apache.nutch.ontology
 interface Ontology
           
 

Uses of Pluggable in org.apache.nutch.ontology.jena
 

Classes in org.apache.nutch.ontology.jena that implement Pluggable
 class OntologyImpl
          this class wraps about a model, built from a list of ontologies, uses HP's Jena
 

Uses of Pluggable in org.apache.nutch.parse
 

Subinterfaces of Pluggable in org.apache.nutch.parse
 interface HtmlParseFilter
          Extension point for DOM-based HTML parsers.
 interface Parser
          A parser for content generated by a Protocol implementation.
 

Uses of Pluggable in org.apache.nutch.parse.ext
 

Classes in org.apache.nutch.parse.ext that implement Pluggable
 class ExtParser
          A wrapper that invokes external command to do real parsing job.
 

Uses of Pluggable in org.apache.nutch.parse.html
 

Classes in org.apache.nutch.parse.html that implement Pluggable
 class HtmlParser
           
 

Uses of Pluggable in org.apache.nutch.parse.js
 

Classes in org.apache.nutch.parse.js that implement Pluggable
 class JSParseFilter
          This class is a heuristic link extractor for JavaScript files and code snippets.
 

Uses of Pluggable in org.apache.nutch.parse.ms
 

Classes in org.apache.nutch.parse.ms that implement Pluggable
 class MSBaseParser
          A generic Microsoft document parser.
 

Uses of Pluggable in org.apache.nutch.parse.msexcel
 

Classes in org.apache.nutch.parse.msexcel that implement Pluggable
 class MSExcelParser
          An Excel document parser.
 

Uses of Pluggable in org.apache.nutch.parse.mspowerpoint
 

Classes in org.apache.nutch.parse.mspowerpoint that implement Pluggable
 class MSPowerPointParser
          Nutch-Parser for parsing MS PowerPoint slides ( mime type: application/vnd.ms-powerpoint).
 

Uses of Pluggable in org.apache.nutch.parse.msword
 

Classes in org.apache.nutch.parse.msword that implement Pluggable
 class MSWordParser
          Parser for mime type application/msword.
 

Uses of Pluggable in org.apache.nutch.parse.oo
 

Classes in org.apache.nutch.parse.oo that implement Pluggable
 class OOParser
          Parser for OpenOffice and OpenDocument formats.
 

Uses of Pluggable in org.apache.nutch.parse.pdf
 

Classes in org.apache.nutch.parse.pdf that implement Pluggable
 class PdfParser
          parser for mime type application/pdf.
 

Uses of Pluggable in org.apache.nutch.parse.rss
 

Classes in org.apache.nutch.parse.rss that implement Pluggable
 class RSSParser
           
 

Uses of Pluggable in org.apache.nutch.parse.swf
 

Classes in org.apache.nutch.parse.swf that implement Pluggable
 class SWFParser
          Parser for Flash SWF files.
 

Uses of Pluggable in org.apache.nutch.parse.text
 

Classes in org.apache.nutch.parse.text that implement Pluggable
 class TextParser
           
 

Uses of Pluggable in org.apache.nutch.parse.zip
 

Classes in org.apache.nutch.parse.zip that implement Pluggable
 class ZipParser
          ZipParser class based on MSPowerPointParser class by Stephan Strittmatter.
 

Uses of Pluggable in org.apache.nutch.protocol
 

Subinterfaces of Pluggable in org.apache.nutch.protocol
 interface Protocol
          A retriever of url content.
 

Uses of Pluggable in org.apache.nutch.protocol.file
 

Classes in org.apache.nutch.protocol.file that implement Pluggable
 class File
          File.java deals with file: scheme.
 

Uses of Pluggable in org.apache.nutch.protocol.ftp
 

Classes in org.apache.nutch.protocol.ftp that implement Pluggable
 class Ftp
          Ftp.java deals with ftp: scheme.
 

Uses of Pluggable in org.apache.nutch.protocol.http
 

Classes in org.apache.nutch.protocol.http that implement Pluggable
 class Http
           
 

Uses of Pluggable in org.apache.nutch.protocol.http.api
 

Classes in org.apache.nutch.protocol.http.api that implement Pluggable
 class HttpBase
           
 

Uses of Pluggable in org.apache.nutch.scoring
 

Subinterfaces of Pluggable in org.apache.nutch.scoring
 interface ScoringFilter
          A contract defining behavior of scoring plugins.
 

Classes in org.apache.nutch.scoring that implement Pluggable
 class ScoringFilters
          Creates and caches ScoringFilter implementing plugins.
 

Uses of Pluggable in org.apache.nutch.scoring.opic
 

Classes in org.apache.nutch.scoring.opic that implement Pluggable
 class OPICScoringFilter
          This plugin implements a variant of an Online Page Importance Computation (OPIC) score, described in this paper: Abiteboul, Serge and Preda, Mihai and Cobena, Gregory (2003), Adaptive On-Line Page Importance Computation .
 

Uses of Pluggable in org.apache.nutch.searcher
 

Subinterfaces of Pluggable in org.apache.nutch.searcher
 interface QueryFilter
          Extension point for query translation.
 interface Summarizer
          Extension point for summarizer.
 

Classes in org.apache.nutch.searcher that implement Pluggable
 class FieldQueryFilter
          Translate query fields to search the same-named field, as indexed by an IndexingFilter.
 class RawFieldQueryFilter
          Translate raw query fields to search the same-named field, as indexed by an IndexingFilter.
 

Uses of Pluggable in org.apache.nutch.searcher.basic
 

Classes in org.apache.nutch.searcher.basic that implement Pluggable
 class BasicQueryFilter
          The default query filter.
 

Uses of Pluggable in org.apache.nutch.searcher.more
 

Classes in org.apache.nutch.searcher.more that implement Pluggable
 class DateQueryFilter
          Handles "date:" query clauses, causing them to search the field "date" indexed by MoreIndexingFilter.java
 class TypeQueryFilter
          Handles "type:" query clauses, causing them to search the field indexed by MoreIndexingFilter.
 

Uses of Pluggable in org.apache.nutch.searcher.response
 

Subinterfaces of Pluggable in org.apache.nutch.searcher.response
 interface ResponseWriter
          Nutch extension point which allow writing search results in many different output formats.
 

Uses of Pluggable in org.apache.nutch.searcher.site
 

Classes in org.apache.nutch.searcher.site that implement Pluggable
 class SiteQueryFilter
          Handles "site:" query clauses, causing them to search the field indexed by SiteIndexingFilter.
 

Uses of Pluggable in org.apache.nutch.searcher.url
 

Classes in org.apache.nutch.searcher.url that implement Pluggable
 class URLQueryFilter
          Handles "url:" query clauses, causing them to search the field indexed by BasicIndexingFilter.
 

Uses of Pluggable in org.apache.nutch.summary.basic
 

Classes in org.apache.nutch.summary.basic that implement Pluggable
 class BasicSummarizer
          Implements hit summarization.
 

Uses of Pluggable in org.apache.nutch.summary.lucene
 

Classes in org.apache.nutch.summary.lucene that implement Pluggable
 class LuceneSummarizer
          Implements hit summarization.
 

Uses of Pluggable in org.apache.nutch.urlfilter.api
 

Classes in org.apache.nutch.urlfilter.api that implement Pluggable
 class RegexURLFilterBase
          Generic URL filter based on regular expressions.
 

Uses of Pluggable in org.apache.nutch.urlfilter.automaton
 

Classes in org.apache.nutch.urlfilter.automaton that implement Pluggable
 class AutomatonURLFilter
          RegexURLFilterBase implementation based on the dk.brics.automaton Finite-State Automata for JavaTM.
 

Uses of Pluggable in org.apache.nutch.urlfilter.prefix
 

Classes in org.apache.nutch.urlfilter.prefix that implement Pluggable
 class PrefixURLFilter
          Filters URLs based on a file of URL prefixes.
 

Uses of Pluggable in org.apache.nutch.urlfilter.regex
 

Classes in org.apache.nutch.urlfilter.regex that implement Pluggable
 class RegexURLFilter
          Filters URLs based on a file of regular expressions using the Java Regex implementation.
 

Uses of Pluggable in org.creativecommons.nutch
 

Classes in org.creativecommons.nutch that implement Pluggable
 class CCIndexingFilter
          Adds basic searchable fields to a document.
 class CCParseFilter
          Adds metadata identifying the Creative Commons license used, if any.
 class CCQueryFilter
          Handles "cc:" query clauses, causing them to search the "cc" field indexed by CCIndexingFilter.
 



Copyright © 2006 The Apache Software Foundation