Uses of Interface

Packages that use Pluggable
org.apache.nutch.analysis.lang Text document language identifier. 
org.apache.nutch.collection Subcollection is a subset of an index. 
org.apache.nutch.indexer Maintain Lucene full-text indexes. 
org.apache.nutch.indexer.anchor An indexing plugin for inbound anchor text. 
org.apache.nutch.indexer.basic A basic indexing plugin. 
org.apache.nutch.indexer.more A more indexing plugin. 
org.apache.nutch.indexer.tld Top Level Domain Indexing plugin. 
org.apache.nutch.microformats.reltag A microformats Rel-Tag Parser/Indexer/Querier plugin.   
org.apache.nutch.parse.html An HTML document parsing plugin. 
org.apache.nutch.parse.js A parser plugin and content filter to extract all (possible) links from JavaScript files and code snippets. 
org.apache.nutch.plugin The Nutch Plugin System. 
org.apache.nutch.protocol.file Protocol plugin which supports retrieving local file resources. 
org.apache.nutch.protocol.ftp Protocol plugin which supports retrieving documents via the ftp protocol. 
org.apache.nutch.protocol.http Protocol plugin which supports retrieving documents via the http protocol. 
org.apache.nutch.protocol.http.api Common API used by HTTP plugins (http, httpclient
org.apache.nutch.protocol.sftp Protocol plugin which supports retrieving documents via the sftp protocol. 
org.apache.nutch.scoring.tld Top Level Domain Scoring plugin. 
org.apache.nutch.urlfilter.automaton A url filter plugin based on dk.brics.automaton Finite-State Automata for JavaTM
org.apache.nutch.urlfilter.domain A url filter plugin that filters by domain. 
org.apache.nutch.urlfilter.prefix A url filter plugin. 
org.apache.nutch.urlfilter.regex A url filter plugin. 
org.apache.nutch.urlfilter.validator A url filter plugin that validates given urls. 
org.creativecommons.nutch Sample plugins that parse and index Creative Commons medadata. 

Uses of Pluggable in org.apache.nutch.analysis.lang

Classes in org.apache.nutch.analysis.lang that implement Pluggable
 class HTMLLanguageParser
          Adds metadata identifying language of document if found We could also run statistical analysis here but we'd miss all other formats
 class LanguageIndexingFilter
          An IndexingFilter that adds a lang (language) field to the document.

Uses of Pluggable in org.apache.nutch.collection

Classes in org.apache.nutch.collection that implement Pluggable
 class Subcollection
          SubCollection represents a subset of index, you can define url patterns that will indicate that particular page (url) is part of SubCollection.

Uses of Pluggable in org.apache.nutch.indexer

Subinterfaces of Pluggable in org.apache.nutch.indexer
 interface IndexingFilter
          Extension point for indexing.

Uses of Pluggable in org.apache.nutch.indexer.anchor

Classes in org.apache.nutch.indexer.anchor that implement Pluggable
 class AnchorIndexingFilter
          Indexing filter that offers an option to either index all inbound anchor text for a document or deduplicate anchors.

Uses of Pluggable in org.apache.nutch.indexer.basic

Classes in org.apache.nutch.indexer.basic that implement Pluggable
 class BasicIndexingFilter
          Adds basic searchable fields to a document.

Uses of Pluggable in org.apache.nutch.indexer.feed

Classes in org.apache.nutch.indexer.feed that implement Pluggable
 class FeedIndexingFilter

Uses of Pluggable in org.apache.nutch.indexer.more

Classes in org.apache.nutch.indexer.more that implement Pluggable
 class MoreIndexingFilter
          Add (or reset) a few metaData properties as respective fields (if they are available), so that they can be accurately used within the search index.

Uses of Pluggable in org.apache.nutch.indexer.subcollection

Classes in org.apache.nutch.indexer.subcollection that implement Pluggable
 class SubcollectionIndexingFilter

Uses of Pluggable in org.apache.nutch.indexer.tld

Classes in org.apache.nutch.indexer.tld that implement Pluggable
 class TLDIndexingFilter
          Adds the Top level domain extensions to the index

Uses of Pluggable in org.apache.nutch.microformats.reltag

Classes in org.apache.nutch.microformats.reltag that implement Pluggable
 class RelTagIndexingFilter
          An IndexingFilter that adds tag field(s) to the document.
 class RelTagParser
          Adds microformat rel-tags of document if found.

Uses of Pluggable in

Subinterfaces of Pluggable in
 interface URLFilter
          Interface used to limit which URLs enter Nutch.

Uses of Pluggable in org.apache.nutch.parse

Subinterfaces of Pluggable in org.apache.nutch.parse
 interface ParseFilter
          Extension point for DOM-based parsers.
 interface Parser
          A parser for content generated by a Protocol implementation.

Uses of Pluggable in org.apache.nutch.parse.ext

Classes in org.apache.nutch.parse.ext that implement Pluggable
 class ExtParser
          A wrapper that invokes external command to do real parsing job.

Uses of Pluggable in org.apache.nutch.parse.feed

Classes in org.apache.nutch.parse.feed that implement Pluggable
 class FeedParser

Uses of Pluggable in org.apache.nutch.parse.html

Classes in org.apache.nutch.parse.html that implement Pluggable
 class HtmlParser

Uses of Pluggable in org.apache.nutch.parse.js

Classes in org.apache.nutch.parse.js that implement Pluggable
 class JSParseFilter
          This class is a heuristic link extractor for JavaScript files and code snippets.

Uses of Pluggable in org.apache.nutch.parse.swf

Classes in org.apache.nutch.parse.swf that implement Pluggable
 class SWFParser
          Parser for Flash SWF files.

Uses of Pluggable in org.apache.nutch.parse.tika

Classes in org.apache.nutch.parse.tika that implement Pluggable
 class TikaParser
          Wrapper for Tika parsers.

Uses of Pluggable in

Classes in that implement Pluggable
 class ZipParser
          ZipParser class based on MSPowerPointParser class by Stephan Strittmatter.

Uses of Pluggable in org.apache.nutch.plugin

Subinterfaces of Pluggable in org.apache.nutch.plugin
 interface FieldPluggable

Uses of Pluggable in org.apache.nutch.protocol

Subinterfaces of Pluggable in org.apache.nutch.protocol
 interface Protocol
          A retriever of url content.

Uses of Pluggable in org.apache.nutch.protocol.file

Classes in org.apache.nutch.protocol.file that implement Pluggable
 class File
          This class is a protocol plugin used for file: scheme.

Uses of Pluggable in org.apache.nutch.protocol.ftp

Classes in org.apache.nutch.protocol.ftp that implement Pluggable
 class Ftp
          This class is a protocol plugin used for ftp: scheme.

Uses of Pluggable in org.apache.nutch.protocol.http

Classes in org.apache.nutch.protocol.http that implement Pluggable
 class Http

Uses of Pluggable in org.apache.nutch.protocol.http.api

Classes in org.apache.nutch.protocol.http.api that implement Pluggable
 class HttpBase

Uses of Pluggable in org.apache.nutch.protocol.sftp

Classes in org.apache.nutch.protocol.sftp that implement Pluggable
 class Sftp
          This class uses the Jsch package to fetch content using the Sftp protocol.

Uses of Pluggable in org.apache.nutch.scoring

Subinterfaces of Pluggable in org.apache.nutch.scoring
 interface ScoringFilter
          A contract defining behavior of scoring plugins.

Classes in org.apache.nutch.scoring that implement Pluggable
 class ScoringFilters
          Creates and caches ScoringFilter implementing plugins.

Uses of Pluggable in

Classes in that implement Pluggable
 class LinkAnalysisScoringFilter

Uses of Pluggable in org.apache.nutch.scoring.opic

Classes in org.apache.nutch.scoring.opic that implement Pluggable
 class OPICScoringFilter
          This plugin implements a variant of an Online Page Importance Computation (OPIC) score, described in this paper: Abiteboul, Serge and Preda, Mihai and Cobena, Gregory (2003), Adaptive On-Line Page Importance Computation .

Uses of Pluggable in org.apache.nutch.scoring.tld

Classes in org.apache.nutch.scoring.tld that implement Pluggable
 class TLDScoringFilter
          Scoring filter to boost tlds.

Uses of Pluggable in org.apache.nutch.urlfilter.api

Classes in org.apache.nutch.urlfilter.api that implement Pluggable
 class RegexURLFilterBase
          Generic URL filter based on regular expressions.

Uses of Pluggable in org.apache.nutch.urlfilter.automaton

Classes in org.apache.nutch.urlfilter.automaton that implement Pluggable
 class AutomatonURLFilter
          RegexURLFilterBase implementation based on the dk.brics.automaton Finite-State Automata for JavaTM.

Uses of Pluggable in org.apache.nutch.urlfilter.domain

Classes in org.apache.nutch.urlfilter.domain that implement Pluggable
 class DomainURLFilter
          Filters URLs based on a file containing domain suffixes, domain names, and hostnames.

Uses of Pluggable in org.apache.nutch.urlfilter.prefix

Classes in org.apache.nutch.urlfilter.prefix that implement Pluggable
 class PrefixURLFilter
          Filters URLs based on a file of URL prefixes.

Uses of Pluggable in org.apache.nutch.urlfilter.regex

Classes in org.apache.nutch.urlfilter.regex that implement Pluggable
 class RegexURLFilter
          Filters URLs based on a file of regular expressions using the Java Regex implementation.

Uses of Pluggable in org.apache.nutch.urlfilter.suffix

Classes in org.apache.nutch.urlfilter.suffix that implement Pluggable
 class SuffixURLFilter
          Filters URLs based on a file of URL suffixes.

Uses of Pluggable in org.apache.nutch.urlfilter.validator

Classes in org.apache.nutch.urlfilter.validator that implement Pluggable
 class UrlValidator
          Validates URLs.

Uses of Pluggable in org.creativecommons.nutch

Classes in org.creativecommons.nutch that implement Pluggable
 class CCIndexingFilter
          Adds basic searchable fields to a document.
 class CCParseFilter
          Adds metadata identifying the Creative Commons license used, if any.

Copyright © 2013 The Apache Software Foundation