Package org.apache.nutch.net
Web-related interfaces: URL
filters
and normalizers
.-
Interface Summary Interface Description URLExemptionFilter Interface used to allow exemptions to external domain resources by overridingdb.ignore.external.links
.URLFilter Interface used to limit which URLs enter Nutch.URLNormalizer Interface used to convert URLs to normal form and optionally perform substitutions -
Class Summary Class Description URLExemptionFilters Creates and cachesURLExemptionFilter
implementing plugins.URLFilterChecker Checks one given filter or all filters.URLFilters Creates and caches plugins implementingURLFilter
.URLNormalizerChecker Checks one given normalizer or all normalizers.URLNormalizers This class uses a "chained filter" pattern to run defined normalizers. -
Exception Summary Exception Description URLFilterException