Package org.apache.nutch.net.urlnormalizer.regex
URL normalizer with configurable rules based on regular expressions
(
Pattern
).-
Class Summary Class Description RegexURLNormalizer Allows users to do regex substitutions on all/any URLs that are encountered, which is useful for stripping session IDs from URLs.