|
Class Summary |
| CrawlDBScanner |
Dumps all the entries matching a regular expression on their URL. |
| DmozParser |
Utility that converts DMOZ RDF into a flat file of URLs to be injected. |
| FreeGenerator |
This tool generates fetchlists (segments to be fetched) from plain text
files containing one URL per line. |
| FreeGenerator.FG |
|
| PruneIndexTool |
This tool prunes existing Nutch indexes of unwanted content. |
| PruneIndexTool.PrintFieldsChecker |
This checker's main function is just to print out
selected field values from each document, just before
they are deleted. |
| PruneIndexTool.StoreUrlsChecker |
This checker's main function is just to store
the URLs of each document to be deleted in a text file. |
| ResolveUrls |
A simple tool that will spin up multiple threads to resolve urls to ip
addresses. |
| SearchLoadTester |
A simple tool to perform load testing on configured search servers. |