Package org.apache.nutch.metrics
Class NutchMetrics
- java.lang.Object
-
- org.apache.nutch.metrics.NutchMetrics
-
public final class NutchMetrics extends Object
Centralized constants for Hadoop metrics counter groups and names.Follows Prometheus naming conventions:
- Counter groups use the
nutch_prefix namespace - Counter names use snake_case
- Accumulating counters use
_totalsuffix - Units are included in counter names where applicable (e.g.,
_bytes)
- Since:
- 1.22
- Counter groups use the
-
-
Field Summary
Fields Modifier and Type Field Description static StringCLEANING_DELETED_DOCUMENTS_TOTALDocuments deleted during cleaning.static StringCRAWLDB_GONE_RECORDS_REMOVED_TOTALGone (404) records removed during CrawlDb operations.static StringCRAWLDB_ORPHAN_RECORDS_REMOVED_TOTALOrphan records removed during CrawlDb operations.static StringCRAWLDB_URLS_FILTERED_TOTALURLs filtered during CrawlDb operations.static StringDEDUP_DOCUMENTS_MARKED_DUPLICATE_TOTALDocuments marked as duplicate.static StringDOMAIN_STATS_EMPTY_RESULT_TOTALEmpty results in domain statistics.static StringDOMAIN_STATS_FETCHED_TOTALFetched URLs in domain statistics.static StringDOMAIN_STATS_NOT_FETCHED_TOTALNot fetched URLs in domain statistics.static StringERROR_INDEXING_TOTALIndexing filter errors.static StringERROR_NETWORK_TOTALNetwork-related errors.static StringERROR_OTHER_TOTALOther uncategorized errors.static StringERROR_PARSING_TOTALParsing errors.static StringERROR_PROTOCOL_TOTALProtocol errors.static StringERROR_SCORING_TOTALScoring filter errors.static StringERROR_TIMEOUT_TOTALTimeout errors.static StringERROR_TOTALTotal errors across all categories.static StringERROR_URL_TOTALURL-related errors.static StringFETCHER_ABOVE_EXCEPTION_THRESHOLD_TOTALURLs dropped due to exception threshold in queue.static StringFETCHER_BYTES_DOWNLOADED_TOTALTotal bytes downloaded by fetcher.static StringFETCHER_FILTERED_TOTALURLs filtered during fetching.static StringFETCHER_HIT_BY_THROUGHPUT_THRESHOLD_TOTALURLs hit by throughput threshold.static StringFETCHER_HIT_BY_TIMELIMIT_TOTALURLs hit by time limit.static StringFETCHER_HIT_BY_TIMEOUT_TOTALURLs hit by timeout.static StringFETCHER_HUNG_THREADS_TOTALThreads that hung during fetching.static StringFETCHER_LATENCYPrefix for fetch latency metrics.static StringFETCHER_OUTLINKS_DETECTED_TOTALOutlinks detected during parsing.static StringFETCHER_OUTLINKS_FOLLOWING_TOTALOutlinks being followed.static StringFETCHER_REDIRECT_COUNT_EXCEEDED_TOTALRedirects that exceeded maximum redirect count.static StringFETCHER_REDIRECT_DEDUPLICATED_TOTALRedirects deduplicated (already seen).static StringFETCHER_REDIRECT_NOT_CREATED_TOTALFetchItems not created for redirects.static StringFETCHER_ROBOTS_DEFER_VISITS_DROPPED_TOTALURLs dropped due to robots.txt deferred visits.static StringFETCHER_ROBOTS_DENIED_MAXCRAWLDELAY_TOTALURLs denied due to crawl delay exceeding maximum.static StringFETCHER_ROBOTS_DENIED_TOTALURLs denied by robots.txt.static StringGENERATOR_EXPR_REJECTED_TOTALURLs rejected by JEXL expression.static StringGENERATOR_HOSTS_AFFECTED_PER_HOST_OVERFLOW_TOTALHosts affected by per-host overflow.static StringGENERATOR_INTERVAL_REJECTED_TOTALURLs rejected due to fetch interval exceeding threshold.static StringGENERATOR_SCHEDULE_REJECTED_TOTALURLs rejected by fetch schedule.static StringGENERATOR_SCORE_TOO_LOW_TOTALURLs rejected due to score below threshold.static StringGENERATOR_STATUS_REJECTED_TOTALURLs rejected due to status restriction.static StringGENERATOR_URL_FILTERS_REJECTED_TOTALURLs rejected by URL filters.static StringGENERATOR_URLS_SKIPPED_PER_HOST_OVERFLOW_TOTALURLs skipped due to per-host overflow.static StringGENERATOR_WAIT_FOR_UPDATE_TOTALURLs waiting for CrawlDb update.static StringGROUP_CLEANINGCounter group for cleaning job operations.static StringGROUP_CRAWLDBCounter group for CrawlDb operations.static StringGROUP_CRAWLDB_FILTERCounter group for CrawlDb filter operations.static StringGROUP_DEDUPCounter group for deduplication operations.static StringGROUP_DOMAIN_STATSCounter group for domain statistics operations.static StringGROUP_FETCHERCounter group for fetcher operations.static StringGROUP_FETCHER_OUTLINKSCounter group for fetcher outlink processing.static StringGROUP_GENERATORCounter group for generator operations.static StringGROUP_HOSTDBCounter group for HostDb operations.static StringGROUP_INDEXERCounter group for indexer operations.static StringGROUP_INJECTORCounter group for injector operations.static StringGROUP_PARSERCounter group for parser operations.static StringGROUP_SITEMAPCounter group for sitemap processing operations.static StringGROUP_WARC_EXPORTERCounter group for WARC export operations.static StringGROUP_WEBGRAPHCounter group for WebGraph operations.static StringHOSTDB_CHECKED_HOSTS_TOTALHosts checked.static StringHOSTDB_EXISTING_KNOWN_HOST_TOTALExisting known hosts.static StringHOSTDB_EXISTING_UNKNOWN_HOST_TOTALExisting unknown hosts.static StringHOSTDB_FILTERED_RECORDS_TOTALRecords filtered in HostDb.static StringHOSTDB_NEW_KNOWN_HOST_TOTALNew known hosts discovered.static StringHOSTDB_NEW_UNKNOWN_HOST_TOTALNew unknown hosts.static StringHOSTDB_PURGED_UNKNOWN_HOST_TOTALPurged unknown hosts.static StringHOSTDB_REDISCOVERED_HOST_TOTALRediscovered hosts.static StringHOSTDB_SKIPPED_NOT_ELIGIBLE_TOTALHosts skipped (not eligible).static StringHOSTDB_TOTAL_HOSTS_TOTALTotal hosts processed.static StringHOSTDB_URL_LIMIT_NOT_REACHED_TOTALHosts where URL limit was not reached.static StringINDEXER_DELETED_BY_INDEXING_FILTER_TOTALDocuments deleted by indexing filter.static StringINDEXER_DELETED_DUPLICATES_TOTALDocuments deleted as duplicates.static StringINDEXER_DELETED_GONE_TOTALDocuments deleted because they are gone.static StringINDEXER_DELETED_REDIRECTS_TOTALDocuments deleted due to redirects.static StringINDEXER_DELETED_ROBOTS_NOINDEX_TOTALDocuments deleted due to robots noindex.static StringINDEXER_INDEXED_TOTALDocuments indexed (added or updated).static StringINDEXER_LATENCYPrefix for indexer latency metrics.static StringINDEXER_SKIPPED_BY_INDEXING_FILTER_TOTALDocuments skipped by indexing filter.static StringINDEXER_SKIPPED_NOT_MODIFIED_TOTALDocuments skipped (not modified).static StringINJECTOR_URLS_FILTERED_TOTALURLs filtered during injection.static StringINJECTOR_URLS_INJECTED_TOTALURLs injected.static StringINJECTOR_URLS_INJECTED_UNIQUE_TOTALUnique URLs injected.static StringINJECTOR_URLS_MERGED_TOTALURLs merged with existing CrawlDb entries.static StringINJECTOR_URLS_PURGED_404_TOTALURLs purged due to 404 status.static StringINJECTOR_URLS_PURGED_FILTER_TOTALURLs purged by filter.static StringPARSER_LATENCYPrefix for parse latency metrics.static StringSITEMAP_EXISTING_ENTRIES_TOTALExisting sitemap entries.static StringSITEMAP_FAILED_FETCHES_TOTALFailed sitemap fetches.static StringSITEMAP_FILTERED_FROM_HOSTNAME_TOTALSitemaps filtered from hostname.static StringSITEMAP_FILTERED_RECORDS_TOTALFiltered records in sitemap processing.static StringSITEMAP_FROM_HOSTNAME_TOTALSitemaps discovered from hostname.static StringSITEMAP_NEW_ENTRIES_TOTALNew sitemap entries.static StringSITEMAP_SEEDS_TOTALSeeds extracted from sitemaps.static StringWARC_MISSING_CONTENT_TOTALMissing content in WARC export.static StringWARC_MISSING_METADATA_TOTALMissing metadata in WARC export.static StringWARC_OMITTED_EMPTY_RESPONSE_TOTALOmitted empty responses in WARC export.static StringWARC_RECORDS_GENERATED_TOTALWARC records generated.static StringWEBGRAPH_ADDED_LINKS_TOTALLinks added to WebGraph.static StringWEBGRAPH_REMOVED_LINKS_TOTALLinks removed from WebGraph.
-
-
-
Field Detail
-
GROUP_FETCHER
public static final String GROUP_FETCHER
Counter group for fetcher operations.- See Also:
- Constant Field Values
-
GROUP_FETCHER_OUTLINKS
public static final String GROUP_FETCHER_OUTLINKS
Counter group for fetcher outlink processing.- See Also:
- Constant Field Values
-
GROUP_GENERATOR
public static final String GROUP_GENERATOR
Counter group for generator operations.- See Also:
- Constant Field Values
-
GROUP_INDEXER
public static final String GROUP_INDEXER
Counter group for indexer operations.- See Also:
- Constant Field Values
-
GROUP_CRAWLDB
public static final String GROUP_CRAWLDB
Counter group for CrawlDb operations.- See Also:
- Constant Field Values
-
GROUP_CRAWLDB_FILTER
public static final String GROUP_CRAWLDB_FILTER
Counter group for CrawlDb filter operations.- See Also:
- Constant Field Values
-
GROUP_INJECTOR
public static final String GROUP_INJECTOR
Counter group for injector operations.- See Also:
- Constant Field Values
-
GROUP_HOSTDB
public static final String GROUP_HOSTDB
Counter group for HostDb operations.- See Also:
- Constant Field Values
-
GROUP_PARSER
public static final String GROUP_PARSER
Counter group for parser operations.- See Also:
- Constant Field Values
-
GROUP_DEDUP
public static final String GROUP_DEDUP
Counter group for deduplication operations.- See Also:
- Constant Field Values
-
GROUP_CLEANING
public static final String GROUP_CLEANING
Counter group for cleaning job operations.- See Also:
- Constant Field Values
-
GROUP_WEBGRAPH
public static final String GROUP_WEBGRAPH
Counter group for WebGraph operations.- See Also:
- Constant Field Values
-
GROUP_SITEMAP
public static final String GROUP_SITEMAP
Counter group for sitemap processing operations.- See Also:
- Constant Field Values
-
GROUP_WARC_EXPORTER
public static final String GROUP_WARC_EXPORTER
Counter group for WARC export operations.- See Also:
- Constant Field Values
-
GROUP_DOMAIN_STATS
public static final String GROUP_DOMAIN_STATS
Counter group for domain statistics operations.- See Also:
- Constant Field Values
-
FETCHER_BYTES_DOWNLOADED_TOTAL
public static final String FETCHER_BYTES_DOWNLOADED_TOTAL
Total bytes downloaded by fetcher.- See Also:
- Constant Field Values
-
FETCHER_ROBOTS_DENIED_TOTAL
public static final String FETCHER_ROBOTS_DENIED_TOTAL
URLs denied by robots.txt.- See Also:
- Constant Field Values
-
FETCHER_ROBOTS_DENIED_MAXCRAWLDELAY_TOTAL
public static final String FETCHER_ROBOTS_DENIED_MAXCRAWLDELAY_TOTAL
URLs denied due to crawl delay exceeding maximum.- See Also:
- Constant Field Values
-
FETCHER_ROBOTS_DEFER_VISITS_DROPPED_TOTAL
public static final String FETCHER_ROBOTS_DEFER_VISITS_DROPPED_TOTAL
URLs dropped due to robots.txt deferred visits.- See Also:
- Constant Field Values
-
FETCHER_REDIRECT_COUNT_EXCEEDED_TOTAL
public static final String FETCHER_REDIRECT_COUNT_EXCEEDED_TOTAL
Redirects that exceeded maximum redirect count.- See Also:
- Constant Field Values
-
FETCHER_REDIRECT_DEDUPLICATED_TOTAL
public static final String FETCHER_REDIRECT_DEDUPLICATED_TOTAL
Redirects deduplicated (already seen).- See Also:
- Constant Field Values
-
FETCHER_REDIRECT_NOT_CREATED_TOTAL
public static final String FETCHER_REDIRECT_NOT_CREATED_TOTAL
FetchItems not created for redirects.- See Also:
- Constant Field Values
-
FETCHER_HIT_BY_TIMELIMIT_TOTAL
public static final String FETCHER_HIT_BY_TIMELIMIT_TOTAL
URLs hit by time limit.- See Also:
- Constant Field Values
-
FETCHER_HIT_BY_TIMEOUT_TOTAL
public static final String FETCHER_HIT_BY_TIMEOUT_TOTAL
URLs hit by timeout.- See Also:
- Constant Field Values
-
FETCHER_HIT_BY_THROUGHPUT_THRESHOLD_TOTAL
public static final String FETCHER_HIT_BY_THROUGHPUT_THRESHOLD_TOTAL
URLs hit by throughput threshold.- See Also:
- Constant Field Values
-
FETCHER_HUNG_THREADS_TOTAL
public static final String FETCHER_HUNG_THREADS_TOTAL
Threads that hung during fetching.- See Also:
- Constant Field Values
-
FETCHER_FILTERED_TOTAL
public static final String FETCHER_FILTERED_TOTAL
URLs filtered during fetching.- See Also:
- Constant Field Values
-
FETCHER_ABOVE_EXCEPTION_THRESHOLD_TOTAL
public static final String FETCHER_ABOVE_EXCEPTION_THRESHOLD_TOTAL
URLs dropped due to exception threshold in queue.- See Also:
- Constant Field Values
-
FETCHER_OUTLINKS_DETECTED_TOTAL
public static final String FETCHER_OUTLINKS_DETECTED_TOTAL
Outlinks detected during parsing.- See Also:
- Constant Field Values
-
FETCHER_OUTLINKS_FOLLOWING_TOTAL
public static final String FETCHER_OUTLINKS_FOLLOWING_TOTAL
Outlinks being followed.- See Also:
- Constant Field Values
-
GENERATOR_URL_FILTERS_REJECTED_TOTAL
public static final String GENERATOR_URL_FILTERS_REJECTED_TOTAL
URLs rejected by URL filters.- See Also:
- Constant Field Values
-
GENERATOR_SCHEDULE_REJECTED_TOTAL
public static final String GENERATOR_SCHEDULE_REJECTED_TOTAL
URLs rejected by fetch schedule.- See Also:
- Constant Field Values
-
GENERATOR_WAIT_FOR_UPDATE_TOTAL
public static final String GENERATOR_WAIT_FOR_UPDATE_TOTAL
URLs waiting for CrawlDb update.- See Also:
- Constant Field Values
-
GENERATOR_EXPR_REJECTED_TOTAL
public static final String GENERATOR_EXPR_REJECTED_TOTAL
URLs rejected by JEXL expression.- See Also:
- Constant Field Values
-
GENERATOR_STATUS_REJECTED_TOTAL
public static final String GENERATOR_STATUS_REJECTED_TOTAL
URLs rejected due to status restriction.- See Also:
- Constant Field Values
-
GENERATOR_SCORE_TOO_LOW_TOTAL
public static final String GENERATOR_SCORE_TOO_LOW_TOTAL
URLs rejected due to score below threshold.- See Also:
- Constant Field Values
-
GENERATOR_INTERVAL_REJECTED_TOTAL
public static final String GENERATOR_INTERVAL_REJECTED_TOTAL
URLs rejected due to fetch interval exceeding threshold.- See Also:
- Constant Field Values
-
GENERATOR_URLS_SKIPPED_PER_HOST_OVERFLOW_TOTAL
public static final String GENERATOR_URLS_SKIPPED_PER_HOST_OVERFLOW_TOTAL
URLs skipped due to per-host overflow.- See Also:
- Constant Field Values
-
GENERATOR_HOSTS_AFFECTED_PER_HOST_OVERFLOW_TOTAL
public static final String GENERATOR_HOSTS_AFFECTED_PER_HOST_OVERFLOW_TOTAL
Hosts affected by per-host overflow.- See Also:
- Constant Field Values
-
INDEXER_DELETED_ROBOTS_NOINDEX_TOTAL
public static final String INDEXER_DELETED_ROBOTS_NOINDEX_TOTAL
Documents deleted due to robots noindex.- See Also:
- Constant Field Values
-
INDEXER_DELETED_GONE_TOTAL
public static final String INDEXER_DELETED_GONE_TOTAL
Documents deleted because they are gone.- See Also:
- Constant Field Values
-
INDEXER_DELETED_REDIRECTS_TOTAL
public static final String INDEXER_DELETED_REDIRECTS_TOTAL
Documents deleted due to redirects.- See Also:
- Constant Field Values
-
INDEXER_DELETED_DUPLICATES_TOTAL
public static final String INDEXER_DELETED_DUPLICATES_TOTAL
Documents deleted as duplicates.- See Also:
- Constant Field Values
-
INDEXER_DELETED_BY_INDEXING_FILTER_TOTAL
public static final String INDEXER_DELETED_BY_INDEXING_FILTER_TOTAL
Documents deleted by indexing filter.- See Also:
- Constant Field Values
-
INDEXER_SKIPPED_NOT_MODIFIED_TOTAL
public static final String INDEXER_SKIPPED_NOT_MODIFIED_TOTAL
Documents skipped (not modified).- See Also:
- Constant Field Values
-
INDEXER_SKIPPED_BY_INDEXING_FILTER_TOTAL
public static final String INDEXER_SKIPPED_BY_INDEXING_FILTER_TOTAL
Documents skipped by indexing filter.- See Also:
- Constant Field Values
-
INDEXER_INDEXED_TOTAL
public static final String INDEXER_INDEXED_TOTAL
Documents indexed (added or updated).- See Also:
- Constant Field Values
-
CRAWLDB_URLS_FILTERED_TOTAL
public static final String CRAWLDB_URLS_FILTERED_TOTAL
URLs filtered during CrawlDb operations.- See Also:
- Constant Field Values
-
CRAWLDB_GONE_RECORDS_REMOVED_TOTAL
public static final String CRAWLDB_GONE_RECORDS_REMOVED_TOTAL
Gone (404) records removed during CrawlDb operations.- See Also:
- Constant Field Values
-
CRAWLDB_ORPHAN_RECORDS_REMOVED_TOTAL
public static final String CRAWLDB_ORPHAN_RECORDS_REMOVED_TOTAL
Orphan records removed during CrawlDb operations.- See Also:
- Constant Field Values
-
INJECTOR_URLS_FILTERED_TOTAL
public static final String INJECTOR_URLS_FILTERED_TOTAL
URLs filtered during injection.- See Also:
- Constant Field Values
-
INJECTOR_URLS_INJECTED_TOTAL
public static final String INJECTOR_URLS_INJECTED_TOTAL
URLs injected.- See Also:
- Constant Field Values
-
INJECTOR_URLS_INJECTED_UNIQUE_TOTAL
public static final String INJECTOR_URLS_INJECTED_UNIQUE_TOTAL
Unique URLs injected.- See Also:
- Constant Field Values
-
INJECTOR_URLS_MERGED_TOTAL
public static final String INJECTOR_URLS_MERGED_TOTAL
URLs merged with existing CrawlDb entries.- See Also:
- Constant Field Values
-
INJECTOR_URLS_PURGED_404_TOTAL
public static final String INJECTOR_URLS_PURGED_404_TOTAL
URLs purged due to 404 status.- See Also:
- Constant Field Values
-
INJECTOR_URLS_PURGED_FILTER_TOTAL
public static final String INJECTOR_URLS_PURGED_FILTER_TOTAL
URLs purged by filter.- See Also:
- Constant Field Values
-
HOSTDB_FILTERED_RECORDS_TOTAL
public static final String HOSTDB_FILTERED_RECORDS_TOTAL
Records filtered in HostDb.- See Also:
- Constant Field Values
-
HOSTDB_TOTAL_HOSTS_TOTAL
public static final String HOSTDB_TOTAL_HOSTS_TOTAL
Total hosts processed.- See Also:
- Constant Field Values
-
HOSTDB_SKIPPED_NOT_ELIGIBLE_TOTAL
public static final String HOSTDB_SKIPPED_NOT_ELIGIBLE_TOTAL
Hosts skipped (not eligible).- See Also:
- Constant Field Values
-
HOSTDB_URL_LIMIT_NOT_REACHED_TOTAL
public static final String HOSTDB_URL_LIMIT_NOT_REACHED_TOTAL
Hosts where URL limit was not reached.- See Also:
- Constant Field Values
-
HOSTDB_NEW_KNOWN_HOST_TOTAL
public static final String HOSTDB_NEW_KNOWN_HOST_TOTAL
New known hosts discovered.- See Also:
- Constant Field Values
-
HOSTDB_REDISCOVERED_HOST_TOTAL
public static final String HOSTDB_REDISCOVERED_HOST_TOTAL
Rediscovered hosts.- See Also:
- Constant Field Values
-
HOSTDB_EXISTING_KNOWN_HOST_TOTAL
public static final String HOSTDB_EXISTING_KNOWN_HOST_TOTAL
Existing known hosts.- See Also:
- Constant Field Values
-
HOSTDB_NEW_UNKNOWN_HOST_TOTAL
public static final String HOSTDB_NEW_UNKNOWN_HOST_TOTAL
New unknown hosts.- See Also:
- Constant Field Values
-
HOSTDB_EXISTING_UNKNOWN_HOST_TOTAL
public static final String HOSTDB_EXISTING_UNKNOWN_HOST_TOTAL
Existing unknown hosts.- See Also:
- Constant Field Values
-
HOSTDB_PURGED_UNKNOWN_HOST_TOTAL
public static final String HOSTDB_PURGED_UNKNOWN_HOST_TOTAL
Purged unknown hosts.- See Also:
- Constant Field Values
-
HOSTDB_CHECKED_HOSTS_TOTAL
public static final String HOSTDB_CHECKED_HOSTS_TOTAL
Hosts checked.- See Also:
- Constant Field Values
-
DEDUP_DOCUMENTS_MARKED_DUPLICATE_TOTAL
public static final String DEDUP_DOCUMENTS_MARKED_DUPLICATE_TOTAL
Documents marked as duplicate.- See Also:
- Constant Field Values
-
CLEANING_DELETED_DOCUMENTS_TOTAL
public static final String CLEANING_DELETED_DOCUMENTS_TOTAL
Documents deleted during cleaning.- See Also:
- Constant Field Values
-
WEBGRAPH_ADDED_LINKS_TOTAL
public static final String WEBGRAPH_ADDED_LINKS_TOTAL
Links added to WebGraph.- See Also:
- Constant Field Values
-
WEBGRAPH_REMOVED_LINKS_TOTAL
public static final String WEBGRAPH_REMOVED_LINKS_TOTAL
Links removed from WebGraph.- See Also:
- Constant Field Values
-
SITEMAP_FILTERED_RECORDS_TOTAL
public static final String SITEMAP_FILTERED_RECORDS_TOTAL
Filtered records in sitemap processing.- See Also:
- Constant Field Values
-
SITEMAP_SEEDS_TOTAL
public static final String SITEMAP_SEEDS_TOTAL
Seeds extracted from sitemaps.- See Also:
- Constant Field Values
-
SITEMAP_FROM_HOSTNAME_TOTAL
public static final String SITEMAP_FROM_HOSTNAME_TOTAL
Sitemaps discovered from hostname.- See Also:
- Constant Field Values
-
SITEMAP_FILTERED_FROM_HOSTNAME_TOTAL
public static final String SITEMAP_FILTERED_FROM_HOSTNAME_TOTAL
Sitemaps filtered from hostname.- See Also:
- Constant Field Values
-
SITEMAP_FAILED_FETCHES_TOTAL
public static final String SITEMAP_FAILED_FETCHES_TOTAL
Failed sitemap fetches.- See Also:
- Constant Field Values
-
SITEMAP_EXISTING_ENTRIES_TOTAL
public static final String SITEMAP_EXISTING_ENTRIES_TOTAL
Existing sitemap entries.- See Also:
- Constant Field Values
-
SITEMAP_NEW_ENTRIES_TOTAL
public static final String SITEMAP_NEW_ENTRIES_TOTAL
New sitemap entries.- See Also:
- Constant Field Values
-
WARC_MISSING_CONTENT_TOTAL
public static final String WARC_MISSING_CONTENT_TOTAL
Missing content in WARC export.- See Also:
- Constant Field Values
-
WARC_MISSING_METADATA_TOTAL
public static final String WARC_MISSING_METADATA_TOTAL
Missing metadata in WARC export.- See Also:
- Constant Field Values
-
WARC_OMITTED_EMPTY_RESPONSE_TOTAL
public static final String WARC_OMITTED_EMPTY_RESPONSE_TOTAL
Omitted empty responses in WARC export.- See Also:
- Constant Field Values
-
WARC_RECORDS_GENERATED_TOTAL
public static final String WARC_RECORDS_GENERATED_TOTAL
WARC records generated.- See Also:
- Constant Field Values
-
DOMAIN_STATS_FETCHED_TOTAL
public static final String DOMAIN_STATS_FETCHED_TOTAL
Fetched URLs in domain statistics.- See Also:
- Constant Field Values
-
DOMAIN_STATS_NOT_FETCHED_TOTAL
public static final String DOMAIN_STATS_NOT_FETCHED_TOTAL
Not fetched URLs in domain statistics.- See Also:
- Constant Field Values
-
DOMAIN_STATS_EMPTY_RESULT_TOTAL
public static final String DOMAIN_STATS_EMPTY_RESULT_TOTAL
Empty results in domain statistics.- See Also:
- Constant Field Values
-
FETCHER_LATENCY
public static final String FETCHER_LATENCY
Prefix for fetch latency metrics. Used withLatencyTrackerto emit fetch timing counters.- See Also:
- Constant Field Values
-
PARSER_LATENCY
public static final String PARSER_LATENCY
Prefix for parse latency metrics. Used withLatencyTrackerto emit parse timing counters.- See Also:
- Constant Field Values
-
INDEXER_LATENCY
public static final String INDEXER_LATENCY
Prefix for indexer latency metrics. Used withLatencyTrackerto emit indexing timing counters.- See Also:
- Constant Field Values
-
ERROR_TOTAL
public static final String ERROR_TOTAL
Total errors across all categories. This is incremented alongside any category-specific error counter.- See Also:
- Constant Field Values
-
ERROR_NETWORK_TOTAL
public static final String ERROR_NETWORK_TOTAL
Network-related errors. Includes: IOException, SocketException, ConnectException, UnknownHostException- See Also:
- Constant Field Values
-
ERROR_PROTOCOL_TOTAL
public static final String ERROR_PROTOCOL_TOTAL
Protocol errors. Includes: ProtocolException, ProtocolNotFound- See Also:
- Constant Field Values
-
ERROR_PARSING_TOTAL
public static final String ERROR_PARSING_TOTAL
Parsing errors. Includes: ParseException, ParserNotFound- See Also:
- Constant Field Values
-
ERROR_URL_TOTAL
public static final String ERROR_URL_TOTAL
URL-related errors. Includes: MalformedURLException, URLFilterException- See Also:
- Constant Field Values
-
ERROR_SCORING_TOTAL
public static final String ERROR_SCORING_TOTAL
Scoring filter errors. Includes: ScoringFilterException- See Also:
- Constant Field Values
-
ERROR_INDEXING_TOTAL
public static final String ERROR_INDEXING_TOTAL
Indexing filter errors. Includes: IndexingException- See Also:
- Constant Field Values
-
ERROR_TIMEOUT_TOTAL
public static final String ERROR_TIMEOUT_TOTAL
Timeout errors. Includes: SocketTimeoutException, connection timeouts- See Also:
- Constant Field Values
-
ERROR_OTHER_TOTAL
public static final String ERROR_OTHER_TOTAL
Other uncategorized errors. Used as fallback for exceptions not matching any specific category.- See Also:
- Constant Field Values
-
-