Fingerprinter - RocketRide Documentation
Skip to main content

Fingerprinter

View as Markdown

What it does

Generates a deterministic fingerprint (hash) of each document's content as it passes through the pipeline. The hash is computed from the raw or normalized text, so identical content always produces the same fingerprint regardless of metadata. Use it for deduplication, content tracking, and identity verification before indexing.

Lanes: datadata

Configuration

None.