Key Concepts
Overview
Explore the key concepts of the document processing pipeline.
How It Works
A deeper dive on the end-to-end pipeline
Metadata
The fields extracted, modified and written
Matter Pages
The page sets containing key information
Token Costs
How we estimate the cost of each run
Duplicate Detection
Document identification and change tracking through CRC32 hashing.
Annotation Handling
Organization of document markups, highlights, and reader notes.