Skip to main content
Explore the key concepts of the document processing pipeline.

How It Works

A deeper dive on the end-to-end pipeline

Metadata

The fields extracted, modified and written

Matter Pages

The page sets containing key information

Token Costs

How we estimate the cost of each run

Duplicate Detection

Document identification and change tracking through CRC32 hashing.

Annotation Handling

Organization of document markups, highlights, and reader notes.