Our multi-tiered approach combines the strengths of each AI system:

Open AI

for filename + metadata generation

+

Meta Llama + Ollama

for holistic understanding of complex layouts

Tesseract

open-source OCR for text extraction