Latency

Latency Overview

Cardinal is optimized for speed and scale. Each page typically takes around 15 seconds to process with our base model, regardless of size or complexity. Because all pages are processed in parallel with no rate limits, total latency depends primarily on the slowest page rather than the total number of pages.

Typical Latency

Document Size	Expected Latency
1 page	~15 sec
10 pages	~15 sec
100 pages	~15 sec
1000 pages	~15 sec

Notes

Pages run fully in parallel, so overall latency does not scale linearly with page count.
Base model processing takes approximately 15 seconds per page for standard OCR.
Additional features can increase processing time:
- Barcode detection may add several seconds per page
- Post-processing and enrichment typically add ~10–20 seconds on top of base latency
- Advanced passes (dense tables, handwriting, annotations) may add ~10–25 seconds per page
- Schema alignment and bounding boxes can add additional processing time
There are no rate limits — you can process as many pages as you want simultaneously.
Skew correction and other advanced passes may add a few seconds of processing time per page.

These are approximate numbers. For production workloads, we recommend uploading a sample batch to benchmark your own latency.

Introduction

Building Blocks

Accessories

Eval

Common Questions

Recipes

Security

On-Premise VPC Deployment

Uptime

Changelog

Latency Overview

Typical Latency

Notes

Introduction

Building Blocks

Accessories

Eval

Common Questions

Recipes

Security

On-Premise VPC Deployment

Uptime

Changelog

​Latency Overview

​Typical Latency

​Notes

Latency Overview

Typical Latency

Notes