Just built an end-to-end document automation workflow using Sarvam Vision + n8n.
Goal was simple: Take messy, low-quality scanned documents and turn them into structured, machine-readable data -fully automated.
What it does:
• Upload document
• Run OCR using Sarvam Vision
• Extract structured data
• Use AI to pull key fields
• Automatically update a sheet
No manual cleanup.
No copy-paste.
Everything runs automatically.
The interesting part?
Sarvam Vision doesn’t just return raw OCR text — it returns structured layout blocks. That makes downstream automation much more reliable.
This kind of setup can be used for:
- Healthcare forms
- KYC processing
- Insurance claims
- Any document-heavy workflow
If anyone here is building with n8n + OCR + LLMs, happy to share the workflow.