SCAN
Project 01

DocScan

Serverless document intelligence pipeline. Upload any PDF or image — AWS Textract extracts raw text, detected tables, and key-value form pairs in real time. 100% serverless on AWS.

AWS Textract Lambda S3 API Gateway Python CloudFormation
Status
● Live
Cost
~$0.01/doc
Runtime
Python 3.12
Async PDF
Supported
← All Projects
Architecture

How It Works

Browser
API Gateway
Lambda
S3 (PDFs only)
AWS Textract
JSON Response

Images sent directly to Textract · PDFs staged in S3 with 1-day lifecycle expiry · IAM least-privilege policy · ~$2/month at demo traffic

Try It Live

Upload a real document — it hits the live AWS API

📄
Drop your document here
PNG · JPG · TIFF · PDF  ·  max 5 MB
Sending to AWS Textract…
Words
Pages
Tables
Fields
Confidence