Client spending $132 k / year on AWS (Textract, Bedrock, Lambda, Dynamo, S3, etc.) to chew through 1 000 engineering PDFs a day. OCR alone = 61 % of the bill.
API shape, S3-compatible buckets, Postgres instead of Dynamo, JWT instead of Cognito. Users barely notice.
| Period | Old Bill | New Bill | Savings |
|---|---|---|---|
| Year 1 | $132 k | $7.7 k | 94 % |
| Year 2+ | $132 k | $2.8 k | 98 % |
FastAPI + Celery, PostgreSQL + PGVector, MinIO, PaddleOCR-VL, vLLM, Nginx, Docker. One 650 W box, $71 / mo power.
Spare GPU & PSU on shelf, nightly encrypted backups to external drive, feature-flagged fallback to AWS in 5 min.
Send 50 ugly PDFs, pick a cloud GPU, validate in 2 weeks. No hardware risk.
Download complete migration plan (Markdown) - Includes detailed technical specifications, code samples, deployment instructions, and operational procedures.
Document Version: 1.0 | Last Updated: 2026-01-07
Prepared By: William Welsh | hello@wwel.sh | https://wwel.sh