anreo Back
作品Mar 2026 – PresentCase study

Data Warehouse

A data warehouse for rebar delivery orders. Ingests delivery documents from steel suppliers and scanned/WhatsApp uploads, reads each PDF into structured data with AI, deduplicates and reconciles against purchase orders, and serves it through a web app — orchestrated with Apache Airflow over AWS Lambda.

PeriodMar 2026 – Present
StackApache Airflow · AWS Lambda · Python · .NET
Apache AirflowAWS LambdaPython.NETPostgreSQLReactTypeScriptAG GridEventBridge

A data warehouse for rebar delivery orders (DOs). It pulls delivery documents from steel suppliers and scanned / WhatsApp uploads, uses AI to read each PDF into structured data, deduplicates and reconciles deliveries against purchase orders, and serves everything through a web app — orchestrated with Apache Airflow over AWS Lambda.

Overview

Construction sites receive rebar against delivery orders from many suppliers, almost always as PDFs. This warehouse centralises every DO — from supplier portals, scanned packages and a WhatsApp intake flow — into one queryable source of truth, with AI extraction replacing manual data entry.

Role

Full-stack + data engineering — .NET back end, React front end, and the Airflow pipelines (scanned-DO and per-supplier sync) wired to the AWS Lambda functions.

Highlights

Outcome

Replaced manual rebar-DO entry with an automated, multi-source pipeline — supplier syncs and scanned / WhatsApp uploads land as clean, reconciled records in a single warehouse.

Technologies

Data & orchestration: Apache Airflow (Python), AWS Lambda, EventBridge, SQS / SNS · AI: PDF → structured extraction with vision LLMs · Backend: .NET, EF Core, PostgreSQL · Frontend: React, TypeScript, AG Grid