A self-validating quant pipeline with live job tracking and a Next.js control plane.
Most projects stop at results — this one validates them in real time, streams every stage over WebSocket, and ships signals, backtest, and versioned runs in one local research workstation.
Next.jsFastAPIPySparkDuckDBWebSocketQA
View repository (coming soon)
01 — The Problem
Four gaps most financial projects ignore.
Most projects stop at "I got results." These four gaps matter in production.
01
Scalability
Pandas breaks at scale — PySpark handles multi-ticker workloads.
02
Correctness
Rolling metrics and drawdown need careful window functions per ticker.
03
Validation
Formal QA scores Spark output against a Pandas/SciPy benchmark.
04
Observability
WebSocket streams every pipeline stage — no black-box runs.
02 — How It Works
Four layers, seven pipeline stages.
Layer 01UI Layer
Next.js dashboard — run pipeline, jobs, QA, signals, backtest, reports
↓
Layer 02API Layer
FastAPI — POST /run-pipeline, WebSocket /ws/{job_id}, /api/v1/*