AI Architecture
How the pieces fit together: where retrieval ends and the model begins, what each call costs, how it fails, and how a person checks its work. Designed around the constraints finance imposes, not a generic chatbot template.
- Retrieval, agents and tool use, scoped to the job
- Model choice backed by an eval set, not a vendor demo
- Cost, latency and failure-mode analysis per call