AI Engineer

Desmond Choy

Shipping full-stack AI systems end-to-end. Multi-agent orchestration, cost-optimized LLM pipelines, and real-time WebSocket architectures. Six production applications across education, music, computer vision, and behavioral science.

6 production apps shipped

Published Work

Production applications and research across AI engineering, computer vision, and education technology.

Buttery Smooth Jamming

Built a real-time multi-agent orchestration runtime where four persistent AI musicians improvise together while the server enforces deterministic routing, state continuity, and final pattern composition.

  • Architected a dual-mode Codex and Strudel runtime that powers both assistant interactions and live multi-agent jam sessions.
  • Implemented server-owned orchestration for four persistent AI agents with deterministic @mention routing, isolated sessions, and structured composition safeguards.
  • Added real-time streaming, multimodal control, and validation gates to keep the system responsive, reliable, and safe under live conditions.
Next.jsTypeScriptCodexStrudelMCPWebSockets

Learning Odyssey

Engineered a multimodal AI application with validated streaming, cost-optimized model routing, and persistent user state so every adventure stays coherent, resumable, and visually consistent from first choice to final summary.

  • Validate-then-stream architecture pre-checks AI output before real-time delivery, preventing broken chapter flows and malformed choices.
  • Cost-optimized multi-model pipeline routes narrative generation, summaries, formatting, and image prompt synthesis to the right model for quality and efficiency.
  • Persistent adventure state plus character and agency tracking keeps progress, story logic, and visual continuity intact across chapters, reconnects, and resumed sessions.
FastAPISupabaseGeminiRailwayDockerReactWebSockets

NANA is a full-stack AI learning system that transforms dense PDFs into personalized study workspaces, combining adaptive notes, document overviews, and inline AI explanations in a product built for real use.

  • Two-phase PDF pipeline with large-file preprocessing and splitting for cost-efficient note generation.
  • Personalized document overviews and study notes adapted to learner background and goals.
  • Inline AI commands, resumable sessions, and exportable notes built for real study workflows.
PythonFastAPITypeScriptReactViteGemini 3 Flash

Moodsic

End-to-end affective computing pipeline that turns noisy real-world video into music recommendations through dual-pathway ML, variance-weighted fusion, and full-stack delivery.

  • CLIP scene analysis + EmoNet facial cues fused through uncertainty-aware dual-pathway ML
    • Variance-weighted fusion with temporal smoothing improved VEATIC MAE by ~19% vs scene-only baselines
    • Delivered a React + Flask demo with synced playback, emotion visualizations, and DEAM music matching
PyTorchReactFlaskOpenCVCLIP ViT-B/32EmoNetMediaPipe

Twinkl

Values-alignment journal analyzing whether daily behavior reflects stated priorities across ten Schwartz dimensions.

  • VIF: ordinal MLP heads with MC Dropout uncertainty estimation
  • Synthetic data generation: 204 personas, 1,651 journal entries
  • Automated LLM judge labeling pipeline via Claude Code subagents
PyTorchShinyPolarsnomic-embed-textOpenAI APIThree.js

SSL ViT Models

Research on whether fine-tuning shifts self-supervised attention toward semantic features experts consider diagnostic.

  • 6 ViT architectures evaluated across frozen vs. fine-tuned conditions
  • Delta-IoU significance testing framework
  • Interactive React dashboard for attention map exploration
PyTorchFastAPIReactVision Transformers

About

Finance to Data Science to AI Engineering

AI engineer and CFA Charterholder shipping full-stack production AI applications end-to-end—from LLM orchestration and API design to real-time user-facing systems.

Career arc runs from traditional finance (fixed income, portfolio management) through data science to AI engineering, with domain expertise in financial services, insurance, and reinsurance. Currently pursuing a Master of Technology in AI Systems at the National University of Singapore while working as Manager, Data Science at Pacific Life Re.

Skills

AI & Machine Learning

LLM OrchestrationMulti-Agent SystemsRAGAgentic WorkflowsContext EngineeringCost OptimizationStructured Output ExtractionValidate-then-StreamSimulation TestingSelf-Supervised LearningVision TransformersFine-tuning (LoRA)PyTorchNLP

Full-Stack Development

FastAPINext.jsReactTypeScriptWebSocketsPostgreSQLSupabaseTailwind CSSMCP ServersJWT Auth

Languages

PythonTypeScriptSQLR

Domain Expertise

Financial ServicesInsurance & ReinsuranceRegulatory CompliancePortfolio Analytics

Experience

2022–present

Manager, Data Science / Pacific Life Re

Internal AI solutions engineer: deployed AI medical research engine, RAG chatbot, ML underwriting solution

2021

Data Scientist / Circles.Life

Full ML lifecycle for churn/anomaly detection; customer segmentation via clustering

2017–2020

Data Scientist / Fixed Income Analyst / UOB Asset Management

NLP text mining saving dozens to hundreds of man-hours; analytics dashboards

2015–2017

Fixed Income Analyst / Great Eastern Life

Zero credit defaults during tenure; promoted 2017

2011–2014

Assistant Portfolio Manager / Woori Absolute Partners

Promoted to front-office; firm won 4 regional awards (2012–2014)

Education

2025–2026

Master of Technology in AI Systems(Expected)

National University of Singapore

2014

CFA Charterholder

CFA Institute

2009

BSc Management (First Class Honours)

University of London

Get in Touch

Singapore (UTC+08:00)