Desmond Choy

Shipping full-stack AI systems end-to-end. Multi-agent orchestration, cost-optimized LLM pipelines, and real-time WebSocket architectures. Six production applications across education, music, computer vision, and behavioral science.

6 production apps shipped

Published Work

Production applications and research across AI engineering, computer vision, and education technology.

Buttery Smooth Jamming

Built a real-time multi-agent orchestration runtime where four persistent AI musicians improvise together while the server enforces deterministic routing, state continuity, and final pattern composition.

  • Architected a dual-mode Codex and Strudel runtime that powers both assistant interactions and live multi-agent jam sessions.
  • Implemented server-owned orchestration for four persistent AI agents with deterministic @mention routing, isolated sessions, and structured composition safeguards.
  • Added real-time streaming, multimodal control, and validation gates to keep the system responsive, reliable, and safe under live conditions.
Next.jsTypeScriptCodexStrudelMCPWebSockets

Learning Odyssey

Engineered a multimodal AI application with validated streaming, cost-optimized model routing, and persistent user state so every adventure stays coherent, resumable, and visually consistent from first choice to final summary.

  • Validate-then-stream architecture pre-checks AI output before real-time delivery, preventing broken chapter flows and malformed choices.
  • Cost-optimized multi-model pipeline routes narrative generation, summaries, formatting, and image prompt synthesis to the right model for quality and efficiency.
  • Persistent adventure state plus character and agency tracking keeps progress, story logic, and visual continuity intact across chapters, reconnects, and resumed sessions.
FastAPISupabaseGeminiRailwayDockerReactWebSockets

NANA is a full-stack AI learning system that transforms dense PDFs into personalized study workspaces, combining adaptive notes, document overviews, and inline AI explanations in a product built for real use.

  • Two-phase PDF pipeline with large-file preprocessing and splitting for cost-efficient note generation.
  • Personalized document overviews and study notes adapted to learner background and goals.
  • Inline AI commands, resumable sessions, and exportable notes built for real study workflows.
PythonFastAPITypeScriptReactViteGemini 3 Flash

Moodsic

End-to-end affective computing pipeline that turns noisy real-world video into music recommendations through dual-pathway ML, variance-weighted fusion, and full-stack delivery.

  • CLIP scene analysis + EmoNet facial cues fused through uncertainty-aware dual-pathway ML
    • Variance-weighted fusion with temporal smoothing improved VEATIC MAE by ~19% vs scene-only baselines
    • Delivered a React + Flask demo with synced playback, emotion visualizations, and DEAM music matching
PyTorchReactFlaskOpenCVCLIP ViT-B/32EmoNetMediaPipe

Twinkl

Values-alignment journal analyzing whether daily behavior reflects stated priorities across ten Schwartz dimensions.

  • VIF: ordinal MLP heads with MC Dropout uncertainty estimation
  • Synthetic data generation: 204 personas, 1,651 journal entries
  • Automated LLM judge labeling pipeline via Claude Code subagents
PyTorchShinyPolarsnomic-embed-textOpenAI APIThree.js

SSL ViT Models

Research on whether fine-tuning shifts self-supervised attention toward semantic features experts consider diagnostic.

  • 6 ViT architectures evaluated across frozen vs. fine-tuned conditions
  • Delta-IoU significance testing framework
  • Interactive React dashboard for attention map exploration
PyTorchFastAPIReactVision Transformers

About

Finance to Data Science to AI Engineering

AI engineer and CFA Charterholder shipping full-stack production AI applications end-to-end—from LLM orchestration and API design to real-time user-facing systems.

Career arc runs from traditional finance (fixed income, portfolio management) through data science to AI engineering, with domain expertise in financial services, insurance, and reinsurance. Currently pursuing a Master of Technology in Artificial Intelligence Systems at the National University of Singapore while working as Manager, Data Science at Pacific Life Re.

Skills

AI & Machine Learning

LLM OrchestrationMulti-Agent SystemsRAGAgentic WorkflowsContext EngineeringCost OptimizationStructured Output ExtractionValidate-then-StreamSimulation TestingSelf-Supervised LearningVision TransformersFine-tuning (LoRA)PyTorchNLP

Full-Stack Development

FastAPINext.jsReactTypeScriptWebSocketsPostgreSQLSupabaseTailwind CSSMCP ServersJWT Auth

Languages

PythonTypeScriptSQLR

Domain Expertise

Financial ServicesInsurance & ReinsuranceRegulatory CompliancePortfolio Analytics

Experience

2022–present

Manager, Data Science / Pacific Life Re

Shipped three production AI systems: medical research engine, RAG chatbot, and underwriting model for CVD risk factors

2021

Data Scientist / Circles.Life

Built churn, anomaly-detection, and customer-segmentation models across the full ML lifecycle

2017–2020

Data Scientist / Fixed Income Analyst / UOB Asset Management

Built NLP text-mining pipelines and research dashboards for investment idea generation

2015–2017

Fixed Income Analyst / Great Eastern Life

Credit research coverage across investment-grade corporate bonds and loans

2011–2014

Assistant Portfolio Manager / Woori Absolute Partners

Promoted to front-office; firm won 4 regional awards (2012–2014)

Education

2025–2026

Master of Technology in Artificial Intelligence Systems(Expected)

National University of Singapore

CEO's Honour List, AY2025/2026 Semester 1

2014

CFA Charterholder

CFA Institute

2009

BSc Management (First Class Honours)

University of London

Get in Touch

Singapore (UTC+08:00)