Desmond
Kao

Software Engineer & AI BuilderData Scientist & QuantClassical Pianist & CreatorNYU Computer Science '27

Building AI systems that solve tangible business problems across NYC's financial and tech landscape — from hedge fund data platforms to LLM-powered research tools. I turn messy data into production-ready solutions.

Desmond Kao
Python
React
TypeScript
PyTorch
TensorFlow
AWS
Docker
FastAPI
PostgreSQL
Snowflake
JavaScript
Azure
Scikit-learn
Kubernetes
Node.js
Pandas
Swift
Firebase
OpenAI
Claude
Python
React
TypeScript
PyTorch
TensorFlow
AWS
Docker
FastAPI
PostgreSQL
Snowflake
JavaScript
Azure
Scikit-learn
Kubernetes
Node.js
Pandas
Swift
Firebase
OpenAI
Claude
Background

About Me

Originally from San Francisco, I'm a Computer Science & Data Science student at New York University building AI systems that solve tangible business problems across NYC's financial and tech landscape.

I've engineered full-stack platforms and ML pipelines for NYC hedge funds, built automated pricing models for real estate companies, and developed NLP tools for academic research centers including NYU's Carter Journalism Institute and Yale.

Beyond engineering, I'm a classical pianist with 16 years of experience and explore creativity through Muay Thai and music production.

Languages
PythonJavaScriptTypeScriptSQLJavaC++C#RMATLAB
Frameworks
ReactNode.jsFastAPIFlaskExpressPySparkStreamlit
AI & ML
PyTorchTensorFlowScikit-learnHuggingFaceLangGraphRAGNLPDeep LearningTime Series
Cloud & Infra
AWSAzureSnowflakeFirebaseDockerKubernetes
Work

Experience

Software Engineer Intern
Apple
Summer 2026
  • Incoming Software Engineer Intern
Software & AI Engineering Intern
AlphaQuest (Systematic Hedge Fund)
June 2025 – Dec 2025
  • Built AI chatbot in Python/FastAPI with Snowflake RAG + OpenAI API, significantly reducing research time for investment staff
  • Developed AI-driven commentary pipeline automating daily report generation, synthesizing news, research, and portfolio data
  • Created full-stack investor relations platform in React/Python automating chart generation, analysis reports, and ad hoc queries
PythonFastAPIReactSnowflakeOpenAI APIRAG
Software & Data Engineering Intern
Catenary Alternatives Asset Management (Quantitative Hedge Fund)
Dec 2024 – May 2025
  • Developed firmwide Flask/SQL research API enabling teams to query live data with dramatically improved retrieval speeds
  • Built autonomous LLM scraping pipeline in Flask/Azure with daily data collection, significantly reducing research time
  • Created AI Excel assistant in Python using Perplexity + Tavily APIs, automating large-scale data entry tasks
PythonFlaskSQLAzureNLP
ML & Data Engineering Intern
Neue Urban
Nov 2024 – Feb 2025
  • Developed ML valuation API in Python, significantly improving property pricing accuracy and reducing appraisal costs
  • Automated end-to-end valuation pipeline from data ingestion to prediction, dramatically reducing review time
  • Integrated live market/listing APIs to enrich model features, improving valuation precision across portfolios
PythonScikit-learnFastAPIMachine Learning
AI Research Engineering Intern
Arthur L. Carter Journalism Institute
Nov 2024 – Mar 2025
  • Built LLM/NLP pipeline analyzing large-scale social media posts for sentiment trends using HuggingFace
  • Developed large-scale document scraper with LLM-based summarization, significantly reducing publication review time
  • Created automated bias detection system evaluating news articles for neutrality and language framing
PythonNLPHuggingFaceWeb Scraping
ML Engineering Intern
Yale University (HP Funded Research)
June 2024 – July 2024
  • Built lightweight LLM with TensorFlow/Keras using pruning and quantization for low-power on-device inference
  • Benchmarked model latency and power consumption across CPU/GPU setups for sustainable local deployment
  • Led ML/AI sustainability workshop for industry professionals on efficient model design techniques
PythonTensorFlowKerasMachine Learning
Selected Work

Projects

01
Compass – AI Financial Dashboard

Equities-focused AI dashboard integrating Grok API for factor report generation. Statistical analysis engine using SciPy for quant metrics and portfolio analytics.

PythonFastAPIGrok APIReactSciPyRAG
Proprietary
02
Hedge Fund Data Intelligence Platform

Firmwide AI chatbot connecting directly to Snowflake for flexible querying across company data. Natural language access to research reports and market data.

PythonFastAPIRAGSnowflakePostgreSQL
Proprietary
03
Iris – Investor Relations Platform

Full-stack IR tool for NYC hedge fund connected to Snowflake. Automated data validation pipelines and client-ready report generation.

PythonReactSnowflakeFastAPIPostgreSQL
Proprietary
04
Cornell Hackathon – AI Email Generator

AI-powered email generator using Pinecone vector DB and OpenAI API. Semantic search across templates with real-time Streamlit dashboard.

PythonPineconeOpenAI APIStreamlitRAG
05
Point72 Hackathon – Traffic Forecasting

Hybrid LSTM–ARIMA model for urban traffic forecasting. Interactive React dashboard visualizing congestion trends as retail/transport indicators.

PythonTensorFlowARIMAReact
06
HP Research @ Yale – Sustainable LLM

Custom lightweight LLM built from scratch with TensorFlow/Keras. Sustainable AI via model pruning, quantization, and efficient training strategies.

PythonTensorFlowKerasStreamlit
07
ML Stock Price Predictor

ML models predicting stock movements after short seller report releases. NLP feature engineering pipeline with logistic regression and deep Q-learning.

PythonScikit-learnDeep LearningNLP
Proprietary
08
2Fish – AI Psychoanalysis App

Published iOS app for AI-driven dream interpretation and journaling. Firebase encrypted backend with SwiftUI/TCA architecture.

SwiftJavaScriptExpressFirebaseReact
09
Intra IQ – Enterprise RAG Platform

Enterprise document intelligence using Claude API and RAG for natural language querying across company knowledge bases.

PythonClaude APIRAGFastAPIReact
10
DataLens – Data Quality Dashboard

ETL monitoring pipeline detecting anomalies across millions of records daily with PowerBI dashboard and LLM-generated summaries.

PythonSQLPySparkPowerBIAzure
Proprietary
11
NLP Web Scraping API

NLP API extracting structured data from unstructured sources using Gemini and Claude APIs. Containerized with Docker for scalable deployment.

PythonGemini APIClaude APINLPFastAPIDocker
Proprietary
12
J&J Hackathon – AI Talent Dashboard

Talent acquisition dashboard using AI agents and blockchain for credential verification. Intelligent matching system for streamlined hiring.

PythonAI AgentsBlockchainReactNLP
Study

Education

New York University
B.A. in Computer Science and Data Science
Sep 2023 – May 2027
GPA: 3.75
Coursework: Data Structures, Algorithms, Database Management, Linear Algebra, Discrete Mathematics, Principles of Data Science, Causal Inference, AI Ethics
Member of BUGS (Open Source Club @ NYU)