Available · Full-time ML, 2026

Machine learning engineer, working on
production AI that
actually ships.

§ Précis

I work the unsexy parts of LLM products — retrieval, evals, observability, cost routing — and ship them to production with measurable wins. Currently at AriesView; M.S. Machine Learning at Stevens, class of '26.

Resume ↓
Currently AriesView · Boston, MA
Studying M.S. ML, Stevens '26
Based New York, NY
Fig. 011 : 1
Portrait of Rishi Chhabra

Most teams know how to call an API. The hard parts are retrieval that doesn't lie, evals that don't drift, and bills that don't grow exponentially.

That's where I work. Production RAG with hybrid + graph retrieval, agentic systems wired up via MCP, semantic routers that cut cloud spend by orders of magnitude, and CI/CD gates that block hallucinations before they ship.

Selected work.

04 of 04 shown

Enterprise MCP Server for Agentic LLMs

Open-source Model Context Protocol server. Exposes enterprise APIs to LLM agents with secure tool execution: OAuth2, token-bucket rate limiting, 500 req/min sustained. One-command Docker deployment dropped agent integration setup time by 70%.

Year2025
RoleAuthor
StackPython · FastAPI · MCP · Claude API · OAuth2 · Docker

Cost-Aware Semantic Gateway for LLM Routing

Real-time prompt complexity classifier (<15ms) routing simple queries to local Llama 3.2 (vLLM) and complex tasks to GPT-4o. FinOps dashboard tracks per-query token cost and routing decisions. Cut cloud inference spend by 65%.

Year2025
RoleAuthor
StackLangGraph · vLLM · Llama 3.2 · GPT-4o · FastAPI

Automated LLM Evaluation & Observability Pipeline

LLM-as-a-judge CI/CD gate using DeepEval — scores Faithfulness, Answer Relevance, Contextual Precision. Auto-blocks updates scoring below 95% on anti-hallucination benchmarks. LangSmith tracing across multi-agent chains; debugging time down 40%.

Year2025
RoleAuthor
StackDeepEval · LangSmith · GitHub Actions · OpenAI API

LLM-Powered Hedge Fund Analyst Agent

Autonomous multi-agent equity analyst ingesting SEC 10-K/10-Q filings, real-time prices, and earnings transcripts. LangGraph orchestration with sub-agents for DCF valuation, FinBERT sentiment, and moat assessment. Neo4j knowledge graph for cross-company comparative analysis. Generates cited Buy/Hold/Sell memos with confidence scores in under 30 seconds — cutting manual research time by 80%.

Year2025
RoleAuthor
StackLangGraph · Claude API · SEC EDGAR API · yfinance · FinBERT · Neo4j · FastAPI · Docker

Experience.

09 / 2025
Present
AriesView
Boston, MA
AI/ML Engineer Intern
Architected production RAG: Weaviate Hybrid Search + Neo4j graph retrieval. +40% accuracy, −30% hallucination.
Designed semantic chunking + Cohere re-ranking; CI/CD on Kubernetes; −30% latency via Redis caching.
02 / 2024
08 / 2024
Incuwise
Delhi, India
Software Development Engineer I
Re-architected monolithic backend → serverless AWS (Lambda, API Gateway). +60% scalability, 99.9% uptime, 10k+ users.
Shipped 4 production apps. −15% API p95 response time, +25% engagement.

Stack.

Languages
Python SQL C++ JavaScript
ML & GenAI
PyTorch TensorFlow LangChain LangGraph CrewAI RAG / GraphRAG LoRA / QLoRA vLLM Hugging Face
LLMs & Agents
GPT-4o Claude API Llama 3.2 DeepSeek R1 Groq Ollama MCP
Cloud & MLOps
AWS SageMaker AWS Bedrock Lambda EC2 GCP Vertex AI Azure Databricks Docker Kubernetes Terraform MLflow
Data & Vector DBs
Weaviate Pinecone Neo4j ChromaDB MongoDB PySpark Snowflake BigQuery

§ Education

Stevens Institute of Technology
M.S. Machine Learning · Class of 2026
Hoboken, NJ
Central University of Haryana
B.Tech. Computer Science · 2024

§ Certifications

AWS Certified ML Engineer2025
Databricks Gen AI Engineer2026
Databricks Spark Developer2025
AWS Certified Developer2025