I build production-grade ML systems — end-to-end, from data pipelines to deployed products. Currently building PaperLens, a semantic search engine over arXiv papers.
Semantic search engine over arXiv research papers — search by meaning, not keywords. Built end-to-end: arXiv ingestion pipeline → sentence-transformers embeddings (384-dim) → FAISS IVFFlat vector index → FastAPI backend with LRU caching → Groq LLaMA-3.3-70b RAG pipeline for structured AI summaries → Streamlit dashboard.
Production RAG system over 70MB+ enterprise ticket logs. Full pipeline: PDF parsing → chunking → SentenceTransformer embeddings → ChromaDB vector store → Groq Llama-4 Scout for contextual QA. Exposed via FastAPI with Streamlit dashboard.
Medical RAG pipeline with structured retrieval, chunking, and dense embeddings for domain-specific health recommendations. FastAPI microservices deployed on GCP with Supabase session logging for full traceability.
CNN achieving 95% accuracy on MRI brain tumor classification across 4 classes. Preprocessing pipeline with augmentation and normalization, containerised clinical inference interface with Docker.
Master of Computer Applications — University of Madras, 2023–2025. GPA: 7.3/10
Open to full-time ML Engineer roles — Chennai, remote, or relocation. Feel free to reach out.