2,979 QPS. 0.31ms. 55× cache speedup. From medical RAG to semantic search over 42K papers — I own the full stack: embeddings, vector indexes, FastAPI backends, and containerised MLOps on GCP. Biotech background meets production-grade ML.
Searches 42,549 arXiv papers by meaning, not keywords — and explains why each result matters. FAISS IVFFlat vector index + LRU cache hits 2,979 QPS at 0.31ms p50; Groq LLaMA-3.3-70B generates research summaries in ~1.2s.
University-funded ML pipeline for Multiple Sclerosis genetic risk prediction — a problem where most tools are statistical, not predictive. SNP-derived polygenic features into Random Forest + ensemble models at 87.6% accuracy; SHAP interpretability dashboard used actively by the research team. Containerised on GCP with DVC versioning.
RAG pipeline over 70MB+ of enterprise ticket logs — surfaces answers keyword search can't find. PDF → chunking → SentenceTransformer embeddings → ChromaDB → Groq Llama-4 Scout delivers contextual Q&A; REST API via FastAPI, ops dashboard via Streamlit.
Production medical RAG system shipped at NASO Technologies. Dense embeddings + structured retrieval for health domain Q&A; FastAPI microservices on GCP with Supabase session logging for full conversation auditability — every answer traceable to its source chunks.
95% accuracy across 4 MRI tumor classes (glioma, meningioma, pituitary, no tumor) — built for portable clinical inference. Transfer learning + augmentation pipeline, Dockerised for single-command deployment in any environment.
Open to ML Engineer, GenAI developer and Data Scientist roles — Chennai, Hyderabad, remote, or global relocation. Response within 24 hours, guaranteed.