Neural Network of Me

I am an M.Tech student in Data Science and Artificial Intelligence at IIT Madras, driven by a passion for turning raw data into actionable intelligence. My journey ranges from high-altitude schooling in Kargil to completing my B.Tech in CSE at NIT Silchar.

Today, my work focuses on Machine Learning, Deep Learning, and Generative AI—specifically LLMs, RAG, and Agentic AI systems. I specialize in building end-to-end Python workflows, from data preprocessing and feature engineering to deployment-oriented pipelines. Ultimately, my goal is to bridge AI research and ML engineering, ensuring models don't just work in Jupyter notebooks, but thrive in production.

  • AI/ML: PyTorch, TensorFlow, Hugging Face, Scikit-Learn
  • MLOps/Tools: Docker, Streamlit, FAISS, LangGraph, DVC, Git
  • Data: SQL, MongoDB, Pandas, NumPy, OpenCV
  • Languages: Python, C++

Research & Lab

MLOps DVC Transfer Learning

Cattle Muzzle Biometrics

A fully reproducible biometric identification pipeline using DVC. Built a 4-stage automated workflow (Prepare → Transform → Train → Evaluate) achieving 0.92 Macro F1 using MobileNetV2 with TTA.

> dvc repro
> dvc metrics show (F1: 0.9211)
Source & Pipeline
GenAILangGraphRAG

Agentic RAG Chatbot

Developed an agentic Retrieval-Augmented Generation (RAG) system for multi-turn, document-grounded question answering. Orchestrated intelligent agents using LangGraph and integrated FAISS vector search.

View GitHub
Data Analytics Seaborn

Movie Success Visualization

Explored factor-driven success in TMDB metadata. Managed data cleaning by treating 0-values as NaN and used Log-transformations to uncover hidden trends in right-skewed revenue data.

Validated: Budget ↔ Revenue linkage.
Debunked: Budget ↔ Rating correlation.
View Analysis
Healthcare AI Scikit-learn

Heart Disease Prediction

Implemented a Logistic Regression classifier to predict heart disease likelihood. Leveraged health markers like cholesterol and BP to enable early medical intervention.

Achieved 85.12% Test Accuracy.
View Repository
NumPy Scratch W&B Logging

NumPy MLP from Scratch

Configurable MLP engine built using pure NumPy. Features backpropagation, Xavier init, and 6 optimizers (Adam, Nadam, etc.). Tracked experiments via Weights & Biases.

MNIST Acc: 97.8% | Fashion-MNIST F1: 0.88
View GitHub
Metric Learning ResNet-MLP

Metric Learning Prediction

Deep ResNet-MLP pipeline for prompt-response fitness scoring. Features KL Histogram Loss for matching score distributions and SWA/EMA for training stability.

High-Dim Feature Vector (3073D)
Explore Pipeline
Ensemble Learning Stacking

Bike Demand Prediction

Ensemble regression suite benchmarking Bagging, Boosting, and Stacking. Captures temporal patterns via Sine/Cosine cyclic feature encoding.

RMSE: 42.18 (Extended Stacking) vs baseline (100.45).
View Notebook

Initiate Connection

yaqoobnits25@gmail.com

+91 9395739252

Download Resume