Data Science
Predictive modeling, forecasting, segmentation, churn analysis, evaluation, dashboards, and insight communication.
I'm Neehanth Reddy, a Data Scientist and AI/ML builder with 2+ years of applied experience across predictive modeling, forecasting, machine learning pipelines, biomedical AI research, and production-style AI applications.
I position my work around business-facing data science: understanding the problem, building measurable models, validating results, and packaging outputs so stakeholders or systems can actually use them.
Predictive modeling, forecasting, segmentation, churn analysis, evaluation, dashboards, and insight communication.
Reproducible training workflows, model APIs, Dockerized delivery, CI/CD, DVC, and AWS deployment patterns.
RAG pipelines, retrieval benchmarking, structured LLM workflows, agentic systems, and tool-aware application design.
Built segmentation, association-rule mining, and SARIMA forecasting on 10K+ transactions to support marketing and supply-chain decisions.
validation accuracy · 0.81 F1
Created a churn prediction pipeline for 7,000+ customer records with feature engineering, class balancing, and model comparison.
Built an interactive Power BI dashboard on 7,043 telecom customers to quantify churn rate, revenue exposure, and high-risk segments.
Built reproducible EEG workflows for sleep spindle detection and motor imagery classification with leakage-aware evaluation.
Developed a VGG16 transfer-learning classifier for four poultry disease classes and packaged it as a deployment-ready inference API.
Built a RAG-based research assistant over arXiv papers with retrieval-backend benchmarking and end-to-end quality evaluation.
LangGraph · Pydantic · LangSmith
Built a LangGraph-based workflow that evaluates resume-job fit through structured extraction, gap analysis, deterministic score calibration, and recommendation generation.
Built a FastAPI inventory system with PostgreSQL, MongoDB audit logs, JWT auth, and a conversational agent for natural-language operations.
Built Dockerized single-node and multi-node Spark environments and benchmarked PySpark ETL performance on NYC Taxi data.
Building reproducible biomedical ML workflows for EEG signal processing, sleep spindle detection, and motor imagery classification.
Supported graduate-level coursework in applied statistics, machine learning, and biostatistical methods through hands-on coding templates and mentoring.
Worked on supervised ML modeling, data preprocessing, workflow automation, and early-stage REST integration for internal classification use cases.
University of Memphis · Jan 2024 — Dec 2025
GPA: 3.93 / 4.00JNTU Hyderabad · Jul 2020 — May 2023
Foundation in signals, DSP, AI, DBMS, and embedded systems.