AI/ML interested Soul. I build intelligent systems where uncertainty is not avoided but constrained. BSc in IT specializing in Data Science (UG)
Full-stack air quality analytics platform built with FastAPI, React, and MySQL. Aggregates multi-source PM2.5/PM10 data, performs multi-city comparison and time-series forecasting (SARIMAX), and integrates an LLM-based planning agent with tiered access, secure APIs, and PDF reporting.
Machine learning system for predicting genetic disorders using genomic, clinical, and demographic data. Implements robust preprocessing, feature selection, and multi-model classification (RF, XGBoost, LightGBM, CatBoost) with cross-validation to support early, data-driven genetic risk assessment.
Advanced SQL analytics project extending prior EDA work. Includes change-over-time, cumulative trends, performance benchmarking, segmentation, part-to-whole analysis, and customer/product analytical reporting using window functions and real-world data warehouse logic.
Building a modern data warehouse with Microsoft SQL Server, including ETL processes with Bronze Layer, Silver Layer and the Gold Layer, data modeling and as well as analytics.
SQL-based EDA project exploring a retail data warehouse. Includes database setup and analysis scripts for data exploration, dimension profiling, date range discovery, measures analysis, magnitude breakdowns and ranking using aggregates and window functions.
End-to-end Azure data engineering pipeline ingesting real-time earthquake data from the USGS API. Implements a Bronze–Silver–Gold lakehouse using Azure Data Factory, Databricks, ADLS Gen2, and Synapse Analytics, with both manual execution and fully automated daily-triggered workflows.