Projects

Ensemble Rainfall Prediction Model

UBC (Mar 2025)
Designed an ensemble machine learning model for daily rainfall prediction using Australian climate datasets.
Deployed a Flask REST API on AWS EC2, using S3 and EMR Spark for scalable data handling.


Ads Analytics Dashboard

UBC (Feb 2025)
Developed an interactive web dashboard (Dash + Flask) to analyze Google Play Store app data for advertisers.
Enabled filtering by installs, categories, and ratings, with visual comparisons using Altair.
Integrated caching, production server deployment (Gunicorn), and live interactivity.
GitHub Repository


Datpro – Data Profiling Package

UBC (Jan 2025)
Built a modular Python package for automated data cleaning, anomaly detection, and EDA visualization.
Supported plots (boxplots, heatmaps, histograms) using Altair; included anomaly detection via IQR and percentile-based methods.
Integrated CI/CD using GitHub Actions, pytest for testing, and ReadTheDocs for deploying the package documentation.
GitHub Repository


Age Group Predictor

UBC (Dec 2024)
Created a containerized ML pipeline (Docker + Makefile) to classify age groups from NHANES survey data.
Designed modular architecture aligned with MLOps principles, enabling reproducible training and deployment.
GitHub Repository