Built a real time pipeline using COVID Python library, World Happiness Report, and vaccination datasets. Applied Random Forest, Decision Tree, Naive Bayes, and Ensemble Learning to boost classification accuracy to 92.5%.
View on GitHubDeveloped a NoSQL database with MongoDB on Docker, used Databricks for big data processing, performed data preprocessing with PySpark and Pandas, achieving model accuracy of 80% with Gradient Boosting.
View on GitHubAnalyzed Target's sales decline using surveys and Tableau. Provided actionable insights and strategic recommendations for digital transformation and omnichannel growth.
View on GitHubPreprocessed large scale climate datasets, engineered custom MCP servers to integrate SQLite with LLMs for real time AI driven insights.
View on GitHub