A passionate data analyst, aspiring actuary, and curious AI enthusiast.
I love turning messy datasets into clear narratives, designing risk‑aware models, and experimenting with agentic AI systems that help organizations make smarter, faster decisions.
Recently, I’ve been exploring deep research agents, financial and insurance analytics, and ways to bring production‑ready AI into real products and workflows.
Welcome to my corner of the internet — where analytics, risk, and creative AI experiments meet.
A mixed-methods computational text analysis project examining 337 English-language civil society alternative reports submitted to the UN Committee on the Rights of Persons with Disabilities (CRPD). Reports were collected via Selenium web automation from the OHCHR Treaty Body Database.
Applied term frequency analysis, TF-IDF, bigram analysis, GloVe word embeddings, Named Entity Recognition (spaCy), lexicon-based sentiment analysis (Bing), and LLM-assisted qualitative coding (NotebookLM) to identify policy priorities, regional variation, and intersectional vulnerability patterns across civil society reporting. All analysis conducted in R and Python within a Quarto literate programming environment.
Analyzed the global correlation between Corruption (CPI) and Human Development (HDI). Built a publication-ready, Economist-styled scatter plot using R to highlight governance trends and outliers.
Built a multi-step Deep Research Agent using Pydantic AI and GPT‑5‑mini that orchestrates DuckDuckGo web searches to generate structured, evidence-based reports on stock tickers and complex general topics.
Built regression-based cost models in Python to quantify how age, BMI, number of children, smoking status, and region influence annual medical insurance charges and risk tiers.
Applied multiple regression in R to analyze how genre, viewing hours, global release, and season impact Netflix movie ratings across diverse content.
Built AI-powered platform using Agentic AI with n8n to extract data from relevant sources and deliver personalized CTE & dual-enrollment recommendations for DC students.
Applied Chi-square tests & regression in R to explore statistical relationships between Netflix movie ratings and worldwide availability patterns.
Applied simple linear regression in R to analyze how global availability impacts Netflix movie ratings, revealing statistical patterns in worldwide content performance.
Analyzed 1M+ Spotify tracks in R exploring how danceability, energy, genre, duration, and temporal trends influence song popularity using correlation and regression models.
Exploring the frontiers of data science, actuarial innovation, and AI.
Deep research agents automating financial analysis workflows at scale.
Read moreTF-IDF, NER, and lexicon-assisted qualitative coding of 500+ CRPD reports.
Read moreBridging classical actuarial science with machine learning for risk prediction.
Read more