Gabrielle Epelle

Data Analyst: Python, SQL,Tableau View My Work

About Me

Hi

I’m Gabrielle, a dedicated professional excited to launch my career in data analytics and data science. With a strong foundation in mathematics and economics, combined with hands-on experience in coding, statistics, and data-driven problem solving, I’m passionate about transforming raw data into actionable insights that drive smarter business decisions. Outside of work, I enjoy reading, writing, and doing digital artwork. I am currently seeking opportunities in data analytics and data science.

Check out some of my work in the projects section.

Skills

My Projects

HIV in America: Diagnosis Rate Trends by State and Demographic

This dashboard explores U.S. HIV diagnosis rate trends from 2008 to 2023 using CDC AtlasPlus data. The analysis examines geographic distribution, demographic patterns, and national trends over 15 years. Use the Year slider and Age Group filter to explore how diagnosis rates vary across states and demographics. Data was cleaned and analyzed in BigQuery (SQL) prior to visualization. Full analysis available on GitHub.

Medicare Provider Anomaly Detection

This project analyzes 9.6 million Medicare billing records across 1.17 million providers to identify anomalous billing patterns that may indicate fraud, waste, or abuse. Providers are scored using z-scores across three metrics and cross-referenced against CMS's list of revoked Medicare providers. Full analysis available on GitHub. Tools: SQL · Amazon Athena · AWS S3 Data Sources: CMS Medicare Physician & Other Practitioners (data.cms.gov), CMS Revoked Medicare Providers and Suppliers (data.cms.gov)

Maternal Mortality Rate (per 100, 000 births)

An interactive Tableau dashboard analyzing maternal mortality trends over time, segmented by age and race/ethnicity.

Healthcare Data Breaches

A tableau dashboard showing recent data breaches in the healthcare sector broken down by state, type, and location.

Predicting Diabetes

Participated in Kaggle's Playground Series - Season 5, Episode 12 competition to develop a binary classification model to predict diabetes onset using a dataset which included features such as cholesterol, BMI, age, triglycerides, and blood pressure.

Predicting Student Test Scores

Participated in the Kaggle Playground Series (Season 6, Episode 1) competition to predict student exam scores using a synthetic dataset containing demographic, academic, and lifestyle features (such as study hours, sleep quality, attendance, parental involvement, and more). Conducted exploratory data analysis, performed feature engineering, handled categorical variables, and built regression models to minimize Root Mean Squared Error (RMSE)

Medicare Telehealth Utilization Analysis

Analysis of Medicare telehealth utilization patterns across the United States using CMS (Centers for Medicare & Medicaid Services) data from 2020-2024. This project explores how telehealth usage varies by geography, demographics, and patient characteristics following the COVID-19 pandemic expansion of telehealth services.

Data Cleaning and Analysis done with Python/Pandas.

Say Hello!