samwich analytics
Admin

Featured Work

Portfolio

A collection of research, analytics, and data engineering projects spanning survey design, statistical modeling, forecasting, and interactive dashboards.

Featured Projects

Growth & Prosperity Summit Survey

Utah Valley Chamber of Commerce  ·  July – November 2024

Commissioned by the Chamber, I collaborated with statistics faculty to execute a comprehensive public sentiment survey on Utah County priorities. My role spanned the full data lifecycle: designing survey logic in Qualtrics, performing statistical analysis in R, and creating visualizations in Canva. The project culminated in a formal presentation where I was selected to co-present findings to an audience of executives and state officials.

  • Co-presented to business executives and elected officials, including the Governor of Utah and a U.S. Senator.
  • Translated complex statistical results into a clear, actionable narrative for non-technical stakeholders.
QualtricsRData VisualizationCanva

Growth & Prosperity Conference Survey (MaxDiff)

Utah Valley Chamber of Commerce  ·  November 2025

Commissioned for a return engagement, I collaborated with Dr. David Benson and the UVU Statistics Lab to analyze community priorities. I designed and built the MaxDiff (Maximum Difference Scaling) survey instrument using Sawtooth Software to quantify resident trade-offs. Following data collection, I utilized R to execute advanced statistical inquiries — including Cluster Analysis and Regression modeling — to segment the population.

  • Identified distinct community priority segments through cluster analysis, enabling targeted policy recommendations.
  • Co-presented results to summit attendees; finalizing comprehensive report and infographic for publication.
Sawtooth Software (MaxDiff)RCluster AnalysisRegression Modeling

CRO A/B Test & Dashboard

&Collar (D2C Men's Dress Shirts)  ·  2026

Built a full statistical analysis pipeline and interactive Power BI dashboard around a simulated 30-day A/B test for a D2C apparel brand. The core analytical question: when a site change lifts conversion rate but drops average order value, is it actually worth deploying? This project demonstrates that trusting a single metric blindly can be deceiving.

  • Variant converts 14% more visitors but each buyer spends 17% less — the AOV drop outweighs the CVR gain.
  • Deploying the Variant as-is would risk reducing annual revenue by ~$100K at current traffic levels.
  • Breakeven AOV is $67.57 — a $3.87 lift through upsells or bundles makes the Variant the winner on both metrics.
PythonPower BITwo-Proportion Z-TestBootstrap Confidence Intervals

Multi-Channel Inventory Forecasting

Nomatic (Premium Travel Gear)  ·  2026

Built a demand forecasting and inventory planning pipeline addressing a challenge multi-channel brands face: D2C sales are smooth and seasonal, but B2B wholesale orders are erratic — most days have zero volume, then a retailer suddenly orders 1,500 units. A single forecast model can't handle both channels, so this analysis uses different strategies for each and combines them into a unified inventory recommendation with explicit risk trade-offs.

  • D2C forecast achieves 13.3% MAPE; B2B orders cluster at quarter-ends (z = 4.44, p < 0.00001).
  • B2B spike sizes follow a log-normal distribution (KS test p = 0.52), enabling probabilistic modeling.
  • Combined 6-month inventory recommendation: ~42,900 units (base) to ~48,500 units (safety stock).
PythonRFacebook ProphetMonte Carlo SimulationPower BI

Revenue Growth Management Analysis

Swire Coca-Cola  ·  2026

Applied the full RGM analytical toolkit to 3 years of beverage distribution data (7.8M transactions, 11 product lines, 158 weeks). The analysis covers five pillars: revenue decomposition into volume/price/mix effects, promotional ROI measurement, XGBoost-driven volume driver identification, Monte Carlo pricing simulation across 7 scenarios, and multi-method demand forecasting.

  • 2023–2024 revenue declined -0.9%, driven by volume loss despite stable pricing.
  • Only 1 of 9 products (Core Power) showed positive promotional ROI.
  • Best forecasting method: Prophet (avg MAPE 16.6%); XGBoost volume driver model achieves R² = 0.61.
PythonXGBoostSARIMAProphetMonte Carlo SimulationPower BI

Menu Performance Analytics

Crumbl Cookies  ·  May 2026

Engineered a multi-source data pipeline integrating web-scraped menu history from CrumblCookieFlavors.com, a flavor catalog, and Reddit sentiment analysis to analyze Crumbl's flavor rotation patterns, category distribution, and popularity signals. Built a predictive model for flavor return likelihood using machine learning.

  • Identified rotation patterns and category distribution trends across Crumbl's flavor history.
  • Random Forest model predicts flavor return likelihood based on rotation history and popularity signals.
Pythonscikit-learn (Random Forest)BeautifulSoupReddit APIMatplotlib

Wasatch Cup Healthcare Case Competition

1st Place Winner  ·  October 2025

Collaborated with an interdisciplinary team to secure 1st Place in the 2025 Wasatch Cup, the region's premier healthcare case competition hosted by the BYU Healthcare Leadership Association. Our winning solution proposed a mobile health clinic to expand critical access to underserved populations in Southern Utah. I spearheaded the data validation phase, modeling financial outcomes to prove the clinic model was both scalable and financially viable.

  • Built time-series forecasting models to project patient demand and clinic utilization over a 3-year horizon.
  • Successfully differentiated our strategy from top competing teams across the state, winning the grand prize.
PythonARIMAProphetFinancial Modeling

MLB Payroll Efficiency & Containerized Data Warehouse

Personal Project  ·  February 2026

Engineered a fully reproducible, containerized PostgreSQL data warehouse using Docker to analyze the historical Lahman Baseball Database (1871–2025). Designed a strict 3rd Normal Form relational schema with composite primary keys and foreign key constraints. Architected an automated ETL pipeline that builds the schema and ingests raw CSV data instantly upon container startup.

  • Developed a post-Moneyball 'Cost Per Win' metric proving Oakland Athletics achieved near-parity in wins at less than 1/3 of the Yankees' payroll.
PostgreSQL 16Docker ComposeSQL (CTEs, Window Functions, 3NF Design)

Additional Projects

Big 12 Healthcare Case CompetitionBest Presentation Award — structural equation modeling and forecasting for healthcare case competition.SEMForecastingData Visualization
Baseball Game Theory EngineNash equilibrium analysis of optimal first-pitch swing strategy using 2024 MLB Statcast data.RQuartoStatcastLahman DB
Program Learning Outcomes NLP AnalysisNLP-driven audit of curriculum alignment across degree programs, deployed as interactive dashboards.PythonRNLPPower BI
Learning Outcome Hierarchy AnalysisMaps hierarchical relationships between degree programs and courses to evaluate curriculum alignment.NLPPower BISemantic Analysis
Student Engagement Research PaperAssessed student engagement using the validated Utrecht Student Engagement Scale; contributed to formal manuscript.QualtricsRUWES Scale
Student Engagement Dashboard MigrationMigrated proprietary analysis logic from Tableau to Power BI with complex DAX queries.Tableau → Power BIAdvanced DAX
Online Program Information DashboardMulti-page Power BI dashboard for university online program performance metrics.PythonPower BIData Wrangling
Survey Software Comparison ExperimentComparative usability study commissioned by Sawtooth Software, scaling to a 300-student cohort.SawtoothQualtricsSurveyMonkeyR
Midway Bond ReportPublic sentiment analysis for a municipal bond proposal; bond successfully passed after survey-guided messaging.QualtricsRCanva
Multi-Agent LLM for FacultyCustom agentic architecture enabling faculty to query institutional datasets through natural language.PythonLangGraphOllama
AutoEDA R PackageR package generating self-contained interactive HTML exploratory data analysis reports with light/dark theming.R (Plotly, DT, Quarto)HTML/CSS
Stock Market Prediction & AnalysisEnsemble forecasting models and structural equation modeling for equity price prediction.Python (yfinance)XGBoostProphetSEM
Prisoner's Dilemma Behavioral ExperimentSix-variant experimental design analyzing human-computer interaction persistence; currently on third iteration.Custom Web InterfaceRExcel
Call of Duty Geospatial AnalysisParsed Warzone Caldera map telemetry files to identify optimal landing zones by survival time.PythonETLHeat Mapping