Portfolio
Every project and course project, tagged by the tools, methods, and domains I worked with. Click any keyword, or search, to see all the work that uses it.
M.S. thesis: LLM multi-agent simulations of five real research teams (~24K utterances) with a four-layer fidelity framework and a 55-metric NLP evaluation suite. Under review at EMNLP 2026.
Samsung Electronics AI research: layer-wise merge strategies and a scalable safetensors pipeline that lift language-benchmark scores while preserving multimodal performance.
Reproduced and verified the SLIViT vision-transformer experiments from the paper and open-source code, analyzing model structure and performance across settings.
CLIP embeddings with k-NN retrieval and Qwen2-VL classification on CIFAR-10, with retrieval-quality checks against a random baseline.
Analysis and modeling of U.S. House members' stock trades: data collection, missingness analysis, hypothesis testing, and a Random Forest party-prediction model (99% accuracy) with a fairness permutation test.
Predicting used-car prices from vehicle attributes with feature engineering, exploratory analysis, and regression model selection.
Comparing preprocessing techniques and classifiers (logistic regression, LDA/QDA, SVM, random forest) on 4,601 emails. Built in R.
Geospatial analysis using traffic-crash and census data to recommend hospital locations for high-risk areas, compared against actual locations.
Implemented gradient boosting (XGBoost-style) from scratch with gradient- and Hessian-based optimization, matching the library's performance under the same hyperparameters.
Case study joining 2023 Census demographics with LODES employment data to analyze where people live versus work, using statistical modeling and data collection from public APIs.
Samsung Biologics internship: market and competitor analysis to support strategy, with automated workflows that cut recurring manual effort by over 90% and insights communicated to stakeholders.
Suntek Systems: built Tableau dashboards from customer data that cut reporting time by over 50% and delivered data-driven insights to align the product with customer needs.
No projects match that filter.