Samantha Pease

Machine Learning Research and Engineering · Ph.D. in Mathematics

GitHub LinkedIn Google Scholar Email Resume

I’m Samantha Pease, an NYC-based machine learning researcher and engineer focused on safety and reliability. I earned my Ph.D. in Mathematics at Rutgers University–Newark, where my background was in pure math, specifically the Langlands program. This site is a home for my research, projects, teaching experience, and project write-ups.

Research

My research is broadly motivated by the goal of ensuring that society's growing dependence on AI goes well. I am particularly interested in understanding how these systems work internally—mechanistically and behaviorally—as a foundation for more effective alignment, monitoring, and control. My recent work studies unlearning methods through finetuning as a mechanism for reducing the WMDP-benchmark risks of open-source models. Read the full write-up: Free Recovery: Does LLM Unlearning Actually Remove Knowledge?

Mathematics

Previously, my research focused on mathematics, particularly the local side of the Langlands program and modular forms. My Ph.D. advisor was Chen Wan. My dissertation investigates the local Gan-Gross-Prasad conjecture for tempered representations of GGP triples associated with general spin groups, extending earlier work by Waldspurger for special orthogonal groups and Beuzart-Plessis for unitary groups. Working in both the p-adic and Archimedean settings, I follow the structural framework developed for the unitary case, exploiting shared features of the root systems and Levi subgroup decompositions of general spin and special orthogonal groups. The central result is a multiplicity-one theorem for GGP triples, established via a local trace formula that connects the multiplicity m(π) to spectral and geometric expansions of a certain distribution.

In my undergraduate thesis I studied Siegel Eisenstein series. Ichino has shown that certain values of some symmetric square L-functions can be expressed as the inner product of a Saito-Kurokawa lift. It is also known that there is a pullback formula which expresses Saito-Kurokawa lifts in terms of genus one cusp forms. We can then take advantage of the finite dimensionality of the vector space of cusp forms to calculate our desired value with simple linear algebra. This was done in Sage Math.

Papers

Presentations

Computing Values of Symmetric Square L-Functions using Ichino’s Pullback Formula

Projects

BARSElo: Player Rating System for Recreational Dodgeball

I built a machine learning system to estimate individual player skill in recreational dodgeball despite heavy team effects. Using ~790 historical games, I developed a Bradley-Terry model with Gaussian margin-of-victory extensions and Bayesian uncertainty for sparse-data players. The approach separates individual skill from team composition through batch optimization, validates predictions using both chronological and "new-team" evaluation modes, and achieves 56.8% accuracy on unseen team compositions. The interactive explorer features player skill trajectories, searchable databases, team comparisons, and model visualizations.

Trans Advice Agent

I built a retrieval-augmented QA system to surface community-sourced information about transgender healthcare. Users enter free-text questions and get concise, document-grounded summaries with links to the forum posts and resources that informed the answer; each response also includes a short confidence note. This is for information discovery, not medical advice.

Implementation highlights: semantic embeddings (all-MiniLM-L6-v2) stored in a FAISS index (IVFPQ) for memory-efficient nearest-neighbor retrieval, aggregation and summarization with Claude Haiku, orchestration and monitoring via LangChain/LangSmith, and a FastAPI backend deployed on Render. The index only contains public/community-contributed documents and the design emphasizes provenance and privacy. Future work focuses on broader curation, community evaluation, and bias/quality checks.

Instagram Network Analysis

This project explores the structure of an individual's Instagram social graph by identifying and analyzing mutual connections among followed users. I primarily looked at the data for my own Instagram network but also examined my sister's and roommate's networks for a comparison point. Future work would include comparing more diverse networks. The pipeline integrates web scraping and network science to uncover community structures, central users, and hidden patterns in online social behavior, and graph neural networks to predict potential future connections.

Cat Identification with Neural Network From Scratch

I built a deep neural network from the ground up to explore how architecture, activation functions, learning rates, and other parameters affect performance. Starting with a small cat image dataset and later pivoting to a binary classification task using CIFAR-10 (Cat vs. Not Cat), I encountered firsthand the limitations of deep neural networks for image classification without convolutional layers. Despite experimenting with network depth and size, I found minimal accuracy gains—highlighting the importance of architecture over brute-force tuning. All training was done on CPU, further emphasizing hardware constraints in model development.

Training results from the from-scratch neural network experiments

Topology as a Tool to Differentiate Canopy Architecture in North American Forests

With Eva Arroyo and Nikita Zemlevskiy

In this project, we used topological data analysis (TDA) to classify forest types based on LiDAR-derived canopy height models from four ecologically distinct U.S. forests. We extracted persistence diagrams—summaries of geometric features at multiple scales—using both 1D and 2D sublevel set filtrations, then transformed these into feature vectors for classification using support vector machines (SVM). Our results showed that 1D persistence, which captures broader structural trends in canopy architecture, outperformed 0D persistence in distinguishing between forest types. This work demonstrates how persistent homology can uncover meaningful ecological differences in complex spatial datasets.

Summary of persistence-based forest classification results

Remote Wind Turbine Monitoring System

With Martin Cala

In the summer of 2017, I volunteered with WindAid, an international NGO based in Trujillo, Peru. I began by working in the engineering workshop, helping to construct a wind turbine (pictured below) for a rural home with no access to electricity. Later, another volunteer and I initiated R&D on a prototype monitoring system designed to remotely report windspeed, power generation, and battery capacity for installed turbines. The goal was to provide engineers with diagnostic feedback, as turbines were often located in remote areas and maintained by users with limited technical knowledge. We successfully developed a functional prototype and handed it off to WindAid’s permanent engineering team for continued development.

Wind turbine constructed with WindAid in Trujillo, Peru

Teaching Experience

Past Teaching (Rutgers)

Academic Year 2025–2026: Tutor in the Rutgers Tutoring Center
Academic Year 2024–2025: TA for large lecture of PreCalculus MATH 114
Academic Year 2023–2024: TA for large lecture of College Algebra MATH 109
Summer 2023: Applied Calculus MATH 119
Spring 2023: TA for large lecture of College Algebra MATH 109
Fall 2022: TA for large lecture of Applied Calculus MATH 119
Spring 2022: Calculus 1 MATH 135
Fall 2021: TA for large lecture of PreCalculus MATH 114
Summer 2021: Applied Calculus MATH 119
Academic Year 2020–2021: Tutor in the Rutgers Tutoring Center

Past Teaching (Duke)

Fall 2017 through Spring 2020: Tutor in the Math Help room for Linear algebra (applied and proof based) and Multivariable Calculus

Interests

In my free time I am very active in various sports leagues and my local queer community. I currently play dodgeball and kickball.

Playing dodgeball in a recreational league

With teammates at a recreational sports league game

I also love the outdoors and try to get out as often as I can! My most recent trips have been to Sedona, Arizona and Banff in Canada.