UC Berkeley · Data Science & Computer Science

Janelle Sampson

Machine Learning  ·  Software Engineering

Scroll

About

I work at the intersection of full-stack software engineering and data science. I design the models, build the data pipelines, and develop the user interfaces that bring them to production. Lately, that’s meant engineering machine learning systems, crafting data visualizations, and architecting clean code built to last.

Education

University of California, Berkeley Expected December 2027
Bachelor of Arts - Data Science
MinorComputer Science
Santa Monica College June 2023
Associate of Science - Computer Science
HonorsMagna Cum Laude / High Honors · Phi Theta Kappa

Curriculum

Relevant Coursework

CS 61AStructure & Interpretation of Programs
CS 61BData Structures
CS 61CMachine Structures
CS 170Algorithms
CS 186Database Systems
CS 189Introduction to Machine Learning
DATA 8Foundations of Data Science
DATA 100Principles & Techniques of Data Science
DATA 140Probability for Data Science
MATH 54Linear Algebra & Differential Equations

A line through my work

Projects

June 2026

Interactive Portfolio

This site. A from-scratch, single-page portfolio in HTML, CSS, and JavaScript: a scroll-driven full-page section deck with a custom galactic theme and hyperspace transitions, and CSS-animated UI, built with no frameworks.

JavaScriptHTMLCSS
May 2026

Build Your Own World (BYOW)

An Animal Crossing themed 2D island-world engine in Java built from-scratch — seeded procedural generation for reproducible worlds, BFS pathfinding across a 4,000-tile grid, and a full game loop with undo/redo, save/load, music, and a live HUD.

JavaPsuedorandom GenerationBFSFull-Stack
View on GitHub →
April 2026

Housing Price Prediction & Fairness Analysis

An end-to-end scikit-learn regression pipeline over 200K+ Cook County records reaching 0.61 log-RMSE, with from-scratch k-fold CV and a custom MAPE loss that exposed bias standard metrics hid.

Pythonscikit-learnPandasNumPyFeature Engineering
View on GitHub →
April 2026

WordNet Semantic Search Engine

A full-stack semantic search engine traversing an 82K-relationship WordNet graph via BFS, indexed into HashMaps/TreeSets at startup for O(1) lookups and NGram-ranked multi-word queries.

JavaGraphsFull-StackHTMLJavaScript
View on GitHub →
March 2026

Logistic Regression Spam Classifier

A logistic-regression spam classifier in scikit-learn with 60 engineered features that lifted accuracy from 76% to 94%, tuned via 5-fold GridSearchCV and judged on precision, recall, and false-positive rate.

Pythonscikit-learnPandasRegressionModel Training
View on GitHub →
March 2026

Percolation

A union-find percolation simulation on an N×N grid, built test-first and paired with Monte Carlo trials to estimate the percolation threshold.

JavaUnion-FindMonte Carlo
View on GitHub →
November 2025

Climate Change

A Python analysis of historical temperature and precipitation data, building 99% bootstrap confidence intervals to surface warming trends and running permutation tests to assess whether drought years differ in rainfall.

PythonNumPyBootstrap SamplingHypothesis Testing
View on GitHub →
November 2024

ChatsApp

A real-time client–server chat app in JavaFX over TCP sockets, with a dedicated reader thread marshaling updates via Platform.runLater and a styled, auto-scrolling bubble interface.

JavaJavaFXSockets
View on GitHub →

Toolkit

Skills

Languages
C++CJavaPythonSQLJavaScriptHTMLCSS
Tools
PyCharmGitGitHubIntelliJ IDEAJavaFXVisual StudioEclipse
Libraries
PandasNumPyMatplotlibSeabornSciPyScikit-Learn

Contact

Let's build.