Hi, my name is
Tom Gause
I'm an AI nerd with a passion for exploring the boundaries and ethics of technology. Currently, I'm building computer vision and large language applications. Leveraging data and machine learning algorithms, I strive to solve real-world problems and make a lasting impact. Welcome to my corner of the internet where I share my projects and insights.
01. 
About Me
I'm a Data Scientist at Leverege, focused on developing a diverse range of machine learning models and natural language applications for customers like Discount Tire, Schnucks, and PetSmart. As a Middlebury College graduate with a double major in Computer Science and Mathematics, I've honed my skills in full-stack engineering at ASCC and machine learning research at Sekeh Lab. Passionate about the power of effective communication, I've previously served as a public speaking coach. My goal is to blend technical expertise with strong communication skills to drive innovation and sustainability.

In my free time, I enjoy running marathons, reading, and pursuing my dreams of bluegrass superstardom.

Technologies I've recently worked with include...
  • Python
  • Tensorflow
  • GCP / Vertex / BigQuery
  • R
  • PyTorch
  • Node
02. 
Work Experience
Leverege
Data Scientist
September 2022 - Present
Lead data scientist developing computer vision applications for cloud and edge.
• Engineered SOTA auto-labeling and ModelOps tools, accelerating model development flow and reducing personnel costs.
• Enhanced model efficacy through advanced techniques like Genetic Algorithms, Hyperparameter Optimization, and adaptive model architecture adjustments, including graded learning rates and selective head freezing.
• Architected, trained, and validated custom vision models from scratch.
• Monitored and maintained production models against degradation and drift.
• Built critical monitoring dashboards, saving thousands a month in cloud costs and exposing breaking software bugs.
• Constructed and deployed a Retrieval Augmented Generation tool making company documentation more discoverable and reducing personnel hours.
• Represented data team in customer on-sites and conclusive presentations leading to pilots, jumpstarts, and product rollouts.
Advanced Structures and Composites Center
Full Stack Developer
September 2020 - September 2021
Full stack dev for secure internal website and software dev for ML lab. Responsibilities included building webpages and data GUIs, SQL database migration and management, and constructing and executing unit tests on key landing pages. See public website here.
Sekeh ML Lab
Research Assistant
March 2021 - September 2021
Developed a robust multimodal deep-fusion autonomous vehicle model for object detection in adverse weather conditions. See project details here.
03. 
Projects
CNN/LSTM Seasonal Forecasting
Data consultant with community partner ISciences to improve NOAA seasonal weather predictions with statistical and novel ML models & algorithms. An experimental LSTM model trained on 200GB of historical measurements outperformed regression, reducing seasonal temperature prediction bias by 10%.
R  Tensorflow  Slurm  iTerm  VSCode
Atmospheric Remote Sensing Downscaler
Developer of static downscaler, grounding air quality predictions from a remote sensing chemical transport model and monitoring stations data yielding spatial predictions of air pollutant levels at a localized scale.
SQL  PHP  JS  Python  Docker
Zeitgeist Analysis
Co-lead of quantitative analysis of annual student-run cultural Zeitgeist survey. Usedmulti-phase testing strategy to avoid P-hacking on permutation tests over many features. Survey respondents weredetermined to be non-representative of the student population with overwhelming confidence.
R Latex
Essay Doer Bot
Built an LLM application utilizing RAG, OCR, and iterative prompt engineering to write compelling plagiarism-free humanities essays with sentimentally relevant quotations pulled from source material. Used a multi-stageHITL approach to emulate the human creative process with artificial intelligence (reflection → thesis → outline → written sections → revise → compile final product) and minimize token costs. Deployed to Kubernetes for public webapp.
Python  LLMs
Other Noteworthy Projects
Adverse Driving Model
Developed a deep multimodal fusion model entropy model in PyTorch to improve vehicle object detection and classification in adverse weather conditions. Trained and tuned model on compute cluster
PyTorch  Python  HPC
Image Captioning Model
Built generative TensorFlow model for creating captions for images from MSCOCO dataset. Team improved initial template architecture with addition of InceptionV3 (classification) and GloVe Embeddings (word vectors)
TensorFlow  Jupyter
Personal Website V2
Built a second iteration of a personal website from scratch in Webflow heavily inspired by Brittany Chang's personal site. Focused on minimalist design and ease of use
CSS  JS  Webflow  Design
UFS2 Filesystem Image Parser
Implemented fs-find and fs-cat functions on raw image data for UFS2 partitions in the FreeBSD operating system. Learned granular mechanisms for data compression and pipelines
FreeBSD  UFS2  C
Personal Website V1
Built and hosted a  personal websites to display resume and project portfolio and develop webdev and design skillsets. Since then, I think my skills have improved significantly!
CSS  JS  Design
Dissipated Eight Webmaster
Managed Dissipated Eight's (Middlebury's oldest male accapella group) wordpress site.
Wordpress
Course Projects
Course projects from Fall 2018 to Spring 2022. Includes Pacman in Python and Tetris in Java
Python Java C
Caching Simulations
Simulated hit/miss rate of popular caching methods in Arch Linux.
Arch C
04. 
Art
Click image for more info.