Jake Ogle | ML Systems Engineer

Engineering + Research

Building production-grade ML platforms, from distributed inference to data pipelines, with an emphasis on performance, reliability, and clear operational visibility.

Focus areas

ML systems engineering
Distributed inference on HPC/GPU clusters
Large-scale data and annotation pipelines
Performance-critical GPU workloads
Medical imaging ML infrastructure

Currently

Machine Learning Engineer @ Michigan Medicine in the Machine Learning in Neurosurgery Lab

Building scalable data and experimentation platforms on Slurm-managed HPC environments, including GPU-backed inference services, evaluation pipelines, and medical imaging workflows with traceable artifacts.

Research engineering

I build research-grade systems with production discipline: reproducibility, scalable pipelines, and infrastructure that survives real-world constraints.

Operationalize prototypes into repeatable pipelines (data, inference, evaluation).
Design failure-tolerant workflows for Slurm-managed HPC and GPU workloads.
Prioritize traceability: schemas, versioning, and artifact lineage.

Toolkit

Languages

Python, C++, Rust, Bash

Systems

Slurm, containers, GPU compute

Data

Pipelines, lineage, medical imaging ETL

Ops

Observability, reproducibility, reliability

Education

Academic background and relevant study.

University of Texas at Austin

MS Computer Science

Focus: ML systems · In progress (started 2026)

Oregon State University

Post-Baccalaureate Studies in Computer Science

University of Michigan - Ann Arbor

BS Chemistry

Experience

Roles spanning ML systems, backend data platforms, DevOps automation, and clinical research engineering.

Machine Learning Engineer

Michigan Medicine – Machine Learning in Neurosurgery (MLiNS) Lab · Dec 2025 – Present

Develop and evaluate computer vision and large language models on large-scale biomedical datasets, including medical imaging and clinical text such as radiology reports. Lead the design, curation, and long-term stewardship of multimodal datasets spanning imaging, pathology, and report-level data. Build preprocessing pipelines for clinical imaging and text normalization, and maintain SQL-backed databases integrating heterogeneous clinical sources. Perform large-scale data processing and model training on Slurm-managed HPC clusters, with an emphasis on data quality assurance, versioning, and reproducible experimentation. Contribute to dissemination of research through peer-reviewed publications, conference proceedings, and academic presentations.

Software Engineer – Backend Data Systems

Magna International · Jan 2023 – Nov 2025

Designed and deployed production data and ML systems across real-time computer vision, LLM-backed retrieval, and telemetry pipelines. Built OpenCV-based vision pipelines for automated manufacturing monitoring; developed RAG systems using embeddings, vector databases, and LLM orchestration to reduce data access latency from minutes to seconds. Implemented centralized Python REST APIs integrating SQL Server and Azure SQL, and engineered scalable ingestion and validation pipelines with Apache Airflow. Contributed to hybrid cloud/on-prem architecture using Azure data and ML services, with a focus on observability, reliability, and long-term platform scalability.

Software Engineering Intern – DevOps & Automation

Bosch North America · May 2021 – Dec 2022

Built automated build, test, and deployment pipelines using Python, Jenkins, and Groovy to support reproducible software delivery. Developed internal tooling for code signing and automated data collection, improving validation and traceability of software artifacts. Contributed to MISRA-compliant embedded C codebases and collaborated cross-functionally to ensure data integrity in production systems. Actively supported internal Python education initiatives to expand data processing capabilities across engineering teams.

Software Product Development Intern – Vehicle Systems

Ford Motor Company · May 2022 – July 2022

Performed large-scale analytics on vehicle diagnostic datasets using SQL and cloud-based data platforms, identifying reliability and performance patterns in networked vehicle systems. Supported feature engineering and data preprocessing for ML workflows, enabling improved predictive diagnostics and system-level insight.

Clinical Data Manager / Research Engineer

Michigan Medicine – Bielas & Martin Labs · June 2019 – May 2021

Engineered and managed clinical genomics data pipelines supporting rare disease research, including whole genome and whole exome sequencing (WGS/WES) projects under the Gabriella Miller Kids First initiative. Built and maintained research databases and patient tracking systems to support multi-institutional collaboration. Applied Python, R, and SQL for genomic data cleaning, normalization, and analysis, enabling reproducible downstream research workflows. Worked directly with sequencing data, metadata, and clinical cohorts, bridging clinical research requirements with scalable, computation-driven data management.

Overview