Senior AI Inference Engineer - Model Optimization & Deployment
Foster Cityonsitesenior
Posted 1mo ago · via Lever
About this role
The Perception team is pioneering the development of a multi-modality foundation model to drive the next generation of autonomous system intelligence.
As a Model Optimization & Deployment Engineer, you will focus on bringing highly efficient, production-ready large-scale models to our on-vehicle stack. We are looking for experts with hands-on experience in compressing, accelerating, and deploying complex models (LLMs, VLMs, or FMs) for power- and thermal-constrained vehicle SOCs. You will optimize the ML models, write custom CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution on edge devices.
What we'd score you on
reqspace match rubricFive dimensions, recruiter-grade. Upload your resume and we'll generate a written explanation of where you fit and where the gaps are.
1
Skills match
We compare your skills against the role requirements.
2
Level fit
This role is senior-level. We check your trajectory against it.
3
Domain experience
Your work in the role's domain matters more than your years total. We weight recent and direct experience.
4
Recency
A skill you used last quarter weighs more than one from five years ago. We grade on recency, not lifetime.
5
Location fit
This role is based in Foster City. We weight your proximity and willingness to relocate.
Score yourself on this role.
Free · no card · written explanation included
More at Zoox
- View →Software Engineer, ML Performance OptimizationFoster City, CA
- View →Robot Program DirectorFoster City, CA
- View →Payment Operations InternFoster City, CA
- View →Recruiting Coordination LeadFoster City, CA
- View →Senior Operations Manager, Communications & MarketingFoster City, CA
- View →Senior Manager, Go-To-Market StrategyLas Vegas, NV
