AI Researcher — Inference Optimization
via Ashby
About this role
ROLE OVERVIEW
We are seeking an AI Researcher with deep experience in inference optimization to design, evaluate, and deploy high-performance inference systems for large-scale machine learning models. You will work at the intersection of model architecture, systems engineering, and hardware-aware optimization, improving latency, throughput, and cost efficiency across real-world production environments.
KEY RESPONSIBILITIES
- Research and develop techniques to optimize inference performance for large neural networks.
- Improve latency, throughput, memory efficiency, and cost per inference.
- Design and evaluate model-level optimizations (quantization, pruning, KV-cache optimization, architecture-aware simplifications).…
What we'd score you on
reqspace match rubricFive dimensions, recruiter-grade. Upload your resume and we'll generate a written explanation of where you fit and where the gaps are.
1
Skills match
For this role: python, pytorch, teams
2
Level fit
We check your title trajectory against the seniority signal of the role.
3
Domain experience
Your work in the role's domain matters more than your years total. We weight recent and direct experience.
4
Recency
A skill you used last quarter weighs more than one from five years ago. We grade on recency, not lifetime.
5
Location fit
This role is based in a specific location. We weight your proximity and willingness to relocate.
Score yourself on this role.
Free · no card · written explanation included
Skills in this role
Pulled from the job description. These are the keywords we'll weight when scoring your fit.
pythonpytorchteams
