AI Researcher — Inference Optimization

via Ashby

About this role

ROLE OVERVIEW We are seeking an AI Researcher with deep experience in inference optimization to design, evaluate, and deploy high-performance inference systems for large-scale machine learning models. You will work at the intersection of model architecture, systems engineering, and hardware-aware optimization, improving latency, throughput, and cost efficiency across real-world production environments. KEY RESPONSIBILITIES - Research and develop techniques to optimize inference performance for large neural networks. - Improve latency, throughput, memory efficiency, and cost per inference. - Design and evaluate model-level optimizations (quantization, pruning, KV-cache optimization, architecture-aware simplifications).…

Read the full description on Featherlessai's site →

What we'd score you on

reqspace match rubric

Five dimensions, recruiter-grade. Upload your resume and we'll generate a written explanation of where you fit and where the gaps are.

1

Skills match

For this role: python, pytorch, teams

2

Level fit

We check your title trajectory against the seniority signal of the role.

3

Domain experience

Your work in the role's domain matters more than your years total. We weight recent and direct experience.

4

Recency

A skill you used last quarter weighs more than one from five years ago. We grade on recency, not lifetime.

5

Location fit

This role is based in a specific location. We weight your proximity and willingness to relocate.

Score yourself on this role.
Free · no card · written explanation included
See if I'm a fit →

Skills in this role

Pulled from the job description. These are the keywords we'll weight when scoring your fit.

pythonpytorchteams

More at Featherlessai

See all open jobs at Featherlessai