Member of Technical Staff, Inference (Paris, London)
remote
via Ashby
About this role
WHAT YOU’LL DO
- Build low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in robotics
- Design and optimize distributed inference systems on GPU clusters, pushing throughput with large-batch serving and efficient resource utilization
- Implement efficient low-level code (CUDA, Triton, custom kernels) and integrate it seamlessly into high-level frameworks
- Optimize workloads for both throughput (batching, scheduling, quantization) and latency (caching, memory management, graph compilation)
- Develop monitoring and debugging tools to guarantee reliability, determinism, and rapid diagnosis of regressions across both stacks
WHAT YOU’LL BRING…
What we'd score you on
reqspace match rubricFive dimensions, recruiter-grade. Upload your resume and we'll generate a written explanation of where you fit and where the gaps are.
1
Skills match
For this role: python, go
2
Level fit
We check your title trajectory against the seniority signal of the role.
3
Domain experience
Your work in the role's domain matters more than your years total. We weight recent and direct experience.
4
Recency
A skill you used last quarter weighs more than one from five years ago. We grade on recency, not lifetime.
5
Location fit
This role is remote-eligible — we factor in your stated location and time-zone overlap.
Score yourself on this role.
Free · no card · written explanation included
Skills in this role
Pulled from the job description. These are the keywords we'll weight when scoring your fit.
pythongo
More at Genesis
- View →Member of Technical Staff, ML Compiler and Systems (Paris, London)
- View →Member of Technical Staff, Data (Bay Area, Remote)
- View →Member of Technical Staff, Robot Learning (Paris, London)
- View →Member of Technical Staff, Foundations (Paris, London)
- View →Member of Technical Staff, Mechanical Engineer (Bay Area)
- View →Member of Technical Staff, Rendering (Bay Area, Remote)
