Member of Technical Staff, Inference (Paris, London)

remote

via Ashby

See if I'm a fit →Tailor my resume for this role →Apply on Ashby ↗

About this role

WHAT YOU’LL DO - Build low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in robotics - Design and optimize distributed inference systems on GPU clusters, pushing throughput with large-batch serving and efficient resource utilization - Implement efficient low-level code (CUDA, Triton, custom kernels) and integrate it seamlessly into high-level frameworks - Optimize workloads for both throughput (batching, scheduling, quantization) and latency (caching, memory management, graph compilation) - Develop monitoring and debugging tools to guarantee reliability, determinism, and rapid diagnosis of regressions across both stacks WHAT YOU’LL BRING…

Read the full description on Genesis's site →

What we'd score you on

reqspace match rubric

Five dimensions, recruiter-grade. Upload your resume and we'll generate a written explanation of where you fit and where the gaps are.

Skills match

For this role: python, go

Level fit

We check your title trajectory against the seniority signal of the role.

Domain experience

Your work in the role's domain matters more than your years total. We weight recent and direct experience.

Recency

A skill you used last quarter weighs more than one from five years ago. We grade on recency, not lifetime.

Location fit

This role is remote-eligible — we factor in your stated location and time-zone overlap.

Score yourself on this role.

Free · no card · written explanation included

See if I'm a fit →

Skills in this role

Pulled from the job description. These are the keywords we'll weight when scoring your fit.

pythongo

More at Genesis

See all open jobs at Genesis →