Applied Research Scientist
mid
via Ashby
About this role
About Us
At Sully.ai http://Sully.ai, We’re Building the Most Impactful Healthcare Company on Earth
We believe that access to a great doctor is a basic human right. Today, that’s not a reality. Delays, misdiagnoses, administrative chaos, and burnout plague the system.
Our Mission
One Human, One Doctor.
We enable our customers to staff 30% of their workforce with AI by creating a shared agent architecture for scale and efficiency. All powered by our own patented, world-class models and deployed in real-world care.
KEY RESPONSIBILITIES
- Build and scale automated evaluation pipelines (LLM-as-judge + human review) with clinical-grade benchmarks.
HARD REQUIREMENTS
- Proven experience designing agentic processes and LLM evaluation/benchmarking frameworks.…
What we'd score you on
reqspace match rubricFive dimensions, recruiter-grade. Upload your resume and we'll generate a written explanation of where you fit and where the gaps are.
1
Skills match
For this role: python, tensorflow, llamaindex
2
Level fit
This role is mid-level. We check your trajectory against it.
3
Domain experience
Your work in the role's domain matters more than your years total. We weight recent and direct experience.
4
Recency
A skill you used last quarter weighs more than one from five years ago. We grade on recency, not lifetime.
5
Location fit
This role is based in a specific location. We weight your proximity and willingness to relocate.
Score yourself on this role.
Free · no card · written explanation included
Skills in this role
Pulled from the job description. These are the keywords we'll weight when scoring your fit.
pythontensorflowllamaindex
