Companies talentpluto RL Environment Software Engineer

About the role

talentpluto · Onsite

Location: San Francisco, CA

Work Model: Hybrid

Industry: Applied AI / AI research data

Compensation: $180K-$220K base, ~$400K+ OTE (uncapped profit share)

About the Company

Our partner is a fast-growing applied AI research lab that builds high-quality reinforcement-learning environments and agents sold to the world's leading AI labs. In under two years they have scaled to a nine-figure revenue run rate and grown their team severalfold in a matter of months, backed by leading venture investors. Quality is their core differentiator, and they are rapidly expanding into new domains.

The Opportunity

As an RL Environment Software Engineer, you will sit at the intersection of research engineering and traditional software engineering, building the environments that simulate real-world workflows and the agents that automate them. This is forward-looking work, you will help research and predict what high-quality environments the frontier will need next, then build them from the ground up.

You will join a brand-new RL team being assembled with exceptional talent, with a clear path to grow alongside it as the function scales into industry pods.

Responsibilities

  • Design and build high-quality RL environments that simulate real working environments end to end.
  • Develop agents for the tasks within those environments and iterate until they are efficient and production-ready.
  • Partner with the research team to scope which environments to build and why, staying ahead of future demand rather than only meeting present needs.
  • Own the backend and infrastructure layers that make environments reliable and scalable.
  • Help set engineering standards for a zero-to-one team as the RL function grows.

Requirements

  • Strong machine-learning engineers who code heavily and build systems from scratch, with strong intuition for reinforcement learning.
  • Proficiency across a modern stack, Node.js and Python on the backend and React/TypeScript on the frontend, with strong Kubernetes and Docker skills.
  • Comfort operating in a fast-paced startup environment with high ownership and long hours.
  • A track record of meaningful tenure and impact at previous companies.
  • Reinforcement-learning experience or an RL research background is a strong plus, though not required.
  • Bachelor's degree in computer science or a related technical field, or equivalent practical experience.
Ready to apply to talentpluto?
Apply to talentpluto

Similar jobs

Cynch AI
Senior Full Stack Software Engineer
Cynch AI
⚡ Apply early San Francisco, California, Uni... Onsite $190,000–$240,000
● New 👁 Seen ✓ Applied 37m ago
Checkr
Staff Software Engineer, Integrations
Checkr
⚡ Apply early Denver, Colorado, United State... Onsite $224,000–$264,000
● New 👁 Seen ✓ Applied 3h ago
Checkr
Staff Software Engineer, Screenings & Verifications
Checkr
⚡ Apply early Denver, Colorado, United State... Onsite $224,000–$264,000
● New 👁 Seen ✓ Applied 3h ago
Postman
Senior Software Engineer, Marketing Engineering
Postman
⚡ Apply early San Francisco, California, Uni... Onsite $180,000–$220,000
● New 👁 Seen ✓ Applied 3h ago
Verana Health
Senior Software Engineer
Verana Health
⚡ Apply early San Francisco, California, Uni... Onsite $149,600–$224,400
● New 👁 Seen ✓ Applied 3h ago
Redwood Materials
Senior Software Engineer - Site Controller, Energy Storage
Redwood Materials
⚡ Apply early San Francisco, California, Uni... Onsite $180,000–$237,500
● New 👁 Seen ✓ Applied 3h ago
Redwood Materials
Software Engineer - ML/Computer Vision (Battery Sorting)
Redwood Materials
⚡ Apply early McCarran, NV; San Francisco, C... Onsite $152,500–$287,500
● New 👁 Seen ✓ Applied 4h ago
NexHealth
Senior Software Engineer, Security
NexHealth
⚡ Apply early San Francisco, California, Uni... Onsite $165,000–$230,000
● New 👁 Seen ✓ Applied 4h ago
NexHealth
Senior Staff Software Engineer
NexHealth
⚡ Apply early San Francisco, California, Uni... Onsite $206,000–$268,000
● New 👁 Seen ✓ Applied 4h ago

Sign up for suggestions tailored to the jobs you open and the searches you save.

Apply now
🤖

Whoa — hold up

JobsRadar was built for real people having a rough time in their job search — not for automated requests. You're clicking way too fast and you're now temporarily blocked.

Come back later. If you're genuinely job hunting, we've got your back — just act like a human.

Catch your next role the second it’s posted.

Create a free account and we’ll watch the boards for you — the instant a job matches your search, it lands in your inbox or Telegram. No digging, no refreshing.

Create free account

Free forever · takes 30 seconds · already have one?

Get the worldwide-remote edge.

Join our Telegram channel for the stuff that helps you land the role — salary benchmarks, the weekly market pulse, and new-feature drops. No spam, just signal.

Join the channel — it's free