Jobs Companies pony.ai Machine Learning Engineer - Reinforcement Learning

About this Machine Learning Engineer - Reinforcement Learning role at pony.ai

pony.ai · Fremont, California, United States

Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous mobility technologies and services at a rapidly expanding footprint of sites around the world. Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony.ai is an industry leader in the commercialization of autonomous driving and is committed to developing the safest autonomous driving capabilities on a global scale. Pony.ai’s leading position has been recognized, with CNBC ranking Pony.ai #10 on its CNBC Disruptor list of the 50 most innovative and disruptive tech companies of 2022. In June 2023, Pony.ai was recognized on the XPRIZE and Bessemer Venture Partners inaugural “XB100” 2023 list of the world’s top 100 private deep tech companies, ranking #12 globally. As of August 2023, Pony.ai has accumulated nearly 21 million miles of autonomous driving globally. Pony.ai went public at NASDAQ in November 2024.

Responsibility

  • Build scalable systems for training and fine-tuning large generative models that produce realistic, informative driving behaviors for evaluation and scenario coverage.
  • Implement and iterate on RL-style methods: algorithms, reward / preference objectives, and training setups suited to high-fidelity, insightful behaviors in simulation-aligned workflows (closed-loop evaluation mindset).
  • Ship deep learning solutions (including LLM / VLM where appropriate) that improve human-led triaging, automate high-volume workflows, and support nuanced analysis of self-driving behavior to surface critical anomalies.
  • Own production-oriented ML for fleet-scale assessment: training, optimization, monitoring, and iteration of models used to judge performance across large real-world exposure.
  • Design and evolve data + evaluation systems inspired by RL from human preferences (RLHF) and related paradigms—turning preference/judgment signals into repeatable, scalable training and evaluation loops.
  • Partner broadly with teams such as Prediction, Planning, Research, and platform/engineering leads to land cross-cutting improvements with clear metrics.

Requirements

  • M.S. or Ph.D. in Computer Science, Machine Learning, AI, or a related field—or equivalent practical experience.
  • Hands-on experience building and applying ML in production-grade settings, with a strong RL component (policy learning, preference/feedback optimization, or offline/online RL pipelines).
  • Depth in deep learning, sequence modeling, and generative models.
  • Demonstrated impact via strong publications or a clear history of shipping impactful ML systems end-to-end.
  • Experience with large-scale distributed training and large-scale data processing.
  • Ability to lead ambiguous technical work from problem framing through reliable delivery.


Preferred

  • Background in autonomous vehicles, robotics, or complex simulation environments.
  • Strong grasp of modern RL and post-training techniques in LLM, dLLM, VLA and video generations.
  • Hands-on integration of simulation platforms with ML training and evaluation workflows.
  • Python fluency and frameworks such as PyTorch
  • Experience defining and operating metrics for complex, safety-critical AI systems.
  • Technical leadership: influencing stakeholders, aligning teams, and raising the bar for evaluation rigor.
  • Excellent communication—simple explanations of complex trade-offs.


Compensation and Benefits

Base Salary Range: $150,000 - $250,000 Annually

Compensation may vary outside of this range depending on many factors, including the candidate’s qualifications, skills, competencies, experience, and location. Base pay is one part of the Total Compensation and this role may be eligible for bonuses/incentives and restricted stock units.

Also, we provide the following benefits to the eligible employees:

  • Health Care Plan (Medical, Dental & Vision)
  • Retirement Plan (Traditional and Roth 401k)
  • Life Insurance (Basic, Voluntary & AD&D)
  • Paid Time Off (Vacation & Public Holidays)
  • Family Leave (Maternity, Paternity)
  • Short Term & Long Term Disability
  • Free Food & Snacks

Please click here for our privacy disclosure.

Ready to apply to pony.ai?
Apply to pony.ai

How this ML Engineer salary compares

This role pays $200,000/yrin line with the typical range for ML Engineer roles.

$142,400 median $205,000 $300,000

Typical range $173,125–$250,000/yr, from 737 comparable ML Engineer listings on JobsRadar (pay annualized to USD). See ML Engineer salary insights →

About pony.ai

PONY.AI

Our mission is to revolutionize the future of transportation by building the safest and most reliable technology for autonomous vehicles. Armed with the latest breakthroughs in artificial intelligence, we aim to deliver our technology at a global scale. We believe our work has the potential to transform lives and industries for the better.

CULTURE

When it comes to our technology, quality and reliability are hallmark attributes; we don’t believe in taking shortcuts. Our emphasis on craftsmanship enables us to deliver an autonomous driving solution that is highly sophisticated and best-in-class.
When it comes to our people, teamwork, robust mentorship, and collaboration are several key pillars of our culture. We ensure every member of our team receives the support they need while tackling some of the biggest tech challenges that exist today. Here, our employees grow with the company. We truly believe that growing a successful company means growing a successful team.

A GLOBAL PERSPECTIVE

We are deeply passionate about reaching a global audience, starting with our two home countries: China and the United States. With offices and development teams in Silicon Valley, Beijing, and Guangzhou, we are well on our way towards achieving that goal.

See all jobs at pony.ai →

Similar jobs

Sign up for suggestions tailored to the jobs you open and the searches you save.

Apply now
🤖

Whoa — hold up

JobsRadar was built for real people having a rough time in their job search — not for automated requests. You're clicking way too fast and you're now temporarily blocked.

Come back later. If you're genuinely job hunting, we've got your back — just act like a human.

Catch your next role the second it’s posted.

Create a free account and we’ll watch the boards for you — the instant a job matches your search, it lands in your inbox or Telegram. No digging, no refreshing.

Create free account

Free forever · takes 30 seconds · already have one?

Get an edge on your job hunt.

Join our Telegram channel for the stuff that helps you land the role — salary benchmarks, the weekly market pulse, and new-feature drops. No spam, just signal.

Join the channel — it's free