Companies Orion Innovation Senior ML Infrastructure Engineer

About the role

Orion Innovation · Onsite

Orion Innovation is a premier, award-winning, global business and technology services firm.  Orion delivers game-changing business transformation and product development rooted in digital strategy, experience design, and engineering, with a unique combination of agility, scale, and maturity.  We work with a wide range of clients across many industries including financial services, professional services, telecommunications and media, consumer products, automotive, industrial automation, professional sports and entertainment, life sciences, ecommerce, and education.

Project Overview:

We're building a large-scale document intelligence platform that processes text files up to 5 TB in size, extracts insights using BERT-class NLP models, and surfaces answers to analysts via a low-latency query interface. The platform runs on Azure Kubernetes Service (AKS) with dedicated GPU node pools, uses KEDA for event-driven autoscaling, and integrates with Azure Data Lake Storage Gen2 and Azure OpenAI.

This is a hands-on role that sits at the intersection of platform engineering and applied ML, and requires someone who is equally comfortable debugging a CUDA out-of-memory error and designing a Kubernetes autoscaling policy. As the Senior ML Infrastructure Engineer the resource will own the end-to-end infrastructure layer — from GPU cluster configuration and CUDA runtime management to Kubernetes job orchestration and model serving. 

 

Skill / Technology: 

  • Level: Kubernetes / AKS
  • Expert: Multi-node-pool design, taint/toleration, autoscaler, GPU node pools (NC/ND series)
  • Senior: Device plugin, driver compat, resource limits, KEDA
  • Senior: Scaled Job, queue triggers, cooldown tuning, CUDA / cuDNN
  • Mid–Senior: Runtime config via PyTorch; raw kernel dev not required, PyTorch (GPU inference)
  • Senior: Batching, FP16, memory management, profiling, Hugging Face Transformers
  • Senior: BERT/DistilBERT/BGE loading, pipeline API, tokenization, Python (production)
  • Senior: Async workers, Azure SDK, queue consumers, Azure infrastructure
  • Senior: VNet, private endpoints, Key Vault, ADLS, AD, Docker / Helm
  • Senior: Multi-stage builds, Helm chart authoring, IaC (Terraform / Bicep)
  • Preferred: willingness to learn is acceptable

Orion is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, citizenship status, disability status, genetic information, protected veteran status, or any other characteristic protected by law.

Candidate Privacy Policy

Orion Systems Integrators, LLC and its subsidiaries and its affiliates (collectively, “Orion,” “we” or “us”) are committed to protecting your privacy. This Candidate Privacy Policy (orioninc.com) (“Notice”) explains:

  • What information we collect during our application and recruitment process and why we collect it;
  • How we handle that information; and
  • How to access and update that information.

Your use of Orion services is governed by any applicable terms in this notice and our general Privacy Policy.

 

Ready to apply to Orion Innovation?
Apply to Orion Innovation

Similar jobs

Sign up for suggestions tailored to the jobs you open and the searches you save.

Apply now
🤖

Whoa — hold up

JobsRadar was built for real people having a rough time in their job search — not for automated requests. You're clicking way too fast and you're now temporarily blocked.

Come back later. If you're genuinely job hunting, we've got your back — just act like a human.

Catch your next role the second it’s posted.

Create a free account and we’ll watch the boards for you — the instant a job matches your search, it lands in your inbox or Telegram. No digging, no refreshing.

Create free account

Free forever · takes 30 seconds · already have one?

Get the worldwide-remote edge.

Join our Telegram channel for the stuff that helps you land the role — salary benchmarks, the weekly market pulse, and new-feature drops. No spam, just signal.

Join the channel — it's free