About the role
THE POSITION
Our roster has an opening with your name on it
As a Staff Infrastructure Engineer, you'll own the design, operation, and maturity of our Kubernetes platforms and service mesh ecosystem. We run 100+ Kubernetes clusters across AWS Cloud and Outposts. Kong Mesh sits at the heart of how our services talk to each other — securing traffic, managing policies, handling retries and timeouts. You'll bring expertise on Kubernetes and mesh and help us operate these platforms with confidence and discipline.
This role sits in a key spot. You'll work directly with the Infrastructure Engineering Director to set standards for how we run Kubernetes and manage service-to-service communication. You'll mentor engineers on what's actually happening inside the cluster and the mesh, not just how to use them. You'll lead incident response when things break, build automation to reduce toil, and help teams adopt patterns that work reliably at our scale.
You're a hands-on builder who understands distributed systems at depth. You care about how things work underneath, why failures happen, and how to prevent them. You write code and build tooling. You jump into incidents. You mentor others. By doing this work well, you'll directly improve reliability, performance, and how confident our teams feel running services on our infrastructure.
In addition to the specific responsibilities outlined above, employees may be required to perform other such duties as assigned by the Company. This ensures operational flexibility and allows the Company to meet evolving business needs.
THE GAME PLAN
Everyone on our team has a part to play
-
- Design and operate our Kubernetes platform across EKS clusters. Set standards for cluster configuration, workload isolation, resource management, and cost optimization. Build the runbooks and automation that make operations predictable.
- Own the Kong Mesh ecosystem. Mature it from a deployment into a production-grade platform with clear patterns for service-to-service security, traffic management, and observability.
- Define and implement service-to-service communication patterns. Work with teams on zero-trust networking, certificate lifecycle, mTLS policies, and how to debug when things go wrong.
- Build infrastructure-as-code, automation, and tooling that reduce toil and let teams operate reliably.
- Define what good looks like for our Infrastructure platform. Set SLOs, monitor reliability, and hold the line on quality.
- Lead incident response when Kubernetes or mesh issues impact services. Understand root cause, fix it, and make sure it doesn't happen the same way twice.
- Mentor engineers on Kubernetes internals, mesh design, and distributed systems thinking. Raise the bar on technical understanding across the team.
- Evaluate infrastructure tools and platforms. Understand what to build, what to buy, and what to integrate.
- Work with platform and product teams to understand what they need from Kubernetes and mesh. Feed that into our roadmap and direction.
- Contribute to broader infrastructure initiatives on observability, cost optimization, and resilience. Own technical scope on assigned work.
A Sneak Peek Into Our Tech Stack
AWS , Kubernetes (EKS, 100+ clusters), Kong Mesh, Terraform, Helm, Datadog.
THE STATS
What we're looking for in our next teammate
-
- 7+ years working with platform infrastructure, SRE, cloud infrastructure, or related work. You've built and operated large systems and shipped real things.
- Deep hands-on experience with Kubernetes. You understand cluster architecture, multi-cluster operations, scheduling, networking, security, and how to debug when things break.
- Strong expertise with modern service mesh platforms. Experience with Kong Mesh, Istio, Linkerd, or Envoy-based systems. Comfortable with mTLS, traffic policies, and zero-trust networking.
- Working knowledge of AWS. You understand VPCs, networking, security, and how to operate in hybrid environments including Outposts.
- You understand distributed systems. You know the tradeoffs involved in service-to-service communication at scale. You think about failure modes.
- You've defined and tracked SLOs/SLIs for infrastructure services. You use metrics that matter to users and the business, not just technical metrics.
- You code in at least one modern language. You build tools and automation. You're comfortable both writing code and operating systems.
- You've driven operational improvements through automation. You see toil and build solutions that scale. You prevent the same failure from happening twice.
- You mentor and influence other engineers. You raise the technical bar. You help people understand why things work the way they do.
- You communicate well. You can explain technical constraints to non-technical people. You influence technical direction.
- You care about doing things right. You own problems end-to-end. You push for continuous improvement.
Bonus
- You've used Envoy in production.
- You've operated AWS Outposts or hybrid cloud infrastructure.
- You hold CNCF Kubernetes certifications (CKA, CKS, CKAD).
- You've worked in regulated industries where network segmentation, auditability, strict controls, and uptime matter.
- You're familiar with Datadog or other observability platforms. You've used eBPF-based networking tools like Cilium.
Don’t check all the boxes? That’s okay! We encourage you to still apply if you feel like you possess an adjacent skill set and are interested in learning more about this position.
ABOUT FANDUEL
FanDuel Group is the premier mobile gaming company in the United States and Canada. FanDuel Group consists of a portfolio of leading brands across mobile wagering including: America’s #1 Sportsbook, FanDuel Sportsbook; its leading iGaming platform, FanDuel Casino; the industry’s unquestioned leader in horse racing and advance-deposit wagering, FanDuel Racing; and its daily fantasy sports product.
In addition, FanDuel Group operates FanDuel TV, its broadly distributed linear cable television network and FanDuel TV+, its leading direct-to-consumer OTT platform. FanDuel Group has a presence across all 50 states, Canada, and Puerto Rico.
The company is based in New York with US offices in Los Angeles, Atlanta, and Jersey City, as well as global offices in Canada and Scotland. The company’s affiliates have offices worldwide, including in Ireland, Portugal, Romania, and Australia.
FanDuel Group is a subsidiary of Flutter Entertainment, the world's largest sports betting and gaming operator with a portfolio of globally recognized brands and traded on the New York Stock Exchange (NYSE: FLUT).
PLAYER BENEFITS
We treat our team right
We offer amazing benefits above and beyond the basics. We have an array of health plans to choose from (some as low as $0 per paycheck) that include programs for fertility and family planning, mental health support, and fitness benefits. We offer generous paid time off (PTO & sick leave), annual bonus and long-term incentive opportunities (based on performance), 401k with up to a 5% match, commuter benefits, pet insurance, and more - check out all our benefits here: FanDuel Total Rewards. *Benefits differ across location, role, and level.
FanDuel is an equal opportunities employer and we believe, as one of our principles states, “We are One Team!”. As such, we are committed to equal employment opportunity regardless of race, color, ethnicity, ancestry, religion, creed, sex, national origin, sexual orientation, age, citizenship status, marital status, disability, gender identity, gender expression, veteran status, or any other characteristic protected by state, local or federal law. We believe FanDuel is strongest and best able to compete if all employees feel valued, respected, and included.
FanDuel is committed to providing reasonable accommodations for qualified individuals with disabilities. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please email Benefits@fanduel.com.
The applicable salary range for this position is $159,000 - $208,950 USD, which is dependent on a variety of factors including relevant experience, location, business needs and market demand. This role may offer the following benefits: medical, vision, and dental insurance; life insurance; disability insurance; a 401(k) matching program; among other employee benefits. This role may also be eligible for short-term or long-term incentive compensation, including, but not limited to, cash bonuses and stock program participation. This role includes paid personal time off and 14 paid company holidays. FanDuel offers paid sick time in accordance with all applicable state and federal laws.
It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.
#LI-Hybrid