Pick a job to read the details
Tap any role on the left — its description and apply link will open here.
About the Company
About the role
We are looking for a Senior Software Engineer to join our growing engineering team. In this role, you will design and build distributed backend systems, lead critical API design decisions, and drive the deployment and operationalization of machine learning models in production. You will work at the intersection of platform infrastructure and application development, making a direct impact on product quality and engineering standards.
What you will do
What we are looking for
Nice to have
For India-based candidates, we offer a competitive base salary along with equity options, providing an opportunity to share in the success and growth of Armada.
You're a Great Fit if You're
Equal Opportunity Statement
At Armada, we are committed to fostering a work environment where everyone is given equal opportunities to thrive. As an equal opportunity employer, we strictly prohibit discrimination or harassment based on race, color, gender, religion, sexual orientation, national origin, disability, genetic information, pregnancy, or any other characteristic protected by law. This policy applies to all employment decisions, including hiring, promotions, and compensation. Our hiring is guided by qualifications, merit, and the business needs at the time.
Unsolicited Resumes and Candidates
Armada does not accept unsolicited resumes or candidate submissions from external agencies or recruiters. All candidates must apply directly through our careers page. Any resumes submitted by agencies without a prior signed agreement will be considered unsolicited and Armada will not be obligated to pay any fees.
Ready to apply?
Apply to Armada
About the Company
We are seeking a highly experienced Lead Software Engineer / Lead AI Platform Engineer to architect and lead the development of our GPU-as-a-Service (GPUaaS) platform. In this role, you will define the core abstractions that transform complex GPU fabrics, storage systems, and networking into a seamless, self-service experience for researchers and engineers.
You will operate at the intersection of distributed systems, Kubernetes internals, and GPU infrastructure, setting the technical direction of the platform while mentoring engineers and driving cross-functional collaboration. This role is ideal for leaders who enjoy hands-on architecture, deep technical ownership, and building infrastructure at massive scale.
Lead the design of a globally scalable AI control plane for GPU, storage, and network orchestration.
Define architectural patterns for custom Kubernetes operators managing complex AI training and inference workloads.
Own the long-term scalability, extensibility, and evolution of the GPUaaS platform.
Architect hard isolation strategies across kernel, hypervisor, and hardware layers (IOMMU, SR-IOV, device isolation).
Design secure multi-tenant execution models aligned with zero-trust networking principles.
Ensure strong isolation without compromising performance in a shared environment.
Drive integration strategies for VAST, Weka, and DDN storage platforms.
Collaborate with hardware and networking vendors to optimize RDMA, GPUDirect, and RoCE v2 traffic patterns.
Design and evolve VXLAN and BGP-EVPN–based networking architectures.
Design, develop, and maintain custom Kubernetes operators for GPU, storage, and infrastructure automation.
Implement CRDs, reconciliation logic, and lifecycle management for AI workloads.
Guide implementation patterns while remaining hands-on with critical platform components.
Define platform SLOs, capacity planning models, and GPU availability targets.
Establish benchmarking standards including MLPerf and custom training/inference stress tests.
Lead post-incident reviews, root-cause analysis, and performance optimization initiatives.
Set engineering standards through design reviews, architecture documentation, and technical RFCs.
Mentor and grow L3/L4 engineers into strong platform owners.
Influence and collaborate across infrastructure, security, and product teams.
10–15 years of experience in software, platform, or infrastructure engineering roles.
Demonstrated expertise designing and operating production-grade Kubernetes operators using Go (Kubebuilder / Operator SDK).
Deep understanding of Kubernetes internals, including etcd performance, API machinery, CRDs, controllers, and scheduling.
Proven experience building secure, multi-tenant platforms with strong isolation and zero-trust networking.
Strong hands-on knowledge of high-performance storage and networking, including POSIX semantics, CSI drivers, and InfiniBand / RoCE v2.
Experience designing infrastructure automation workflows using tools such as Ansible, Terraform, or equivalent.
Hands-on experience with observability and monitoring tools such as Prometheus, OpenTelemetry (OTEL), Grafana, Splunk, or similar.
Strong proficiency in Go and Python.
Excellent leadership, communication, and cross-functional collaboration skills.
Experience with AI serving frameworks such as vLLM, Ray Serve, Triton Inference Server, or similar.
Familiarity with virtualization and lower-layer systems including VMware vSphere, OpenStack, KVM, or bare-metal provisioning.
Experience with GPU infrastructure, including NVIDIA DGX/HGX systems, GPU Operator, DCGM, Nsight, or performance profiling tools.
Exposure to distributed training systems such as PyTorch DDP, DeepSpeed, or large-scale training frameworks.
For India-based candidates, we offer a competitive base salary along with equity options, providing an opportunity to share in the success and growth of Armada.
You're a Great Fit if You're
Equal Opportunity Statement
At Armada, we are committed to fostering a work environment where everyone is given equal opportunities to thrive. As an equal opportunity employer, we strictly prohibit discrimination or harassment based on race, color, gender, religion, sexual orientation, national origin, disability, genetic information, pregnancy, or any other characteristic protected by law. This policy applies to all employment decisions, including hiring, promotions, and compensation. Our hiring is guided by qualifications, merit, and the business needs at the time.
Unsolicited Resumes and Candidates
Armada does not accept unsolicited resumes or candidate submissions from external agencies or recruiters. All candidates must apply directly through our careers page. Any resumes submitted by agencies without a prior signed agreement will be considered unsolicited and Armada will not be obligated to pay any fees.
Ready to apply?
Apply to Armada
About the Company
About the role
We are seeking an experienced Product Manager to lead our GPU-as-a-Service product development initiatives. The ideal candidate will have 5+ years of experience, with a strong background in developing infrastructure provisioning/orchestration, virtualisation, and AI applications.
Location: Bangalore, India
What You'll Do (Key Responsibilities)
Required Qualifications
Preferred Qualifications
You're a Great Fit if You're
Equal Opportunity Statement
At Armada, we are committed to fostering a work environment where everyone is given equal opportunities to thrive. As an equal opportunity employer, we strictly prohibit discrimination or harassment based on race, color, gender, religion, sexual orientation, national origin, disability, genetic information, pregnancy, or any other characteristic protected by law. This policy applies to all employment decisions, including hiring, promotions, and compensation. Our hiring is guided by qualifications, merit, and the business needs at the time.
Unsolicited Resumes and Candidates
Armada does not accept unsolicited resumes or candidate submissions from external agencies or recruiters. All candidates must apply directly through our careers page. Any resumes submitted by agencies without a prior signed agreement will be considered unsolicited and Armada will not be obligated to pay any fees.
Ready to apply?
Apply to Armada
Cookies & analytics
This site uses cookies from third-party services to deliver its features and to analyze traffic.