Companies Specter Site Reliability Engineer

About the role

Specter · Onsite

Company Background
Specter's mission is to help automate the physical world.

Today, we build video sensors with state-of-the-art AI agents that answer any question, anywhere in their environments. Our systems can automatically detect and reason about any physical activity captured on camera, from security incidents (e.g. perimeter intrusion, theft, LPR), to safety monitoring (e.g. PPE detection, injured people), to operational efficiency (e.g. material tracking, congestion monitoring). We offer both long range wireless (1km range) and wired sensor variants to suit any deployment.

Our co-founders Xerxes and Philip are passionate about empowering our partners in the fast approaching world of physical AI and robotics. We are a small, fast growing team who hail from Anduril, Tesla, Uber, and the U.S. Special Forces.


The Role
We're hiring a Site Reliability Engineer to own the operational health of our connected sensor platform — spanning a live fleet of edge hardware deployed at customer sites and the cloud infrastructure behind it.

This is a high-ownership role at the intersection of ops and platform engineering. You'll drive reliability across our sensor fleet — triaging issues in the field, building the systems that prevent them from recurring, and owning the observability that keeps us ahead of problems as we scale.

You set your own priorities across all three:
Responsibilities:
Reactive — Triage & Recovery

  • Debug and triage issues across a live fleet of diverse Linux-based sensor nodes and edge appliances deployed at customer sites.

  • SSH into field hardware to diagnose, patch, and recover systems — often with limited remote access and incomplete information.

  • Own site bring-ups end to end; be the person who gets things back online.

Systems Builder — Close the Loop

  • Build and maintain fleet management systems: OTA update pipelines, device health tracking, remote diagnostics, and lifecycle tooling.

  • Identify repeat fires and eliminate them — build tooling, pre-deployment checks, and root cause processes that prevent recurrence.

  • Automate toil relentlessly: if you're doing something twice, you should be scripting it.

  • Collaborate with embedded systems, and platform teams to define reliability and deployment requirements.

Observability Owner — Fleet Visibility

  • Design and implement observability (logging, metrics, alerting) across edge devices and cloud infrastructure (AWS).

  • Surface and close telemetry gaps; build fleet-wide visibility that enables data-driven reliability decisions.

  • Develop runbooks, incident response procedures, and participate in on-call rotations.

Qualifications:

  • Strong Linux systems administration — comfortable working over SSH in production, not just dev environments.

  • Experience with edge or on-prem hardware alongside cloud infrastructure.

  • Solid networking fundamentals: DNS, firewalls, VPNs, subnets, secure remote access.

  • Scripting or programming in Python, Go, or Bash for operational tooling.

  • Familiarity with containerization (Docker, Kubernetes a plus).

  • Embedded systems experience — reading firmware logs, understanding hardware-software boundaries, and reasoning about what's happening below the OS is a meaningful edge in this role.

  • Deeper cloud experience (AWS infrastructure, IAM, networking, observability tooling) is a strong plus for owning the cloud side of the fleet.

  • Rust or C experience — we have firmware in both; being able to read and reason about low-level code accelerates triage significantly.

Ready to apply to Specter?
Apply to Specter

Similar jobs

AI
Senior Site Reliability Engineer - Hiring Sprint
Airbyte
⚡ Apply early San Francisco Hybrid $196,000–$255,000
● New 👁 Seen ✓ Applied 2h ago
Okta
Staff Site Reliability Engineer - Observability
Okta
⚡ Apply early Bellevue, Washington; Chicago,... Onsite $194,000–$267,000
● New 👁 Seen ✓ Applied 6h ago
Okta
Staff Site Reliability Engineer - Observability GCP
Okta
⚡ Apply early Bellevue, Washington; Chicago,... Onsite $194,000–$267,000
● New 👁 Seen ✓ Applied 6h ago
Baseten
Manager, Cloud Platform & Site Reliability
Baseten
⚡ Apply early San Francisco Hybrid $165,000–$330,000
● New 👁 Seen ✓ Applied 9h ago
Okta
Manager, Site Reliability Engineering
Okta
⚡ Apply early San Francisco, California Onsite $204,000–$306,000
● New 👁 Seen ✓ Applied 16h ago
BT
Technology, DevOps/Site Reliability Engineer
BTIG
⚡ Apply early New York, San Francisco Onsite $160,000–$200,000
● New 👁 Seen ✓ Applied 1d ago
Anthropic
Staff Software Engineer, AI Reliability
Anthropic
⚡ Apply early San Francisco, CA | New York C... Onsite $325,000–$485,000
● New 👁 Seen ✓ Applied 1d ago
AM
Hardware Reliability Engineer
Amperesand
⚡ Apply early Reno, Nevada, United States; S... Onsite $115,000–$160,000
● New 👁 Seen ✓ Applied 1d ago
MongoDB
Site Reliability Engineer (Senior or Staff), Infrastructure Security
MongoDB
⚡ Apply early Austin; New York City; San Fra... Onsite $127,000–$249,000
● New 👁 Seen ✓ Applied 1d ago

Sign up for suggestions tailored to the jobs you open and the searches you save.

Apply now
🤖

Whoa — hold up

JobsRadar was built for real people having a rough time in their job search — not for automated requests. You're clicking way too fast and you're now temporarily blocked.

Come back later. If you're genuinely job hunting, we've got your back — just act like a human.

Catch your next role the second it’s posted.

Create a free account and we’ll watch the boards for you — the instant a job matches your search, it lands in your inbox or Telegram. No digging, no refreshing.

Create free account

Free forever · takes 30 seconds · already have one?

Get the worldwide-remote edge.

Join our Telegram channel for the stuff that helps you land the role — salary benchmarks, the weekly market pulse, and new-feature drops. No spam, just signal.

Join the channel — it's free