Talent.com
Infrastructure Reliability Engineer

Infrastructure Reliability Engineer

AndurilCosta Mesa, California, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century’s most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril’s family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and control center. As the world enters an era of strategic competition, Anduril is committed to bringing cutting-edge autonomy, AI, computer vision, sensor fusion, and networking technology to the military in months, not years.

ABOUT THE TEAM

The Infrastructure Reliability team owns the self-hosted systems that power our engineering organization — including GitHub Enterprise, CircleCI, Artifactory, and others. These services are mission-critical and must be secure, scalable, and highly available.

ABOUT THE JOB

This team is responsible for ensuring developer tools are continuously patched and upgraded, have reliable and tested backups, and can scale to support a rapidly growing company. This is a hybrid role that blends DevOps, SRE, and software engineering — ideal for engineers who enjoy end-to-end ownership and solving complex infrastructure challenges through automation and thoughtful system design.

WHAT YOU'LL DO

  • Own the lifecycle of core self-hosted developer tools (e.g., GitHub Enterprise, CircleCI, Artifactory)
  • Design and implement automated systems for patching, backups (with validation), and upgrades
  • Scale infrastructure to support a fast-growing engineering org
  • Use Infrastructure-as-Code (Terraform, Pulumi, etc.) to manage environments
  • Operate and troubleshoot systems using Docker, Kubernetes, and cloud platforms (AWS, GCP, Azure)
  • Define and maintain SLOs for service availability, reliability, and performance
  • Lead and participate in incident response and root cause analysis
  • Collaborate with platform, security, and software teams to drive operational excellence

REQUIRED QUALIFICATIONS

  • Experience operating production systems using Docker and Kubernetes
  • Proficiency with at least one cloud platform (AWS, GCP, or Azure)
  • Experience managing infrastructure with Infrastructure-as-Code tools (e.g., Terraform)
  • Strong problem-solving skills with a focus on automation
  • Scripting or software development experience (e.g., Python, Go, Bash)
  • Familiarity with CI / CD pipelines and developer tooling
  • Ability to own systems end-to-end, from design to incident resolution
  • Eligible to obtain and maintain an active U.S. Secret security clearance
  • PREFERRED QUALIFICATIONS

  • Prior experience with GitHub Enterprise Server, Artifactory, or CircleCI
  • Experience maintaining highly available, scalable internal tools
  • Exposure to security best practices, compliance requirements, or auditing
  • Experience supporting large engineering teams in a fast-paced environment
  • Background in SRE or hybrid SWE / DevOps roles
  • US Salary Range$124,000—$186,000 USD

    The salary range for this role is an estimate based on a wide range of compensation factors, inclusive of base salary only. Actual salary offer may vary based on (but not limited to) work experience, education and / or training, critical skills, and / or business considerations. Highly competitive equity grants are included in the majority of full time offers; and are considered part of Anduril's total compensation package. Additionally, Anduril offers top-tier benefits for full-time employees, including :

    Platinum Healthcare Benefits : For U.S. roles, we offer comprehensive medical, dental, and vision plans at little to no cost to you. For UK roles, Private Medical Insurance (PMI) : Anduril will cover the full cost of the insurance premium for an employee and dependents.

    For AUS roles, Private health plan through Bupa : Coverage is fully subsidized by Anduril.

  • Basic Life / AD&D and long-term disability insurance 100% covered by Anduril, plus the option to purchase additional life insurance for you and your dependents.
  • Extremely generous company holiday calendar including a holiday hiatus in December, and highly competitive PTO plans.
  • 16 weeks of paid Caregiver & Wellness Leave to care for a family member, bond with your baby, or tend to your own medical condition.
  • Family Planning & Parenting Support : Fertility (eg, IVF, preservation), adoption, and gestational carrier coverage with additional benefits and resources to provide support from planning to parenting.
  • Mental Health Resources : We provide free mental health resources 24 / 7 including therapy, life coaching, and more. Additional work-life services, such as free legal and financial support, available to you as well.
  • A professional development stipend is available to all Andurilians.
  • Company-funded commuter benefits available based on your region.
  • Relocation assistance (depending on role eligibility).
  • 401(k) retirement savings plan - both a traditional and Roth 401(k). (US roles only)
  • The recruiter assigned to this role can share more information about the specific compensation and benefit details associated with this role during the hiring process.

    Anduril is an equal-opportunity employer committed to creating a diverse and inclusive workplace. The Anduril team is made up of incredibly talented and unique individuals, who together are disrupting industry norms by creating new paths towards the future of defense technology. All qualified applicants will be treated with respect and receive equal consideration for employment without regard to race, color, creed, religion, sex, gender identity, sexual orientation, national origin, disability, uniform service, Veteran status, age, or any other protected characteristic per federal, state, or local law, including those with a criminal history, in a manner consistent with the requirements of applicable state and local laws, including the CA Fair Chance Initiative for Hiring Ordinance. We actively encourage members of recognized minorities, women, Veterans, and those with disabilities to apply, and we work to create a welcoming and supportive environment for all applicants throughout the interview process. If you are someone passionate about working on problems that have a real-world impact, we'd love to hear from you!

    To view Anduril's candidate data privacy policy, please visit .

    serp_jobs.job_alerts.create_a_job

    Reliability Engineer • Costa Mesa, California, United States

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    Sr. Cloud Infrastructure Engineer

    Sr. Cloud Infrastructure Engineer

    Serve RoboticsLos Angeles, CA, US
    serp_jobs.job_card.full_time
    At Serve Robotics, we’re reimagining how things move in cities.Our personable sidewalk robot is our vision for the future. It’s designed to take deliveries away from congested streets, m...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Systems Engineer I / II Pipeline

    Systems Engineer I / II Pipeline

    Rocket Lab CorporationLong Beach, CA, US
    serp_jobs.job_card.permanent
    Rocket Lab is an end-to-end space company delivering responsive launch services, complete spacecraft design and manufacturing, payloads, satellite components, and more – all with the goal of ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    IT Infrastructure Engineer II

    IT Infrastructure Engineer II

    South Central Family Health CentLos Angeles, CA, US
    serp_jobs.job_card.full_time
    We seek a dynamic and experienced.We want to hear from you if you thrive in a fast-paced, caring, and compassionate environment!. The Mission of South-Central Family Health Center is to improve the ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Regional Reliability Engineer

    Regional Reliability Engineer

    Catalyst Recruiting Inc.Los Angeles, CA, US
    serp_jobs.job_card.full_time
    A large company is looking for a reliability engineer to lead reliability and asset engineering and management efforts and projects at multiple sites for a chemical / materials company.Mechanical or ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Sr. DevOps / Infrastructure Engineer

    Sr. DevOps / Infrastructure Engineer

    Serve RoboticsLos Angeles, CA, US
    serp_jobs.job_card.full_time
    At Serve Robotics, we’re reimagining how things move in cities.Our personable sidewalk robot is our vision for the future. It’s designed to take deliveries away from congested streets, m...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Infrastructure, DevOps & Reliability Engineer (Multiple Roles, Remote & On-Site)

    Infrastructure, DevOps & Reliability Engineer (Multiple Roles, Remote & On-Site)

    MLabsLos Angeles, CA, US
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    We’re recruiting Infrastructure, DevOps, and Reliability Engineers for high-growth startups including.AirGarage, Dyno Therapeutics, Codex Health, and Banquet Health.These roles focus on scali...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tentek, Inc.Glendale, CA, US
    serp_jobs.job_card.full_time
    Must report onsite in Glendale 3 days per week, typically Tuesday-Thursday.There will be 3 rounds of interviews for this position. Linux system admin and Windows but willing to consider only Linux b...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    Cloud Reliability Engineer

    Cloud Reliability Engineer

    ViantIrvine, California, United States, 92602
    serp_jobs.job_card.full_time
    The Cloud Reliability Engineer will be responsible for writing and integrating various open source and closed sources tools. The ideal candidate will possess a deep understanding of systems engineer...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TP-Link Systems Inc.Irvine, CA, US
    serp_jobs.job_card.full_time
    At the forefront of the future of connected living, TP-Link's Systems Inc.R&D Center in Irvine, Southern California's innovation hub, spearheads research and development of next-generat...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    TP-Link Systems Inc.Irvine, CA, US
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    At the forefront of the future of connected living, TP-Link's Systems Inc.R&D Center in Irvine, Southern California's innovation hub, spearheads research and development of next-generation netw...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    Network Reliability Engineer - Remote (1772)

    Network Reliability Engineer - Remote (1772)

    CoreSiteLos Angeles, CA, US
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    At CoreSite, we empower a more connected future through high-performance data centers and interconnection solutions.Recognized as a trusted partner in digital transformation, our strategically loca...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Cloud Service Reliability Engineer

    Cloud Service Reliability Engineer

    ForhyreLos Angeles, CA, US
    serp_jobs.job_card.full_time
    We are looking for someone that is generalist at heart, one who is curious, appreciates complexity, knows or wants to learn when to step back and when to dive deep. We call this role a Cloud Service...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    1.61 ML Infrastructure Engineer — ML Platform, Tooling & Systems

    1.61 ML Infrastructure Engineer — ML Platform, Tooling & Systems

    Field AIIrvine, California, United States, 92602
    serp_jobs.job_card.full_time
    Field AI is transforming how robots interact with the real world.We are building risk-aware, reliable, and field-ready AI systems that address the most complex challenges in robotics, unlocking the...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Infrastructure Engineer

    Infrastructure Engineer

    TP-Link Systems Inc.Irvine, CA, US
    serp_jobs.job_card.full_time
    Headquartered in the United States,.Consistently ranked as the world's top provider of Wi-Fi devices, TP-Link is dedicated to delivering innovative solutions that improve people’s lives b...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Infrastructure Engineer

    Senior Infrastructure Engineer

    Recruitment RoomLos Angeles, CA, US
    serp_jobs.job_card.full_time
    Senior Infrastructure Engineer – Blockchain Infrastructure.serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Ground Systems Engineer II

    Ground Systems Engineer II

    COMTECH TELECOMMUNICATIONSCypress, CA, US
    serp_jobs.job_card.full_time
    Comtech Telecommunications Corp.Our unique culture of innovation and employee empowerment unleashes a relentless passion for customer success. With multiple facilities located in technology corridor...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Infrastructure Engineer

    Infrastructure Engineer

    Approach VentureEl Segundo, CA, US
    serp_jobs.job_card.full_time
    Infrastructure Engineer – Help Build the Backbone of Next-Generation Systems!.El Segundo, CA | Hybrid (M / TH in-person). Join a venture-backed startup redefining how complex machines are tested...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Systems Engineer

    Systems Engineer

    Strategic EmploymentIrvine, CA, US
    serp_jobs.job_card.full_time
    We are working with a SaaS-based company in Irvine that's expanding their internal IT team.They're looking to hire a Systems Engineer who can support both internal infrastructure and cloud-...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    K2 SpaceLos Angeles, CA, US
    serp_jobs.job_card.permanent
    K2 Space is building large, high-powered spacecraft for the next generation of space development.Backed by Lightspeed Venture Partners, Altimeter Capital, and many others ($200M raised to date), we...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    Database Reliability Engineer

    Database Reliability Engineer

    ViantIrvine, California, United States, 92602
    serp_jobs.job_card.full_time
    We are looking for a skilled and motivated.Database Administrator (DBA) to join our growing team.In this role, you will support the design, implementation, and day-to-day operations of our database...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30