Talent.com
Senior Site Reliability Engineer, Colorado Springs
Senior Site Reliability Engineer, Colorado SpringsOnebrief • Oahu, HI, Hawaii, United States
Senior Site Reliability Engineer, Colorado Springs

Senior Site Reliability Engineer, Colorado Springs

Onebrief • Oahu, HI, Hawaii, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

About Onebrief

Onebrief is collaboration and AI-powered workflow software designed specifically for military staffs. By transforming this work, Onebrief makes the staff as a whole superhuman - meaning faster, smarter, and more efficient.

We take ownership, seek excellence, and play to win with the seriousness and camaraderie of an Olympic team. Onebrief operates as an all-remote company, though many of our employees work alongside our customers at military commands around the world.

Founded in 2019 by a group of experienced planners, today, Onebrief’s team spans veterans from all forces and global organizations, and technologists from leading-edge software companies. We’ve raised $123m+ from top-tier investors, including Battery Ventures, General Catalyst, Insight Partners, and Human Capital, and today, Onebrief is valued at $1.1B. With this continued growth, Onebrief is able to make an impact where it matters most.

Security Clearance, Location, and Onsite Notice :

This role requires regularly working on-site at customer locations in Colorado Springs, Colorado.

If you are not currently within commuting distance, you must be willing to relocate (note that Onebrief will provide relocation assistance).

Active Top Secret Clearance required; SCI eligibility is a plus.

About The Role

We are hiring a Site Reliability Engineer to join our Infrastructure & Security team. You’ll report to our Director of Infrastructure and work closely with fellow SREs, security, and customer success.

You will be the first line of support for our mission critical deployments, and responsible for ensuring best-in-class service quality and issue resolution. You will work in both on-premise DoD environments and AWS cloud environments. Your lessons from the field will shape how our team works, from policy to implementation.

In addition to working at the customer, you will contribute directly to solutions that increase stability, performance, and security of our deployments, and improve the overall experience of deploying and managing Onebrief on premise.

About You

You are a force multiplier who views reliability as the most critical feature of any application and / or platform and believe that "reliability beats novelty." You see infrastructure and operability as a product to be automated, documented, and continuously improved, always leaving systems easier to operate than you found them.

You are equally comfortable leading a post-incident review, designing SLOs in a system design session, or diving into a

kubectl

shell to triage a complex production issue. You don't just fix problems; you translate constraints and failure modes into clear, automated guardrails and scalable, resilient architecture. For you, robust monitoring, actionable alerting, and insightful runbooks are core parts of the engineering process, not afterthoughts.

You mentor others, fostering a culture of blameless postmortems and proactive reliability. You collaborate naturally with application and platform teams, helping them move quickly but safely by building the tools, processes, and observability that make "fast recovery" a reality.

What You'll Do

You'll own the reliability, scalability, and security of the production application and / or platform. You will do this by :

Building a World-Class Observability Platform : Design, implement, and manage our monitoring, logging, and alerting stack (e.g., Prometheus, Loki, Alloy, and Grafana). You won't just track metrics; you'll create the actionable insights and automated alerting that allow teams to identify and resolve issues before they impact users.

Defining and Upholding Reliability : Define, measure, and own alerting that feeds into our Service Level Objectives (SLOs) and increases trust internally and externally. You will be the organization's expert on what it means for our systems to be reliable and how to measure it.

Leading Incident Response : Act as the incident responder and potentially incident commander during critical incidents You will lead blameless post-mortems / After Action Reviews (AARs) that identify true root causes and drive automated, long-term solutions to prevent recurrence.

Automating for Scale and Security : Partner with platform engineers to design, build, and manage secure, resilient Kubernetes clusters and cloud / on-prem environments using Infrastructure-as-Code (Terraform, Ansible). You will embed security and compliance controls (RMF, STIGs) directly into this automation.

Eliminating Toil and Scaling the Team : Proactively identify and eliminate operational toil by building automation. You will act as a force multiplier by advising other teams on best practices in air-gapped environments and production readiness.

What We Look For

3 years of experience in Site Reliability Engineering or a related field, with firsthand experience managing mission-critical systems within DoD’s air-gapped environments

An active Top Secret security clearance. U.S. citizenship required.

Experience automating software delivery, deployment, and providing documentation and self-service tools for engineering teams and customers.

A strong understanding of Linux, containerization and orchestration, and virtual machines

Experience with centralized logging, metrics, and observability using tools such as Prometheus, Loki, Grafana, ELK stack, or Datadog.

Networking fundamentals : core protocols and secure configurations.

A deep understanding of incident response processes, with experience conducting thorough root cause analyses and driving continuous improvement

Clear, concise writing; strong documentation habits and async communication.

Core skills and technologies : VMWare, Kubernetes, Docker, Helm, Ansible, Terraform, Linux, AWS, DoD compliance, Monitoring and Observability tools, AWS.

Bonus points (nice to have)

Experience with compliance frameworks (RMF, STIGs / SRGs, ICD 503).

Security‑minded design for air-gapped environments.

Active Security+ or another DoD 8570.01-approved security credential, or the ability to obtain the valid credentials within 3 months of employment.

serp_jobs.job_alerts.create_a_job

Senior Site Reliability Engineer • Oahu, HI, Hawaii, United States

Job_description.internal_linking.related_jobs
Site Reliability Engineer

Site Reliability Engineer

CATHEXIS • Honolulu, HI, United States
serp_jobs.job_card.full_time
Join to apply for the Site Reliability Engineer role at CATHEXIS.Team CATHEXIS elevates the government contracting experience through rapid response, deep skill, and thoughtful problem‑solving and ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Entry Level Caregiver

Entry Level Caregiver

Wilson Care Group • Kapolei, Hawaii
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
Ask how you can make up to $1500 in bonuses! .We hire all care levels from Nurse Aides without a CNA or schooling to Certified Caregivers with extensive experience. We look forward to having yo...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Site Reliability Engineer

Site Reliability Engineer

Sysco Northeast Rdc • Honolulu, HI, United States
serp_jobs.job_card.full_time
As a Site Reliability Engineer, you will drive system reliability, performance, and operational excellence through automation, observability, and proactive incident management.By collaborating clos...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Sales Manager

Sales Manager

Seal Masters Of Hawaii • Honolulu, HI, US
serp_jobs.job_card.full_time
Seal Masters of Hawaii & Elite Construction Services.Location : Honolulu, HI (On-site).MUST LIVE IN HAWAII TO APPLY • • •. Salary Range - $95,000 - $120,000.Seal Masters of Hawai‘i and Elite C...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Cyber Manager / Site Lead

Senior Cyber Manager / Site Lead

Leidos Inc • Pearl Harbor, HI, United States
serp_jobs.job_card.full_time
The Digital Modernization Sector is seeking a.DISA GSM-O II TN09 Network Assurance (NA) Program.GSM-O II provides network operations and cyber defense support to the Defense Information Systems Age...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Remote Exceptional Software Engineers (Coding Agent Experience) - AI Trainer ($85-$85 per hour)

Remote Exceptional Software Engineers (Coding Agent Experience) - AI Trainer ($85-$85 per hour)

Mercor • Honolulu, Hawaii, US
serp_jobs.filters.remote
serp_jobs.job_card.full_time
Mercor is seeking software engineers to support one of the world’s leading AI labs in building • •robust, high-performance systems • • that serve the needs of next-generation machine learning applicat...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
Principal System Engineer

Principal System Engineer

HMSA • Honolulu, HI, United States
serp_jobs.job_card.full_time
Serve as a technical leader on our most demanding, cross-functional projects.Balance technical leadership with strong business judgment to make the right decisions about technology choices.Decompos...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Engineer - Unified Communications

Senior Engineer - Unified Communications

HMSA • Honolulu, HI, United States
serp_jobs.job_card.full_time
Responsible for leading the support of HMSA's unified communications infrastructure.Responsibilities include, but are not limited to the following : . Ensures the reliability of HMSA's unified communi...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Remote Exceptional Software Engineers (Experience Using Agents) - AI Trainer ($70-$110 per hour)

Remote Exceptional Software Engineers (Experience Using Agents) - AI Trainer ($70-$110 per hour)

Mercor • Honolulu, Hawaii, US
serp_jobs.filters.remote
serp_jobs.job_card.full_time
Mercor is seeking software engineers to support one of the world’s leading AI labs in building • •robust, high-performance systems • • that serve the needs of next-generation machine learning applicat...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
Senior Site Reliability Engineer, Hawaii

Senior Site Reliability Engineer, Hawaii

Onebrief • Oahu, HI, Hawaii, United States
serp_jobs.job_card.full_time
Onebrief is collaboration and AI-powered workflow software designed specifically for military staffs.By transforming this work, Onebrief makes the staff as a whole superhuman - meaning faster, smar...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Mission Software Engineer, Public Sector

Mission Software Engineer, Public Sector

Scale AI, Inc. • Honolulu, HI, United States
serp_jobs.job_card.full_time
Scale AI is seeking a highly skilled and motivated Mission Software Engineer to join our dynamic Federal Engineering team. As a part of this team, you will play a critical role in supporting Scale's...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Systems Engineer

Systems Engineer

Leidos Inc • Hickam Air Force Base, HI, United States
serp_jobs.job_card.full_time
Join our National Security Sector as a Systems Engineer supporting the Headquarters Pacific Air Forces (PACAF) A2 Intelligence, Surveillance, and Reconnaissance (ISR) Directorate.Embedded within th...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Site Reliability Engineer (req-190)

Site Reliability Engineer (req-190)

CATHEXIS • Honolulu, HI, United States
serp_jobs.job_card.full_time
Team CATHEXIS elevates the government contracting experience through rapid response, deep skill, and thoughtful problem-solving and communication. Our core capabilities are our top-tier program and ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Environmental Technical Claims Specialist

Senior Environmental Technical Claims Specialist

Argonaut Management Services, Inc • Honolulu, HI, United States
serp_jobs.job_card.full_time
Argo Group International Holdings, Inc.American National, US based specialty P&C companies, (together known as BP&C, Inc. Brookfield Wealth Solutions, Ltd.BWS"), a New York and Toronto-listed public...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Civil / Environmental Engineer

Civil / Environmental Engineer

Jobot • Honolulu, HI, US
serp_jobs.job_card.full_time
Brand New Civil / Environmental Engineer Opening With Leader In Land Development, Utilities and Drainage Design!.This Jobot Job is hosted by : Brian Perkins. Are you a fit? Easy Apply now by clicking t...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Piers Site Operations Lead

Piers Site Operations Lead

Leidos Inc • Fort Shafter, HI, United States
serp_jobs.job_card.full_time
The Digital Modernization Group at Leidos has an opening for the Navy's Service Management, Integration and Transport (SMIT) Piers Connectivity Services program as a Site Operations Leader.The SMIT...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Energy Advisor - New Construction

Energy Advisor - New Construction

Leidos Inc • Honolulu, HI, United States
serp_jobs.job_card.full_time
Leidos is seeking an Energy Advisor to join our Hawai'i Energy program team's efforts in empowering our community to achieve the state's clean energy goals. For over 15 years, Hawai'i Energy has emp...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Mission Software Engineer - Hawaii

Mission Software Engineer - Hawaii

Havocai • Urban Honolulu, Hawaii, United States
serp_jobs.job_card.full_time
HavocAI is an innovative defense technology company making scalable maritime autonomy a reality.We are pioneering an end-to-end solution for planning, tasking and control of uncrewed surface vessel...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted