Software Engineer, Site Reliability Engineer (SRE)Harvey • San Francisco, California, United States

Software Engineer, Site Reliability Engineer (SRE)

Harvey • San Francisco, California, United States

[job_card.30_days_ago]

[job_preview.job_type]

[job_card.full_time]

[job_card.job_description]

Why Harvey

Harvey is a secure AI platform for legal and professional services that augments productivity and automates complex workflows. Harvey uses algorithms with reasoning-adept LLMs that have been customized and developed by our expert team of lawyers, engineers and research scientists. We’ve found product market fit and are scaling our team very quickly. Some reasons to join Harvey are :

Exceptional product market fit : We have partnered with the largest law firms and professional service providers in the world, including Paul Weiss , A&O Shearman , Ashurst , O'Melveny & Myers, PwC , KKR, and many others.

Strategic investors : Raised over $500 million from strategic investors including Sequoia, Google Ventures, Kleiner Perkins, and OpenAI.

World-class team : Harvey is hiring the best talent from DeepMind, Google Brain, Stripe, FAIR, Tesla Autopilot, Glean, Superhuman, Figma, and more.

Partnerships : Our engineers and researchers work directly with OpenAI to build the future of generative AI and redefine professional services.

Performance : 4x ARR in 2024.

Competitive compensation.

Role Overview

As a Software Engineer on the Site Reliability team at Harvey, you will ensure the reliability, scalability, and performance of our legal AI platform. You’ll join a high-leverage team that sits at the intersection of infrastructure and product, owning the systems that keep our platform fast, secure, and always on. From scaling across 50+ regions to automating mission-critical operations, your work will ensure that Harvey remains resilient as we grow. If you’re passionate about building robust systems and reducing complexity through automation, we’d love to work with you.

This role is based in San Francisco, CA. We use an in-person work model and offer relocation assistance to new employees.

What You’ll Do

Design, implement, and manage monitoring, alerting, and infrastructure resources (compute, storage, networking) across 50+ global regions

Lead incident management processes, including postmortems, root cause analyses, and driving actionable improvements

Automate operational tasks and workflows, building tools and processes for capacity planning, graceful rollouts, and safe data access to maintain high reliability and reduce manual intervention

Develop and enforce best practices for security, compliance, and infrastructure reliability while collaborating cross-functionally to integrate these principles throughout the software lifecycle

Optimize infrastructure costs through strategic capacity planning and build-versus-buy decisions while maintaining system performance, reliability, and functionality.

What You Have

3+ years of experience in Site Reliability Engineering or similar roles supporting production environments

Expertise in infrastructure as code(IaC) tools (Pulumi, Terraform, CloudFormation, etc.).

Deep familiarity with observability tools (Datadog, Sentry, etc.) and incident response practices (PagerDuty, IncidentIO, etc.)

Proficiency with cloud infrastructure platforms (Azure, GCP, AWS, etc.)

Strong programming skills (Python, Bash, Go, or similar languages)

Proven track record of diagnosing complex system problems and implementing durable solutions

Solid understanding of CI / CD, Kubernetes, containerization, networking, and cloud security principles

Excellent problem-solving skills, meticulous attention to detail, and a commitment to operational excellence

Compensation Range

$175,000 - $250,000 USD

Please find our CA applicant privacy notice here .

Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity / expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law.

We are in the early innings of a generational company. Joining early at a hypergrowth startup has proven to lead to exponential growth in responsibility, access, and ability. Apply here today!

[job_alerts.create_a_job]

Site Reliability Engineer Sre • San Francisco, California, United States

[internal_linking.related_jobs]

Senior Technology Site Reliability Engineer

Cooley LLP • San Francisco, CA, United States

[job_card.full_time]

Senior Technology Site Reliability Engineer.Cooley is seeking a Senior Site Reliability Engineer to join the.Infrastructure & Development Operations. The Senior Technology Site Reliability Engineer(...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Systems Reliability Engineer (SRE) - Edge

Cloudflare • San Francisco, CA, United States

[job_card.full_time]

Systems Reliability Engineer (SRE) - Edge.At Cloudflare, we are on a mission to help build a better Internet.Today the company runs one of the world’s largest networks that powers millions of websi...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Systems Reliability Engineer (SRE), Edge

Cloudflare, Inc. • San Francisco, CA, United States

[job_card.full_time]

At Cloudflare, we are on a mission to help build a better Internet.Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for cust...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Site Reliability Engineer

Together AI • San Francisco, CA, United States

[job_card.full_time]

As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a soft...[show_more]

[last_updated.last_updated_30] • [promoted]

Software Engineer, Site Reliability (SRE)

Sierra • San Francisco, CA, United States

[job_card.full_time]

At Sierra, we’re creating a platform to help businesses build better, more human customer experiences with AI.We are primarily an in-person company based in San Francisco, with growing offices in A...[show_more]

[last_updated.last_updated_30] • [promoted]

Software Engineer, Site Reliability (SRE)

Sierra Business Solution • San Francisco, CA, United States

[job_card.full_time]

Software Engineer, Site Reliability (SRE).Software Engineer, Site Reliability (SRE).We are an in‑person company based in San Francisco with growing offices in Atlanta, New York, and London, buildin...[show_more]

[last_updated.last_updated_30] • [promoted]

Senior Site Reliability Engineer

Alembic Technologies • San Francisco, CA, United States

[job_card.full_time]

Senior Site Reliability Engineer.This range is provided by Alembic Technologies.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.We’re looking fo...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Site Reliability Engineer

Canonical • San Francisco, CA, United States

[job_card.full_time]

Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiat...[show_more]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer (SRE)

Air Apps • San Francisco, CA, United States

[job_card.full_time]

Site Reliability Engineer (SRE).Site Reliability Engineer (SRE).Get AI-powered advice on this job and more exclusive features. At Air Apps, we believe in thinking bigger—and moving faster.We’re a fa...[show_more]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer I

Prosper • San Francisco, CA, United States

[job_card.full_time]

As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...[show_more]

[last_updated.last_updated_30] • [promoted]

Software Engineer (Site Reliability Engineer)

Anyscale • San Francisco, CA, United States

[job_card.full_time]

Software Engineer (Site Reliability Engineer).Software Engineer (Site Reliability Engineer).At Anyscale, we're on a mission to democratize distributed computing and make it accessible to software d...[show_more]

[last_updated.last_updated_30] • [promoted]

Sr. Site Reliability Engineer

Apple Inc. • San Francisco, CA, United States

[job_card.full_time]

San Francisco Bay Area, California, United States Software and Services.Apple is where individual imaginations gather together, committing to the values that lead to great work.Every new product we...[show_more]

[last_updated.last_updated_1_day] • [promoted]

Site Reliability Engineer - Scale & Observability

gamma.app • San Francisco, CA, United States

[job_card.full_time]

A dynamic tech firm located in San Francisco is seeking a Site Reliability Engineer to enhance operational health across their production systems. This high-impact role demands expertise in AWS and ...[show_more]

[last_updated.last_updated_variable_hours] • [promoted] • [new]

Site Reliability Engineer

Speak • San Francisco, CA, United States

[job_card.full_time]

Our mission is to reinvent the way people learn, starting with language.Learning a language can change a life by opening doors to new cultures, careers, and communities. Two billion people around th...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Site Reliability Engineer

Flexton, Inc. • San Francisco, CA, United States

[job_card.full_time]

Skill : You have excellent written and verbal communication skills.You have experience managing large websites or services within the context of a large scale web environment.You are able to execute...[show_more]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer II

Hinge Health • San Francisco, CA, United States

[job_card.full_time]

From scaling Kubernetes clusters to improving observability with Datadog, we build the tooling and automation that empower product teams to ship with confidence. Collaborate with engineering teams t...[show_more]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer (SRE)

Baseten • San Francisco, CA, United States

[job_card.full_time]

Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and Zed. By uniting applied AI research, flexible inf...[show_more]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer

Cypress HCM • San Mateo, CA, United States

[job_card.full_time]

As a Site Reliability Engineer (Contractor), you will be a hands-on contributor, focused on supporting and improving the reliability of our AWS cloud infrastructure. You will apply core SRE principl...[show_more]

[last_updated.last_updated_variable_days] • [promoted]