Talent.com
Staff Site Reliability Engineer
Staff Site Reliability EngineerGrindr • Palo Alto, CA, United States
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Grindr • Palo Alto, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Join to apply for the Staff Site Reliability Engineer role at Grindr

Get AI-powered advice on this job and more exclusive features.

This range is provided by Grindr. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Base pay range

$152,400.00 / yr - $254,000.00 / yr

This is a hybrid role based in our Chicago or Palo Alto offices and will require you to be in office Tuesdays and Thursdays.

What’s so interesting about this role?

The Site Reliability Engineering (SRE) team at Grindr is responsible for ensuring our systems are stable, performant, and scalable as we continue to grow globally. This role reports directly to the Director of Technical Operations and plays a critical part in keeping our infrastructure running reliably while supporting both backend and operations teams. By driving improvements in automation, incident response, and performance optimization, this position ensures Grindr can deliver a safe, reliable, and seamless experience to millions of users worldwide. The team’s work directly impacts uptime, efficiency, and overall system resilience, supporting Grindr’s broader roadmap of building a secure and high‑performing platform for the LGBTQ+ community.

What’s the job?

  • Monitoring and Alerting : Set up and maintain monitoring systems to track the health and performance of applications and infrastructure. Create and manage alerting mechanisms to detect and respond to issues quickly.
  • Incident Response : Handle incidents and outages, working to resolve them swiftly and minimize downtime. Performing root cause analysis to prevent future occurrences and improve system resilience.
  • Automation : Develop tools and scripts to automate repetitive tasks, such as deployments, monitoring, and scaling, to increase efficiency and reduce human error.
  • Performance Optimization : Analyze system performance and identify bottlenecks or areas for improvement. Work with development teams to optimize code and infrastructure for better performance and resource utilization.
  • Capacity Planning : Plan for future growth by analyzing current usage trends and forecasting resource needs. Additionally, you’ll ensure that systems can handle increased load without compromising performance or reliability.
  • Service Level Objectives (SLOs) and Service Level Agreements (SLAs) : Define and measure SLOs and SLAs to set expectations for system reliability and performance. Track these metrics and work to maintain or exceed the defined standards.
  • Incident Management and Postmortems : After incidents, conduct post mortems to document what went wrong, what was done to fix it, and how to prevent similar incidents in the future. This process helps in continuous improvement and learning from failures.
  • Collaboration with Development Teams : Work closely with software developers to integrate reliability and performance into the development process. Provide guidance on best practices and assist with designing resilient systems.
  • Security and Compliance : Ensure that systems are secure and compliant with relevant regulations and standards. They implement security measures, monitor for vulnerabilities, and respond to security incidents.
  • Continuous Improvement : Continuously look for ways to improve system reliability, performance, and efficiency. Stay updated with industry trends and advancements to implement the best practices and technologies.
  • Participate in an on‑call rotation.

What We’ll Love About You

  • 5+ years of experience in site reliability including incident response, incident management, automation and performance optimization
  • 5+ years of experience in cloud platforms (AWS preferred)
  • 4+ years of experience working with DevOps technologies such as Docker, Kubernetes, Helm, and Terraform
  • 4+ years developing and maintaining CI / CD pipelines
  • 4+ years experience using a scripting language like python or bash
  • Experience coding in Kotlin or another JVM language is a plus
  • We’ll Really Swoon if You Have

    Technical Expertise :

  • Proficient in at least one programming language (e.g., Python, Go, Java).
  • Strong knowledge of Linux / Unix systems.
  • Experience with cloud platforms (e.g., AWS, GCP, Azure).
  • Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes).
  • Understanding of networking concepts and protocols.
  • Reliability Engineering :

  • Experience with monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK stack).
  • Ability to implement and manage CI / CD pipelines.
  • Knowledge of infrastructure as code (e.g., Terraform, Ansible).
  • Proficiency in automated testing and deployment practices.
  • Understanding of SRE principles and practices, including SLAs, SLOs, and SLIs.
  • Security :

  • Knowledge of security best practices and compliance standards.
  • Experience with vulnerability assessment and mitigation.
  • Operational Excellence :

  • Proven track record of maintaining high availability and performance in production environments.
  • Experience with incident management and post‑mortem analysis.
  • Ability to optimize system performance and resource utilization
  • What You’ll Love About Us

  • Mission and Impact : Grindr is building the global gayborhood in your pocket. Your role will impact the lives of millions of LGBTQ+ people around the world. Through our success, we are making a world where the lives of our community are free, equal, and just.
  • Family Insurance : Insurance premium coverage for health, dental, and vision for you and partial coverage for your dependents.
  • Retirement Savings : Generous 401K plan with 6% match and immediate vest in the U.S.
  • Compensation : Industry‑competitive compensation and eligibility for company bonus and equity programs.
  • Queer‑Inclusive Benefits : Industry‑leading gender‑affirming offerings with up to 90% cost coverage, access to Included Health, monthly stipends for HRT, and more.
  • Additional Benefits : Flexible vacation policy, monthly stipends for cell phone, internet, wellness, food, and commuting, breakfast / lunch provided onsite, and yearly travel & leisure stipend.
  • About Grindr

    Grindr is building the global gayborhood in your pocket. With more than 13.5 million monthly active users, Grindr has become a fundamental part of the LGBTQ+ community and is charting a path to make the world more free, equal, and just. Since 2015, Grindr for Equality has advanced safety, health, and human rights for millions of Grindr users and the global LGBTQ+ community in partnership with more than 100 community organizations in every region of the world. Our next evolution is underway as a public company that continues to grow and build meaningful experiences for our users. From social issues to product innovations, we’re setting audacious goals for our community and the business, and leveraging the latest tech stacks and a culture of engineering excellence to make it happen. At the heart of our work in this new chapter is a shared set of operating principles centered around cultivating curiosity, thinking big, setting and expediting our ambitious goals, and growing through iteration; all while keeping our users #1. Grindr is headquartered in West Hollywood, California, with offices in the Bay Area, Chicago, and New York. With a track record of strong financial performance and plans for continued headcount growth, we’re building a team of talented, passionate, and open‑minded people who want to disrupt the dating app space, innovate products, and advance LGBTQ+ culture. Come be a part of this exciting journey with us.

    Grindr is an equal‑opportunity employer

    To learn more about how we handle the personal data of applicants, visit our Employee and Candidate Privacy Policy.

    Seniority level

  • Mid‑Senior level
  • Employment type

  • Full‑time
  • Job function

  • Engineering and Information Technology
  • Industries

  • Software Development
  • Referrals increase your chances of interviewing at Grindr by 2x

    #J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Site Reliability Engineer • Palo Alto, CA, United States

    Job_description.internal_linking.related_jobs
    Site Reliability Engineer

    Site Reliability Engineer

    Fortinet • Sunnyvale, CA, United States
    serp_jobs.job_card.full_time
    At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members enjoy solving complex problems, and obsess...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior / Staff Site Reliability Engineer

    Senior / Staff Site Reliability Engineer

    Gatik Ai • Mountain View, California, United States
    serp_jobs.job_card.full_time
    Gatik, the leader in autonomous middle-mile logistics, is revolutionizing the B2B supply chain with its autonomous transportation-as-a-service (ATaaS) solution and prioritizing safe, consistent del...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Sr. Site Reliability Engineer

    Sr. Site Reliability Engineer

    Globality • Palo Alto, California, United States
    serp_jobs.job_card.full_time
    Joel Hyatt and Lior Delgo founded Globality with a vision to create prosperous and healthy economies, companies, communities, and individuals. In this new era of the Autonomous Enterprise, Globality...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Technology Site Reliability Engineer

    Senior Technology Site Reliability Engineer

    Cooley LLP • Palo Alto, CA, United States
    serp_jobs.job_card.full_time
    Senior Technology Site Reliability Engineer.Cooley is seeking a Senior Site Reliability Engineer to join the.Infrastructure & Development Operations. The Senior Technology Site Reliability Engineer(...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PsiQuantum • Palo Alto, CA, United States
    serp_jobs.job_card.full_time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Staff Site Reliability Engineer, Telecom & SMS

    Staff Site Reliability Engineer, Telecom & SMS

    Ez Texting • San Jose, California, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Who We Are EZ Texting is a recognized leader in text message marketing for small and medium-sized businesses and organizations, setting the standard for professional texting.Our messaging solutions...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Psiquantum • Palo Alto, California, United States
    serp_jobs.job_card.full_time
    Quantum computing holds the promise of humanity’s mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Sr. Reliability Engineer (26861)

    Sr. Reliability Engineer (26861)

    Supermicro • San Jose, CA, United States
    serp_jobs.job_card.full_time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Zscaler • San Jose, California, United States
    serp_jobs.job_card.full_time
    Zscaler accelerates digital transformation so our customers can be more agile, efficient, resilient, and secure.Our cloud native Zero Trust Exchange platform protects thousands of customers from cy...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Staff Site Reliability Engineer (Cortex Observability)

    Senior Staff Site Reliability Engineer (Cortex Observability)

    Palo Alto Networks • Santa Clara, California, United States
    serp_jobs.job_card.full_time
    At Palo Alto Networks® everything starts and ends with our mission : .Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and m...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Natcast • Sunnyvale, California, United States
    serp_jobs.job_card.full_time
    Natcast (short for The National Center for the Advancement of Semiconductor Technology) is a new, purpose-built, non-profit entity created to operate the National Semiconductor Technology Center (N...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    black.ai • Palo Alto, CA, United States
    serp_jobs.job_card.full_time
    Quantum computing holds the promise of humanity’s mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tarana Wireless • Milpitas, California, United States
    serp_jobs.job_card.full_time
    Join the Team That's Redefining Wireless Technology.Our groundbreaking Fixed Wireless Access technology is delivering .As a Site Reliability Engineer, you will help us manage software that runs on ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer - Observability

    Site Reliability Engineer - Observability

    Rivian and Volkswagen Group Technologies • Palo Alto, CA, United States
    serp_jobs.job_card.full_time
    Senior Site Reliability Engineer (SRE).RivianVW's Data Platform - Production Engineering team.In this role, you will design, implement, and scale robust observability systems to ensure the health, ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Site Reliability Engineer (SRE) - Media Production Infrastructure

    Site Reliability Engineer (SRE) - Media Production Infrastructure

    Monks • Cupertino, California, United States
    serp_jobs.job_card.full_time
    Please note that we will never request payment or bank account information at any stage of the recruitment process.As we continue to grow our teams, we urge you to be cautious of fraudulent job pos...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Site Reliability Engineer (L2)

    Site Reliability Engineer (L2)

    Wave Money • Palo Alto, CA, United States
    serp_jobs.job_card.full_time
    Job Location : The Campus, Pun Hlaing Estate, Hlaing Thar Yar Township, Yangon.Working Hours : 8 : 30 AM to 5 : 30 PM, (Monday to Friday). Site Reliability Engineer is to perform daily support and monitor...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Xai • Palo Alto, California, United States
    serp_jobs.job_card.full_time
    AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Paynearme • Cupertino, California, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    At PayNearMe, we’re on a mission to make paying and getting paid as simple as possible.We build innovative technology that transforms the way businesses and their customers experience payments.Our ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted