Talent.com
Senior Site Reliability Engineer
Senior Site Reliability EngineerGreenLite • New York, New York, United States
Senior Site Reliability Engineer

Senior Site Reliability Engineer

GreenLite • New York, New York, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Our Company

Founded in 2022, GreenLite is revolutionizing development in America by streamlining the collaboration between developers, builders, and local regulatory authorities. GreenLite’s software powers its Private Plan Review offering, serving many of the nation’s largest public retailers, developers, and production home builders. By leveraging GreenLite’s technology, its customers save months on each project, significantly accelerating their timelines and staying within budget.

GreenLite is founded by experts in technology, development, and within the AEC (Architecture, Engineering, and Construction) industry, and backed by leading venture capital firms. GreenLite is at the forefront of the privatization of construction permitting and plan review, reshaping a multi-hundred billion dollar industry.

GreenLite has raised nearly $40M from the country’s leading venture capital investors, including Craft Ventures, who led GreenLite’s $28.5M Series A. We’re well capitalized to achieve our mission of revolutionizing the plan review and construction permitting process across the country.

The role and why it matters

Reliability is a product for our customers : city officials rely on us to be available whenever a submission deadline looms, and builders stake millions of dollars on predictable turnaround. As our first dedicated SRE you will establish the patterns, tooling and culture that keep our systems fast, observable and resilient while we 10x traffic over the next 18 months.

Our operating principles— Winning Mentality, Speed & Urgency, Disagree & Commit, Ownership & Integrity, Customer Centricity —are not wall art; they guide hiring, architecture and on‑call decisions.

What you’ll do

Design & harden production infrastructure AWS ECS / Fargate via AWS Copilot (migrating to Terraform), RDS / Postgres, S3, EventBridge, Bedrock.

Lead reliability engineering : SLO / SLA definition, error‑budget policies, capacity planning and load testing ahead of major launches.

Own CI / CD : advance our GitHub Actions pipeline, introduce progressive delivery and automated rollbacks to steadily maintain & improve deployment frequency and lead time for changes.

Instrument & Observe : deploy metrics, tracing and logging (Datadog) and drive an on‑call culture focused on MTTR and learning reviews, not blame.

Security & compliance : partner with the engineers to automate patching, secrets management & rotation, least‑privilege IAM and SOC 2 controls.

Coach & collaborate : mentor engineers on SRE best practices, work closely with ML and product squads, and influence architecture decisions through strong opinions loosely held.

Continuously improve : identify systemic bottlenecks, build tooling that eliminates toil and scale our platform without scaling pager fatigue.

What you’ll bring

Must‑have :

6+ yrs building and operating production systems in AWS, GCP or Azure (AWS preferred).

Demonstrated ownership of SLOs, incident response and post‑incident analysis.

Expert in IaC (Terraform, CDK, Pulumi) and container orchestration (ECS, EKS or K8s).

Proficient with at least one modern language (Python, Rust, Go) and strong bash skills.

Deep familiarity with observability stacks (Datadog, Grafana, Prometheus, OTEL).

Track record of raising the bar for security, compliance and cost optimisation.

Nice-to-have :

Experience with infrastructure for ML workflows (model training, feature stores).

Prior work in construction‑tech, gov‑tech or other regulated domains.

Certification : AWS Solutions Architect or DevOps Pro.

Experience introducing chaos engineering or game‑days.

Public track record (blog posts, OSS) advancing the SRE discipline.

Leadership in defining hiring / on‑call processes at a high‑growth startup.

In your first 180 days you will

30 days – Stand up staging / production dashboards, own the on‑call rotation and deliver a gap‑analysis of our reliability posture. Take ownership of our migration into AWS Control Tower, and contribute to architecture for hosting our production applications, including AI engineering.

60 days – Roll out error‑budget policies, automated canary deploys and service‑level telemetry across all micro‑services. Complete migration off of AWS CoPilot. Plan migration from RDS Postgres to Aurora Postgres, including metrics. Establish production infrastructure for AI engineering.

90 days – Reduce p95 latency by ≥20 %, cut mean time‑to‑recovery (MTTR) to

180 days – Mentor two mid‑level engineers into effective first responders and established infrastructure for ML products. Train team on disaster recovery plan, and do a dry run of restoration from backups.

What success looks like

99.95 % customer‑visible uptime with clearly defined SLAs.

Engineering velocity accelerates because infrastructure just works and developers ship confidently.

Post‑incident reviews focus on learning; recurring classes of incidents drop each quarter.

Stakeholders (product, customer success, city reviewers) describe reliability as a core differentiator.

Our team thrives on collaboration, so we’re in the office 4 days per week. In the summer, from Memorial Day to Labor Day, we switch to a 3-day in-office schedule to give everyone extra flexibility.

Our hiring process

Intro with Talent Partner

values & architecture deep‑dive with Head of Engineering

On-site : Practical systems‑design exercise (real scenarios we face)

On-site : On‑call simulation & retrospective with two engineers

On-site : Cross functional Panel interview

Final exec conversation and offer discussion

Thrive With GreenLite

Competitive Compensation - Generous base salary & access to our Employee Equity Program, so you can grow with us.

Performance-Based Annual Bonuses - Rewards for high-impact results and contributions that move the needle.

Premium Health Coverage - Comprehensive medical, dental, and vision insurance for full-time team members : 100% of premiums covered under our HDHP plan & 98% coverage for employees and their spouses.

401(k) Retirement Plan - Helping you invest in your future with smart saving options.

Parental Leave - Generous parental leave for all parents to support your growing family.

Wellness Support - Monthly Wellness Stipend and full access to Wellhub, Talkspace, & Teladoc for your physical and mental well-being.

Weekly Team Lunches - Enjoy catered lunches every week in our NYC office. Great food, better company.

Company-Wide Team All Hands - Held twice a year, fostering transparency, alignment, and inspiration.

Team-Building Events - Regular opportunities to connect, collaborate, and celebrate as a team.

Unlimited PTO - Flexible time off so you can recharge, travel, or take care of life as needed.

Hybrid Work Environment – Our team thrives on collaboration, so we’re in the office 4 days per week. In the summer, from Memorial Day to Labor Day, we switch to a 3-day in-office schedule to give everyone extra flexibility.

Equal Opportunity Statement

GreenLite values people from all walks of life and professional backgrounds. We understand not everyone will meet all the above qualifications on day one. That's okay. If you’re passionate about the construction industry or solving the housing crisis in America, and want the opportunity to grow in your career, we encourage you to apply.

GreenLite is an equal employment opportunity employer, committed to an inclusive workplace where we do not discriminate on the basis of race, sex, gender, national origin, religion, sexual orientation, gender identity, marital or familial status, age, ancestry, disability, genetic information, or any other characteristic protected by applicable laws. We believe in diversity and encourage any qualified individual to apply.

serp_jobs.job_alerts.create_a_job

Senior Site Reliability Engineer • New York, New York, United States

Job_description.internal_linking.related_jobs
Site Reliability Engineer

Site Reliability Engineer

S&P Global • New York, New York, United States
serp_jobs.job_card.full_time
This job is with S&P Global, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly.About the Rol...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Software Engineer, Enterprise GenAI

Senior Software Engineer, Enterprise GenAI

Scale AI, Inc. • New York, NY, United States
serp_jobs.job_card.full_time
Scale GP (Scale Generative AI Platform) is an enterprise-grade Generative AI platform that provides APIs for knowledge retrieval, inference, evaluation, and more. We are looking for a strong enginee...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Reliability Software Engineer

Reliability Software Engineer

Point72 • New York, New York, United States
serp_jobs.job_card.full_time
A Career with Point72’s Technology Team.As Point72 reimagines the future of investing, our Technology group is constantly improving our company’s IT infrastructure, positioning us at the forefront ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Site Reliability Engineer (Genetec) (Englewood Cliffs)

Site Reliability Engineer (Genetec) (Englewood Cliffs)

STAND 8 Technology Consulting • Englewood Cliffs, NJ, United States
serp_jobs.job_card.full_time
STAND 8 provides end to end IT solutions to enterprise partners across the United States and with offices in Los Angeles, New York, New Jersey, Atlanta, and more including internationally in Mexico...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Site Reliability Engineer Charlotte, NC / Chandler, AZ / , NJ

Site Reliability Engineer Charlotte, NC / Chandler, AZ / , NJ

Career Mentors, LLC • Jersey City, NJ, US
serp_jobs.job_card.full_time
Pay Rate : upto $75 pr hr on W2.Jersey City, NJ - Near by candidates.Previously functioned in an SRE role within a large production environment, with a focus on automation testing experience.Hands-o...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30
Fulltime role Sr. Site Reliability Engineer

Fulltime role Sr. Site Reliability Engineer

Smart Bot Systems • Jersey City, NJ, United States
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
Job Title : Sr.Site Reliability Engineer Location : Jersey City, NJ Hybrid <...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days
Senior Software Engineer

Senior Software Engineer

Camber • New York, New York, United States
serp_jobs.job_card.full_time
Camber builds software to improve the quality and accessibility of healthcare.We streamline and replace manual work so clinicians can focus on what they do best : providing great care.For more detai...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Technology Site Reliability Engineer

Senior Technology Site Reliability Engineer

Cooley LLP • New York, NY, United States
serp_jobs.job_card.full_time
Senior Technology Site Reliability Engineer.Cooley is seeking a Senior Site Reliability Engineer to join the.Infrastructure & Development Operations. The Senior Technology Site Reliability Engineer(...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Site Reliability Engineer

Site Reliability Engineer

Tekgence Private Ltd • Jersey City, NJ, United States
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
Site Reliability Engineer Location : Jersey City, NJ Day 1 onsite Hybrid Required skills Python,...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days
Senior Energy Engineer

Senior Energy Engineer

Willdan Group, Inc. • New York, NY, United States
serp_jobs.job_card.full_time
Enica Engineering, a division of Willdan Group Inc, provides services built on its industry-leading controls expertise and unique, hands-on project delivery. We offer services in project development...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Remote Side Hustle Developer

Remote Side Hustle Developer

Finance Buzz • Keansburg, New Jersey, US
serp_jobs.filters.remote
serp_jobs.job_card.full_time +1
This position is for individuals who want to develop a side income stream while still working full time.You will test different small-scale remote opportunities, learn what works, and grow what pro...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Fullstack Engineer, Detect

Senior Fullstack Engineer, Detect

Sage • New York, New York, United States
serp_jobs.job_card.full_time
Sage is on a mission to improve care and quality of life for older adults, starting with those residing in senior living facilities. Falls are the leading cause of injury-related death among adults ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Altana AI • New York, NY, United States
serp_jobs.job_card.full_time
AI can be a powerful tool for good in the world – at Altana we apply AI to the world’s largest organized body of supply chain data to power a more resilient, more secure, and more sustainable model...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Site Reliability Manager

Site Reliability Manager

Macmillan Learning • New York, NY, United States
serp_jobs.job_card.full_time
The Site Reliability Manager (SRM) maintains the availability, reliability, and performance of internal applications and SaaS platforms. This role involves managing incidents, optimizing system perf...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Business Development Manager

Business Development Manager

The Kiely Family of Companies • Eatontown, NJ, US
serp_jobs.job_card.full_time
Since 1952, Kiely Family of Companies has been building lasting relationships and delivering innovative design-build solutions that put our customers’ success first.Recognized on the ENR 400,...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Engineer

Senior Engineer

Columbia University • New York, NY, United States
serp_jobs.job_card.full_time
Job Type : Officer of Administration.Salary Range : $150,000 - $159,000.The salary of the finalist selected for this role will be set based on a variety of factors, including but not limited to depar...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Sr Lead Software Engineer, Back End / SRE - Shopping (Remote-Eligible)

Sr Lead Software Engineer, Back End / SRE - Shopping (Remote-Eligible)

Capital One • New York, NY, US
serp_jobs.filters.remote
serp_jobs.job_card.full_time +1
Sr Lead Software Engineer, Back End / SRE - Shopping (Remote-Eligible).Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, col...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
DevOps / Site Reliability Engineer

DevOps / Site Reliability Engineer

Noblesoft Technologies • Englewood Cliffs, NJ, United States
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
Job Title : DevOps / Site Reliability Engineer Location : Englewood Cliffs, NJ Mandatory skill : CI / CD, AW...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days