Talent.com
Site Reliability Engineering (SRE) Architect
Site Reliability Engineering (SRE) ArchitectQTech • Atlanta, Georgia, USA
Site Reliability Engineering (SRE) Architect

Site Reliability Engineering (SRE) Architect

QTech • Atlanta, Georgia, USA
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Job Title : Site Reliability Engineering (SRE) Architect

Location : Atlanta Georgia (Hybrid)

Long Term Contract

Looking for W2 Candidates. No C2C

Job Discription :

As an SRE Architect you will be a pivotal technical leader responsible for designing building and evolving the foundational systems and practices that ensure the reliability scalability performance and efficiency of our critical services. Moving beyond day-to-day operations you will focus on the strategic architectural direction of SRE function defining standards blueprints and frameworks that enable development teams and fellow SRE operations team to build and operate highly resilient systems. Leverage deep expertise in software engineering distributed systems cloud infrastructure and SRE principles to influence technology choices establish best practices and foster a proactive culture of reliability across the organization and much beyond observability pillar.

Key Responsibilities :

1. Reliability Strategy & Design :

o Architect and design highly available scalable secure and cost-effective infrastructure and application patterns on AWS

o Define and evangelize SRE best practices standards and blueprints for service design deployment monitoring and operational readiness across the engineering organization

o Review current observability implementation to identify gaps and define steps to reach next level maturity of observability setup to provide deep insights into system health and behaviour

o With overall maturity lead the definition and implementation strategy for Service Level Indicators (SLIs) Service Level Objectives (SLOs) and Error Budgets for critical services

2. Platform Architecture & Automation :

o Design solutions to systematically reduce operational toil through automation and improved system design

o Evaluate current SRE tools and automation frameworks (e.g. CI / CD pipelines Infrastructure as Code modules automated incident remediation chaos engineering platforms) and suggest enhancement that will help overall enhancement of capability

o Evaluate prototype and recommend new technologies tools and methodologies to enhance system reliability developer productivity and operational efficiency

3. Technical Leadership & Consultation :

o Act as a senior technical advisor and subject matter expert on reliability scalability and performance for development and platform teams

o Provide architectural guidance during the design phase of new services and features to ensure reliability principles are embedded early (shift-left)

o Mentor and coach other SREs and engineers fostering technical excellence and adherence to SRE principles

o Lead architectural reviews and production readiness assessments for critical systems

4. Resilience :

o Lead blameless postmortems for significant incidents ensuring root causes are identified and systemic architectural improvements are prioritized and implemented

o Architect and advocate for resilience patterns (e.g. circuit breaking rate limiting graceful degradation chaos engineering) within applications and infrastructure

Required Qualifications :

Proven experience in an architectural role designing solutions for reliability scalability and performance

Deep understanding and practical application of SRE principles (SLIs / SLOs error budgets toil reduction automation incident management postmortems)

Expertise in cloud computing platforms (e.g. AWS) including infrastructure networking and security services

Strong experience with containerization and orchestration technologies (Kubernetes Docker serverless computing)

Solid experience designing and implementing observability solutions (e.g. Dynatrace Prometheus Grafana ELK / EFK Stack Jaeger OpenTelemetry)

Strong programming / scripting skills (e.g. Python Go Bash) for automation and tool development

Excellent analytical problem-solving and strategic thinking skills.

Strong communication collaboration and leadership skills with the ability to influence technical direction across teams

Preferred Qualifications :

Experience designing and implementing chaos engineering practices and platforms

Best Regards

Tarun K

Phone : 1-

Email : Key Skills

Fashion Retail,Highway Design,Apache Web Server,Atl,CAD CAM,ABAP

Employment Type : Full Time

Experience : years

Vacancy : 1

serp_jobs.job_alerts.create_a_job

Site Reliability Sre • Atlanta, Georgia, USA

Job_description.internal_linking.related_jobs
SRE Architect

SRE Architect

Blue Ribbon Global Technologies • Atlanta, Georgia, USA
serp_jobs.job_card.full_time +1
Client : Xebia / Delta Airlines.As an SRE Architect you will be a pivotal technical leader responsible for designing building and evolving the foundational systems and practices that ensure the reli...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Principal Site Reliability Engineer

Principal Site Reliability Engineer

Qgenda • Atlanta, Georgia, United States
serp_jobs.job_card.full_time +1
QGenda is redefining healthcare workforce management everywhere care is delivered.We're on a mission to empower the healthcare industry to better onboarding, deploy, and manage their workforce.Over...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Cloudious LLC • Atlanta, Georgia, USA
serp_jobs.job_card.full_time
Senior Site Reliability Engineer.Manage and optimize data streaming and API components in OpenShift Onpremise and AWS.Proactively review the applications APIs and processes to identify opportunitie...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Travel Registered Respiratory Therapist (RRT) - $1,576 to $1,812 per week in Fayetteville, GA

Travel Registered Respiratory Therapist (RRT) - $1,576 to $1,812 per week in Fayetteville, GA

AlliedTravelNetwork • Fayetteville, GA, US
serp_jobs.job_card.full_time
AlliedTravelNetwork is working with Fusion Medical Staffing to find a qualified RRT in Fayetteville, Georgia, 30214!.Facility in Fayetteville, Georgia. Fusion Medical Staffing is seeking a skilled R...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Travel Registered Respiratory Therapist (RRT) - $1,535 to $1,735 per week in Fayetteville, GA

Travel Registered Respiratory Therapist (RRT) - $1,535 to $1,735 per week in Fayetteville, GA

AlliedTravelCareers • Fayetteville, GA, US
serp_jobs.job_card.full_time
AlliedTravelCareers is working with Titan Medical Group to find a qualified RRT in Fayetteville, Georgia, 30214!.Travel - Respiratory Therapist. Fayetteville, GA, United States.BCLS / BLS - American H...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Sr. Manager, Engineering

Sr. Manager, Engineering

OpenGov • Atlanta, GA, United States
serp_jobs.job_card.full_time
OpenGov is the leader in AI and ERP solutions for local and state governments in the U.More than 2,000 cities, counties, state agencies, school districts, and special districts rely on the OpenGov ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Sitecore Solutions Architect

Sitecore Solutions Architect

Perficient • Alpharetta, GA, US
serp_jobs.job_card.full_time
We currently have a career opportunity for a Sitecore Solutions / Technical Architect to join our team located in the US. Architect is expected to be knowledgeable in two or more technologies within...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Site Reliability Engineer II

Site Reliability Engineer II

Axon • Atlanta, Georgia, United States
serp_jobs.job_card.full_time
Join Axon and be a Force for Good.At Axon, we’re on a mission to Protect Life.We’re explorers, pursuing society’s most critical safety and justice issues with our ecosystem of devices and cloud sof...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Site Reliability Engineer

Site Reliability Engineer

CD Newco LLC d / b / a Curve Dental • Alpharetta, Georgia, United States, 30009
serp_jobs.job_card.full_time
At Flex Dental, we go beyond checking boxes; our integration and automation are unparalleled.Every feature serves a purpose, creating seamless collaboration with Open Dental’s practice management s...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30
Senior Site Reliability Engineer - Featurespace

Senior Site Reliability Engineer - Featurespace

Visa • Atlanta, Georgia, United States
serp_jobs.job_card.full_time
Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Site Reliability Engineer

Site Reliability Engineer

Donato Technologies, Inc • Atlanta, Georgia, USA
serp_jobs.job_card.full_time
Senior Site Reliability Engineer.Manage and optimize data streaming and API components in OpenShift Onpremise and AWS.Proactively review the applications APIs and processes to identify opport...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Site Reliability Engineer

Site Reliability Engineer

T-Mobile USA, Inc. • Atlanta, GA, United States
serp_jobs.job_card.full_time +1
At T-Mobile, we invest in YOU! Our Total Rewards Package ensures that employees get the same big love we give our customers. All team members receive a competitive base salary and compensation pack...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Site Reliability Engineering Architect

Site Reliability Engineering Architect

TechniPros • Atlanta, Georgia, USA
serp_jobs.job_card.full_time
Job Title : Site Reliability Engineering (SRE) Architect.Location : Atlanta Georgia (Hybrid).As an SRE Architect you will be a pivotal technical leader responsible for designing building and evolvi...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
SRE Architect

SRE Architect

Cortex consultants LLC • Atlanta, GA, United States
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
Job Title : Site Reliability Engineering (SRE) Architect Location : Atlanta, Georgia As an SRE Architect, you will ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days
SRE Architect at Atlanta, Georgia ( Hybrid )

SRE Architect at Atlanta, Georgia ( Hybrid )

Exatech Inc • Atlanta, GA, United States
serp_jobs.job_card.full_time +2
serp_jobs.filters_job_card.quick_apply
Job Title : Site Reliability Engineering (SRE) Architect Location : Atlanta, Georgia ( Hybrid ) serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days
SRE Architect GOEDC5498339 (Atlanta)

SRE Architect GOEDC5498339 (Atlanta)

Compunnel Inc. • Atlanta, GA, United States
serp_jobs.job_card.full_time
As an SRE Architect, you will be a pivotal technical leader responsible for designing, building, and evolving the foundational systems and practices that ensure the reliability, scalability, perfor...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
D365 Engineer

D365 Engineer

TRC Talent Solutions • Griffin, GA, US
serp_jobs.job_card.full_time +1
Engineering Data & D365 Specialist.Industry : Manufacturing / Engineering.TRC is partnered with an innovative manufacturing organization seeking an Engineering Data & D365 Specialist to brid...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Site Reliability Engineering Manager (Alpharetta)

Site Reliability Engineering Manager (Alpharetta)

LexisNexis Risk Solutions • Alpharetta, GA, US
serp_jobs.job_card.part_time
Are you an experienced Site Reliability Engineering leader ready to shape strategy, inspire teams, and drive innovation at scale?. Are you looking to lead a high-impact SRE team where your leadershi...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted