Job Title : Site Reliability Engineer III
Location : Arlington, TX 2 days Onsite
Contract to hire Visa Independent consultants
JOB SUMMARY
Job Scope :
- Collaboration / Architecture / Development Partnering with Architecture / Development Teams, Ensuring Applications Highly Available / Reliable / Performant at Global Scale
- Reliability Guidance Collaborating with Architecture Team, Ensuring Reliability Factors are Accounted for in Business Features / Enablers
- SLO - Service-Level Operations / SLI Service-Level Indicators Management / Implementation Guiding Development Teams in Understanding Established Service Level Objectives / Consequences | Implementing Appropriate SLIs to Support Objectives
- Troubleshooting / Problem Resolution Collaborating with Development Team Members to Swarm / Troubleshoot / Resolve Problems
- Root Cause Analysis / Solution Planning Guiding Ad-Hoc Teams to Brainstorm Solutions | Build Implementation Plans Based on Root Cause Analysis of Production Issues
- Automation / Optimization Designing / Building Automated Solutions to Optimize Application / Service / Platform Uptime with Minimal Human Intervention
- Standards / Mentorship Implementing / Helping Create Standards / Best Practices | Mentoring Team Members to Drive Adoption Across Development Teams
Job Requirements :
Programming / Scripting Background Java / C# (.NET MVC / .NET Core) / Go | PowerShell / BashSite Reliability Engineer Identifying / Delivering Automation Solutions, Ensuring HA / ResiliencySLO - Service-Level Operations / SLI Service-Level Indicators Management Defining / Implementing / Evaluating SLOs / SLIs | Associated ConsequencesPipeline Automation Azure DevOps (YAML / ARM) / Terraform / Jenkins / Chef / Octopus Deploy | Designing / Building / Optimizing Automated Pipelines with Automated Testing / Automated Security ControlsDevOps / Containerization AKS (Azure Kubernetes Service) / Kubernetes (Open Source) / DockerDatabase Design / Optimization Oracle / MS SQL Server / NoSQL (CosmosDB) | Designing / Evolving Database Schemas | Performing Query Performance Analysis | Indexing to Deliver Scalable / Performant ServicesCode Scanning SonarQube / Checkmarx | Configurations / CI / CD Integrations / Running Scans / Triaging, etc.Test Automation Xamarin UITest / SpecFlow / DevTest / Selenium / Test Data Manager / Postman / Maven / TestNG / JMeterRoot Cause Analysis / Problem Management Performing Root Cause Analysis / Managing ProblemsSCRUM / Agile Leadership Working in SCRUM / Agile Teams | Demonstrated Success Leading ImprovementsTechnical Skills Requirements :
Proficiency in :
C#.NETSQLAzure expertise :AKS (Azure Kubernetes Service)Azure MonitoringAzure Certifications ( preferred, not required )Serverless architectureStorage and Service BusDesign and architecture experience