Ensure operational excellence. Independently drive the triaging and service restoration of all high-impact incidents to minimize the mean time to service restoration and impact the business. Demonstrate end-to-end ownership.
Partner with infrastructure teams to design and implement intelligent incident routing, enhanced monitoring / alerting capabilities and automated service restoration processes. Take proactive measures to prevent high impactful incidents.
Achieve and maintain the continuity of Hartford and third-party assets that support a business function. Accountable for keeping the IT application and infrastructure metadata repositories current.
System Thinking, end-to-end and broad understanding of enterprise architectures and distributed systems.
Highly collaborative, partners with peers, stakeholders with a passion about delighting customers.
Hands on experience with Performance and Observability tools such as Splunk ITSI (IT Service Intelligence), Dynatrace, Splunk, CloudWatch, CloudTrail, and related tools.
Strong solution architecture orientation to enable expedient troubleshooting, issue-resolution and root-cause removal in a hybrid cloud environment.
Experience with continuous integration and DevOps methodologies, preferred tools such as GitHub, Jenkins, Nexus, Rally, SonarQube, Akamai etc.
Keeps abreast with new market technologies and adept at learning and adopting new models. Promotes and applies continuous learning.
Knowledge of complex traditional and modern enterprise architectures and systems. Strong hybrid cloud experience (private and public) across various service delivery models – SRE, IaaS, PaaS, SaaS.
Effective communication (verbally and written) / collaboration / negotiation skill, working in a diverse team cross business unit