Talent.com
Site-Reliability Engineer

Site-Reliability Engineer

Axiom Software Solutions LimitedPhoenix, Arizona, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Job Description :

  • Min 3-5 years of Service reliability / operation experience running large scale, high performance applications in a hybrid environment (on-prem and cloud).
  • Min 3-5 years of experience writing automation scripts and building dashboards for Application Performance management to manage Transaction journeys.
  • 2-4 years of Experience working with Programming languages such as Go, Python, Java, Rust etc.
  • Working knowledge on with one or more databases-Oracle, PL / SQL, SQL Server, Redis, Clickhouse, postgres, Mongo or any time-series databases
  • At least 2+ years of Experience transitioning platforms to the cloud and Containerization - GCP, AWS and Rancher (or Cloud Formation, Azure and OpenShift).
  • Experience maintaining containerized app in GKE / RKE / AKE environments.
  • Experience Implementing Cloud observability using OTEL to enable real-time monitoring, distributed tracing and incident resolution.
  • Experience working with specific GraphQL Framework (Apollo, Prisma, Hasura etc...).
  • Experience using knowledge of networking protocols such as TCP / IP, HTTP, DNS, Load balancing and service mesh to troubleshoot issues in high pressure situations.
  • Proven experience managing Application availability, building creative solutions to manage repetitive activities, improve gating and detect for applications at every touchpoint for a 24 x 7 High availability platform exposed to critical clients and customers.
  • Working knowledge of Monitoring tools - Splunk, App-dynamics, grafana / Prometheus and Dynatrace.
  • Experience with tools like Rally, Confluence and other CI / CD extenders.
  • Hands-on experience with implementing in-memory caching solutions. Experience on Redis DB is a plus.
  • Excellent debugging skills across variety of integrated technical platforms on API gateway.
  • Hands-on with GCS, Cloud SQL, PL?SQL and Spanner.
  • Monitor and troubleshoot HashiCorp Vault environments, ensuring minimal downtime and rapid recovery from incidents.
  • Working knowledge on Vertex Al, Gen Al and Bigquery.
serp_jobs.job_alerts.create_a_job

Engineer • Phoenix, Arizona, United States