Talent.com
(Senior) Software Engineer, Infrastructure (Kubernetes Platform)
(Senior) Software Engineer, Infrastructure (Kubernetes Platform)Pony.ai • Fremont, CA, United States
(Senior) Software Engineer, Infrastructure (Kubernetes Platform)

(Senior) Software Engineer, Infrastructure (Kubernetes Platform)

Pony.ai • Fremont, CA, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous mobility technologies and services at a rapidly expanding footprint of sites around the world. Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony.ai is an industry leader in the commercialization of autonomous driving and is committed to developing the safest autonomous driving capabilities on a global scale. Pony.ai’s leading position has been recognized, with CNBC ranking Pony.ai #10 on its CNBC Disruptor list of the 50 most innovative and disruptive tech companies of 2022. In June 2023, Pony.ai was recognized on the XPRIZE and Bessemer Venture Partners inaugural “XB100” 2023 list of the world’s top 100 private deep tech companies, ranking #12 globally. As of August 2023, Pony.ai has accumulated nearly 21 million miles of autonomous driving globally. Pony.ai went public at NASDAQ in November 2024.

Responsibilities

As a (Senior) Kubernetes Engineer, you will :

Design, operate, and optimize Kubernetes clusters across hybrid cloud environments (public cloud and on-prem datacenter).

Support diverse workloads including large-scale model training and low-latency inference services.

Develop, maintain, and extend Kubernetes platform features (operators, CRDs, APIs) to automate and productize internal use cases.

Own cluster lifecycle management including upgrades, patching, configuration, and governance.

Define and enforce best practices for service deployments, security policies, and operational guidelines.

Contribute to observability and SRE practices to ensure reliability at scale (SLOs, incident reviews, metrics-driven improvements).

Collaborate with storage, compute, and networking teams (CNI, ingress, service discovery) to enhance automation, availability, and performance.

Provide technical mentorship, documentation, and on-call support for cluster-related incidents.

Requirements

Bachelor’s degree in Computer Science, Engineering, or related field, or equivalent experience.

3+ years of hands-on experience managing Kubernetes clusters in production (EKS / GKE / AKS and / or bare-metal).

Strong Linux systems background and distributed systems fundamentals (scheduling, reliability, scaling).

Proven experience with hybrid cloud environments (AWS, GCP, Azure, and on-prem).

Expertise in containerization (Docker) and Infrastructure-as-Code tools (Terraform, Helm, Ansible, or similar).

Experience developing and maintaining Kubernetes platform features (operators, CRDs, APIs).

Solid knowledge of Kubernetes networking (CNI, ingress, service discovery), storage, and compute integrations.

Strong understanding of security best practices (RBAC, network policies, secrets).

Effective communication skills and ability to work cross-functionally in a fast-paced environment.

Preferred Experience

Programming skills in Go and / or Python for operator development, platform automation, and tooling.

Experience with observability and SRE practices (Prometheus, Grafana, ELK, Datadog; SLOs, incident response, postmortems).

Familiarity with workloads common to AI / ML systems (training, inference).

Compensation and Benefits

Base Salary Range : $120,000 - $240,000 Annually

Compensation may vary outside of this range depending on many factors, including the candidate’s qualifications, skills, competencies, experience, and location. Base pay is one part of the Total Compensation and this role may be eligible for bonuses / incentives and restricted stock units.

Also, we provide the following benefits to the eligible employees :

Health Care Plan (Medical, Dental & Vision)

Retirement Plan (Traditional and Roth 401k)

Life Insurance (Basic, Voluntary & AD&D)

Paid Time Off (Vacation & Public Holidays)

Family Leave (Maternity, Paternity)

Short Term & Long Term Disability

Free Food & Snacks

Please click here () for our privacy disclosure.

serp_jobs.job_alerts.create_a_job

Senior Software Engineer Infrastructure • Fremont, CA, United States

Job_description.internal_linking.related_jobs
Senior Kubernetes & Infrastructure Engineer

Senior Kubernetes & Infrastructure Engineer

Third Wave Automation • Union City, California, United States
serp_jobs.job_card.full_time
Third Wave Automation is a rapidly growing startup that has demonstrated its core technology components, proven its market fit, and just closed its Series C funding. If you are excited about cutting...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Software Engineer - Cloud Infrastructure

Senior Software Engineer - Cloud Infrastructure

General Motors • Sunnyvale, CA, United States
serp_jobs.job_card.full_time
At General Motors, our product teams are redefining mobility.Through a human-centered design process, we create vehicles and experiences that are designed not just to be seen, but to be felt.We're ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Infrastructure Engineer InfraOps

Senior Infrastructure Engineer InfraOps

BitGo • Palo Alto, California, USA
serp_jobs.job_card.full_time
BitGo is the leading infrastructure provider of digital asset solutions delivering custody wallets staking trading financing and settlement services from regulated cold storage.Since our founding i...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Software Engineer (Go, or C and C++ )

Senior Software Engineer (Go, or C and C++ )

Purple Drive • Sunnyvale, CA, United States
serp_jobs.job_card.full_time
Senior Software Engineer - Linux / Kubernetes.We are seeking a highly experienced.Linux driver development, Kubernetes operations, and backend programming. Kubernetes operations, API servers, and life...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Kubernetes Software Engineer

Senior Kubernetes Software Engineer

Broadcom Corporation • Palo Alto, CA, United States
serp_jobs.job_card.full_time
If you are a first time user, please create your candidate login account before you apply for a job.If you already have a Candidate Account, please Sign-In before you apply.Senior Kubernetes Softwa...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Kubernetes Software Engineer

Senior Kubernetes Software Engineer

Broadcom Inc. • Palo Alto, CA, US
serp_jobs.job_card.full_time
Leverage common patterns to develop fixes and features for Kubernetes and CNCF projects • Design customer-oriented and community-aligned features by building consensus through Key Enhancement Propos...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
Senior Software Engineer - Cloud Infrastructure

Senior Software Engineer - Cloud Infrastructure

WeRide.ai • San Jose, CA, United States
serp_jobs.job_card.full_time
Established in 2017, WeRide (NASDAQ : WRD) is a leading global commercial-stage company that develops autonomous driving technologies from Level 2 to Level 4. WeRide is the only tech company in the w...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Sr. DevOps Engineer

Sr. DevOps Engineer

Supermicro • San Jose, CA, United States
serp_jobs.job_card.full_time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Infrastructure Engineer - InfraOps

Senior Infrastructure Engineer - InfraOps

Bitgo • Palo Alto, California, United States
serp_jobs.job_card.full_time
BitGo is the leading infrastructure provider of digital asset solutions, delivering custody, wallets, staking, trading, financing, and settlement services from regulated cold storage.Since our foun...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior DevOps Engineer

Senior DevOps Engineer

Intellipro Group • Palo Alto, California, United States
serp_jobs.job_card.full_time
Salary Range / Rate (Currency) : .Job Summary (Responsibilities and Requirements) : .You will continue to develop and empower a diverse team of developers, providing technical guidance and direction an...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Software Engineer, Cloud Infrastructure

Senior Software Engineer, Cloud Infrastructure

Nuro • Mountain View, CA, United States
serp_jobs.job_card.full_time
Senior Software Engineer, Cloud Infrastructure.Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world's most scalable ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Software Engineer l, Core Infrastructure

Senior Software Engineer l, Core Infrastructure

Moveworks.ai • Mountain View, CA, United States
serp_jobs.job_card.full_time
As a senior member of the Core Infrastructure team, you will be responsible for architecting the next generation of the Moveworks AI infrastructure. As Moveworks grows fast, the infrastructure team ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Infrastructure Linux & DevOps Engineer

Senior Infrastructure Linux & DevOps Engineer

Matrix Precise, Inc. • Pleasanton, California, United States
serp_jobs.job_card.full_time
Infra Linux Engineer’s primary function will be to advance the infrastructure team from a traditional infrastructure methodology to an infrastructure as code approach. You will be responsible for ma...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer - E5 (Kubernetes)

Software Engineer - E5 (Kubernetes)

Whatfix • San Jose, CA, United States
serp_jobs.job_card.full_time
Whatfix is an AI platform advancing the "userization" of enterprise applications, empowering companies to maximize the ROI of their digital investments. As AI reshapes roles, workflows, and human-ma...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior DevOps Engineer

Senior DevOps Engineer

Jobot • San Jose, CA, US
serp_jobs.job_card.full_time
REMOTE Senior Site Reliability Engineer / Senior Dev Ops Engineer Needed for Growing Fintech Startup!.This Jobot Job is hosted by : Reed Kellick. Are you a fit? Easy Apply now by clicking the "Apply ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Software Engineer, Core Infrastructure

Senior Software Engineer, Core Infrastructure

Waymo • Mountain View, CA, United States
serp_jobs.job_card.full_time
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted