Talent.com
Software Engineer - Infrastructure - X
Software Engineer - Infrastructure - XXai • Palo Alto, California, United States
Software Engineer - Infrastructure - X

Software Engineer - Infrastructure - X

Xai • Palo Alto, California, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.

Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity.

We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important.

All engineers and researchers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

About the role

We’re looking for exceptional infrastructure engineers to redefine what is possible in large-scale AI infrastructure and build the supercomputing platform that powers xAI’s ambitious goals. The initial 200,000 GPU phase of the Colossus supercluster in Memphis was only the beginning of xAI’s infrastructure roadmap, and we need world-class infra engineers to build the future of large-scale AI.

As a Software Engineer on the Infrastructure team, you’ll work across multiple workstreams to design, build, and maintain cutting-edge distributed systems that enable our developers to scale our compute and data platforms and ensure efficiency, reliability, and performance at an unprecedented scale.

This role offers the opportunity to shape the backbone of xAI’s technology.

What You'll Do

  • Enhance developer experience by modernizing build systems, reducing build times, and implementing tools like Bazel in a monorepo environment. Design and improve CI / CD pipelines using Buildkite, Argo, and Kubernetes, create developer portals, and assist application teams in migrating to Kubernetes while defining the next-generation tech stack.
  • Scale compute infrastructure on Kubernetes by building controllers, admission plugins, and supporting systems that empower teams to leverage Kubernetes effectively.
  • Design and maintain one of the largest traffic shaping and load balancing deployments using Envoy, while building service meshes and service discovery systems to handle massive scale.
  • Scale data platforms and observability systems (logging, tracing, metrics) to support exabyte-scale data processing and provide deep insights into system performance.
  • Drive reliability, standardization, and performance by building and refining systems with a pedantic focus on quality and scalability.
  • Manage and optimize large-scale storage systems, including key-value stores, relational databases, and network file systems or object stores (open-source, cloud-managed, and in-house solutions).
  • Contribute to miscellaneous quality-of-life improvements that empower developers and streamline workflows.

Who You Are

  • 2+ years of industry experience working with large-scale, high-throughput distributed systems, compute platforms, or data infrastructure.
  • Proficient in Golang, Rust, Python, or similar languages.
  • (Optional) Familiarity with modern developer tools (Bazel, Buildkite, Argo, Kubernetes), service meshes (Envoy), or observability frameworks.
  • (Optional) Experience with large-scale storage systems (KV stores, RDBMS, object stores) or Kubernetes ecosystem development (controllers, plugins).
  • Passionate about reliability, performance optimization, and building systems that scale seamlessly.
  • Tech Stack

  • Golang, Python, Rust, gRPC
  • Kubernetes, Bazel, Buildkite, Argo, Envoy
  • Observability tools, service meshes, and large-scale storage systems
  • Interview Process

    After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15-30 minutes phone interview, during which a member of our team will ask technical questions. If you clear the phone interview, you will proceed to next steps :

  • Deep dive coding challenge
  • Meet and greet with the wider team
  • Our goal is to complete the process within one week. All interviews will be conducted in person when applicable.

    Location

    The role is based in Palo Alto. Candidates are expected to be located near the Bay Area or open to relocation.

    Annual Salary Range

    $180,000 - $440,000 USD

    xAI is an equal opportunity employer and does not unlawfully discriminate based on race, color, religion, ethnicity, ancestry, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, age, disability, medical conditions, genetic information, marital status, military or veteran status, or any other applicable legally protected characteristics.

    Qualified applicants with arrest or conviction records will be considered for employment in accordance with all applicable federal, state, and local laws, including the San Francisco Fair Chance Ordinance, Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act.

    For Los Angeles County (unincorporated) Candidates :

    xAI reasonably believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of a conditional offer of employment :

  • Access to information technology systems and confidential information, including proprietary and trade secret information, and / or user data;
  • Interacting with internal and / or external clients and colleagues; and
  • Exercising sound judgment.
  • California Consumer Privacy Act (CCPA) Notice

    serp_jobs.job_alerts.create_a_job

    Software Engineer Infrastructure • Palo Alto, California, United States

    Job_description.internal_linking.related_jobs
    Flight Software Infrastructure Engineer

    Flight Software Infrastructure Engineer

    Reliable Robotics • Mountain View, CA, United States
    serp_jobs.job_card.permanent
    We're building safety-enhancing technology for aviation that will save lives.Automated aviation systems will enable a future where air transportation is safer, more convenient and fundamentally tra...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer, Core Infrastructure

    Software Engineer, Core Infrastructure

    Moveworks • Mountain View, California, United States
    serp_jobs.job_card.full_time
    As a member of the Core Infrastructure team, you will be responsible for architecting the next generation of the Moveworks AI infrastructure. As Moveworks grows fast, the infrastructure team is task...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer, ML Infra

    Software Engineer, ML Infra

    Newsbreak • Mountain View, California, United States
    serp_jobs.job_card.full_time
    NewsBreak is redefining the way users interact with local news and their communities.By bridging local users, local content creators, and local businesses, our mission is to foster safer, more vibr...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer, Platform

    Software Engineer, Platform

    Obsidian Security • Palo Alto, California, United States
    serp_jobs.job_card.full_time
    Founded in 2017, Obsidian Security was created to close a critical gap : securing the SaaS applications where modern business happens—platforms like Microsoft 365, Salesforce, and hundreds more.Back...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer - Developer Infrastructure

    Software Engineer - Developer Infrastructure

    Applied Intuition • Mountain View, California, United States
    serp_jobs.job_card.full_time
    Applied Intuition is the vehicle intelligence company that accelerates the global adoption of safe, AI-driven machines.Founded in 2017, Applied Intuition delivers the toolchain, Vehicle OS, and aut...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer, Build Infrastructure

    Software Engineer, Build Infrastructure

    Doordash Usa • Sunnyvale, California, United States
    serp_jobs.job_card.full_time
    The Build Infrastructure team drives improvements in build tooling to accelerate development and reduce friction for engineers across DoorDash. We support and scale the core systems behind developer...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer

    Software Engineer

    Cisco Systems, Inc. • Milpitas, CA, United States
    serp_jobs.job_card.full_time
    Please note this posting is to advertise potential job opportunities.This exact role may not be open today but could open in the near future. When you apply, a Cisco representative may contact you d...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer

    Software Engineer

    Supermicro • San Jose, CA, United States
    serp_jobs.job_card.full_time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Engineer 1

    Engineer 1

    Sheraton Four Points Pleasanton • Pleasanton, California, United States
    serp_jobs.job_card.full_time
    Compensation Type : Hourly Highgate Hotels : .Highgate is a leading real estate investment and hospitality management company with over $15 billion of assets under management and a global portfolio of...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Software Infrastructure & Platform Engineer

    Software Infrastructure & Platform Engineer

    PsiQuantum • Palo Alto, CA, United States
    serp_jobs.job_card.full_time
    PsiQuantum'smission is to build the first useful quantum computers-machines capable of delivering the breakthroughs the field has long promised. Since our founding in 2016, our singular focus has be...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer - Perception Infrastructure

    Software Engineer - Perception Infrastructure

    Pony.ai • Fremont, California, United States
    serp_jobs.job_card.full_time
    Founded in 2016 in Silicon Valley, Pony.Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony. CNBC Disruptor list of the 50 most innovative and disruptive tech comp...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer, Infrastructure - X

    Software Engineer, Infrastructure - X

    Xai • Palo Alto, California, United States
    serp_jobs.job_card.full_time
    AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer, Infrastructure

    Software Engineer, Infrastructure

    Matroid • Palo Alto, California, United States
    serp_jobs.job_card.full_time
    Matroid is a full-service computer vision company that has developed an end-to-end platform allowing enterprise customers to rapidly train and. EO, IR, X-Ray, CT, OCT, and others.Founded in 2016 by ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Software Engineer - AI Agent Infrastructure (Healthcare)

    Senior Software Engineer - AI Agent Infrastructure (Healthcare)

    Honey Health • Sunnyvale, CA, US
    serp_jobs.job_card.full_time
    Honey Health is the all-in-one AI back office for primary and specialty care.Our AI agents autonomously handle core back-office jobs, such as aggregating patients data, processing orders and prescr...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer - Test Infrastructure

    Software Engineer - Test Infrastructure

    Muon Space • Mountain View, California, United States
    serp_jobs.job_card.full_time +1
    Software Test Infrastructure Engineer.You will be responsible for designing, building, and maintaining the software test infrastructure used to test the hardware components, such as avionics comput...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer

    Software Engineer

    Orvixengr • San Jose, California, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Note : The role is strictly for candidates within the United States.We are seeking a highly skilled and passionate.In this role, you will be responsible for designing, developing, testing, and maint...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer -Distributed Systems

    Software Engineer -Distributed Systems

    Rubrik • Palo Alto, California, United States
    serp_jobs.job_card.full_time
    Data protection needs for large enterprises are evolving into a varied usage of private / public clouds.While Rubrik has built incredibly successful solutions for both, our technical architecture nee...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer II

    Software Engineer II

    Lead Bank • Sunnyvale, California, United States
    serp_jobs.job_card.full_time
    Lead is a fintech building banking infrastructure for embedded financial products and services.We operate an FDIC-insured bank headquartered in Kansas City, Missouri. Additionally, we have offices i...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted