Talent.com
Staff Software Engineer, AI / ML Infrastructure

Staff Software Engineer, AI / ML Infrastructure

Chan Zuckerberg InitiativeRedwood City, CA (Hybrid)
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

The Team

Our Central Tech team provides technology and security support for CZI, the Biohub Network, and our grantees. We believe that Engineering and Security are most effective when in sync and learning from each other on a daily basis. Our AI Infrastructure Engineering team enables our AI Research teams to achieve their goals faster and more securely. We leverage technology to automate manual processes, constantly innovate to optimize operations, provide first-class support, and build solutions to enable the scale and execution of our business partners' strategies and initiatives.

The Opportunity

The AI / ML and Data Engineering Infrastructure organization works on building shared tools and platforms to be used across all of the Chan Zuckerberg Initiative and CZ Biohub, partnering and supporting the work of a wide range of Research Scientists, Data Scientists, AI Research Scientists, as well as a broad range of Engineers focusing on Education and Science domain problems. Members of the central technology’s infrastructure engineering team have an impact on all of CZI's initiatives by enabling the technology solutions used by other engineering teams at CZI to scale. A person in this role will build these technology solutions and help to cultivate a culture of shared best practices and knowledge around AI / ML infrastructure.

What You'll Do

  • Lead the design and delivery of secure, scalable, and high-performance AI / ML compute infrastructure.
  • Architect and implement containerized AI / ML platforms using Kubernetes for heterogeneous, distributed environments.
  • Integrate on-prem (High Performance Compute) and cloud-based AI platforms with GPU clusters to support pre-training, training, fine-tuning, and inference workflows.
  • Define and execute systems integration strategies to maximize performance, scalability, and security for AI workloads.
  • Enable research teams to effectively use AI platforms through best practices in lifecycle management and deployment.
  • Solve complex challenges in scaling AI workflows and optimizing model training and inference pipelines.

What You'll Bring

  • BS / MS in Computer Science or related field, or equivalent experience, with 8+ years in coding and systems architecture / design across AI / ML and core infrastructure.
  • Proven proficiency in a systems language (C, C++, C#, Go, Rust, Java, Scala) and a scripting language (Python, PHP, Ruby).
  • Expertise in cloud platforms (AWS, GCP, Azure) and hybrid environments, including on-premises and colocation hosting.
  • Strong experience in AI / ML platform operation technologies (e.g. Slrum, Sunk, Run : ai, Kubeflow)
  • Advanced skills in scaling and securing containerized applications on Kubernetes, including custom container development and CI / CD integration.
  • Working knowledge of Nvidia CUDA, AI / ML custom libraries, and Linux systems optimization / administration.
  • Compensation

    The Redwood City, CA and New York City, NY base pay range for this role is $270,000.00 - $371,800.00

    The Chicago, IL base pay range for this role $230,000.00 - $315,700.00

    New hires are typically hired into the lower portion of the range, enabling employee growth in the range over time. Actual placement in range is based on job-related skills and experience, as evaluated throughout the interview process.

    Work Mode

    As we grow, we’re excited to strengthen in-person connections and cultivate a collaborative, team-oriented environment. This role is a hybrid position requiring you to be onsite for at least 60% of the working month, approximately 3 days a week, with specific in-office days determined by the team’s manager. The exact schedule will be at the hiring manager's discretion and communicated during the interview process.

    Benefits for the Whole You

    We’re thankful to have an incredible team behind our work. To honor their commitment, we offer a wide range of benefits to support the people who make all we do possible.

  • CZI provides a generous employer match on employee 401(k) contributions to support planning for the future.
  • Annual benefit for employees that can be used most meaningfully for them and their families, such as housing, student loan repayment, childcare, commuter costs, or other life needs.
  • CZI Life of Service Gifts are awarded to employees to “live the mission” and support the causes closest to them.
  • Paid time off to volunteer at an organization of your choice.
  • Funding for select family-forming benefits.
  • Relocation support for employees who need assistance moving to the Bay Area
  • If you’re interested in a role but your previous experience doesn’t perfectly align with each qualification in the job description, we still encourage you to apply as you may be the perfect fit for this or another role.

    Explore our , , and at .

    serp_jobs.job_alerts.create_a_job

    Software Engineer Infrastructure • Redwood City, CA (Hybrid)

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    Senior Staff Software Engineer, Managed AI

    Senior Staff Software Engineer, Managed AI

    Crusoe Energy Systems LLCSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    As a Senior Staff Software Engineer on the Managed AI team at Crusoe, you'll have a pivotal role in shaping the architecture and scalability of our next-generation AI inference platform.You will le...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Applied AI Engineer – ML for Systems & Infrastructure

    Senior Applied AI Engineer – ML for Systems & Infrastructure

    Databricks Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Senior Applied AI Engineer – ML for Systems & Infrastructure.The Applied AI team at Databricks sits at the forefront of advancing GenAI-powered products. Over the past years, we’ve launched Databric...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    ML Infrastructure Engineer (Staff / Principal)

    ML Infrastructure Engineer (Staff / Principal)

    Menlo VenturesBurlingame, CA, United States
    serp_jobs.job_card.full_time
    We’re a tight-knit team of proven drug hunters, deep learning researchers, and software engineers united by a common mission — drive AI innovation in biochemistry, discovering and developing ground...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff Software Engineer, Compute

    Staff Software Engineer, Compute

    FalSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    You are an experienced software engineer who thrives on building large scale computation platforms.You have deep expertise in backend systems that orchestrate workloads and route requests efficient...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff Software Engineer (Agentic AI & Data)

    Staff Software Engineer (Agentic AI & Data)

    MLabsSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Remote (San Francisco Bay Area preferred).Remote (US - West Coast hours preferred).We are pioneering the future of agentic AI to transform the property management industry, a market worth over $200...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Machine Learning Engineer — Infrastructure

    Machine Learning Engineer — Infrastructure

    Fundamental Research LabsMenlo Park, CA, United States
    serp_jobs.job_card.full_time
    Machine Learning Infrastructure Engineer.AI : from high-performance inference engines to the underlying agent technologies and large-scale compute clusters that keep everything running.You’ll collab...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Staff Software Engineer, Machine Learning Platform

    Staff Software Engineer, Machine Learning Platform

    Menlo VenturesSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    As a Staff Engineer on the ML Platform team at Chime, you'll architect and lead development of scalable ML infrastructure used to fight fraud, personalize experiences, and power intelligent decisio...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    AI / ML Engineer, Staff

    AI / ML Engineer, Staff

    LimohealthSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    At Charta, we're pioneering a transformative approach to healthcare administration and patient care through the power of generative AI. Our mission is to revolutionize this critical yet often cumber...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff Machine Learning Engineer, AI Platform

    Staff Machine Learning Engineer, AI Platform

    General MotorsSunnyvale, CA, United States
    serp_jobs.job_card.full_time
    Remote : This role is based remotely but if you live within a 50-mile radius of Mountain View, you are expected to report to that location three times a week, at minimum. We are seeking an experience...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Machine Learning Engineer - Infrastructure

    Machine Learning Engineer - Infrastructure

    Fundamental Research LabsMenlo Park, CA, US
    serp_jobs.job_card.full_time
    Machine Learning Infrastructure Engineer.AI : from high-performance inference engines to the underlying agent technologies and large-scale compute clusters that keep everything running.You'll collab...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Staff Software Engineer – AI Systems

    Staff Software Engineer – AI Systems

    airbnb, Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff Software Engineer (Agentic AI & Data)

    Staff Software Engineer (Agentic AI & Data)

    MLabs LtdSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Location : Remote (San Francisco Bay Area preferred).Work Arrangement : Remote (US - West Coast hours preferred).We are pioneering the future of agentic AI to transform the property management indust...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Software Engineer, AI / ML GenAI, Workspace

    Senior Software Engineer, AI / ML GenAI, Workspace

    Google Inc.Sunnyvale, CA, United States
    serp_jobs.job_card.full_time
    Senior Software Engineer, AI / ML GenAI, Workspace.Bachelor’s degree or equivalent practical experience.LLMs, Multi-Modal, Large Vision Models) or with GenAI-related concepts (e.Experience with distr...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Staff Software Engineer, AI

    Staff Software Engineer, AI

    Social Finance, Inc. (SoFi)San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Shape a brighter financial future with us.Together with our members, we’re changing the way people think about and interact with personal finance. We’re a next-generation financial services company ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Software Engineer, AI / ML GenAI, Google Workspace

    Senior Software Engineer, AI / ML GenAI, Google Workspace

    Google Inc.Sunnyvale, CA, United States
    serp_jobs.job_card.full_time
    Senior Software Engineer, AI / ML GenAI, Google Workspace.Bachelor’s degree or equivalent practical experience.LLMs, Multi-Modal, Large Vision Models) or with GenAI-related concepts (language modelin...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Staff Software Engineer – AI Agents

    Staff Software Engineer – AI Agents

    GoodLeap, LLCSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    About GoodLeap : GoodLeap is a technology company delivering best-in-class financing and software products for sustainable solutions, from solar panels and batteries to energy-efficient HVAC, heat p...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Staff / Senior Software Engineer, Applied AI

    Staff / Senior Software Engineer, Applied AI

    MLabs LtdSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Our client is a mission-driven health tech company building AI agents to scale patient care without needing more clinicians. They are at the forefront of applying AI to create modern, powerful, and ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Software Engineer, Managed AI

    Software Engineer, Managed AI

    Crusoe Energy Systems LLCSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Crusoe's mission is to accelerate the abundance of energy and intelligence.We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, spe...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Software Engineer, Machine Learning Menlo Park, CA • Software Engineering • Engineering +2 more[...]

    Software Engineer, Machine Learning Menlo Park, CA • Software Engineering • Engineering +2 more[...]

    MetaMenlo Park, CA, United States
    serp_jobs.job_card.full_time
    Meta), formerly known as Facebook Inc.When Facebook launched in 2004, it changed the way people connect.Apps and services like Messenger, Instagram, and WhatsApp further empowered billions around t...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff / Senior Software Engineer, Applied AI

    Staff / Senior Software Engineer, Applied AI

    MLabsSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Our client is a mission-driven health tech company building AI agents to scale patient care without needing more clinicians. They are at the forefront of applying AI to create modern, powerful, and ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days