Talent.com
Staff ML Engineer - Infrastructure
Staff ML Engineer - InfrastructureChipStack • San Jose, California, United States
serp_jobs.error_messages.no_longer_accepting
Staff ML Engineer - Infrastructure

Staff ML Engineer - Infrastructure

ChipStack • San Jose, California, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

About Us

Chips are at the center of today's tech-driven world. But how we design them has not changed in decades, while their complexity and specialization have skyrocketed due to increasing performance demands from applications like AI. We want to change that.

Our team is small, technical, and fast-moving. We’ve built and shipped at the intersection of AI, EDA, and systems software, with deep roots at companies like Qualcomm, Nvidia, Google, Meta, and the Allen Institute for AI. We’re backed by top investors including Khosla Ventures, Cerberus, and Clear Ventures, and already deployed with 10+ innovative customers—from Fortune 100s to cutting-edge AI silicon startups.

About This Role

This role offers a unique opportunity to be part of the founding team at ChipStack, where we are reinventing how modern silicon chips are designed. You will work alongside highly experienced chip designers who have built complex chips, ML scientists who have trained LLMs at scale, and top-notch infrastructure and software engineers. You will get to leverage your experience building ML and data infrastructure and apply it to some of the hardest problems in chip design.

About You

You want to be at a startup because you love to be at the center of all the dynamism that a startup offers.

You are willing to put in the hours and go the extra mile to ensure every customer has an exceptional experience.

You are self-motivated with a sense of urgency and can operate independently without much guidance.

You are not afraid of difficult problems and enjoy venturing into areas you have not explored before.

This Role

We’re looking for a strong, experienced ML Infrastructure Engineer to join our founding team. We are seeking someone with experience designing and scaling ML infrastructure and training pipelines. You’ll be responsible for building the core infrastructure that enables training, fine-tuning, evaluation, and deployment of LLMs across cloud and on-premise environments. Your work will directly impact product capabilities and speed of iteration.

What's needed

5+ years of experience in ML infrastructure or adjacent roles

Deep expertise in Python and experience with training frameworks like PyTorch or TensorFlow

Strong systems engineering skills and experience with distributed training, data pipelines, and performance optimization

Experience deploying ML models to production (REST APIs, batch jobs, streaming pipelines)

Proficiency with cloud platforms (e.g., GCP, AWS) and containerized systems (Docker, Kubernetes)

Experience managing GPU / TPU workloads efficiently

Good communication skills and the ability to work directly with engineers and customers

Prior experience training or fine-tuning LLMs

Experience setting up observability, monitoring, and evaluation pipelines for ML models

What's good to have

Exposure to chip design fundamentals (via coursework or elsewhere)

Experience at an early-stage startup

Our Culture

Challenge status quo : We are innovators who can challenge the status quo and push forward our vision of the world.

Strong opinions, loosely held : We are low on ego, but high on collaboration. We are okay to be wrong and are always open to learning.

Ship fast, ship quality : We ruthlessly prioritize what matters. We build a few things, but at lightning speed with high quality.

Proud of our craft : Attention to detail is in our DNA. We take pride in what we build and ensure they exceed the high standards of the semiconductor industry.

serp_jobs.job_alerts.create_a_job

Staff Infrastructure Engineer • San Jose, California, United States

Job_description.internal_linking.related_jobs
Sr. Staff ML Platform Engineer (TLM)

Sr. Staff ML Platform Engineer (TLM)

Earnin • Mountain View, California, United States
serp_jobs.job_card.full_time
As one of the first pioneers of earned wage access, our passion at EarnIn is building products that deliver real-time financial flexibility for those with the unique needs of living paycheck to pay...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Software Engineer, ML Infrastructure, Level 5

Software Engineer, ML Infrastructure, Level 5

Snap • Palo Alto, CA, United States
serp_jobs.job_card.full_time
Snap Inc () is a technology company.We believe the camera presents the greatest opportunity to improve the way people live and communicate. Snap contributes to human progress by empowering people to...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Staff Cloud Infrastructure Engineer

Staff Cloud Infrastructure Engineer

Zscaler • San Jose, California, United States
serp_jobs.filters.remote
serp_jobs.job_card.full_time
Zscaler accelerates digital transformation so our customers can be more agile, efficient, resilient, and secure.Our cloud native Zero Trust Exchange platform protects thousands of customers from cy...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior ML Platform Engineer : Scale LLM Infrastructure

Senior ML Platform Engineer : Scale LLM Infrastructure

GEICO • Palo Alto, CA, United States
serp_jobs.job_card.full_time
A leading insurance company in California is seeking a Senior ML Platform Engineer to enhance their machine learning infrastructure. This role involves designing scalable systems for Large Language ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Staff Thermal Engineer

Staff Thermal Engineer

Supermicro • San Jose, CA, United States
serp_jobs.job_card.full_time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Sr. Staff Software Engineer Systems Infrastructure

Sr. Staff Software Engineer Systems Infrastructure

LinkedIn • Mountain View, California, USA
serp_jobs.job_card.full_time
This role will be based in Mountain View CA or Bellevue WA.At LinkedIn our approach to flexible work is centered on trust and optimized for culture connection clarity and the evolving needs of our ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Software Development Engineer, ML Infrastructure Team

Software Development Engineer, ML Infrastructure Team

Amazon • Cupertino, California, USA
serp_jobs.job_card.full_time
Want to help drive the success of Machine Learning technologies at AWS Do you have the skills and motivation to build automation that supports the success of peer teams We want to talk to you!.We s...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Software Engineer, ML Infra

Software Engineer, ML Infra

Newsbreak • Mountain View, California, United States
serp_jobs.job_card.full_time
NewsBreak is redefining the way users interact with local news and their communities.By bridging local users, local content creators, and local businesses, our mission is to foster safer, more vibr...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
RAN Infrastructure Engineer

RAN Infrastructure Engineer

Skylo Technologies • Mountain View, California, United States
serp_jobs.job_card.full_time
Skylo is a global Non-Terrestrial Network service provider based in Mountain View, CA, offering a service that allows smartphone and IoT cellular devices to connect directly over existing satellite...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Staff Infrastructure / DevOps Engineer

Staff Infrastructure / DevOps Engineer

Gatik Ai • Mountain View, California, United States
serp_jobs.job_card.full_time
Gatik, the leader in autonomous middle-mile logistics, is revolutionizing the B2B supply chain with its autonomous transportation-as-a-service (ATaaS) solution and prioritizing safe, consistent del...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Staff Systems Software Engineer, Infrastructure Platform

Staff Systems Software Engineer, Infrastructure Platform

GM • Mountain View, California, USA
serp_jobs.job_card.full_time
The Infrastructure Engineering organisation at GM is building a cloud-native platform that transforms how developers interact with automotive test hardware. This platform treats physical benches mob...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
ML Infrastructure Engineer — Scale Generative Models

ML Infrastructure Engineer — Scale Generative Models

Apple Inc. • Cupertino, CA, United States
serp_jobs.job_card.full_time
A leading technology company in Cupertino, California, is seeking a ML Infrastructure Engineer to design and optimize the systems that power large-scale model training. The ideal candidate will have...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Staff ML Engineer, Cross‑Team Recommendations

Staff ML Engineer, Cross‑Team Recommendations

Pinterest • Palo Alto, CA, United States
serp_jobs.job_card.full_time
A leading visual discovery platform is seeking a highly motivated Staff ML Engineer to work as a cross-team technical leader. This role involves innovating on large-scale machine learning recommenda...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Product Infrastructure Engineer - Site Reliability

Product Infrastructure Engineer - Site Reliability

Zyphra • Palo Alto, California, United States
serp_jobs.job_card.full_time
Infrastructure Engineer - Site Reliability.Your work will be essential to ensuring the reliability and reproducibility of ML workloads, the safety and control of deployments, and the long-term main...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior Software Engineer - ML Infrastructure

Senior Software Engineer - ML Infrastructure

Applied Intuition • Sunnyvale, CA, United States
serp_jobs.job_card.full_time
Applied Intuition is the vehicle intelligence company that accelerates the global adoption of safe, AI-driven machines.Founded in 2017 and now valued at $15 billion following its recent Series F fu...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior / Staff Software Engineer, Machine Learning Infrastructure

Senior / Staff Software Engineer, Machine Learning Infrastructure

Nuro • Mountain View, California, United States
serp_jobs.job_card.full_time
Nuro is a self-driving technology company on a mission to make autonomy accessible to all.Founded in 2016, Nuro is building the world’s most scalable driver, combining cutting-edge AI with automoti...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Software Engineer, Ads ML Infrastructure

Software Engineer, Ads ML Infrastructure

Tik Tok • San Jose, CA, United States
serp_jobs.job_card.full_time
About the team The ads system at TikTok operates on a massive scale and serves millions of advertisers, clients and influencers across the world. The quality of the ads system highly depends on the ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Staff Systems Engineer

Staff Systems Engineer

Intuitive • Sunnyvale, California, USA
serp_jobs.job_card.full_time
We are seeking a highly experienced Staff Engineer in Infrastructure to contribute to the strategy architecture and operations of Infrastructure as Code (IaC) for the Technical Operations group (Az...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted