Talent.com
Staff ML Engineer - Infrastructure
Staff ML Engineer - InfrastructureChipStack • San Jose, California, United States
serp_jobs.error_messages.no_longer_accepting
Staff ML Engineer - Infrastructure

Staff ML Engineer - Infrastructure

ChipStack • San Jose, California, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

About Us

Chips are at the center of today's tech-driven world. But how we design them has not changed in decades, while their complexity and specialization have skyrocketed due to increasing performance demands from applications like AI. We want to change that.

Our team is small, technical, and fast-moving. We’ve built and shipped at the intersection of AI, EDA, and systems software, with deep roots at companies like Qualcomm, Nvidia, Google, Meta, and the Allen Institute for AI. We’re backed by top investors including Khosla Ventures, Cerberus, and Clear Ventures, and already deployed with 10+ innovative customers—from Fortune 100s to cutting-edge AI silicon startups.

About This Role

This role offers a unique opportunity to be part of the founding team at ChipStack, where we are reinventing how modern silicon chips are designed. You will work alongside highly experienced chip designers who have built complex chips, ML scientists who have trained LLMs at scale, and top-notch infrastructure and software engineers. You will get to leverage your experience building ML and data infrastructure and apply it to some of the hardest problems in chip design.

About You

You want to be at a startup because you love to be at the center of all the dynamism that a startup offers.

You are willing to put in the hours and go the extra mile to ensure every customer has an exceptional experience.

You are self-motivated with a sense of urgency and can operate independently without much guidance.

You are not afraid of difficult problems and enjoy venturing into areas you have not explored before.

This Role

We’re looking for a strong, experienced ML Infrastructure Engineer to join our founding team. We are seeking someone with experience designing and scaling ML infrastructure and training pipelines. You’ll be responsible for building the core infrastructure that enables training, fine-tuning, evaluation, and deployment of LLMs across cloud and on-premise environments. Your work will directly impact product capabilities and speed of iteration.

What's needed

5+ years of experience in ML infrastructure or adjacent roles

Deep expertise in Python and experience with training frameworks like PyTorch or TensorFlow

Strong systems engineering skills and experience with distributed training, data pipelines, and performance optimization

Experience deploying ML models to production (REST APIs, batch jobs, streaming pipelines)

Proficiency with cloud platforms (e.g., GCP, AWS) and containerized systems (Docker, Kubernetes)

Experience managing GPU / TPU workloads efficiently

Good communication skills and the ability to work directly with engineers and customers

Prior experience training or fine-tuning LLMs

Experience setting up observability, monitoring, and evaluation pipelines for ML models

What's good to have

Exposure to chip design fundamentals (via coursework or elsewhere)

Experience at an early-stage startup

Our Culture

Challenge status quo : We are innovators who can challenge the status quo and push forward our vision of the world.

Strong opinions, loosely held : We are low on ego, but high on collaboration. We are okay to be wrong and are always open to learning.

Ship fast, ship quality : We ruthlessly prioritize what matters. We build a few things, but at lightning speed with high quality.

Proud of our craft : Attention to detail is in our DNA. We take pride in what we build and ensure they exceed the high standards of the semiconductor industry.

serp_jobs.job_alerts.create_a_job

Staff Infrastructure Engineer • San Jose, California, United States

Job_description.internal_linking.related_jobs
Staff Thermal Engineer

Staff Thermal Engineer

Supermicro • San Jose, CA, United States
serp_jobs.job_card.full_time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Senior Staff Engineer

Senior Staff Engineer

Cloudera • San Jose, CA, United States
serp_jobs.job_card.full_time
At Cloudera, we empower people to transform complex data into clear and actionable insights.With as much data under management as the hyperscalers, we're the preferred data partner for the top comp...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Sr. Staff ML Platform Engineer (TLM)

Sr. Staff ML Platform Engineer (TLM)

Earnin • Mountain View, California, United States
serp_jobs.job_card.full_time
As one of the first pioneers of earned wage access, our passion at EarnIn is building products that deliver real-time financial flexibility for those with the unique needs of living paycheck to pay...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Staff Cloud Infrastructure Engineer

Staff Cloud Infrastructure Engineer

Zscaler • San Jose, California, United States
serp_jobs.filters.remote
serp_jobs.job_card.full_time
Zscaler accelerates digital transformation so our customers can be more agile, efficient, resilient, and secure.Our cloud native Zero Trust Exchange platform protects thousands of customers from cy...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Sr. Staff Software Engineer Systems Infrastructure

Sr. Staff Software Engineer Systems Infrastructure

LinkedIn • Mountain View, California, USA
serp_jobs.job_card.full_time
This role will be based in Mountain View CA or Bellevue WA.At LinkedIn our approach to flexible work is centered on trust and optimized for culture connection clarity and the evolving needs of our ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Software Engineer, ML Infra

Software Engineer, ML Infra

Newsbreak • Mountain View, California, United States
serp_jobs.job_card.full_time
NewsBreak is redefining the way users interact with local news and their communities.By bridging local users, local content creators, and local businesses, our mission is to foster safer, more vibr...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Staff Infrastructure / DevOps Engineer

Staff Infrastructure / DevOps Engineer

Gatik Ai • Mountain View, California, United States
serp_jobs.job_card.full_time
Gatik, the leader in autonomous middle-mile logistics, is revolutionizing the B2B supply chain with its autonomous transportation-as-a-service (ATaaS) solution and prioritizing safe, consistent del...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Staff Systems Software Engineer, Infrastructure Platform

Staff Systems Software Engineer, Infrastructure Platform

GM • Mountain View, California, USA
serp_jobs.job_card.full_time
The Infrastructure Engineering organisation at GM is building a cloud-native platform that transforms how developers interact with automotive test hardware. This platform treats physical benches mob...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Staff ML Engineer, Cross‑Team Recommendations

Staff ML Engineer, Cross‑Team Recommendations

Pinterest • Palo Alto, CA, United States
serp_jobs.job_card.full_time
A leading visual discovery platform is seeking a highly motivated Staff ML Engineer to work as a cross-team technical leader. This role involves innovating on large-scale machine learning recommenda...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Product Infrastructure Engineer - Site Reliability

Product Infrastructure Engineer - Site Reliability

Zyphra • Palo Alto, California, United States
serp_jobs.job_card.full_time
Infrastructure Engineer - Site Reliability.Your work will be essential to ensuring the reliability and reproducibility of ML workloads, the safety and control of deployments, and the long-term main...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
AI / ML System Software Engineer, Staff

AI / ML System Software Engineer, Staff

D-matrix • Santa Clara, California, United States
serp_jobs.job_card.full_time
AI to power the transformation of technology.We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. We value humility and believe in direct communic...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Senior / Staff Software Engineer, Machine Learning Infrastructure

Senior / Staff Software Engineer, Machine Learning Infrastructure

Nuro • Mountain View, California, United States
serp_jobs.job_card.full_time
Nuro is a self-driving technology company on a mission to make autonomy accessible to all.Founded in 2016, Nuro is building the world’s most scalable driver, combining cutting-edge AI with automoti...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Staff ML Engineer - AI-Powered Observability Platform

Staff ML Engineer - AI-Powered Observability Platform

Cisco Systems • San Jose, California, United States
serp_jobs.job_card.full_time
A global technology company is looking for a seasoned software engineer to enhance AI capabilities within their observability platform. Candidates should have a strong background in AI / ML systems, c...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
Machine Learning Infrastructure Engineer

Machine Learning Infrastructure Engineer

Institute Of Foundation Models • Sunnyvale, California, United States
serp_jobs.job_card.full_time
About the Institute of Foundation Models.We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
Staff Integration Engineer

Staff Integration Engineer

PsiQuantum • Palo Alto, CA, United States
serp_jobs.job_card.full_time
PsiQuantum'smission is to build the first useful quantum computers-machines capable of delivering the breakthroughs the field has long promised. Since our founding in 2016, our singular focus has be...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
Senior ML Platform Engineer : Scale LLM Infrastructure

Senior ML Platform Engineer : Scale LLM Infrastructure

GEICO • Palo Alto, CA, United States
serp_jobs.job_card.full_time
A leading insurance company in California is seeking a Senior ML Platform Engineer to enhance their machine learning infrastructure. This role involves designing scalable systems for Large Language ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Staff ML Engineer / Applied Scientist

Staff ML Engineer / Applied Scientist

Typeface • Palo Alto, California, United States
serp_jobs.job_card.full_time
Typeface is on a mission to help everyone express their unique imagination.We believe technology is a creative partner that empowers any company to tell their unique stories faster and easier than ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
ML Infrastructure Engineer — Scale Generative Models

ML Infrastructure Engineer — Scale Generative Models

Apple Inc. • Cupertino, CA, United States
serp_jobs.job_card.full_time
A leading technology company in Cupertino, California, is seeking a ML Infrastructure Engineer to design and optimize the systems that power large-scale model training. The ideal candidate will have...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted