Talent.com
Staff ML Engineer - Infrastructure

Staff ML Engineer - Infrastructure

ChipStackSan Jose, California, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

About Us

Chips are at the center of today's tech-driven world. But how we design them has not changed in decades, while their complexity and specialization have skyrocketed due to increasing performance demands from applications like AI. We want to change that.

Our team is small, technical, and fast-moving. We’ve built and shipped at the intersection of AI, EDA, and systems software, with deep roots at companies like Qualcomm, Nvidia, Google, Meta, and the Allen Institute for AI. We’re backed by top investors including Khosla Ventures, Cerberus, and Clear Ventures, and already deployed with 10+ innovative customers—from Fortune 100s to cutting-edge AI silicon startups.

About This Role

This role offers a unique opportunity to be part of the founding team at ChipStack, where we are reinventing how modern silicon chips are designed. You will work alongside highly experienced chip designers who have built complex chips, ML scientists who have trained LLMs at scale, and top-notch infrastructure and software engineers. You will get to leverage your experience building ML and data infrastructure and apply it to some of the hardest problems in chip design.

About You

You want to be at a startup because you love to be at the center of all the dynamism that a startup offers.

You are willing to put in the hours and go the extra mile to ensure every customer has an exceptional experience.

You are self-motivated with a sense of urgency and can operate independently without much guidance.

You are not afraid of difficult problems and enjoy venturing into areas you have not explored before.

This Role

We’re looking for a strong, experienced ML Infrastructure Engineer to join our founding team. We are seeking someone with experience designing and scaling ML infrastructure and training pipelines. You’ll be responsible for building the core infrastructure that enables training, fine-tuning, evaluation, and deployment of LLMs across cloud and on-premise environments. Your work will directly impact product capabilities and speed of iteration.

What's needed

5+ years of experience in ML infrastructure or adjacent roles

Deep expertise in Python and experience with training frameworks like PyTorch or TensorFlow

Strong systems engineering skills and experience with distributed training, data pipelines, and performance optimization

Experience deploying ML models to production (REST APIs, batch jobs, streaming pipelines)

Proficiency with cloud platforms (e.g., GCP, AWS) and containerized systems (Docker, Kubernetes)

Experience managing GPU / TPU workloads efficiently

Good communication skills and the ability to work directly with engineers and customers

Prior experience training or fine-tuning LLMs

Experience setting up observability, monitoring, and evaluation pipelines for ML models

What's good to have

Exposure to chip design fundamentals (via coursework or elsewhere)

Experience at an early-stage startup

Our Culture

Challenge status quo : We are innovators who can challenge the status quo and push forward our vision of the world.

Strong opinions, loosely held : We are low on ego, but high on collaboration. We are okay to be wrong and are always open to learning.

Ship fast, ship quality : We ruthlessly prioritize what matters. We build a few things, but at lightning speed with high quality.

Proud of our craft : Attention to detail is in our DNA. We take pride in what we build and ensure they exceed the high standards of the semiconductor industry.

serp_jobs.job_alerts.create_a_job

Staff Engineer Infrastructure • San Jose, California, United States

Job_description.internal_linking.related_jobs
  • serp_jobs.job_card.promoted
Staff Infrastructure Engineer

Staff Infrastructure Engineer

ScribdSan Francisco, CA, United States
serp_jobs.job_card.full_time
At Scribd (pronounced “scribbed”), our mission is to spark human curiosity.Join our team as we create a world of stories and knowledge, democratize the exchange of ideas and information, and empowe...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
  • serp_jobs.job_card.promoted
Staff Software Engineer - Core Infrastructure

Staff Software Engineer - Core Infrastructure

6SenseSan Francisco, CA, United States
serp_jobs.job_card.full_time
Staff Software Engineer - Core Infrastructure.B2B organizations create revenue by predicting customers most likely to buy and recommending the best course of action to engage anonymous buying teams...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Senior Engineer, ML Infrastructure

Senior Engineer, ML Infrastructure

CoreWeaveSunnyvale, CA, US
serp_jobs.job_card.permanent
CoreWeave is the AI Hyperscaler™, delivering a cloud platform of cutting edge services powering the next wave of AI.Our technology provides enterprises and leading AI labs with the most perfo...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
ML Infrastructure Engineer (Staff / Principal)

ML Infrastructure Engineer (Staff / Principal)

Menlo VenturesBurlingame, CA, United States
serp_jobs.job_card.full_time
We’re a tight-knit team of proven drug hunters, deep learning researchers, and software engineers united by a common mission — drive AI innovation in biochemistry, discovering and developing ground...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Staff Infrastructure Engineer, Discovery Team

Staff Infrastructure Engineer, Discovery Team

Menlo VenturesSan Francisco, CA, United States
serp_jobs.job_card.full_time
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Machine Learning Infrastructure Engineer

Machine Learning Infrastructure Engineer

Greylock PartnersSan Francisco, CA, United States
serp_jobs.job_card.full_time
Machine Learning Infrastructure Engineer — join early B2C investment to help build large-scale ML infrastructure for a cutting-edge AI-first mobile product. Founders have experience building iconic ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Software Engineer, ML Infrastructure - Training Platform

Software Engineer, ML Infrastructure - Training Platform

Scale AI, Inc.San Francisco, California, United States
serp_jobs.job_card.full_time
Scale is looking for an AI / ML Infrastructure Engineer to join our Machine Learning Infrastructure team to build out our Training Platform. You will partner closely with Machine Learning researchers ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
ML Infrastructure Engineer

ML Infrastructure Engineer

PhizenixMenlo Park, CA, US
serp_jobs.job_card.full_time +1
Menlo Park, CA | On-Site | Full-Time / Direct Hire.Client Opportunity | Through Phizenix.Phizenix, a certified minority and women-led recruiting firm, is hiring on behalf of an AI startup pioneering ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Tech Lead Manager, Safeguards ML Infrastructure

Tech Lead Manager, Safeguards ML Infrastructure

AnthropicSan Francisco, CA, US
serp_jobs.job_card.full_time
Anthropic's mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Machine Learning Engineer — Infrastructure

Machine Learning Engineer — Infrastructure

Fundamental Research LabsMenlo Park, CA, United States
serp_jobs.job_card.full_time
Machine Learning Infrastructure Engineer.AI : from high-performance inference engines to the underlying agent technologies and large-scale compute clusters that keep everything running.You’ll collab...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Hardcore Engineer - Infrastructure / Supercomputing

Hardcore Engineer - Infrastructure / Supercomputing

xAIPalo Alto, CA, US
serp_jobs.job_card.full_time
AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
AI / ML Engineer, Staff

AI / ML Engineer, Staff

LimohealthSan Francisco, CA, United States
serp_jobs.job_card.full_time
At Charta, we're pioneering a transformative approach to healthcare administration and patient care through the power of generative AI. Our mission is to revolutionize this critical yet often cumber...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Machine Learning Engineer - Infrastructure

Machine Learning Engineer - Infrastructure

Fundamental Research LabsMenlo Park, CA, US
serp_jobs.job_card.full_time
Machine Learning Infrastructure Engineer.AI : from high-performance inference engines to the underlying agent technologies and large-scale compute clusters that keep everything running.You'll collab...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
ML Engineer

ML Engineer

PhizenixMenlo Park, CA, US
serp_jobs.job_card.full_time +1
Client Opportunity | Through Phizenix.Phizenix, a certified minority and women-led recruiting firm, is hiring on behalf of an innovative generative AI startup that's developing diffusion-based larg...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Senior Staff Infrastructure Security Engineer

Senior Staff Infrastructure Security Engineer

Promote ProjectSan Francisco, CA, United States
serp_jobs.job_card.full_time
Senior Staff Infrastructure Security Engineer.Crusoe is building the World’s Favorite AI-first Cloud infrastructure company. We’re pioneering vertically integrated, purpose-built AI infrastructure s...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Infrastructure Engineer

Infrastructure Engineer

Mercor, Inc.San Francisco, CA, United States
serp_jobs.job_card.full_time
We use our platform to source, vet, and onboard expert contractors who help train AI models in a wide variety of domains. Our technology is so effective it’s used by all of the top 5 AI labs.We scal...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Tech Lead Manager, Safeguards ML Infrastructure

Tech Lead Manager, Safeguards ML Infrastructure

Menlo VenturesSan Francisco, CA, United States
serp_jobs.job_card.full_time
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
ML Infrastructure Engineer

ML Infrastructure Engineer

Symbolica AISan Francisco, CA, US
serp_jobs.job_card.full_time
Symbolica is an AI research lab pioneering the application of category theory to enable logical reasoning in machines.We're a well-resourced, nimble team of experts on a mission to bridge the g...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Member of Technical Staff : ML Infrastructure, Platform Engineer

Member of Technical Staff : ML Infrastructure, Platform Engineer

essential AISan Francisco, CA, US
serp_jobs.job_card.full_time
Essential AI is building an open platform to fuel and accelerate AI breakthroughs globally.Our open models, robust tooling, reproducible pipelines, and evaluation frameworks are designed for collab...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Staff Infrastructure Engineer

Staff Infrastructure Engineer

hackeroneSan Francisco, California, United States
serp_jobs.job_card.full_time
HackerOne is a global leader in offensive security solutions.Our HackerOne Platform combines AI with the ingenuity of the largest community of security researchers to find and fix security, privacy...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days