Talent.com
ML Infrastructure Engineer

ML Infrastructure Engineer

PhizenixMenlo Park, California, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
  • serp_jobs.job_card.permanent
job_description.job_card.job_description

ML Infrastructure Engineer

Menlo Park, CA | On-Site | Full-Time / Direct Hire

Client Opportunity | Through Phizenix

Phizenix, a certified minority and women-led recruiting firm, is hiring on behalf of an AI startup pioneering diffusion-based large language models—built for faster generation, multimodal integration, and scalable enterprise deployment.

We’re looking for a ML Infrastructure Engineer  to help build the infrastructure that powers large-scale model training and real-time inference. You’ll collaborate with world-class researchers and engineers to design high-performance, distributed systems that bring advanced LLMs into production.

Responsibilities

Design and manage distributed infrastructure for ML training at scale

Optimize model serving systems for low-latency inference

Build automated pipelines for data processing, model training, and deployment

Implement observability tools to monitor performance in production

Maximize resource utilization across GPU clusters and cloud environments

Translate research requirements into robust, scalable system designs

Must-Haves

PhD in Computer Science, Engineering, or a related field (or equivalent experience)

Strong foundation in software engineering, systems design, and distributed systems

Experience with cloud platforms (AWS, GCP, or Azure)

Proficient in Python and at least one systems-level language (C++ / Rust / Go)

Hands-on experience with Docker, Kubernetes, and CI / CD workflows

Familiarity with ML frameworks like PyTorch or TensorFlow from a systems perspective

Understanding of GPU programming and high-performance infrastructure

Nice-to-Haves

Experience with large-scale ML training clusters and GPU orchestration

Knowledge of LLM-serving tools (vLLM, TensorRT, ONNX Runtime)

Experience with distributed training strategies (e.g., data / model / pipeline parallelism)

Familiarity with orchestration tools like Kubeflow or Airflow

Background in performance tuning, system profiling, and MLOps best practices

At Phizenix , we’re committed to supporting diverse and inclusive teams. This is your chance to shape the systems that power the next generation of AI innovation. Let’s build the future—together.

California Pay Range

$180,000 - $200,000 USD

serp_jobs.job_alerts.create_a_job

Infrastructure Engineer • Menlo Park, California, United States

Job_description.internal_linking.related_jobs
  • serp_jobs.job_card.promoted
ML Infrastructure Engineer (Staff / Principal)

ML Infrastructure Engineer (Staff / Principal)

Menlo VenturesBurlingame, CA, United States
serp_jobs.job_card.full_time
We’re a tight-knit team of proven drug hunters, deep learning researchers, and software engineers united by a common mission — drive AI innovation in biochemistry, discovering and developing ground...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Machine Learning Infrastructure Engineer

Machine Learning Infrastructure Engineer

Ambience Healthcare, Inc.San Francisco, CA, US
serp_jobs.job_card.full_time
Ambience Healthcare is the leading AI platform for documentation, coding, and clinical workflow, built to reduce administrative burden and protect revenue integrity at the point of care.Trusted by ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Technical Lead, Multimodal Infrastructure

Technical Lead, Multimodal Infrastructure

OpenAISan Francisco, CA, United States
serp_jobs.job_card.full_time
The Multimodal Research team at OpenAI is building the next generation of AI systems that can understand and generate content across multiple modalities—including text, audio, images, and video.The...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Machine Learning Infrastructure Engineer

Machine Learning Infrastructure Engineer

Greylock PartnersSan Francisco, CA, United States
serp_jobs.job_card.full_time
Machine Learning Infrastructure Engineer — join early B2C investment to help build large-scale ML infrastructure for a cutting-edge AI-first mobile product. Founders have experience building iconic ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Tech Lead Manager, Safeguards ML Infrastructure

Tech Lead Manager, Safeguards ML Infrastructure

AnthropicSan Francisco, CA, US
serp_jobs.job_card.full_time
Anthropic's mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Founding Machine Learning Infrastructure Engineer

Founding Machine Learning Infrastructure Engineer

NomadicML Inc.San Francisco, CA, United States
serp_jobs.job_card.full_time
Harvard, where they both did research in the intersection of computation and evaluations.Between them, they have authored multiple published papers in the machine learning domain and hold numerous ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Machine Learning Engineer — Infrastructure

Machine Learning Engineer — Infrastructure

Fundamental Research LabsMenlo Park, CA, United States
serp_jobs.job_card.full_time
Machine Learning Infrastructure Engineer.AI : from high-performance inference engines to the underlying agent technologies and large-scale compute clusters that keep everything running.You’ll collab...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
ML Infrastructure Engineering Manager, SafeguardsSan Francisco, CA

ML Infrastructure Engineering Manager, SafeguardsSan Francisco, CA

AnthropicSan Francisco, CA, US
serp_jobs.job_card.full_time
ML Infrastructure Engineering Manager, Safeguards.Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for socie...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Hardcore Engineer - Infrastructure / Supercomputing

Hardcore Engineer - Infrastructure / Supercomputing

xAIPalo Alto, CA, US
serp_jobs.job_card.full_time
AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Machine Learning Engineer - Infrastructure

Machine Learning Engineer - Infrastructure

Fundamental Research LabsMenlo Park, CA, US
serp_jobs.job_card.full_time
Machine Learning Infrastructure Engineer.AI : from high-performance inference engines to the underlying agent technologies and large-scale compute clusters that keep everything running.You'll collab...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Lead Infrastructure Engineer - Remote

Lead Infrastructure Engineer - Remote

BigCommerce Pty.San Francisco, CA, United States
serp_jobs.filters.remote
serp_jobs.job_card.full_time
Lead Infrastructure Engineer - Remote page is loaded.Lead Infrastructure Engineer - Remote.Apply remote type Remote locations United States - Remote San Francisco, CA Austin, TX time type Full time...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Software Engineer (Infrastructure)

Software Engineer (Infrastructure)

GreptileSan Francisco, CA, United States
serp_jobs.job_card.full_time
K – $210K • $75K – $125K Equity • Up to $25,000 in relocation assistance.Greptile is an AI code reviewer that catches bugs and anti-patterns in pull requests with complete context of the codebase.H...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Infrastructure Engineers wanted

Infrastructure Engineers wanted

RustsyndiSan Francisco, CA, United States
serp_jobs.job_card.full_time
Infrastructure Engineers wanted at EdgeDB.Join EdgeDB, an open-source database built on top of Postgres, and help scale out our cloud infrastructure. As an SRE / Infrastructure Engineer at EdgeDB, you...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
ML Infrastructure Engineer

ML Infrastructure Engineer

PhizenixMenlo Park, CA, US
serp_jobs.job_card.full_time +1
Menlo Park, CA | On-Site | Full-Time / Direct Hire.Looking for ML Infra experts (Bay Area preferred) with deep experience in CUDA, GPU optimization, VLLMs, and LLM inference—pure language focus...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
Infrastructure Engineer

Infrastructure Engineer

Mercor, Inc.San Francisco, CA, United States
serp_jobs.job_card.full_time
We use our platform to source, vet, and onboard expert contractors who help train AI models in a wide variety of domains. Our technology is so effective it’s used by all of the top 5 AI labs.We scal...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Infrastructure Engineer - Developer Productivity

Infrastructure Engineer - Developer Productivity

Recruiting From ScratchSan Francisco, CA, United States
serp_jobs.job_card.full_time
Who is Recruiting from Scratch : .Recruiting from Scratch is a talent firm that focuses on placing the best candidate for our clients. Our team is 100% remote and we work with teams across North Ameri...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Tech Lead Manager, Safeguards ML Infrastructure

Tech Lead Manager, Safeguards ML Infrastructure

Menlo VenturesSan Francisco, CA, United States
serp_jobs.job_card.full_time
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
  • serp_jobs.job_card.promoted
Machine Learning Engineer - Infrastructure

Machine Learning Engineer - Infrastructure

ZipRecruiterSan Francisco, CA, United States
serp_jobs.job_card.full_time
Nextdoor (NYSE : NXDR) is the essential neighborhood network.Neighbors, public agencies, and businesses use Nextdoor to connect around local information that matters in more than 340,000 neighborhoo...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
ML Infrastructure Engineer

ML Infrastructure Engineer

Symbolica AISan Francisco, CA, US
serp_jobs.job_card.full_time
Symbolica is an AI research lab pioneering the application of category theory to enable logical reasoning in machines.We're a well-resourced, nimble team of experts on a mission to bridge the g...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
  • serp_jobs.job_card.promoted
  • serp_jobs.job_card.new
AI Infrastructure Engineer

AI Infrastructure Engineer

StackAISan Francisco, CA, United States
serp_jobs.job_card.full_time
As a Series A company, your work will be foundational, enabling safe, efficient, and reliable AI workflows from end to end. Design and implement scalable backend architectures for AI workloads (infe...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours