Talent.com
Senior ML Infrastructure Engineer

Senior ML Infrastructure Engineer

Hippocratic AiPalo Alto, California, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

About Us :

Hippocratic AI has developed a safety-focused Large Language Model (LLM) for healthcare. The company believes that a safe LLM can dramatically improve healthcare accessibility and health outcomes in the world by bringing deep healthcare expertise to every human. No other technology has the potential to have this level of global impact on health.

Why Join Our Team :

Innovative Mission : We are developing a safe, healthcare-focused large language model (LLM) designed to revolutionize health outcomes on a global scale.

Visionary Leadership : Hippocratic AI was co-founded by CEO Munjal Shah, alongside a group of physicians, hospital administrators, healthcare professionals, and artificial intelligence researchers from leading institutions, including El Camino Health, Johns Hopkins, Stanford, Microsoft, Google, and NVIDIA.

Strategic Investors : We have raised a total of $278 million in funding, backed by top investors such as Andreessen Horowitz, General Catalyst, Kleiner Perkins, NVIDIA’s NVentures, Premji Invest, SV Angel, and six health systems.

World-Class Team : Our team is composed of leading experts in healthcare and artificial intelligence, ensuring our technology is safe, effective, and capable of delivering meaningful improvements to healthcare delivery and outcomes.

Position Overview :

We are seeking a skilled ML Infrastructure Engineer to help design, build, and maintain a robust orchestration platform for managing a diverse set of Large Language Models (LLMs). The ideal candidate will have hands-on experience with infrastructure orchestration tools such as Kubernetes and Terraform, as well as a strong understanding of multi-cloud environments. This role offers the opportunity to work on cutting-edge technologies and play a key part in scaling our AI infrastructure.

Key Responsibilities : Infrastructure Development & Maintenance :

  • Build and maintain infrastructure for deploying and managing LLMs at scale.
  • Implement automated processes using Kubernetes and Infrastructure as Code (IAC) tools like Terraform.

Orchestration Platform Support :

  • Contribute to the development and optimization of an orchestration platform for managing a heterogeneous set of LLMs.
  • Monitor and troubleshoot issues in the platform to ensure high availability and performance.
  • Cloud Integration :

  • Deploy and manage resources across multiple cloud platforms (e.g., AWS, Azure, Google Cloud).
  • Optimize cloud resource usage for cost efficiency and scalability.
  • Collaboration :

  • Work closely with ML engineers and DevOps teams to ensure smooth deployment and operation of AI models.
  • Provide feedback on system designs and recommend improvements to infrastructure workflows.
  • Performance Monitoring :

  • Implement tools and processes to monitor system health, identify bottlenecks, and improve model lifecycle management.
  • Perform capacity planning to support growing infrastructure needs.
  • Qualifications : Technical Skills :

  • 3-5 years of experience in infrastructure engineering, DevOps, or a related field.
  • Experience with enterprise GPUs such as H200, H100, A100

  • Proficiency with Kubernetes, Terraform, and other IAC tools.
  • Familiarity with multi-cloud environments and cloud-native services (e.g., AWS Lambda, Google Cloud Run, Azure Functions).
  • Programming skills in Python, Bash, or a similar language for automation and scripting.
  • Basic understanding of ML workflows and frameworks like TensorFlow, PyTorch, or Hugging Face is a plus.
  • Soft Skills :

  • Strong problem-solving skills and attention to detail.
  • Good communication and collaboration abilities to work effectively with cross-functional teams.
  • Eagerness to learn new technologies and improve existing systems.
  • Education & Experience :

  • Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent work experience).
  • serp_jobs.job_alerts.create_a_job

    Senior Engineer Infrastructure • Palo Alto, California, United States

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    Senior Engineer, ML Infrastructure

    Senior Engineer, ML Infrastructure

    CoreWeaveSunnyvale, CA, US
    serp_jobs.job_card.permanent
    CoreWeave is the AI Hyperscaler™, delivering a cloud platform of cutting edge services powering the next wave of AI.Our technology provides enterprises and leading AI labs with the most perfo...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Staff Infrastructure Engineer

    Staff Infrastructure Engineer

    VirtualVocationsSunnyvale, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Staff Infrastructure & Automation Engineer.Key Responsibilities Design, build, and operate cloud-based infrastructure with a focus on Infrastructure as Code and CI / CD m...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    ML Infrastructure Engineer (Staff / Principal)

    ML Infrastructure Engineer (Staff / Principal)

    Menlo VenturesBurlingame, CA, United States
    serp_jobs.job_card.full_time
    We’re a tight-knit team of proven drug hunters, deep learning researchers, and software engineers united by a common mission — drive AI innovation in biochemistry, discovering and developing ground...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Technical Lead, Multimodal Infrastructure

    Technical Lead, Multimodal Infrastructure

    OpenAISan Francisco, CA, United States
    serp_jobs.job_card.full_time
    The Multimodal Research team at OpenAI is building the next generation of AI systems that can understand and generate content across multiple modalities—including text, audio, images, and video.The...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    ML Infrastructure Engineer with GCP

    ML Infrastructure Engineer with GCP

    iSoftTek Solutions IncMountain View, CA, US
    serp_jobs.job_card.full_time
    Job Title : ML Infrastructure Engineer with GCP.Location : Mountain View, CA [Needs to be onsite for 1 week once in a quarter on your own expenses]. Note : Only PST and MST candidates are required.Expe...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Software Engineer, ML Infrastructure - Training Platform

    Software Engineer, ML Infrastructure - Training Platform

    Scale AI, Inc.San Francisco, California, United States
    serp_jobs.job_card.full_time
    Scale is looking for an AI / ML Infrastructure Engineer to join our Machine Learning Infrastructure team to build out our Training Platform. You will partner closely with Machine Learning researchers ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Senior Infrastructure Engineer - Bellevue or San Francisco

    Senior Infrastructure Engineer - Bellevue or San Francisco

    AircallSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Aircall is the world’s leading integrated customer communications and intelligence platform for growing businesses.Trusted by over 20,000 companies worldwide, Aircall unifies voice and digital chan...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    Tech Lead Manager, Safeguards ML Infrastructure

    Tech Lead Manager, Safeguards ML Infrastructure

    AnthropicSan Francisco, CA, US
    serp_jobs.job_card.full_time
    Anthropic's mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Lead Infrastructure Engineer - Remote

    Lead Infrastructure Engineer - Remote

    BigCommerce Pty.San Jose, CA, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Lead Infrastructure Engineer - Remote page is loaded.Lead Infrastructure Engineer - Remote.Apply remote type Remote locations United States - Remote San Francisco, CA Austin, TX time type Full time...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Software Engineer - Infrastructure

    Senior Software Engineer - Infrastructure

    QualifiedSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Qualified is the Agentic Marketing Platform for B2B companies.With Piper the AI SDR Agent, Qualified offers a whole new way to grow inbound pipeline. Piper operates across both the website and email...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Infrastructure Engineer

    Infrastructure Engineer

    VirtualVocationsFremont, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Member of Technical Staff, Infrastructure & Data.Key Responsibilities Build, manage, and scale GPU infrastructure using tools like Kubernetes, Terraform, or Pulumi Mai...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Cloud Infrastructure Engineer

    Cloud Infrastructure Engineer

    VirtualVocationsSunnyvale, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Cloud Infrastructure Engineer.Key Responsibilities Automate and maintain services for high availability, resiliency, scalability, security, and cost optimization Devel...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior Software Engineer, Infrastructure

    Senior Software Engineer, Infrastructure

    SentrySan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Bad software is everywhere, and we’re tired of it.Sentry is on a mission to help developers write better software faster so we can get back to enjoying technology. With more than $217 million in fun...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Infrastructure Engineer

    Senior Infrastructure Engineer

    VirtualVocationsSan Francisco, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior Staff Infrastructure Engineer.Key Responsibilities Design and operate multi-tenant architectures for global enterprise customers Develop scalable CI / CD pipeline...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior GPU Infrastructure Engineer

    Senior GPU Infrastructure Engineer

    VirtualVocationsSanta Clara, California, United States
    serp_jobs.job_card.full_time
    A company is looking for a Senior GPU Infrastructure Engineer II.Key Responsibilities Contribute to the Bare Metal GPU product by implementing security and operational best practices across infra...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Infrastructure Engineer

    Infrastructure Engineer

    Mercor, Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    We use our platform to source, vet, and onboard expert contractors who help train AI models in a wide variety of domains. Our technology is so effective it’s used by all of the top 5 AI labs.We scal...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Tech Lead Manager, Safeguards ML Infrastructure

    Tech Lead Manager, Safeguards ML Infrastructure

    Menlo VenturesSan Francisco, CA, United States
    serp_jobs.job_card.full_time
    Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    ML Infrastructure Engineer

    ML Infrastructure Engineer

    Symbolica AISan Francisco, CA, US
    serp_jobs.job_card.full_time
    Symbolica is an AI research lab pioneering the application of category theory to enable logical reasoning in machines.We're a well-resourced, nimble team of experts on a mission to bridge the g...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Senior Infrastructure Engineer

    Senior Infrastructure Engineer

    Macroscope Inc.San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Macroscope aims to be the source of truth of what's happening for any company that builds software.Our mission is to give leaders clarity and engineers time. We help leaders understand how their pro...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Senior Infrastructure Engineer

    Senior Infrastructure Engineer

    PRAGMATIKESan Francisco, CA, US
    serp_jobs.job_card.full_time
    Cambridge, MA (Eastern Time / UTC -4).We are hiring at Pragmatike to expand our team and drive the growth of our internal projects. Our focus is on developing cutting-edge solutions in Cloud Computi...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30