Talent.com
AIML - Staff ML Infrastructure Engineer, ML Platform & Technology - Pre-training Compute
AIML - Staff ML Infrastructure Engineer, ML Platform & Technology - Pre-training ComputeApple Inc. • San Francisco, CA, United States
AIML - Staff ML Infrastructure Engineer, ML Platform & Technology - Pre-training Compute

AIML - Staff ML Infrastructure Engineer, ML Platform & Technology - Pre-training Compute

Apple Inc. • San Francisco, CA, United States
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

AIML - Staff ML Infrastructure Engineer, ML Platform & Technology - Pre-training Compute

San Francisco Bay Area, California, United States Machine Learning and AI

Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other’s ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the world, changing lives for the better. It’s the diversity of our people and their thinking that inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Here, you’ll do more than join something — you’ll add something!

Description

As an engineer on ML Compute team, your work will include : - Drive large-scale pre-training initiatives to support cutting‑edge foundation models, focusing on resiliency, efficiency, scalability, and resource optimization.- Enhance distributed training techniques for foundation models.- Research and implement new patterns and technologies to improve system performance, maintainability, and design.- Optimize execution and performance of workloads built with JAX, PyTorch, XLA and CUDA on large distributed systems.- Leverage high‑performance networking technologies such as NCCL for GPU collectives and TPU interconnect (ICI / Fabric) for large‑scale distributed training.- Architect a robust MLOps platform to streamline and automate pretraining operations.- Operationalize large‑scale ML workloads on Kubernetes, ensuring distributed trainings are robust, efficient, and fault‑tolerant.- Lead complex technical projects, defining requirements and tracking progress with team members.- Collaborate with cross‑functional engineers to solve large‑scale ML training challenges.- Mentor engineers in areas of your expertise, fostering skill growth and knowledge sharing.- Cultivate a team centered on collaboration, technical excellence, and innovation.

Minimum Qualifications

  • Bachelors in Computer Science, engineering, or a related field
  • 6+ years of hands‑on experience in building scalable backend systems for training and evaluation of machine learning models
  • Proficient in relevant programming languages, like Python or Go
  • Strong expertise in distributed systems, reliability and scalability, containerization, and cloud platforms
  • Proficient in cloud computing infrastructure and tools : Kubernetes, Ray, PySpark
  • Ability to clearly and concisely communicate technical and architectural problems, while working with partners to iteratively find

Preferred Qualifications

  • Advance degrees in Computer Science, engineering, or a related field
  • Proficient in working with and debugging accelerators, like : GPU, TPU, AWS Trainium
  • Proficient in ML training and deployment frameworks, like : JAX, Tensorflow, PyTorch, TensorRT, vLLM
  • At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $181,100 and $318,400, and your base pay will depend on your skills, qualifications, experience, and location.

    Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including : Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation.

    Note : Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

    Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.

    #J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Ml Engineer • San Francisco, CA, United States

    Job_description.internal_linking.related_jobs
    Senior ML Infra Architect for Scalable AI Training

    Senior ML Infra Architect for Scalable AI Training

    Amazon • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A leading tech company in San Francisco is seeking a Sr.Machine Learning Engineer to lead the development of next-generation ML training infrastructure. This high-impact role requires over 8 years o...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    AI Safety ML Infrastructure Engineer

    AI Safety ML Infrastructure Engineer

    Virtue AI • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    An innovative AI security startup in San Francisco seeks an Applied Machine Learning Engineer to develop robust ML systems that address AI safety challenges. The ideal candidate will have a degree i...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Clinical Lead, Internal Medicine

    Clinical Lead, Internal Medicine

    Ethos Veterinary Health • Redwood City, CA, US
    serp_jobs.job_card.full_time
    Clinical Lead, Internal Medicine.Full-Time | Variable weekdays with some weekends.SAGE Veterinary Centers is a nationally recognized leader in specialty and emergency medicine.At our Redwood City l...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    AI Infrastructure Engineer, Model Serving Platform

    AI Infrastructure Engineer, Model Serving Platform

    Scale AI • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    As a software engineer on the ML Infrastructure team, you will work on developing the platform for orchestrating post-training and model evaluation jobs. At Scale, we are constantly developing new d...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Staff ML Engineer - AI-Powered Observability Platform

    Staff ML Engineer - AI-Powered Observability Platform

    Cisco Systems • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A leading technology company seeks an experienced software engineer to join their team focused on AI innovations.The role emphasizes developing scalable cloud-based systems and includes responsibil...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    AIML - Staff ML Infrastructure Engineer, ML Platform & Technology - Pre-training Compute

    AIML - Staff ML Infrastructure Engineer, ML Platform & Technology - Pre-training Compute

    Apple Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    AIML - Staff ML Infrastructure Engineer, ML Platform & Technology - Pre-training Compute.San Francisco Bay Area, California, United States Machine Learning and AI. Apple is where individual imaginat...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior ML Platform Engineer - Training & Inference

    Senior ML Platform Engineer - Training & Inference

    Zoox • Foster City, CA, United States
    serp_jobs.job_card.full_time
    A tech company specializing in autonomous vehicles is seeking an experienced ML Infrastructure Engineer to build scalable ML training frameworks and lead the design of a robust ML platform.Candidat...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    ML Infrastructure Engineer (Menlo Park)

    ML Infrastructure Engineer (Menlo Park)

    Strativ Group • Menlo Park, CA, United States
    serp_jobs.job_card.full_time
    We are partnered with a Stealth AI Lab (backed by top-tier investors and advised by pioneering figures in generative and interactive media) that is hiring a Staff ML Infrastructure Engineer.This co...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Manager, REMS Data Programmer

    Senior Manager, REMS Data Programmer

    Jazz Pharmaceuticals • Menlo Park, California, USA
    serp_jobs.job_card.full_time
    If you are a current Jazz employee please apply via the Internal Career site.Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Travel Echo Tech - $1,670 to $1,851 per week in Menlo Park, CA

    Travel Echo Tech - $1,670 to $1,851 per week in Menlo Park, CA

    AlliedTravelCareers • Menlo Park, CA, US
    serp_jobs.job_card.full_time
    AlliedTravelCareers is working with LRS Healthcare to find a qualified Echo Tech in Menlo Park, California, 94025!.Ready to start your next travel adventure? LRS Healthcare offers a full benefits p...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    ML Infrastructure Engineer (Staff / Principal)

    ML Infrastructure Engineer (Staff / Principal)

    Genesis Molecular AI • Burlingame, CA, United States
    serp_jobs.job_card.full_time
    We’re a tight-knit team of proven drug hunters, deep learning researchers, and software engineers united by a common mission — drive AI innovation in biochemistry, discovering and developing ground...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior / Staff ML Engineer, Recommendations Systems

    Senior / Staff ML Engineer, Recommendations Systems

    Grow Therapy • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Grow Therapy is on a mission to serve as the trusted partner for therapists growing their practice, and patients accessing high‑quality care. Powered by technology, we are a three‑sided marketplace ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Staff ML Engineer - Healthcare AI Platform Leader (Hybrid / Remote)

    Staff ML Engineer - Healthcare AI Platform Leader (Hybrid / Remote)

    Headspace Sourcing • San Francisco, CA, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    A leading mental health technology company is looking for a Staff Software Engineer specializing in Machine Learning to develop AI solutions that enhance mental healthcare.This role emphasizes tech...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Staff ML Engineer

    Staff ML Engineer

    Google • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Google's software engineers develop the next‑generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle in...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Machine Learning Infrastructure Engineer

    Machine Learning Infrastructure Engineer

    Character.AI • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Machine Learning Infrastructure Engineer.Machine Learning Infrastructure Engineer.Machine Learning Infrastructure Engineer. Machine Learning Infrastructure Engineer.Get AI-powered advice on this job...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Staff ML Engineer — LLMs & Production AI

    Senior Staff ML Engineer — LLMs & Production AI

    Rippling • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A prominent tech company based in San Francisco is looking for a Senior Staff Machine Learning Engineer to drive innovation in their products. You will collaborate across teams to develop and implem...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Staff ML Infrastructure Engineer - Scale & Inference

    Staff ML Infrastructure Engineer - Scale & Inference

    Snap Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A leading tech company is seeking a Software Engineer for ML Infrastructure in San Francisco.This role involves designing high-performance systems for machine learning workloads, collaborating with...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    Staff ML Engineer

    Staff ML Engineer

    Grindr LLC • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    This is a hybrid role based in our San Francisco or Palo Alto offices (Palo Alto preferred) and will require you to be in the office on Tuesdays and Thursdays. What’s So Interesting About This Role?...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted