Talent.com
Senior ML Inference Platform Engineer
Senior ML Inference Platform EngineerAION • Seattle, WA, US
Senior ML Inference Platform Engineer

Senior ML Inference Platform Engineer

AION • Seattle, WA, US
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
  • serp_jobs.filters_job_card.quick_apply
job_description.job_card.job_description

About AION

AION is building the next generation of AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, AION democratizes access to compute power for AI training, fine-tuning, inference, data labeling, and full stack AI / ML lifecycle.

Led by high-pedigree founders with previous exits, AION is well-funded by major VCs with strategic global partnerships. Headquartered in the US with global presence, the company is building its initial core team across India, London and Seattle.

Who You Are

You're an ML systems engineer who's passionate about building high-performance inference infrastructure. You don't need to be an expert in everything - this field is evolving too rapidly for that - but you have strong fundamentals and the curiosity to dive deep into optimization challenges. You thrive in early-stage environments where you'll learn cutting-edge techniques while building production systems. You think systematically about performance bottlenecks and are excited to push the boundaries of what's possible in AI infrastructure.

Requirements

Key Responsibilities

  • Build and optimize LLM inference systems working towards 2-4x performance improvements over standard frameworks like vLLM and TensorRT-LLM.
  • Implement modern inference optimizations including KV-cache management, dynamic batching, speculative decoding, compression and quantization strategies.
  • Develop GPU optimization solutions using CUDA, with opportunities to learn advanced techniques like Triton kernel development and CUDA graphs.
  • Design model evaluation and benchmarking systems to assess performance across reasoning, coding, and safety metrics.
  • Research and integrate trending open-source models (DeepSeek R1, Qwen 3, Llama 4, Mistral variants) with optimized configurations.
  • Build performance monitoring and profiling tools for GPU cluster analysis, bottleneck identification, and cost optimization.
  • Create cost-performance optimization strategies that balance throughput, latency, and infrastructure costs.
  • Explore agent orchestration capabilities for multi-step reasoning and tool integration workflows.
  • Collaborate with tech and product teams to identify optimization opportunities and translate them into production improvements.

Skills & Experience

  • High agency individual looking to own and influence product architecture and company direction
  • 3+ years of software engineering experience with focus on performance-critical systems and production deployments.
  • Strong Python expertise and working knowledge of C++ for performance optimization.
  • Working understanding of deep learning fundamentals including transformer architectures, attention mechanisms, and neural network training / inference.
  • Hands-on experience of model serving and deployment techniques.
  • Experience with at least one modern inference framework (vLLM, TensorRT-LLM, SGLang or similar) in a production setting.
  • Hands-on experience with PyTorch including model development, training loops, and basic distributed computing concepts.
  • Understanding of distributed systems concepts including load balancing, auto-scaling, and fault tolerance.
  • Basic GPU programming experience with CUDA or willingness to quickly learn GPU optimization techniques.
  • Strong debugging and performance profiling skills for identifying and resolving system bottlenecks.
  • Benefits

  • Join the ground floor of a mission-driven AI startup revolutionizing compute infrastructure.
  • Work with a high-caliber, globally distributed team backed by major VCs.
  • Competitive compensation and benefits.
  • Fast-paced, flexible work environment with room for ownership and impact.
  • Hybrid model : 3 days in-office, 2 days remote with flexibility to work remotely for part of the year.
  • In case you got any questions about the role please reach out to hiring manager on linkedin or X .

    serp_jobs.job_alerts.create_a_job

    Senior Engineer Platform • Seattle, WA, US

    Job_description.internal_linking.related_jobs
    Senior ML Platform Engineer — Scalable AI Infrastructure

    Senior ML Platform Engineer — Scalable AI Infrastructure

    Apple Inc. • Seattle, WA, United States
    serp_jobs.job_card.full_time
    A leading technology company is seeking a Machine Learning Engineer in Seattle to design and operate large-scale distributed systems for intelligence and search experiences.This role involves optim...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    ML Research Engineer, ML Systems

    ML Research Engineer, ML Systems

    Scale AI, Inc. • Seattle, WA, United States
    serp_jobs.job_card.full_time
    Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Hive • Seattle, Washington, United States
    serp_jobs.job_card.full_time
    Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Continua • Seattle, Oregon, USA
    serp_jobs.job_card.full_time
    AI agent that users can invite into group conversations to make planning coordination and information retrieval effortless. Funded by Google and Bessemer Venture Partners Continua is developing the ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    ML Engineer : Distributed Inference & Serving Systems

    ML Engineer : Distributed Inference & Serving Systems

    ByteDance • Seattle, WA, United States
    serp_jobs.job_card.full_time
    A leading technology company in Seattle is looking for a Machine Learning Engineer to design and implement distributed inference infrastructure. The ideal candidate will have a strong background in ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior ML Engineer

    Senior ML Engineer

    Truveta • Seattle, Washington, United States
    serp_jobs.job_card.full_time
    Truveta is the world’s first health provider led data platform with a vision of Saving Lives with Data.Our mission is to enable researchers to find cures faster, empower every clinician to be an ex...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior ML Engineer - Real-Time, Large-Scale Travel Platform

    Senior ML Engineer - Real-Time, Large-Scale Travel Platform

    Expedia, Inc. • Seattle, WA, United States
    serp_jobs.job_card.full_time
    A leading travel technology company in Seattle is seeking a Machine Learning Engineer III to design and scale intelligent systems for their global travel marketplace. You will collaborate with cross...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Platform Engineer - AI / ML Microservices (Remote)

    Senior Platform Engineer - AI / ML Microservices (Remote)

    Medium • Seattle, WA, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    A technology company specializing in AI is seeking a Senior Software Engineer to design and develop software components integrated into cloud architectures. Responsibilities include building automat...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior ML Model Integration & Platform Engineer

    Senior ML Model Integration & Platform Engineer

    Apple • Seattle, WA, United States
    serp_jobs.job_card.full_time
    Join the Apple Service Engineering (ASE) team and drive innovation that matters! The ASE team builds and provides systems and infrastructure that fuel Apple's services. As part of this team, you wil...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted
    Senior ML Platform Engineer : SDKs, Frameworks & Serving

    Senior ML Platform Engineer : SDKs, Frameworks & Serving

    Apple Inc. • Seattle, WA, United States
    serp_jobs.job_card.full_time
    A leading technology company is seeking a Senior Machine Learning Platform Engineer in Seattle, WA.This role involves building core platform capabilities for model management, integrating with comp...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    ML Engineer

    ML Engineer

    Catalyst Labs • Seattle, Oregon, USA
    serp_jobs.job_card.full_time
    Is a rapidly growing Tier 1 VC backed startup based in New York with $60 million in funding revolutionizing how outside sales and service teams work. Their AI technology captures and analyzes real-w...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior ML Engineer, Applied Research — Remote

    Senior ML Engineer, Applied Research — Remote

    Pinterest • Seattle, WA, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    A leading social media platform in Seattle is seeking a Machine Learning Engineer to innovate with advanced algorithms that personalize user experiences. The ideal candidate has over 4 years of expe...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Principal ML Engineer - Content Relevance & Recommendations

    Principal ML Engineer - Content Relevance & Recommendations

    Minimal • Seattle, WA, United States
    serp_jobs.job_card.full_time
    A leading technology company in Seattle is looking for a Principal Machine Learning Engineer to enhance their content relevance systems. The ideal candidate will have extensive experience in machine...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior AI / ML Engineer – Remote

    Senior AI / ML Engineer – Remote

    UnitedHealth Group • Seattle, Washington, US
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives.The work you do with our team will directly improve health outcomes by connect...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior ML Engineer - Ranking & Recommendations (RSU Eligible)

    Senior ML Engineer - Ranking & Recommendations (RSU Eligible)

    Snap Inc. • Seattle, WA, United States
    serp_jobs.job_card.full_time
    A leading technology company in Seattle is seeking a Machine Learning Engineer to create impactful models that drive user and advertiser value. The role requires strong machine learning skills, coll...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Principal ML Engineer - Content Relevance & Recommendations

    Principal ML Engineer - Content Relevance & Recommendations

    Snap Inc. • Seattle, WA, United States
    serp_jobs.job_card.full_time
    A forward-thinking technology company in Seattle is seeking a Principal Machine Learning Engineer to drive the technical roadmap for content recommendation systems. You will collaborate with diverse...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Software Engineer, Ads ML Infrastructure - USDS

    Senior Software Engineer, Ads ML Infrastructure - USDS

    Tik Tok • Seattle, WA, United States
    serp_jobs.job_card.full_time
    About the team The ads system at TikTok USDS operates on a massive scale and serves millions of advertisers, clients and influencers across the world. The quality of the ads system highly depends on...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior AI / ML Software Engineer for Spaceflight

    Senior AI / ML Software Engineer for Spaceflight

    Blue Origin LLC • Seattle, WA, United States
    serp_jobs.job_card.full_time
    A space technology company is seeking a skilled Software Engineer III - AI / ML to join its Machine Learning team.You will design and implement scalable AI services, collaborate with stakeholders, an...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted