Talent.com
Software Engineer - Model Performance
Software Engineer - Model PerformanceAI Fund • San Francisco, CA, United States
serp_jobs.error_messages.no_longer_accepting
Software Engineer - Model Performance

Software Engineer - Model Performance

AI Fund • San Francisco, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Overview

Join to apply for the Software Engineer - Model Performance role at AI Fund .

Are you passionate about advancing the application of artificial intelligence? We are looking for a Software Engineer focused on ML performance to join our dynamic team. This role is ideal for someone who thrives in a fast-paced startup environment and is eager to make significant contributions to the exciting field of LLM inference. If you are a backend engineer who thrives on making things faster and is excited about open-source ML models, we look forward to your application.

About Baseten

Baseten provides the infrastructure, tooling, and expertise needed to bring great AI products to market - fast. Backed by top investors including

Responsibilities

  • Implement, refine, and productionize cutting-edge techniques (quantization, speculative decoding, kv cache reuse, chunked prefill and LoRA) for ML model inference and infrastructure.
  • Deep dive into underlying codebases of TensorRT, PyTorch, TensorRT-LLM, vllm, sglang, CUDA, and other libraries to debug ML performance issues.
  • Apply and scale optimization techniques across a wide range of ML models, particularly large language models.
  • Collaborate with a diverse team to design and implement innovative solutions.
  • Own projects from idea to production.

Qualifications

  • Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field.
  • Experience with one or more general-purpose programming languages, such as Python or C++.
  • Familiarity with LLM optimization techniques (e.g., quantization, speculative decoding, continuous batching).
  • Strong familiarity with ML libraries, especially PyTorch, TensorRT, or TensorRT-LLM.
  • Demonstrated interest and experience in LLMs.
  • Deep understanding of GPU architecture.
  • Bonus :
  • Proficiency in enhancing the performance of software systems, particularly in the context of large language models (LLMs).

  • Experience with CUDA or similar technologies.
  • Deep understanding of software engineering principles and a proven track record of developing and deploying AI / ML inference solutions.
  • Experience with Docker and Kubernetes.
  • Benefits

  • Competitive compensation package.
  • This is a unique opportunity to be part of a rapidly growing startup in one of the most exciting engineering fields of our era.
  • An inclusive and supportive work culture that fosters learning and growth.
  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.
  • Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

    At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

    Job Details

  • Seniority level : Entry level
  • Employment type : Full-time
  • Job function : Engineering and Information Technology
  • Industries : Venture Capital and Private Equity Principals
  • Location : San Francisco, CA

    Salary : $160,000.00-$180,000.00 per year

    #J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Engineer Performance • San Francisco, CA, United States

    Job_description.internal_linking.related_jobs
    Software Engineer (AI Performance)

    Software Engineer (AI Performance)

    Gimlet Labs, Inc • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Gimlet Labs is building the foundation for the next generation of AI applications.As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck.Gimlet is redefi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Software Engineer - Infrastructure, Machine Learning

    Senior Software Engineer - Infrastructure, Machine Learning

    Baton • San Francisco, California, United States
    serp_jobs.job_card.full_time
    With $10B in freight under management, our technology reaches every part of the U.We design and ship category-defining software that enables Ryder and its 50,000+ customers—including some of the wo...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer - Model API's

    Software Engineer - Model API's

    Baseten • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and Zed. By uniting applied AI research, flexible inf...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Software Engineer, Machine Learning

    Senior Software Engineer, Machine Learning

    Planet Labs PBC • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    We believe in using space to help life on Earth.Planet designs, builds, and operates the largest constellation of imaging satellites in history. This constellation delivers an unprecedented dataset ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer, SystemML - Scaling / Performance

    Software Engineer, SystemML - Scaling / Performance

    META • Menlo Park, CA, United States
    serp_jobs.job_card.full_time
    In this role, you will be a member of the Network.AI Software team and part of the bigger DC networking organization.The team develops and owns the software stack around NCCL (NVIDIA Collective Com...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer, Performance Optimization

    Software Engineer, Performance Optimization

    Fireworks Ai • Redwood City, California, United States
    serp_jobs.job_card.full_time
    Here at Fireworks, we’re building the future of generative AI infrastructure.Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference.We’...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    AI Infrastructure Engineer, Model Serving Platform

    AI Infrastructure Engineer, Model Serving Platform

    Scale AI, Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and product...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Lead Performance Modelling Engineer - Systems & Simulators

    Lead Performance Modelling Engineer - Systems & Simulators

    Flux • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A leading technology company in San Francisco is seeking a Staff Performance Modelling Engineer to develop analytical and simulation models that drive architecture evolution.The ideal candidate wil...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer, Scientific Models (Full-Stack)

    Software Engineer, Scientific Models (Full-Stack)

    Benchling • San Francisco, California, USA
    serp_jobs.job_card.full_time
    Biotechnology is rewriting life as we know it from the medicines we take to the crops we grow the materials we wear and the household goods that we rely on every day. But moving at the new speed of ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Software Engineer - Machine Learning- Publica by IAS

    Senior Software Engineer - Machine Learning- Publica by IAS

    Publica • San Francisco, California, United States
    serp_jobs.job_card.full_time
    At Publica, engineers have a unique opportunity to work on a platform that handles billions of requests per hour in one of the fastest growing areas in Ad Tech : Connected Television.Engineers at Pu...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer, Machine Learning Infrastructure

    Software Engineer, Machine Learning Infrastructure

    Datologyai • Redwood City, California, United States
    serp_jobs.job_card.full_time
    Companies want to train their own large models on their own data.The current industry standard is to train on a random sample of your data, which is inefficient at best and actively harmful to mode...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Staff Software Engineer, Machine Learning

    Staff Software Engineer, Machine Learning

    Mlabs • Burlingame, California, United States
    serp_jobs.job_card.full_time
    Staff Software Engineer, Machine Learning.Burlingame, CA (On-site, 4 days a week).We are a rapidly growing AI company applying. We're looking for a highly skilled and experienced.Staff Software Engi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Software Engineer - Machine Learning

    Senior Software Engineer - Machine Learning

    Celonis • Redwood City, California, United States
    serp_jobs.job_card.full_time
    We're Celonis, the global leader in Process Intelligence technology and one of the world's fastest-growing SaaS firms.We believe there is a massive opportunity to unlock productivity by placing AI,...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Performance Modelling Engineer

    Performance Modelling Engineer

    PageBolt WordPress • San Francisco, CA, United States
    serp_jobs.job_card.permanent
    We’re searching for a Staff Performance Modelling Engineer to create and own the analytical and simulation models that steer OTPU architecture and software evolution. You will build functional simul...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Software Engineer (AI Performance)

    Software Engineer (AI Performance)

    Gimlet Labs • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Gimlet Labs is building the foundation for the next generation of AI applications.As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck.Gimlet is redefi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer, Model Inference

    Software Engineer, Model Inference

    Openai • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Our team brings OpenAI’s most capable research and technology to the world through our products.We empower consumers, enterprise and developers alike to use and access our start-of-the-art AI model...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Software Engineer - Machine Learning Platform

    Software Engineer - Machine Learning Platform

    Snowflake • Menlo Park, California, United States
    serp_jobs.job_card.full_time
    The Snowflake Machine Learning Platform team’s mission is to enable customers to bring their ML / AI workload to Snowflake. Our customers want to leverage ML / AI to extract business values from ever in...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Software Engineer, Model Serving

    Senior Software Engineer, Model Serving

    Databricks Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted