Software Engineer - Model PerformanceAI Fund • San Francisco, CA, United States

serp_jobs.error_messages.no_longer_accepting

Software Engineer - Model Performance

AI Fund • San Francisco, CA, United States

job_description.job_card.variable_days_ago

serp_jobs.job_preview.job_type

serp_jobs.job_card.full_time

job_description.job_card.job_description

Overview

Join to apply for the Software Engineer - Model Performance role at AI Fund .

Are you passionate about advancing the application of artificial intelligence? We are looking for a Software Engineer focused on ML performance to join our dynamic team. This role is ideal for someone who thrives in a fast-paced startup environment and is eager to make significant contributions to the exciting field of LLM inference. If you are a backend engineer who thrives on making things faster and is excited about open-source ML models, we look forward to your application.

About Baseten

Baseten provides the infrastructure, tooling, and expertise needed to bring great AI products to market - fast. Backed by top investors including

Responsibilities

Implement, refine, and productionize cutting-edge techniques (quantization, speculative decoding, kv cache reuse, chunked prefill and LoRA) for ML model inference and infrastructure.
Deep dive into underlying codebases of TensorRT, PyTorch, TensorRT-LLM, vllm, sglang, CUDA, and other libraries to debug ML performance issues.
Apply and scale optimization techniques across a wide range of ML models, particularly large language models.
Collaborate with a diverse team to design and implement innovative solutions.
Own projects from idea to production.

Qualifications

Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field.

Experience with one or more general-purpose programming languages, such as Python or C++.

Familiarity with LLM optimization techniques (e.g., quantization, speculative decoding, continuous batching).

Strong familiarity with ML libraries, especially PyTorch, TensorRT, or TensorRT-LLM.

Demonstrated interest and experience in LLMs.

Deep understanding of GPU architecture.

Bonus :

Proficiency in enhancing the performance of software systems, particularly in the context of large language models (LLMs).

Experience with CUDA or similar technologies.

Deep understanding of software engineering principles and a proven track record of developing and deploying AI / ML inference solutions.

Experience with Docker and Kubernetes.

Benefits

Competitive compensation package.

This is a unique opportunity to be part of a rapidly growing startup in one of the most exciting engineering fields of our era.

An inclusive and supportive work culture that fosters learning and growth.

Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

Job Details

Seniority level : Entry level

Employment type : Full-time

Job function : Engineering and Information Technology

Industries : Venture Capital and Private Equity Principals

Location : San Francisco, CA

Salary : $160,000.00-$180,000.00 per year

#J-18808-Ljbffr

serp_jobs.job_alerts.create_a_job

Engineer Performance • San Francisco, CA, United States

Job_description.internal_linking.related_jobs

Senior Software Engineer - Infrastructure, Machine Learning

Baton • San Francisco, California, United States

serp_jobs.job_card.full_time

With $10B in freight under management, our technology reaches every part of the U.We design and ship category-defining software that enables Ryder and its 50,000+ customers—including some of the wo...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Senior Software Engineer, Machine Learning

Planet Labs PBC • San Francisco, CA, United States

serp_jobs.job_card.full_time

We believe in using space to help life on Earth.Planet designs, builds, and operates the largest constellation of imaging satellites in history. This constellation delivers an unprecedented dataset ...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Software Engineer, Performance Optimization

Fireworks Ai • Redwood City, California, United States

serp_jobs.job_card.full_time

Here at Fireworks, we’re building the future of generative AI infrastructure.Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference.We’...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Software Engineer, Fullstack

Ambient.ai • Redwood City, California, United States

serp_jobs.job_card.full_time

Build a safer world with us, one incident at a time.AI-powered physical security platform helping the world’s leading enterprises reduce risk, improve operational efficiency, and gain critical insi...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Fullstack Software Engineer, Pricing Lifecycle

Orb • San Francisco, California, United States

serp_jobs.job_card.full_time

Orb is transforming how modern AI and software companies monetize at scale.We've built the next-generation billing infrastructure that turns complex usage-based pricing into competitive advantage.O...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Software Engineer

Confidential • Redwood City, California, United States

serp_jobs.job_card.full_time

C3 Energy is looking for data engineers to develop and implement the next generation of analytics for the smart grid.We are building a platform able to handle the extremely large amount of data gen...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

AI Infrastructure Engineer, Model Serving Platform

Scale AI, Inc. • San Francisco, CA, United States

serp_jobs.job_card.full_time

As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and product...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Senior Software Engineer - Machine Learning Platform

Snowflake • Menlo Park, California, United States

serp_jobs.job_card.full_time

The Snowflake Machine Learning Platform team’s mission is to enable customers to bring their machine learning and deep learning workloads to Snowflake. Our customers want to build powerful models wi...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Senior Software Engineer - Machine Learning- Publica by IAS

Publica • San Francisco, California, United States

serp_jobs.job_card.full_time

At Publica, engineers have a unique opportunity to work on a platform that handles billions of requests per hour in one of the fastest growing areas in Ad Tech : Connected Television.Engineers at Pu...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Software Engineer (Fullstack)

Onecrew • San Francisco, California, United States

serp_jobs.job_card.full_time

OneCrew is the leading unified platform helping paving contractors estimate accurately, manage crews effectively, and track profitability in real-time. We eliminate the costly mistakes and wasted ti...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Software Engineer (AI Performance)

Gimlet Labs • San Francisco, California, United States

serp_jobs.job_card.full_time

Gimlet Labs is building the foundation for the next generation of AI applications.As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck.Gimlet is redefi...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Software Engineer, Machine Learning Infrastructure

Datologyai • Redwood City, California, United States

serp_jobs.job_card.full_time

Companies want to train their own large models on their own data.The current industry standard is to train on a random sample of your data, which is inefficient at best and actively harmful to mode...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Software Engineer, Model Inference

Openai • San Francisco, California, United States

serp_jobs.job_card.full_time

Our team brings OpenAI’s most capable research and technology to the world through our products.We empower consumers, enterprise and developers alike to use and access our start-of-the-art AI model...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Staff Software Engineer, Machine Learning

Mlabs • Burlingame, California, United States

serp_jobs.job_card.full_time

Staff Software Engineer, Machine Learning.Burlingame, CA (On-site, 4 days a week).We are a rapidly growing AI company applying. We're looking for a highly skilled and experienced.Staff Software Engi...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Senior Software Engineer - Machine Learning

Celonis • Redwood City, California, United States

serp_jobs.job_card.full_time

We're Celonis, the global leader in Process Intelligence technology and one of the world's fastest-growing SaaS firms.We believe there is a massive opportunity to unlock productivity by placing AI,...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Senior Machine Learning Engineer (Modeling), Support

Block • San Francisco, California, United States

serp_jobs.job_card.full_time

Block is one company built from many blocks, all united by the same purpose of economic empowerment.The blocks that form our foundational teams — People, Finance, Counsel, Hardware, Information Sec...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Software Engineer (ML Platform)

Anyscale • San Francisco, California, United States

serp_jobs.job_card.full_time

Ray in their tech stacks to accelerate the progress of AI applications out into the real world.With Anyscale, we’re building the best place to run Ray, so that any developer or data scientist can s...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Backend Engineer, Models

Meter • San Francisco, California, United States

serp_jobs.job_card.full_time

Networking is one of the most fundamental industries in all of technology.For the first time, Meter has unified the full networking stack. and now we are making it autonomous.Meter Zero is our neur...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted