CUDA Kernel Optimizer ML EngineerMercor • San Francisco, California, USA

CUDA Kernel Optimizer ML Engineer

Mercor • San Francisco, California, USA

job_description.job_card.variable_days_ago

serp_jobs.job_preview.job_type

serp_jobs.job_card.full_time

job_description.job_card.job_description

1) Role Overview

Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization performance profiling and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while maintaining correctness and reproducibility

2) Key Responsibilities

Develop tune and benchmark CUDA kernels for tensor and operator workloads.

Optimize for occupancy memory coalescing instruction-level parallelism and warp scheduling.

Profile and diagnose performance bottlenecks using Nsight Systems Nsight Compute and comparable tools.

Report performance metrics analyze speedups and propose architectural improvements.

Collaborate asynchronously with PyTorch Operator Specialists to integrate kernels into production frameworks.

Produce well-documented reproducible benchmarks and performance write-ups.

3) Ideal Qualifications

Deep expertise in CUDA programming GPU architecture and memory optimization.

Proven ability to achieve quantifiable performance improvements across hardware generations.

Proficiency with mixed precision Tensor Core usage and low-level numerical stability considerations.

Familiarity with frameworks like PyTorch TensorFlow or Triton (not required but beneficial).

Strong communication skills and independent problem-solving ability.

Demonstrated open-source research or performance benchmarking contributions.

4) More About the Opportunity

Ideal for independent contractors who thrive in performance-critical systems-level work.

Engagements focus on measurable high-impact kernel optimizations and scalability studies.

Work is fully remote and asynchronous; deliverables are outcome-driven.

Access to shared benchmarking infrastructure and reproducibility tooling via Mercor support resources.

5) Compensation & Contract Terms

Typical range : $120$250 / hour depending on scope specialization and results achieved. Payments will be based on accepted task output over flat hourly.

Structured as a contract-based engagement not an employment relationship.

Compensation tied to measurable deliverables or agreed milestones.

Confidentiality IP and NDA terms as defined per engagement.

6) Application Process

Submit a brief overview of prior CUDA optimization experience profiling results or performance reports.

Include links to relevant GitHub repos papers or benchmarks if available.

Indicate your hourly rate time availability and preferred engagement length.

Selected experts may complete a small paid pilot kernel optimization project

7) About Mercor

Mercor connects domain experts with top AI research and technology organizations through project-based contracts.

Contractors operate independently with full flexibility over methods timelines and tools.

Our mission is to help top engineers and researchers access frontier technical work without rigid employment structures.

Key Skills

HVAC Engineering,Client Servicing,Access Control,Jpa,Gym,Air Compressors

Employment Type : Full Time

Experience : years

Vacancy : 1

serp_jobs.job_alerts.create_a_job

Ml Engineer • San Francisco, California, USA

Job_description.internal_linking.related_jobs

Performance ML Engineer : CUDA, GPU Systems

Relace • San Francisco, CA, United States

serp_jobs.job_card.full_time

A tech company specializing in ML infrastructure is seeking a Machine Learning Engineer who excels at making models faster and more efficient through performance tuning and optimization.The ideal c...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Dev Ops Engineer

Lawrence Berkeley National Laboratory • Berkeley, CA, United States

serp_jobs.job_card.full_time +1

Lawrence Berkeley National Lab's (.NERSC Division has an opening for a Dev Ops Engineer to join the team.In this exciting role, you will serve as a DevOps-oriented System Administrator / Software Eng...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Hardware Support Engineer

Cognizant • Emerald Hills, CA, US

serp_jobs.job_card.full_time

Cognizant is a leading provider IT and BPO services, providing critical initiatives to a variety of global clients.The Hardware Operations team is a part of a high profile client project that provi...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_1_hour • serp_jobs.job_card.promoted • serp_jobs.job_card.new

Imaging CT Tech

The US Oncology Network • Emeryville, CA, United States

serp_jobs.job_card.full_time

ANNUAL SALARY (DEPENDING ON SKILLS / EXPERIENCE) : $60.Open Positions in these Clinic Locations : Antioch, Dublin, Hayward, Emeryville, & Pleasant Hill. Perform computed tomography scan duties in compli...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Software Engineer - Observability

Snowflake • Menlo Park, California, United States

serp_jobs.job_card.full_time

The Observability team at Snowflake is in charge of building an extensible, self-service Observability platform that reliably collects and serves telemetry data such as metrics, logs, traces to bot...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Software Engineer, Performance Optimization

Fireworks Ai • Redwood City, California, United States

serp_jobs.job_card.full_time

Here at Fireworks, we’re building the future of generative AI infrastructure.Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference.We’...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Software Engineer

Confidential • Redwood City, California, United States

serp_jobs.job_card.full_time

C3 Energy is looking for data engineers to develop and implement the next generation of analytics for the smart grid.We are building a platform able to handle the extremely large amount of data gen...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Product Development Engineer, Reagents

Bruker • Emeryville, CA, United States

serp_jobs.job_card.full_time +1

Product Development Engineer, Reagents.Bruker is enabling scientists to make breakthrough discoveries and develop new applications that improve the quality of human life. Bruker's high-performance s...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

CT Technologist

Providence • Vallejo, CA, US

serp_jobs.job_card.full_time

Under the direction of a Radiologist, or Physician designee, and Imaging Leadership, and with latitude for independent judgment, performs all the professional duties involved in applying ionizing r...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_1_hour • serp_jobs.job_card.promoted • serp_jobs.job_card.new

Senior Mechanical Engineer

Bio-Rad Laboratories • Hercules, CA, United States

serp_jobs.job_card.full_time

Working within Bio-Rad's Clinical Diagnostics Group R&D organization as an Instrument Engineer with Mechanical Engineering background, you'll take engineering concepts and requirements and transfor...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

CUDA Kernel Optimizer - ML Engineer

Mercor • San Francisco, California, United States

serp_jobs.filters.remote

serp_jobs.job_card.full_time

serp_jobs.filters_job_card.quick_apply

Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model of how modern GPU...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days

ML Research Engineer, ML Systems

Scale AI, Inc. • San Francisco, CA, United States

serp_jobs.job_card.full_time

Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Software Engineer - ML Pricing

Opendoor • San Francisco, California, United States

serp_jobs.job_card.full_time

At Opendoor, pricing is at the core of our product — our models directly influence high-stakes decisions around real estate transactions across the country. We are looking for a mid-level.This is a ...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Software Engineer, Supercomputing

Thinking Machines Lab • San Francisco, California, United States

serp_jobs.job_card.full_time

Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence.We're building a future where everyone has access to the knowledge and tools to make AI w...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Software Engineer, GPU Infrastructure

Openai • San Francisco, California, United States

serp_jobs.job_card.full_time

This role will support the fleet infrastructure team at OpenAI.The fleet team focuses on running the world’s largest, most reliable, and frictionless GPU fleet to support OpenAI’s general purpose m...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Surgical Technologist I - General Specialty PMs - El Cerrito

Advocate Aurora • El Cerrito, CA, United States

serp_jobs.job_card.full_time +1

Surgical Technologist I - General Specialty PMs.Aurora St Lukes Medical Center - 2900 W Oklahoma Ave.Advocate Health offers a comprehensive suite of Total Rewards : benefits and well-being programs,...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Principal Software Engineer

Informatica LLC • Redwood City, CA, United States

serp_jobs.job_card.full_time

Build Your Career at Informatica.We seek innovative thinkers who believe in the power of data to drive meaningful change. At Informatica, we welcome adventurous, work-from-anywhere minds eager to so...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Software Engineer (ML Platform)

Anyscale • San Francisco, California, United States

serp_jobs.job_card.full_time

Ray in their tech stacks to accelerate the progress of AI applications out into the real world.With Anyscale, we’re building the best place to run Ray, so that any developer or data scientist can s...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted