Research Engineer, Frontier Evals & Environments - FinanceOpenAI • San Francisco, CA, United States

Research Engineer, Frontier Evals & Environments - Finance

OpenAI • San Francisco, CA, United States

job_description.job_card.variable_days_ago

serp_jobs.job_preview.job_type

serp_jobs.job_card.full_time

job_description.job_card.job_description

Research Engineer, Frontier Evals & Environments - Finance

Join to apply for the Research Engineer, Frontier Evals & Environments - Finance role at OpenAI

About The Team

The Frontier Evals team builds north star model evaluations to drive progress towards safe AGI / ASI. This team builds ambitious evaluations to measure and steer our models, and creates self‑improvement loops to steer our training, safety, and launch decisions. Some of the team's open‑sourced evaluations include SWE‑bench Verified, MLE‑bench, PaperBench, and SWE‑Lancer, and the team built and ran frontier evaluations for GPT4o, o1, o3, GPT 4.5, ChatGPT Agent, and GPT5. If you are interested in feeling firsthand the fast progress of our models, and steering them towards good, this is the team for you.

About You

We seek exceptional research engineers that can push the boundaries of our frontier models in the finance domain. We are looking for those who will help shape AI evaluations of financial reasoning and related capabilities, and will own individual threads within this endeavor end‑to‑end.

In This Role, You'll

Identify important model capabilities, skills, and behaviors that are crucial to financial workflows, and design methods to quantify performance in these areas
Own and pursue a research agenda to identify an important model capability (especially as it relates to financial reasoning) and build evals to measure it
Continuously refine evaluations of frontier AI models to assess the extent of frontier capabilities

We Expect You To

Have strong engineering and statistical analysis skills (with at least 2–3 years of full‑time technical experience)

Be passionate about Excel spreadsheets and / or finance

Be detail‑oriented and thorough

Be a team player / willing to do a variety of tasks to move the team forward

Be passionate and knowledgeable about AGI / ASI measurement

Be able to operate effectively in a dynamic and extremely fast‑paced research environment as well as scope and deliver projects end‑to‑end

It Would Be Great If You Also Have

Prior background / domain expertise in finance, especially investment banking or private equity (e.g., through internships, prior jobs)

An ability to work cross‑functionally

Excellent communication skills

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general‑purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement .

Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. For unincorporated Los Angeles County workers : we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment : protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non‑public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

To notify OpenAI that you believe this job posting is non‑compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Compensation Range : $200K - $370K

#J-18808-Ljbffr

serp_jobs.job_alerts.create_a_job

Research Engineer • San Francisco, CA, United States

Job_description.internal_linking.related_jobs

Clinical Research Associate II - Temporary

Bio-Rad Laboratories • Hercules, CA, United States

serp_jobs.job_card.full_time

As a CRA at Bio-Rad, you will play a vital role in ensuring the successful conduct of clinical trials from initiation to closeout. You will collaborate with numerous investigators, study coordinator...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Founding Audio AI Research Engineer

David AI • San Francisco, California, United States

serp_jobs.job_card.full_time

David AI is the first audio data research company.We bring an R&D approach to data–developing datasets with the same rigor AI labs bring to models. Speech is versatile, accessible, and.To unlock the...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Remote Geopolitics Forecaster - AI Trainer ($105-$125 per hour)

Mercor • Redwood City, California, US

serp_jobs.filters.remote

serp_jobs.job_card.full_time

Role Overview Mercor is collaborating with a leading AI lab on a cutting-edge research initiative involving top superforecasters from around the world. We’re seeking geopolitical experts to contribu...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Postdoctoral Employee - Department of Astronomy

University of California-Berkeley • Berkeley, CA, United States

serp_jobs.job_card.full_time

The UC postdoc salary scales set the minimum pay determined by experience level at appointment.See the following table for the current salary scale for this position : https : / / www.A reasonable estim...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Postdoc - Microbiome - Innovative Genomics Institute

InsideHigherEd • Berkeley, California, United States

serp_jobs.job_card.full_time

Postdoc - Microbiome - Innovative Genomics Institute.The UC postdoc salary scales set the minimum pay determined by experience level at appointment. See the following table(s) for the current salary...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Founding Research Engineer

Camfer • San Francisco, CA, United States

serp_jobs.job_card.full_time

At Camfer, our research engineers are training models to intelligently interpret and edit parametric CAD designs in 3D space. This is a cutting-edge challenge that requires innovative problem-solvin...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Research Engineer, AI Safety & Alignment

Character.ai • Redwood City, California, United States

serp_jobs.job_card.full_time

Joining us as a Research Engineer, you'll be at the forefront of tackling one of the most critical challenges in AI today : safety and alignment. Your work will be pivotal in understanding and mitiga...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Remote Finance Director - AI Trainer ($50-$60 / hour)

Data Annotation • Redwood City, California

serp_jobs.filters.remote

serp_jobs.job_card.full_time +1

We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Machine Learning Engineer

Skild AI • San Mateo, Pennsylvania, United States

serp_jobs.job_card.full_time

At Skild AI, we are building the world's first general purpose robotic intelligence that is robust and adapts to unseen scenarios without failing. We believe massive scale through data-driven machin...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Grants & Financial Analyst

University of California - Riverside • Oakland, CA, United States

serp_jobs.job_card.full_time

Reporting to the Financial Operations Manager, provide leadership and support for the financial management of the Departments of Biochemistry, Nematology, Plant Pathology and Microbiology, and the ...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Founding Research Engineer - Frontier AI & RL Systems

Appliedcompute • San Francisco, CA, United States

serp_jobs.job_card.full_time

A tech startup specializing in AI seeks a Founding Research Engineer to train large language models and develop novel methods for agentic training. This position demands expertise in ML frameworks l...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Founding Research Engineer

The LLM Data Company • San Francisco, CA, United States

serp_jobs.job_card.full_time

The LLM Data Company (YC X25) provides post-training data and RL environments to foundation model labs and frontier applied AI companies. Tier 1 VCs and are growing 200%+ month-over-month.Design and...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Forward Deployed AI Engineer

Datologyai • Redwood City, California, United States

serp_jobs.job_card.full_time

But a large portion of training compute is wasted training on data that are already learned, irrelevant, or even harmful, leading to worse models that cost more to train and deploy.At DatologyAI, w...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Research Engineer, Post-Training Evals

Lambda • San Francisco, California, United States

serp_jobs.job_card.full_time

In 2012, Lambda started with a crew of AI engineers publishing research at top machine-learning conferences.We began as an AI company built by AI engineers. Today, we're on a mission to be the world...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Research Engineer

Decagon • San Francisco, California, United States

serp_jobs.job_card.full_time

Decagon is building the most advanced conversational AI agents for the enterprise.Since starting the company, we've been on a tear, winning over customers like. Duolingo, Notion, Rippling, Eventbrit...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Research Engineer, Frontier Evals & Environments - Finance

OpenAI • San Francisco, CA, United States

serp_jobs.job_card.full_time

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted

Senior Research Engineer

FAR.AI • Berkeley, California, United States

serp_jobs.job_card.full_time

AI research institute dedicated to ensuring advanced AI is safe and beneficial for everyone.Our mission is to facilitate breakthrough AI safety research, advance global understanding of AI risks an...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted

Research Engineer

Anyscale • San Francisco, California, United States

serp_jobs.job_card.full_time

Ray in their tech stacks to accelerate the progress of AI applications out into the real world.With Anyscale, we’re building the best place to run Ray, so that any developer or data scientist can s...serp_jobs.internal_linking.show_more

serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted