Talent.com
Research Engineer, Frontier Evals & Environments - Finance
Research Engineer, Frontier Evals & Environments - FinanceOpenAI • San Francisco, CA, United States
Research Engineer, Frontier Evals & Environments - Finance

Research Engineer, Frontier Evals & Environments - Finance

OpenAI • San Francisco, CA, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Research Engineer, Frontier Evals & Environments - Finance

Join to apply for the Research Engineer, Frontier Evals & Environments - Finance role at OpenAI

About The Team

The Frontier Evals team builds north star model evaluations to drive progress towards safe AGI / ASI. This team builds ambitious evaluations to measure and steer our models, and creates self‑improvement loops to steer our training, safety, and launch decisions. Some of the team's open‑sourced evaluations include SWE‑bench Verified, MLE‑bench, PaperBench, and SWE‑Lancer, and the team built and ran frontier evaluations for GPT4o, o1, o3, GPT 4.5, ChatGPT Agent, and GPT5. If you are interested in feeling firsthand the fast progress of our models, and steering them towards good, this is the team for you.

About You

We seek exceptional research engineers that can push the boundaries of our frontier models in the finance domain. We are looking for those who will help shape AI evaluations of financial reasoning and related capabilities, and will own individual threads within this endeavor end‑to‑end.

In This Role, You'll

  • Identify important model capabilities, skills, and behaviors that are crucial to financial workflows, and design methods to quantify performance in these areas
  • Own and pursue a research agenda to identify an important model capability (especially as it relates to financial reasoning) and build evals to measure it
  • Continuously refine evaluations of frontier AI models to assess the extent of frontier capabilities

We Expect You To

  • Have strong engineering and statistical analysis skills (with at least 2–3 years of full‑time technical experience)
  • Be passionate about Excel spreadsheets and / or finance
  • Be detail‑oriented and thorough
  • Be a team player / willing to do a variety of tasks to move the team forward
  • Be passionate and knowledgeable about AGI / ASI measurement
  • Be able to operate effectively in a dynamic and extremely fast‑paced research environment as well as scope and deliver projects end‑to‑end
  • It Would Be Great If You Also Have

  • Prior background / domain expertise in finance, especially investment banking or private equity (e.g., through internships, prior jobs)
  • An ability to work cross‑functionally
  • Excellent communication skills
  • About OpenAI

    OpenAI is an AI research and deployment company dedicated to ensuring that general‑purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

    We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

    For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement .

    Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. For unincorporated Los Angeles County workers : we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment : protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non‑public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

    To notify OpenAI that you believe this job posting is non‑compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.

    We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

    OpenAI Global Applicant Privacy Policy

    At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

    Compensation Range : $200K - $370K

    #J-18808-Ljbffr

    serp_jobs.job_alerts.create_a_job

    Research Engineer • San Francisco, CA, United States

    Job_description.internal_linking.related_jobs
    Clinical Research Associate II - Temporary

    Clinical Research Associate II - Temporary

    Bio-Rad Laboratories • Hercules, CA, United States
    serp_jobs.job_card.full_time
    As a CRA at Bio-Rad, you will play a vital role in ensuring the successful conduct of clinical trials from initiation to closeout. You will collaborate with numerous investigators, study coordinator...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Founding Audio AI Research Engineer

    Founding Audio AI Research Engineer

    David AI • San Francisco, California, United States
    serp_jobs.job_card.full_time
    David AI is the first audio data research company.We bring an R&D approach to data–developing datasets with the same rigor AI labs bring to models. Speech is versatile, accessible, and.To unlock the...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Remote Geopolitics Forecaster - AI Trainer ($105-$125 per hour)

    Remote Geopolitics Forecaster - AI Trainer ($105-$125 per hour)

    Mercor • Redwood City, California, US
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Role Overview Mercor is collaborating with a leading AI lab on a cutting-edge research initiative involving top superforecasters from around the world. We’re seeking geopolitical experts to contribu...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Postdoctoral Employee - Department of Astronomy

    Postdoctoral Employee - Department of Astronomy

    University of California-Berkeley • Berkeley, CA, United States
    serp_jobs.job_card.full_time
    The UC postdoc salary scales set the minimum pay determined by experience level at appointment.See the following table for the current salary scale for this position : https : / / www.A reasonable estim...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Postdoc - Microbiome - Innovative Genomics Institute

    Postdoc - Microbiome - Innovative Genomics Institute

    InsideHigherEd • Berkeley, California, United States
    serp_jobs.job_card.full_time
    Postdoc - Microbiome - Innovative Genomics Institute.The UC postdoc salary scales set the minimum pay determined by experience level at appointment. See the following table(s) for the current salary...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Founding Research Engineer

    Founding Research Engineer

    Camfer • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    At Camfer, our research engineers are training models to intelligently interpret and edit parametric CAD designs in 3D space. This is a cutting-edge challenge that requires innovative problem-solvin...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Research Engineer, AI Safety & Alignment

    Research Engineer, AI Safety & Alignment

    Character.ai • Redwood City, California, United States
    serp_jobs.job_card.full_time
    Joining us as a Research Engineer, you'll be at the forefront of tackling one of the most critical challenges in AI today : safety and alignment. Your work will be pivotal in understanding and mitiga...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Remote Finance Director - AI Trainer ($50-$60 / hour)

    Remote Finance Director - AI Trainer ($50-$60 / hour)

    Data Annotation • Redwood City, California
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Skild AI • San Mateo, Pennsylvania, United States
    serp_jobs.job_card.full_time
    At Skild AI, we are building the world's first general purpose robotic intelligence that is robust and adapts to unseen scenarios without failing. We believe massive scale through data-driven machin...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Grants & Financial Analyst

    Grants & Financial Analyst

    University of California - Riverside • Oakland, CA, United States
    serp_jobs.job_card.full_time
    Reporting to the Financial Operations Manager, provide leadership and support for the financial management of the Departments of Biochemistry, Nematology, Plant Pathology and Microbiology, and the ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Founding Research Engineer - Frontier AI & RL Systems

    Founding Research Engineer - Frontier AI & RL Systems

    Appliedcompute • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    A tech startup specializing in AI seeks a Founding Research Engineer to train large language models and develop novel methods for agentic training. This position demands expertise in ML frameworks l...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Founding Research Engineer

    Founding Research Engineer

    The LLM Data Company • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    The LLM Data Company (YC X25) provides post-training data and RL environments to foundation model labs and frontier applied AI companies. Tier 1 VCs and are growing 200%+ month-over-month.Design and...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Forward Deployed AI Engineer

    Forward Deployed AI Engineer

    Datologyai • Redwood City, California, United States
    serp_jobs.job_card.full_time
    But a large portion of training compute is wasted training on data that are already learned, irrelevant, or even harmful, leading to worse models that cost more to train and deploy.At DatologyAI, w...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Research Engineer, Post-Training Evals

    Research Engineer, Post-Training Evals

    Lambda • San Francisco, California, United States
    serp_jobs.job_card.full_time
    In 2012, Lambda started with a crew of AI engineers publishing research at top machine-learning conferences.We began as an AI company built by AI engineers. Today, we're on a mission to be the world...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Research Engineer

    Research Engineer

    Decagon • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Decagon is building the most advanced conversational AI agents for the enterprise.Since starting the company, we've been on a tear, winning over customers like. Duolingo, Notion, Rippling, Eventbrit...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Research Engineer, Frontier Evals & Environments - Finance

    Research Engineer, Frontier Evals & Environments - Finance

    OpenAI • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    The Frontier Evals team builds north star model evaluations to drive progress towards safe AGI / ASI.This team builds ambitious evaluations to measure and steer our models, and creates self-improveme...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Research Engineer

    Senior Research Engineer

    FAR.AI • Berkeley, California, United States
    serp_jobs.job_card.full_time
    AI research institute dedicated to ensuring advanced AI is safe and beneficial for everyone.Our mission is to facilitate breakthrough AI safety research, advance global understanding of AI risks an...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Research Engineer

    Research Engineer

    Anyscale • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Ray in their tech stacks to accelerate the progress of AI applications out into the real world.With Anyscale, we’re building the best place to run Ray, so that any developer or data scientist can s...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted