Evaluate scientific correctness, conceptual rigor, and depth of LLM-generated responses across biology and biomedical domains. - Review outputs involving experimental design, data interpretation, and theoretical frameworks in biological sciences. - Identify factual inaccuracies, reasoning errors, and conceptual misunderstandings in model outputs. - Benchmark model performance on advanced biological and interdisciplinary research problems. - Work independently and asynchronously using proprietary evaluation tools.
###
Requirements
PhD (candidate / recipient) or Master’s degree in Biology, Molecular Biology, Biochemistry, Biotechnology, Bioinformatics, or a closely related field. - Strong command of graduate-level biological concepts, experimental reasoning, and data interpretation. - Excellent written communication and analytical abilities. - Ability to work independently in a remote, asynchronous setting.
###
Role Details
Part-time (20 hours / week)
Remote and asynchronous
work environment -
Flexible schedule
to accommodate global contributors
###
Compensation
Contractor position via Mercor
$20–$30 / hour
, depending on expertise and domain depth -
Weekly payments
through Stripe Connect
###
About Mercor
Mercor is a
San Francisco-based company
connecting top professionals with
leading AI initiatives
. Investors include
Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey
serp_jobs.job_alerts.create_a_job
Biology • Missouri City, Texas, US
Job_description.internal_linking.related_jobs
Math Writing Editor - Remote
TradeJobsWorkForce • 77279 Houston, TX, US
serp_jobs.filters.remote
serp_jobs.job_card.full_time
Math Writing Editor Remote Job Duties : The Math Writing Editor ensures that all Eureka Math content—including print a...serp_jobs.internal_linking.show_more
Remote Senior FP&A Analyst - AI Trainer ($50-$60 / hour)
Data Annotation • Missouri City, TX, United States
serp_jobs.filters.remote
serp_jobs.job_card.full_time +1
We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...serp_jobs.internal_linking.show_more