Talent.com
Machine Learning Scientist/Sr Scientist, Federated Benchmarking & Validation Engineering
Machine Learning Scientist/Sr Scientist, Federated Benchmarking & Validation EngineeringEli Lilly • San Francisco, CA, US
serp_jobs.error_messages.no_longer_accepting
Machine Learning Scientist / Sr Scientist, Federated Benchmarking & Validation Engineering

Machine Learning Scientist / Sr Scientist, Federated Benchmarking & Validation Engineering

Eli Lilly • San Francisco, CA, US
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

JOB DESCRIPTION

At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We’re looking for people who are determined to make life better for people around the world.

Purpose

Lilly TuneLab is an AI-powered drug discovery platform that provides biotech companies with access to machine learning models trained on Lilly's extensive proprietary pharmaceutical research data. Through federated learning, the platform enables Lilly to build models on broad, diverse datasets from across the biotech ecosystem while preserving partner data privacy and competitive advantages. This collaborative approach accelerates drug discovery by creating continuously improving AI models that benefit both Lilly and our biotech partners.

The Machine Learning Scientist / Sr Scientist, Federated Benchmarking & Validation Engineering plays an essential role within the TuneLab platform, responsible for identifying, assessing, and implementing cutting-edge algorithmic solutions that leverage diverse datasets while ensuring data privacy and security for our biotech partners. This position requires comprehensive knowledge in small molecule drug development, ADME / Tox, antibody engineering, and / or genetic medicine, combined with expertise in data science and statistical analysis to develop sophisticated models utilizing federated learning. This position will be instrumental in advancing both Lilly's pipeline and our partners' drug discovery efforts by designing critical algorithms and workflows that expedite the creation of transformative therapies.

This role centers on constructing robust validation frameworks for federated models, creating privacy-preserving test sets across partner datasets, establishing standardized benchmarks against public datasets, and ensuring model reproducibility and generalization in diverse deployment scenarios.

Key Responsibilities

  • Federated Test Set Design : Architect and implement privacy-preserving protocols for constructing representative test sets across distributed partner datasets, ensuring statistical validity while maintaining data isolation.
  • Benchmark Suite Development : Create comprehensive benchmark suites covering small molecules (ADMET, solubility, permeability), antibodies (affinity, stability, immunogenicity), and RNA therapeutics (stability, delivery, off-target effects).
  • Cross-Domain Validation : Develop validation strategies that assess model generalization across different experimental protocols, cell lines, species, and therapeutic indications while respecting partner data boundaries.
  • Public Dataset Integration : Systematically benchmark federated models against public datasets (ChEMBL, PubChem, PDB, Therapeutic Antibody Database) to establish performance baselines and identify gaps.
  • Validation Frameworks : Implement time-split or proper scaffold-split validation protocols that assess model performance on prospective data, simulating real-world deployment scenarios and detecting concept drift.
  • Reproducibility Infrastructure : Build robust MLOps pipelines ensuring complete reproducibility of federated experiments, including versioning of data snapshots, model checkpoints, and hyperparameter configurations.
  • Statistical Rigor : Design statistically powered validation studies accounting for multiple testing, hierarchical data structures, and non-independent observations common in drug discovery datasets.
  • Performance Profiling : Develop comprehensive performance profiling across diverse molecular scaffolds, target classes, and property ranges, identifying systematic biases and failure modes.
  • Platform Integration : Collaborate with engineering teams to integrate validation frameworks with the TuneLab federated learning platform built on NVIDIA FLARE, ensuring scalable and automated testing across partner networks.

Basic Qualifications

  • PhD in Computational Biology, Bioinformatics, Cheminformatics, Computer Science, Statistics, or related field from an accredited college or university
  • Minimum of 2 years of experience in the biopharmaceutical industry or related fields, with demonstrated expertise in drug discovery and early development
  • Strong foundation in experimental design, statistical validation, and hypothesis testing
  • Experience with ML model validation, cross-validation strategies, and performance metrics
  • Proficiency in data engineering, pipeline development, and automation
  • Additional Preferences

  • Experience with federated learning platforms and distributed computing
  • Knowledge of regulatory requirements for AI / ML in pharmaceutical development
  • Expertise in ADMET assay development and validation
  • Understanding of antibody engineering and characterization methods
  • Familiarity with RNA therapeutic design and delivery systems
  • Experience with clinical biomarker validation and translational research
  • Proficiency in workflow orchestration tools (Airflow, Kubeflow, Prefect)
  • Strong knowledge of containerization and cloud computing (Docker, Kubernetes)
  • Publications on model validation, benchmarking, or reproducibility
  • Experience with GxP compliance and quality management systems
  • Exceptional attention to detail and commitment to scientific rigor
  • Strong technical writing skills for regulatory documentation
  • Portfolio mindset balancing rigorous validation with rapid deployment for partner value
  • This role is based at a Lilly site in Indianapolis, South San Francisco, or Boston with up to 10% travel (attendance expected at key industry conferences). Relocation is provided.

    Lilly is dedicated to helping individuals with disabilities to actively engage in the workforce, ensuring equal opportunities when vying for positions. If you require accommodation to submit a resume for a position at Lilly, please complete the accommodation request form (

  • ) for further assistance. Please note this is for individuals to request an accommodation as part of the application process and any other correspondence will not receive a response.
  • Lilly is proud to be an EEO Employer and does not discriminate on the basis of age, race, color, religion, gender identity, sex, gender expression, sexual orientation, genetic information, ancestry, national origin, protected veteran status, disability, or any other legally protected status.

    Our employee resource groups (ERGs) offer strong support networks for their members and are open to all employees. Our current groups include : Africa, Middle East, Central Asia Network, Black Employees at Lilly, Chinese Culture Network, Japanese International Leadership Network (JILN), Lilly India Network, Organization of Latinx at Lilly (OLA), PRIDE (LGBTQ+ Allies), Veterans Leadership Network (VLN), Women’s Initiative for Leading at Lilly (WILL), enAble (for people with disabilities). Learn more about all of our groups.

    Actual compensation will depend on a candidate’s education, experience, skills, and geographic location.  The anticipated wage for this position is

    $151,500 - $244,200

    Full-time equivalent employees also will be eligible for a company bonus (depending, in part, on company and individual performance). In addition, Lilly offers a comprehensive benefit program to eligible employees, including eligibility to participate in a company-sponsored 401(k); pension; vacation benefits; eligibility for medical, dental, vision and prescription drug benefits; flexible benefits (e.g., healthcare and / or dependent day care flexible spending accounts); life insurance and death benefits; certain time off and leave of absence benefits; and well-being benefits (e.g., employee assistance program, fitness benefits, and employee clubs and activities).Lilly reserves the right to amend, modify, or terminate its compensation and benefit programs in its sole discretion and Lilly’s compensation practices and guidelines will apply regarding the details of any promotion or transfer of Lilly employees.

    #WeAreLilly

    Explore Location Close the popup

    Apply Now Save job

    serp_jobs.job_alerts.create_a_job

    Machine Learning Scientist • San Francisco, CA, US

    Job_description.internal_linking.related_jobs
    Manager, Machine Learning Research Scientist, GenAI

    Manager, Machine Learning Research Scientist, GenAI

    Scale AI, Inc. • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Scale AI accelerates the development of AI systems by providing the data, infrastructure, and tooling that power the most advanced models in the world. Our teams operate at the intersection of cutti...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Sr. Machine Learning Engineer (Recommendation Systems)

    Sr. Machine Learning Engineer (Recommendation Systems)

    Philo • San Francisco, California, United States
    serp_jobs.job_card.full_time
    At Philo, we’re a group of technology and product people who set out to build the future of television, marrying the best in modern technology with the most compelling medium ever invented — in sho...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Curai • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Curai Health is an AI-powered virtual clinic on a mission to improve access to care at scale.As the pioneer in deploying machine learning into clinical workflows, Curai Health enables its dedicated...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Jobot • San Francisco, CA, US
    serp_jobs.job_card.full_time
    Entry Level ML Engineer Needed for Growing AI Startup!.This Jobot Job is hosted by : Reed Kellick.Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume.Salary : ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Machine Learning Engineer / Researcher (SF)

    Senior Machine Learning Engineer / Researcher (SF)

    Medal, Highlight, & General Intuition • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Senior Machine Learning Engineer / Researcher.Highlight AI is a cutting-edge desktop assistant designed to enhance productivity by seamlessly integrating with your workflow.We empower users to inte...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Founding Machine Learning Engineer

    Founding Machine Learning Engineer

    Fermàt • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Commerce brands to transform clicks into conversions with highly-personalized , 1 : 1 dynamic shopping experiences.We've raised $30M+ to date and are backed by Bain Capital Ventures, Greylock, QED, a...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Suki • Redwood City, California, United States
    serp_jobs.job_card.full_time
    The Future of Healthcare Needs You.At Suki, we’re building technology that listens, understands, and gets out of the way — so clinicians can get back to being clinicians. AI to automate clinical doc...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Troveo Ai • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Troveo is building the next-generation data platform to train AI video models.Troveo offers the world’s largest library of AI video training data, featuring millions of hours of licensed video cont...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Orchard Robotics • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Series A startup backed by top VCs like Quiet Capital, Shine Capital, and General Catalyst.We're securing America’s food supply by building the AI farmer that automates our nation’s farms.We've rai...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Machine Learning Research Engineer - Robotics

    Machine Learning Research Engineer - Robotics

    Scale Ai • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Scale’s Robotics business unit is dedicated to solving the data bottleneck in Physical AI.This position will be a key contributor in conducting applied research in Robotics and developing ML pipeli...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Founding Data Scientist / Machine Learning Engineer

    Founding Data Scientist / Machine Learning Engineer

    Palladio AI • San Francisco Bay, California, United States
    serp_jobs.job_card.full_time
    Seeking Founding Data Scientists and Machine Learning Engineers.Imagine Multiplying Your Impact.You've unlocked major wins in your career - you've shipped models, moved key metrics, and proved what...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior / Staff Machine Learning Engineer

    Senior / Staff Machine Learning Engineer

    Dexterity • Redwood City, California, United States
    serp_jobs.job_card.full_time
    At Dexterity, we believe robots can positively transform the world.Our breakthrough technology frees people to do the creative, inspiring, problem-solving jobs that humans do best by enabling robot...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Rainesdev • San Francisco, California, United States
    serp_jobs.job_card.full_time
    We're a leading partner in social commerce, collaborating with major athletic wear, footwear, and electronics brands to expand their influencer -driven sales channels. Having achieved significant re...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Latent • San Francisco, California, United States
    serp_jobs.job_card.full_time
    Latent is building the intelligence infrastructure for American healthcare.Our products are already helping hospitals and clinics dramatically increase workflow output, speed up patient access to m...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Machine Learning Engineer South San Francisco CA

    Machine Learning Engineer South San Francisco CA

    Esrhealthcare • San Bruno, California, United States
    serp_jobs.job_card.full_time
    Machine Learning Engineer (Operations).South San Francisco CA (Hybrid, 3 days / week) (Not remote).Strong understanding of machine learning concepts, algorithms, and best practices.Proven experience ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Sr. Machine Learning Engineer, Applied Research Science

    Sr. Machine Learning Engineer, Applied Research Science

    Pinterest • San Francisco, CA, United States
    serp_jobs.job_card.full_time
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Data Scientist- Machine Learning

    Senior Data Scientist- Machine Learning

    Sofi • San Francisco, California, United States
    serp_jobs.job_card.full_time
    The Risk Data Science team is looking for a Sr Staff Data Scientist to develop advanced machine learning models, guide measurement, strategy, and data-driven decision making to support credit under...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Sr. Machine Learning Engineer

    Sr. Machine Learning Engineer

    Unity Technologies • San Francisco, California, United States
    serp_jobs.job_card.full_time
    At Unity, we’re committed to building a culture grounded in Empathy, Respect, and Opportunity.Within our fast-paced and collaborative environment, we’re tackling complex challenges that drive meani...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted