Talent.com
AI/LLM Evaluation & Alignment Software Engineer
AI/LLM Evaluation & Alignment Software EngineerLeo Tech Services • Austin, TX, United States
serp_jobs.error_messages.no_longer_accepting
AI / LLM Evaluation & Alignment Software Engineer

AI / LLM Evaluation & Alignment Software Engineer

Leo Tech Services • Austin, TX, United States
job_description.job_card.variable_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

At LeoTech, we are passionate about building software that solves real-world problems in the Public Safety sector. Our software has been used to help the fight against continuing criminal enterprises, drug trafficking organizations, identifying financial fraud, disrupting sex and human trafficking rings and focusing on mental health matters to name a few.

Role

  • This is a remote, WFH role.
  • As an AI / LLM Evaluation & Alignment Engineer on our Data Science team, you will play a critical role in ensuring that our Large Language Model (LLM) and Agentic AI solutions are accurate, safe, and aligned with the unique requirements of public safety and law enforcement workflows. You will design and implement evaluation frameworks, guardrails, and bias-mitigation strategies that give our customers confidence in the reliability and ethical use of our AI systems. This is an individual contributor (IC) role that combines hands-on technical engineering with a focus on responsible AI deployment. You will work closely with AI engineers, product managers, and DevOps teams to establish standards for evaluation, design test harnesses for generative models, and operationalize quality assurance processes across our AI stack.

Core Responsibilities

  • Build and maintain evaluation frameworks for LLMs and generative AI systems tailored to public safety and intelligence use cases.
  • Design guardrails and alignment strategies to minimize bias, toxicity, hallucinations, and other ethical risks in production workflows.
  • Partner with AI engineers and data scientists to define online and offline evaluation metrics (e.g., model drifts, data drifts, factual accuracy, consistency, safety, interpretability).
  • Implement continuous evaluation pipelines for AI models, integrated into CI / CD and production monitoring systems.
  • Collaborate with stakeholders to stress test models against edge cases, adversarial prompts, and sensitive data scenarios.
  • Research and integrate third-party evaluation frameworks and solutions; adapt them to our regulated, high-stakes environment.
  • Work with product and customer-facing teams to ensure explainability, transparency, and auditability of AI outputs.
  • Provide technical leadership in responsible AI practices, influencing standards across the organization.
  • Contribute to DevOps / MLOps workflows for deployment, monitoring, and scaling of AI evaluation and guardrail systems (experience with Kubernetes is a plus).
  • Document best practices and findings, and share knowledge across teams to foster a culture of responsible AI innovation.
  • What We Value

  • Bachelor's or Master's in Computer Science, Artificial Intelligence, Data Science, or related field.
  • 3-5+ years of hands-on experience in ML / AI engineering, with at least 2 years working directly on LLM evaluation, QA, or safety.
  • Strong familiarity with evaluation techniques for generative AI : human-in-the-loop evaluation, automated metrics, adversarial testing, red-teaming.
  • Experience with bias detection, fairness approaches, and responsible AI design.
  • Knowledge of LLM observability, monitoring, and guardrail frameworks e.g Langfuse, Langsmith
  • Proficiency with Python and modern AI / ML / LLM / Agentic AI libraries (LangGraph, Strands Agents, Pydantic AI, LangChain, HuggingFace, PyTorch, LlamaIndex).
  • Experience integrating evaluations into DevOps / MLOps pipelines, preferably with Kubernetes, Terraform, ArgoCD, or GitHub Actions.
  • Understanding of cloud AI platforms (AWS, Azure) and deployment best practices.
  • Strong problem-solving skills, with the ability to design practical evaluation systems for real-world, high-stakes scenarios.
  • Excellent communication skills to translate technical risks and evaluation results into insights for both technical and non-technical stakeholders.
  • Technologies We Use

  • Cloud & Infrastructure : AWS (Bedrock, SageMaker, Lambda), Azure AI, Kubernetes (EKS), Terraform, ArgoCD.
  • LLMs & Evaluation : HuggingFace, OpenAI API, Anthropic, LangChain, LlamaIndex, Ragas, DeepEval, OpenAI Evals.
  • Observability & Guardrails : Langfuse, GuardrailsAI.
  • Backend & Data : Python (primary), ElasticSearch, Kafka, Airflow.
  • DevOps & Automation : GitHub Actions, CodePipeline.
  • What You Can Expect

  • Work from home opportunity
  • Enjoy great team camaraderie.
  • Thrive on the fast pace and challenging problems to solve.
  • Modern technologies and tools.
  • Continuous learning environment.
  • Opportunity to communicate and work with people of all technical levels in a team environment.
  • Grow as you are given feedback and incorporate it into your work.
  • Be part of a self-managing team that enjoys support and direction when required.
  • 3 weeks of paid vacation - out the gate!!
  • Competitive Salary.
  • Generous medical, dental, and vision plans.
  • Sick, and paid holidays are offered.
  • $135,000 - $160,000 a year

    Please note the national salary range listed in the job posting reflects the new hire salary range across levels and U.S. locations that would be applicable to the position. The final salary will be commensurate with the candidate's accepted hiring level and work location. Also, this range represents base salary only and does not include equity, or benefits if applicable.

    LeoTech is an equal opportunity employer and does not discriminate on the basis of any legally protected status.

    serp_jobs.job_alerts.create_a_job

    Software Engineer • Austin, TX, United States

    Job_description.internal_linking.related_jobs
    AIML Engineer

    AIML Engineer

    innovitusa • Austin, Texas, USA
    serp_jobs.job_card.full_time
    We are seeking an AI Engineer to drive innovation in our SDLC processes using artificial intelligence and automation.This role is ideal for an engineer passionate about automation and applying AI / M...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    AWS Development Engineer with AIML

    AWS Development Engineer with AIML

    Cortex consultants LLC • Austin, Texas, USA
    serp_jobs.job_card.full_time
    Title : AWS Development Engineer with AI / ML.Location : Austin TX Day 1 onsite only locals.Overall 10-15 years of experience in a fast paced dev. Solid programming with at least one software programmin...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    AI / ML Engineer

    AI / ML Engineer

    innovitusa • Austin, TX, Texas, USA
    serp_jobs.job_card.full_time
    MessageBody"> Hiring : W2 Candidates Only 🛂 Visa : Open to any visa type with valid work auth...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days
    UI Developer with AI / ML - W2 Contract

    UI Developer with AI / ML - W2 Contract

    Axiom Software Solutions Limited • Austin, TX, US
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    Position : UI Developer with AI / ML.Location : Sunnyvale, CA / Austin, TX – Need Locals Only.We are seeking a talented UI Developer with a strong background in Artificial Intelligence and Machine Lea...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30
    Remote Machine Learning Engineer - AI Trainer ($80-$120 per hour)

    Remote Machine Learning Engineer - AI Trainer ($80-$120 per hour)

    Mercor • Austin, Texas, US
    serp_jobs.filters.remote
    serp_jobs.job_card.part_time
    At Mercor, we’re building the talent engine that helps leading labs and research orgs move AI forward.Our latest initiative focuses on benchmarking and improving model performance and training spee...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Mitratech Holdings, Inc. • Austin, TX, United States
    serp_jobs.job_card.full_time
    At Mitratech, we are a team of.Legal, Risk, Compliance, and HR functions of companies the world over.We are a close-knit, globally dispersed team that thrives in an ecosystem that supports individu...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    ML Engineer

    ML Engineer

    Axiom Software Solutions Limited • Austin, TX, US
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    Location : Sunnyvale CA or Austin, TX (Hybrid).A 5-10 years experienced machine learning engineer to build efficient, data-driven artificial intelligence systems that advance our predictive automat...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30
    Remote Senior Machine Learning Engineer - LLM Evaluation / Task Creations (India Based) - AI Trainer ($21-$21 per hour)

    Remote Senior Machine Learning Engineer - LLM Evaluation / Task Creations (India Based) - AI Trainer ($21-$21 per hour)

    Mercor • Pflugerville, Texas, US
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Role Description • • Mercor is hiring on behalf of a leading AI research lab to bring on highly skilled • •Machine Learning Engineers • • with a proven record of building, training, and evaluating high-...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_hours • serp_jobs.job_card.promoted • serp_jobs.job_card.new
    GenAI NLP ML Engineer

    GenAI NLP ML Engineer

    Cloudious LLC • Austin, Texas, USA
    serp_jobs.job_card.full_time
    Operationalize complex ML and GenAI models.LLMs prompt engineering embeddings).Retrieval Augmented Generation.Lead Platform and DevOps : CI / CD containerization observability and environment automati...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    ML Engineer

    ML Engineer

    Catalyst Labs • Austin, Texas, USA
    serp_jobs.job_card.full_time
    Is a rapidly growing Tier 1 VC backed startup based in New York with $60 million in funding revolutionizing how outside sales and service teams work. Their AI technology captures and analyzes real-w...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    R&D Machine Learning Engineer (Engineering Scientist Associate)

    R&D Machine Learning Engineer (Engineering Scientist Associate)

    University of Texas at Austin • Austin, TX, United States
    serp_jobs.job_card.full_time
    R&D Machine Learning Engineer (Engineering Scientist Associate).Development of novel machine learning algorithms for application to sonar and underwater acoustics, as well as the accompanying data ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

    Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

    Mercor • Pflugerville, Texas, US
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Role Overview • • Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model o...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Remote Open Source Developers - AI Trainer ($90-$120 per hour)

    Remote Open Source Developers - AI Trainer ($90-$120 per hour)

    Mercor • Austin, Texas, US
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    We’re looking for • •open-source contributors • • and • •experienced engineers • • who understand how to review, maintain, and troubleshoot live repositories. Who You Are • • - An • •open-source developer or...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    AI ML Engineer

    AI ML Engineer

    Numentica LLC • Austin, Texas, USA
    serp_jobs.job_card.full_time
    Were looking for a sharp fast-moving AI / ML engineer who thrives in ambiguity and gets excited about building.Youll be tackling greenfield projects across various ML domains - whether thats NLP.The ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Machine Learning Engineer - Sr. Consultant level

    Machine Learning Engineer - Sr. Consultant level

    Visa • Austin, TX, United States
    serp_jobs.job_card.full_time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Lead Machine Learning Engineer

    Lead Machine Learning Engineer

    Visa • Austin, TX, United States
    serp_jobs.job_card.full_time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    GenAI / Python Engineer (W2-Only) (Austin)

    GenAI / Python Engineer (W2-Only) (Austin)

    ClifyX • Austin, TX, US
    serp_jobs.job_card.part_time +1
    Visa (GC / USC) • • • • • • • • • • • • • • • • • • • • • • • • •.Location : SCV or Austin (Hybrid-Onsite).Strong understanding of LLMs, generative AI, and transformer-based architectures. Experience with Python, data analysis...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Gen AI Architect

    Gen AI Architect

    Flexon Technologies Talent360.ai • Austin, TX, United States
    serp_jobs.job_card.full_time
    Location : Sunnyvale, CA or Austin, TX.Machine Learning Implementations.Experience in Structured Data Modelling, NLP, Time Series Modelling. Good Understanding of LLM concepts (RAG, Prompting, Few Sh...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted