Talent.com
HPC Engineer
HPC EngineerAvalore, LLC • Annapolis Junction, MD, US
HPC Engineer

HPC Engineer

Avalore, LLC • Annapolis Junction, MD, US
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
  • serp_jobs.filters_job_card.quick_apply
job_description.job_card.job_description

What you will be doing

Playing a key role in defining and operating some of the most complex compute platforms that the client has to bring to bear against complex problems. These systems enable complex analysis, simulation and modeling leveraging massively parallel computing and disparate holding of very large data sets, to answer difficult questions. To do this you will assist the users in deploying jobs to these systems to harness the capabilities of these systems producing answers in the form of analytic product, models and simulations. This mission enablement is the heart of the hardest problems to solve.

  • Responsible for the normal day-to-day HPC operations and maintenance of the HPC systems
  • Provide day to day systems administration duties for Nvidia GPUs, Commodity Cluster Systems and Cray HPC environments
  • Perform system monitoring, software installations, debug, upgrades, health checks, and identification / implementation of automated business processes
  • Provide assessments, on-going performance analysis and recommendations for future architectures
  • Responsible for operating all the host systems for the analysis
  • Works in a liaison role, linking the analysts and their specialty codes and applications, to the computing systems that are focused on yielding in-depth technically sound results.
  • Oversees analytic applications running on a clustered HPC fabric including CPU and GPU systems
  • Managing job submission to clients applications and codes using MPI / OpenMPI
  • Provide in-depth analytic results, to achieve a best-tool-for-the-job approach.
  • Partners with data scientists, engineers, and analysts conducting specialized scientific and engineering analysis.
  • Escalate issues and problems to hardware support and / or engineering management as necessary
  • Responsible for continuous performance analysis and tuning the HPC environment
  • Assist with the identification, troubleshooting, and repair of software problems impacting performance of implemented HPC solutions
  • Perform installation of software patches including upgrades to operating systems and firmware
  • Assist with the resolution of trouble tickets and software problems identified by system’s users
  • Identify and expand services and functionalities offered in HPC environment
  • Be a primary point of contact to resolve any hardware or software malfunctions, including working with service personnel as necessary
  • Review system logs to identify and resolve software and systems related issues
  • Prepare reports related to the operational efficiency of the hardware and execution of users jobs
  • Experience with MPI / OpenMPI, SLURM, and Linux Operating Systems essential
  • Prior experience as a Systems Administrator essential, with a preference for experience working with clustered systems including GPUs in the hardware stack
  • Experience with high speed networking, and CUDA preferred
  • Software integration experience a plus
  • Other duties could be required to support the customer’s mission

Requirements

  • Minimum of 6 years demonstrated on-the-job experience
  • Demonstrated on-the-job experience with integrating functionality from disparate systems via scripting / tooling / automation
  • Demonstrated on-the-job experience with the Sponsor's system security environment and requirements
  • Demonstrated experience leading systems architecture, operations, maintenance and administration
  • Clearance : Active TS / SCI with an appropriate current polygraph is required to be considered for this role; Ability to receive privileged access rights.

    Benefits

    Eligibility requirements apply.

  • Employer-Paid Health Care Plan (Medical, Dental & Vision)
  • Retirement Plan (401k, IRA) with a generous matching program
  • Life Insurance (Basic, Voluntary & AD&D)
  • Paid Time Off (Vacation, Sick & Public Holidays)
  • Short Term & Long Term Disability
  • Training & Development
  • Employee Assistance Program
  • serp_jobs.job_alerts.create_a_job

    Hpc Engineer • Annapolis Junction, MD, US

    Job_description.internal_linking.related_jobs
    HPC Software Engineer - FS Poly

    HPC Software Engineer - FS Poly

    stanleyreid.com • Annapolis Junction, MD, United States
    serp_jobs.job_card.full_time
    Stanley Reid is a specialized recruiting firm connecting top contractors with exciting IC / DoD opportunities.We're dedicated to a personalized, stress-free job search, matching your unique skills an...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Microsoft Endpoint Configuration Manager

    Microsoft Endpoint Configuration Manager

    TSymmetry • Cheltenham, MD, United States
    serp_jobs.job_card.full_time
    Microsoft Endpoint Configuration Manager.Tsymmetry is an IT professional services company dedicated to delivering flexible, scalable solutions to our customers around the globe when and where their...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    HPC Generalist System Engineer Level 6 (AE25111544SE6)

    HPC Generalist System Engineer Level 6 (AE25111544SE6)

    Advantage Engineering & IT Solutions Inc • Annapolis Junction, MD, USA
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    HPC Generalist System Engineer.Level 6, to support our customer in the Annapolis Junction, Maryland area.HPC Generalist System Engineer. HPC (High-Performance Computing) architecture knowledge.Annap...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days
    HPC Software Engineer 3

    HPC Software Engineer 3

    T-Rex Solutions, LLC • Fort Meade, MD, United States
    serp_jobs.job_card.full_time
    T-Rex is looking for a fully cleared.Fort Meade, Maryland area in support of the Intelligence Community.The HPC Software Engineer designs, develops, tests, deploys, documents, maintains, and enhanc...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    HPC Software Engineer 3

    HPC Software Engineer 3

    T-Rex Solutions • Fort Meade, Maryland, USA
    serp_jobs.job_card.full_time
    T-Rex is looking for a fully cleared.Fort Meade Maryland area in support of the Intelligence Community.The HPC Software Engineer designs develops tests deploys documents maintains and enhances comp...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Principal HPC Software Engineer

    Principal HPC Software Engineer

    GliaCell Technologies • Annapolis Junction, MD, US
    serp_jobs.job_card.full_time
    serp_jobs.filters_job_card.quick_apply
    Are you a Principal HPC Software Engineer who is ready for a new challenge that will launch your career to the next level?. Tired of being treated like a company drone?.Tired of ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days
    IAM Systems Engineer

    IAM Systems Engineer

    Skill • Baltimore, MD, United States
    serp_jobs.job_card.temporary
    Aquent is proud to partner with a leading financial institution dedicated to innovation and maintaining robust, secure operations. This organization is at the forefront of digital transformation, co...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Maintenance Tech / Engineer

    Maintenance Tech / Engineer

    Little Sisters of the Poor • Catonsville, Maryland, United States
    serp_jobs.job_card.full_time
    JOB OBJECTIVE AND MISSION : The Maintenance Technician / Engineer is responsible for performing highly diversified duties, completing work orders, and making repairs related to the maintenance of our ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Electrical Engineer / Spectrum Engineer (Microwave)

    Electrical Engineer / Spectrum Engineer (Microwave)

    Leidos • Upper Marlboro, MD, US
    serp_jobs.job_card.full_time
    Leidos’ NISC IV program seeks an.Electrical Engineer (Spectrum Engineer).Microwave frequency assignment engineering and coordination. FAA Spectrum Engineering Services Group at FAA Headquarter...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Advanced Software Engineer

    Advanced Software Engineer

    Relativity • Baltimore, MD, United States
    serp_jobs.job_card.full_time
    As an Advanced Software Engineer at Relativity, you will use your development expertise, working on software projects to build our software platform, Relativity. You will help solve complex problems...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Traffic Design Engineer I

    Traffic Design Engineer I

    Wallace Montgomery • Hunt Valley, Maryland, United States
    serp_jobs.job_card.full_time
    Job Location : Maryland Office - Hunt Valley, MD Position Type : Full Time Salary Range : $35.Hourly Description Job Description : Design Engineer I is an entry level engineering position that provi...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Advanced Technical Support Engineer

    Advanced Technical Support Engineer

    Ciena • Hanover, MD, United States
    serp_jobs.job_card.full_time
    As the global leader in high-speed connectivity, Ciena is committed to a people-first approach.Our teams enjoy a culture focused on prioritizing a flexible work environment that empowers individual...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    HPC Software Engineer

    HPC Software Engineer

    Base2 Solutions • Fort Meade, MD, United States
    serp_jobs.job_card.full_time
    Top Secret / SCI with CI Polygraph.Base-2 Solutions is seeking a High-Performance Computing Software Engineer who designs, develops, tests, deploys, documents, maintains, and enhances complex and div...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Mechanical Engineer III - Hydraulics

    Mechanical Engineer III - Hydraulics

    Oceaneering International, Inc. • Hanover, MD, United States
    serp_jobs.job_card.full_time
    Oceaneering Technologies (OTECH) develops, manufactures, and operates customized marine systems, shipboard equipment, subsea vehicles, and engineered solutions for commercial and U.Oceaneering Aero...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Senior System Engineer

    Senior System Engineer

    MANTECH • Fort Meade, Maryland, US
    serp_jobs.job_card.full_time
    Unlock the secrets of intelligence with MANTECH! Join a dynamic team at the forefront of national security, providing advanced solutions to government intelligence agencies.Since 1968, we’ve ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
    Technician 1

    Technician 1

    Sunbelt Rentals • Upper Marlboro, MD, United States
    serp_jobs.job_card.full_time
    Sunbelt Rentals strives to be the customer's first choice in the equipment rental industry.From pumps to scaffolding to general construction tools, we aim to be the only call needed to outfit a job...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Controls Engineer

    Controls Engineer

    Jobot • Baltimore, MD, US
    serp_jobs.job_card.full_time
    Well known company with an international reach is growing their offices!.This Jobot Job is hosted by : Alex Dickinson.Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us ...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_30 • serp_jobs.job_card.promoted
    Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

    Remote CUDA Kernel Optimizer - ML Engineer - AI Trainer ($120-$250 per hour)

    Mercor • Bowie, Maryland, US
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Role Overview • • Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization, performance profiling, and numerical efficiency. These professionals possess a deep mental model o...serp_jobs.internal_linking.show_more
    serp_jobs.last_updated.last_updated_1_day • serp_jobs.job_card.promoted