Talent.com
Senior Manager, Professional Services HPC Deployment
Senior Manager, Professional Services HPC DeploymentNVIDIA • Remote, TX, US
serp_jobs.error_messages.no_longer_accepting
Senior Manager, Professional Services HPC Deployment

Senior Manager, Professional Services HPC Deployment

NVIDIA • Remote, TX, US
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
  • serp_jobs.filters.remote
job_description.job_card.job_description

NVIDIA is in search of an HPC Deployment Manager to bolster its Professional Services division. Across academia and industry, NVIDIA's products are driving ground-breaking advancements in deep learning, data analytics, and the optimization of data centers. Join our team, where we are at the forefront of constructing some of the globe's most expansive and rapid data centers! We seek an individual capable of supervising the deployment of cutting-edge InfiniBand and Ethernet technologies with a team comprising AI and HPC experts. This role demands dynamic interpersonal abilities and a customer-centric approach.

The chosen candidate will engage with clients, collaborators, and internal units to assess, delineate, and complete large-scale AI / HPC initiatives. They will orchestrate the day-to-day operations, guidance, and cultivation of a multi-layered team of HPC service professionals. This entails ensuring the timely delivery of a varied spectrum of AI HPC data center projects. Furthermore, this role offers an opportunity to thrive within a fast-paced, inventive, and technologically sophisticated atmosphere, emphasizing unparalleled performance and the exploration of an array of novel hardware and software technologies in AI supercomputing.

What you will be doing :

Directs and supervises the service HPC engineering functions in designing, developing, installing, and validating hardware and software for the Customer AI High-Performance Computing (HPC) systems.

Leads, handles, mentors, and builds a very hardworking HPC service engineering team to deliver innovative advances in high-performance computing AI systems.

Responsible for leading our HPC projects' planning, implementation, and performance. Improves the integrity of system services bring-up and related by applying groundbreaking technical and operational knowledge to configure and maintain HPC AI network and server platforms.

Drives HPC team hardware and software deployment, plans, develops, and deploys procedures for system validation.

Lead team activities and drive tests and plans for Customer's HPC AI systems implementations, custom scripts, and testing procedures to ensure operational reliability for the system.

Supports the HPC Engineering team, working with other internal collaborators to develop and run a well-rounded strategy for delivering service quality and continuous service improvement. Supports governance for software engineering through the implementation of standards and quality measures.

Leads team member development, helping them set and achieve goals for their career growth. Develop an inclusive environment that values team member differences, creating a sense of belonging and appreciation. Chips in to a culture of trust and clarity.

Build strong relationships with INVIDIA leaders, customers, partners, and collaborators. Works closely to identify, implement, and support leading NVIDIA's AI solutions engineering, maintaining currency with industry standards and innovations. Provides input around process optimization, department budgeting, and the monitoring and management of resources.

Be the domain authority with customers during planning calls through implementation.

What we need to see :

8+ overall years' experience in IT, high-performance computing, or other related field; 3+ years of experience in a management or leadership role

Demonstrated expertise in HPC systems design configuration and planning.

Proficiency with low latency / high-bandwidth interconnect infrastructure (Infiniband and Ethernet).

Expertise with HPC system software cluster management / provisioning tools, including job schedulers (Slurm, salt, xCAT).

Proficiency with shared and distributed memory parallelism (OpenMP, MPI, NCCL and HPL) and accelerators (GPUs).

Strong scripting ability (Bash, Perl, Python, etc.) and experience with programming fundamentals.

Expertise with administration, supervising and maintaining secure Linux / Unix operating systems (CentOS, Solaris).

Experience establishing processes for maintaining system performance, managing best-in-class standards, and familiarity with cloud computing and container technologies.

Ability to understand and work with large, sophisticated systems, identify and resolve problems, handle performance, and troubleshoot network issues related to infrastructure.

Expertise with multi-vendor hardware / software management, security, and network / Internet protocols. Strong communication and social skills, with the ability to provide detailed information and high-level summaries to management-level individuals and groups, present the business side of technical topics to non-technical audiences, and develop positive working relationships and strong rapport with team members.

Bachelor's degree in computer science, information systems, or a related field or equivalent experience

Solid knowledge of HPC storage

Exemplary communication and interpersonal skills, with the ability to present the business side of technical topics to non-technical audiences and persuasively and optimally get along with relationships with various stakeholders and diverse individuals and groups

Ways to stand out from the crowd :

InfiniBand experience.

Experience with GPU-focused hardware / software.

Experience with MPI.

Automation tooling background (Ansible, Salt, Puppet, etc.).

Ethernet and Storage technologies such as Lustre or GPFS.

The base salary range is 208,000 USD - 327,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

serp_jobs.job_alerts.create_a_job

Deployment Manager • Remote, TX, US

Job_description.internal_linking.related_jobs
Senior Amazon Director

Senior Amazon Director

DreamHire.com • TX, US
serp_jobs.filters.remote
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
This role focuses on client satisfaction and retention to build loyalty, while overseeing operations.This position serves as a key operations role and represents the organization publicly.Ensure Cl...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30
Regional Sales Director, Captive Solutions

Regional Sales Director, Captive Solutions

Crumdale Specialty • TX, US
serp_jobs.filters.remote
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
Crumdale Specialty is a diversified insurance firm providing custom, self-funded healthcare solutions to a limited distribution network of brokers, consultants, and agents nationwide.Fastest Growin...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days
Senior Project Manager

Senior Project Manager

Pavilion Construction • TX, USA
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
At Pavilion, we embrace "BUILDING YOUR VISION.This translates to both building the physical assets our clients envision as well as building the careers of our employees. In joining Pavilion, you are...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30
Chaplain Corps

Chaplain Corps

US Navy Reserve • Winters, Texas, United States
serp_jobs.job_card.part_time
ABOUT Pursuing a civilian career doesn’t have to mean getting stuck in the corporate world.As a current or former Navy Chaplain, there are plenty of part-time opportunities to use your leadership s...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Client Services Representative

Client Services Representative

Ethos Risk Services • TX, USA
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
We are at the forefront of innovation in our space, and our success is driven by a dynamic team passionate about delivering exceptional services to our customers. Client Services Specialist (RE...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30
Sr. Director - Infrastructure Mergers and Acquisitions

Sr. Director - Infrastructure Mergers and Acquisitions

MCKESSON • TX, United States
serp_jobs.job_card.full_time
McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare.We are known for delivering insights, products, and services that make quality care more accessibl...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
ServiceNow Financial Services Operations (FSO) Developer / Architect

ServiceNow Financial Services Operations (FSO) Developer / Architect

Tekaccel Inc • TX, United States
serp_jobs.job_card.temporary
serp_jobs.filters_job_card.quick_apply
Title : ServiceNow Financial Services Operations (FSO) Developer / Architect Location : Stamford, CT (Remote) Experience : 5-1...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days
Director, Product Management - Lynx - US Remote

Director, Product Management - Lynx - US Remote

MCKESSON • TX, United States
serp_jobs.filters.remote
serp_jobs.job_card.full_time
McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare.We are known for delivering insights, products, and services that make quality care more accessibl...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Manager, Global Enterprise Implementation

Manager, Global Enterprise Implementation

Culture Amp • TX, US
serp_jobs.job_card.full_time
Join us on our mission to make a better world of work.Culture Amp is the world’s leading employee experience platform, revolutionizing how 25 million employees across more than 6,500 companies crea...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days
Director of Strategic Partnerships and Alliances

Director of Strategic Partnerships and Alliances

Concord USA • Multiple Cities, TX, US
serp_jobs.job_card.full_time +1
serp_jobs.filters_job_card.quick_apply
About us Concord isn't your typical consulting firm; we're an execution-focused company passionate about delivering results. Our mission is to help clients enhance customer experiences, optimize ope...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30
Principal Technical Advisor

Principal Technical Advisor

Salt Technologies • TX, US
serp_jobs.filters.remote
serp_jobs.job_card.full_time +1
serp_jobs.filters_job_card.quick_apply
We’re a modern software consulting and development company with a reputation for quality engineering and long-term partnerships. You won’t be writing code — you’ll be shaping conversations, influenc...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30
Business Development Manager- Product & Services

Business Development Manager- Product & Services

Biocytogen • TX, US
serp_jobs.filters.remote
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
Biocytogen is a fast-growing biotech company with broad cutting-edge technologies.You will have the best opportunity to learn and utilize the science and business in the fields of immuno-oncology, ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days
SAP Cloud Delivery Manager - Remote

SAP Cloud Delivery Manager - Remote

Two95 International Inc. • TX, US
serp_jobs.filters.remote
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
Bachelor’s Degree or a combination of education or equivalent experience.Minimum 10 years of Technical Project / Program management. Experience or deep technical knowledge of Cloud Product solutions a...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30
Entrepreneur / leadership Remote

Entrepreneur / leadership Remote

Yellowstone Life Insurance Agency, LLC • TX, US
serp_jobs.filters.remote
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
Yellowstone Life Insurance Agency, an Integrity Company, is seeking dynamic and driven individuals for the position of Remote Entrepreneur / Leader. In this role, you will have the opportunity to work...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30
Manager, Surplus Lines Reporting

Manager, Surplus Lines Reporting

InsuranceJobs.com • Texas, United States
serp_jobs.job_card.full_time
As part of the InhabitIQ company, ePremium Insurance Agency, LLC has been named one of INC 500s fastest growing privately held companies in the nation and has been recognized as a Top Workplace in ...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Director, Analytics & Strategy (Healthcare)

Director, Analytics & Strategy (Healthcare)

MCKESSON • TX, United States
serp_jobs.job_card.full_time
McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare.We are known for delivering insights, products, and services that make quality care more accessibl...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days • serp_jobs.job_card.promoted
Associate Project Manager | M&A Integration & Business Transformation

Associate Project Manager | M&A Integration & Business Transformation

Infinx • TX, US
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
At Infinx, we're a fast-growing company focused on delivering innovative technology solutions to meet our clients' needs. We partner with healthcare providers to leverage automation and intelligence...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_30
Technical Manager, Scada Engineer (Texas or Montreal or Vermont)

Technical Manager, Scada Engineer (Texas or Montreal or Vermont)

Power Factors • TX, US
serp_jobs.filters.remote
serp_jobs.job_card.full_time
serp_jobs.filters_job_card.quick_apply
Power Factors is a software and solutions provider leading the next generation of clean energy with Unity, one of the most extensive and widely deployed renewable energy management suites (REMS) in...serp_jobs.internal_linking.show_more
serp_jobs.last_updated.last_updated_variable_days