General Information
Req #
WD00091301
Career area : Hardware Engineering
Country / Region :
United States of America
State : North Carolina
City : Morrisville
Date :
Friday, November 14, 2025
Working time : Full-time
Additional Locations :
Why Work at Lenovo
We are Lenovo. We do what we say. We own what we do. We WOW our customers.
Lenovo is a US$69 billion revenue global technology powerhouse, ranked #196 in the Fortune Global 500, and serving millions of customers every day in 180 markets. Focused on a bold vision to deliver Smarter Technology for All, Lenovo has built on its success as the world's largest PC company with a full-stack portfolio of AI-enabled, AI-ready, and AI-optimized devices (PCs, workstations, smartphones, tablets), infrastructure (server, storage, edge, high performance computing and software defined infrastructure), software, solutions, and services. Lenovo's continued investment in world-changing innovation is building a more equitable, trustworthy, and smarter future for everyone, everywhere. Lenovo is listed on the Hong Kong stock exchange under Lenovo Group Limited (HKSE : 992) (ADR : LNVGY).
This transformation together with Lenovo's world-changing innovation is building a more inclusive, trustworthy, and smarter future for everyone, everywhere. To find out more visit www.lenovo.com, and read about the latest news via our StoryHub.
The Lenovo AI Technology Center (LATC)-Lenovo's global AI Center of Excellence-is driving our transformation into an AI-first organization. We are assembling a world-class team of researchers, engineers, and innovators to position Lenovo and its customers at the forefront of the generational shift toward AI. Lenovo is one of the world's leading computing companies, delivering products across the entire technology spectrum, spanning wearables, smartphones (Motorola), laptops (ThinkPad, Yoga), PCs, workstations, servers, and services / solutions. This unmatched breadth gives us a unique canvas for AI innovation, including the ability to rapidly deploy cutting-edge foundation models and to enable flexible, hybrid-cloud, and agentic computing across our full product portfolio.
To this end, we are building the next wave of AI core technologies and platforms that leverage and evolve with the fast-moving AI ecosystem, including novel model and agentic orchestration & collaboration across mobile, edge, and cloud resources. This space is evolving fast and so are we. If you're ready to shape AI at a truly global scale, with products that touch every corner of life and work, there's no better time to join us. #LATC
Description and Requirements
Lenovo is seeking a Software Engineer to lead the integration and implementation of Small Language Model (SLM) inferencing for our next-generation AI systems onto our Windows laptop and desktop computers. If you are passionate about making Smarter Technology For All, come help us realize our Hybrid AI vision!
Responsibilities :
Design, implement, and maintain core agent runtimes responsible for :
Dynamic model loading and lifecycle management
Scheduling, prioritization, and queuing of model inference requests
Information retrieval, data preprocessing, and context preparation pipelines
Develop system integrations to support interoperability between Windows applications, services, and AI runtime components
Implement security and privacy controls , including process isolation, sandboxing, audit logging, and compliance with enterprise-grade software security standards
Optimize runtime performance for latency, throughput, and memory footprint across heterogeneous compute platforms (CPU, GPU, NPU ) and across various vendor AI Frameworks (such as OpenVino, RyzenAI, and QNN)
Embed safety and interpretability features into the stack, including guardrails, telemetry, and explainability mechanisms
Contribute to platform reliability , including fault tolerance, error handling, and automated recovery strategies
Collaborate with cross-functional teams (AI researchers, product managers, QA, and DevOps) to deliver robust, production-ready solutions
Mentor junior engineers by providing technical guidance, conducting code reviews, and supporting onboarding
Author technical documentation and best practices , contributing to internal knowledge bases, design proposals, and architecture reviews
Required Qualifications :
Expertise in Windows development , including Win32 APIs, DLL development, and system-level programming
Strong understanding of Windows software security best practices (e.g., secure coding, privilege management, sandboxing)
Experience with GGML, GGUF, and llama.cpp for local model inference
Familiarity with ONNX, OpenVino, RyzenAI, and QNN runtimes
Proficiency in C / C++ for client and systems development
Working knowledge of Python and deep learning frameworks such as PyTorch
Strong problem-solving and debugging skills in complex, multi-threaded environments
Preferred Qualifications :
Experience with Kotlin Multiplatform (KMP) or other cross-platform development frameworks
Contributions to open-source AI runtimes, libraries, or Windows utilities
Background in performance optimization, compiler toolchains, or low-level hardware acceleration
#LATC
We are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, color, sex, age, religion, sexual orientation, gender identity, national origin, status as a veteran, and basis of disability or any federal, state, or local protected class.
Additional Locations :
Ai Engineer • Morrisville, NC, United States