Senior Solution Architect – AI / GPU Cloud
We are seeking a Senior Solution Architect to design GPU‑cloud and AI infrastructure solutions, lead PoCs and benchmarks, guide customers through deployment, and partner closely with engineering and operations teams at GMI Cloud.
About GMI Cloud
GMI Cloud is a fast‑growing AI infrastructure company backed by Headline VC. We operate hundreds of megawatts of AI‑ready data center capacity across North America and a growing AI Factory footprint in Asia, delivering a full spectrum of services from GPU compute to AI model inference API solutions. As an NVIDIA Reference Platform Cloud Partner, our infrastructure meets the highest standards for performance, security, and scalability in AI deployments.
Role Overview
As a Solution Architect, you will be the primary technical interface for our enterprise and hyperscaler accounts and help customers build AI without limits.
Key Responsibilities
- Serve as the primary technical point‑of‑contact for enterprise and hyperscaler customers.
- Deeply understand customer AI / ML / HPC workloads, scaling requirements, and deployment models.
- Architect GPU clusters, storage, networking, and orchestration solutions tailored to customer needs.
- Lead proof‑of‑concepts, benchmarks, and workshops demonstrating performance, reliability, and scalability.
- Produce technical proposals, architecture diagrams, capacity plans, and cost / performance recommendations.
- Translate complex technical issues into clear actions for both engineering and business stakeholders.
- Guide customers through onboarding, cluster setup, performance tuning, and scaling.
- Partner with internal infra, DC ops, and engineering teams to ensure smooth delivery and implementation.
- Identify optimization opportunities in customer workloads (GPU utilization, networking, scheduling, cost).
- Act as a trusted advisor on GPU / AI infrastructure best practices, roadmap, and long‑term planning.
- Maintain regular technical check‑ins, capacity reviews, and performance reviews with customers.
- Gather customer feedback and collaborate with product / engineering to improve our platform.
Required Qualifications
Technical Background
5–10+ years in cloud infrastructure, GPU cloud, HPC, AI / ML infrastructure, or data center engineering.Strong understanding of distributed training & inference architectures, Kubernetes, Slurm, or other cluster / orchestration systems, NVIDIA GPU stack (H100 / H200 / B200 / GB200 or similar), InfiniBand / high‑speed networking, and storage architectures for AI workloads.Customer‑Facing Skills
Experience working directly with enterprise or hyperscaler technical teams.Ability to simplify complex infra concepts for both technical and non‑technical audiences.Strong communication, solution‑design, and project coordination skills.Soft Skills
Self‑starter, ownership mindset, excellent follow‑through.Comfortable working in a fast‑moving, high‑growth environment.Strong problem‑solving and “architect + advisor” mentality.Preferred Qualifications (Nice to Have)
Hands‑on with large‑scale GPU deployments (multi‑node, multi‑cluster).Exposure to hyperscaler capacity planning or AI infrastructure procurement teams.Experience with multi‑region or global GPU deployments (US + APAC / Taiwan).Why Join GMI Cloud
Work directly with some of the world’s most advanced AI organizations.Architect and deliver multi‑MW GPU clusters at global scale.Influence product roadmap and partner closely with NVIDIA and top‑tier data center providers.High‑impact role with significant ownership and career growth.#J-18808-Ljbffr