HPC Cluster Architect

Remote Full-time

HPC Cluster Architect
Location: UK / Remote
Reporting to: Head of Infrastructure
Department: Infrastructure
ABOUT NEXGEN CLOUD:
NexGen Cloud is the company behind Hyperstack, a full-stack AI cloud serving tens of thousands of customers from AI researchers to enterprises running the world's most compute-intensive workloads. We deliver on-demand and private GPU infrastructure to teams who treat performance as a requirement, not a feature.
We're a tight-knit, fast-moving team working at the cutting edge of AI cloud infrastructure. We practice what we preach, equipping our people with AI at every level so we can solve harder problems, ship faster, and keep raising the bar for what enterprise GPU infrastructure looks like.
THE ROLE: HPC Cluster Architect
This role exists because NexGen Cloud is winning large-scale dedicated GPU cluster contracts and needs someone who can own the full architecture cycle — from first customer conversation to production deployment. This is a capability that doesn’t exist yet in a dedicated role; we’re building it now because the pipeline demands it.. You’ll have direct ownership over cluster architecture across compute, networking, storage, and physical design — translating customer requirements into production-ready, commercially optimised GPU deployments.
Role positioning: This is a senior hands-on role for someone who has lived and breathed HPC cluster design — and who wants to be the technical authority, not one voice in a committee. You’ll own designs end-to-end and see them go live.
WHAT YOU’LL BE DOING
Rather than a long checklist, here’s what success in this role looks like:

Own end-to-end cluster architecture for large-scale NVIDIA GPU deployments — from customer requirement through rack layouts, BOM, power and cooling design, to production handover
Design high-performance network fabrics across compute (InfiniBand, RDMA, NVLink/NVSwitch), storage, and WAN — defining topology, oversubscription models, and scaling strategies
Engage directly with OEMs and vendors — validating hardware configurations, reviewing quotes, and ensuring designs are both technically sound and commercially optimised
Provide technical oversight during deployment and bring-up — supporting hardware validation, performance testing, and acting as escalation point for complex integration issues
Act as a senior technical leader across Solutions Architecture, Cloud Engineering, and data centre partners — contributing to standardised reference designs and building out the HPC engineering function

ABOUT YOU:
We’re more interested in how you think and work than in a perfect CV. You’ll likely bring a combination of the following:
Essential

Proven experience designing and delivering GPU-based HPC or AI clusters at scale — covering the full lifecycle from design through procurement, deployment, and validation

Deep hands-on knowledge of NVIDIA GPU platforms (H100/H200/B-series) and NVIDIA reference architectures
Strong InfiniBand/RDMA design experience — topology, performance tuning, and high-performance Ethernet fabrics
Solid grounding in Linux systems, PCIe topology, NUMA alignment, and server-level performance considerations
Background from an OEM, hyperscaler, neo-cloud, or enterprise/research HPC environment — with demonstrable exposure to the full design-to-deployment lifecycle
Confident engaging with customers, vendors, OEMs, and internal engineering teams as a technical authority — able to translate complex design trade-offs into clear decisions

Nice to Have

Experience with Spectrum-X or next-generation Ethernet fabrics
Prior involvement in large-scale cluster deployments (1,000+ GPUs) and performance benchmarking (NCCL, MLPerf)
Exposure to both air-cooled and liquid-cooled HPC environments, and/or automation/infrastructure-as-code

WHAT WE OFFER

Competitive salary and annual discretionary bonus scheme
Employee wellbeing benefits
25 days of holiday, plus public holidays
Flexible working arrangements (remote or hybrid, depending on role and location)
Real ownership and autonomy, with the trust to take initiative and experiment
The opportunity to make a visible, meaningful impact as we scale
Clear career progression and growth opportunities in a fast-growing company
A collaborative, international culture built on trust, transparency, and ownership
The chance to help shape NexGen Cloud’s team, culture, and future alongside ambitious, mission-driven colleagues

MORE INFORMATION
Head over to our NexGen Cloud careers page to view current opening and follow us on LinkedIn and X to learn more about our journey, newest releases and hear exciting news in the neocloud space.

Apply Now

Experienced Remote Data Entry Specialist – Contributing to the Magic of blithequark through Accurate Data Management and Analysis

Remote Full-time

Experienced Remote Customer Service Representative – Delivering Exceptional Pet Parent Experiences for arenaflex in Kentucky

Remote Full-time

HPC Cluster Architect

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

USPS Office Helper

Amazon Delivery Driver

Experienced Entry-Level Data Entry Clerk – Remote Opportunity for Career Growth and Development

[Remote-Position] Certified Nursing Assistants (CNA) | Night

Experienced Customer Service Representative – Beverage Delivery Space

Experienced Full Stack Customer Service Representative – Medicaid Member Support

Experienced Remote Data Entry Specialist – Contributing to the Magic of blithequark through Accurate Data Management and Analysis

Experienced Remote Customer Service Representative – Delivering Exceptional Pet Parent Experiences for arenaflex in Kentucky

Experienced Full Stack Data Entry Specialist – Remote Opportunity at blithequark

Intermediate Product Manager

IT Field Support Specialist (HT I) (Government) Columbia, Maryland

HPC Cluster Architect

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

USPS Office Helper

Amazon Delivery Driver

**Experienced Entry-Level Data Entry Clerk – Remote Opportunity for Career Growth and Development**

[Remote-Position] Certified Nursing Assistants (CNA) | Night

**Experienced Customer Service Representative – Beverage Delivery Space**

**Experienced Full Stack Customer Service Representative – Medicaid Member Support**

Experienced Remote Data Entry Specialist – Contributing to the Magic of blithequark through Accurate Data Management and Analysis

Experienced Remote Customer Service Representative – Delivering Exceptional Pet Parent Experiences for arenaflex in Kentucky

**Experienced Full Stack Data Entry Specialist – Remote Opportunity at blithequark**

Intermediate Product Manager

IT Field Support Specialist (HT I) (Government) Columbia, Maryland

Experienced Entry-Level Data Entry Clerk – Remote Opportunity for Career Growth and Development

Experienced Customer Service Representative – Beverage Delivery Space

Experienced Full Stack Customer Service Representative – Medicaid Member Support

Experienced Full Stack Data Entry Specialist – Remote Opportunity at blithequark