DevOps Architect

Remote Full-time
About the position

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.
Tenstorrent is building multi-megawatt AI data centers with thousands of accelerators. We are seeking a DevOps Architect to define the next generation cluster control plane that provisions, operates, and secures large-scale AI training and inference environments.
This is a foundational architecture role. You will define how clusters are configured, orchestrated, monitored, and secured at scale.
This role is hybrid, based out of Austin, TX; Santa Clara, CA; or Toronto, ON.
We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.

Responsibilities
• Define the end-to-end architecture for the AI cluster control plane, covering provisioning, configuration, lifecycle management, and monitoring
• Architect scalable systems for system, network, and storage provisioning across multi-thousand accelerator environments
• Establish telemetry, logging, metrics, tracing, and alerting frameworks with operational guardrails
• Define workload placement, resource allocation, scheduling, and preemption policies to maximize accelerator utilization
• Integrate authentication, authorization, account management, key management, backup, checkpointing, and DCIM infrastructure into a secure multi-tenant environment

Requirements
• 10+ years designing and operating enterprise, HPC, or large-scale data center infrastructure
• Deep expertise in cloud-native and bare-metal infrastructure management
• Strong hands-on experience with Infrastructure-as-Code tools such as Terraform, Ansible, and Helm
• Experienced building and operating observability stacks including Prometheus, Grafana, ELK or EFK, and OpenTelemetry
• Strong understanding of networking, storage systems, accelerator resource management, and security models including RBAC, IAM, TLS, and secrets management

Benefits
• Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.

Apply tot his job

Apply To this Job
Apply Now

Similar Opportunities

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote Full-time

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote Full-time

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote Full-time

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote Full-time

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote Full-time

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote Full-time

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote Full-time

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote Full-time

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote Full-time

USPS Office Helper

Remote Full-time

SAP Service Delivery Manager

Remote Full-time

Staples New Business Development Furniture Account Executive – HiTouch Remote in United States

Remote Full-time

**Experienced Data Entry Specialist – Remote UPS Operations Support**

Remote Full-time

Game Tester / Community Moderator

Remote Full-time

Datacenter Fiber Technician

Remote Full-time

**Experienced Customer Service Representative – Remote Opportunity at blithequark**

Remote Full-time

Senior Product Manager

Remote Full-time

Senior Structural Engineer II job at Sargent & Lundy in Richland, WA

Remote Full-time

Lead Data and Financial Reporting - Senior Consultant

Remote Full-time

Psychologist

Remote Full-time
← Back to Home