AI Infrastructure You Actually Own

Q: When will Alpha Cube ship?

First-run units are scheduled to begin shipping later this year. Reservations are filled in queue order; we'll confirm a delivery window with you once your unit enters assembly.

Q: How does the air-gapped security actually work?

Alpha Cube ships with no wireless radios, and the only network surface is a wired Ethernet jack you control. Inbound files move through an air-lock workflow before release into the live cube. Authentication is local, so an outside service cannot lock you out of your own compute.

Q: Which models does Vault OS support?

Vault OS runs frontier models locally and manages agentic AI, model deployment, chat sessions, and background jobs including training, evaluation, and ingestion.

Q: What is the refund or cancellation policy for pre-orders?

Pre-order deposits are non-refundable. They reserve your place in the production queue and fund the long-lead components your unit will ship with. The remaining balance is invoiced before shipment.

Q: How does the ROI compare to cloud API spending?

For a team of heavy AI users, a single Alpha Cube typically pays for itself within the first 12 to 18 months versus blended cloud spend, then continues running at zero marginal token cost for the life of the hardware.

Q: What happens if a cube fails or needs service?

The internal chassis slides out on a rail system for direct access to components. The enclosure is designed for serviceability without specialized tools.

Vault delivers AI compute at your fingertips, built to the same standard as the data it protects. Plug it into your network, run cutting-edge AI models locally, and eliminate data exposure from the outside world.

Pre-order → See the platform

SCROLL ↓

Threat Model

Your AI. Your Data. Your Network.

Every cloud-based AI model that is used ships your prompts, your documents, and your proprietary data to infrastructure you do not control. Once it's out of your hands the genie is out of the bottle. For many regulated industries, that exposure can mean loss of attorney-client privilege, compliance drift, exposure of financial data, or exposure of medical records. Vault Alpha Cube puts the compute in your business, behind your firewall, and under your authority.

Physical Isolation

No wireless radios. Ethernet only.

Air-lock Hardware Quarantine

Inbound media is staged, scanned, and released into the cube on your terms.

Physical Access Control

Locking chassis. Tamper-evident seals.

Local Authentication

No SSO round-trips through someone else's identity provider.

Vault OS · Agent

Meet GEM, an agent you can trust.

GEM is the agent surface of Vault OS. It reads your private files, drives background jobs, trains and evaluates models, and answers in your own language — all without ever leaving the cube.

Vault OS models screen for local model deployment and fine-tuning

Vault OS insights screen highlighting anomalies in private data

Angled render of the Vault Alpha Cube local AI appliance

Two Models

Same chassis.
Total security.

Same 18-inch anodized aluminum chassis. Same air-gapped architecture. Two compute footprints, sized to how hard you intend to push it.

Alpha Cube

Two RTX 5090s, a 32-core Threadripper Pro, 256 GB of RAM, and 8 TB of fast local storage. Enough to run frontier-class open-weight models — including 70B-parameter chat models — for a full team of heavy users without ever touching the internet.

Alpha Cube Pro

Doubles the GPU count to four RTX 5090s with dual power supplies, for organizations running larger models, longer training jobs, or more concurrent agents. Same chassis, twice the headroom.

Whichever you start with, the security promise is identical: nothing leaves the building.

The Thesis

Built around the physics of security.

Data that never leaves a building cannot escape from one. The hardware, the OS, the network posture — every decision serves that single rule.

The Platform

One cube.
The whole stack.

01 / Air-Gapped

Vault Alpha Cube isolated in an offline environment.

Air-Gapped by Architecture

Disconnected by design. Every model, prompt, document, and inference happens behind your own firewall — physically. Nothing about your work product is ever uploaded, mirrored, or telemetered.

02 / Dense Compute

Interior compute stack concept for Alpha Cube Pro.

Dense Compute, Small Footprint

Frontier-grade performance in an 18-inch cube. The Alpha Cube Pro scales to four RTX 5090 GPUs and 256 GB of RAM — enough headroom for 70B-parameter models and concurrent agent workloads.

03 / Display

Alpha Cube device display showing local system status.

Information at a Glance

An on-device AMOLED display surfaces what's running, who's connected, and how hot the silicon is.

04 / Clustering

Multiple Vault Alpha Cubes connected as a local compute cluster.

Clustering

Run multiple cubes side-by-side and Vault OS pools their compute as a single elastic resource. Need more? Try the calculator below ↓

05 / Token Economics

Token economics visual comparing local inference to recurring cloud usage.

Token Usage

Hitting usage caps mid-job, paying again for every retry, and watching costs scale with the productivity gains they were supposed to buy you. Alpha Cube turns inference cost from a recurring tax into a one-time hardware purchase you already own.

06 / Vault OS

Choose your models with Vault OS

Run frontier open-weight models — Llama, Mistral, Qwen — or upload your own fine-tunes. Vault OS handles deployment, agent orchestration, and background training jobs.

Exposure Audit

Calculator

How much are you leaking? Estimate your annual cloud-AI cost and the Vault configuration that replaces it.

AI-active employees

Override assumptions

AI spend / user / yr

Adjust this if your team spends more or less than the default $3,600 per heavy AI user.

Heavy daily AI use—coding agents, document research, model evaluation—typically lands around $300/mo across Cursor, Claude, Copilot, and API overage.

Annual cloud spend

Vault one-time cost

3-year savings

5-year savings

— Alpha Cubes

Move the dial to compute your configuration.

Your data stays in your building. Replace variable token billing with one-time hardware you own.

Pre-order now →

* Blended estimate of $3,600/heavy user/yr based on public list pricing for Cursor, Claude, Copilot and Anthropic API overage as of 2026-05. Capacity defaults assume 10 heavy concurrent users per Alpha Cube and 20 per Alpha Cube Pro.

Reserve

Reserve your Alpha Cube

First production run. Limited units. Each cube is assembled, tested, and validated before it ships.

Tier 01 · Standard

Alpha Cube

$42,950

2× NVIDIA RTX 5090
32-core Threadripper Pro · 256 GB RAM
8 TB local NVMe storage
Single PSU · standard power

For teams running frontier 70B-class models with strong concurrency. The default starting point for most organizations.

Tier 02 · Pro

Alpha Cube Pro

$56,950

4× NVIDIA RTX 5090
32-core Threadripper Pro · 256 GB RAM
8 TB local NVMe storage
Dual PSU · dual-outlet or 240V

Doubles the compute headroom for organizations training larger models, running heavier concurrent workloads, or hosting more agents.

Both share an identical aluminum enclosure. Reserve with a non-refundable deposit; remaining balance invoiced before shipment.

FAQ

Everything you need to know.

When will Alpha Cube ship?

How does the air-gapped security actually work?

Which models does Vault OS support?

What is the refund or cancellation policy for pre-orders?

How does the ROI compare to cloud API spending?

What happens if a cube fails or needs service?

Pre-order → Talk to us

Your AI. Your Data. Your Network.

Physical Isolation

Air-lock Hardware Quarantine

Physical Access Control

Local Authentication

Meet GEM, an agent you can trust.

Same chassis.Total security.

Alpha Cube

Alpha Cube Pro

Built around the physics of security.

One cube.The whole stack.

Air-Gapped by Architecture

Dense Compute, Small Footprint

Information at a Glance

Clustering

Token Usage

Choose your models with Vault OS

Calculator

Reserve your Alpha Cube

Alpha Cube

Alpha Cube Pro

Everything you need to know.

Same chassis.
Total security.

One cube.
The whole stack.