The embedding API built for production.

87ms median latency. Three quality tiers. Zero trust security. Dedicated NVIDIA DGX infrastructure.

or explore Forge features →
NVIDIA Inception Program badge Program Membership Member of NVIDIA Inception
Powered by DGX infrastructure
playground.voxell.ai

MASH Sort: Up to 9x faster GPU sorting on NVIDIA Blackwell. Benchmarked across 100M–3B keys.

Read benchmarks →
NVIDIA Inception Program badge
Member of NVIDIA Inception
Program Member
CUDA
Native
DGX Spark
Tested
Engineering

The technical foundations behind Voxell's products.

View all engineering articles →
Get In Touch

Building systems where latency and consistency matter? We'd like to hear about your challenges.

24h reply • NDA ok • No IP needed