Verda is SOC 2 Type II compliant Learn more

Verda Blog

NVFP4 Explained: How to Build a GEMM Kernel for B200 in CuTeDSL
NEW AI research

NVFP4 Explained: How to Build a GEMM Kernel for B200 in CuTeDSL

NVFP4 Explained: How NVIDIA Blackwell Unlocks Low-Precision Floating Point
NEW AI research

NVFP4 Explained: How NVIDIA Blackwell Unlocks Low-Precision Floating Point

Multi-Head Latent Attention: Benefits in Memory and Computation

Multi-Head Latent Attention: Benefits in Memory and Computation

AI research
FLUX on B200 vs H100: Real-Time Image Inference with WaveSpeedAI

FLUX on B200 vs H100: Real-Time Image Inference with WaveSpeedAI

AI research
DeepSeek-V3 + SGLang: Inference Optimization

DeepSeek-V3 + SGLang: Inference Optimization

AI research
DeepSeek + SGLang: Multi-Head Latent Attention

DeepSeek + SGLang: Multi-Head Latent Attention

AI research
Multi Data Center Training: Prime Intellect

Multi Data Center Training: Prime Intellect

AI research