Yapay Zeka Araçları ve ÜrünlerFlashKDA Open-Sourced: 2.5x Faster Kimi Delta Attention on H200 GPUs (2026)
Moonshot AI has open-sourced FlashKDA, a high-performance implementation of Kimi Delta Attention that delivers up to 2.5x faster inference on Hopper GPUs. Built with CUTLASS and optimized for variable-length batching, it integrates seamlessly into the flash-linear-attention ecosystem.






















