Skip to main content

DeepSeek V3, a state-of-the-art open model, is now available. Try it now!

FireAttention V2: 12x faster to make Long Contexts practical for Online Inference

FireAttention V2: 12x faster to make Long Contexts practical for Online Inference

By Dmytro Ivchenko|6/20/2024

Loading...