DeepSeek releases ‘sparse attention’ model that cuts API costs in half

Researchers at DeepSeek released a new experimental model designed to have dramatically lower inference costs when used in long-context operations.

LEAVE A REPLY

Your email address will not be published. Required fields are marked *