¿Quién creó LongCat-2.0?

LongCat-2.0 fue publicado por LongCat en Hugging Face.

LongCat-2.0

Name: LongCat-2.0
Author: LongCat

Modelo de lenguaje de contexto largo LongCat-2.0 de Meituan para comprensión y generación de documentos extensos.

Model Introduction

We introduce LongCat-2.0, a large-scale MoE language model with 1.6 trillion total parameters and ~48 billion activated per token — a substantial step up from previous LongCat models, accompanied by several architectural improvements.

Both the full training run and the large-scale deployment are built entirely on AI ASIC superpods. Pretraining spans millions of accelerator-hours across more than 35 trillion tokens, with no rollbacks or irrecoverable loss spikes — demonstrating that we have the capability to conduct frontier-scale training on alternative hardware platforms.

To strengthen the model on long-horizon tasks, we introduce LongCat Sparse Attention and train LongCat-2.0 on hundreds of billions of tokens of 1M-context data. Together with dedicated post-training, this gives LongCat-2.0 strong performance on coding and agentic tasks.

[!NOTE] 🏋️ Model weights coming soon — stay tuned!

Autor

LongCat

Organización

meituan-longcat

Detalles

Descargas0

Me gusta152

AccesoCódigo Abierto

Tendencia148

Creado30 jun 2026

Actualizado30 jun 2026

Ver en Hugging Face

Entiende todo el contexto.

Regístrate para leer casos de estudio completos, acceder a métricas detalladas y recibir todos los reportes.