A

led-base-16384

Otherby Ai2·Model page

LED base 16384 is Ai2's Longformer Encoder-Decoder model for summarizing and processing documents of up to 16,384 tokens.

Share:

Model Card

Introduction

Allenai's Longformer Encoder-Decoder (LED).

As described in Longformer: The Long-Document Transformer by Iz Beltagy, Matthew E. Peters, Arman Cohan, led-base-16384 was initialized from bart-base since both models share the exact same architecture. To be able to process 16K tokens, bart-base's position embedding matrix was simply copied 16 times.

This model is especially interesting for long-range summarization and question answering.

Fine-tuning for down-stream task

This notebook shows how led-base-16384 can effectively be fine-tuned on a downstream task.

Author
A
Ai2
Organization · ✓
allenai
Details
Downloads18.1K
Likes51
AccessOpen Source
Licenseapache-2.0
Librarytransformers
CreatedMar 2, 2022
UpdatedJan 24, 2023
View on Hugging Face
Languages
en
Get the full context.

Sign up to read complete case studies, access detailed metrics, and unlock all use cases.

led-base-16384 — AI Model Details | Applied