¿Quién creó layoutlmv2-base-uncased?

layoutlmv2-base-uncased fue publicado por Microsoft en Hugging Face.

layoutlmv2-base-uncased

Name: layoutlmv2-base-uncased
Author: Microsoft

LayoutLMv2-base de Microsoft es un modelo preentrenado para comprensión de documentos que modela conjuntamente texto, diseño y características visuales.

Descripción del Modelo

Multimodal (text + layout/format + image) pre-training for document AI

The documentation of this model in the Transformers library can be found here.

Microsoft Document AI | GitHub

Introduction

LayoutLMv2 is an improved version of LayoutLM with new pre-training tasks to model the interaction among text, layout, and image in a single multi-modal framework. It outperforms strong baselines and achieves new state-of-the-art results on a wide variety of downstream visually-rich document understanding tasks, including , including FUNSD (0.7895 → 0.8420), CORD (0.9493 → 0.9601), SROIE (0.9524 → 0.9781), Kleister-NDA (0.834 → 0.852), RVL-CDIP (0.9443 → 0.9564), and DocVQA (0.7295 → 0.8672).

LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding Yang Xu, Yiheng Xu, Tengchao Lv, Lei Cui, Furu Wei, Guoxin Wang, Yijuan Lu, Dinei Florencio, Cha Zhang, Wanxiang Che, Min Zhang, Lidong Zhou, ACL 2021

Autor

Microsoft

Organización · ✓

microsoft

Detalles

Descargas559.4K

Me gusta68

AccesoCódigo Abierto

Licenciacc-by-nc-sa-4.0

Libreríatransformers

Creado2 mar 2022

Actualizado16 sept 2022

Ver en Hugging Face

Idiomas

Entiende todo el contexto.

Regístrate para leer casos de estudio completos, acceder a métricas detalladas y recibir todos los reportes.