Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

LayoutLM - Pre-training of Text and Layout for Document Image Understanding

BIMSA via YouTube

Overview

Explore a 48-minute conference talk by Yiheng Xu from BIMSA on LayoutLM, a groundbreaking pre-training model for document image understanding. Discover how LayoutLM innovatively combines text and layout information from scanned documents, addressing a crucial gap in traditional NLP pre-training techniques. Learn about the model's unique approach to jointly processing textual content and spatial layout, enhancing its effectiveness in tasks like information extraction from scanned documents. Gain insights into how LayoutLM incorporates visual features to further enrich its understanding of document structure. Understand the significance of this pioneering framework that, for the first time, integrates text and layout learning for document-level pre-training, potentially revolutionizing various real-world document processing applications.

Syllabus

Yiheng Xu: LayoutLM: Pre-training of Text and Layout for Document Image Understanding #ICBS2024

Taught by

BIMSA

Reviews

Start your review of LayoutLM - Pre-training of Text and Layout for Document Image Understanding

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.