Skip to main navigation Skip to search Skip to main content

Architecture-Agnostic Curriculum Learning for Document Understanding: Empirical Evidence from Text-Only and Multimodal Paradigms

  • École de technologie supérieure
  • University of Bari

Research output: Contribution to Book/Report typesContribution to conference proceedingspeer-review

Abstract

This study investigates whether the efficiency gains afforded by curriculum learning transfer across architecturally distinct document understanding models. Through a series of 24 controlled experiments comparing BERT (text-only) and LayoutLMv3 (multimodal) on the FUNSD and CORD benchmarks, we demonstrate that progressive data scheduling following a 33%→67%→100% trajectory yields consistent training speedups of 34.9% for BERT and 29.2% for LayoutLMv3, both achieving statistical significance at p < 0.001. The modest 5.8 percentage point differential between these architectures provides compelling evidence for architectureagnostic curriculum benefits. Notably, both model families exhibit approximately twofold higher final training loss while maintaining equivalent downstream F1 performance, suggesting that the loss differential reflects optimization dynamics rather than representational degradation. Extended validation across six document domains confirms the cross-domain transferability of these findings. These results indicate that curriculum learning operates primarily at the data distribution level, enabling practitioners to deploy uniform progressive schedules across heterogeneous model portfolios without requiring architecture-specific tuning.

Original languageEnglish
Title of host publicationProceedings of the 18th International Conference on Agents and Artificial Intelligence
EditorsAna Paula Rocha, Mattias Wahde, H. Jaap van den Herik
PublisherScience and Technology Publications, Lda
Pages694-703
Number of pages10
ISBN (Print)9789897587962
DOIs
Publication statusPublished - 2026
Externally publishedYes
Event18th International Conference on Agents and Artificial Intelligence, ICAART 2026 - Marbella, Spain
Duration: 5 Mar 20268 Mar 2026

Publication series

NameInternational Conference on Agents and Artificial Intelligence
Volume1
ISSN (Print)2184-3589
ISSN (Electronic)2184-433X

Conference

Conference18th International Conference on Agents and Artificial Intelligence, ICAART 2026
Country/TerritorySpain
CityMarbella
Period5/03/268/03/26

!!!Keywords

  • BERT
  • Curriculum Learning
  • Document Understanding
  • LayoutLM
  • Multimodal Learning
  • Training Efficiency
  • Transfer Learning

Fingerprint

Dive into the research topics of 'Architecture-Agnostic Curriculum Learning for Document Understanding: Empirical Evidence from Text-Only and Multimodal Paradigms'. These topics are generated from the title and abstract of the publication. Together, they form a unique fingerprint.

Cite this