Passer à la navigation principale Passer à la recherche Passer au contenu principal

Architecture-Agnostic Curriculum Learning for Document Understanding: Empirical Evidence from Text-Only and Multimodal Paradigms

  • École de technologie supérieure
  • University of Bari

Résultats de recherche: Chapitre dans un livre, rapport, actes de conférenceParticipation à un ouvrage collectif lié à un colloque ou une conférenceRevue par des pairs

Résumé

This study investigates whether the efficiency gains afforded by curriculum learning transfer across architecturally distinct document understanding models. Through a series of 24 controlled experiments comparing BERT (text-only) and LayoutLMv3 (multimodal) on the FUNSD and CORD benchmarks, we demonstrate that progressive data scheduling following a 33%→67%→100% trajectory yields consistent training speedups of 34.9% for BERT and 29.2% for LayoutLMv3, both achieving statistical significance at p < 0.001. The modest 5.8 percentage point differential between these architectures provides compelling evidence for architectureagnostic curriculum benefits. Notably, both model families exhibit approximately twofold higher final training loss while maintaining equivalent downstream F1 performance, suggesting that the loss differential reflects optimization dynamics rather than representational degradation. Extended validation across six document domains confirms the cross-domain transferability of these findings. These results indicate that curriculum learning operates primarily at the data distribution level, enabling practitioners to deploy uniform progressive schedules across heterogeneous model portfolios without requiring architecture-specific tuning.

langue originaleAnglais
titreProceedings of the 18th International Conference on Agents and Artificial Intelligence
rédacteurs en chefAna Paula Rocha, Mattias Wahde, H. Jaap van den Herik
EditeurScience and Technology Publications, Lda
Pages694-703
Nombre de pages10
ISBN (imprimé)9789897587962
Les DOIs
étatPublié - 2026
Modification externeOui
Evénement18th International Conference on Agents and Artificial Intelligence, ICAART 2026 - Marbella, Espagne
Durée: 5 mars 20268 mars 2026

Série de publications

NomInternational Conference on Agents and Artificial Intelligence
Volume1
ISSN (imprimé)2184-3589
ISSN (Electronique)2184-433X

Conférence

Conférence18th International Conference on Agents and Artificial Intelligence, ICAART 2026
Pays/TerritoireEspagne
La villeMarbella
période5/03/268/03/26

Empreinte digitale

Voici les principaux termes ou expressions associés à « Architecture-Agnostic Curriculum Learning for Document Understanding: Empirical Evidence from Text-Only and Multimodal Paradigms ». Ces libellés thématiques sont générés à partir du titre et du résumé de la publication. Ensemble, ils forment une empreinte digitale unique.

Contient cette citation