An Empirical Study on Hugging Face Trends, Topics and Challenges on Stack Overflow

Research output: Contribution to Book/Report typesContribution to conference proceedingspeer-review

Abstract

Hugging Face (HF) has emerged as a pivotal platform for the Machine Learning (ML) community, functioning as a central hub where developers collaborate, share models, and exchange datasets. By offering a vast repository of pre-trained models (PTMs), HF has democratized access to advanced ML resources, promoting model reuse and accelerating the development of ML-based systems. Despite its rapid adoption in recent years, there remains a limited understanding of the challenges developers encounter when working with HF in general and PTMs in particular. Understanding these challenges is crucial for guiding future research and developing support strategies for the software engineering community. Consequently, in this study we investigate HF-related Stack Overflow (SO) posts, one of the most popular discussion platforms for developers, to uncover the relevance of the topics, key challenges, and trends in HF-related discussions. This understanding will help future studies and the HF community improve the use of HF by focusing on the challenges developers face according to the prevalence and complexity of each of these challenges. To do so, we apply a topic modeling technique to categorize the topics discussed in SO posts that are related to HF. We then assess the popularity and difficulty of these topics to gain deeper insight into the specific challenges developers encounter. Our findings reveal an average annual growth rate of 31.3% in the number of HF-related questions on SO from 2019 to 2024. Furthermore, we identify eight major topics, with the usage and understanding of large language models (LLMs) being the most popular, while the distributed computing and resource management of PTMs stands out as the most challenging topic for developers.

Original languageEnglish
Title of host publicationProceedings - 2025 IEEE 49th Annual Computers, Software, and Applications Conference, COMPSAC 2025
EditorsHossain Shahriar, Kazi Shafiul Alam, Hiroyuki Ohsaki, Stelvio Cimato, Miriam Capretz, Shamem Ahmed, Sheikh Iqbal Ahamed, AKM Jahangir Alam Majumder, Munirul Haque, Tomoki Yoshihisa, Alfredo Cuzzocrea, Michiharu Takemoto, Nazmus Sakib, Marwa Elsayed
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1297-1307
Number of pages11
ISBN (Electronic)9798331574345
DOIs
Publication statusPublished - 2025
Event49th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2025 - Toronto, Canada
Duration: 8 Jul 202511 Jul 2025

Publication series

NameProceedings - 2025 IEEE 49th Annual Computers, Software, and Applications Conference, COMPSAC 2025

Conference

Conference49th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2025
Country/TerritoryCanada
CityToronto
Period8/07/2511/07/25

!!!Keywords

  • Challenges
  • Hugging Face
  • Stack Overflow
  • Topic Modeling

Fingerprint

Dive into the research topics of 'An Empirical Study on Hugging Face Trends, Topics and Challenges on Stack Overflow'. These topics are generated from the title and abstract of the publication. Together, they form a unique fingerprint.

Cite this