GHAminer: An Open Source Tool to Extract GitHub Actions Build Metrics

Research output: Contribution to Book/Report typesContribution to conference proceedingspeer-review

Abstract

GitHub Actions (GHA) has become among the most popular Continuous Integration (CI) platforms in open-source software (OSS) and commercial projects. Collecting such build data remains crucial for practitioners and researchers to allow build performance monitoring, optimization and improvement. However, mining GHA builds to collect build-related data and metrics remains challenging and time-consuming. This paper introduces GHAminer, an open-source tool designed to collect build-related metrics for GitHub Actions. GHAminer covers various aspects of data such as the build-related code changes and tests, the build duration and status (e.g., passed, failed, timeout, etc.), and repository metadata, which would be useful for practitioners and researchers to make data-driven decisions to enhance CI efficiency and quality. The tool has a modular architecture that supports efficient data extraction with minimal API load. Specifically, it consists of a set of modules that are related to repository information collection, build analysis, commit history analysis, and build log parsing. We evaluate the performance of GHAminer on a representative sample of 3,151 OSS projects. Results show that GHAminer is efficient in handling projects of various sizes with relatively stable performance to collect build data for larger projects. GHAminer is publicly available with a demo video at: https:lIgithub.com/stilab-ets/GHAminer

Original languageEnglish
Title of host publicationProceedings - 2025 IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages834-838
Number of pages5
ISBN (Electronic)9798331535100
DOIs
Publication statusPublished - 2025
Event32nd IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2025 - Montreal, Canada
Duration: 4 Mar 20257 Mar 2025

Publication series

NameProceedings - 2025 IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2025

Conference

Conference32nd IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2025
Country/TerritoryCanada
CityMontreal
Period4/03/257/03/25

!!!Keywords

  • GitHub Actions
  • continuous integration
  • open source tools
  • software build
  • software mining
  • software quality

Fingerprint

Dive into the research topics of 'GHAminer: An Open Source Tool to Extract GitHub Actions Build Metrics'. These topics are generated from the title and abstract of the publication. Together, they form a unique fingerprint.

Cite this