Are All Code Reviews the Same? Identifying and Assessing the Impact of Merge Request Deviations

Samah Kansab; Francis Bordeleau; Ali Tizghadam

doi:10.1109/ICSME64153.2025.00036

Are All Code Reviews the Same? Identifying and Assessing the Impact of Merge Request Deviations

Samah Kansab
, Francis Bordeleau
, Ali Tizghadam

Résultats de recherche: Chapitre dans un livre, rapport, actes de conférence › Participation à un ouvrage collectif lié à un colloque ou une conférence › Revue par des pairs

Résumé

Code review is a fundamental practice in software engineering, ensuring code quality, fostering collaboration, and reducing defects. While research has extensively examined various aspects of this process, most studies assume that all code reviews follow a standardized evaluation workflow. However, our industrial partner, which uses Merge Requests (MRs) mechanism for code review, reports that this assumption does not always hold in practice. Many MRs serve alternative purposes beyond rigorous code evaluation. These MRs often bypass the standard review process, requiring minimal oversight. We refer to these cases as deviations, as they disrupt expected workflow patterns. For example, work-in-progress (WIP) MRs may be used as draft implementations without the intention of being reviewed, MRs with huge changes are often created for code rebase, and library updates typically involve dependency version changes that require minimal or no review effort. We hypothesize that overlooking MR deviations can lead to biased analytics and reduced reliability of machine learning (ML) models used to explain the code review process. This study addresses these challenges by first identifying MR deviations. Our findings show that deviations occur in up to 37.02 % of MRs across seven distinct categories. In addition, we develop a detection approach leveraging few-shot learning, achieving up to 91 % accuracy in identifying these deviations. Furthermore, we examine the impact of removing MR deviations on ML models predicting code review completion time. Removing deviations significantly enhances model performance in 53.33 % of cases, with improvements of up to 2.25 times. Additionally, their exclusion significantly impacts model interpretation, strongly altering overall feature importance rankings in 47 % of cases and top-k rankings in 60 %. Our contributions include: (1) a clear definition and categorization of MR deviations, (2) a novel AI-based detection method leveraging few-shot learning, and (3) an empirical analysis of their exclusion impact on ML models explaining code review completion time. Our approach helps practitioners streamline review workflows, allocate reviewer effort more effectively, and ensure more reliable insights from MR analytics.

langue originale	Anglais
titre	Proceedings - 2025 IEEE International Conference on Software Maintenance and Evolution, ICSME 2025
Editeur	Institute of Electrical and Electronics Engineers Inc.
Pages	308-320
Nombre de pages	13
ISBN (Electronique)	9798331595876
Les DOIs	https://doi.org/10.1109/ICSME64153.2025.00036
état	Publié - 2025
Evénement	41st IEEE International Conference on Software Maintenance and Evolution, ICSME 2025 - Auckland, Nouvelle-Zélande Durée: 7 sept. 2025 → 12 sept. 2025

Série de publications

Nom	Proceedings - 2025 IEEE International Conference on Software Maintenance and Evolution, ICSME 2025

Conférence

Conférence	41st IEEE International Conference on Software Maintenance and Evolution, ICSME 2025
Pays/Territoire	Nouvelle-Zélande
La ville	Auckland
période	7/09/25 → 12/09/25

Accès au document

10.1109/ICSME64153.2025.00036

Autres fichiers et liens

Lien vers la publication dans Scopus

Empreinte digitale

Voici les principaux termes ou expressions associés à « Are All Code Reviews the Same? Identifying and Assessing the Impact of Merge Request Deviations ». Ces libellés thématiques sont générés à partir du titre et du résumé de la publication. Ensemble, ils forment une empreinte digitale unique.

Contient cette citation

Kansab, S., Bordeleau, F., & Tizghadam, A. (2025). Are All Code Reviews the Same? Identifying and Assessing the Impact of Merge Request Deviations. Dans Proceedings - 2025 IEEE International Conference on Software Maintenance and Evolution, ICSME 2025 (p. 308-320). (Proceedings - 2025 IEEE International Conference on Software Maintenance and Evolution, ICSME 2025). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICSME64153.2025.00036

Kansab, Samah ; Bordeleau, Francis ; Tizghadam, Ali. / Are All Code Reviews the Same? Identifying and Assessing the Impact of Merge Request Deviations. Proceedings - 2025 IEEE International Conference on Software Maintenance and Evolution, ICSME 2025. Institute of Electrical and Electronics Engineers Inc., 2025. p. 308-320 (Proceedings - 2025 IEEE International Conference on Software Maintenance and Evolution, ICSME 2025).

@inproceedings{7d0ce1033be24927b29cc18dad9bc396,

title = "Are All Code Reviews the Same? Identifying and Assessing the Impact of Merge Request Deviations",

abstract = "Code review is a fundamental practice in software engineering, ensuring code quality, fostering collaboration, and reducing defects. While research has extensively examined various aspects of this process, most studies assume that all code reviews follow a standardized evaluation workflow. However, our industrial partner, which uses Merge Requests (MRs) mechanism for code review, reports that this assumption does not always hold in practice. Many MRs serve alternative purposes beyond rigorous code evaluation. These MRs often bypass the standard review process, requiring minimal oversight. We refer to these cases as deviations, as they disrupt expected workflow patterns. For example, work-in-progress (WIP) MRs may be used as draft implementations without the intention of being reviewed, MRs with huge changes are often created for code rebase, and library updates typically involve dependency version changes that require minimal or no review effort. We hypothesize that overlooking MR deviations can lead to biased analytics and reduced reliability of machine learning (ML) models used to explain the code review process. This study addresses these challenges by first identifying MR deviations. Our findings show that deviations occur in up to 37.02 \% of MRs across seven distinct categories. In addition, we develop a detection approach leveraging few-shot learning, achieving up to 91 \% accuracy in identifying these deviations. Furthermore, we examine the impact of removing MR deviations on ML models predicting code review completion time. Removing deviations significantly enhances model performance in 53.33 \% of cases, with improvements of up to 2.25 times. Additionally, their exclusion significantly impacts model interpretation, strongly altering overall feature importance rankings in 47 \% of cases and top-k rankings in 60 \%. Our contributions include: (1) a clear definition and categorization of MR deviations, (2) a novel AI-based detection method leveraging few-shot learning, and (3) an empirical analysis of their exclusion impact on ML models explaining code review completion time. Our approach helps practitioners streamline review workflows, allocate reviewer effort more effectively, and ensure more reliable insights from MR analytics.",

keywords = "Code Review, Deviations, Few-shot Learning, Machine learning, Merge Requests",

author = "Samah Kansab and Francis Bordeleau and Ali Tizghadam",

note = "Publisher Copyright: {\textcopyright} 2025 IEEE.; 41st IEEE International Conference on Software Maintenance and Evolution, ICSME 2025 ; Conference date: 07-09-2025 Through 12-09-2025",

year = "2025",

doi = "10.1109/ICSME64153.2025.00036",

language = "English",

series = "Proceedings - 2025 IEEE International Conference on Software Maintenance and Evolution, ICSME 2025",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "308--320",

booktitle = "Proceedings - 2025 IEEE International Conference on Software Maintenance and Evolution, ICSME 2025",

}

Kansab, S, Bordeleau, F & Tizghadam, A 2025, Are All Code Reviews the Same? Identifying and Assessing the Impact of Merge Request Deviations. Dans Proceedings - 2025 IEEE International Conference on Software Maintenance and Evolution, ICSME 2025. Proceedings - 2025 IEEE International Conference on Software Maintenance and Evolution, ICSME 2025, Institute of Electrical and Electronics Engineers Inc., p. 308-320, 41st IEEE International Conference on Software Maintenance and Evolution, ICSME 2025, Auckland, Nouvelle-Zélande, 7/09/25. https://doi.org/10.1109/ICSME64153.2025.00036

Are All Code Reviews the Same? Identifying and Assessing the Impact of Merge Request Deviations. / Kansab, Samah; Bordeleau, Francis; Tizghadam, Ali.
Proceedings - 2025 IEEE International Conference on Software Maintenance and Evolution, ICSME 2025. Institute of Electrical and Electronics Engineers Inc., 2025. p. 308-320 (Proceedings - 2025 IEEE International Conference on Software Maintenance and Evolution, ICSME 2025).

Résultats de recherche: Chapitre dans un livre, rapport, actes de conférence › Participation à un ouvrage collectif lié à un colloque ou une conférence › Revue par des pairs

TY - GEN

T1 - Are All Code Reviews the Same? Identifying and Assessing the Impact of Merge Request Deviations

AU - Kansab, Samah

AU - Bordeleau, Francis

AU - Tizghadam, Ali

PY - 2025

Y1 - 2025

N2 - Code review is a fundamental practice in software engineering, ensuring code quality, fostering collaboration, and reducing defects. While research has extensively examined various aspects of this process, most studies assume that all code reviews follow a standardized evaluation workflow. However, our industrial partner, which uses Merge Requests (MRs) mechanism for code review, reports that this assumption does not always hold in practice. Many MRs serve alternative purposes beyond rigorous code evaluation. These MRs often bypass the standard review process, requiring minimal oversight. We refer to these cases as deviations, as they disrupt expected workflow patterns. For example, work-in-progress (WIP) MRs may be used as draft implementations without the intention of being reviewed, MRs with huge changes are often created for code rebase, and library updates typically involve dependency version changes that require minimal or no review effort. We hypothesize that overlooking MR deviations can lead to biased analytics and reduced reliability of machine learning (ML) models used to explain the code review process. This study addresses these challenges by first identifying MR deviations. Our findings show that deviations occur in up to 37.02 % of MRs across seven distinct categories. In addition, we develop a detection approach leveraging few-shot learning, achieving up to 91 % accuracy in identifying these deviations. Furthermore, we examine the impact of removing MR deviations on ML models predicting code review completion time. Removing deviations significantly enhances model performance in 53.33 % of cases, with improvements of up to 2.25 times. Additionally, their exclusion significantly impacts model interpretation, strongly altering overall feature importance rankings in 47 % of cases and top-k rankings in 60 %. Our contributions include: (1) a clear definition and categorization of MR deviations, (2) a novel AI-based detection method leveraging few-shot learning, and (3) an empirical analysis of their exclusion impact on ML models explaining code review completion time. Our approach helps practitioners streamline review workflows, allocate reviewer effort more effectively, and ensure more reliable insights from MR analytics.

AB - Code review is a fundamental practice in software engineering, ensuring code quality, fostering collaboration, and reducing defects. While research has extensively examined various aspects of this process, most studies assume that all code reviews follow a standardized evaluation workflow. However, our industrial partner, which uses Merge Requests (MRs) mechanism for code review, reports that this assumption does not always hold in practice. Many MRs serve alternative purposes beyond rigorous code evaluation. These MRs often bypass the standard review process, requiring minimal oversight. We refer to these cases as deviations, as they disrupt expected workflow patterns. For example, work-in-progress (WIP) MRs may be used as draft implementations without the intention of being reviewed, MRs with huge changes are often created for code rebase, and library updates typically involve dependency version changes that require minimal or no review effort. We hypothesize that overlooking MR deviations can lead to biased analytics and reduced reliability of machine learning (ML) models used to explain the code review process. This study addresses these challenges by first identifying MR deviations. Our findings show that deviations occur in up to 37.02 % of MRs across seven distinct categories. In addition, we develop a detection approach leveraging few-shot learning, achieving up to 91 % accuracy in identifying these deviations. Furthermore, we examine the impact of removing MR deviations on ML models predicting code review completion time. Removing deviations significantly enhances model performance in 53.33 % of cases, with improvements of up to 2.25 times. Additionally, their exclusion significantly impacts model interpretation, strongly altering overall feature importance rankings in 47 % of cases and top-k rankings in 60 %. Our contributions include: (1) a clear definition and categorization of MR deviations, (2) a novel AI-based detection method leveraging few-shot learning, and (3) an empirical analysis of their exclusion impact on ML models explaining code review completion time. Our approach helps practitioners streamline review workflows, allocate reviewer effort more effectively, and ensure more reliable insights from MR analytics.

KW - Code Review

KW - Deviations

KW - Few-shot Learning

KW - Machine learning

KW - Merge Requests

UR - https://www.scopus.com/pages/publications/105022431231

U2 - 10.1109/ICSME64153.2025.00036

DO - 10.1109/ICSME64153.2025.00036

M3 - Contribution to conference proceedings

AN - SCOPUS:105022431231

T3 - Proceedings - 2025 IEEE International Conference on Software Maintenance and Evolution, ICSME 2025

SP - 308

EP - 320

BT - Proceedings - 2025 IEEE International Conference on Software Maintenance and Evolution, ICSME 2025

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 41st IEEE International Conference on Software Maintenance and Evolution, ICSME 2025

Y2 - 7 September 2025 through 12 September 2025

ER -

Kansab S, Bordeleau F, Tizghadam A. Are All Code Reviews the Same? Identifying and Assessing the Impact of Merge Request Deviations. Dans Proceedings - 2025 IEEE International Conference on Software Maintenance and Evolution, ICSME 2025. Institute of Electrical and Electronics Engineers Inc. 2025. p. 308-320. (Proceedings - 2025 IEEE International Conference on Software Maintenance and Evolution, ICSME 2025). doi: 10.1109/ICSME64153.2025.00036