Revealing the impact of COVID-19 on mental health through machine learning

Salah Bouktif; Akib Mohi Ud Din Khanday; Ali Ouni

doi:10.1093/jamiaopen/ooag013

Revealing the impact of COVID-19 on mental health through machine learning

Salah Bouktif
, Akib Mohi Ud Din Khanday
, Ali Ouni

Research output: Contribution to journal › Journal Article › peer-review

Abstract

Objective The COVID-19 pandemic caused a major health crisis worldwide significantly impacting mental well-being. In this study, our objective is to assess the resilience of pre-pandemic depression level prediction models when applied to COVID-19 era data. We leverage advanced Machine Learning (ML) and Explainable Artificial Intelligence (XAI) techniques to identify the key factors impacting the shifts in depression levels during the pandemic. We aim to align the later identification with interventions and preparedness for future pandemics. Materials and methods We use, in this study, a data-driven methodology using National Health Interview Survey (NHIS) household survey data, explicitly covering the years 2019-2022. The NHIS data is used to build both the pre-pandemic (2019) and COVID-19 (2020-2022) models discussed in our comparative evaluation. Various ML techniques are supported (1) upstream, using feature selection methods to reduce both irrelevance and the high dimensionality of social-nature data, and (2) downstream, by an XAI-based approach to gain insight into the pandemic-associated phenomena that mostly impacted the mental health of individuals. In our empirical experiments, we use over 100 000 entries across the 4 yearly datasets, where we apply an 80%-20% training/testing split for models building and evaluation. Results The outcomes of our empirical study show that classifiers trained solely on pre-COVID-19 data performed poorly when applied to COVID-19 era data. Conversely, models retrained on pandemic-specific data demonstrated high performance. In particular, the Random Forest (RF) classifier achieved the best performance, recording an average accuracy of 98.10% across the COVID-19 era datasets. With respect to the depression key factors’ identification, XAI techniques provided actionable insights, revealing that features such as Delayed Medical Care, Family Poverty, Participation in Social Activities, and Marital Status were the most influential factors contributing to depression challenges during the pandemic. Discussion and conclusion The significant decline in the performance of pre-pandemic models on COVID-19 data reveals the profound impact of the pandemic on mental health, highlighting the need for new predictive models tailored to crisis circumstances. The built RF model, uses appropriate pandemic data, performed accurately during the COVID-19 era with an accuracy of 98.1%. XAI techniques confirmed that factors such as delayed medical care, family poverty, job loss, and reduced social involvement were critical drivers that impacted the decline in mental health during the pandemic.

Original language	English
Article number	ooag013
Journal	JAMIA Open
Volume	9
Issue number	1
DOIs	https://doi.org/10.1093/jamiaopen/ooag013
Publication status	Published - 1 Feb 2026
Externally published	Yes

!!!Keywords

COVID-19
anxiety
depression
explainable artificial intelligence
machine learning

Access to Document

10.1093/jamiaopen/ooag013

Cite this

@article{d2f519e91be64b949683dd182da576e4,

title = "Revealing the impact of COVID-19 on mental health through machine learning",

abstract = "Objective The COVID-19 pandemic caused a major health crisis worldwide significantly impacting mental well-being. In this study, our objective is to assess the resilience of pre-pandemic depression level prediction models when applied to COVID-19 era data. We leverage advanced Machine Learning (ML) and Explainable Artificial Intelligence (XAI) techniques to identify the key factors impacting the shifts in depression levels during the pandemic. We aim to align the later identification with interventions and preparedness for future pandemics. Materials and methods We use, in this study, a data-driven methodology using National Health Interview Survey (NHIS) household survey data, explicitly covering the years 2019-2022. The NHIS data is used to build both the pre-pandemic (2019) and COVID-19 (2020-2022) models discussed in our comparative evaluation. Various ML techniques are supported (1) upstream, using feature selection methods to reduce both irrelevance and the high dimensionality of social-nature data, and (2) downstream, by an XAI-based approach to gain insight into the pandemic-associated phenomena that mostly impacted the mental health of individuals. In our empirical experiments, we use over 100 000 entries across the 4 yearly datasets, where we apply an 80\%-20\% training/testing split for models building and evaluation. Results The outcomes of our empirical study show that classifiers trained solely on pre-COVID-19 data performed poorly when applied to COVID-19 era data. Conversely, models retrained on pandemic-specific data demonstrated high performance. In particular, the Random Forest (RF) classifier achieved the best performance, recording an average accuracy of 98.10\% across the COVID-19 era datasets. With respect to the depression key factors{\textquoteright} identification, XAI techniques provided actionable insights, revealing that features such as Delayed Medical Care, Family Poverty, Participation in Social Activities, and Marital Status were the most influential factors contributing to depression challenges during the pandemic. Discussion and conclusion The significant decline in the performance of pre-pandemic models on COVID-19 data reveals the profound impact of the pandemic on mental health, highlighting the need for new predictive models tailored to crisis circumstances. The built RF model, uses appropriate pandemic data, performed accurately during the COVID-19 era with an accuracy of 98.1\%. XAI techniques confirmed that factors such as delayed medical care, family poverty, job loss, and reduced social involvement were critical drivers that impacted the decline in mental health during the pandemic.",

keywords = "COVID-19, anxiety, depression, explainable artificial intelligence, machine learning",

author = "Salah Bouktif and Khanday, \{Akib Mohi Ud Din\} and Ali Ouni",

note = "Publisher Copyright: {\textcopyright} The Author(s) 2026. Published by Oxford University Press on behalf of the American Medical Informatics Association.",

year = "2026",

month = feb,

day = "1",

doi = "10.1093/jamiaopen/ooag013",

language = "English",

volume = "9",

journal = "JAMIA Open",

issn = "2574-2531",

publisher = "Oxford University Press",

number = "1",

}

TY - JOUR

T1 - Revealing the impact of COVID-19 on mental health through machine learning

AU - Bouktif, Salah

AU - Khanday, Akib Mohi Ud Din

AU - Ouni, Ali

N1 - Publisher Copyright: © The Author(s) 2026. Published by Oxford University Press on behalf of the American Medical Informatics Association.

PY - 2026/2/1

Y1 - 2026/2/1

N2 - Objective The COVID-19 pandemic caused a major health crisis worldwide significantly impacting mental well-being. In this study, our objective is to assess the resilience of pre-pandemic depression level prediction models when applied to COVID-19 era data. We leverage advanced Machine Learning (ML) and Explainable Artificial Intelligence (XAI) techniques to identify the key factors impacting the shifts in depression levels during the pandemic. We aim to align the later identification with interventions and preparedness for future pandemics. Materials and methods We use, in this study, a data-driven methodology using National Health Interview Survey (NHIS) household survey data, explicitly covering the years 2019-2022. The NHIS data is used to build both the pre-pandemic (2019) and COVID-19 (2020-2022) models discussed in our comparative evaluation. Various ML techniques are supported (1) upstream, using feature selection methods to reduce both irrelevance and the high dimensionality of social-nature data, and (2) downstream, by an XAI-based approach to gain insight into the pandemic-associated phenomena that mostly impacted the mental health of individuals. In our empirical experiments, we use over 100 000 entries across the 4 yearly datasets, where we apply an 80%-20% training/testing split for models building and evaluation. Results The outcomes of our empirical study show that classifiers trained solely on pre-COVID-19 data performed poorly when applied to COVID-19 era data. Conversely, models retrained on pandemic-specific data demonstrated high performance. In particular, the Random Forest (RF) classifier achieved the best performance, recording an average accuracy of 98.10% across the COVID-19 era datasets. With respect to the depression key factors’ identification, XAI techniques provided actionable insights, revealing that features such as Delayed Medical Care, Family Poverty, Participation in Social Activities, and Marital Status were the most influential factors contributing to depression challenges during the pandemic. Discussion and conclusion The significant decline in the performance of pre-pandemic models on COVID-19 data reveals the profound impact of the pandemic on mental health, highlighting the need for new predictive models tailored to crisis circumstances. The built RF model, uses appropriate pandemic data, performed accurately during the COVID-19 era with an accuracy of 98.1%. XAI techniques confirmed that factors such as delayed medical care, family poverty, job loss, and reduced social involvement were critical drivers that impacted the decline in mental health during the pandemic.

AB - Objective The COVID-19 pandemic caused a major health crisis worldwide significantly impacting mental well-being. In this study, our objective is to assess the resilience of pre-pandemic depression level prediction models when applied to COVID-19 era data. We leverage advanced Machine Learning (ML) and Explainable Artificial Intelligence (XAI) techniques to identify the key factors impacting the shifts in depression levels during the pandemic. We aim to align the later identification with interventions and preparedness for future pandemics. Materials and methods We use, in this study, a data-driven methodology using National Health Interview Survey (NHIS) household survey data, explicitly covering the years 2019-2022. The NHIS data is used to build both the pre-pandemic (2019) and COVID-19 (2020-2022) models discussed in our comparative evaluation. Various ML techniques are supported (1) upstream, using feature selection methods to reduce both irrelevance and the high dimensionality of social-nature data, and (2) downstream, by an XAI-based approach to gain insight into the pandemic-associated phenomena that mostly impacted the mental health of individuals. In our empirical experiments, we use over 100 000 entries across the 4 yearly datasets, where we apply an 80%-20% training/testing split for models building and evaluation. Results The outcomes of our empirical study show that classifiers trained solely on pre-COVID-19 data performed poorly when applied to COVID-19 era data. Conversely, models retrained on pandemic-specific data demonstrated high performance. In particular, the Random Forest (RF) classifier achieved the best performance, recording an average accuracy of 98.10% across the COVID-19 era datasets. With respect to the depression key factors’ identification, XAI techniques provided actionable insights, revealing that features such as Delayed Medical Care, Family Poverty, Participation in Social Activities, and Marital Status were the most influential factors contributing to depression challenges during the pandemic. Discussion and conclusion The significant decline in the performance of pre-pandemic models on COVID-19 data reveals the profound impact of the pandemic on mental health, highlighting the need for new predictive models tailored to crisis circumstances. The built RF model, uses appropriate pandemic data, performed accurately during the COVID-19 era with an accuracy of 98.1%. XAI techniques confirmed that factors such as delayed medical care, family poverty, job loss, and reduced social involvement were critical drivers that impacted the decline in mental health during the pandemic.

KW - COVID-19

KW - anxiety

KW - depression

KW - explainable artificial intelligence

KW - machine learning

UR - https://www.scopus.com/pages/publications/105030721206

U2 - 10.1093/jamiaopen/ooag013

DO - 10.1093/jamiaopen/ooag013

M3 - Journal Article

AN - SCOPUS:105030721206

SN - 2574-2531

VL - 9

JO - JAMIA Open

JF - JAMIA Open

IS - 1

M1 - ooag013

ER -

Revealing the impact of COVID-19 on mental health through machine learning

Abstract

!!!Keywords

Access to Document

Other files and links

Fingerprint

Cite this