General Information
    • ISSN: 1793-8201 (Print), 2972-4511 (Online)
    • Abbreviated Title: Int. J. Comput. Theory Eng.
    • Frequency: Quarterly
    • DOI: 10.7763/IJCTE
    • Editor-in-Chief: Prof. Mehmet Sahinoglu
    • Associate Editor-in-Chief: Assoc. Prof. Alberto Arteta, Assoc. Prof. Engin Maşazade
    • Managing Editor: Ms. Mia Hu
    • Abstracting/Indexing: Scopus (Since 2022), INSPEC (IET), CNKI,  Google Scholar, EBSCO, etc.
    • Average Days from Submission to Acceptance: 192 days
    • APC: 800 USD
    • E-mail: editor@ijcte.org
    • Journal Metrics:
    • SCImago Journal & Country Rank
Article Metrics in Dimensions

IJCTE 2023 Vol.15(1): 1-9
DOI: 10.7763/IJCTE.2023.V15.1326

Identifying the Most Relevant Attributes to Explain Peaks of COVID-19 Infections and Deaths by Machine Learning Methods

Gabriel Pena, Juliana Gambini, and Nestor R. Barraza*

Manuscript received September 7, 2022; revised November 1, 2022; accepted December 29, 2022.

Abstract—One of the key factors related to assessing the spreading speed of a given disease is to determine the peak of infections, the point after which a wave starts to mitigate as the daily number of cases goes down. This issue has attracted the attention of scientists for the last two years in relation to the COVID-19 pandemic. At the present time, since several waves have affected most countries, there is plenty of information at our disposal: date and magnitude of contagion peaks; country-related data such as population density, gdp per capita, etc.; among other relevant status metrics at the dates of peaks, like vaccination, mobility, use of mask, occupied hospital beds, etc. Thus, finding which of those attributes are relevant and ranking them becomes an interesting field for research. In this work, we apply a filtering technique to identify peaks on the reported data and then perform feature selection algorithms with the peak magnitude as output. A comparative ranking of the attributes is thus obtained for several countries and for different waves in the same country. As pre-processing tasks, we performed a normalization and a conversion from numerical to categorical values on the output variable. As a result, a grouping of countries and waves is obtained, from where important information can be extracted. Our results contribute with knowledge for predicting and monitoring the spreading of diseases and become a relevant tool for health institutions and authorities.

Index Terms—COVID-19, peak of infections, feature selection, clustering, machine learning, K-means, random forest, Boruta

G. Pena is with the Technology and Science Department of the Universidad Nacional de Tres de Febrero, Argentina.
J. Gambini is with the Technological Institute of Buenos Aires (ITBA) and the Technology and Science Department of the Universidad Nacional de Tres de Febrero, Argentina.
N. R. Barraza is with the Technology and Science Department of the Universidad Nacional de Tres de Febrero and the School of Engineering of the University of Buenos Aires, Argentina.
*Correspondence: nbarraza@untref.edu.ar

[PDF]

Cite:Gabriel Pena, Juliana Gambini, and Nestor R. Barraza, "Identifying the Most Relevant Attributes to Explain Peaks of COVID-19 Infections and Deaths by Machine Learning Methods," International Journal of Computer Theory and Engineering vol. 15, no. 1, pp. 1-9, 2023.

Copyright © 2023 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).


Copyright © 2008-2024. International Association of Computer Science and Information Technology. All rights reserved.