Time series analysis for modeling and predicting confirmed cases of influenza a in Algeria

Djillali Seba; Себа Джиллали; N. Benaklef; Бенаклеф Н.; K. Belaide; Белаиде К.

doi:10.15789/2220-7619-TSA-17693

Анализ временных рядов для моделирования и прогнозирования подтвержденных случаев гриппа а в Алжире

Авторы: Себа Д.¹, Бенаклеф Н.², Белаиде К.²
Учреждения:
1. Высшая школа информатики
2. Университет Беджаи
Выпуск: Том 15, № 1 (2025)
Страницы: 168-172
Раздел: КРАТКИЕ СООБЩЕНИЯ
Дата подачи: 14.06.2024
Дата принятия к публикации: 29.07.2024
Дата публикации: 30.04.2025
URL: https://iimmun.ru/iimm/article/view/17693
DOI: https://doi.org/10.15789/2220-7619-TSA-17693
ID: 17693

Цитировать

Полный текст

Аннотация
Полный текст
Об авторах
Список литературы
Дополнительные файлы
Статистика

Аннотация

Грипп А является подтипом вируса гриппа, который в первую очередь поражает птиц и млекопитающих, вызывая респираторные заболевания, и характеризуется способностью быстро мутировать, что приводит к появлению разнообразия штаммов и периодическим пандемиям. Настоящая статья посвящена изучению в Алжире распространения и прогнозированию подтвержденных случаев гриппа А, высокоинфекционного заболевания, которое вызывает широко распространенные заболевания и смертность как в Алжире, так и во всем мире. Материалы и методы. Для прогнозирования подтвержденных случаев гриппа А были применены несколько статистических моделей, включая ARIMA, Seasonal ARIMA (SARIMA), ETS, BATS и широко признанный метод машинного обучения RNN. Далее мы провели сравнительное исследование с использованием показателей производительности для оценки указанных моделей. Результаты. Для определения наиболее эффективной модели проводилась оценка среднеквадратической ошибки. Наши результаты показывают, что RNN превзошел другие модели благодаря своей способности обрабатывать сложные шаблоны, включая сезонные компоненты и наличию памяти. SARIMA и BATS также показали хорошие результаты благодаря своей способности управлять сезонными закономерностями. Напротив, ARIMA и ETS показали самые плохие результаты. Вывод. В приводимом исследовании использовался комплексный подход для разработки модели прогнозирования подтвержденных случаев гриппа A в Алжире. Полученные результаты расширяют наше понимание потенциального будущего распространения данного заболевания и способствуют эффективным стратегиям управления рисками.

Ключевые слова

грипп A, прогнозирование, управление рисками, временные ряды, BATS, Алжир

Полный текст

Inroduction

Influenza A is a viral infection that affects the respiratory system. It is one of the four types of influenza viruses and can cause symptoms such as cough, body aches, and sore throat. Highly contagious, Influenza A spreads through tiny droplets of bodily fluid released during coughing, sneezing, or talking. Symptoms often include fever, chills, fatigue, and other related discomforts.

In the early 20th century, scientific knowledge was advanced enough to predict the recurrence of influenza, which had twice reached pandemic levels in the late 19th century. However, it was largely ineffective in mitigating the devastating impact of the 1918 pandemic. Since then, humanity has made significant strides against the disease, developing the capability to design and produce vaccines and antiviral drugs to prevent or lessen infections.

The World Health Organization (WHO) estimates that globally there are 3–5 million cases of severe illness and 290 000–650 000 deaths annually due to influenza-related respiratory conditions.

Nowadays, predicting Influenza A helps minimize the health, economic, and social impacts of the virus by enabling proactive and well-coordinated responses. The debate on forecasting Influenza A involves researchers from various disciplines who use a range of methodologies, including statistical, machine learning, and deep learning techniques, referencing various studies such as Goldstein et al.[7], Xu et al.[14], Zheng et al. [16], Kandula et al. [8], Khan et al. [9], Cheng et al. [4], Wolk et al. [13], Xue et al. [15], Boostani et al. [3], Al-Qaness et al. [2], Seba et al. [12].

This topic has been treated in the Algerian context by several works such as [6] and [11]. The objective of our work is to predict the behavior of new cases in Algeria using time series analysis. We employ two widely recognized approaches from the literature: statistical methods and machine learning techniques for time series analysis.

Methods and Materials

Descriptive Data

The epidemiology of seasonal influenza is well defined in many parts of the world, especially in developed countries. However, in other regions, much less is known about the epidemiology of Influenza A, notably in Algeria (Fig. 1).

Figure 1. Monthly confirmed cases of influenza A

We collect data from: Our world in data-Influenza. We observed that Influenza A exhibits a seasonal winter pattern because the cold, dry air of winter provides ideal conditions for the virus’s prolonged survival. The reduced humidity during this season enhances the likelihood of infection. The significant decrease in Influenza cases from 2020 to 2022 can be attributed to several factors related to the COVID-19 pandemic: Implementing public health strategies, including mask wearing, hand hygiene, social distancing, and lockdowns on a large scale, substantially decreased the spread of respiratory infections, including Influenza A.

Changes in the individual’s awareness of safety amid the COVID-19 outbreak [10]. Analyzing monthly confirmed cases involves treating the data as a time series. Throughout the literature, various approaches such as statistical, machine learning methods have commonly been employed.

Preprocessing data analysis

Our data does not contain any missing values but contains seasonal patterns.

We employ the Augmented Dickey-Fuller (ADF) test to assess the stationarity of the present time series. A p-value of 0.01, which is smaller than the significance level of 0.05, indicates that the time series is stationary.

For data to be considered stationary, the statistical characteristics of the system must remain constant over time. This does not mean that the values of each data point must be identical; rather, the overall behavior of the data should remain consistent.

We introduce the Partial Autocorrelation Function (PACF) and the Autocorrelation Function (ACF) to measure the memory of the most effective model from the ACF and PACF plots, we observe that the autocorrelation decays exponentially, indicating that the data has short memory. Additionally, we note the presence of seasonal components (Fig. 2).

Figure 2. ACF and PACF plots

Methodology

Various time series analysis models and techniques are employed to determine the most efficient method for handling validated cases of influenza A, utilizing both statistical and machine learning models.

We take our data as a time series and split it into two separate sets: the training set and the test set. The test set, comprising 15% of the data, is used to validate the best model, while the training set consists of the remaining 85%. We evaluate a model’s performance using the Root Mean Square Error (RMSE).

$R M S E = \sqrt{\frac{1}{n}_{i = 1}^{n} {(y_{i} - z_{i})}^{2}}$ (1)

Results

Predictive models

ARIMA and ARFIMA model

Autoregressive integrated moving average (ARIMA) models predict future values based on past values, it gauges the strength of one dependent variable relative to other changing variables.

A stochastic process (Xt)t≥0 is said to be an ARIMA(p, d, q) an integrated mixture autoregressive moving average model if it satisfies the following equation:

ϕ(L)(1 − L)^dX_t = θ(L)ε_t ∀t ≥0 (2)

where d ∈ N, L is lag operator, ε_t ∼ N (0, σ ²) i.i.d. errors, with σ ² < ∞.

ϕ(L) = (1- ϕ₁L - ... - ϕ_pL^p) with ϕ_p ≠ 0

θ(L) = (1- θ₁L - ... - θ_qL^q) with θ_q 0

Seasonal ARIMA, is an extension of ARIMA that explicitly supports univariate time series data with a seasonal component.

There are four seasonal elements that are not part of ARIMA that must be configured; they are: P: Seasonal autoregressive order, D: Seasonal difference order, Q: Seasonal moving average order, m: The number of time steps for a single seasonal period SARIMA(p,d,q)(P,D,Q)m

ϕ^′(L)ϕ(L)(1 − L^m)^d(1 − L)^dX_t = θ^′(L)θ(L)ε_t ∀t ≥ 0 (3)

ETS model

The ETS models are time series models with an underlying state space model consisting of a level component, a trend component (T), a seasonal component (S), and an error term (E). This method produces forecasts that are weighted averages of past observations where the weights of older observations exponentially decrease.

BATS model

The BATS (Exponential smoothing state space model with Box-Cox transformation, ARMA errors, Trend and Seasonal components) model is a time series forecasting model that was proposed by De Livera et al. [5].

Box-Cox Transformation component is used to transform the data to achieve normality and stabilize the variance. The ARMA (Autoregressive Moving Average) Errors component is used to model the residuals of the time series data, which are assumed to be independent and identically distributed. Finally, the Seasonal component is used to model the seasonal patterns in the data.

Recurrent Neural Network RNN model

A Recurrent Neural Network (RNN) model for regression is a type of neural network designed to process sequential data by maintaining a memory of previous inputs.

Sequential Data Handling: RNNs are ideal for tasks where data points are dependent on previous ones, due to their ability to maintain information over sequences.

Memory: RNNs have internal memory (hidden states) that captures information from previous time steps, allowing them to learn patterns and dependencies over time.

Structure: An RNN consists of layers of neurons where each neuron receives inputs not only from the current time step but also from its own previous output.

Backpropagation Through Time (BPTT): The training of RNNs involves a variation of backpropagation (BPTT), which updates weights by considering the entire sequence of data.

Empirical Results

Here, we illustrate the predicted results graphically using several models “Statistical and machine learning models” (Fig. 3).

Figure 3. Predicting confirmed cases of Influenza A in Algeria

We show numerical results using this measure such as RMSE, a smaller RMSE and MAE indicates better performance (Tabl.).

Table. Measure of performance

Models	ARIMA	SARIMA	ETS	BATS	RNN
RMSE	78.319	75.176	80.353	74.088	71.335

Discussion

We have applied statistical models: ARIMA, SARIMA, BATS, and ETS, as well as a machine learning model, RNN.

ARIMA is not suitable for modeling and predicting this data due to the lack of seasonal patterns. Therefore, we opted for SARIMA, which can handle seasonal patterns. SARIMA performed well in forecasting the confirmed cases in winter 2023 but exhibit poor performance for the winter of 2024 due to its short persistence.

ETS had the worst performance. We applied simple exponential smoothing but could not use multiplicative errors and seasonal components due to negative values. Similarly, the additive case also resulted in negative values.

BATS performed well due to its ability to handle seasonal components and its treatment of errors as ARMA, meaning they are autocorrelated (dependent), which is more realistic compared to the independent errors assumed by ARIMA and SARIMA. To improve the results, we replaced negative values with zeroes.

The RNN model performed the best due to its capacity to handle complex patterns, including nonlinear and periodic trends, and its ability to overcome the problem of memory. However, it had problems modeling the rest of the year except for the wintertime.

Suggestions for reducing the number of confirmed cases of Influenza A

A combination of public health initiatives, personal efforts, and preventative measures are needed to decrease the number of confirmed cases of influenza A. The following are some crucial procedures:

Vaccination: Boost the use of yearly influenza vaccinations, which aim to protect people against the most prevalent strains that emerge each season.

Public Health Campaigns: Organize awareness programs to inform people about the value of immunizations, good hand cleanliness, and proper respiratory protocol.

Social Distancing: Take steps to avoid close proximity in crowded regions, particularly during the prime time of the flu season. This can involve advising people to stay away from crowded places, work from home, and keep a safe distance from other people.

Surveillance and Early Detection: Implement robust surveillance systems to: Monitor Influenza Activity: Track the spread and evolution of influenza strains in real-time. Identify Outbreaks: Quickly detect and respond to outbreaks to contain their spread.

Data Sharing: Collaborate with international health organizations for data sharing and coordinated response efforts.

Conclusion

To summarize, this work examined the behavior of confirmed Influenza A cases in Algeria and applied various statistical and machine learning models to predict the future behavior of this phenomenon. This approach enhances our understanding of the disease’s future trends. Based on our findings, we have suggested recommendations to help reduce the number of confirmed Influenza A cases.

Acknowledgement

We acknowledge the support of “Direction Générale de la Recherche Scientifique et du Développement Technologique DGRSDT”.MESRS ALGERIA.

Об авторах

Джиллали Себа

Высшая школа информатики

Автор, ответственный за переписку.
Email: d.seba@esi-sba.dz

Doctor in Mathematics, Assistant Professor, Laboratory of Applied Mathematics, Department of Mathematics

Алжир, г. Сиди-Бель-Аббес

Н. Бенаклеф

Университет Беджаи

Email: d.seba@esi-sba.dz

аспирант по математике, специальность «Вероятность и статистика», лаборатория прикладной математики в Университете

Алжир, г. Беджая

К. Белаиде

Университет Беджаи

Email: d.seba@esi-sba.dz

д.мат.н., профессор, лаборатория прикладной математики, факультет математики

Алжир, г. Беджая

Список литературы

Ali S.T., Cowling B.J. Influenza virus: tracking, predicting, and forecasting. Annu Rev. Public Health, 2021, vol. 42, pp. 43–57. doi: 10.1146/annurev-publhealth-010720-021049
Al-Qaness M.A.A., Ewees A.A., Fan H., Abd Elaziz M. Optimized forecasting method for weekly influenza confirmed cases. Int. J. Environ. Res. Public Health., 2020, vol. 17, no. 10: 3510. doi: 10.3390/ijerph17103510
Boostani R., Rismanchi M., Khosravani A., Rashidi L., Kouchaki S. Presenting a hybrid method in order to predict the 2009 pandemic influenza A (H1N1). J. Health. Med. Inform., 2012, vol. 3, no. 1, pp. 31–43. doi: 10.4172/2157-7420.1000112
Cheng H.Y., Wu Y.C., Lin M.H., Liu Y.L., Tsai Y.Y., Wu J.H., Pan K.H., Ke C.J., Chen C.M., Liu D.P., Lin I.F., Chuang J.H. Applying machine learning models with an ensemble approach for accurate real-time influenza forecasting in Taiwan: development and validation study. J. Med. Internet Res., 2020, vol. 22, no. 8: e15394. doi: 10.2196/15394
De Livera A.M., Hyndman R.J., Snyder R.D. Forecasting time series with complex seasonal patterns using exponential smoothing. J. Am. Stat. Assoc., 2011, vol. 106, no. 496, pp. 1513–1527. doi: 10.1198/jasa.2011.tm09771
Feradi F., Bouhata R., Kalla M.I., Kalla M. Assessing avian influenza vulnerability using geographically weighted regression, Batna Algeria. The Arab. World. Geographer, 2023, vol. 26, no. 1, pp. 76–87. doi: 10.5555/1480-6800-26.1.76
Goldstein E., Cobey S., Takahashi S., Miller J.C., Lipsitch M. Predicting the epidemic sizes of influenza A/H1N1, A/H3N2, and B: a statistical method. PLoS Med., 2011, vol. 8, no. 7: e1001051. doi: 10.1371/journal.pmed.1001051
Kandula S., Yamana T., Pei S., Yang W., Morita H., Shaman J. Evaluation of mechanistic and statistical methods in forecasting influenza-like illness. J R Soc. Interface, 2018, vol. 15, no. 144: 20180174. doi: 10.1098/rsif.2018.0174
Khan M.A., Abidi W.U.H., Ghamdi M.A.A., Almotiri S.H., Saqib S., Alyas T., Khan K.M., Mahmood N. Forecast the influenza pandemic using machine learning. Comput. Mater. Contin., 2021, vol. 66, no. 1, pp. 331–340. doi: 10.32604/cmc.2020.012148
Lu Y., Wang Y., Shen C., Luo J., Yu W. Decreased incidence of influenza during the COVID-19 pandemic. Int. J. Gen. Med., 2022, vol. 15, pp. 2957–2962. doi: 10.2147/IJGM.S343940
Mejia K., Viboud C., Santillana M. Leveraging Google search data to track influenza outbreaks in Africa. Gates Open Research, 2019, vol. 3, no. 1653: 1653. doi: 10.12688/gatesopenres.13072.1
Seba D., Belaide K. Forecasting infection fatality rate of COVID-19: measuring the efficiency of several hybrid models. Russian Journal of Infection and Immunity, 2024, vol. 14, no. 2, pp. 313–319. doi: 10.15789/2220-7619-FIF-17548
Wolk D.M., Lanyado A., Tice A.M., Shermohammed M., Kinar Y., Goren A., Chabris C.F., Meyer M.N., Shoshan A., Abedi V. Prediction of influenza complications: development and validation of a machine learning prediction model to improve and expand the identification of vaccine-hesitant patients at risk of severe influenza complications. J. Clin. Med., 2022, vol. 11, no. 15: 4342. doi: 10.3390/jcm11154342
Xu Q., Gel Y.R., Ramirez Ramirez L.L., Nezafati K., Zhang Q., Tsui K.L. Forecasting influenza in Hong Kong with Google search queries and statistical model fusion. PLoS One, 2017, vol. 12, no. 5: e0176690. doi: 10.1371/journal.pone.0176690
Xue H., Bai Y., Hu H., Liang H. Regional level influenza study based on Twitter and machine learning method. PLoS One, 2019, vol. 14, no. 4: e0215600. doi: 10.1371/journal.pone.0215600
Zheng Y., Wang K., Zhang L., Wang L. Study on the relationship between the incidence of influenza and climate indicators and the prediction of influenza incidence. Environ. Sci. Pollut. Res. Int., 2021, vol. 28, no. 1, pp. 473–481. doi: 10.1007/s11356-020-10523-7

Дополнительные файлы

Доп. файлы

Действие

1. JATS XML

Скачать

2. Рисунок 1. Ежемесячные подтвержденные случаи гриппа А

Скачать (82KB)

Метаданные

3. Рисунок 2. Графики ACF и PACF

Скачать (165KB)

Метаданные

4. Рисунок 3. Прогнозирование подтвержденных случаев гриппа А в Алжире

Скачать (90KB)

Метаданные

Имя пользователя
Пароль
Запомнить меня

Забыли пароль?	Регистрация

Имя пользователя
Пароль
Запомнить меня

Забыли пароль?	Регистрация

Том 15, № 3 (2025)

Анализ временных рядов для моделирования и прогнозирования подтвержденных случаев гриппа а в Алжире

Полный текст

Аннотация

Ключевые слова

Полный текст

Inroduction

Methods and Materials

Descriptive Data

Preprocessing data analysis

Methodology

Results

Predictive models

ARIMA and ARFIMA model

ETS model

BATS model

Recurrent Neural Network RNN model

Empirical Results

Discussion

Suggestions for reducing the number of confirmed cases of Influenza A

Conclusion

Acknowledgement

Об авторах

Джиллали Себа

Н. Бенаклеф

К. Белаиде

Список литературы

Дополнительные файлы

Данный сайт использует cookie-файлы