Kumaraswamy Inverted Topp–Leone Distribution with Applications to COVID-19 Data

2021-12-14 09:57AmalHassanEhabAlmetwallyandGamalIbrahim
Computers Materials&Continua 2021年7期

Amal S.Hassan,Ehab M.Almetwally and Gamal M.Ibrahim

1Faculty of Graduate Studies for Statistical Research,Cairo University,Giza,12613,Egypt

2Faculty of Business Administration,Delta University of Science and Technology,Mansoura,35511,Egypt

3High Institute for Management Sciences,Belqas,35511,Egypt

Abstract: In this paper, an attempt is made to discover the distribution of COVID-19 spread in different countries such as; Saudi Arabia, Italy,Argentina and Angola by specifying an optimal statistical distribution for analyzing the mortality rate of COVID-19.A new generalization of the recently inverted Topp Leone distribution,called Kumaraswamy inverted Topp–Leone distribution,is proposed by combining the Kumaraswamy-G family and the inverted Topp–Leone distribution.We initially provide a linear representation of its density function.We give some of its structure properties,such as quantile function,median,moments,incomplete moments,Lorenz and Bonferroni curves,entropies measures and stress-strength reliability.Then,Bayesian and maximum likelihood estimators for parameters of the Kumaraswamy inverted Topp–Leone distribution under Type-II censored sample are considered.Bayesian estimator is regarded using symmetric and asymmetric loss functions.As analytical solution is too hard, behaviours of estimates have been done viz Monte Carlo simulation study and some reasonable comparisons have been presented.The outcomes of the simulation study confirmed the efficiencies of obtained estimates as well as yielded the superiority of Bayesian estimate under adequate priors compared to the maximum likelihood estimate.Application to COVID-19 in some countries showed that the new distribution is more appropriate than some other competitive models.

Keywords: Kumaraswamy-G family; maximum likelihood; Bayesian method;COVID-19; moments; quantile function; stress-strength reliability

1 Introduction

The inverted distributions are of great importance due to their applicability in many fields like; biological sciences, life testing problems, etc.The density and hazard rate shapes of inverted distributions exhibit dissimilar structure than matching the non-inverted distributions.Applications of inverted distributions have been discussed with various researchers, so the reader can refer to [1–8] among others.

Recently, [9] provided the inverted Topp–Leone (ITL) distribution with the following probability density function (pdf)

where,υis the shape parameter.The associated cumulative distribution function (cdf) is given by

Extensions and generalizations of probability distributions have been regarded by many researchers to enhance flexibility in modelling variety of data in many fields.A well-notable family of adding parameters is the Kumaraswamy-G (K-G) proposed in [10].They defined the cdf and the pdf of K-G as follows:

and,

whereG(x), andg(x)are the baseline cdf and pdf,δ,ϑ >0, are shape parameters.A physical clarification of the K-G (3) and (4), forδandϑpositive integers, is as follows.Consider a system is made ofϑindependent items and that each item is made up ofδindependent sub-items.Suppose the system fails if any ofϑitems fails and that each item fails if all of the sub-items fail.LetZj1,Zj2,...,Zjδdenote the life times of the sub-items within thejth component,j=1,...,ϑwith common cdfG.LetZjdenote the lifetime of thejth item,j=1,...,ϑand let Z denote the lifetime of the entire system.Then the cdf ofZis given by

In this work, we provide and study a generalization of ITL model, the so called Kumaraswamy inverted Topp–Leone (KITL) distribution.Using (2) in (4), the cdf of KITL distribution is

where,ς≡(υ,δ,ϑ), a random variable with cdf (6) will be denoted by Z~KITL(υ,δ,ϑ).Forδ=ϑ=1, the KITL distribution provides ITL distribution provided in [9].The pdf of KITL is given by

The KITL density function can exhibit different behavior for different parameters values (Fig.1).

Figure 1:Density function of the KITL distribution

The hazard rate function of KITL distribution is given as follows

Plots of the hazard rate function (hrf) of KITL distribution for specific values of parameters are shown in Fig.2.We conclude that the hrf of KITL distribution has the increasing, decreasing and upside-down shape.

Figure 2:The hrf of the KITL distribution

We are motivated to suggest the KITL model according to:(a) Produce new useful form of ITL with three parameters; (b) discuss several statistical properties (c) introduce more flexible model with decreasing, increasing, and upside-down hazard rate shapes; (d) able to model the COVID-19 data, in Saudi Arabia, Italy, Argentina and Angola, than some other distributions.This article is addressed as follows.Section 2 deals with some important properties.Maximum likelihood (ML) and Bayesian estimators of parameters in presence of Type II censored (T2C) samples are given in Sections 3 and 4 respectively.Monte Carlo simulation is provided in Section 5.Analysis to COVID-19 data sets is carried in Section 6, and conclusions are presented in Section 7.

2 Signifcant Statistical Measures

Here, some significant properties of KITL distribution, specifically, linear representation of the pdf, quantile function, moments, Rényi andϖ-entropies, mean residual life, stress-strength reliability are derived.

2.1 Useful Formulae

Here, an important mathematical formula of KITL distribution is provided.Consider the binomial theorem

in the pdf (7), we obtain

Again, employ the binomial expansion in (10), then

2.2 Quantile Function and Median

The KITL distribution is easily simulated by inverting (6) as follows:If U has a uniform distribution on (0, 1), then Z can be obtained from

L=andQ(u)is the quantile function of the KITL distribution.Hence, the medianzMof the distribution is derived by substitutingu=0.5 in (12).

2.3 Moments Measures

Thenth moment for KITL distribution about zero is given by using pdf (11) as follows

which gives

where,Λs,k,ℓ=2υ(k+1)ψs,kand B(·,·)is the beta function.For,n=1, 2, 3, 4 we obtain the first four moments around origin.Tab.1 gives the basic moments measures for particular values of parameters.

Table 1:Some moments values of the KITL distribution

2.4 Incomplete and Conditional Moments

Therth incomplete moment, say Ξr(z)ofZis obtained from (11) as follows

whereβ(·,·,x)is the incomplete beta function.Settingr=1 in (15), we obtain the first incomplete moment as follows

The Lorenz and Bonferroni curves are useful applications of the first incomplete moment defined byLo(p)=Ξ1(P)/E(P)andBo(p)=Lo(p)/F(p)respectively.The mean residual life is another application of Ξ1(t)defined bym1(t)=[1 −Ξ1(t)]/S(t)−t.

2.5 Rényi and ϖ-Entropies

Here, we obtain Rényi andϖ-entropies.The Rényi entropyR(η)of a random variableZis defined by

where,η>0 andη1.Substituting (7) in (17), then after some mathematical abbreviations of(f (z;ς))η, we get that:

Substituting (18) in (17), then we obtain the Rényi entropy of KITL distribution as follows:

Theϖ-entropy, sayR(ϖ), is determined by the following relation

Theϖ-entropy of the KITL model will be

2.6 Stress-Strength Reliability

The stress-strength reliability (SSR) is defined as the probability that the system is strong frequently to beat the stress applied on it.Consider thatX1andX2are independent stress and strength random variables following the KITL(υ,δ1,ϑ1), and KITL(υ,δ2,ϑ2)distributions,respectively.Then, the SSR of the KITL distribution is defined by

Using (6) and (7) in (22), then we get

Using the binomial expansion in last equation and after simplification we have

3 Maximum Likelihood Estimation

Here, the ML estimators of the model parameters are determined via T2C scheme.Let z1:n,z2:n,...,zr:nis of T2C sample of size r from a life test of n items whose lifetimes have the KITL distribution with parametersδ,ϑandυ.Regarding T2C, the test is stopped at specified number of failure r before all n items have failed.Then, the log-likelihood function based on censored observed sample is given by

ħj:n=The partial derivatives of lnL(z), denoted by lnℓ, with respect to the

model parametersδ,ϑ, andυare

The ML estimators of parameters are determined by solving the non-linear Eqs.(26)–(28).

4 Bayesian Estimation

Here, we discuss the Bayesian estimation of the parameters of the KITL distribution.The Bayesian estimator is considered under squared error (SE) loss function which can be defined as;

and linear exponential (LINEX) loss function which can be expressed as

wherehreflects the direction and degree of asymmetry.

Assuming that the prior distribution ofδ,ϑ,υdenoted byπ(δ),π(ϑ),π(υ)have an independent gamma prior distribution.The joint gamma prior density ofδ,ϑ,υcan be written as

Based on the following likelihood function of the KITL distribution

and the joint prior density (31), the joint posterior of the KITL distribution with parametersδ,ϑandυis

Then the joint posterior can be written as

To obtain the Bayesian estimators, we can use the Markov Chain Monte Carlo (MCMC)approach.An important sub-class of the MCMC techniques is Gibbs sampling and more general Metropolis within Gibbs samplers.The Metropolis-Hastings (M-H) algorithm together with the Gibbs sampling are the two most popular example of a MCMC method.It’s similar to acceptance rejection sampling, the M-H algorithms consider that, to each iteration of the algorithm, a candidate value can be generated from the KITL distributions.We use the M-H within Gibbs sampling steps to generate random samples from conditional posterior densities of(δ,ϑ,υ)as follows:

and

This satisfied the kids, but not the husband. The next day he purchased half a dozen young lilacs bushes and planted them around their yard, and several times since then he has added more.

The Bayesian estimates based on SE and LINEX loss functions are obtained in simulation section.For more information, please see as an example [11–13].

5 Simulation Study

A simulation study for KITL model is conducted for samples of sizesn=20, 50, 100 and the parameters are estimated under complete and T2C samples.The number of failure items;r, is selected for two levels of censoring (LC), as 70% and 90%.10000 iterations are made to compute the ML estimate (MLE), bias and mean square error (MSE).The observed outcomes are listed in Tabs.2–4.

Table 2:Bias and MSE of the MLE and Bayesian estimate for KITL model for complete sample

Table 3:Bias and MSE of the MLE and Bayes estimate for KITL model under T2C at LC=70%

Table 4:Bias and MSE of the MLE and Bayes estimate for KITL model under T2C at LC=90%

From the above tables, we conclude the following

i.As the sample sizenincreases, the bias decreases.

ii.As the sample sizenincreases, the MSE decreases.

iii.As the value ofυincreases, the bias and MSE increase.

iv.As the value ofδincreases, the bias and MSE increase.

v.As the value ofϑincreases, the bias and MSE increases.

vi.As the level of censoring increases, the bias and MSE decrease.

6 Analysis to COVID-19 Data

In this section, the KITL distribution is fitted to more famous fields of survival times of COVID-19 data with different country including Saudi Arabia, Italy, Argentina, Angola as well as March precipitation data.The data are available at https://covid19.who.int/.Reference [14]used this link to find data of COVID-19 for Egypt.Reference [15] used a deep neural network approach to train networks for estimating the optimal parameters of an SIR model endemicity of COVID-19 in Spain.The KITL model is compared with other some competitive models as, ITL,inverse Weibull (IW), inverse Lomax (IL), inverse Kumaraswamy (IK) and Topp Leone inverted Kumaraswamy (TLIK) distributions (see [16]).

Tabs.5–9 provide values of Cramér–von Mises (W∗), Anderson–Darling (A∗) and Kolmogorov–Smirnov (KS) statistics for all models fitted based on five real data sets.In addition,these tables contain the MLEs and standard errors (SEs) (appear in parentheses) of the parameters for the considered models.We compare the fits of the KITL model with the ITL, IW, IL, IK and TLIK models (see Tabs.5–9).The fitted KITL, pdf and cdf of the five data sets are displayed in Figs.3–7, respectively.These figures indicate that the KITL distribution gets the lowest values of W∗, A∗, KS among all fitted models.

6.1 Argentina Data

The following COVID-19 data represent the daily new deaths which belong to Argentina in 65 days recorded from 1 June to 4 August 2020:20, 11, 19, 10, 18, 27, 27, 14, 14, 28, 19, 24, 31,30, 17, 23, 20, 24, 43, 25, 25, 13, 24, 33, 36, 39, 43, 25, 25, 28, 38, 27, 53, 40, 50, 37, 33, 79, 52,53, 42, 38, 31, 41, 67, 61, 85, 61,71, 42, 35, 145, 80, 111, 105, 125, 66, 43, 126, 118, 111, 155, 77,69, and 55.

Tab.5 gives the MLEs, SEs and the statistics measures for all models.Tab.5 shows that the KITL model gives the smallest values for the K-S, W∗and A∗statistics among all fitted models.

Table 5:MLE and statistical measures for COVID-19 data in Argentina

Furthermore, we plot the histogram, estimated pdf plots for all models for data of Argentina in Fig.3.

6.2 Saudi Arabia Data

The following COVID-19 data belong to Saudi Arabia in 109 days recorded from 17 April to 4 August 2020 (data of daily new cases):762, 1088, 1122, 1132, 1141, 1147, 1158, 1172, 1197,1223, 1258, 1266, 1289, 1325, 1344, 1351, 1357, 1362, 1552, 1573, 1581, 1595, 1618, 1629, 1644,1645, 1686, 1687, 1701, 1704, 1759, 1793, 1815, 1869, 1877, 1881, 1897, 1905, 1911 ,1912, 1931,1966, 1968, 1975, 1993, 2039, 2171, 2201, 2235, 2238, 2307, 2331, 2378, 2399, 2429, 2442, 2476,2504, 2509, 2532, 2565, 2591, 2593, 2613, 2642, 2671, 2691, 2692, 2736, 2764, 2779, 2840 2852,2994, 3036, 3045, 3121, 3123, 3139, 3159, 3183, 3288, 3366, 3369, 3372, 3379, 3383, 3392, 3393,3402, 3580, 3717, 3733, 3921, 3927, 3938, 3941, 3943, 3989, 4128, 4193, 4207, 4233, 4267, 4301,4387, 4507, 4757, 4919.

Tab.6 gives the MLEs, SEs and the statistics measures for all models for Saudi Arabia data.We conclude that the KITL is an adequate model for these data compared to other models.

Table 6:MLE and statistical measures for COVID-19 data in Saudi Arabia country

Furthermore, the histogram and estimated cdf plots for all models for data of Saudi Arabia are plotted in Fig.4.

Figure 4:The histogram and estimated cdf for all models of COVID-19 in Saudi Arabia country

6.3 Italy Data

The considered COVID-19 data belong to Italy of 111 days that are recorded from 1 April to 20 July 2020.This data formed of daily new deaths divided by daily new cases.The data are as follows:0.2070, 0.1520, 0.1628, 0.1666, 0.1417, 0.1221, 0.1767, 0.1987, 0.1408, 0.1456, 0.1443,0.1319, 0.1053, 0.1789, 0.2032, 0.2167, 0.1387, 0.1646, 0.1375, 0.1421, 0.2012, 0.1957, 0.1297,0.1754, 0.1390, 0.1761, 0.1119, 0.1915, 0.1827, 0.1548, 0.1522, 0.1369, 0.2495, 0.1253, 0.1597,0.2195, 0.2555, 0.1956, 0.1831, 0.1791, 0.2057, 0.2406, 0.1227, 0.2196, 0.2641, 0.3067, 0.1749,0.2148, 0.2195, 0.1993, 0.2421, 0.2430, 0.1994, 0.1779, 0.0942, 0.3067, 0.1965, 0.2003, 0.1180,0.1686, 0.2668, 0.2113, 0.3371, 0.1730, 0.2212, 0.4972, 0.1641, 0.2667, 0.2690, 0.2321, 0.2792,0.3515, 0.1398, 0.3436, 0.2254, 0.1302, 0.0864, 0.1619, 0.1311, 0.1994, 0.3176, 0.1856, 0.1071,0.1041, 0.1593, 0.0537, 0.1149, 0.1176, 0.0457, 0.1264, 0.0476, 0.1620, 0.1154, 0.1493, 0.0673,0.0894, 0.0365, 0.0385, 0.2190, 0.0777, 0.0561, 0.0435, 0.0372, 0.0385, 0.0769, 0.1491, 0.0802,0.0870, 0.0476, 0.0562, 0.0138.

Tab.7 provides the MLEs, SEs and the statistics measures for all models for Italy data.We conclude that the KITL is an adequate model for these data compared to other models.

Also, the histogram and estimated cdf plots for all models for data of Italy country are plotted in Fig.5.

Figure 5:The histogram and estimated cdf for all models of COVID-19 in Italy

Table 7:MLE and statistical measures for COVID-19 data in Italy country

Table 8:MLE and statistical measures for COVID-19 data in Angola

Table 9:MLE and statistical measures for March precipitation data

6.4 Angola Data

The considered COVID19 data represent the daily new cases which are belonging to Angola of 27 days recorded from 8 July to 3 August 2020.The data are as follows:33, 10, 62, 4, 21, 23,19, 16, 35, 31, 31, 49, 18, 44, 30, 33, 39, 29, 36, 16, 18, 50, 78, 31, 39, 16, 116.

Tab.8 presents the MLEs, SEs and the statistics measures for all models for Angola data.We conclude that the KITL is an adequate model for these data compared to other models.

Fig.6 gives the histogram and estimated cdf plots for all models for data of Angola country.

Figure 6:The histogram and estimated cdf for all models of COVID-19 in Angola

6.5 March Precipitation Data in Minneapolis/St Paul

Reference [17] reported data that contain 30 observations of the March precipitation (in inches) in Minneapolis/St Paul.The observed values are:0.77, 1.74, 0.81, 1.20, 1.95, 1.20, 0.47,1.43, 3.37, 2.20, 3.00, 3.09, 1.51, 2.10, 0.52, 1.62, 1.31, 0.32, 0.59, 0.81, 2.81, 1.87, 1.18, 1.35, 4.75,2.48, 0.96, 1.89, 0.90, 2.05.

Tab.9 presents the MLEs, SEs and the statistics measures for all models for March precipitation data.We conclude that the KITL is an adequate model for these data compared to other models.Fig.7 gives the histogram and estimated cdf plots for all models for data of March precipitation.

Figure 7:CDF and PDF for different distribution for March precipitation data

7 Conclusions

This article formulates a generalization of inverted Topp–Leone distribution, named as Kumaraswamy inverted Topp–Leone distribution.Some statistical properties of the KITL distribution are provided.Bayesian and ML methods of estimation are considered.The Bayesian estimator is deduced under LINEX and SE loss functions.Monte Carlo simulation study is designed to assess the performance of estimates.Generally, we conclude that the Bayesian estimates are preferable than the corresponding other estimates in approximately most of the situations.Five real data of COVID-19 obtained from Saudi Arabia, Italy, Argentina, and Angola as well as March precipitation data are considered and they showed that KITL distribution is an adequate model for these data compared with other competitive distributions.

Funding Statement:The authors received no specific funding for this study.

Conficts of Interest:The authors declare that they have no conflicts of interest to report regarding the present study.