Acessibilidade / Reportar erro

Closed form expressions for moments of the beta Weibull distribution

Abstracts

The beta Weibull distribution was first introduced by Famoye et al. (2005) and studied by these authors and Lee et al. (2007). However, they do not give explicit expressions for the moments. In this article, we derive explicit closed form expressions for the moments of this distribution, which generalize results available in the literature for some sub-models. We also obtain expansions for the cumulative distribution function and Rényi entropy. Further, we discuss maximum likelihood estimation and provide formulae for the elements of the expected information matrix. We also demonstrate the usefulness of this distribution on a real data set.

beta Weibull distribution; expected information matrix; maximum likelihood; moment; Weibull distribution


A distribuição beta Weibull (BW) foi primeiramente introduzida por Famoye et al. (2005), e estudada por estes autores e Lee et al. (2007). No entanto, eles não fornecem expressões explícitas para os momentos. Neste artigo, nós obtemos expressões explícitas, em forma fechada, para os momentos desta distribuição, o que generaliza resultados disponíveis na literatura para alguns sub-modelos. Nós obtemos expansões para a função de distribuição acumulada e entropia de Rényi. Além disso, discutimos sobre estimação por máxima verossimilhança e fornecemos fórmulaspara os elementos da matriz de informação de Fisher. Nós também mostramos a utilidade desta distribuição em um conjunto de dados reais.

distribuição Beta Weibull; matriz de informação de Fisher; máxima verossimilhança; momento; distribuição Weibull


MATHEMATICAL SCIENCES

Closed form expressions for moments of the beta Weibull distribution

Gauss M. CordeiroI; Alexandre B. SimasII; Borko D. Stošic'III

IDepartamento de Estatística, Universidade Federal de Pernambuco Cidade Universitária, 50740-540 Recife, PE, Brasil

IIDepartamento de Matemática, Universidade Federal da Paraíba, Cidade Universitária Campus I, 58051-970 João Pessoa, PB, Brasil

IIIDepartamento de Estatística e Informática, Universidade Federal Rural de Pernambuco Rua Dom Manoel de Medeiros s/n, Dois Irmãos, 52171-900 Recife, PE, Brasil

Correspondence to Correspondence to: Gauss Moutinho Cordeiro E-mail: gausscordeiro@uol.com.br

ABSTRACT

The beta Weibull distribution was first introduced by Famoye et al. (2005) and studied by these authors and Lee et al. (2007). However, they do not give explicit expressions for the moments. In this article, we derive explicit closed form expressions for the moments of this distribution, which generalize results available in the literature for some sub-models. We also obtain expansions for the cumulative distribution function and Rényi entropy. Further, we discuss maximum likelihood estimation and provide formulae for the elements of the expected information matrix. We also demonstrate the usefulness of this distribution on a real data set.

Key words: beta Weibull distribution, expected information matrix, maximum likelihood, moment, Weibull distribution.

RESUMO

A distribuição beta Weibull (BW) foi primeiramente introduzida por Famoye et al. (2005), e estudada por estes autores e Lee et al. (2007). No entanto, eles não fornecem expressões explícitas para os momentos. Neste artigo, nós obtemos expressões explícitas, em forma fechada, para os momentos desta distribuição, o que generaliza resultados disponíveis na literatura para alguns sub-modelos. Nós obtemos expansões para a função de distribuição acumulada e entropia de Rényi. Além disso, discutimos sobre estimação por máxima verossimilhança e fornecemos fórmulaspara os elementos da matriz de informação de Fisher. Nós também mostramos a utilidade desta distribuição em um conjunto de dados reais.

Palavras-chave: distribuição Beta Weibull, matriz de informação de Fisher, máxima verossimilhança, momento, distribuição Weibull.

INTRODUCTION

The Weibull distribution is a popular distribution widely used for analyzing lifetime data. We work with the beta Weibull (BW) distribution because of the wide applicability of theWeibull distribution and the fact that it extends some recent developed distributions. We derive expansions for this distribution function and explicit closed form expressions for its moments. An application is illustrated to a real data set with the hope that it will attract more applications in reliability, biology and other areas of research.

The BW distribution stems from the following general class: if G denotes the cumulative distribution function (cdf) of a random variable, then a generalized class of distributions can be defined by

for a > 0 and b > 0, where

is the incomplete beta function ratio, By (a, b) is the incomplete beta function, B (a, b) = Γ (a) Γ (b) / Γ (a +b) is the beta function, and Γ (·) is the gamma function. This class of generalized distributions hasbeen receiving increased attention over the last years, in particular after the recent works of Eugene et al. (2002) and Jones (2004). Eugene et al. (2002) introduced which is known as the beta normal distributionby taking G (x) in (1) to be the cumulative function of the normal distribution with parameters µ and σ. The only properties of the beta normal distribution known are some first moments derived by Eugene et al. (2002) and some more general moment expressions derived by Gupta and Nadarajah (2004). Morerecently, Nadarajah and Kotz (2004) were able to provide closed form expressions for the moments, the asymptotic distribution of the extreme order statistics and the estimation procedure for the beta Gumbel distribution. Another distribution that happens to belong to (1) is the log (F) (or beta logistic) distribution, which has been around for over 20 years (Brown et al. 2002), even if it did not originate directly from (1).

While the transformation (1) is not analytically tractable in the general case, the formulae related with the BW are manageable (as it is shown in this paper), and with the use of modern computer resources with analytic and numerical capabilities, may turn into adequate tools comprising the arsenal of applied statisticians. The current work represents an advance in the direction traced by Nadarajah and Kotz (2006), contrary to their belief that some mathematical properties of the BW distribution are not tractable.

Replacing G (x) by the cdf of the Weibull distribution with parameters c and λ in (1) yields the cdf of the BW distribution

for x > 0, a > 0, b > 0, c > 0 and λ> 0. The probability density function (pdf) and the hazard rate function corresponding to (2) are

and

respectively. Simulation from (3) is easy: if B is a random number following a beta distribution with parameters a and b, then X = {- log (1 - B) }1/c/λ will follow a BW distribution with parameters a, b, cand λ. Some mathematical properties of the BW distribution are given by Famoye et al. (2005) and Leeet al. (2007).

Graphical representations of equations (3) and (4) for some choices of parameters a and b, fixed c = 3 and λ = 1, are given in Figures 1 and 2, respectively. The Weibull distribution corresponds to the particular case a =b = 1. Thus, the Weibull distribution is generalized by a family of curves shown in these figures with a variety of shapes.



The rest of the paper is organized as follows. In Section 2, we derive some expansions for the cdf of the BW distribution, and point out some special cases that have been considered in the literature.

In Section 3, we obtain explicit closed form expressions for the moments. We also present plots of the skewness and kurtosis for selected parameter values. Section 4 provides an expansion for the moment generating function (mgf). In Section 5, we obtain a closed-form expression for the Rényi entropy. InSection 6, we discuss the maximum likelihood estimation and provide the elements of the Fisher information matrix. In Section 7, an application to real data is presented. Section 8 deals with a log-beta Weibull distribution. In Section 9, we provide some conclusions. In the Appendix, two identities required in Section 3 are derived.

EXPANSIONS FOR THE DISTRIBUTION FUNCTION

The BW distribution is an extended model to analyze more complex data and generalizes some recent distributions in the literature. In particular, the BW distribution contains the exponentiated Weibull distribution (for instance, see Mudholkar et al. 1995, Mudholkar and Hutson 1996, Nassar and Eissa 2003, Nadarajah and Gupta 2005, Choudhury 2005) as special cases when b = 1. The Weibull distribution (with parameters c and λ) is clearly a special case for a =b = 1. When a = 1, (3) follows a Weibull distribution with parameters λ b1/c and c. The beta exponential distribution (Nadarajah and Kotz 2006) is also a special case for c = 1.

In what follows, we provide two simple formulae for the cdf (2), depending on whether the parameter a > 0 is a real non-integer or an integer, which may be used for further analytical or numerical analysis. Starting from the explicit expression for the cdf (2)

and setting (λy) = u gives

If a > 0 is a real non-integer, we have

It follows that

Hence, we obtain

For positive real non-integer a, the expansion (6) may be used for further analytical and/or numerical studies. For integer a, we change the formula in (5) to the binomial expansion to obtain

When both a and b =n -a + 1 are integers, the relation of the incomplete beta function to the binomial expansion yields

It can be found in the Wolfram Functions Site1 1 http://functions.wolfram.com/GammaBetaErf/BetaRegularized/03/01/ that, for integer a,

and for integer b,

Hence, if a is an integer, we have another equivalent form for (7)

For integer values of b, we obtain

Finally, if a = 1/2 and b = 1/2, we have

The special cases (8) and (9) were discussed generally by Jones (2004), and the expansions (6) and (10) reduce to Nadarajah and Kotz's (2006) results for the beta exponential distribution by setting c = 1. Clearly, the expansions for the BW density function are obtained from (6) and (7) by simple differentiation. Hence, the BW density function can be expressed in a mixture form of Weibull density functions.

MOMENTS

The moments are used for computing skewness and kurtosis and for computing moment estimators (MM), which could at least be used as starting values for maximum likelihood estimation. Let X be aBW random variable following the density function (3). We derive explicit expressions for the moments of X. First, we introduce the following notation (for any real d and a and b positive)

The change of variables x =-log (z) immediately yields S1, b, a =B (a, b). On the other hand, the change of variables x = (λy) c gives the following relation

from which it follows that

or, equivalently, for any real r,

relating Sr/c+1, b, a to the r th generalized moment of the BW distribution.

First, we consider the integral (11) when d is an integer. Let U be a random variable following the Beta (b, a) distribution with pdf fU(·) and W =- log (U). Further, let FU (·) and FW (·) be the cdfs of U and W , respectively. It is easy to see that FW (x) = 1 - FU(e-x ). Further, some properties of the Lebesgue-Stiltjes integral lead to

Thus, the values of Sd, b, afor integer values of d can be found from the moments of W if they are known. However, the moment generating function (mgf) MW (t) = E (etW) of W can be expressed as

This formula is well defined for t b. However, we are only interested in the limit t → 0 and, therefore, this expression can be used for the current purpose. We can write

From equations (13) and (14), for any positive integer k, we obtain a general formula

As particular cases, we can see directly from equation (14) that

and

and using (13), we obtain

and

The same results can also be obtained directly from equation (15).

Note that the formula for S1, b, a matches the one just given after equation (12). Since the BW distribution for c = 1 reduces to the beta exponential distribution, the above formulae for E (Xc) and E (X2c) reduce to the corresponding ones obtained by Nadarajah and Kotz (2006).

Our main goal here is to give the rth moment of X for every positive integer r. In fact, in what follows, we obtain the rth generalized moment for every real r, which may be used for further theoretical or numerical analysis. To this end, we need to obtain a formula for Sd, b, athat holds for every positive real d. In the appendix we show that, for any d > 0, the following identity holds for positive real non-integer a

and that, when a is positive integer,

is satisfied.

From equations (13) and (16), the rth generalized moment of X for positive real non-integer a canbe written as

When a > 0 is an integer, we obtain

When a =b = 1, X follows a Weibull distribution and formula (19) reduces to

which is precisely the rth moment of the Weibull distribution with parameters λ and c. Equations (15),

(18) and (19) represent the main results of this section, which may serve as a starting point for applications in particular cases, as well as for further research.

Graphical representations of skewness and kurtosis for some choices of parameter b as a function of parameter a, and for some choices of parameter a as a function of parameter b, for fixed λ = 1 and c = 3, are given in Figures 3 and 4, and 5 and 6 , respectively. Figures 3 and 4 show that the skewness and kurtosis curves cross at a = 1, whereas Figures 5 and 6 show that both skewness and kurtosis are independent of bfor a = 1. In addition, the Weibull distribution (a =b = 1) appears as a single point in Figures 3-6.



We now consider the estimation of the model parameters by the methods of moments. For the moments estimation, let

By equating the theoretical moments (18) with the sample moments, we obtain the equations:

for r = 1, . . ., 4. The method of moments estimators is the simultaneous solutions of these equations.

Here, we provide an expansion for the mgf of the BW distribution. We have

where the last expression comes from equation (12). For real non-integer a > 0, we obtain from equation (16)

and for integer a > 0 using (17),

The expression for the mgf obtained by Choudhury (2005) is a particular case of equation (20) corresponding to a = θ, λ = 1/α and b = 1. When c = 1, we have

Setting y = exp (-λx) in the above integral yields

and using the definition of the beta function in equation (21) yields

which is precisely the expression (3. 1) obtained by Nadarajah and Kotz (2006).

RÉNYI ENTROPY

The entropy of a random variable X is a measure of uncertainty variation. The Rényi entropy is defined as IR (δ) = (1 - δ) -1 log, where δ> 0 and δ ≠= 1. We have

and, if γ (a - 1) + 1 > 0 is a real non-integer, we expand {1 - e- (λx) }γ (a-1) as in (5) to obtain

If γ (a - 1) + 1 > 0 is an integer, the sum stops at γ (a - 1). Thus, if (γ - 1) (c - 1) > 0, an expression for Rényi entropy is given by

It is important to derive an expression for Rényi entropy because it serves as a measure of shape of a distribution and can be used to compare the tails and shapes of various frequently used densities, whereas the traditional kurtosis measure is not applicable; for further details, see Song (2001).

ESTIMATION AND INFORMATION MATRIX

Let Y be a random variable with the BW distribution (3). The log-likelihood for a single observation y ofY is given by

The corresponding components of the score vector are:

and

The maximum likelihood estimates (MLEs) can be calculated by making equations (22) - (25) equal tozero. These equations can be solved numerically for a, b, c and λ. We can use iterative techniques suchas a Newton-Raphson type algorithm to obtain the estimates of these parameters. From E (∂ℓ/∂b) = 0, equation (23) yields

which agrees with the previous calculations.

For interval estimation of (a, b, c, λ) and hypothesis tests, the expected information matrix isrequired. For expressing the elements of this matrix, it is convenient to introduce an extension of the integral (11)

so that we have

As before, let W =- log (U ), where U is a random variable following the Beta (b, a) distribution. Then,

Hence, the equation

relates the integral Td, b, a, eto expected values.

To simplify the expressions for some elements of the expected information matrix, the followingidentities are useful:

and

which can be easily proved.

Explicit expressions for the elements of the information matrix K , obtained using Maple and Mathematica algebraic manipulation software (we have used both for double checking the obtained expressions), are given below in terms of the integrals (11) and (26):

And

The integrals Si, j, kand Ti, j, k, l in the expected information matrix are easily determined numerically usingMAPLE and MATHEMATICA for any parameter values.

The observed information matrix could also replace the expected information matrix. The latter would possibly not depend on complicated integral computations and could be used in interval and testing implementation. Second derivatives for the log-likelihood could also be numerically computed.

Under the conditions that are fulfilled for parameters in the interior of the parameter space, but not on the boundary, the asymptotic distribution of the MLEs â, , ĉ and is multivariate normal N4 (0, K -1). The estimated multivariate normal N4 (0, -1) distribution can be used to construct approximate confidence intervals and confidence regions for the individual parameters and for the hazard rate and survival functions. The likelihood ratio (LR) statistic is useful for comparing the BW distribution with some of its special sub-models.

We can compute the maximum values of the unrestricted and restricted log-likelihoods to construct the LR statistics for testing some sub-models of the BW distribution. For example, we may use the LR statistic to check if the fit using the BW distribution is statistically "superior" to a fit using the exponentiated Weibull (EW) or Weibull distributions for a given data set. Mudholkar et al. (1995), in their discussion of the classical bus-motor-failure data, noted the curious aspect in which the larger EW distribution provides an inferior fit as compared to the smaller Weibull distribution.

APPLICATION TO REAL DATA

In this section we compare the results of fitting the BW and Weibull distributions to the data set studied by Meeker and Escobar (1998, p. 383), who give the times of failure and running for a sample of devices from a field-tracking study of a larger system. At a certain point in time, 30 units were installed in normal service conditions. Two causes of failure were observed for each unit that failed: the failure caused by an accumulation of randomly occurring damage from power-line voltage spikes during electric storms, and the failure caused by normal product wear. The times are: 275, 13, 147, 23, 181, 30, 65, 10, 300, 173, 106, 300, 300, 212, 300, 300, 300, 2, 261, 293, 88, 247, 28, 143, 300, 23, 300, 80, 245, and 266.

The MLEs and the maximized log-likelihood BW for the BW distribution are:

â = 0. 0785, = 0. 0659, ĉ= 7. 9355, = 0. 004987 and BW =-169. 919,

whereas the MLEs and the maximized log-likelihood W for the Weibull distribution are:

c˜= 1. 2650, = 0. 005318 and W =-184. 3138.

The LR statistic for testing the hypothesis a =b = 1 (namely, Weibull versus BW distribution) is then w = 28. 7896, which indicates that the Weibull distribution should be rejected. As an alternative test, we can use the Wald statistic. The asymptotic covariance matrix of the MLEs for the BW distribution, which comes from the inverse of the expected information matrix, is given by

The value of the Wald statistic is W = 38. 4498, again signalizing that the BW distribution conforms to these data. In Figure 7 we display the density of both Weibull and BW distributions fitted and the histogram of the data, where it is seen that the BW model captures the apparent bi-modality of the data.


THE LOG-BETA WEIBULL DISTRIBUTION

Let X be a random variable having the BW density function (3). The random variable Y = log (X) has a log-beta Weibull (LBW) distribution whose density function, say f (y) = f (y; a, b, σ, µ), which is parameterized in terms of σ = c-1 and µ =- log (λ), is given by

where -∞ y, µ < ∞ and σ> 0. The corresponding survival function is

We can define the standardized random variable Z = (Y - µ) /σ with the density function

for -∞ z ∞. the standard extreme-value distribution corresponds to the particular choice of a =b = 1. A LBW regression model can be constructed from (27), and the parameters can be estimated by maximum likelihood. However, this will be conducted in a future research.

CONCLUSION

The Weibull distribution, having exponential and Rayleigh distributions as special cases, is a very popular distribution for modeling lifetime data and the phenomenon with monotone failure rates. We study the beta Weibull (BW) distribution, which represents a generalization of several distributions previously considered in the literature, such as the exponentiated Weibull (EW) distribution (Mudholkar et al. 1995, Mudholkar and Hutson 1996, Nassar and Eissa 2003, Nadarajah and Gupta 2005, Choudhury 2005) obtained for b = 1. The Weibull distribution (with parameters c and λ) is also another particular case for a = 1 and b = 1. When a = 1, the BW distribution reduces to a Weibull distribution with parameters λ b1/c and c. The beta exponential distribution is also an important special case for c = 1.

The BW distribution provides a rather general and flexible framework for statistical analysis. It unifies several previously proposed families of distributions, therefore yielding a general overview of these families for theoretical studies, and it also provides a rather flexible mechanism for fitting a wide spectrum of real world data sets.

We derive explicit expressions for the moments of the BW distribution and an expansion for the moment generating function. We also derive the Rényi entropy. These expressions are manageable and, with the use of modern computer resources with analytic and numerical capabilities, may turn into adequate tools comprising the arsenal of applied statisticians. We discuss the estimation procedure by maximum likelihood and derive the information matrix. We demonstrate an application to real data.

We note that our analysis is related only to uncensored lifetime data and the censored case is not dealt with. We could try to extend some parts of the article to the censored situations in a future research. However, we will not obtain closed form expressions for the moments, and the integral forms will depend on numerical software. Maximum likelihood estimation and the observed information matrix using numerical methods would be a possibility.

Finally, a Bayesian analysis as developed by Cancho et al. (1999) for the EW model could possibly be extended to the more general situation of the BW distribution.

ACKNOWLEDGMENTS

We gratefully acknowledge the grants from Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq). The authors are also grateful to the editor and the two referees for helpful comments and suggestions.

Manuscript received on September 15, 2009; accepted for publication on July 5, 2010

APPENDIX

In what follows, we derive the identities (16) and (17). We start from

which yields

and setting z = e-x gives

For real non-integer a, we have

Also, for real p > -1 and real q, we have

Hence,

and, finally, we arrive at

which represents the identity (16).

Now, let a > 0 be an integer; then, from equation (28), we have

Using (29) we obtain

and therefore we arrive at

which represents the identity (17).

  • BROWN BW, SPEARS FM AND LEVY LB. 2002. The log F: a distribution for all seasons. Computation Stat 17: 47-58.
  • CANCHO VG, BOLFARINE H AND ACHCAR JA. 1999. A Bayesian analysis for the exponentiated-Weibull distribution. J Appl Statist 8: 227-242.
  • CHOUDHURY A. 2005. A simple derivation of moments of the exponentiated Weibull distribution. Metrika 62: 17-22.
  • EUGENE N, LEE C AND FAMOYE F. 2002. Beta-normal distribution and its applications. Commun Stat-Theor M 31: 497-512.
  • FAMOYE F, LEE C AND OLUMOLADE O. 2005. The Beta-Weibull Distribution. J Stat Theory Appl 4: 121-136.
  • GUPTA AK AND NADARAJAH S. 2004. On the moments of the beta normal distribution. Commun Stat-Theor M 33: 1-13.
  • JONES MC. 2004. Families of distributions arising from distributions of order statistics. Test 13: 1-43.
  • LEE C, FAMOYE F AND OLUMOLADE O. 2007. Beta-Weibull Distribution: Some Properties and Applications to Censored Data. J Mod Appl Stat Methods 6: 173-186.
  • MEEKER WQ AND ESCOBAR LA. 1998. Statistical Methods for Reliability Data. J Wiley, New York.
  • MUDHOLKAR GS, SRIVASTAVA DK AND FREIMER M. 1995. The exponentiated Weibull family. Technometrics 37: 436-45.
  • MUDHOLKAR GS AND HUTSON AD. 1996. The exponentiated Weibull family: some properties and a flood data application. Commun Stat-Theor M 25: 3059-3083.
  • NADARAJAH S AND GUPTA AK. 2005. On the moments of the exponentiated Weibull distribution. Commun tat-Theor M 34: 253-256.
  • NADARAJAH S AND KOTZ S. 2004. The beta Gumbel distribution. Math Probl Eng 10: 323-332.
  • NADARAJAH S AND KOTZ S. 2006. The beta exponential distribution. Reliab Eng Syst Safe 91: 689-697.
  • NASSAR MM AND EISSA FH. 2003. On the exponentiated Weibull distribution. Commun Stat-Theor M 32: 1317-1336.
  • SONG KS. 2001. Rényi information, loglikelihood and an intrinsic distribution measure. J Stat Plan Infer 93: 51-69.
  • Correspondence to:

    Gauss Moutinho Cordeiro
    E-mail:
  • 1
  • Publication Dates

    • Publication in this collection
      03 June 2011
    • Date of issue
      June 2011

    History

    • Received
      15 Sept 2009
    • Accepted
      05 July 2010
    Academia Brasileira de Ciências Rua Anfilófio de Carvalho, 29, 3º andar, 20030-060 Rio de Janeiro RJ Brasil, Tel: +55 21 3907-8100 - Rio de Janeiro - RJ - Brazil
    E-mail: aabc@abc.org.br