The Asymmetric Power-Student-t Model for Censored and Truncated Data

TOVAR-FALÓN, ROGER; BOLFARINE, HELENO; MARTÍNEZ-FLÓREZ, GUILLERMO

doi:10.1590/0001-3765202120190920

Abstract

In this paper, we propose the power Student-t regression model for censored (limited) observations which extends the Student-t censored regression model. This extension is based on the asymmetric and heavy-tailed power Student-t distribution. The score functions and expected information matrix are given as well as the process for estimating the parameters in the model is discussed by using the likelihood approach. Two simulation studies are conducted to evaluate parameter recovery and properties of the model and finally, two applications to a real data set are reported to demonstrate the usefulness of this new methodology.

Key words
Censored regression model; Fisher information matrix; maximum likelihood estimation; power Student-$t$ distribution

INTRODUCTION

Regression models where the response variable is censored or limited are common in different fields: clinical essays, econometric analysis, social phenomena, engineering studies, among others. In clinical essays for example, in the first phases of development of the new vaccines, the determination of antibody concentration values often are left-censored due to detection limit by lack of sensitivity of the essay when the concentrations are near zero, see Moulton & Halsey 1995MOULTON LH & HALSEY NA. 1995. A Mixture Model With Detection Limits for Regression Analyses of Antibody Response to Vaccine. Biometrics 51: 1570–1578. . In social phenomena, the study on extramarital behavior where the variable of interest is the number of extramarital affairs in the previous year, for example, it can result in a left-censored variable (Fair 1978FAIR RC. 1978.A theory of extramarital affairs. J Polit Econ 86(1): 45–61. ). In econometrics analysis, the ordinary Tobit model (Tobin 1958TOBIN J. 1958.Estimation of relationship for limited dependent variables. Econometrica 26(1): 24–36. ) is commonly used to conduct studies of the labor force participation of married women. In this case, the observed response is the wage rate, which is typically considered as censored below zero, i.e., for working women, positive values for the wage rates are registered, whereas for the non-working women the observed wage rates are zero; see Mroz 1987MROZ TA. 1987. The Sensitivity of an Empirical Model of Married Women’s Hours of Work to Economic and Statistical Assumptions. Econometrica 55(4): 765–799. doi:10.2307/1911029. .

In situations such as previously discussed, where censored regression (CR) models are proposed, it is common to assume a normal distribution for the error term, however, this assumption can not be suitable and it can be unrealistic due to the presence of atypical observations or high (or low) degree of skewness and kurtosis of the response variable, which the normal model is unable to capture, so considerable interest has centered on relaxing the assumption of normality of the errors in CR models. In this context, some authors have proposed a wide range of alternatives to the normal censored regression (NCR) model which is widely known in the literature as the Tobit model. Arellano-Valle et al. 2012ARELLANO-VALLE RB, CASTRO LM, GONZÁLEZ-FARÍAS G & MUNÕZ-GAJARDO KA. 2012. Student-t Censored Regression Model: Properties and Inference. Stat Methods Appt 21(4): 453–473. doi:10.1007/s10260-012-0199-y.
10.1007/s10260-012-0199-y... for example, extend the classical Tobit model by introducing the Student- $t$ censored regression (TCR) model that can be suitable when the response variable has heavy-tails and the kurtosis is greater than the usual normal model. Another extension of the Tobit model was proposed by Martínez-Flórez et al. 2013MARTÍNEZ-FLÓREZ G, BOLFARINE H & GÓMEZ HW. 2013. The alpha–power tobit model. Commun Stat Theory Methods 42(4): 633–643. by considering that random errors follow a power-normal (PN) distribution (Gupta & Gupta 2008GUPTA RD & GUPTA RC. 2008. Analyzing skewed data by power–normal model. Test 17: 197–210. ). The novelty of this proposal is the incorporation of a shape parameter which gives flexibility to the assumption of the symmetric errors (normality assumption) and it allows to accommodating skewed forms to the left and the right for the error term in CR models. Recently Garay et al. 2017GARAY AM, LACHOS VH, BOLFARINE H & CABRAL CRB. 2017. Linear Censored Regression Models with Scale Mixtures of Normal Distributions. Stat Pap 58(1): 247–278. doi:10.1007/s00362-015-0696-9. proposed a family of censored regression models based on the family of symmetric distributions commonly known as the scale mixture of normal (SMN) distributions, which includes the TCR model proposed by Arellano-Valle et al. 2012ARELLANO-VALLE RB, CASTRO LM, GONZÁLEZ-FARÍAS G & MUNÕZ-GAJARDO KA. 2012. Student-t Censored Regression Model: Properties and Inference. Stat Methods Appt 21(4): 453–473. doi:10.1007/s10260-012-0199-y.
10.1007/s10260-012-0199-y... . This family also includes Pearson type VII (PVII), slash (SL), power exponential (PE), contaminated normal (CN) and normal (N) distributions. In addition to being robust, these models have shown to be useful in detecting atypical observations in CR models.

Although some proposals that take into account the problem of atypical observations in censored regression models, most of them are based on the assumption of symmetry of the error and few studies that capture departure from symmetry in the distribution of errors as in Martínez-Flórez et al. 2013MARTÍNEZ-FLÓREZ G, BOLFARINE H & GÓMEZ HW. 2013. The alpha–power tobit model. Commun Stat Theory Methods 42(4): 633–643. , for example, who support their work in the great virtues of the alpha-power models to fit data where distribution presents high or low asymmetry and/or kurtosis.

Within this class of alpha-power models Zhao & Kim 2016ZHAO J & KIM HM. 2016.Power t distribution. Ommun Stat Appl Methods 23(4): 321–334. proposed an extension of the Student- $t$ model by defining the power-Student- $t$ (PT) distribution as an alternative to the skew- $t$ model by Azzalini & Capitanio 2003AZZALINI A & CAPITANIO A. 2003. Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t-distribution. J R Stat Soc Series B Stat Methodol 65(2): 367–389. for fitting skewed and heavy-tailed data. The PT model, which extends the power-normal model by Gupta & Gupta 2008GUPTA RD & GUPTA RC. 2008. Analyzing skewed data by power–normal model. Test 17: 197–210. seems to be useful in situations where the data present higher degree of skewness and kurtosis than PN model in presence of atypical observations.

In this paper, we propose a censored regression model under the assumption that errors follow a PT distribution (hereafter we will call it the PTCR model). The assumption of PT distribution gives flexibility for accommodating skew forms to the left and the right, and kurtosis greater or smaller than the Student $t$ -distribution can be also accommodated, hence, PTCR model extends the TCR model. The process of inference in the model is conducted by using the maximum likelihood (ML) approach and its large sample properties. Application is implemented to real data set where it is demonstrated that the proposed model can be very useful in fitting real data sets.

The rest of this paper is organized as follows: Section “The Power Student-t Distribution“ presents a brief review of the main properties of the PT distribution. In Section “Power-Student-t Model for Censored and Truncated Data“, we introduce the censored and truncated PT models. Section “Censored Power-Student-t Regression Model“ introduces the PTCR model. Here, ML equation and the observed and expected information matrices are given. Section “Simulation Study“ presents the results of a simulation study which reveals the good performance of the estimation approach. The PTCR model is fitted to a data set of housewives wages in Section “Real Data Application“, revealing that the data set in question can be fitted by PTCR as well as by a CR model where the observational errors have a SMN distribution (SMNCR model).

THE POWER STUDENT-T DISTRIBUTION

In this section, we present the PT distribution and review some of its main characteristics and properties. The PT model was introduced by Zhao & Kim 2016ZHAO J & KIM HM. 2016.Power t distribution. Ommun Stat Appl Methods 23(4): 321–334. and it is an alternative to skew-t model for fitting data with high indices of asymmetry and kurtosis in addition to heavy tails.

Definition 1. The random variable $X$ is said to have a PT distribution with parameter $α$ and degree of freedom $ν$ , if $X$ has probability density function (PDF) given by

f_{P T} (x; α, ν) = α f_{T} (x; ν) [F_{T} (x; ν)]^{α - 1},

(1)

for $x \in ℝ$ and $α > 0$ . Functions $f_{T} (\cdot; ν)$ and $F_{T} (\cdot; ν)$ are the PDF and cumulative distribution function (CDF) of the standard Student- $t$ distribution.

Random variable having $f_{P T} (x; α, ν)$ distribution is denoted shortly by $X \sim P T (α, ν)$ . Figure 1 displays some forms of the PDF of the PN distribution for selected values of $α$ . Note from figure that parameter $α$ controls the skewness and kurtosis of the distribution. The CDF of the PT model is given by

F_{P T} (x; α, ν) = [F_{T} (x; ν)]^{α},

(2)

for $x \in ℝ$ . Some properties of the PT distribution can be proven as result of Definition 1.

Figure 1
Density function of

P T (α, 10)

for

α = 5

(solid line),

α = 2

(dashed line),

α = 1

(dotted line),

α = 0.5

(dotted-dashed line).

Proposition 1. Let $X \sim P T (α, ν)$ , then

(i) if $α = 1$ , $X$ follows Student- $t$ distribution and we write $X \sim T (ν)$ ,
(ii) if $ν \to \infty$ , $X$ converges to power-normal (PN) model with parameter $α$ . The PDF is given by

f_{P N} (x; α) = α ϕ (x) [Φ (x)]^{α - 1}, x \in ℝ .

(3)

More details of PN distribution can be found in Gupta & Gupta 2008GUPTA RD & GUPTA RC. 2008. Analyzing skewed data by power–normal model. Test 17: 197–210. and Pewsey et al. 2012PEWSEY A, GÓMEZ HW & BOLFARINE H. 2012. Likelihood–based inference for power distributions. Test 21(4): 775–789. .
(iii) if $α = 1$ and $ν \to \infty$ , $X$ converges to standard normal distribution.

Proof. Proof of (i)-(iii) are directly obtained from definition of PT distribution ◻

Proposition 2. Let $X \sim P T (α, ν)$ , then for $k \in ℕ$

E [X^{k}] = E [(F_{T}^{- 1} (Y; ν))^{k}],

(4)

where $Y$ has a beta distribution, and $F_{T}^{- 1} (\cdot; ν)$ denotes the inverse of the function $F_{T} (\cdot; ν)$ .

Proof. We have by definition that

E [X^{k}] = \int_{ℝ} x^{k} α f_{T} (x; ν) [F_{T} (x; ν)]^{α - 1} d x

thus, letting $y = F_{T} (x; ν)$ , then $x = F_{T}^{- 1} (y; ν)$ , it follows that

E [X^{k}] = \int_{0}^{1} (F_{T}^{- 1} (y; ν))^{k} α y^{α - 1} d y

which is the expected value of the function $(F_{T}^{- 1} (Y; ν))^{k}$ , where $Y$ follows a beta distribution with parameter $α$ and 1. ◻

The expected value, variance, indices of asymmetry and kurtosis of the PT model can be found by using the expressions

(i) $E [X] = μ_{1}$
(ii) $V [X] = μ_{2} - μ_{1}^{2}$
(iii) $γ_{1} = \frac{μ_{3} - 3 μ_{1} μ_{2} + 2 μ_{1}^{3}}{{(μ_{2} - μ_{1}^{2})}^{3 / 2}}$
(iv) $γ_{2} = \frac{μ_{4}'}{{(μ_{2}')}^{2}}$

where $μ_{r} = E [X^{r}]$ and $μ_{r}' = E {[X - E (X)]}^{r}$ . Table I presents the values of the asymmetry coefficient of the PT model for some values of the $ν$ parameter and for values of $α$ in the range of 0.1 to 100000.

Thumbnail

Table I
Skewness of the

P T (α, ν)

model. Values of

α

ranging of 0.1 to 100000.

Definition 2. Let $X \sim P T (α, ν)$ . The PT density of location and scale is defined as the distribution of $Z = ξ + η X$ , for $ξ \in ℝ$ and $η > 0$ . The corresponding PDF is given by

f_{P T} (z; ξ, η, α, ν) = α f_{T} (\frac{z - ξ}{η}; ν) [F_{T} (\frac{z - ξ}{η}; ν)]^{α - 1},

(5)

for $z \in ℝ$ . We denote this extension as $Z \sim P T (ξ, η, α, ν)$ , and we have that $P T (0, 1, α, ν) \equiv P T (α, ν)$ .

The $k$ th moment of the random variable $Z$ is given by

E [Z^{k}] = \sum_{j = 0}^{k} (\binom{k}{j}) ξ^{j} η^{k - j} μ_{k - j},

(6)

where $μ_{j}$ is the $j$ th moment of a random variable $X \sim P T (α, ν)$ . Zhao & Kim 2016ZHAO J & KIM HM. 2016.Power t distribution. Ommun Stat Appl Methods 23(4): 321–334. derived the information matrix for the location-scale version and showed that it is non-singular when $α = 1$ for small values of the parameter $ν$ (i.e., $ν ≪ 30$ ). When $ν$ tends to $+ \infty$ , then PT distribution converges to PN model and here, we recall Pewsey et al. 2012PEWSEY A, GÓMEZ HW & BOLFARINE H. 2012. Likelihood–based inference for power distributions. Test 21(4): 775–789. showed that PN model has non-singular information matrix. This result guarantees that regularity conditions are satisfied for the likelihood approach, hence, with PT model, symmetry can be tested by using ordinary large sample properties of the likelihood ratio statistics.

POWER-STUDENT-T MODEL FOR CENSORED AND TRUNCATED DATA

Based on the goodness of the PT distribution to fit data with high indices of asymmetry and kurtosis, in this section, we introduce the censored PT and the truncated PT models which we will be denoted by CPT and TPT, respectively.

Definition 3 (Censored PT Model). Suppose that random variable $Y$ follows a $P T (ξ, η, α, ν)$ distribution. Let $Y_{1}, \dots, Y_{n}$ a random sample of size $n$ of $Y$ , where only those values of $Y_{i}$ greater than constant $k_{i}$ are recorded; and for values $Y_{i} \leq k_{i}$ only the value $k_{i}$ is registered. The observed values $Y_{i}^{o}$ can be written as

Y_{i}^{o} = {\begin{matrix} k_{i}, & if Y_{i} \leq k_{i}, \\ Y_{i}, & if Y_{i} > k_{i}, \end{matrix}

for $i = 1, \dots, n$ . The resulting sample is said to be a censored power-Student- $t$ (CPT).

From Definition 3 it follows that $P (Y_{i}^{o} = k_{i}) = P (Y_{i} \leq k_{i}) = {F_{T} (\frac{k_{i} - ξ}{η}; ν)}^{α}$ , and for the observations $Y_{i}^{o} = Y_{i}$ the distribution of $Y_{i}^{o}$ is the same of $Y_{i}$ , i.e., $Y_{i}^{o} \sim P T (ξ, η, α, ν)$ . For convenience, we choose to work with the case of left-censored data, however, the followings results can be extended to other types of censorship.

Maximum Likelihood Estimation for CPT Model

Let $Y_{1}^{o}, \dots, Y_{n}^{o}$ be a random sample of the censored $P T (ξ, η, α, ν)$ distribution (censored in $k_{i}$ ). To perform statistical inference for parameter vector $𝛉 = {(ξ, η, α, ν)}^{⊤}$ by using the ML method, we use the reparameterization in Olsen 1978OLSEN RJ. 1978. Note on the Uniqueness of the Maximum Likelihood Estimator for the Tobit Model. Econometrica 46(5): 1211–1215. . Thus, let $γ = σ ξ$ and $σ = 1 / η$ , the log-likelihood function for the new vector $𝛗 = {(γ, σ, α, ν)}^{⊤}$ , given the observed sample $Y^{o}$ can be written as

\begin{aligned} ℓ (φ; Y^{o}) & a m p; = α \sum_{i = 1}^{n} (1 - d_{i}) \log {F_{T} (c_{i}; ν)} \\ a m p; + \sum_{i = 1}^{n} d_{i} {\log α + \log σ + \log f_{T} (z_{i}; ν) + (α - 1) \log F_{T} (z_{i}; ν)} \end{aligned}

(7)

where $c_{i} = σ k_{i} - γ$ , $z_{i} = σ y_{i} - γ$ , and $d_{i}$ is an indicator variable defined as $d_{i} = 1$ if $Y_{i}^{o} = Y_{i}$ ; and $d_{i} = 0$ if $Y_{i}^{o} = k_{i}$ . The components of the score function $𝐔 (𝛗)$ are obtained by deriving partially the log-likelihood function given in (7) with regard to components $γ$ , $σ$ , $α$ and $ν$ , we obtain the following equations

\begin{aligned} U (γ) & = - α \sum_{i = 1}^{n} (1 - d_{i}) r (c_{i}; ν) \\ + \sum_{i = 1}^{n} d_{i} {(\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} z_{i} - (α - 1) r (z_{i}; ν)} \\ U (σ) & = α \sum_{i = 1}^{n} (1 - d_{i}) r (c_{i}; ν) k_{i} \\ + \sum_{i = 1}^{n} d_{i} {\frac{1}{σ} - (\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} z_{i} y_{i} + (α - 1) r (z_{i}; ν) y_{i}} \\ U (α) & = \sum_{i = 1}^{n} (1 - d_{i}) \log {F_{T} (c_{i}; ν)} + \sum_{i = 1}^{n} d_{i} {\frac{1}{α} + \log F_{T} (z_{i}; ν)} \\ U (ν) & = \frac{α}{2} \sum_{i = 1}^{n} (1 - d_{i}) {ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - b_{01} (c_{i}; ν) - \frac{c_{i} r (c_{i}; ν)}{ν}} \\ + \frac{1}{2} \sum_{i = 1}^{n} d_{i} {ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - \frac{1}{ν} \\ - \log (1 + \frac{z_{i}^{2}}{ν}) + (\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} \frac{z_{i}^{2}}{ν}} \\ + \frac{α - 1}{2} \sum_{i = 1}^{n} d_{i} {ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - b_{01} (z_{i}; ν) - \frac{z_{i} r (z_{i}; ν)}{ν}} \end{aligned}

where $r (x; ν) = f_{T} (x; ν) / F_{T} (x; ν)$ , $ψ (x) = \frac{d}{d x} log Γ (x)$ is the digamma function and $b_{m n} (c_{m}; ν + m)$ is the truncated moment defined as

b_{m n} (c_{m}; ν + m) = \int_{- \infty}^{c_{m}} s^{m} {log (1 + \frac{s^{2}}{ν + m})}^{n} \frac{f_{T} (s; ν + m)}{F_{T} (c_{m}; ν + m)} d s,

(8)

with $c_{m} = \sqrt{\frac{ν + m}{ν}} c$ and $c_{0} = c$ . The moments $b_{m n}$ in (8) are obtained by numerical integration, for example, by using integrate function of R Development Core Team 2018R DEVELOPMENT CORE TEAM. 2018. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. Vienna, Austria. URL http://www.R-project.org. ISBN 3-900051-07-0.
http://www.R-project.org... . The ML estimates of the parameters $ξ$ , $η$ , $α$ and $ν$ in censored PT model are obtained using iterative algorithms based on the score functions and by applying the inverse transformation $ξ = 𝛄 / σ$ and $η = 1 / σ$ . For obtaining the standard errors of the ML estimates one should compute the information matrix $I_{φ}$ . It is well known that the elements of $I_{φ}$ are given by

I_{φ} (i, k) = - E [\frac{\partial^{2} ℓ (φ; Y^{o})}{\partial φ_{i} \partial φ_{k}}], i, k = 1, \dots, 4

where $𝛗 = {(γ, σ, α, ν)}^{⊤}$ . Since expectation over PT distribution and second-order derivatives are not straightforward, numerical methods should be performed to obtain the explicit form of the information matrix $I_{φ}$ . Thus, we use the observed information matrix for calculating the standard errors in the rest of the paper. To recover the information matrix $I_{θ}$ of the original parameterization $𝛉 = {(ξ, η, α, ν)}^{⊤}$ , we use

I_{θ} = (\partial φ / \partial θ)^{⊤} I_{φ} (\partial φ / \partial θ),

where the Jacobian matrix is

\frac{\partial 𝛗}{\partial 𝛉} = (\begin{matrix} \frac{1}{η} & - \frac{1}{η^{2}} & 0 & 0 \\ 0 & - \frac{1}{η^{2}} & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{matrix})

(9)

Definition 4 (Truncated PT Model). Let $X$ be a random variable with distribution $X \sim P T (ξ, η, α, ν)$ . Let $a, b \in ℝ$ with $a < b$ , such that $P (a < X < b) > 0$ . It is said that random variable $Y$ has a truncated power-Student- $t$ (TPT) distribution in the interval $(a, b)$ , if $Y$ has the same distribution as $X ∣ X \in (a, b)$ . In this case, we write $Y \sim {T P T}_{(a, b)} (ξ, η, α, ν)$ .

As a consequence of the Definition 4, the PDF of TPT distribution can be obtained as

\begin{matrix} f_{P T} (y ∣ (a, b)) & = \frac{α}{η} f_{T} (\frac{y - ξ}{η}; ν) {[F_{T} (\frac{y - ξ}{η}; ν)]}^{α - 1} \\ \times {{[F_{T} (\frac{b - ξ}{η})]}^{α} - {[F_{T} (\frac{a - ξ}{η})]}^{α}}^{- 1} \end{matrix}

if $a < y < b$ , and $f_{P T} (y ∣ (a, b)) = 0$ in otherwise. Now, we consider that before the sample to be selected, the distribution of $Y$ is truncated at the value $k$ , so that we can only choose observations such that $Y \leq k$ . Then, random variable $Y$ has PDF given by

\begin{matrix} f_{P T} (y ∣ y \leq k) & = \frac{α}{η} f_{T} (\frac{y - ξ}{η}; ν) {[F_{T} (\frac{y - ξ}{η}; ν)]}^{α - 1} {[F_{T} (\frac{k - ξ}{η}; ν)]}^{- α} \end{matrix}

(10)

for $- \infty < y \leq k$ .

Maximum Likelihood Estimation for TPT Model

Given a sample $Y = (Y_{1}, \dots, Y_{n})^{⊤}$ of the TPT distribution in the value $k$ , the log-likelihood function for vector $𝛗 = {(γ, σ, α, ν)}^{⊤}$ , where $γ = σ ξ$ and $σ = 1 / η$ , is given by

\begin{aligned} ℓ (φ; Y) & = - n_{1} α \log {F_{T} (c; ν)} \\ + \sum_{y_{i} \leq k}^{n} {\log α + \log σ + \log f_{T} (z_{i}; ν) + (α - 1) \log F_{T} (z_{i}; ν)} \end{aligned}

(11)

where $c = σ k - γ$ , $z_{i} = σ y_{i} - γ$ , and $n_{1}$ is the number of observations in the sample such that $- \infty < y \leq k$ . Deriving partially the log-likelihood function (11) with respect to the components of the vector $𝛗$ the following elements of the score function are obtained

\begin{aligned} U (γ) & = n_{1} α r (c; ν) + \sum_{y_{i} \leq k} {(\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} z_{i} - (α - 1) r (z_{i}; ν)} \\ U (σ) & = - n_{1} α r (c; ν) k + \sum_{y_{i} \leq k} {\frac{1}{σ} - (\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} z_{i} + (α - 1) r (z_{i}; ν)} \\ U (α) & = - n_{1} \log {F_{T} (c; ν)} + \sum_{y_{i} \leq k} {\frac{1}{α} + \log F_{T} (z_{i}; ν)} \\ U (ν) & = - \frac{n_{1} α}{2} {ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - b_{01} (c; ν) - \frac{c r (c; ν)}{ν}} \\ + \frac{1}{2} \sum_{y_{i} \leq k} {ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - \frac{1}{ν} \\ - \log (1 + \frac{z_{i}^{2}}{ν}) + (\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} \frac{z_{i}^{2}}{ν}} \\ + \frac{α - 1}{2} \sum_{y_{i} \leq k} {ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - b_{01} (z_{i}; ν) - \frac{z_{i} r (z_{i}; ν)}{ν}} \end{aligned}

where $r (x; ν) = f_{T} (x; ν) / F_{T} (x; ν)$ , $ψ (x) = \frac{d}{d x} log Γ (x)$ is the digamma function and $b_{m n} (c_{m}; ν + m)$ is given by (8). To obtain the ML estimates of the parameters $ξ$ , $η$ , $α$ and $ν$ in the TPT model, we proceed in a similar way to the CPT model, and iterative methods based on the Newton-Rapshon algorithm are used with the score functions. We use the observed information matrix for calculating the standard errors and to recover the information matrix $I_{θ}$ of the original parameterization $𝛉 = {(ξ, η, α, ν)}^{⊤}$ , we use $I_{θ} = (\partial φ / \partial θ)^{⊤} I_{φ} (\partial φ / \partial θ)$ , where $\partial 𝛗 / \partial 𝛉$ is given in (9).

CENSORED POWER-STUDENT-T REGRESSION MODEL

In this section, we introduce the censored power-Student- $t$ regression model, which is denoted by PTCR. This model results from the consideration of the observed random variable $Y_{i}^{o} = D_{i} Y_{i}$ , where $D_{i} = I_{(0, + \infty)} (Y_{i})$ and $Y_{i} = x_{i}^{⊤} β + ε_{i}$ , with $ε_{i} \overset{i i d}{\sim} P T (0, η, α, ν)$ , for $i = 1, \dots, n$ ; i.e.,

Y_{i}^{o} = {\begin{cases} x_{i}^{⊤} β + ε_{i}, & if Y_{i} > 0, \\ 0, & if Y_{i} \leq 0, \end{cases}

(12)

where $𝛃$ is a vector of dimension $p$ of unknown parameters, $x_{i} = (x_{i 1}, \dots, x_{i p})^{⊤}$ , for $i = 1, \dots, n$ , are vectors of known covariates, and $ε_{i}$ , for $i = 1, \dots, n$ , are independent random variables with PT distribution with location parameter 0, scale parameter $η$ , shape parameter $α$ , and degrees of freedom $ν$ . This assumption is equivalent to considering that unobserved random variables $Y_{1}, \dots, Y_{n}$ are independent with $Y_{i} \sim P T (x_{i}^{⊤} β, η, α, ν)$ , that is, with PDF given by

g (y_{i}; θ) = \frac{α}{η} f_{T} (\frac{y_{i} - x_{i}^{⊤} β}{η}; ν) {[F_{T} (\frac{y_{i} - x_{i}^{⊤} β}{η}; ν)]}^{α - 1}

for $i = 1, \dots, n$ , where $𝛉 = {(𝛃^{⊤}, η, α, ν)}^{⊤}$ . The contribution to likelihood function for observations $Y_{i}^{o} = 0$ is given by

P (Y_{i}^{o} = 0) = {F_{T} (- \frac{x_{i}^{⊤} β}{η}; ν)}^{α},

and for observations $Y_{i}^{o} \geq 0$ , we have that $_{i}^{o} \sim P T (x_{i}^{⊤} β, η, α, ν)$ . Therefore, the likelihood function of the PTCR model based on the observed sample $Y^{o} = (Y_{1}^{o}, \dots, Y_{n}^{o})^{⊤}$ is given by

\begin{aligned} L (θ; Y^{o}) = & \prod_{i = 1}^{n} {F_{T} (- \frac{x_{i}^{⊤} β}{η}; ν)}^{α (1 - d_{i})} {\frac{α}{η} f_{T} (\frac{y_{i} - x_{i}^{⊤} β}{η}; ν) \\ \times {[F_{T} (\frac{y_{i} - x_{i}^{⊤} β}{η}; ν)]}^{α - 1}}^{d_{i}} \end{aligned}

(13)

where

d_{i} = {\begin{matrix} 1, & if Y_{i}^{o} > 0, \\ 0, & if Y_{i}^{o} = 0, \end{matrix}

Model (13) can be extended to the situation where the value of the censorship associated with the observation $i$ is replaced by the value $k_{i}$ (a known value), i.e.,

Y_{i}^{o} = {\begin{cases} x_{i}^{⊤} β + ε_{i}, & if Y_{i} > k_{i}, \\ k_{i}, & if Y_{i} \leq k_{i}, \end{cases}

(14)

for $i = 1, \dots, n$ . Note that, by making $Y_{i}^{o *} = Y_{i}^{o} - k_{i}$ , $x_{i}^{*} = (x_{i}^{⊤}, k_{i})^{⊤}$ and $𝛃^{*} = {(𝛃^{⊤}, - 1)}^{⊤}$ , we have the previous model in (12), hence, the results of the inference based on the ML method can be used to fit the more general model in (14).

Proposition 3. Consider the model (12) with assumption $ε_{i} \overset{i i d}{\sim} P T (0, η, α, ν)$ , for $i = 1, \dots, n$ , then

(i) if $α = 1$ , model (12) is reduced to $t$ -Student censored regression (TCR) model
(ii) if $ν \to \infty$ , model (12) converges to power-normal censored regression (PNCR) model see Martínez-Flórez et al. 2013MARTÍNEZ-FLÓREZ G, BOLFARINE H & GÓMEZ HW. 2013. The alpha–power tobit model. Commun Stat Theory Methods 42(4): 633–643.
(iii) if $α = 1$ and $ν \to \infty$ , model (12) converges to usual normal censored regression (NCR) model, that is, Tobit model.

Proof. Proof of (i)-(iii) are directly obtained from definition of PTCR model. ◻

Moments

Proposition 4. The mean and variance for the $i$ th observed response in PTCR model are given by

\begin{aligned} E [Y_{i}^{o}] & = η (c_{i} + m_{1} (c_{i}; ν)) (1 - [F_{T} (- c_{i}; ν)]^{α}) \end{aligned}

(15)

and

\begin{aligned} V a r [Y_{i}^{o}] & = η^{2} (c_{i}^{2} + 2 c_{i} m_{1} (c_{i}; ν)) ([F_{T} (- c_{i}; ν)]^{α}) (1 - [F_{T} (- c_{i}; ν)]^{α}) \\ + η^{2} (m_{2} (c_{i}; ν) - {[m_{1} (c_{i}; ν)]}^{2} (1 - [F_{T} (- c_{i}; ν)]^{α})) (1 - [F_{T} (- c_{i}; ν)]^{α}) \end{aligned}

(16)

respectively, where $c_{i} = μ_{i} / η$ with $μ_{i} = x_{i}^{⊤} β$ and

m_{r} (c; ν) = \frac{1}{1 - {F_{T} (- c; ν)}^{α}} \int_{F_{T} (- c; ν)}^{1} {[F_{T}^{- 1} (u; ν)]}^{r} α u^{α - 1} d u, r = 1, 2 .

Note that $m_{r} (c; ν)$ is the moment $E [{(F_{T}^{- 1} (U; ν))}^{r}]$ of a random variable $U$ with distribution $B e t a (α, 1)$ , truncated in the interval $(F_{T} (- c; ν), 1)$ .

Proof. The mean and variance for the $i$ th observed response in PTCR model can be obtained by noting that $Y_{i}^{o} = D_{i} Y_{i}$ , where $D_{i} = I_{(0, + \infty)} (Y_{i})$ and $Y_{i} = μ_{i} + η Z_{i}$ , with $μ_{i} = x_{i}^{⊤} β$ and $Z_{i} \overset{iid}{\sim} P T (α, ν)$ , $i = 1, \dots, n$ . We have

\begin{aligned} E [Y_{i}^{o}] & = E [Y_{i} ∣ Y_{i} > 0] P (Y_{i} > 0) \\ = E [μ_{i} + η Z_{i} ∣ μ_{i} + η Z_{i} > 0] P (μ_{i} + η Z_{i} > 0) \\ = (μ_{i} + η E [Z_{i} ∣ Z_{i} > - \frac{μ_{i}}{η}]) P (Z_{i} > - \frac{μ_{i}}{η}) \\ = η (\frac{μ_{i}}{η} + E [Z_{i} ∣ Z_{i} + \frac{μ_{i}}{η} > 0]) (1 - P (Z_{i} \leq - \frac{μ_{i}}{η})) \\ = η (c_{i} + E [Z_{i} ∣ Z_{i} + c_{i} > 0]) (1 - [F_{T} (- c_{i}; ν)]^{α}) \end{aligned}

By Lemma 1 in Appendix A, we have

E [Y_{i}^{o}] = η (c_{i} + m_{1} (c_{i}; ν)) (1 - [F_{T} (- c_{i}; ν)]^{α})

To obtain the variance of $Y_{i}^{o}$ , note that

\begin{aligned} E [(Y_{i}^{o})^{2}] & = E [Y_{i}^{2} ∣ Y_{i} > 0] P (Y_{i} > 0) \\ = E [(μ_{i} + η Z_{i})^{2} ∣ μ_{i} + η Z_{i} > 0] P (μ_{i} + η Z_{i} > 0) \\ = (μ_{i}^{2} + 2 η μ_{i} E [Z_{i} ∣ Z_{i} > - \frac{μ_{i}}{η}] + η^{2} E [Z_{i}^{2} ∣ Z_{i} > - \frac{μ_{i}}{η}]) P (Z_{i} > - \frac{μ_{i}}{η}) \\ = η^{2} (\frac{μ_{i}^{2}}{η^{2}} + 2 \frac{μ_{i}}{η} E [Z_{i} ∣ Z_{i} > - \frac{μ_{i}}{η}] + E [Z_{i}^{2} ∣ Z_{i} > - \frac{μ_{i}}{η}]) (1 - P (Z_{i} \leq - \frac{μ_{i}}{η})) \\ = η^{2} (c_{i}^{2} + 2 c_{i} E [Z_{i} ∣ Z_{i} + c_{i} > 0] + E [Z_{i}^{2} ∣ Z_{i} + c_{i} > 0]) (1 - [F_{T} (- c_{i}; ν)]^{α}) \end{aligned}

Using Lemma 1 in Appendix, it follows that

E [(Y_{i}^{o})^{2}] = η^{2} (c_{i}^{2} + 2 c_{i} m_{1} (c_{i}; ν) + m_{2} (c_{i}; ν)) (1 - [F_{T} (- c_{i}; ν)]^{α})

thus, by calculating $V a r [Y_{i}^{o}] = E [{(Y_{i}^{o})}^{2}] - (E [Y_{i}^{o}])^{2}$ and after some algebraic manipulations, we obtain

\begin{aligned} V a r [Y_{i}^{o}] & = η^{2} (c_{i}^{2} + 2 c_{i} m_{1} (c_{i}; ν)) ([F_{T} (- c_{i}; ν)]^{α}) (1 - [F_{T} (- c_{i}; ν)]^{α}) \\ + η^{2} (m_{2} (c_{i}; ν) - {[m_{1} (c_{i}; ν)]}^{2} (1 - [F_{T} (- c_{i}; ν)]^{α})) (1 - [F_{T} (- c_{i}; ν)]^{α}) \end{aligned}

◻

It is important to note that, if $ν$ tends to infinity, then (15) and (16) converge to the mean and variance of the PNCR model (Martínez-Flórez et al. 2013MARTÍNEZ-FLÓREZ G, BOLFARINE H & GÓMEZ HW. 2013. The alpha–power tobit model. Commun Stat Theory Methods 42(4): 633–643. ), i.e., when $ν \to \infty$

\begin{aligned} E [Y_{i}^{o}] & = η (c_{i} + m_{1} (c_{i})) (1 - [Φ (- c_{i})]^{α}) \\ V a r [Y_{i}^{o}] & = η^{2} (c_{i}^{2} + 2 c_{i} m_{1} (c_{i})) ([Φ (- c_{i})]^{α}) (1 - [Φ (- c_{i})]^{α}) \\ + η^{2} (m_{2} (c_{i}) - {[m_{1} (c_{i})]}^{2} (1 - {Φ (- c_{i})}^{α})) (1 - [Φ (- c_{i})]^{α}) \end{aligned}

where

m_{r} (c) = \frac{1}{1 - {Φ (- c)}^{α}} \int_{Φ (- c)}^{1} {[Φ^{- 1} (u)]}^{r} α u^{α - 1} d u .

with $Φ (\cdot)$ the CDF of the standard normal distribution, and $Φ^{- 1} (\cdot)$ the inverse function of $Φ (\cdot)$ . Also, worth noting that, if $α = 1$ and $ν \to \infty$ , we have $m_{1} (c_{i}; ν) \to ϕ (c_{i}) / Φ (c_{i})$ and $m_{2} (c_{i}; ν) \to (Φ (c_{i}) - c_{i} ϕ (c_{i})) / Φ (c_{i})$ , thus

\begin{aligned} E [Y_{i}^{o}] & = η (c_{i} Φ (c_{i}) + ϕ (c_{i})) \\ V a r [Y_{i}^{o}] & = η^{2} (c_{i}^{2} Φ (c_{i}) + 2 c_{i} ϕ (c_{i})) (1 - Φ (c_{i})) \\ + η^{2} (Φ (c_{i}) - ϕ (c_{i}) [c_{i} + ϕ (c_{i})]) \end{aligned}

which are the mean and variance, respectively, of the Tobit model (Tobin 1958TOBIN J. 1958.Estimation of relationship for limited dependent variables. Econometrica 26(1): 24–36. ).

Maximum Likelihood Estimation

The ML method is considered by using the reparameterization of Olsen 1978OLSEN RJ. 1978. Note on the Uniqueness of the Maximum Likelihood Estimator for the Tobit Model. Econometrica 46(5): 1211–1215. . Let $𝛄 = σ 𝛃$ , $σ = 1 / η$ and

d_{i} = {\begin{cases} 1, & if Y_{i}^{o} > k_{i}, \\ 0, & if Y_{i}^{o} = k_{i}, \end{cases}

the log-likelihood function for $𝛗 = {(𝛄^{⊤}, σ, α, ν)}^{⊤}$ obtained from (13) under the new parameterization is given by

\begin{aligned} ℓ (φ; Y^{o}) = & α \sum_{i = 1}^{n} (1 - d_{i}) \log {1 - F_{T} (c_{i}; ν)} \\ + \sum_{i = 1}^{n} d_{i} {\log α + \log σ + \log f_{T} (z_{i}; ν) + (α - 1) \log F_{T} (z_{i}; ν)} \end{aligned}

(17)

where $z_{i} = σ y_{i} - c_{i}$ , with $c_{i} = x_{i}^{⊤} γ$ . The components of the score function $𝐔 (𝛗)$ are obtained by deriving $ℓ (φ; Y^{o})$ partially in relation to the components $𝛄$ , $η$ , $α$ and $ν$ . After some algebraic manipulations the following components of the score function are obtained

\begin{aligned} U (γ) & = - α \sum_{i = 1}^{n} (1 - d_{i}) r (- c_{i}; ν) x_{i} \\ + \sum_{i = 1}^{n} d_{i} {(\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} z_{i} - (α - 1) r (z_{i}; ν)} x_{i} \end{aligned}

(18)

\begin{aligned} U (σ) & = \sum_{i = 1}^{n} d_{i} {\frac{1}{σ} - (\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} z_{i} y_{i} + (α - 1) r (z_{i}; ν) y_{i}} \end{aligned}

(19)

\begin{aligned} U (α) & = \sum_{i = 1}^{n} (1 - d_{i}) \log {1 - F_{T} (c_{i}; ν)} + \sum_{i = 1}^{n} d_{i} {\frac{1}{α} + \log F_{T} (z_{i}; ν)} \end{aligned}

(20)

\begin{aligned} U (ν) & = - \frac{α}{2} \sum_{i = 1}^{n} (1 - d_{i}) {ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - b_{01} (c_{i}; ν) - \frac{c_{i} r (c_{i}; ν)}{ν}} R (c_{i}; ν) \\ + \frac{1}{2} \sum_{i = 1}^{n} d_{i} {ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - \frac{1}{ν} - \log (1 + \frac{z_{i}^{2}}{ν})} \\ + \frac{1}{2} \sum_{i = 1}^{n} d_{i} {(\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} \frac{z_{i}^{2}}{ν}} \\ + \frac{α - 1}{2} \sum_{i = 1}^{n} d_{i} {ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - b_{01} (z_{i}; ν) - \frac{z_{i} r (z_{i}; ν)}{ν}} \end{aligned}

(21)

where $r (x; ν) = f_{T} (x; ν) / F_{T} (x; ν)$ , $R (x; ν) = F_{T} (x; ν) / (1 - F_{T} (x; ν))$ , $ψ (x) = \frac{d}{d x} log Γ (x)$ is the digamma function and $b_{m n} (c_{m}; ν + m)$ is the truncated moment defined by (8). Note that, if $α = 1$ the equations (18)-(21) are reduced to the functions of the TCR model (Arellano-Valle et al. 2012ARELLANO-VALLE RB, CASTRO LM, GONZÁLEZ-FARÍAS G & MUNÕZ-GAJARDO KA. 2012. Student-t Censored Regression Model: Properties and Inference. Stat Methods Appt 21(4): 453–473. doi:10.1007/s10260-012-0199-y.
10.1007/s10260-012-0199-y... ), while, if $α = 1$ and $ν \to \infty$ , then $r (c; ν) \to r (c) = ϕ (c) / Φ (c)$ , $R (c; ν) \to R (c) = Φ (c) / (1 - Φ (c))$ , and $U (ν) \to 0$ , therefore, the equations (18) and (19) are reduced to score functions of the Tobit model.

The elements of the observed information matrix $J_{φ}$ for PTCR model, which are denoted by $j_{𝛗_{i} 𝛗_{j}}$ , can be obtained by calculating the second partial derivative of the log-likelihood function (17), i.e., $j_{φ_{i} φ_{j}} = - \partial^{2} ℓ (φ; Y^{o}) / \partial φ_{i} \partial φ_{j}$ , while the expected information matrix is obtained as $I_{φ} = E [J_{φ}]$ , which involves the calculation of truncated expected values that have no closed form and must be obtained numerically. The Appendix B presents the expressions for the elements of the matrices $I_{φ}$ and $J_{φ}$ . The expected information matrix $I_{θ}$ of the original parameterization $𝛉 = {(𝛃, η, α, ν)}^{⊤}$ can be recovered by using $I_{θ} = (\partial φ / \partial θ)^{⊤} I_{φ} (\partial φ / \partial θ)$ , where

\frac{\partial φ}{\partial θ} = (\begin{array}{cccc} \frac{1}{η} I_{p} & - \frac{1}{η^{2}} β & 0 & 0 \\ 0 & - \frac{1}{η^{2}} & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{array})

Finally, the ML estimates for $𝛉 = {(𝛃, η, α, ν)}^{⊤}$ can be obtained using iterative methods based on the Newton-Rapshon algorithm from the score function (18) - (21) and applying the inverse transformation $𝛃 = 𝛄 / σ$ and $η = 1 / σ$ . Estimates of the variances of the estimator can be obtained by evaluating the inverse of the observed information matrix $J_{φ}^{- 1}$ at the ML estimators $\hat{𝛗} = {(\hat{𝛄}, \hat{σ}, \hat{α}, \hat{ν})}^{⊤}$ and by using the previous result.

Model Selection and Residual Analysis

In this section some criteria for the selection of the best-fitted model and a methodology for residual analysis are proposed.

Model Selection

Many model selection tools are generally used, such as the Akaike information criteria (AIC), (Akaike 1974AKAIKE H. 1974.A new look at statistical model identification. IEEE Trans Automat Contr AU-19(4): 716–722. ), Bayesian information criterion (BIC) (Schwarz 1978SCHWARZ G. 1978. Estimating the dimension of a model. Ann Stat 6(2): 461–464. ), and the AIC corrected (AICc) (Sugiura 1978SUGIURA N. 1978. Further Analysis of the Data by Akaike’s Information Criterion and the Finite Corrections. Commun Stat Theory Methods 7(1): 13–26. doi:10.1080/03610927808827599. ), which are defined by

- 2 ℓ (θ ∣ y) + c (k, n)

where the term $c (k, n)$ is a quantity that depends on the number of free parameters that are estimated in the model $k$ , and the number of observations in the sample $n$ . For the AIC one has $c (k, n) = 2 k$ , for BIC $c (k, n) = k log (n)$ and for AICc, $c (k, n) = 2 k (k + 1) / (n - (k + 1))$ . To choose the best-fitted model, the criteria AIC, BIC and AICc are used.

Residual Analysis

The residual analysis has the purpose of detecting the presence of atypical observations and to evaluate the assumptions of the model, being able to include formal tests to detect departures from the assumptions of the considered model, as well as informal graphs to present general characteristics of the residuals.

Following Garay et al. 2017GARAY AM, LACHOS VH, BOLFARINE H & CABRAL CRB. 2017. Linear Censored Regression Models with Scale Mixtures of Normal Distributions. Stat Pap 58(1): 247–278. doi:10.1007/s00362-015-0696-9. and Arellano-Valle et al. 2012ARELLANO-VALLE RB, CASTRO LM, GONZÁLEZ-FARÍAS G & MUNÕZ-GAJARDO KA. 2012. Student-t Censored Regression Model: Properties and Inference. Stat Methods Appt 21(4): 453–473. doi:10.1007/s10260-012-0199-y.
10.1007/s10260-012-0199-y... , in this work we considered the transformed martingal residuals $r_{M T i}$ proposed by Barros et al. 2010BARROS M, GALEA M, GONZÁLEZ M & LEIVA V. 2010. Influence Diagnostics in the Tobit Censored Response Model. Stat Methods Appt 19(3): 379–397. doi:10.1007/s10260-010-0135-y. as diagnostic tool to evaluate deviations from the postulated model for the response variable, as well as to detect the presence of atypical observations. The residuals are defined as

r_{M T i} = s i g n (r_{M i}) \sqrt{- 2 [r_{M i} + δ_{i} \log (δ_{i} - r_{M_{i}})]}, i = 1, \dots, n

where $r_{M i} = δ_{i} + log S (y_{i}; \hat{𝛉})$ is the martingal residual proposed by Ortega et al. 2003ORTEGA EM, BOLFARINE H & PAULA GA. 2003. Influence Diagnostics in Generalized Log-Gamma Regression Models. Comput Stat Data Anal 42: 165–186. , where $δ_{i} = 0, 1$ indicates whether the $i$ th observation is censored or not, respectively, $s i g n (r_{M i})$ denotes the sign of $r_{M i}$ and $S (y_{i}; \hat{𝛉}) = P_{\hat{𝛉}} (Y_{i} > y_{i})$ represents the survival function evaluated at $y_{i}$ , where $\hat{𝛉}$ are the MLE for $𝛉$ .

As suggested by Garay et al. 2017GARAY AM, LACHOS VH, BOLFARINE H & CABRAL CRB. 2017. Linear Censored Regression Models with Scale Mixtures of Normal Distributions. Stat Pap 58(1): 247–278. doi:10.1007/s00362-015-0696-9. , this type of standardized residuals is used due to the fact that they are symmetrically distributed around zero, which facilitates the construction of the simulated envelopes with little computational effort and will be useful to detect an incorrect specification of the model, as well as the presence of observations atypical.

SIMULATIONS STUDIES

Simulation Study 1: Robustness of the Maximum Likelihood Estimates

In this section, we compare the performance of the estimates for PTCR model in the presence of outliers on the response variable. Following Garay et al. 2017GARAY AM, LACHOS VH, BOLFARINE H & CABRAL CRB. 2017. Linear Censored Regression Models with Scale Mixtures of Normal Distributions. Stat Pap 58(1): 247–278. doi:10.1007/s00362-015-0696-9. and Mattos et al. 2018MATTOS T, GARAY AM & LACHOS VH. 2018. Likelihood-based inference for censored linear regression models with scale mixtures of skew-normal distributions. J Appl Stat 45(11): 2039–2066. we performed a simulation study based on the NCR model. Specifically, we considered (12) with $x_{i}^{⊤} = (1, x_{i})$ and $ε_{i} \sim N (0, η^{2})$ for $i = 1, \dots, n$ . As in Garay et al. 2017GARAY AM, LACHOS VH, BOLFARINE H & CABRAL CRB. 2017. Linear Censored Regression Models with Scale Mixtures of Normal Distributions. Stat Pap 58(1): 247–278. doi:10.1007/s00362-015-0696-9. we generated 1000 artificial samples of size $n = 300$ , considering $𝛃 = {(β_{1}, β_{2})}^{⊤} = {(2, 3.5)}^{⊤}$ , $η = 2$ and fixing the left censoring level at $p = 10, 20$ and 30% (that is, 10, 20 and 30% of the observations in each data set were left censored, respectively). We generated independently the values $x_{i}$ , for $i = 1, \dots, n$ , from a uniform distribution on the interval (2, 20). These values were fixed throughout the simulations.

To assess how much the ML estimates are influenced by the presence of outliers, we replaced the observation $y_{150}$ by $y_{150} (δ) = y_{150} + δ$ , with $δ = 1, 2, \dots, 10$ . Let ${\hat{β}}_{i} (δ)$ and ${\hat{β}}_{i}$ be the ML estimates of $β_{i}$ with and without contamination, respectively, $i = 1, 2$ . We are particularly interested in the relative changes

R C ({\hat{β}}_{i} (δ)) = | ({\hat{β}}_{i} (δ) - {\hat{β}}_{i}) / {\hat{β}}_{i} |

We define the relative changes for $η$ analogously. For each replication we obtained the parameter estimates with and without outliers, under the PTCR model. Table II and Figure 2 depict the average values of the relative changes across all samples and different censoring levels.

Thumbnail

Table II
Average relative changes on estimates for different contaminations d and censoring level p.

Figure 2
Simulation study 1. Average relative changes on estimates for different contaminations d and censoring level.

We observe that influence increases dramatically when $δ$ increases for $p = 10 %$ , specially for the parameter. However, for the $p = 20$ and 30%, these measures vary little, which indicates that PTCR model is more robust in these cases in the presence of discrepant observations.

Simulation Study 2: Asymptotic properties

To study the performance of the ML estimator $\hat{𝛉} = {({\hat{𝛃}}^{⊤}, \hat{η}, \hat{α}, \hat{ν})}^{⊤}$ , a Monte Carlo simulation study with sample sizes $n =$ 150, 300, 750, and 1000 is presented. We considered the PTCR model defined in Section “Censored Power-Student-t Regression Model” with $x_{i}^{⊤} = (1, x_{i})$ for $i = 1, \dots, n$ . The true values of the parameters were taken as $𝛃 =$ (2,1.5) $^{⊤}$ , $η =$ 1.5, $α =$ 0.4, 2.5 and $ν =$ 3.0. We also consider levels of censorship equal to $p =$ 0, 10, 25 and 45 $%$ . The covariate $x_{i}$ was generated from a uniform distribution $U$ (0.1,20) as considered in Garay et al. 2017GARAY AM, LACHOS VH, BOLFARINE H & CABRAL CRB. 2017. Linear Censored Regression Models with Scale Mixtures of Normal Distributions. Stat Pap 58(1): 247–278. doi:10.1007/s00362-015-0696-9. . For each combination of parameters, sample sizes and censorship levels, 2000 samples of the PTCR model were generated with errors $ε_{i} \sim P T (0, η, α, ν)$ . To evaluate the performance of the estimators, the absolute value of the relative bias (RB) and the mean square error (MSE) were considered, they are given by

RB ({\hat{θ}}_{i}) = \frac{1}{2000} \sum_{j = 1}^{2000} (\frac{{\hat{θ}}_{i}^{(j)}}{θ_{i}} - 1), MSE ({\hat{θ}}_{i}) = \frac{1}{2000} \sum_{j = 1}^{2000} {({\hat{θ}}_{i}^{(j)} - θ_{i})}^{2},

respectively, where ${\hat{θ}}_{i}^{(j)}$ is the estimator of $θ_{i}$ for the $j$ th sample, for $θ_{i} \in 𝛉 = {(𝛃^{⊤}, η, α, ν)}^{⊤}$ . The ML estimates of the parameters were calculated by using the optim function of R Development Core Team 2018R DEVELOPMENT CORE TEAM. 2018. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. Vienna, Austria. URL http://www.R-project.org. ISBN 3-900051-07-0.
http://www.R-project.org... . The optimization of the likelihood function was done using iterative methods based on the Newton-Rapshon algorithm by using the score functions.

It can be seen from the Table III that RB and MSE tend to decrease when the value of $n$ increases, indicating that estimates based on the ML method have good asymptotic properties. That pattern is the same for the different levels of censorship of $p$ under consideration. Note that, when the sample sizes is $n = 150$ , the estimates for the $β_{0}$ parameter are unstable (in terms of MSE) because it is affected by the bias of the asymmetry parameter $α$ , however, when the sample size increases, the estimates become more stable. In general, this problem is very common in these types of models, see for example, Martínez-Flórez et al. 2013MARTÍNEZ-FLÓREZ G, BOLFARINE H & GÓMEZ HW. 2013. The alpha–power tobit model. Commun Stat Theory Methods 42(4): 633–643. , so we recommend moderate and large sample sizes in these types of models

Thumbnail

Table III
Performance of the ML estimators of

β_{0}

,

β_{1}

, η, α and ν for the PTCR model.

REAL DATA APPLICATIONS

Application 1: Wage rate

To illustrate the proposed model, we consider a data set described by Mroz 1987MROZ TA. 1987. The Sensitivity of an Empirical Model of Married Women’s Hours of Work to Economic and Statistical Assumptions. Econometrica 55(4): 765–799. doi:10.2307/1911029. . The data set consists of 753 married white women with ages between 30 and 60 years old in 1975, with 428 women that worked at some point during that year. The response variable used in this application is the wage rate, which represents a measure of the wage of the housewife known as the average hourly earnings. In data set, we have that if the wage rates are set equal to zero, these wives did not work in 1975. Therefore, these observations are considered left-censored at zero.

The considerated covariates were the wife’s age $(X_{1})$ , years of schooling $(X_{2})$ , the number of children younger than six years old in the household $(X_{3})$ , and the number of children between six and nineteen years old $(X_{4})$ . These data were analyzed previously by Arellano-Valle et al. 2012ARELLANO-VALLE RB, CASTRO LM, GONZÁLEZ-FARÍAS G & MUNÕZ-GAJARDO KA. 2012. Student-t Censored Regression Model: Properties and Inference. Stat Methods Appt 21(4): 453–473. doi:10.1007/s10260-012-0199-y.
10.1007/s10260-012-0199-y... using a TCR model and later by Garay et al. 2017GARAY AM, LACHOS VH, BOLFARINE H & CABRAL CRB. 2017. Linear Censored Regression Models with Scale Mixtures of Normal Distributions. Stat Pap 58(1): 247–278. doi:10.1007/s00362-015-0696-9. using the Scale Mixture of Normal Censored Regression (SMNCR) models. We analyzed the data set by fitting a PTCR model and we compare our proposal with SMNCR models by Garay et al. 2017GARAY AM, LACHOS VH, BOLFARINE H & CABRAL CRB. 2017. Linear Censored Regression Models with Scale Mixtures of Normal Distributions. Stat Pap 58(1): 247–278. doi:10.1007/s00362-015-0696-9. : Student- $t$ censored regression model (TCR) (Arellano-Valle et al. 2012ARELLANO-VALLE RB, CASTRO LM, GONZÁLEZ-FARÍAS G & MUNÕZ-GAJARDO KA. 2012. Student-t Censored Regression Model: Properties and Inference. Stat Methods Appt 21(4): 453–473. doi:10.1007/s10260-012-0199-y.
10.1007/s10260-012-0199-y... ), Slash censored regression model (SLCR), and normal censored regression model (NCR), that is, the usual tobit model. Table IV shows skewness and kurtosis index for complete data and also for uncensored observations. Notice that values for the skewness and kurtosis indexes justify using the PTCR.

Thumbnail

Table IV
Statistical summary for wage rate data.

Table V presents parameter estimates together with their corresponding standard errors (SE) for the PTCR, TCR, SLCR and NCR models. To fit model of the PTCM, we use the R code in the Appendix C. Table VI presents some model selection criteria, together with the values of the log-likelihood. According to the AIC, BIC and AICc criteria, the PTCR model seems to yield a better fit to the Mroz’s data than the SMNCR models (TCR and SLCR models) and the usual Tobit model (NCR model), supporting the contention of a departure from symmetry of the errors. Also, the SE of the PTCR model are smaller than SE of the SMNCR and NCR models.

Thumbnail

Table V
Parameters and standard errors (SE) of the PTCR, TCR, SLCR and NCR models fitted to Wage rate data.

Thumbnail

Table VI
Wage rate data. Model selection criteria.

A more emphatic indication that an asymmetric model should be considered comes from testing the hypothesis a TCR model against an asymmetric (PTCR model), that is,

H_{0} : α = 1 versus H_{1} : α \neq 1,

by using the likelihood ratio (LR) statistics, $- 2 log (Λ) = - 2 (ℓ_{T C R} (\hat{𝛉}) - ℓ_{P T C R} (\hat{𝛉}))$ , which for the data set under study, leads to $- 2 log (Λ) = 52.41$ , which is greater than the critical 5% value with one degree of freedom which is given by $χ_{1, 95 %}^{2} = 3.84$ . This is an indication that the PTCR model fits Mroz’s data better than the ordinary TCR model.

Finally, in order to verify if there is any incorrect specification in the assumptions of the fitted model, the simulated envelope graphs for the transformed martingal residuals are shown in Figure 3. This figure indicates that the PTCR model is, apparently, more suitable for the adjustment of this data than the SMNCR models. It can also be observed that the SMNCR models with heavy tails fit the data better than the NCR model, since there are few observations that are outside the envelopes.

Figure 3
Wage rate data. Envelopes of transformed martingale residuals for PTCR, TCR, SLCR and NCR models.

Application 2: Stellar Abundances Data

The second censored dataset is described in Santos et al. 2002SANTOS N, LÓPEZ RG, ISRAELIAN G, MAYOR M, REBOLO R, GARCÍA-GIL A, TAORO MP DE & RANDICH S. 2002. Beryllium Abundances in Stars Hosting Giant Planets. Astron Astrophys 386: 1028–1038. and are available in the R package astrodatR (Feigelson 2014FEIGELSON ED. 2014. astrodatR: Astronomical Data. Available at. URL https://cran.r-project.org/web/packages/astrodatR/. R package v. 0.1.
https://cran.r-project.org/web/packages/... ) under the name Stellar abundances. These data were analyzed Mattos et al. 2018MATTOS T, GARAY AM & LACHOS VH. 2018. Likelihood-based inference for censored linear regression models with scale mixtures of skew-normal distributions. J Appl Stat 45(11): 2039–2066. by using the Scale Mixture of Skew Normal Censored Regression (SMSNCR) models. We analyzed the data set by fitting a PTCR model and again, we compare our proposal with SMNCR models by Garay et al. 2017GARAY AM, LACHOS VH, BOLFARINE H & CABRAL CRB. 2017. Linear Censored Regression Models with Scale Mixtures of Normal Distributions. Stat Pap 58(1): 247–278. doi:10.1007/s00362-015-0696-9. .

The dataset consists of measurements for 68 solar-type stars and for our analysis we followed Mattos et al. 2018MATTOS T, GARAY AM & LACHOS VH. 2018. Likelihood-based inference for censored linear regression models with scale mixtures of skew-normal distributions. J Appl Stat 45(11): 2039–2066. and consider:

log N(Be) as the response variable, which represents the log of the abundance of beryllium scaled to Sun’s abundance (i.e. the Sun has $log N (B e) = 0.0$ )
$T_{eff} / 1000$ as the explanatory variable, which represents the effective stellar surface temperature (in kelvin).

In astronomical research, a previously identified sample of objects (stars, galaxies, quasars, X-ray sources, etc.) is observed at some new wavebands. According to Feigelson 2014FEIGELSON ED. 2014. astrodatR: Astronomical Data. Available at. URL https://cran.r-project.org/web/packages/astrodatR/. R package v. 0.1.
https://cran.r-project.org/web/packages/... , due to limited sensitivities, some objects may be undetected, leading to upper limits in their derived luminosities. For this dataset we have 12 left-censored data points, i.e. 12 undetected beryllium measurement, that represents 19.35% of observations.

Table VII presents the ML estimates for the parameters of the four models, i.e. PTCR, TCR, SLCR and NCR models, together with their corresponding standard errors. Table VIII compares the fit of the four models using the model selection criteria (AIC, AIC $_{c}$ and BIC). Note that again the PTCR model with heavy tails have better fit than the TCR, SLCR and NCR models. The QQ-plots and envelopes for the martingale residuals are shown in Figure 4. This figure clearly indicates that the PTCR, TCR and SLCR models are more suitable for modeling the current data than the NCR model, since there are not observations falling outside the envelope.

Thumbnail

Table VII
Parameters and standard errors (SE) of the PTCR, TCR, SLCR and NCR models fitted to stellar abundances data.

Thumbnail

Table VIII
Stellar abundances data. Model selection criteria.

Figure 4
Stellar abundances data. Envelopes of transformed martingale residuals for PTCR, TCR, SLCR and NCR models.

CONCLUSIONS

In this paper, an asymmetric alternative for the Student- $t$ censored regression model by Arellano-Valle et al. 2012ARELLANO-VALLE RB, CASTRO LM, GONZÁLEZ-FARÍAS G & MUNÕZ-GAJARDO KA. 2012. Student-t Censored Regression Model: Properties and Inference. Stat Methods Appt 21(4): 453–473. doi:10.1007/s10260-012-0199-y.
10.1007/s10260-012-0199-y... and SMNCR by Garay et al. 2017GARAY AM, LACHOS VH, BOLFARINE H & CABRAL CRB. 2017. Linear Censored Regression Models with Scale Mixtures of Normal Distributions. Stat Pap 58(1): 247–278. doi:10.1007/s00362-015-0696-9. has been developed. It is based on the new family of asymmetric and heavy-tailed power- $t$ distribution (Zhao & Kim 2016ZHAO J & KIM HM. 2016.Power t distribution. Ommun Stat Appl Methods 23(4): 321–334. ). Moreover, it follows that the ordinary tobit model and the Student- $t$ censored regression models are special cases. The observed and expected information matrix is analytically obtained, allowing for the direct implementation of the inference on this type of models. The problem of estimating the parameters in the model is dealt by using the maximum likelihood approach which is also used for developing large sample properties for the estimators. The likelihood ratio statistics can be used for testing the PTCR null hypothesis since the TCR model is special case of the model entertained. Applications to Wage rate data and Stellar Abundances Data indicate that the PTCR model can be a useful alternative to the TCR and SMNCR models.

The proposed PT distribution can be considered in the statistical models based on the scale mixtures of normal family to improve the fit the models such as Maleki & Nematollahi 2017MALEKI M & NEMATOLLAHI AR. 2017. Autoregressive Models with Mixture of Scale Mixtures of Gaussian innovations. Iran J Sci Technol Trans A Sci 41(4): 1099–1107. . Also, the methodology of constructing the asymmetric distribution on the symmetric version of the Skew-Reflected-Gompertz distribution which recently introduced by Hosseinzadeh et al. 2019HOSSEINZADEH A, MALEKI M, KHODADADI Z & CONTRERAS-REYES JE. 2019. The Skew-Reflected-Gompertz distribution for analyzing the symmetric and asymmetric data. J Comput Appl Math 349: 132–141. , can be considered as a future work for researchers.

ACKNOWLEDGMENTS

AKAIKE H. 1974.A new look at statistical model identification. IEEE Trans Automat Contr AU-19(4): 716–722.
ARELLANO-VALLE RB, CASTRO LM, GONZÁLEZ-FARÍAS G & MUNÕZ-GAJARDO KA. 2012. Student- $t$ Censored Regression Model: Properties and Inference. Stat Methods Appt 21(4): 453–473. doi:10.1007/s10260-012-0199-y
AZZALINI A & CAPITANIO A. 2003. Distributions generated by perturbation of symmetry with emphasis on a multivariate skew $t$ -distribution. J R Stat Soc Series B Stat Methodol 65(2): 367–389.
BARROS M, GALEA M, GONZÁLEZ M & LEIVA V. 2010. Influence Diagnostics in the Tobit Censored Response Model. Stat Methods Appt 19(3): 379–397. doi:10.1007/s10260-010-0135-y.
FAIR RC. 1978.A theory of extramarital affairs. J Polit Econ 86(1): 45–61.
FEIGELSON ED. 2014. astrodatR: Astronomical Data. Available at. URL https://cran.r-project.org/web/packages/astrodatR/ R package v. 0.1.
» https://cran.r-project.org/web/packages/astrodatR/
GARAY AM, LACHOS VH, BOLFARINE H & CABRAL CRB. 2017. Linear Censored Regression Models with Scale Mixtures of Normal Distributions. Stat Pap 58(1): 247–278. doi:10.1007/s00362-015-0696-9.
GUPTA RD & GUPTA RC. 2008. Analyzing skewed data by power–normal model. Test 17: 197–210.
HOSSEINZADEH A, MALEKI M, KHODADADI Z & CONTRERAS-REYES JE. 2019. The Skew-Reflected-Gompertz distribution for analyzing the symmetric and asymmetric data. J Comput Appl Math 349: 132–141.
MALEKI M & NEMATOLLAHI AR. 2017. Autoregressive Models with Mixture of Scale Mixtures of Gaussian innovations. Iran J Sci Technol Trans A Sci 41(4): 1099–1107.
MARTÍNEZ-FLÓREZ G, BOLFARINE H & GÓMEZ HW. 2013. The alpha–power tobit model. Commun Stat Theory Methods 42(4): 633–643.
MATTOS T, GARAY AM & LACHOS VH. 2018. Likelihood-based inference for censored linear regression models with scale mixtures of skew-normal distributions. J Appl Stat 45(11): 2039–2066.
MOULTON LH & HALSEY NA. 1995. A Mixture Model With Detection Limits for Regression Analyses of Antibody Response to Vaccine. Biometrics 51: 1570–1578.
MROZ TA. 1987. The Sensitivity of an Empirical Model of Married Women’s Hours of Work to Economic and Statistical Assumptions. Econometrica 55(4): 765–799. doi:10.2307/1911029.
OLSEN RJ. 1978. Note on the Uniqueness of the Maximum Likelihood Estimator for the Tobit Model. Econometrica 46(5): 1211–1215.
ORTEGA EM, BOLFARINE H & PAULA GA. 2003. Influence Diagnostics in Generalized Log-Gamma Regression Models. Comput Stat Data Anal 42: 165–186.
PEWSEY A, GÓMEZ HW & BOLFARINE H. 2012. Likelihood–based inference for power distributions. Test 21(4): 775–789.
R DEVELOPMENT CORE TEAM. 2018. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. Vienna, Austria. URL http://www.R-project.org ISBN 3-900051-07-0.
» http://www.R-project.org
SANTOS N, LÓPEZ RG, ISRAELIAN G, MAYOR M, REBOLO R, GARCÍA-GIL A, TAORO MP DE & RANDICH S. 2002. Beryllium Abundances in Stars Hosting Giant Planets. Astron Astrophys 386: 1028–1038.
SCHWARZ G. 1978. Estimating the dimension of a model. Ann Stat 6(2): 461–464.
SUGIURA N. 1978. Further Analysis of the Data by Akaike’s Information Criterion and the Finite Corrections. Commun Stat Theory Methods 7(1): 13–26. doi:10.1080/03610927808827599.
TOBIN J. 1958.Estimation of relationship for limited dependent variables. Econometrica 26(1): 24–36.
ZHAO J & KIM HM. 2016.Power $t$ distribution. Ommun Stat Appl Methods 23(4): 321–334.

APPENDIX A LEMMAS

Lemma 1. Let $Z \sim P T (α, ν)$ , then $E [Z^{k} ∣ Z + c > 0] = m_{k} (c; ν)$ , with

m_{k} (c; ν) = \frac{1}{1 - {F_{T} (- c; ν)}^{α}} \int_{F_{T} (- c; ν)}^{1} {[F_{T}^{- 1} (u; ν)]}^{k} α u^{α - 1} d u .

where $F_{T}^{- 1} (\cdot; ν)$ is the inverse of $F_{T} (\cdot; ν)$ .

Lemma 2. Let $Z \sim P T (α, ν)$ , and define $r (Z; ν) = f_{T} (Z; ν) / F_{T} (Z; ν)$ . Then

(i) $E {{[r (Z; ν)]}^{k} {(1 + \frac{Z^{2}}{ν})}^{- m / 2} Z^{n} ∣ Z + c > 0} = \frac{{[f_{T} (0; ν)]}^{k}}{1 - {F_{T} (- c; ν)}^{α}} a_{k m n} (- c; ν)$ , where

\begin{matrix} a_{k m n} (- c; ν) = & \int_{F_{T} (- c; ν)}^{1} {[F_{T}^{- 1} (u; ν)]}^{n} {1 + \frac{{[F_{T}^{- 1} (u; ν)]}^{2}}{ν}}^{- \frac{k (ν + 1) + m}{2}} α u^{α - k - 1} d u \end{matrix}

(ii) $E {{[r (Z; ν)]}^{k} log (1 + \frac{Z^{2}}{ν}) Z^{n} ∣ Z + c > 0} = \frac{{[f_{T} (0; ν)]}^{k}}{1 - {F_{T} (- c; ν)}^{α}} a_{k n} (- c; ν)$ , where

$a_{k n} (- c; ν) = \int_{F_{T} (- c; ν)}^{1} log {1 + \frac{{[F_{T}^{- 1} (u; ν)]}^{2}}{ν}} {[F_{T}^{- 1} (u; ν)]}^{n} α u^{α - k - 1} d u$
(iii) $E {{[r (Z; ν)]}^{k} {[b_{01} (Z; ν)]}^{m} Z^{n} ∣ Z + c > 0} = \frac{{[f_{T} (0; ν)]}^{k}}{1 - {F_{T} (- c; ν)}^{α}} b_{k m n} (- c; ν)$ , where

b_{k m n} (- c; ν) = \int_{F_{T} (- c; ν)}^{1} {[b_{01} (F_{T}^{- 1} (u; ν))]}^{m} {[F_{T}^{- 1} (u; ν)]}^{n} α u^{α - k - 1} d u

con $b_{01} (c; ν) = \int_{- \infty}^{c} {log (1 + \frac{s^{2}}{ν})} \frac{f_{T} (s; ν)}{F_{T} (c; ν)} d s$ .

The proof of the Lemmas 1 and 2 are straightforward and they follow directly for the definition of expected value.

APPENDIX B INFORMATION MATRICES FOR PTCR MODEL

Observed Information Matrix

The elements of the observed information matrix $𝐉 (𝛗)$ for the PTCR model are given by

\begin{aligned} j γ γ^{⊤} & = α \sum_{i = 1}^{n} (1 - d_{i}) {r (- c_{i}; ν) - (\frac{ν + 1}{ν}) {(1 + \frac{c_{i}^{2}}{ν})}^{- 1} c_{i}} r (- c_{i}; ν) x_{i} x_{i}^{⊤} \\ + (\frac{ν + 1}{ν}) \sum_{i = 1}^{n} d_{i} {{(1 + \frac{z_{i}^{2}}{ν})}^{- 1} - \frac{2}{ν} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} z_{i}^{2}} x_{i} x_{i}^{⊤} \\ + (α - 1) \sum_{i = 1}^{n} d_{i} {r (z_{i}; ν) + (\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} z_{i}} r (z_{i}; ν) x_{i} x_{i}^{⊤} \\ j γ σ & = - (\frac{ν + 1}{ν}) \sum_{i = 1}^{n} d_{i} {{(1 + \frac{z_{i}^{2}}{ν})}^{- 1} - \frac{2}{ν} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} z_{i}^{2}} y_{i} x_{i} \\ - (α - 1) \sum_{i = 1}^{n} d_{i} {r (z_{i}; ν) + (\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} z_{i}} \\ \times r (z_{i}; ν) y_{i} x_{i} \\ j γ ν & = \frac{α}{2} \sum_{i = 1}^{n} (1 - d_{i}) {(ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2})) (1 + R (c_{i}; ν)) \\ - \frac{1}{ν} (1 + c_{i} r (c_{i}; ν) R (c_{i}; ν)) - R (c_{i}; ν) b_{01} (c_{i}; ν) - \log (1 + \frac{c_{i}^{2}}{ν}) \\ + (\frac{ν + 1}{ν}) {(1 + \frac{c_{i}^{2}}{ν})}^{- 1} \frac{c_{i}^{2}}{ν}} r (- c_{i}; ν) x_{i} \\ + \frac{1}{ν^{2}} \sum_{i = 1}^{n} d_{i} {{(1 + \frac{z_{i}^{2}}{ν})}^{- 1} z_{i} - (\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} z_{i}^{3}} x_{i} \\ + \frac{α - 1}{2} \sum_{i = 1}^{n} d_{i} {(\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} \frac{z_{i}^{2}}{ν} + b_{01} (z_{i}; ν) \\ - \log (1 + \frac{z_{i}^{2}}{ν}) - \frac{1}{ν} (1 - z_{i} r (z_{i}; ν))} r (z_{i}; ν) x_{i} \end{aligned}

\begin{aligned} j γ α & = \sum_{i = 1}^{n} (1 - d_{i}) r (- c_{i}; ν) x_{i} + \sum_{i = 1}^{n} d_{i} r (z_{i}; ν) x_{i} \\ j σ σ & = \sum_{i = 1}^{n} d_{i} {\frac{1}{σ^{2}} + (\frac{ν + 1}{ν}) [{(1 + \frac{z_{i}^{2}}{ν})}^{- 1} - \frac{2}{ν} {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} z_{i}^{2}] y_{i}^{2}} \\ + (α - 1) \sum_{i = 1}^{n} I_{i} {(\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} z_{i} + r (z_{i}; ν)} r (z_{i}; ν) y_{i}^{2} \\ j σ ν & = - \frac{1}{ν^{2}} \sum_{i = 1}^{n} d_{i} {{(1 + \frac{z_{i}^{2}}{ν})}^{- 1} z_{i} - (\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} z_{i}^{3}} y_{i} \\ - \frac{α - 1}{2} \sum_{i = 1}^{n} d_{i} {(\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} \frac{z_{i}^{2}}{ν} + b_{01} (z_{i} ν) \\ - \log (1 + \frac{z_{i}^{2}}{ν}) - \frac{1}{ν} (1 - z_{i} r (z_{i}; ν))} r (z_{i}; ν) y_{i} \\ j σ α & = - \sum_{i = 1}^{n} d_{i} r (z_{i}; ν) y_{i} \\ j ν α & = - \frac{1}{2} \sum_{i = 1}^{n} (1 - d_{i}) {ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - b_{01} (c_{i}; ν) - \frac{c_{i}}{ν} r (c_{i}; ν)} \\ \times R (c_{i}; ν) + \frac{1}{2} \sum_{i = 1}^{n} d_{i} {ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - b_{01} (z_{i}; ν) \\ - \frac{z_{i}}{ν} r (z_{i}; ν)} \\ j α α & = - \sum_{i = 1}^{n} \frac{d_{i}}{α^{2}} \end{aligned}

\begin{aligned} j ν ν & = + \frac{α}{4} \sum_{i = 1}^{n} (1 - d_{i}) {ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - b_{01} (c_{i}; ν) \\ {- \frac{c_{i}}{ν} r (c_{i} ∣ ν)}}^{2} R (c_{i}; ν) (1 + R (c_{i}; ν)) \\ - \frac{α}{4} \sum_{i = 1}^{n} (1 - d_{i}) {ψ_{1} (\frac{ν}{2}) - ψ_{1} (\frac{ν + 1}{2}) - \frac{2}{ν (ν + 1)} + b_{01}^{2} (c_{i}; ν) \\ - \frac{1}{ν} b_{01} (c_{i}; ν) + \frac{1}{ν} (2 b_{01} (c_{i}; ν) - \frac{ν + 3}{ν (ν + 1)}) c_{i} r (c_{i}; ν) \\ + \frac{1}{ν + 2} \frac{F_{T} (c_{2 i}; ν + 2)}{F_{T} (c_{i}; ν)} b_{21} (c_{2 i}; ν + 2) - b_{02} (c_{i}; ν) \\ + \frac{1}{ν} [(\frac{ν + 1}{ν}) {(1 + \frac{c_{i}^{2}}{ν})}^{- 1} \frac{c_{i}^{2}}{ν} - \log (1 + \frac{c_{i}^{2}}{ν}) + \frac{c_{i}}{ν} r (c_{i}; ν)] \\ \times c_{i} r (c_{i}; ν)} R (c_{i}; ν) \\ - \frac{1}{4} \sum_{i = 1}^{n} d_{i} {ψ_{1} (\frac{ν + 1}{2}) - ψ_{1} (\frac{ν}{2}) + \frac{2}{ν^{2}} - \frac{4}{ν^{3}} {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} z_{i}^{2} \\ + \frac{2}{ν^{3}} (\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 2} z_{i}^{4}} \\ + \frac{α - 1}{4} \sum_{i = 1}^{n} d_{i} {ψ_{1} (\frac{ν}{2}) - ψ_{1} (\frac{ν + 1}{2}) - \frac{2}{ν (ν + 1)} + b_{01}^{2} (z_{i}; ν) \\ - \frac{1}{ν} b_{01} (z_{i}; ν) + \frac{1}{ν} (2 b_{01} (z_{i}; ν) - \frac{ν + 3}{ν (ν + 1)}) z_{i} r (z_{i}; ν) \\ + \frac{1}{ν + 2} \frac{F_{T} (z_{2 i}; ν + 2)}{F_{T} (z_{i}; ν)} b_{21} (z_{2 i}; ν + 2) - b_{02} (z_{i}; ν) \\ + \frac{1}{ν} [(\frac{ν + 1}{ν}) {(1 + \frac{z_{i}^{2}}{ν})}^{- 1} \frac{z_{i}^{2}}{ν} \\ - \log (1 + \frac{z_{i}^{2}}{ν}) + \frac{z_{i}}{ν} r (z_{i}; ν)] z_{i} r (z_{i}; ν)} \end{aligned}

Expected Information Matrix

The expected information matrix $I_{φ}$ is obtained by taking $I_{φ} = E [J_{φ}]$ , which involves the calculation of truncated moments that have no closed form and must be obtained numerically. The elements $I_{𝛗_{i} 𝛗_{j}}$ of the expected information matrix are given by

\begin{aligned} I_{γ γ} = & \sum_{i = 1}^{n} {r (- c_{i}; ν) - (\frac{ν + 1}{ν}) {(1 + \frac{c_{i}}{ν})}^{- 1} c_{i}} f_{P T} (- c_{i}; α, ν) x_{i} x_{i}^{⊤} \\ + (\frac{ν + 1}{ν}) \sum_{i = 1}^{n} {a_{020} (- c_{i}; ν) - \frac{2}{ν} a_{042} (- c_{i}; ν)} x_{i} x_{i}^{⊤} \\ + (α - 1) f_{T} (0; ν) \sum_{i = 1}^{n} {(\frac{ν + 1}{ν}) a_{121} (- c_{i}; ν) + f_{T} (0; ν) a_{200} (- c_{i}; ν)} x_{i} x_{i}^{⊤} \\ I_{γ σ} = & - \frac{1}{σ} (\frac{ν + 1}{ν}) \sum_{i = 1}^{n} {a_{021} (- c_{i}; ν) + c_{i} a_{020} (- c_{i}; ν)} x_{i} \\ + \frac{2}{σ} (\frac{ν + 1}{ν^{2}}) \sum_{i = 1}^{n} {a_{043} (- c_{i}; ν) + c_{i} a_{042} (- c_{i}; ν)} x_{i} \\ - (f_{T} (0; ν))^{2} (\frac{α - 1}{σ}) \sum_{i = 1}^{n} {a_{201} (- c_{i}; ν) + c_{i} a_{200} (- c_{i}; ν)} x_{i} \\ - f_{T} (0; ν) (\frac{α - 1}{σ}) (\frac{ν + 1}{ν}) \sum_{i = 1}^{n} {a_{122} (- c_{i}; d ν) + c_{i} a_{121} (- c_{i}; ν)} x_{i} \\ I_{γ ν} = & \frac{α}{2} \sum_{i = 1}^{n} {(ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2})) (1 + R (c_{i}; ν)) \\ - \frac{1}{ν} (1 + c_{i} r (c_{i}; ν) R (c_{i}; ν)) - R (c_{i}; ν) b_{01} (c_{i}; ν) - \log (1 + \frac{c_{i}^{2}}{ν}) \\ + (\frac{ν + 1}{ν}) {(1 + \frac{c_{i}^{2}}{ν})}^{- 1} \frac{c_{i}^{2}}{ν}} f_{P T} (- c_{i}; α, ν) x_{i} \\ + \frac{1}{ν^{2}} \sum_{i = 1}^{n} {a_{021} (- c_{i}; ν) - (\frac{ν + 1}{ν}) a_{043} (- c_{i}; ν)} x_{i} \\ + f_{T} (0; ν) (\frac{α - 1}{2}) \sum_{i = 1}^{n} {(\frac{ν + 1}{ν}) a_{122} (- c_{i}; ν) - a_{10} (- c_{i}; ν) \\ + b_{110} (- c_{i}; ν) - \frac{1}{ν} (a_{100} (- c_{i}; ν) - f_{T} (0; ν) a_{201} (- c_{i}; ν))} x_{i} \end{aligned}

\begin{aligned} I_{γ α} = & \frac{1}{α} \sum_{i = 1}^{n} f_{P T} (- c_{i}; α, ν) x_{i} + f_{T} (0; ν) \sum_{i = 1}^{n} a_{100} (- c_{i}; ν) x_{i} \\ I_{σ σ} = & \frac{1}{σ^{2}} \sum_{i = 1}^{n} {1 + (\frac{ν + 1}{ν}) [a_{022} (- c_{i}; ν) + 2 c_{i} a_{021} (- c_{i}; ν) \\ + c_{i}^{2} a_{020} (- c_{i}; ν) - \frac{2}{ν} (a_{044} (- c_{i}; ν) + 2 c_{i} a_{043} (- c_{i}; ν) \\ + c_{i}^{2} a_{042} (- c_{i}; ν))]} \\ + f_{T} (0; ν) (\frac{α - 1}{σ^{2}}) \sum_{i = 1}^{n} {(\frac{ν + 1}{ν}) [a_{123} (- c_{i}; ν) + 2 c_{i} a_{122} (- c_{i}; ν) \\ + c_{i}^{2} a_{121} (- c_{i}; ν)] + f_{T} (0; d ν) [a_{202} (- c_{i}; ν) + 2 c_{i} a_{201} (- c_{i}; ν) \\ + c_{i}^{2} a_{200} (- c_{i}; ν)]} \\ I_{σ α} = & - \frac{f_{T} (0; ν)}{σ} \sum_{i = 1}^{n} {a_{101} (- c_{i} ν) + c_{i} a_{100} (- c_{i}; ν)} \\ I_{α α} = & \frac{1}{α^{2}} \sum_{i = 1}^{n} (1 - {F_{T} (- c_{i}; ν)}^{α}) \\ I_{σ ν} = & - \frac{1}{σ ν^{2}} \sum_{i = 1}^{n} {a_{022} (- c_{i}; ν) + c_{i} a_{021} (- c_{i}; ν) \\ - (\frac{ν + 1}{ν}) [a_{044} (- c_{i}; ν) + c_{i} a_{043} (- c_{i}; ν)]} {F_{T} (- c_{i}; ν)}^{α} \\ - f_{T} (0; ν) (\frac{α - 1}{2 σ}) \sum_{i = 1}^{n} {(\frac{ν + 1}{ν^{2}}) [a_{123} (- c_{i}; ν) + c_{i} a_{122} (- c_{i}; ν)] \\ - a_{11} (- c_{i}; ν) - c_{i} a_{10} (- c_{i}; ν) + b_{111} (- c_{i}; ν) + c_{i} b_{110} (- c_{i}; ν) \\ - \frac{1}{ν} [a_{101} (- c_{i}; ν) + c_{i} a_{100} (- c_{i}; ν)] \\ + \frac{1}{ν} [f_{T} (0; ν) a_{202} (- c_{i}; ν) + c_{i} f_{T} (0; ν) a_{201} (- c_{i}; ν)]} \\ I_{ν α} = & \frac{1}{2 α} \sum_{i = 1}^{n} {ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - b_{01} (c_{i}; ν) - \frac{c_{i}}{ν} r (c_{i}; ν)} \\ \times {(r (- c_{i}; ν))}^{- 1} f_{P T} (- c_{i}; α, ν) \\ - \frac{1}{2} \sum_{i = 1}^{n} {ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - \frac{b_{010} (- c_{i}; ν)}{1 - {F_{T} (- c_{i}; ν)}^{α}} \\ - \frac{f_{T} (0; ν) a_{101} (- c_{i}; ν)}{ν (1 - {F_{T} (- c_{i}; ν)}^{α})}} (1 - {F_{T} (- c_{i}; ν)}^{α}) \end{aligned}

\begin{aligned} I_{ν ν} = & \frac{α}{4} \sum_{i = 1}^{n} {ψ (\frac{ν + 1}{2}) - ψ (\frac{ν}{2}) - b_{01} (c_{i}; ν) - \frac{c_{i}}{ν} r (c_{i}; ν)}^{2} \\ \times R (c_{i}; ν) (1 + R (c_{i}; ν)) {F_{T} (- c_{i}; ν)}^{α} \\ - \frac{α}{4} \sum_{i = 1}^{n} {ψ_{1} (\frac{ν}{2}) - ψ_{1} (\frac{ν + 1}{2}) - \frac{2}{ν (ν + 1)} + b_{01}^{2} (c_{i}; ν) \\ - \frac{1}{ν} b_{01} (c_{i}; ν) + \frac{1}{ν} (2 b_{01} (c_{i}; ν) - \frac{ν + 3}{ν (ν + 1)}) c_{i} r (c_{i}; ν) \\ - b_{02} (c_{i}; ν) + \frac{F_{T} (c_{2 i}; ν + 2) b_{21} (c_{2 i}; ν + 2)}{(ν + 2) F_{T} (c_{2 i}; ν)} \\ + \frac{1}{ν} [(\frac{ν + 1}{ν}) {(1 + \frac{c_{i}^{2}}{ν})}^{- 1} \frac{c_{i}^{2}}{ν} \\ - \log (1 + \frac{c_{i}^{2}}{ν}) + \frac{c_{i}}{ν} r (c_{i}; ν)] c_{i} r (c_{i}; ν)} R (c_{i}; ν) {F_{T} (- c_{i}; ν)}^{α} \\ - \frac{1}{4} \sum_{i = 1}^{n} {ψ_{1} (\frac{ν + 1}{2}) - ψ_{1} (\frac{ν}{2}) + \frac{2}{ν^{2}} - \frac{4 a_{022} (- c_{i}; ν)}{ν^{3} (1 - {F_{T} (- c_{i}; ν)}^{α})} \\ + \frac{2}{ν^{3}} (\frac{ν + 1}{ν}) \frac{a_{044} (- c_{i}; ν)}{(1 - {F_{T} (- c_{i}; ν)}^{α})}} (1 - {F_{T} (- c_{i}; ν)}^{α}) \\ + \frac{α - 1}{4} \sum_{i = 1}^{n} {ψ_{1} (\frac{ν}{2}) - ψ_{1} (\frac{ν + 1}{2}) - \frac{2}{ν (ν + 1)} \\ + \frac{b_{020} (- c_{i}; ν)}{1 - {F_{T} (- c_{i}; ν)}^{α}} - \frac{b_{010} (- c_{i}; ν)}{ν (1 - {F_{T} (- c_{i}; ν)}^{α})} \\ + \frac{1}{ν} [\frac{2 f_{T} (0; ν) b_{111} (- c_{i}; ν)}{1 - {F_{T} (- c_{i} ∣ ν)}^{α}} - \frac{ν + 3}{ν (ν + 1)} \frac{f_{T} (0; ν) a_{101} (- c_{i}; ν)}{(1 - {F_{T} (- c_{i}; ν)}^{α})}] \\ - E [b_{02} (z_{i}; ν)] + (\frac{1}{ν + 2}) E [\frac{F_{T} (z_{2 i}; ν + 2)}{F_{T} (z_{i}; ν)} b_{21} (z_{2 i}; ν + 2)] \\ + \frac{f_{T} (0; ν)}{ν (1 - {F_{T} (- c_{i}; ν)}^{α})} [(\frac{ν + 1}{ν^{2}}) a_{123} (- c_{i}; ν) - a_{11} (- c_{i}; ν) \\ + \frac{1}{ν} f_{T} (0; ν) a_{202} (- c_{i}; ν)]} (1 - {F_{T} (- c_{i}; ν)}^{α}) \end{aligned}

where $b_{01}$ , $b_{02}$ and $b_{21}$ are given by (8) and must be obtained numerically. The components $a_{k m n}$ , $a_{k n}$ and $b_{k m n}$ , are given in the Lemman 2.

APPENDIX C R CODE TO FIT THE PTCR MODEL

Publication Dates

Publication in this collection
11 Oct 2021
Date of issue
2021

History

Received
11 Aug 2019
Accepted
22 Oct 2019

This is an open-access article distributed under the terms of the Creative Commons Attribution License

[1] AKAIKE H. 1974.A new look at statistical model identification. IEEE Trans Automat Contr AU-19(4): 716–722.

[2] ARELLANO-VALLE RB, CASTRO LM, GONZÁLEZ-FARÍAS G & MUNÕZ-GAJARDO KA. 2012. Student- $t$ Censored Regression Model: Properties and Inference. Stat Methods Appt 21(4): 453–473. doi:10.1007/s10260-012-0199-y

[3] AZZALINI A & CAPITANIO A. 2003. Distributions generated by perturbation of symmetry with emphasis on a multivariate skew $t$ -distribution. J R Stat Soc Series B Stat Methodol 65(2): 367–389.

[4] BARROS M, GALEA M, GONZÁLEZ M & LEIVA V. 2010. Influence Diagnostics in the Tobit Censored Response Model. Stat Methods Appt 19(3): 379–397. doi:10.1007/s10260-010-0135-y.

[5] FAIR RC. 1978.A theory of extramarital affairs. J Polit Econ 86(1): 45–61.

[6] FEIGELSON ED. 2014. astrodatR: Astronomical Data. Available at. URL https://cran.r-project.org/web/packages/astrodatR/ R package v. 0.1.
» https://cran.r-project.org/web/packages/astrodatR/

[7] GARAY AM, LACHOS VH, BOLFARINE H & CABRAL CRB. 2017. Linear Censored Regression Models with Scale Mixtures of Normal Distributions. Stat Pap 58(1): 247–278. doi:10.1007/s00362-015-0696-9.

[8] GUPTA RD & GUPTA RC. 2008. Analyzing skewed data by power–normal model. Test 17: 197–210.

[9] HOSSEINZADEH A, MALEKI M, KHODADADI Z & CONTRERAS-REYES JE. 2019. The Skew-Reflected-Gompertz distribution for analyzing the symmetric and asymmetric data. J Comput Appl Math 349: 132–141.

[10] MALEKI M & NEMATOLLAHI AR. 2017. Autoregressive Models with Mixture of Scale Mixtures of Gaussian innovations. Iran J Sci Technol Trans A Sci 41(4): 1099–1107.

[11] MARTÍNEZ-FLÓREZ G, BOLFARINE H & GÓMEZ HW. 2013. The alpha–power tobit model. Commun Stat Theory Methods 42(4): 633–643.

[12] MATTOS T, GARAY AM & LACHOS VH. 2018. Likelihood-based inference for censored linear regression models with scale mixtures of skew-normal distributions. J Appl Stat 45(11): 2039–2066.

[13] MOULTON LH & HALSEY NA. 1995. A Mixture Model With Detection Limits for Regression Analyses of Antibody Response to Vaccine. Biometrics 51: 1570–1578.

[14] MROZ TA. 1987. The Sensitivity of an Empirical Model of Married Women’s Hours of Work to Economic and Statistical Assumptions. Econometrica 55(4): 765–799. doi:10.2307/1911029.

[15] OLSEN RJ. 1978. Note on the Uniqueness of the Maximum Likelihood Estimator for the Tobit Model. Econometrica 46(5): 1211–1215.

[16] ORTEGA EM, BOLFARINE H & PAULA GA. 2003. Influence Diagnostics in Generalized Log-Gamma Regression Models. Comput Stat Data Anal 42: 165–186.

[17] PEWSEY A, GÓMEZ HW & BOLFARINE H. 2012. Likelihood–based inference for power distributions. Test 21(4): 775–789.

[18] R DEVELOPMENT CORE TEAM. 2018. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. Vienna, Austria. URL http://www.R-project.org ISBN 3-900051-07-0.
» http://www.R-project.org

[19] SANTOS N, LÓPEZ RG, ISRAELIAN G, MAYOR M, REBOLO R, GARCÍA-GIL A, TAORO MP DE & RANDICH S. 2002. Beryllium Abundances in Stars Hosting Giant Planets. Astron Astrophys 386: 1028–1038.

[20] SCHWARZ G. 1978. Estimating the dimension of a model. Ann Stat 6(2): 461–464.

[21] SUGIURA N. 1978. Further Analysis of the Data by Akaike’s Information Criterion and the Finite Corrections. Commun Stat Theory Methods 7(1): 13–26. doi:10.1080/03610927808827599.

[22] TOBIN J. 1958.Estimation of relationship for limited dependent variables. Econometrica 26(1): 24–36.

[23] ZHAO J & KIM HM. 2016.Power $t$ distribution. Ommun Stat Appl Methods 23(4): 321–334.

$ν$	$min$	$max$
4	-9.3423	5.5913
5	-7.5147	3.5101
6	-7.3825	2.7668
7	-4.6073	2.3721
8	-4.1468	2.1228
9	-3.8410	1.9490
10	-3.4826	1.8202
12	-2.2654	1.6409
15	-2.0131	1.4752
20	-1.8802	1.3208
25	-1.8245	1.2329
30	-1.6372	1.1760
50	-1.1158	1.0664
100	-1.0401	0.9864
500	-0.9390	0.9264

Cens.	$R C (\cdot)$	$δ$
Level		1	2	3	4	5	6	7	8	9	10
	$R C (β_{1})$	0.00194	0.02610	0.01427	0.05336	0.08081	0.09474	0.12469	0.15654	0.15260	0.15643
10%	$R C (β_{2})$	0.00010	0.00004	0.00009	0.00022	0.00024	0.00032	0.00045	0.00060	0.00064	0.00080
	$R C (η)$	0.00046	0.01137	0.00780	0.02271	0.03341	0.03780	0.04594	0.05558	0.05101	0.04596
	$R C (β_{1})$	0.01337	0.01963	0.00945	0.06445	0.12935	0.10080	0.09735	0.07929	0.04515	0.03117
20%	$R C (β_{2})$	0.00010	0.00008	0.00030	0.00031	0.00002	0.00018	0.00040	0.00075	0.00124	0.00157
	$R C (η)$	0.00219	0.00020	0.00160	0.00849	0.00984	0.00615	0.00417	0.00024	0.00910	0.01333
	$R C (β_{1})$	0.02466	0.00670	0.10623	0.06087	0.09179	0.15043	0.05653	0.19465	0.03994	0.16353
30%	$R C (β_{2})$	0.00062	0.00190	0.00016	0.00109	0.00006	0.00066	0.00019	0.00015	0.00012	0.00123
	$R C (η)$	0.00176	0.00860	0.01234	0.00391	0.00881	0.01215	0.00324	0.01675	0.00467	0.01425

		$α = 0.40$								$α = 2.5$
		$p = 0 %$		$p = 10 %$		$p = 25 %$		$p = 45 %$		$p = 0 %$		$p = 10 %$		$p = 25 %$		$p = 45 %$
$n$	$\hat{𝛉}$	RB	MSE	RB	MSE	RB	MSE	RB	MSE	RB	MSE	RB	MSE	RB	MSE	RB	MSE
150	${\hat{β}}_{1}$	0.010	0.982	0.019	1.127	0.022	0.746	0.050	0.657	0.092	0.700	0.132	0.813	0.138	0.913	0.155	1.200
	${\hat{β}}_{2}$	0.002	0.083	0.002	0.070	0.002	0.047	0.001	0.041	0.002	0.014	0.002	0.026	0.003	0.034	0.005	0.055
	$\hat{η}$	0.030	0.278	0.035	0.260	0.028	0.225	0.033	0.215	0.022	0.207	0.018	0.231	0.015	0.248	0.004	0.275
	$\hat{α}$	0.039	0.119	0.040	0.112	0.041	0.102	0.044	0.098	0.309	1.778	0.301	1.878	0.297	1.913	0.315	2.020
	$\hat{ν}$	0.309	2.673	0.296	2.428	0.255	2.125	0.223	1.980	0.188	1.735	0.196	1.840	0.211	2.057	0.216	2.228
300	${\hat{β}}_{1}$	0.010	0.839	0.025	0.787	0.014	0.504	0.025	0.441	0.061	0.552	0.086	0.601	0.094	0.689	0.122	0.922
	${\hat{β}}_{2}$	0.001	0.062	0.001	0.051	0.001	0.033	0.001	0.029	0.001	0.011	0.001	0.018	0.001	0.024	0.002	0.038
	$\hat{η}$	0.020	0.203	0.020	0.184	0.014	0.155	0.016	0.147	0.018	0.150	0.018	0.168	0.016	0.178	0.016	0.205
	$\hat{α}$	0.035	0.091	0.031	0.080	0.026	0.072	0.026	0.068	0.195	1.223	0.184	1.343	0.206	1.472	0.262	1.715
	$\hat{ν}$	0.186	1.739	0.160	1.610	0.123	1.263	0.102	1.127	0.093	0.973	0.107	1.025	0.109	1.026	0.128	1.175
750	${\hat{β}}_{1}$	0.002	0.588	0.003	0.481	0.006	0.303	0.009	0.263	0.024	0.219	0.037	0.346	0.044	0.415	0.064	0.574
	${\hat{β}}_{2}$	0.001	0.046	0.001	0.031	0.001	0.021	0.001	0.018	0.001	0.009	0.001	0.012	0.001	0.014	0.001	0.024
	$\hat{η}$	0.008	0.136	0.007	0.111	0.005	0.097	0.006	0.092	0.005	0.077	0.006	0.099	0.007	0.108	0.008	0.128
	$\hat{α}$	0.009	0.042	0.009	0.045	0.008	0.042	0.008	0.040	0.084	0.599	0.070	0.640	0.089	0.781	0.128	1.024
	$\hat{ν}$	0.052	0.715	0.041	0.602	0.034	0.534	0.029	0.502	0.020	0.512	0.035	0.498	0.040	0.531	0.048	0.603
1000	${\hat{β}}_{1}$	0.001	0.417	0.001	0.394	0.004	0.262	0.008	0.227	0.010	0.167	0.026	0.289	0.031	0.347	0.044	0.474
	${\hat{β}}_{2}$	0.001	0.031	0.001	0.025	0.001	0.017	0.001	0.015	0.001	0.009	0.001	0.010	0.001	0.012	0.001	0.020
	$\hat{η}$	0.001	0.091	0.002	0.095	0.003	0.082	0.004	0.078	0.003	0.069	0.005	0.083	0.005	0.093	0.006	0.107
	$\hat{α}$	0.008	0.035	0.001	0.040	0.006	0.036	0.006	0.035	0.068	0.489	0.051	0.519	0.063	0.609	0.089	0.778
	$\hat{ν}$	0.035	0.605	0.036	0.513	0.027	0.444	0.004	0.036	0.013	0.293	0.028	0.401	0.031	0.435	0.038	0.487

	$n$	Mean	SE	Skewness	Kurtosis
Complete Data	753	2.3746	3.2418	2.7778	15.7967
Non Censored Data	428	4.1777	3.3103	3.0801	16.8212

	PTCR		TCR		SLCR		NCR
$𝛉$	Estimate	SE	Estimate	SE	Estimate	SE	Estimate	SE
$β_{0}$	-8.8412	0.3785	-1.0474	1.1549	-1.1445	1.4377	-2.7510	1.7334
$β_{1}$	-0.1331	0.0004	-0.1108	0.0204	-0.1084	0.0232	-0.1046	0.0276
$β_{2}$	0.4887	0.0050	0.6475	0.0658	0.6434	0.0635	0.7281	0.0831
$β_{3}$	-3.4010	0.1869	-3.1636	0.3878	-3.0958	0.3785	-3.0264	0.4406
$β_{4}$	-0.3053	0.0149	-0.2964	0.1247	-0.2946	0.1210	-0.2143	0.1527
$η$	4.1538	0.1320	3.2616	0.1731	2.5958	0.3071	4.5760	0.1697
$ν$	6.5899	0.9014	4.2000	0.8068	1.4061	0.4361	–	–
$α$	12.8760	0.9871	–	–	–	–	–	–

Brasil

Brasil

The Asymmetric Power-Student-t Model for Censored and Truncated Data

Abstract

INTRODUCTION

THE POWER STUDENT-T DISTRIBUTION

POWER-STUDENT-T MODEL FOR CENSORED AND TRUNCATED DATA

Maximum Likelihood Estimation for CPT Model

Maximum Likelihood Estimation for TPT Model

CENSORED POWER-STUDENT-T REGRESSION MODEL

Moments

Maximum Likelihood Estimation

Model Selection and Residual Analysis

Model Selection

Residual Analysis

SIMULATIONS STUDIES

Simulation Study 1: Robustness of the Maximum Likelihood Estimates

Simulation Study 2: Asymptotic properties

REAL DATA APPLICATIONS

Application 1: Wage rate

Application 2: Stellar Abundances Data

CONCLUSIONS

ACKNOWLEDGMENTS

APPENDIX A LEMMAS

APPENDIX B INFORMATION MATRICES FOR PTCR MODEL

APPENDIX C R CODE TO FIT THE PTCR MODEL

Publication Dates

History

Criterion	PTCR	TCR	SLCR	NCR
$ℓ (𝛉)$	$- 1413.942$	-1440.145	-1436.286	-1481.655
AIC	2843.883	2894.291	2886.573	2975.311
AICc	2828.077	2880.441	2872.723	2963.424
BIC	2880.876	2926.659	2918.941	3003.055

Criterion	PTCR	TCR	SLCR	NCR
$ℓ (𝛉)$	3.688	-6.635	-25.096	-38.281
AIC	2.624	21.269	58.192	82.562
AICc	-6.408	13.905	50.827	76.937
BIC	13.722	30.148	67.069	89.220

	PTCR		TCR		SLCR		NCR
$𝛉$	Estimate	SE	Estimate	SE	Estimate	SE	Estimate	SE
$β_{0}$	-1.9692	0.0658	-1.8923	0.1482	-0.5441	0.8733	-1.6478	0.5941
$β_{1}$	0.5353	0.0021	0.5091	0.0046	0.2719	0.0262	0.4372	0.0180
$η$	0.0608	0.0002	0.0873	0.0004	0.1987	0.0254	0.3843	0.0014
$ν$	2.0891	0.4530	0.8335	0.0419	1.0567	0.1652	–	–
$α$	0.3265	0.0084	–	–	–	–	–	–