On Posterior Properties of the Two Parameter Gamma Family of Distributions

RAMOS, PEDRO L.; DEY, DIPAK K.; LOUZADA, FRANCISCO; RAMOS, EDUARDO

doi:10.1590/0001-3765202120190826

Abstract

The gamma distribution has been extensively used in many areas of applications. In this paper, considering a Bayesian analysis we provide necessary and sufficient conditions to check whether or not improper priors lead to proper posterior distributions. Further, we also discuss sufficient conditions to verify if the obtained posterior moments are finite. An interesting aspect of our findings are that one can check if the posterior is proper or improper and also if its posterior moments are finite by looking directly in the behavior of the proposed improper prior. To illustrate our proposed methodology these results are applied in different objective priors.

Key words
Gamma distribution; improper prior; objective prior; posterior property

1 - INTRODUCTION

The Gamma distribution is one of the most well-known distributions used in statistical analysis. Such distribution arises naturally in many areas such as environmental analysis, reliability analysis, clinical trials, signal processing and other physical situations. Let $X$ be a non-negative random variable with the gamma distribution given by

$f (x | α, β) = \frac{β^{α}}{Γ (α)} x^{α - 1} e^{- β x},$ (1)

where

$α 0$ and

$β 0$ are unknown shape and scale parameters, respectively, and

$Γ (ϕ) = \int_{0}^{\infty} e^{- x} x^{ϕ - 1} d x$ is the gamma function.

Commonly-used frequentist methods of inference for gamma distribution are standard in the statistical literature. Considering the Bayesian approach, where a prior distribution must be assigned, different objective priors for the gamma distribution have been discussed earlier by Miller 1980, Sun & Ye 1996, Berger et al. 2015 and Louzada & Ramos 2018. Although these priors are constructed by formal rules (see, Kass & Wasserman 1996, Ramos et al. 2019), they are improper, i.e., do not correspond to proper probability distribution and could lead to improper posteriors, which is undesirable. Northrop & Attalides 2016 argued that “… there is no general theory providing simple conditions under which an improper prior yields a proper posterior for a particular model, so this must be investigated case-by-case". In this study, under the assumption that the obtained sample is independent and identically distributed (iid), we overcome this problem by providing in a simple way necessary and sufficient conditions to check whether or not these objective priors lead to proper posterior distributions. Even if the posterior distribution is proper the posterior moments for the parameters can be infinite. Further, we also provided sufficient conditions to verify if the posterior moments are finite. Therefore, one can easily check if the obtained posterior is proper or improper and also if its posterior moments are finite considering directly the behavior of the improper prior. Our proposed methodology is fully illustrated in more than ten objective priors such as independent uniform priors, Jeffreys’ rule (Kass & Wasserman 1996), Jeffreys’ prior (Jeffreys 1946), maximal data information (MDI) prior (Zellner 1977, 1984), reference priors (Berger et al. 2015) and matching priors (Mukerjee & Dey 1993 and Tibshirani 1989), to list a few. Finally, the effect of these priors in the posterior distribution is compared via numerical simulation. It is worth mentioning that we only considered improper objective priors, when prior information is available one may consider the use of elicited prior (see for instance, Dey & Moala 2018).

The remainder of this paper is organized as follows. Section 2 presents a theorem that provides necessary and sufficient conditions for the posterior distributions to be proper and also sufficient conditions to check if the posterior moments of the parameters are finite. Section 3 presents the applications of our main theorem in different objective priors. In Section 4, a simulation study is conducted in order to identify the most efficient estimation procedure. Finally, Section 5 summarizes the study.

2 - PROPER POSTERIOR

Let $X_{1}, \dots, X_{n}$ be an iid sample where $X \sim$ Gamma $(α, β)$ ,. Then the joint posterior distribution for $𝛉$ is given by the product of the likelihood function and the prior distribution $π (𝛉)$ divided by a normalizing constant $d (𝐱)$ , resulting in

$p (𝛉 | 𝐱) = \frac{π (𝛉)}{d (𝐱)} \frac{β^{n α}}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i}},$ (2)

where

$d (𝐱) = \int_{𝒜} \frac{π (𝛉) β^{n α}}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i}} d 𝛉$ (3)

and

$𝒜 = {(0, \infty) \times (0, \infty)}$ is the parameter space of

$𝛉$ . For any prior distribution in the form

$π (𝛉) \propto π_{1} (β) π_{2} (α)$ , our purpose is to find necessary and sufficient conditions for these class of posterior be proper, i.e.,

$d (𝐱) \infty$ . The following propositions will be useful to attain this objective. For the following we let

$\bar{ℝ}$ denote the extended real number line

$ℝ \cup {- \infty, \infty}$ and the subscript

$*$ in

$ℝ$ and

$\bar{ℝ}$ will denote the exclusion of

$0$ in these sets.

Definition 2.1. Let $g : 𝒰 \to {\bar{ℝ}}_{*}^{+}$ and $h : 𝒰 \to {\bar{ℝ}}_{*}^{+}$ , where $𝒰 \subset ℝ$ . We say that $g (x) \propto h (x)$ if there exists $c_{0} \in ℝ_{*}^{+}$ and $c_{1} \in ℝ_{*}^{+}$ such that $c_{0} h (x) \leq g (x) \leq c_{1} h (x)$ for every $x \in 𝒰$ .

Definition 2.2. Let $a \in \bar{ℝ}$ , $g : 𝒰 \to ℝ^{+}$ and $h : 𝒰 \to ℝ^{+}$ , where $𝒰 \subset ℝ$ . We say that $g (x) \underset{x \to a}{\propto} h (x)$ if

$\underset{x \to a}{liminf} \frac{g (x)}{h (x)} 0 and \underset{x \to a}{limsup} \frac{g (x)}{h (x)} \infty .$

The meaning of the relations

$g (x) \underset{x \to a^{+}}{\propto} h (x)$ and

$g (x) \underset{x \to a^{-}}{\propto} h (x)$ for

$a \in ℝ$ are defined analogously.

Note that, from the above definiton, if for some $c \in ℝ_{*}^{+}$ we have that ${lim}_{x \to a} \frac{g (x)}{h (x)} = c$ , then it will follow that $g (x) \underset{x \to a}{\propto} h (x)$ . The following proposition is a direct consequence of the above definition.

Proposition 2.3. For $a \in \bar{ℝ}$ and $r \in ℝ$ , let $f_{1} (x) \underset{x \to a}{\propto} f_{2} (x)$ and $g_{1} (x) \underset{x \to a}{\propto} g_{2} (x)$ . Then we have that

$f_{1} (x) g_{1} (x) \underset{x \to a}{\propto} f_{2} (x) g_{2} (x) and f_{1} {(x)}^{r} \underset{x \to a}{\propto} f_{2} {(x)}^{r} .$

The following proposition gives us a relation between Definition 2.1 and Definition 2.2.

Proposition 2.4. Let $g : (a, b) \to ℝ^{+}$ and $h : (a, b) \to ℝ^{+}$ be continuous functions on $(a, b) \subset ℝ$ , where $a \in \bar{ℝ}$ and $b \in \bar{ℝ}$ . Then $g (x) \propto h (x)$ if and only if $g (x) \underset{x \to a}{\propto} h (x)$ and $g (x) \underset{x \to b}{\propto} h (x)$ .

Proposition 2.5. Let $g : (a, b) \to ℝ^{+}$ and $h : (a, b) \to ℝ^{+}$ be continuous functions in $(a, b) \subset ℝ$ , where $a \in \bar{ℝ}$ and $b \in \bar{ℝ}$ , and let $c \in (a, b)$ . Then, if either $g (x) \underset{x \to a}{\propto} h (x)$ or $g (x) \underset{x \to b}{\propto} h (x)$ , it will follow respectively that

$\int_{a}^{c} g (x) d x \propto \int_{a}^{c} h (x) d x or \int_{c}^{b} g (x) d x \propto \int_{c}^{b} h (x) d x .$

Theorem 2.6. Let the behavior of $π (β)$ be given by $π (β) \propto β^{c}$ , for some $c \in ℝ$ . Then we have that:

If $c - 1$ , then the posterior distribution (3) is improper.
If $c \geq - 1$ and ${lim}_{α \to 0^{+}} π (α) α^{s} = \infty$ $\forall s \in ℕ$ then the posterior distribution (3) is improper.
If $c \geq - 1$ and the behavior of $π (α)$ is given by

$π (α) \underset{α \to 0^{+}}{\propto} α^{s_{0}} and π (α) \underset{α \to \infty}{\propto} α^{s_{\infty}},$

where $s_{0} \in ℝ$ and $s_{\infty} \in ℝ$ , then the posterior distribution (3) is proper if and only if $n - s_{0}$ in case $c = - 1$ , and is proper if and only if $n - s_{0} - 1$ in case $c - 1$ .

Proof. See Appendix A. ◻

Theorem 2.7. Let $π (α, β) = π (α) π (β)$ , and suppose the behavior of $π (β)$ are $π (α)$ are given by

$π (β) \propto β^{c}, π (α) \underset{β \to 0^{+}}{\propto} α^{s_{0}} and π (α) \underset{α \to \infty}{\propto} α^{s_{\infty}},$

for $c \in ℝ$ , $s_{0} \in ℝ$ and $s_{\infty} \in ℝ$ . Then, if the posterior of $π (α, β)$ is proper, then the posterior mean of $α$ and $β$ are finite for this prior, as well as all moments.

Proof. Since the posterior is proper, by Theorem 2.6 we have that $c \geq - 1$ , and moreover $n - s_{0} - 1$ if $c - 1$ and $n - s_{0}$ if $c = - 1$ .

Now let $π^{*} (α, β) = α π (α, β)$ . Then $π^{*} (α, β) = π^{*} (α) π (β)$ , where $π^{*} (α) = α π (α)$ , and it follows that

$π (β) \propto β^{c}, π^{*} (α) \underset{β \to 0^{+}}{\propto} α^{s_{0} + 1} and π^{*} (α) \underset{α \to \infty}{\propto} α^{s_{\infty} + 1} .$

Therefore, since $c \geq - 1$ , and since $n - s_{0} - 1 - (s_{0} + 1) - 1$ if $c - 1$ and $n - s_{0} - (s_{0} + 1)$ if $c = - 1$ , it follows from Theorem 2.6 that the posterior

$\begin{matrix} π^{*} (α, β) \frac{β^{n α}}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i}} \end{matrix}$

relative to the prior

$π^{*} (α, β)$ is proper. Therefore

$\begin{matrix} E [α | 𝐱] = \int_{0}^{\infty} \int_{0}^{\infty} α π (α, β) \frac{β^{n α}}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i}} d β d α \infty . \end{matrix}$

Proceeding analogously it also follows that

$\begin{matrix} E [β | 𝐱] = \int_{0}^{\infty} \int_{0}^{\infty} β π (α, β) \frac{β^{n α}}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i}} d β d α \infty . \end{matrix}$

Therefore we have proved that if a prior $π (α, β)$ satisfying the assumptions of the theorem leads to a proper posterior, then the priors $α π (α, β)$ and $β π (α, β)$ also leads to proper posteriors, and it follows by induction that $α^{r} β^{s} π (α, β)$ also leads to proper posteriors for any $r$ and $s$ in $ℕ$ , which concludes the proof. ◻

Proposition 2.8. Suppose $π_{i} (α, β)$ leads to a proper posterior for $n \in ℕ$ and $i = 1, \dots, m$ , and consider the constants $k_{i} \geq 0$ for $i = 1, \dots, m$ . Then

$\sum_{i = 1}^{m} k_{i} π_{i} (α, β)$ leads to a proper posterior
$\prod_{i = 1}^{m} π_{i} {(α, β)}^{k_{i}}$ leads to a proper posterior if additionally $\sum_{i = 1}^{m} k_{i} = 1$ .

Proof. The item $i)$ is a direct of consequence of the linearity of the Lebesgue integral while $i i)$ is a direct consequence of the Holder’s inequality. ◻

3 - APPLICATION

In this section, we applied the proposed theorems in different objective priors.

3.1 - Uniform prior

A simple noninformative prior can be obtained considering uniform priors contained in the interval $(0, \infty)$ . This prior usually is not attractive due to its lack of invariance to reparameterisation. The uniform prior is given by $π_{1} (α, β) \propto 1$ . The joint posterior distribution for $α$ and $β$ , produced by the uniform prior, is

$\begin{matrix} π_{1} (α, β | 𝐱) \propto \frac{β^{n α}}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i}} . \end{matrix}$ (4)

Theorem 3.1. The posterior distribution (4) is proper for any sample size, in which case the posterior moments for $α$ and $β$ are finite.

Proof. Since $π (β) = β^{0}$ and $π (α) = α^{0}$ , it follows that $c = 0$ and $s_{0} = s_{\infty} = 0$ are valid constants for application of Theorem 2.6. Thus, since $c - 1$ and $n s_{0} - 1$ for all $n \in ℕ$ , the result follows from Theorem 2.6 and Theorem 2.7. ◻

The marginal posterior distribution for $α$ is

$\begin{matrix} π_{1} (α | 𝐱) & \propto \frac{1}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} \int_{0}^{\infty} β^{n α} exp {- β \sum_{i = 1}^{n} x_{i}} d β \propto \frac{α Γ (n α)}{Γ {(α)}^{n}} {(\frac{\sqrt[n]{\prod_{i = 1}^{n} x_{i}}}{\sum_{i = 1}^{n} x_{i}})}^{n α} . \end{matrix}$

The conditional posterior distribution for $β$ is given by

$π_{1} (β | α, 𝐱) \sim Gamma (n α + 1, \sum_{i = 1}^{n} x_{i}) .$ (5)

3.2 - Jeffreys rule

Jeffreys considered different procedures for constructing objective priors. For $θ \in (0, \infty)$ (see Kass & Wasserman 1996), Jeffreys suggested the prior $π (θ) = θ^{- 1}$ . The main justification for this choice was its invariance under power transformations of the parameters. Since the parameters of the Gamma distribution are contained in the interval $(0, \infty)$ , the prior using the Jeffreys rule (Miller 1980) is

$π_{2} (α, β) \propto \frac{1}{α β} .$ (6)

The joint posterior distribution for $α$ and $β$ produced by the Jeffreys rule prior is given by

$\begin{matrix} π_{2} (α, β | 𝐱) \propto \frac{β^{n α - 1}}{α Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i}} . \end{matrix}$ (7)

Theorem 3.2. The posterior density (7) is proper if and only if $n \geq 2$ , in which case the posterior moments for $α$ and $β$ are finite.

Proof. Since $π (β) = β^{- 1}$ and $π (α) = α^{- 1}$ , then $c = - 1$ and $s_{0} = s_{\infty} = - 1$ are valid constants for application of Theorem 2.6. Thus, since $c = - 1$ , and since the inequality $n - s_{0}$ holds if and only if $n \geq 2$ , the result follows from the Theorem 2.6 and Theorem 2.7. ◻

The marginal posterior distribution for $α$ is given by

$\begin{matrix} π_{2} (α | 𝐱) \propto \frac{Γ (n α)}{α Γ {(α)}^{n}} {(\frac{\sqrt[n]{\prod_{i = 1}^{n} x_{i}}}{\sum_{i = 1}^{n} x_{i}})}^{n α} . \end{matrix}$

The conditional posterior distribution for $β$ is

$π_{2} (β | α, 𝐱) \sim Gamma (n α, \sum_{i = 1}^{n} x_{i}) .$ (8)

3.3 - Jeffreys prior

In a further study, Jeffreys 1946 proposed a general rule to obtain an objective prior. This prior is obtained through the square root of the determinant of the Fisher information matrix $I (α, β)$ and has been widely used due to its invariance property under one-to-one transformations. For the Gamma distribution, the Jeffreys prior (see Miller 1980) is given by

$π_{3} (α, β) \propto \frac{\sqrt{α ψ' (α) - 1}}{β} .$ (9)

The joint posterior distribution for $α$ and $β$ produced by the Jeffreys prior is

$\begin{matrix} π_{3} (α, β | 𝐱) \propto \frac{β^{n α - 1} \sqrt{α ψ' (α) - 1}}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i}} . \end{matrix}$ (10)

Theorem 3.3. The posterior density (10) is proper for any sample size, in which case the posterior moments for $α$ and $β$ are finite.

Proof. Here, we have $π (β) = β^{- 1}$ . Following Abramowitz & Stegun 1972, we have that ${lim}_{z \to 0^{+}} \frac{ψ' (z)}{z^{- 2}} = 1$ and thus

$\begin{matrix} lim_{α \to 0^{+}} \frac{\sqrt{α ψ' (α) - 1}}{α^{- \frac{1}{2}}} = lim_{α \to 0^{+}} \sqrt{\frac{ψ' (α)}{α^{- 2}} - α} = 1, \end{matrix}$

which implies that

$\begin{matrix} \sqrt{α ψ' (α) - 1} \underset{α \to 0^{+}}{\propto} α^{- \frac{1}{2}} . \end{matrix}$

Moreover, following Abramowitz & Stegun 1972, we also have that

$ψ' (z) = \frac{1}{z} + \frac{1}{2 z^{2}} + o (\frac{1}{z^{3}})$ , and thus

$\begin{matrix} \frac{α ψ' (α) - 1}{α^{- 1}} = \frac{1}{2} + o (\frac{1}{α}) \Rightarrow lim_{α \to \infty} \frac{\sqrt{α ψ' (α) - 1}}{α^{- \frac{1}{2}}} = \frac{1}{\sqrt{2}}, \end{matrix}$

which implies that

$\begin{matrix} \sqrt{α ψ' (α) - 1} \underset{α \to \infty}{\propto} α^{- \frac{1}{2}} . \end{matrix}$

Therefore, $c = - 1$ and $s_{0} = s_{\infty} = - \frac{1}{2}$ are valid constants for application of Theorem 2.6, and since $n s_{0}$ for all $n \geq 1$ , the posterior is proper for any sample size and the posterior moments are finite using Theorems 2.6 and 2.7. ◻

The conditional posterior distribution for $β$ is (8). The marginal posterior distribution for $α$ is given by

$π_{3} (α | 𝐱) \propto \frac{Γ (n α) \sqrt{α ψ' (α) - 1}}{Γ {(α)}^{n}} {(\frac{\sqrt[n]{\prod_{i = 1}^{n} x_{i}}}{\sum_{i = 1}^{n} x_{i}})}^{n α} .$

3.4 - Miller prior

Miller 1980 discussed three objective priors for the parameters of the gamma distribution, where the first two were the Jeffreys Rule and the Jeffreys prior. However, the author chose a prior using the justification that such approach involves less computational subroutines. This prior is given by

$π_{4} (α, β) \propto \frac{1}{β} .$ (11)

Note that much progress has been made in computational analysis and many of these computational limitations have been overcome specially after Gelfand and Smith (see Gelfand & Smith 1990) successfully applied the Gibbs sampling in Bayesian Analysis.

The joint posterior distribution for $α$ and $β$ produced by the Miller’s prior is

$\begin{matrix} π_{4} (α, β | 𝐱) \propto \frac{β^{n α - 1}}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i}} . \end{matrix}$ (12)

Theorem 3.4. The posterior density (12) is proper for any sample size, in which case the posterior moments for $α$ and $β$ are finite.

Proof. Since $π (β) = β^{- 1}$ and $π (α) = α^{0}$ , then $c = - 1$ and $s_{0} = s_{\infty} = 0$ are valid constants for application of Theorem 2.6. Therefore, since $c = - 1$ and $n s_{0}$ for all $n \in ℕ$ , the result follows directly from the Theorem 2.6 and Theorem 2.7. ◻

The conditional posterior distribution for $β$ is (8). The marginal posterior distribution for $α$ is given by

$\begin{matrix} π_{4} (α | 𝐱) \propto \frac{Γ (n α)}{Γ {(α)}^{n}} {(\frac{\sqrt[n]{\prod_{i = 1}^{n} x_{i}}}{\sum_{i = 1}^{n} x_{i}})}^{n α} . \end{matrix}$

3.5 - Reference prior

Bernardo 1979 proposed to maximize the expected Kullback-Leibler divergence between the posterior distribution and the prior to obtain objective prior. They obtained a class of non-informative priors known as reference priors. The reference prior provides posterior distributions with interesting properties such as invariance under one-to-one transformations, consistent marginalization and consistent sampling properties (Bernardo 2005). The procedure to obtain reference priors is described as follows.

Corollary 3.5. Bernardo 2005: Let $𝛉 = (θ_{1}, θ_{2})$ be the vector of parameters and let p $(θ_{1}, θ_{2} | x)$ be the the posterior distribution with asymptotic normal distribution and dispersion matrix $S (θ_{1}, θ_{2}) = I^{- 1} (θ_{1}, θ_{2})$ . Moreover, let $θ_{1}$ be the parameter of interest and $θ_{2}$ the nuisance. Then, if the parameter space of $θ_{2}$ is independent of $θ_{1}$ and if the functions $s_{1, 1} (θ_{1}, θ_{2}), h_{2, 2} (θ_{1}, θ_{2})$ factorize in the form $s_{1, 1}^{\frac{1}{2}} (θ_{1}, θ_{2}) = f_{1} (θ_{1}) g_{1} (θ_{2}) and h_{2, 2}^{\frac{1}{2}} (θ_{1}, θ_{2}) = f_{2} (θ_{1}) g_{2} (θ_{2})$ it will follow that $π_{θ_{1}} (θ_{1}, θ_{2}) \propto f_{1} (θ_{1}) g_{2} (θ_{2})$ and that there is no need for compact approximations.

3.5.1 - Reference prior when $α$ is the parameter of interest

From Corollary 3.5 the reference prior when $α$ is the parameter of interest and $β$ is the nuisance parameter is given by

$π_{5} (α, β) \propto \frac{1}{β} \sqrt{\frac{α ψ' (α) - 1}{α}} .$ (13)

Therefore, the joint posterior distribution for $α$ and $β$ , produced by the reference prior (13) is given by

$π_{5} (α, β | 𝐱) \propto \sqrt{\frac{α ψ' (α) - 1}{α}} \frac{β^{n α - 1}}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i}} .$ (14)

Theorem 3.6. The posterior density (14) is proper if and only if $n \geq 2$ , in which case the posterior moments for $α$ and $β$ are finite.

Proof. We proved in Theorem 3.3 that $\sqrt{α ψ' (α) - 1} \underset{z \to 0^{+}}{\propto} α^{- \frac{1}{2}}$ and $\sqrt{α ψ' (α) - 1} \underset{z \to \infty}{\propto} α^{- \frac{1}{2}}$ . It follows that

$\sqrt{\frac{α ψ' (α) - 1}{α}} \underset{z \to 0^{+}}{\propto} α^{- 1} and \sqrt{\frac{α ψ' (α) - 1}{α}} \underset{z \to \infty}{\propto} α^{- 1} .$

Then

$c = - 1$ and

$s_{0} = s_{\infty} = - 1$ , therefore the result follows directly from the Theorem 2.6 and 2.7. ◻

The conditional posterior distribution for $β$ is (8). The marginal posterior distribution for $α$ is given by

$π_{5} (α | 𝐱) \propto \sqrt{\frac{α ψ' (α) - 1}{α}} \frac{Γ (n α)}{Γ {(α)}^{n}} {(\frac{\sqrt[n]{\prod_{i = 1}^{n} x_{i}}}{\sum_{i = 1}^{n} x_{i}})}^{n α} .$

3.5.2 - Reference prior when $β$ is the parameter of interest

The reference prior when $β$ is the parameter of interest and $α$ is the nuisance parameter is given by

$π_{6} (α, β) \propto \frac{\sqrt{ψ' (α)}}{β} .$ (15)

The joint posterior distribution for $α$ and $β$ , produced by the reference prior (15) is given by

$π_{6} (α, β | 𝐱) \propto β^{n α - 1} \frac{\sqrt{ψ' (α)}}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i}} .$ (16)

Theorem 3.7. The posterior density (14) is proper if and only if $n \geq 2$ , in which case the posterior moments for $α$ and $β$ are finite.

Proof. Following Abramowitz & Stegun 1972, we have that ${lim}_{α \to 0^{+}} \frac{ψ' (α)}{α^{- 2}} = 1$ and ${lim}_{α \to \infty} \frac{ψ' (α)}{α^{- 1}} = 1$ . Thus, $\sqrt{ψ' (α)} \underset{α \to 0^{+}}{\propto} α^{- 1}$ and $\sqrt{ψ' (α)} \underset{α \to \infty}{\propto} α^{- \frac{1}{2}}$ . Therefore we conclude that $c = - 1$ , $s_{0} = - 1$ , $s_{\infty} = - \frac{1}{2}$ are valid constants for application of Theorem 2.6. Thus, since $n s_{0}$ if and only if $n 1$ the result follows from the Theorem 2.6 and Theorem 2.7. ◻

The conditional posterior distribution for $β$ is (8). The marginal posterior distribution for $α$ is given by

$π_{6} (α | 𝐱) \propto \sqrt{ψ' (α)} \frac{Γ (n α)}{Γ {(α)}^{n}} {(\frac{\sqrt[n]{\prod_{i = 1}^{n} x_{i}}}{\sum_{i = 1}^{n} x_{i}})}^{n α} .$

There are different ways to derive the same reference priors in the presence of nuisance parameters, e.g, Liseo 1993, Sun & Ye 1996 and Moala et al. 2013.

3.5.3 - Overall reference prior

The reference priors presented so far consider the presence of nuisance parameters. However, in many situation we are simultaneously interested in all parameters of the model. Sun Ye 1996 considered the Bar-Lev & Reiser 1982 two parameter exponential family and presented a straightforward procedure to derive overall reference priors. Since the gamma distribution can be expressed as Bar-Lev and Reiser’s two parameter exponential distribution, the overall reference Berger et al. 2015 is given by

$π_{7} (α, β) \propto \frac{1}{β} \sqrt{\frac{α ψ' (α) - 1}{α}}$ (17)

which is the same as the reference prior when

$α$ is the parameter of interest and

$β$ is the nuisance parameter.

3.6 - Maximal Data Information prior

Zellner 1977, 1984 introduced another objective prior in which its information is weak comparing with data information. Such prior is known as Maximal Data Information (MDI) prior and can be obtained by solving

$π_{8} (α, β) \propto exp (\int_{0}^{\infty} log (f (t | α, β)) f (t | α, β) d t) .$ (18)

Therefore, the MDI prior (18) for the Gamma distribution (1) is given by

$π_{8} (α, β) \propto \frac{β}{Γ (α)} exp {(α - 1) ψ (α) - α} .$ (19)

The joint posterior distribution for $α$ and $β$ , produced by the MDI prior, is

$\begin{matrix} π_{8} (α, β | 𝐱) & \propto \frac{β^{n α + 1}}{Γ {(α)}^{n + 1}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i} + (α - 1) ψ (α) - α} . \end{matrix}$ (20)

Moala et al. 2013 argued that the posterior distribution (20) is improper. However, the authors did not present a proof of such result. The following theorem presents a formally rigorous proof in which confirmed such conjecture.

Theorem 3.8. The joint posterior density (20) is improper for any $n \in ℕ$ .

Proof. Following Abramowitz & Stegun 1972, ${lim}_{α \to 0^{+}} \frac{Γ (α)}{α^{- 1}} = 1$ and ${lim}_{α \to 0^{+}} \frac{ψ (α)}{α^{- 1}} = - 1$ . Thus,

$\begin{matrix} lim_{α \to 0^{+}} \frac{π (α)}{α^{s_{0}}} & = lim_{α \to 0^{+}} \frac{\frac{1}{Γ (α)} e^{(α - 1) ψ (α) - α}}{α^{s_{0}}} = lim_{α \to 0^{+}} \frac{α^{- 1}}{Γ (α)} \frac{e^{(α - 1) ψ (α) - α}}{e^{α^{- 1}}} \frac{e^{α^{- 1}}}{α^{s_{0} - 1}} \\ = lim_{α \to 0^{+}} 1 \times e^{α ψ (α) - α} e^{- ψ (α) - α^{- 1}} \frac{e^{α^{- 1}}}{α^{s_{0} - 1}} = lim_{α \to 0^{+}} e^{\frac{ψ (α)}{α^{- 1}} - α} e^{- ψ (α + 1)} \frac{e^{α^{- 1}}}{α^{s_{0} - 1}} \\ = e^{- 1} e^{- ψ (1)} lim_{α \to 0^{+}} \frac{e^{α^{- 1}}}{α^{s_{0} - 1}} = e^{- 1} e^{- ψ (1)} lim_{u \to \infty} \frac{e^{u}}{u^{- s_{0} + 1}} = \infty . \end{matrix}$ (21)

Since $c = - 1$ and ${lim}_{α \to 0^{+}} \frac{π (α)}{α^{s_{0}}} = \infty$ $\forall s_{0} \in ℕ$ , the result follows from the Theorem 2.6. ◻

3.6.1 - Modified MDI prior

Moala et al. 2013, introduces a modified maximal data information (MMDI) prior given by

$π_{9} (α, β) \propto \frac{β}{Γ (α)} exp {(α - 1) \frac{ψ (α)}{Γ (α)} - α} .$ (22)

The joint posterior distribution for $α$ and $β$ , produced by the MMDI prior, is

$\begin{matrix} π_{9} (α, β | 𝐱) & \propto \frac{β^{n α + 1}}{Γ {(α)}^{n + 1}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i} + (α - 1) \frac{ψ (α)}{Γ (α)} - α} . \end{matrix}$ (23)

Theorem 3.9. The posterior density (23) is proper for every $n \in ℕ$ , in which case the posterior moments for $α$ and $β$ are finite.

Proof. Following Abramowitz & Stegun 1972, ${lim}_{α \to 0^{+}} \frac{Γ (α)}{α^{- 1}} = 1$ and ${lim}_{α \to 0^{+}} \frac{ψ (α)}{α^{- 1}} = - 1$ . Thus ${lim}_{α \to 0^{+}} \frac{ψ (α)}{Γ (α)} = - 1$ and

$\begin{matrix} lim_{α \to 0^{+}} \frac{π_{9} (α)}{α} & = lim_{α \to 0^{+}} \frac{\frac{1}{Γ (α)} e^{(α - 1) \frac{ψ (α)}{Γ (α)} - α}}{α} = lim_{α \to 0^{+}} \frac{α^{- 1}}{Γ (α)} e^{(α - 1) \frac{ψ (α)}{Γ (α)} - α} \\ = 1 \times e^{(- 1) (- 1) - 0} = e 0 . \end{matrix}$ (24)

On the other hand, ${lim}_{α \to \infty} \frac{ψ (α)}{log (α)} = 1$ and by the Stirling approximation (see Abramowitz & Stegun 1972) we have ${lim}_{α \to 0^{+}} \frac{Γ (α)}{α^{α - \frac{1}{2}} e^{- α}} = \sqrt{2 π}$ and ${lim}_{α \to \infty} \frac{Γ (α)}{α^{2}} = \infty$ . Then

$\begin{matrix} lim_{α \to \infty} \frac{π_{9} (α)}{α^{\frac{1}{2} - α}} & = lim_{α \to 0^{+}} \frac{\frac{1}{Γ (α)} e^{(α - 1) \frac{ψ (α)}{Γ (α)} - α}}{α^{\frac{1}{2} - α}} = lim_{α \to 0^{+}} \frac{α^{α - \frac{1}{2}} e^{- α}}{Γ (α)} e^{(α - 1) \frac{ψ (α)}{Γ (α)}} \\ = \frac{1}{\sqrt{2 π}} lim_{α \to 0^{+}} e^{(1 - \frac{1}{α}) \frac{ψ (α)}{log (α)} \frac{log (α)}{α} \frac{α^{2}}{Γ (α)}} = \frac{1}{\sqrt{2 π}} e^{1 \times 1 \times 0 \times 0} = \frac{1}{\sqrt{2 π}} 0 . \end{matrix}$ (25)

Now, define

$π_{9}^{*} (α) = {\begin{matrix} α, & if α \leq 1 \\ α^{\frac{1}{2} - α} & if α 1 \end{matrix} and χ (α) = {\begin{matrix} α, & if α \leq 1 \\ α^{- \frac{1}{2}} & if α 1 . \end{matrix}$ (26)

Then, from (24) and (25) we have $π_{9} (α) \underset{α \to 0^{+}}{\propto} π_{9}^{*} (α)$ and $π_{9} (α) \underset{α \to \infty}{\propto} π_{9}^{*} (α)$ , which implies that $π_{9} (α) \propto π_{9}^{*} (α)$ from Proposition 2.4. However, $π_{9}^{*} (α) \leq χ (α)$ and the prior $π_{9} (β) χ (α) = β χ (α)$ leads to a proper posterior as well as posterior moments for every $n \in ℕ$ by Theorem 2.6 and Theorem 2.7. Therefore $α^{r} β^{s} π_{9} (α, β) \propto α^{r} β^{s} π_{9} (β) π_{9}^{*} (α) \leq α^{r} β^{s} π_{9} (β) χ (α)$ also leads to a proper posterior for every $n \in ℕ$ , $s \in ℕ$ and $r \in ℕ$ which proves the result. ◻

The marginal posterior distribution for $α$ is given by

$π_{9} (α | 𝐱) \propto \frac{(α ψ' (α) - 1)}{\sqrt{α}} \frac{Γ (n α + 2)}{Γ {(α)}^{n}} exp {(α - 1) \frac{ψ (α)}{Γ (α)} - α} {(\frac{\sqrt[n]{\prod_{i = 1}^{n} x_{i}}}{\sum_{i = 1}^{n} x_{i}})}^{n α} .$

The conditional posterior distribution for $β$ is given by

$π_{9} (β | α, 𝐱) \sim Gamma (n α + 2, \sum_{i = 1}^{n} x_{i}) .$

3.7 - Tibshirani priors

Tibshirani 1989 discussed an alternative method to derive a class of objective priors $π (θ_{1}, θ_{2})$ where $θ_{1}$ is the parameter of interest so that the credible interval for $θ_{1}$ has coverage error $O$ ( $n^{- 1}$ ) in the frequentist sense, i.e.,

$P [θ_{1} \leq θ_{1}^{1 - α} (π; X) | (θ_{1}, θ_{2})] = 1 - α - O (n^{- 1}),$ (27)

where

$θ_{1}^{1 - α} (π; X) | (θ_{1}, θ_{2})$ denote the

$(1 - α)$ th quantile of the posterior distribution of

$θ_{1}$ . The class of priors satisfying (27) are known as matching priors up to

$O (n^{- 1})$ . Mukerjee & Dey 1993 discussed sufficiency and necessary conditions for a class of Tibshirani priors be matching prior up to

$o (n^{- 1})$ .

Sun & Ye 1996 prove that the reference prior (13) is also a Tibshirani prior when $α$ is the parameter of interest and $β$ is the nuisance parameter and the Tibshirani prior when $β$ is the parameter of interest and $α$ is the nuisance parameter with order $O (n^{- 1})$ . They also proved that when $α$ is the parameter of interest, there is no matching prior up to order $o (n^{- 1})$ . Finally, they present a Tibshirani prior when $β$ is the parameter of interest that is matching prior up to order $o (n^{- 1})$ , such prior is given as follows

$π_{10} (α, β) \propto \frac{α ψ' (α) - 1}{β \sqrt{α}} .$ (28)

The joint posterior distribution for $α$ and $β$ , produced by the Tibshirani prior (28) is given by

$π_{10} (α, β | 𝐱) \propto \frac{(α ψ' (α) - 1)}{\sqrt{α}} \frac{β^{n α - 1}}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i}} .$ (29)

Theorem 3.10. The posterior density (29) is proper if and only if $n \geq 2$ , in which case the posterior moments for $α$ and $β$ are finite.

Proof. We proved in Theorem 3.3 that $\sqrt{α ψ' (α) - 1} \underset{z \to 0^{+}}{\propto} α^{- \frac{1}{2}}$ and that $\sqrt{α ψ' (α) - 1} \underset{z \to \infty}{\propto} α^{- \frac{1}{2}}$ . From that, it follows that

$\frac{α ψ' (α) - 1}{\sqrt{α}} \underset{z \to 0^{+}}{\propto} \frac{α^{- 1}}{α^{\frac{1}{2}}} = α^{- \frac{3}{2}} and \frac{α ψ' (α) - 1}{\sqrt{α}} \underset{z \to \infty}{\propto} \frac{α^{- 1}}{α^{\frac{1}{2}}} = α^{- \frac{3}{2}} .$

Thus

$c = - 1$ and

$s_{0} = s_{\infty} = - \frac{3}{2}$ , therefore the result follows directly from the Theorem 2.6 and Theorem 2.7. ◻

The conditional posterior distribution for $β$ is (8). The marginal posterior distribution for $α$ is given by

$π_{10} (α | 𝐱) \propto \frac{(α ψ' (α) - 1)}{\sqrt{α}} \frac{Γ (n α)}{Γ {(α)}^{n}} {(\frac{\sqrt[n]{\prod_{i = 1}^{n} x_{i}}}{\sum_{i = 1}^{n} x_{i}})}^{n α} .$

3.8 - Consensus prior

A rather natural approach to find an objective prior is to start with a collection of objective priors and take its average. Berger et al. 2015 discussed this prior averaging approach under the two most natural averages, the geometric mean and the arithmetic mean.

3.8.1 - Geometric mean

Let $π_{i} (α, β), i = 3, 5, 6, 7, 10$ be a collection of objective priors. Such priors were selected conveniently due its invariance property under one-to-one transformations. Then, our geometric mean (GM) prior is given by

$π_{11} (α, β) \propto \frac{1}{β} \sqrt[5]{\frac{{(α ψ' (α) - 1)}^{\frac{5}{2}} ψ' {(α)}^{\frac{1}{2}}}{α^{\frac{3}{2}}}} \propto \frac{1}{β} \frac{\sqrt{α ψ' (α) - 1} \sqrt[10]{ψ' (α)}}{α^{\frac{3}{10}}} .$ (30)

Note that, since our prior was constructed as a geometric mean of one-to-one invariant priors then such prior has also invariance property under one-to-one transformations.

The joint posterior distribution for $α$ and $β$ , produced by the consensus prior, is

$\begin{matrix} π_{11} (α, β | 𝐱) \propto \frac{ψ' {(α)}^{\frac{1}{10}} \sqrt{α ψ' (α) - 1}}{α^{\frac{3}{10}}} \frac{β^{n α - 1}}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i}} . \end{matrix}$ (31)

Theorem 3.11. The posterior density (31) is proper if and only if $n \geq 2$ , in which case the posterior moments for $α$ and $β$ are finite.

Proof. The result follows directly from the Theorem 2.8 and by Theorem 2.7. ◻

The conditional posterior distribution for $β$ is (8). The marginal posterior distribution for $α$ is given by

$\begin{matrix} π_{11} (α | 𝐱) \propto \frac{ψ' {(α)}^{\frac{1}{10}} \sqrt{α ψ' (α) - 1}}{α^{\frac{3}{10}}} \frac{Γ (n α)}{Γ {(α)}^{n}} {(\frac{\sqrt[n]{\prod_{i = 1}^{n} x_{i}}}{\sum_{i = 1}^{n} x_{i}})}^{n α} . \end{matrix}$

3.8.2 - Arithmetic mean

Let $π_{i} (α, β), i = 3, 5, 6, 7, 10$ be a collection of objective priors. Then, our arithmetic mean (AM) prior is given by

$π_{12} (α, β) \propto \frac{π_{12} (α)}{β}$

where

$π_{12} (α) = (\frac{2 \sqrt{α ψ' (α) - 1} + \sqrt{α ψ' (α)} + \sqrt{α^{2} ψ' (α) - α} + α ψ' (α) - 1}{\sqrt{α}}) .$

The joint posterior distribution for $α$ and $β$ , produced by the consensus prior, is

$\begin{matrix} π_{12} (α, β | 𝐱) \propto π_{12} (α) \frac{β^{n α - 1}}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i}} . \end{matrix}$ (32)

Theorem 3.12. The posterior density (32) is proper if and only if $n \geq 2$ , in which case the posterior moments for $α$ and $β$ are finite.

Proof. The result follows directly from the Theorem 2.8 and by Theorem 2.7. ◻

The conditional posterior distribution for $β$ is (8). The marginal posterior distribution for $α$ is given by

$\begin{matrix} π_{12} (α | 𝐱) \propto π_{12} (α) \frac{Γ (n α)}{Γ {(α)}^{n}} {(\frac{\sqrt[n]{\prod_{i = 1}^{n} x_{i}}}{\sum_{i = 1}^{n} x_{i}})}^{n α} . \end{matrix}$

4 - NUMERICAL EVALUATION

A simulation study is presented to compare the influence of different objective priors in the posterior distributions and select an objective prior that return good results in terms of the mean relative errors (MRE) and the mean square errors (MSE), given by

${MRE}_{i} \frac{1}{N} \sum_{j = 1}^{N} \frac{{\hat{θ}}_{i, j}}{θ_{i}} and {MSE}_{i} = \sum_{j = 1}^{N} \frac{{({\hat{θ}}_{i, j} - θ_{i})}^{2}}{N}, i = 1, 2$

where

$𝛉 = (α, β)$ and

$N = 10, 000$ is the number of estimates obtained through the posterior means of

$α$ and

$β$ . The

$95 %$ coverage probability (

$C P_{95 %}$ ) of the credibility intervals for

$α$ and

$β$ are evaluated. Considering this approach, the best estimators will show MRE closer to one and MSE closer to zero. In addition, for a large number of experiments considering a

$95 %$ confidence level, the frequencies of intervals that covered the true values of

$𝛉$ should be closer to

$95 %$ .

The results were computed using the software R. Considering $n = (10, 20, \dots, 120)$ the results were presented only for $𝛉 = ((4, 2), (0.5, 5))$ for reasons of space. However, the following results were similar for other choices of $α$ and $β$ . Using the MCMC methods, we computed the posterior mean for $α$ , $β$ and the credibility (confidence) intervals for both parameters. In terms of decision theory, we have considered the squared error loss function (SELF) as the loss function. Moreover, the posterior mean is finite for $n \geq 2$ and has optimality under the Kullback-Leibler divergence. Tables I and II available in Appendix B present the MREs, MSEs and $C P_{95 %}$ from the different estimators of $α$ and $β$ .

From these results, for both parameters the posterior mean using the Tibshirani prior indicates better performance than the obtained with other priors in terms of MREs and MSEs. The better performance of this approach is also confirmed through the coverage probability obtained from the credibility intervals. It is worth mentioning that the fact that the Tibshirani prior has frequentist coverage close to the nominal is a consequence of its construction. Although we have presented here only one scenario for the parameters, the results were similar for other choices of $𝛉$ . Overall, we conclude that the posterior distribution obtained with Tibshirani prior should be used to make inference on the parameters of the Gamma distribution.

5 - DISCUSSION

In this study, we presented a theorem that provides simple conditions under which improper prior yields a proper posterior for the Gamma distribution. Further, we provided sufficient conditions to verify if the posterior moments of the parameters are finite. An interesting aspect of our findings are that one can check if the posterior is proper or improper and also if its posterior moments are finite looking directly at the behavior of the proposed improper prior.

The proposed methodology is applied in different objective priors. The MDI prior was the only one that yield an improper posterior for any sample sizes. An extensive simulation study showed that the posterior distribution obtained under Tibshirani prior provided more accurate results in terms of MRE, MSE and coverage probabilities. Therefore, this posterior distribution should be used to make inference in the unknown parameters of the Gamma distribution. This study can be extended for other distributions, for instance, in a homogeneous Poisson process, the lengths of inter-arrival times can be modeled using an exponential distribution Exp( $λ$ ) with the following hierarchical structure

$\begin{matrix} y_{1}, \dots, y_{n} \sim f (y | λ) \\ λ \sim G a m m a (α, β) \\ π (α, β) \propto π (α) π (β) . \end{matrix}$

In this case we have a posterior distribution

$π (λ, α, β | 𝐲)$ that depends on three parameters (see Papadopoulos 1989). Although the results presented here can not be used to select the best prior due to the additional

$λ$ parameter, the same approach will be considered in further research.

ACKNOWLEDGMENTS

The authors are thankful to the Editorial Board and two reviewers for their valuable comments and suggestions which led to this improved version. Pedro L. Ramos is grateful to the São Paulo State Research Foundation (FAPESP Proc. 2017/25971-0). Eduardo Ramos acknowledges financial support from S~ao Paulo State Research Foundation (FAPESP Proc. 2019/27636-9). Francisco Louzada is supported by the Brazilian agencies CNPq (grant number 301976/2017-1) and FAPESP (grant number 2013/07375-0).

REFERENCES

ABRAMOWITZ M & STEGUN IA. 1972. Handbook of Mathematical Functions. 10th ed. Washington, D.C.: NBS, p. 1046.
BAR-LEV SK & REISER B. 1982. An exponential subfamily which admits UMPU tests based on a single test statistic. Ann Stat 979-989.
BERGER JO, BERNARDO JM & SUN D. 2015. Overall objective priors. Bayesian Anal 10(1): 189-221.
BERNARDO JM. 1979. Reference posterior distributions for Bayesian inference. J Roy Stat Soc B p. 113-147.
BERNARDO JM. 2005. Reference analysis. Handb Stat 25: 17-90.
DEY S & MOALA FA. 2018. Objective and subjective prior distributions for the Gompertz distribution. An Acad Bras Cienc 90: 2643-2661.
FOLLAND GB. 1999. Real analysis: modern techniques and their applications. 2nd ed. New York: Wiley, 408 p.
GELFAND AE & SMITH AF. 1990. Sampling-based approaches to calculating marginal densities. J Am Stat Assoc 85(410): 398-409.
JEFFREYS H. 1946. An invariant form for the prior probability in estimation problems. P Roy Soc A-Math Phy 186(1007): 453-461.
KASS RE & WASSERMAN L. 1996. The selection of prior distributions by formal rules. J Am Stat Assoc 91(435): 1343-1370.
LISEO B. 1993. Elimination of nuisance parameters with reference priors. Biometrika 80(2): 295-304.
LOUZADA F & RAMOS PL. 2018. Efficient closed-form maximum a posteriori estimators for the gamma distribution. J Stat Comput Sim 88(6): 1134-1146.
MILLER RB. 1980. Bayesian analysis of the two-parameter gamma distribution. Technometrics 22(1): 65-69.
MOALA FA, RAMOS PL & ACHCAR JA. 2013. Bayesian Inference for Two-Parameter Gamma Distribution Assuming Different Noninformative Priors. Rev Colomb Eetad 36(2): 321-338.
MUKERJEE R & DEY DK. 1993. Frequentist validity of posterior quantiles in the presence of a nuisance parameter: higher order asymptotics. Biometrika 80(3): 499-505.
NORTHROP P & ATTALIDES N. 2016. Posterior propriety in Bayesian extreme value analyses using reference priors. Stat Sinica 26(2).
PAPADOPOULOS AG. 1989. A hierarchical approach to the study of the exponential failure model. Commun Stat-Theor M 18(12): 4375-4392.
RAMOS PL, ALMEIDA MP, TOMAZELLA VL & LOUZADA F. 2019. Improved Bayes estimators and prediction for the Wilson-Hilferty distribution. An Acad Bras Cienc 91: e20190002.
SUN D & YE K. 1996. Frequentist validity of posterior quantiles for a two-parameter exponential family. Biometrika 83(1): 55-65.
TIBSHIRANI R. 1989. Noninformative priors for one parameter of many. Biometrika 76(3): 604-608.
ZELLNER A. 1977. Maximal Data Information Prior Distributions. New Meth Appli Bay Meth 211-232.
ZELLNER A. 1984. Maximal Data Information Prior Distributions. Bas Iss Econ, 334 p.

APPENDIX A

PROOF OF THEOREM 2.7

Proof. Let

$\begin{matrix} d (𝐱) & \propto \int_{ℬ} \frac{π (α) β^{n α + c}}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i}} d 𝚯 \end{matrix}$ (32)

Since $π (α) β^{n α + c} Γ {(α)}^{- n} \prod_{i = 1}^{n} x_{i}^{α} exp (- β \sum_{i = 1}^{n} x_{i}) \geq 0$ , by the Fubini-Tonelli Theorem (see Folland 1999) we have

$\begin{matrix} d (𝐱) & \propto \int_{ℬ} \frac{π (α) β^{n α + c}}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} exp {- β \sum_{i = 1}^{n} x_{i}} d 𝚯 \\ = \int_{0}^{\infty} \frac{π (α)}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} \int_{0}^{\infty} β^{n α + c} exp {- β \sum_{i = 1}^{n} x_{i}} d β d α . \end{matrix}$ (33)

The rest of the proof is divided in three items which are given bellow:

Case i): Suppose $c < - 1$ . Notice that $\int_{0}^{\infty} x^{k - 1} e^{- h x} d x = \infty$ for any $k \leq 0$ and $h \in ℝ$ . Then, for $0 < α < \frac{- c - 1}{n}$ we have $n α + c < n \frac{(- c - 1)}{n} + c = - 1$ , and it follows that

$\begin{matrix} d (𝐱) & \propto \int_{0}^{\infty} \frac{π (α)}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} \int_{0}^{\infty} β^{n α + c} exp {- β \sum_{i = 1}^{n} x_{i}} d β d α \\ \geq \int_{0}^{\frac{- c - 1}{n}} \frac{π (α)}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} \int_{0}^{\infty} β^{n α + c} exp {- β \sum_{i = 1}^{n} x_{i}} d β d α \\ = \int_{0}^{\frac{- c - 1}{n}} \frac{π (α)}{Γ {(α)}^{n}} {\prod_{i = 1}^{n} x_{i}^{α}} \times \infty d α = \int_{0}^{\frac{- c - 1}{n}} \infty d α = \infty . \end{matrix}$

and the case i) is proved.

Now suppose $c \geq - 1$ . Denoting

$v (α) = \frac{π (α) Γ (n α + c + 1)}{Γ {(α)}^{n}} and q (𝐱) = log (\frac{\frac{1}{n} \sum_{i = 1}^{n} x_{i}}{\sqrt[n]{\prod_{i = 1}^{n} x_{i}}}) > 0,$

we have that

$q (𝐱) > 0$ by the inequality of the arithmetic and geometric means, and

$\begin{matrix} d (𝐱) & = \int_{0}^{\infty} v (α) \frac{{(\prod_{i = 1}^{n} x_{i})}^{α}}{{(\sum_{i = 1}^{n} x_{i})}^{n α + c + 1}} d α \propto \int_{0}^{\infty} v (α) \frac{1}{n^{n α}} \frac{{(\sqrt[n]{\prod_{i = 1}^{n} x_{i}})}^{n α}}{{(\frac{1}{n} \sum_{i = 1}^{n} x_{i})}^{n α}} d α = \int_{0}^{\infty} v (α) n^{- n α} e^{- n q (𝐱) α} d α \\ = \int_{0}^{1} v (α) n^{- n α} e^{- n q (𝐱) α} d α + \int_{1}^{\infty} v (α) n^{- n α} e^{- n q (𝐱) α} d α = d_{0} (𝐱) + d_{\infty} (𝐱), \end{matrix}$

where

$d_{0} (𝐱) = \int_{0}^{1} v (α) n^{- n α} e^{- n q (𝐱) α} d α$ and

$d_{\infty} (𝐱) = \int_{1}^{\infty} v (α) n^{- n α} e^{- n q (𝐱) α} d α$ .

Then $d (𝐱) < \infty$ if and only if $d_{0} (𝐱) < \infty$ and $d_{\infty} (𝐱) < \infty$ . These results lead us to the two remaining cases.

Case ii): Suppose $c \geq - 1$ and ${lim}_{α \to 0^{+}} π (α) α^{s} = \infty$ $\forall s \in ℕ$ . From Abramowitz & Stegun (1972), we have $Γ (z) \underset{z \to 0^{+}}{\propto} \frac{1}{z}$ . Then, if $c = - 1$

$\begin{matrix} d_{0} (𝐱) & = \int_{0}^{1} \frac{π (α) Γ (n α)}{Γ {(α)}^{n}} n^{- n α} e^{- n q (𝐱) α} d α \propto \int_{0}^{1} \frac{π (α) \frac{1}{n α}}{\frac{1}{α^{n}}} \times 1 \times 1 d α \\ \propto \int_{0}^{1} π (α) α^{n - 1} d α = \int_{1}^{\infty} π (u^{- 1}) u^{- n - 1} d u = \infty, \end{matrix}$

where the last equality comes from the fact that

${lim}_{u \to \infty} π (u^{- 1}) u^{- n - 1} = {lim}_{α \to 0^{+}} π (α) α^{n + 1} = \infty$ . Therefore,

$d (𝐱) = \infty$ if

$c = - 1$ .

On the other hand, if $c > - 1$ then $n α + c + 1 > 0$ for $α > 0$ , which implies $Γ (n α + c + 1) \underset{α \to 0^{+}}{\propto} 1$ and

$\begin{matrix} d_{0} (𝐱) & = \int_{0}^{1} \frac{π (α) Γ (n α + c + 1)}{Γ {(α)}^{n}} n^{- n α} e^{- n q α} d α \propto \int_{0}^{1} \frac{π (α)}{\frac{1}{α^{n}}} \times 1 \times 1 d α \\ = \int_{0}^{1} π (α) α^{n} d α = \int_{1}^{\infty} π (u^{- 1}) u^{- n - 2} d u = \infty . \end{matrix}$

Therefore,

$d (𝐱) = \infty$ if

$c > - 1$ and the case ii) is proved.

Case iii): Suppose that $c \geq - 1$ and the behavior of $π (α)$ is given by

$π (α) \underset{α \to 0^{+}}{\propto} α^{s_{0}} and π (α) \underset{α \to \infty}{\propto} α^{s_{\infty}},$

where

$s_{0} \in ℝ$ and

$s_{\infty} \in ℝ$ . Following Abramowitz & Stegun 1972, p. 260, we obtain that

$Γ (z) \underset{z \to \infty}{\propto} z^{z - \frac{1}{2}} e^{- z}$ and

$Γ (z + a) \underset{z \to \infty}{\propto} Γ (z) z^{a}$ for

$a \in ℝ^{+}$ . Then

$Γ (n α + c + 1) \underset{α \to \infty}{\propto} Γ (n α) {(n α)}^{c + 1}$ and

$\begin{matrix} v (α) & = \frac{π (α) Γ (n α + c + 1)}{Γ {(α)}^{n}} \underset{α \to \infty}{\propto} \frac{α^{s_{\infty}} {(n α)}^{n α - \frac{1}{2}} e^{- n α} {(n α)}^{c + 1}}{α^{n α - \frac{n}{2}} e^{- n α}} \\ \propto \frac{α^{s_{\infty} + c + 1} {(n α)}^{n α - \frac{1}{2}}}{α^{n α - \frac{n}{2}}} \propto α^{s_{\infty} + c + \frac{n + 1}{2}} n^{n α} . \end{matrix}$

Therefore

$\begin{matrix} d_{\infty} (𝐱) & = \int_{1}^{\infty} v (α) n^{- n α} e^{- n q (𝐱) α} d α \propto \int_{1}^{\infty} α^{s_{\infty} + c + \frac{n + 1}{2}} e^{- n q (𝐱) α} d α \\ = \frac{Γ (s_{\infty} + c + \frac{n + 1}{2}, n q (𝐱))}{{(n q (𝐱))}^{s_{\infty} + c + \frac{n + 1}{2}}} < \infty, \end{matrix}$

i.e.,

$d_{\infty} (𝐱) < \infty$ for all

$s_{\infty} \in ℝ$ . Therefore

$d (𝐱) < \infty \Leftrightarrow d_{0} (𝐱) < \infty$ .

Now, following the same from case $i i)$ , if $c = - 1$ we have

$\begin{matrix} d_{0} (𝐱) & = \int_{0}^{1} \frac{π (α) Γ (n α)}{Γ {(α)}^{n}} n^{- n α} e^{- n q α} d α \propto \int_{0}^{1} \frac{α^{s_{0}} \frac{1}{n α}}{\frac{1}{α^{n}}} d α \propto \int_{0}^{1} α^{s_{0} + n - 1} d α, \end{matrix}$

i.e.,

$d (𝐱) < \infty$ if and only if

$n > - s_{0}$ when

$c = - 1$ . On the other hand, if

$c > - 1$

$\begin{matrix} d_{0} (𝐱) & = \int_{0}^{1} \frac{π (α) Γ (n α + c + 1)}{Γ {(α)}^{n}} n^{- n α} e^{- n q α} d α \propto \int_{0}^{1} \frac{α^{s_{0}}}{\frac{1}{α^{n}}} d α = \int_{0}^{1} α^{s_{0} + n} d α, \end{matrix}$

i.e.,

$d (𝐱) < \infty$ if and only if

$n > - s_{0} - 1$ when

$c > - 1$ and the proof is completed. ◻

APPENDIX B

Thumbnail

Table I
The

$𝐂 𝐏_{95 %}$ from the estimates of

$𝛍$ and

$𝛀$ considering different values of

$𝐧$ with N = 10,000 simulated samples.

Thumbnail

Table II
The MRE(MSE) for for the estimates of

$α$ and

$β$ considering different sample sizes.

Publication Dates

Publication in this collection
03 Dec 2021
Date of issue
2021

History

Received
22 Nov 2019
Accepted
8 Feb 2020

This is an open-access article distributed under the terms of the Creative Commons Attribution License

[1] ABRAMOWITZ M & STEGUN IA. 1972. Handbook of Mathematical Functions. 10th ed. Washington, D.C.: NBS, p. 1046.

[2] BAR-LEV SK & REISER B. 1982. An exponential subfamily which admits UMPU tests based on a single test statistic. Ann Stat 979-989.

[3] BERGER JO, BERNARDO JM & SUN D. 2015. Overall objective priors. Bayesian Anal 10(1): 189-221.

[4] BERNARDO JM. 1979. Reference posterior distributions for Bayesian inference. J Roy Stat Soc B p. 113-147.

[5] BERNARDO JM. 2005. Reference analysis. Handb Stat 25: 17-90.

[6] DEY S & MOALA FA. 2018. Objective and subjective prior distributions for the Gompertz distribution. An Acad Bras Cienc 90: 2643-2661.

[7] FOLLAND GB. 1999. Real analysis: modern techniques and their applications. 2nd ed. New York: Wiley, 408 p.

[8] GELFAND AE & SMITH AF. 1990. Sampling-based approaches to calculating marginal densities. J Am Stat Assoc 85(410): 398-409.

[9] JEFFREYS H. 1946. An invariant form for the prior probability in estimation problems. P Roy Soc A-Math Phy 186(1007): 453-461.

[10] KASS RE & WASSERMAN L. 1996. The selection of prior distributions by formal rules. J Am Stat Assoc 91(435): 1343-1370.

[11] LISEO B. 1993. Elimination of nuisance parameters with reference priors. Biometrika 80(2): 295-304.

[12] LOUZADA F & RAMOS PL. 2018. Efficient closed-form maximum a posteriori estimators for the gamma distribution. J Stat Comput Sim 88(6): 1134-1146.

[13] MILLER RB. 1980. Bayesian analysis of the two-parameter gamma distribution. Technometrics 22(1): 65-69.

[14] MOALA FA, RAMOS PL & ACHCAR JA. 2013. Bayesian Inference for Two-Parameter Gamma Distribution Assuming Different Noninformative Priors. Rev Colomb Eetad 36(2): 321-338.

[15] MUKERJEE R & DEY DK. 1993. Frequentist validity of posterior quantiles in the presence of a nuisance parameter: higher order asymptotics. Biometrika 80(3): 499-505.

[16] NORTHROP P & ATTALIDES N. 2016. Posterior propriety in Bayesian extreme value analyses using reference priors. Stat Sinica 26(2).

[17] PAPADOPOULOS AG. 1989. A hierarchical approach to the study of the exponential failure model. Commun Stat-Theor M 18(12): 4375-4392.

[18] RAMOS PL, ALMEIDA MP, TOMAZELLA VL & LOUZADA F. 2019. Improved Bayes estimators and prediction for the Wilson-Hilferty distribution. An Acad Bras Cienc 91: e20190002.

[19] SUN D & YE K. 1996. Frequentist validity of posterior quantiles for a two-parameter exponential family. Biometrika 83(1): 55-65.

[20] TIBSHIRANI R. 1989. Noninformative priors for one parameter of many. Biometrika 76(3): 604-608.

[21] ZELLNER A. 1977. Maximal Data Information Prior Distributions. New Meth Appli Bay Meth 211-232.

[22] ZELLNER A. 1984. Maximal Data Information Prior Distributions. Bas Iss Econ, 334 p.

$𝛉$	n	Uniform		Jeffreys’ Rule		Jeffreys’ Prior		Miller		Reference $α$		Reference $β$		MDIP		Tibshirani		Consensus GM		Consensus AM
$𝛉$	n	$μ$	$θ$	$μ$	$θ$	$μ$	$θ$	$μ$	$θ$	$μ$	$θ$	$μ$	$θ$	$μ$	$θ$	$μ$	$θ$	$μ$	$θ$	$μ$	$θ$
	10	0.892	0.891	0.950	0.953	0.942	0.948	0.928	0.936	0.948	0.953	0.943	0.948	0.969	0.964	0.950	0.957	0.949	0.953	0.922	0.928
	20	0.907	0.908	0.946	0.949	0.941	0.945	0.938	0.937	0.944	0.949	0.942	0.946	0.960	0.956	0.947	0.951	0.945	0.949	0.931	0.932
$α = 2$	30	0.918	0.916	0.948	0.950	0.944	0.945	0.937	0.939	0.948	0.951	0.946	0.946	0.958	0.949	0.950	0.950	0.949	0.949	0.933	0.936
	40	0.923	0.921	0.948	0.946	0.944	0.944	0.938	0.942	0.947	0.947	0.943	0.944	0.953	0.948	0.949	0.948	0.948	0.947	0.937	0.938
	50	0.929	0.928	0.950	0.950	0.948	0.947	0.942	0.944	0.951	0.949	0.948	0.948	0.955	0.948	0.950	0.951	0.949	0.949	0.941	0.941
	60	0.929	0.928	0.947	0.946	0.943	0.944	0.941	0.940	0.944	0.946	0.944	0.943	0.948	0.945	0.945	0.946	0.946	0.946	0.938	0.938
$β = 0.5$	70	0.934	0.931	0.946	0.948	0.943	0.947	0.942	0.943	0.946	0.948	0.944	0.947	0.948	0.948	0.944	0.948	0.946	0.947	0.938	0.938
	80	0.934	0.934	0.948	0.947	0.947	0.948	0.943	0.944	0.946	0.949	0.946	0.945	0.948	0.946	0.949	0.948	0.946	0.948	0.940	0.942
	90	0.938	0.936	0.949	0.949	0.948	0.948	0.944	0.943	0.948	0.948	0.948	0.947	0.950	0.945	0.948	0.949	0.949	0.947	0.944	0.943
	100	0.938	0.933	0.945	0.945	0.945	0.944	0.943	0.940	0.944	0.944	0.944	0.943	0.946	0.942	0.946	0.945	0.944	0.944	0.943	0.941
	110	0.941	0.939	0.948	0.951	0.950	0.950	0.945	0.945	0.946	0.948	0.947	0.950	0.949	0.948	0.948	0.948	0.948	0.947	0.944	0.944
	120	0.942	0.943	0.950	0.951	0.949	0.950	0.948	0.950	0.948	0.953	0.948	0.950	0.950	0.950	0.950	0.952	0.949	0.951	0.946	0.947
	10	0.886	0.885	0.948	0.951	0.942	0.942	0.928	0.931	0.949	0.952	0.942	0.942	0.920	0.942	0.951	0.954	0.949	0.949	0.918	0.920
$α = 4$	20	0.907	0.905	0.947	0.946	0.942	0.941	0.935	0.933	0.947	0.944	0.943	0.940	0.923	0.940	0.950	0.946	0.947	0.944	0.928	0.928
	30	0.917	0.914	0.952	0.950	0.948	0.947	0.939	0.939	0.952	0.949	0.949	0.947	0.921	0.938	0.950	0.949	0.951	0.948	0.934	0.933
	40	0.923	0.920	0.947	0.946	0.946	0.943	0.939	0.939	0.947	0.948	0.945	0.944	0.925	0.939	0.946	0.948	0.948	0.947	0.935	0.935
	50	0.924	0.926	0.947	0.947	0.944	0.945	0.939	0.940	0.945	0.947	0.946	0.944	0.929	0.943	0.948	0.946	0.945	0.947	0.936	0.937
	60	0.932	0.930	0.950	0.950	0.948	0.946	0.942	0.943	0.951	0.949	0.947	0.947	0.933	0.940	0.952	0.950	0.950	0.948	0.940	0.941
	70	0.929	0.929	0.945	0.946	0.945	0.944	0.940	0.938	0.946	0.946	0.943	0.946	0.931	0.939	0.946	0.947	0.945	0.944	0.938	0.938
$β = 2$	80	0.936	0.936	0.950	0.952	0.949	0.949	0.944	0.946	0.950	0.951	0.949	0.949	0.937	0.942	0.950	0.952	0.948	0.951	0.943	0.945
	90	0.934	0.938	0.946	0.948	0.945	0.948	0.941	0.945	0.944	0.949	0.945	0.947	0.931	0.943	0.944	0.949	0.944	0.948	0.940	0.943
	100	0.941	0.939	0.949	0.952	0.949	0.949	0.947	0.948	0.951	0.951	0.951	0.951	0.936	0.942	0.949	0.950	0.950	0.952	0.946	0.945
	110	0.940	0.939	0.949	0.949	0.949	0.949	0.947	0.947	0.949	0.950	0.947	0.948	0.937	0.943	0.949	0.950	0.950	0.951	0.944	0.946
	120	0.940	0.937	0.949	0.945	0.947	0.944	0.944	0.943	0.947	0.945	0.947	0.944	0.936	0.941	0.946	0.947	0.947	0.945	0.944	0.942

$𝛉$	n	Uniform	Jeffreys’ Rule	Jeffreys’ Prior	Miller	Reference $α$	Reference $β$	MDIP	Tibshirani	Consensus GM	Consensus AM
$α = 2$	10	1.336(1.305)	1.130(0.655)	1.175(0.768)	1.232(0.928)	1.124(0.646)	1.169(0.762)	1.082(0.304)	1.067(0.542)	1.131(0.664)	1.252(1.014)
	20	1.209(0.598)	1.080(0.354)	1.109(0.397)	1.144(0.457)	1.076(0.351)	1.105(0.394)	1.070(0.230)	1.041(0.312)	1.081(0.358)	1.156(0.487)
	30	1.153(0.360)	1.059(0.234)	1.080(0.256)	1.106(0.288)	1.056(0.232)	1.077(0.254)	1.061(0.179)	1.030(0.211)	1.060(0.236)	1.114(0.302)
	40	1.122(0.256)	1.048(0.177)	1.064(0.191)	1.084(0.211)	1.045(0.176)	1.061(0.190)	1.054(0.149)	1.025(0.164)	1.048(0.179)	1.091(0.220)
	50	1.100(0.195)	1.039(0.142)	1.053(0.151)	1.070(0.164)	1.038(0.141)	1.051(0.151)	1.049(0.124)	1.021(0.133)	1.040(0.143)	1.075(0.170)
	60	1.084(0.157)	1.032(0.120)	1.044(0.126)	1.058(0.136)	1.031(0.119)	1.042(0.125)	1.042(0.109)	1.016(0.113)	1.032(0.120)	1.062(0.140)
	70	1.071(0.127)	1.026(0.100)	1.036(0.105)	1.049(0.111)	1.025(0.099)	1.034(0.104)	1.036(0.093)	1.012(0.095)	1.026(0.100)	1.052(0.114)
	80	1.067(0.110)	1.027(0.088)	1.036(0.092)	1.047(0.097)	1.026(0.087)	1.035(0.092)	1.038(0.083)	1.015(0.083)	1.028(0.088)	1.050(0.100)
	90	1.057(0.094)	1.022(0.077)	1.030(0.080)	1.040(0.084)	1.021(0.077)	1.029(0.080)	1.032(0.074)	1.011(0.074)	1.022(0.077)	1.043(0.086)
	100	1.055(0.086)	1.022(0.071)	1.030(0.074)	1.039(0.078)	1.021(0.071)	1.028(0.074)	1.032(0.069)	1.012(0.068)	1.023(0.071)	1.041(0.079)
	110	1.047(0.074)	1.018(0.063)	1.024(0.065)	1.033(0.068)	1.017(0.062)	1.023(0.065)	1.027(0.061)	1.009(0.061)	1.018(0.063)	1.035(0.069)
	120	1.045(0.069)	1.018(0.059)	1.024(0.060)	1.031(0.063)	1.017(0.058)	1.023(0.060)	1.027(0.058)	1.009(0.057)	1.018(0.059)	1.033(0.064)
$β = 0.5$	10	1.395(0.107)	1.157(0.053)	1.204(0.062)	1.262(0.074)	1.151(0.053)	1.198(0.062)	1.160(0.033)	1.093(0.044)	1.159(0.054)	1.283(0.081)
	20	1.246(0.049)	1.098(0.029)	1.127(0.032)	1.163(0.037)	1.094(0.028)	1.123(0.032)	1.121(0.023)	1.058(0.025)	1.099(0.029)	1.175(0.039)
	30	1.181(0.030)	1.073(0.019)	1.094(0.021)	1.121(0.023)	1.070(0.019)	1.091(0.021)	1.101(0.017)	1.044(0.018)	1.074(0.020)	1.129(0.025)
	40	1.143(0.022)	1.059(0.015)	1.075(0.016)	1.096(0.018)	1.056(0.015)	1.072(0.016)	1.085(0.014)	1.035(0.014)	1.059(0.015)	1.102(0.018)
	50	1.118(0.016)	1.048(0.011)	1.061(0.012)	1.079(0.013)	1.046(0.011)	1.059(0.012)	1.074(0.011)	1.029(0.011)	1.048(0.012)	1.084(0.014)
	60	1.100(0.013)	1.041(0.010)	1.052(0.010)	1.067(0.011)	1.039(0.010)	1.050(0.010)	1.065(0.010)	1.025(0.009)	1.041(0.010)	1.071(0.011)
	70	1.085(0.011)	1.034(0.008)	1.044(0.009)	1.056(0.009)	1.032(0.008)	1.042(0.009)	1.057(0.008)	1.020(0.008)	1.034(0.008)	1.060(0.009)
	80	1.080(0.009)	1.034(0.007)	1.043(0.008)	1.054(0.008)	1.033(0.007)	1.042(0.008)	1.055(0.007)	1.021(0.007)	1.034(0.007)	1.057(0.008)
	90	1.068(0.008)	1.027(0.006)	1.035(0.007)	1.045(0.007)	1.026(0.006)	1.034(0.007)	1.047(0.006)	1.016(0.006)	1.027(0.006)	1.048(0.007)
	100	1.064(0.007)	1.027(0.006)	1.034(0.006)	1.043(0.006)	1.026(0.006)	1.033(0.006)	1.046(0.006)	1.017(0.006)	1.027(0.006)	1.046(0.006)
	110	1.055(0.006)	1.022(0.005)	1.028(0.005)	1.036(0.005)	1.021(0.005)	1.027(0.005)	1.039(0.005)	1.012(0.005)	1.022(0.005)	1.039(0.006)
	120	1.053(0.006)	1.022(0.005)	1.028(0.005)	1.036(0.005)	1.021(0.005)	1.027(0.005)	1.039(0.005)	1.014(0.005)	1.022(0.005)	1.038(0.005)
$α = 4$	10	1.348(5.558)	1.128(2.743)	1.179(3.256)	1.238(3.930)	1.124(2.724)	1.177(3.243)	0.841(0.779)	1.066(2.290)	1.134(2.810)	1.270(4.420)
	20	1.217(2.522)	1.079(1.467)	1.111(1.659)	1.148(1.909)	1.077(1.458)	1.109(1.651)	0.879(0.616)	1.040(1.302)	1.083(1.493)	1.167(2.086)
	30	1.159(1.535)	1.059(0.986)	1.082(1.084)	1.109(1.217)	1.057(0.982)	1.081(1.081)	0.902(0.507)	1.030(0.897)	1.062(1.000)	1.123(1.308)
	40	1.127(1.136)	1.048(0.787)	1.067(0.850)	1.088(0.934)	1.047(0.784)	1.066(0.849)	0.919(0.452)	1.026(0.730)	1.050(0.796)	1.099(0.993)
	50	1.105(0.832)	1.040(0.601)	1.055(0.642)	1.072(0.698)	1.039(0.599)	1.054(0.642)	0.929(0.380)	1.021(0.563)	1.042(0.606)	1.081(0.736)
	60	1.088(0.673)	1.033(0.508)	1.045(0.537)	1.060(0.577)	1.032(0.506)	1.045(0.536)	0.936(0.346)	1.017(0.480)	1.034(0.511)	1.068(0.603)
	70	1.077(0.562)	1.029(0.436)	1.040(0.459)	1.053(0.489)	1.028(0.435)	1.040(0.458)	0.943(0.311)	1.015(0.416)	1.030(0.440)	1.060(0.510)
	80	1.069(0.475)	1.027(0.376)	1.037(0.394)	1.048(0.418)	1.026(0.375)	1.036(0.393)	0.949(0.277)	1.015(0.359)	1.028(0.379)	1.054(0.433)
	90	1.061(0.410)	1.023(0.333)	1.031(0.346)	1.041(0.365)	1.022(0.332)	1.031(0.345)	0.952(0.255)	1.011(0.319)	1.023(0.334)	1.047(0.377)
	100	1.053(0.350)	1.018(0.290)	1.027(0.301)	1.036(0.315)	1.018(0.290)	1.026(0.300)	0.954(0.232)	1.009(0.280)	1.019(0.291)	1.040(0.325)
	110	1.049(0.320)	1.018(0.269)	1.025(0.278)	1.034(0.291)	1.017(0.268)	1.025(0.278)	0.958(0.218)	1.009(0.261)	1.019(0.270)	1.038(0.299)
	120	1.046(0.292)	1.017(0.247)	1.024(0.256)	1.032(0.267)	1.017(0.247)	1.023(0.255)	0.962(0.203)	1.009(0.240)	1.018(0.249)	1.035(0.273)
$β = 2$	10	1.377(1.616)	1.142(0.798)	1.194(0.942)	1.253(1.130)	1.138(0.793)	1.191(0.939)	0.877(0.198)	1.079(0.667)	1.148(0.817)	1.285(1.265)
	20	1.235(0.728)	1.088(0.423)	1.120(0.476)	1.157(0.545)	1.086(0.420)	1.118(0.474)	0.903(0.164)	1.049(0.375)	1.092(0.430)	1.177(0.593)
	30	1.174(0.446)	1.066(0.286)	1.090(0.313)	1.117(0.350)	1.064(0.285)	1.088(0.313)	0.921(0.138)	1.037(0.260)	1.069(0.290)	1.131(0.375)
	40	1.138(0.327)	1.053(0.226)	1.072(0.243)	1.093(0.266)	1.052(0.225)	1.071(0.243)	0.933(0.124)	1.030(0.210)	1.055(0.228)	1.104(0.282)
	50	1.115(0.240)	1.045(0.172)	1.060(0.184)	1.078(0.200)	1.044(0.172)	1.059(0.184)	0.942(0.104)	1.026(0.161)	1.047(0.174)	1.087(0.210)
	60	1.096(0.193)	1.037(0.145)	1.050(0.153)	1.065(0.164)	1.036(0.145)	1.049(0.153)	0.947(0.095)	1.021(0.137)	1.038(0.146)	1.072(0.171)
	70	1.083(0.160)	1.032(0.124)	1.043(0.130)	1.056(0.138)	1.031(0.123)	1.042(0.130)	0.952(0.086)	1.018(0.118)	1.033(0.125)	1.063(0.144)
	80	1.076(0.136)	1.030(0.108)	1.040(0.113)	1.051(0.119)	1.029(0.107)	1.039(0.112)	0.958(0.077)	1.018(0.103)	1.031(0.108)	1.057(0.123)
	90	1.067(0.116)	1.026(0.094)	1.035(0.098)	1.045(0.103)	1.025(0.094)	1.034(0.098)	0.961(0.070)	1.015(0.090)	1.027(0.094)	1.050(0.106)
	100	1.058(0.100)	1.021(0.083)	1.029(0.086)	1.038(0.090)	1.020(0.083)	1.028(0.086)	0.961(0.064)	1.011(0.080)	1.022(0.083)	1.043(0.092)
	110	1.053(0.091)	1.020(0.076)	1.027(0.079)	1.035(0.082)	1.019(0.076)	1.027(0.079)	0.964(0.060)	1.011(0.074)	1.020(0.077)	1.040(0.084)
	120	1.051(0.085)	1.020(0.071)	1.026(0.074)	1.034(0.077)	1.019(0.071)	1.026(0.074)	0.968(0.057)	1.011(0.069)	1.020(0.072)	1.038(0.079)

On Posterior Properties of the Two Parameter Gamma Family of Distributions

Abstract

1 - INTRODUCTION

2 - PROPER POSTERIOR

3 - APPLICATION

3.1 - Uniform prior

3.2 - Jeffreys rule

3.3 - Jeffreys prior

3.4 - Miller prior

3.5 - Reference prior

3.5.1 - Reference prior when α<math xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" display="inline"><mi>α</mi></math> is the parameter of interest

3.5.2 - Reference prior when β<math xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" display="inline"><mi>β</mi></math> is the parameter of interest

3.5.3 - Overall reference prior

3.6 - Maximal Data Information prior

3.6.1 - Modified MDI prior

3.7 - Tibshirani priors

3.8 - Consensus prior

3.8.1 - Geometric mean

3.8.2 - Arithmetic mean

4 - NUMERICAL EVALUATION

5 - DISCUSSION

ACKNOWLEDGMENTS

REFERENCES

APPENDIX A

APPENDIX B

Publication Dates

History

3.5.1 - Reference prior when $α$ is the parameter of interest

3.5.2 - Reference prior when $β$ is the parameter of interest