Maximum likelihood estimation¶

Time series model¶

a specification of the joint distribution of ${z_{𝑡}}$

p (z; θ)

definition: Likelihood function

L (θ | z) = p (z; θ)

Note

The likelihood function is identical in functional form to the PDF of $z$ , $p (z; θ)$ , but is interpreted as a function of $θ$ , for a given value of $z$ , rather than as a function of $z$ for a given value of $θ$ .

definition: Log-likelihood function

ℓ (θ | z) = \log (L (θ | z)

The maximum likelihood estimator (MLE)¶

\begin{array}{r} \begin{aligned} \hat{θ} & = \underset{θ}{argmax} L (θ | z) \\ = \underset{θ}{argmax} ℓ (θ | z) \end{aligned} \end{array}

Rationale for MLE¶

For a given $θ$ , the value of $p (z; θ) d z$ evaluated at the observed sample $z$ tells us what is the probability of observing a sample in a small neighborhood around the actual $z$ for that value of $θ$ . Compared to the MLE $\hat{θ}$ , any other value of $θ$ is associated with a pdf that assigns a lower probability of observing such a sample. Therefore, $\hat{θ}$ is the value most supported by the observed sample.

Note

Difference between ML estimator and ML estimate:

estimator: $\hat{θ}$ as a function of a generic sample $z$
estimate: the value $\hat{θ}$ at a particular sample $z$

Score¶

S_{T} (θ) = \frac{\partial}{\partial θ} ℓ (θ | z)

describes the steepness of log-likelihood function

MLE $\hat{θ}$ solves

S_{T} (\hat{θ}) = 0

Observed Fisher information¶

\begin{array}{r} \begin{aligned} I_{T} (\hat{θ}) & = - \frac{\partial}{\partial θ} S_{T} (θ) |_{\hat{θ}} \\ = - \frac{\partial^{2}}{\partial θ \partial θ^{'}} ℓ (θ | z) |_{\hat{θ}} \end{aligned} \end{array}

describes the curvature of the log-likelihood function at the maximum $\hat{θ}$

measures how much information about $θ$ we have at the MLE.

Expected Fisher information¶

I_{T} (θ) = E [S_{T} (θ) S_{T} (θ)^{'}]

I_{T} (θ) = - E [\frac{\partial}{\partial θ} S_{T} (θ)] = E [I_{T} (θ)]

expected curvature of the log-likelihood function

measures how much information about $θ$ we can expect to have

Consistency and asymptotic normality of MLE¶

Assumption: $z$ is a draw from $p (z; θ_{0})$ , $θ_{0}$ - true value of $θ$

$\hat{θ}$ is consitent estimator of $θ_{0}$

\hat{θ} ⟶ θ_{0}

$\hat{θ}$ is asymptotically normally distributed

\sqrt{T} (\hat{θ} - θ_{0}) ⟶ N (0, I_{\infty}^{- 1} (θ_{0}))

where

I_{\infty} (θ) = \lim_{T \to \infty} \frac{1}{T} I_{T} (θ)

\hat{θ} \overset{a}{\sim} N (θ_{0}, \frac{1}{T} I_{\infty}^{- 1} (θ_{0}))

Maximum likelihood estimation

Contents