Maximum likelihood estimation
Contents
Maximum likelihood estimation¶
Time series model¶
a specification of the joint distribution of \(\{z_𝑡\}\)
definition: Likelihood function
Note
The likelihood function is identical in functional form to the PDF of \(\mathbf{z}\), \(p(\mathbf{z};\boldsymbol \theta)\), but is interpreted as a function of \(\boldsymbol \theta\), for a given value of \(\mathbf{z}\), rather than as a function of \(\mathbf{z}\) for a given value of \(\boldsymbol \theta\).
definition: Log-likelihood function
The maximum likelihood estimator (MLE)¶
Rationale for MLE¶
For a given \(\boldsymbol \theta\), the value of \(p(\mathbf{z}; \boldsymbol \theta) d \mathbf{z}\) evaluated at the observed sample \(\mathbf{z}\) tells us what is the probability of observing a sample in a small neighborhood around the actual \(\mathbf{z}\) for that value of \(\boldsymbol \theta\). Compared to the MLE \(\hat {\boldsymbol \theta}\), any other value of \(\boldsymbol \theta\) is associated with a pdf that assigns a lower probability of observing such a sample. Therefore, \(\hat {\boldsymbol \theta}\) is the value most supported by the observed sample.
Note
Difference between ML estimator and ML estimate:
estimator: \(\hat {\boldsymbol \theta}\) as a function of a generic sample \(\mathbf{z}\)
estimate: the value \(\hat {\boldsymbol \theta}\) at a particular sample \(\mathbf{z}\)
Score¶
describes the steepness of log-likelihood function
MLE \(\hat {\boldsymbol \theta}\) solves
Observed Fisher information¶
describes the curvature of the log-likelihood function at the maximum \(\hat {\boldsymbol \theta}\)
measures how much information about \(\boldsymbol \theta\) we have at the MLE.
Expected Fisher information¶
expected curvature of the log-likelihood function
measures how much information about \(\boldsymbol \theta\) we can expect to have
Consistency and asymptotic normality of MLE¶
Assumption: \(\mathbf{z}\) is a draw from \( p(\mathbf{z}; \boldsymbol \theta_0)\), \(\boldsymbol \theta_0\) - true value of \(\boldsymbol \theta\)
\(\hat {\boldsymbol \theta}\) is consitent estimator of \(\boldsymbol \theta_0\)
\(\hat {\boldsymbol \theta}\) is asymptotically normally distributed
where