Comparison of the EM, CEM and SEM algorithms in the estimation of finite mixtures of linear mixed models: a simulation study

Finite mixture models are a widely known method for modelling data that arise from a heterogeneous population. Within the family of mixtures of regression models, mixtures of linear mixed models have also been applied in different areas since, besides taking into consideration the heterogeneity in t...

Full description

Saved in:

Bibliographic Details
Published in	Computational statistics Vol. 36; no. 4; pp. 2507 - 2533
Main Authors	Novais, Luísa, Faria, Susana
Format	Journal Article
Language	English
Published	Berlin/Heidelberg Springer Berlin Heidelberg 01.12.2021 Springer Nature B.V
Subjects	Algorithms Economic Theory/Quantitative Economics/Mathematical Methods Heterogeneity Iterative methods Mathematics and Statistics Maximization Maximum likelihood estimates Maximum likelihood estimation Maximum likelihood estimators Optimization Original Paper Parameters Performance evaluation Probabilistic models Probability and Statistics in Computer Science Probability Theory and Stochastic Processes Regression models Statistics Maximum likelihood estimation Iterative algorithms Mixture models
Online Access	Get full text
ISSN	0943-4062 1613-9658 1613-9658
DOI	10.1007/s00180-021-01088-1

Cover

More Information
Summary:	Finite mixture models are a widely known method for modelling data that arise from a heterogeneous population. Within the family of mixtures of regression models, mixtures of linear mixed models have also been applied in different areas since, besides taking into consideration the heterogeneity in the population, they also allow to take into account the correlation between observations from the same individual. One of the main issues in mixture models concerns the estimation of the parameters. Maximum likelihood estimation is one of the most used methods in the estimation of the parameters for mixture models. However, the maximization of the log-likelihood function in mixture models is complex, producing in many cases infinite solutions whereby the maximum likelihood estimator may not exist, at least globally. For this reason, it is common to resort to iterative methods, in particular to the Expectation-Maximization (EM) algorithm. However, the slow convergence and the selection of initial values are two of biggest issues of the EM algorithm, the reason why some modified versions of this algorithm have been developed over the years. In this article we compare the performance of the EM, Classification EM (CEM) and Stochastic EM (SEM) algorithms in the estimation of the parameters for mixtures of linear mixed models. In order to evaluate their performance, we carry out a simulation study and a real data application. The results show that the CEM algorithm is the least computationally demanding algorithm, although the three algorithms provide similar maximum likelihood estimates for the parameters.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	0943-4062 1613-9658 1613-9658
DOI:	10.1007/s00180-021-01088-1