In mixture with varying concentrations model (MVC) one deals with a nonhomogeneous sample which consists of subjects belonging to a fixed number of different populations (mixture components). The population which a subject belongs to is unknown, but the probabilities to belong to a given component are known and vary from observation to observation. The distribution of subjects’ observed features depends on the component which it belongs to.
Generalized estimating equations (GEE) for Euclidean parameters in MVC models are considered. Under suitable assumptions the obtained estimators are asymptotically normal. A jackknife (JK) technique for the estimation of their asymptotic covariance matrices is described. Consistency of JK-estimators is demonstrated. An application to a model of mixture of nonlinear regressions and a real life example are presented.
Principal Component Analysis (PCA) is a classical technique of dimension reduction for multivariate data. When the data are a mixture of subjects from different subpopulations one can be interested in PCA of some (or each) subpopulation separately. In this paper estimators are considered for PC directions and corresponding eigenvectors of subpopulations in the nonparametric model of mixture with varying concentrations. Consistency and asymptotic normality of obtained estimators are proved. These results allow one to construct confidence sets for the PC model parameters. Performance of such confidence intervals for the leading eigenvalues is investigated via simulations.
We consider a mixture with varying concentrations in which each component is described by a nonlinear regression model. A modified least squares estimator is used to estimate the regressions parameters. Asymptotic normality of the derived estimators is demonstrated. This result is applied to confidence sets construction. Performance of the confidence sets is assessed by simulations.
A general jackknife estimator for the asymptotic covariance of moment estimators is considered in the case when the sample is taken from a mixture with varying concentrations of components. Consistency of the estimator is demonstrated. A fast algorithm for its calculation is described. The estimator is applied to construction of confidence sets for regression parameters in the linear regression with errors in variables. An application to sociological data analysis is considered.
We consider a finite mixture model with varying mixing probabilities. Linear regression models are assumed for observed variables with coefficients depending on the mixture component the observed subject belongs to. A modification of the least-squares estimator is proposed for estimation of the regression coefficients. Consistency and asymptotic normality of the estimates is demonstrated.