dc.description.abstract | Based on cluster measurements, our research interest mainly focuses on seriating the uncertainty in the degree of health or functioning of the body for collected subjects. For this problem, a latent variable is used to represent an unobserved seriation. In this thesis, some adequately and widely used joint models of a latent variable and cluster measurements are proposed to predict the most possible occurring value of a latent variable, which is taken in
our seriation procedure. Since a latent variable is considered in modeling, a popular expectation and maximization (EM) algorithm is implemented for the estimation of parameters in the observed likelihood function. Moreover, a parametric bootstrapping method
is considered to generate latent values and bootstrap samples, which are used to estimate seriation indices such as the correlation and concordance proportion in the evaluation of seriation. To examine the performance of the developed procedures, a class of simulations is conducted. From the numerical studies, we further detect that the evaluation indices computed based on the maximum likelihood estimators or the true parameters are very close although the accuracy of estimators relies on the sample size. Thus, a computationally efficient approach is proposed to estimate seriation indices. Finally, the seriation and evaluation procedures are applied to a CD4 depletion study. | en |
dc.relation.reference | Breslow, N. E. and Clayton, D. G. (1993). Approximate
inference in generalized linear mixed models. Journal of the
American Statistical Association. 88, 125-134.
Breslow, N. E. and Lin, X. (1995). Bias correction in
generalized linear mixed models with a single component of
dispersion. Biometrika. 82, 81-92.
Casella, G., and Robert, C. P. (1996).
Rao-Blackwellisation of sampling schemes. Biometrika. 83, 81-94.
Chiang, C. T. (2005). Comparisons between simultaneous
and componentwisw splines for varying-coefficient models.
Annals of the Institute of Statistical Mathematics. To appear.
Dempster, A. P., Laird, N. M., and Rubin, D. B. (1977).
Maximum likelihood from incomplete observations. Journal of
the Royal Statistical Society, Series B. 39, 1-38.
Efron, B. and Tibshirani, R. J. (1993). An
introduction to the bootstrap. Chapman and Hall, New York.
Holland, P. W. and Wainer, H. (1993). Differential
item functioning. Lawrence Erlbaum associates, New Jersey.
Kaslow, R. A., Ostrow, D. G., Detels, R., Phair, J. P.,
Polk, B. F. and Rinaldo, C. R. (1987). The multicenter AIDS cohort study: Rationale, organization and selected characteristics of the participants. American Journal of Epidemiology. 126,310-318.
Larsen, K. (2004). Joint analysis of time-to-event and
multiple binary indicators of latent classes. Biometrics. 60, 85-92.
Lin, X. and Breslow, N. E. (1996). Bias correction in
generalized linear mixed models with multiple components of
dispersion. Jornal of the American Statistical Association.
91, 1007-1016.
Lin, H., McCulloch, C. E., Turnbull, B. W., Slate, E. H.,
and Clark, L. C. (2000). A latent class mixed model for analysing biomarker trajectories with irregularly scheduled observations. Statistics in Medicine. 19, 1303-1318.
Wu, C. O. and Chiang, C. T. (2000). Kernel estimation and
its applications of a varying coefficient model with longitudinal data. Statistica Sinica. 10, 433-456.
Wu, J. (1983). On the convergence properties of the EM
algorithm. The Annals of Statistics. 1, 95-103 | en |