使用Lasso-Cp選取線性模型解釋變數之探討

黃信雄; Huang, Hsin-Hsiung

DC 欄位	值	語言
dc.contributor	陳宏	en
dc.contributor	臺灣大學：數學研究所	zh_TW
dc.contributor.author	黃信雄	zh
dc.contributor.author	Huang, Hsin-Hsiung	en
dc.creator	黃信雄	zh
dc.creator	Huang, Hsin-Hsiung	en
dc.date	2006	en
dc.date.accessioned	2007-11-28T02:20:21Z	-
dc.date.accessioned	2018-06-28T09:08:16Z	-
dc.date.available	2007-11-28T02:20:21Z	-
dc.date.available	2018-06-28T09:08:16Z	-
dc.date.issued	2006	-
dc.identifier	en-US	en
dc.identifier.uri	http://ntur.lib.ntu.edu.tw//handle/246246/59449	-
dc.description.abstract	當線性回歸模型中的自變數極多時, 正規化是個常用的辦法來達到降低被選取回歸模型複雜度之目的。Lasso (Tibshirani, 1996) 被認為是可以達到選取模型參數精簡目的之正規化方法。當線性回歸模型中的自變數為么正且自變數個數及樣本數個數相近時, 本論文探討使用Lasso 與Cp辦法選擇重要自變數的操作性質。考慮的操作性質, 包含了被選取自變數的個數及被選取真實自變數佔被選取自變數的比例。當Lasso 與Cp作為多重假設檢定辦法時, 這些結論也適用之。	zh_TW
dc.description.abstract	When the number of predictors in a linear regression model is large, regularization is a commonly used method to reduce the complexity of the fitted model. LASSO (Tibshirani, 1996) is being advocated as a useful regulation method for achieving sparsity or parsimony of resulting fitted model. In this thesis, we study the operating characteristics of LASSO coupled with Mallows’Cp on identifying the orthonormal predictor variables of linear regression when the number of predictors and the number of the observation are of the same magnitude. The characteristics includes the chosen number of predictors and the proportion of correctly identified predictors. This result can be useful in multiple testing.	en
dc.description.tableofcontents	目錄口試委員會審定書.......................................i 誌謝...................................................ii 中文摘要...............................................iii 英文摘要...............................................iv 第一章Introduction.....................................1 第二章Lasso............................................3 第一節Multiple Hypothesis Testing......................4 第二節Regression.......................................9 第三章Random walk induced by Mallows'Cp................11 第四章Estimate of the degrees of freedom...............16 第五章Null and Sparse Models...........................23 第六章Simulation Studies When n = m and Xnm = Im.......25 第一節Study 1: Null Model..............................26 第二節Study 2: The spacings determined by 2exp(1)-2....27 第三節Study 3: Sparse Model with Cp of Penalty 2.......28 第四節Study 4: Sparse Model with Cp of Penalty 4.......31 第五節Study 5: Effect on Penalty 4 and 2 under Abundant Models.................................................33 第七章Simulation Studies When n~5m and XTnmXnm=Im......36 第一節Study 6: Null Model..............................36 第二節Study 7: Effect on Penalty 4 and 2 under Sparse Model..................................................37 第三節Study 8: Effect on Penalty 4 and 2 under Abundant Models.................................................39 第八章Conclusions and Discussions......................41 參考文獻...............................................44	zh_TW
dc.format.extent	704517 bytes	-
dc.format.mimetype	application/pdf	-
dc.language	en-US	en
dc.language.iso	en_US	-
dc.subject	最小角度回歸	en
dc.subject	Least angle regression	en
dc.subject	Forward selection	en
dc.title	使用Lasso-Cp選取線性模型解釋變數之探討	zh
dc.title	Study on the Lasso Method for Variable Selection in Linear Regression Model with Mallows' Cp	en
dc.type	thesis	en
dc.identifier.uri.fulltext	http://ntur.lib.ntu.edu.tw/bitstream/246246/59449/1/ntu-95-R93221018-1.pdf	-
dc.relation.reference	[1] Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control 19, 716-723. [2] Balkema, A.A. and de Haan, L. (1972). On R. von Mises’ condition for the domain of attraction of exp(¡e−x). Annals of Mathematical Statistics. 43, 1352-1354. [3] Benjamini, Y. and Hochberg, Y. (1995). Controlling the False Discovery Rate:a Practical and Powerful Approach to Multiple Hypothesis Testing. Journal of the Royal Statistical Society, Series B, 57, 289-300. [4] David, H.A. and Nagaraja, H.N. (2003). Order Statistics. Third Edition. Wiley Interscience. [5] de Haan, L. (1970). On regular variation and its application to the weak convergence of sample extremes. Thesis, University of Amsterdam, Mathematical Centre tract, 32, 296, 299, 301. [6] Donoho, D. and Johnstone, I. (1994). Ideal spatial adaptation by wavelet shrinkage. Biometrika, 81, 425-455. [7] Efron, B., Hastie, T., Johnstone, I., and Tibshirani, R. (2004). Least angle regression (with discussion). Annals of Statistics, 32, 407-499. [8] Gnedenko, B. (1943). Sur la distribution limite du terme maximum d’une serie aleatoire. Annals of Mathematics, 44, 423-453. [9] Hoerl, A.E. and Kennard, R.W. (1970). Ridge regression: Biased estimation for nonorthogonal problems.Technometrics, 12, 55-67. [10] Kolmogorov, A.N. (1933). Grundgebriffe der Wahrscheinlichkeitsrechnung. Berlin: Springer-Verlag (English trans. 1950). Foundation of the Theory of Probability. New York: Chelsea. 2nd ed. (1974) in German and Russian. [11] Li, K.C. (2005). Likelihood of false positives in hypotheses with strongest evidence from multiple testing: the p-value memoryless conversion approach. Technical report. [12] Li, K.C. (1985). From Stein’s unbiased risk estimates to the method of generalized cross validation. Annals of Statistics, 13, 1352-1377. [13] Mallows, C.L. (1973). Some comments on Cp. Technometrics. 15, 661-675. [14] Murray, W., Gill, P. and Wright, M. (1981). Practical Optimization. New York:Academic Press. [15] Pyke, R. (1965). Spacings. Journal of the Royal Statistical Society, Series B 27, 395-436. [16] Schwartz, G. (1978). Estimating the dimension of a model. Annals of Statistics 6, 461-464. [17] Stein, C.M. (1981). Estimation of the Mean of a Multivariate Normal Distribution. Annals of Statistics 9, 1135-1151. [18] Tibshirani, R. (1996). Regression Shrinkage and Selection via the Lasso. Journal of the Royal Statistical Society, Series B 58, 267-288. [19] von Mises, R. (1936). La distribution de la plus grande de n valeurs. Rev. Math. Union Interbalcanique, 1141-160. [20] Woodrofe, M. (1982). On Model Selection and the ARC Sin Laws. Annals of Statistics, 10, 1182-1194. [21] Zhang, P. (1992) On the Distributional Properties of Model Selection Criteria. Journal of the American Statistical Association 87, 732-737. [22] Zou, H., Hastie, T. and Tibshirani, R. (2004). On the “Degrees of Freedom” of the Lasso. Technical Report. [23] Zou, H. (2006). The Adaptive Lasso and Its Oracle Properties. Journal of the American Statistical Association 101, 1418-1429.	en
item.languageiso639-1	en_US	-
item.fulltext	with fulltext	-
item.grantfulltext	open	-
item.openairetype	thesis	-
item.openairecristype	http://purl.org/coar/resource_type/c_46ec	-
item.cerifentitytype	Publications	-
顯示於：	數學系

文件中的檔案：

檔案	描述	大小	格式
ntu-95-R93221018-1.pdf		23.53 kB	Adobe PDF	檢視/開啟

顯示文件簡單紀錄

Page view(s) 5

172

checked on 2024/4/20

下載 5

checked on 2024/4/20

Google Scholar^TM

檢查

TAIR相關文章

文件中的檔案：

Page view(s) 5

下載 5

Google ScholarTM

Google Scholar^TM