2006-08-012024-05-17https://scholars.lib.ntu.edu.tw/handle/123456789/692033摘要:依據各試驗單位的特性建&#63991;樣本分&#63952;法則, 可細分為&#63864;個步驟: (1) 篩選&#63849;個可明確判別群集的變&#63849;, (2) &#63965;用選擇的變&#63849;建&#63991;最佳分&#63952;法則。其中步驟 (1) 通稱為特徵篩選 (feature selection), 當可供做為分&#63952;依據的變&#63849;個&#63849;很多時, 選擇少&#63849;幾個足以適當分&#63952;樣本的變&#63849;除&#63930;有助於&#64009;低成本, 挑選適當的變&#63849;也是準確分&#63952;樣本的關鍵 (Sima et al., 2005)。變&#63849;篩選標準可歸&#63952;為相關性&#64001;&#63870;與錯分&#63841;&#64001;&#63870;&#63864;大&#63952;。錯分&#63841;&#64001;&#63870;與分&#63952;模式有關, &#63847;同分&#63952;模式結果相&#63842;。相關性&#64001;<br> Abstract: The problem of classification is to assign objects to one of the mutually exclusive subgroups in the population based on the object's characteristics. To build a precise rule of classification, a two-step procedure is usually performed on the training dataset: (1) selecting a few features that are most informative in the sense of decision making; (2) deriving the formula that outputs optimal allocation of objects. Selecting appropriate features is particularly essential for a successful classification. Recent methods of feature selection consider either the misclassification rate of objects given information of a set of variables, or the association between variables and class label. The former yields inconsistent results for different settings of classifiers. The later is subject to the choice of association measures. The analysis of actual data from a study of breast cancer gene expression is included. Hsing et al. (2005) has proposed a new measure of association, the coefficient of intrinsic dependence, or CID. The CID captures not only linear but general association among variables. It was also demonstrated that CID is capable of putting variables in appropriate order according to their degree of association to the target variable even when sample size is small. This research will broaden the work of Hsing et al. (2005) by applying CID in feature selection. It will be followed by construction of Bayes classifiers and comparisons to conventional methods.本質相關係&#63849分&#63952法則特徵篩選微陣&#63900CIDclassificationfeature selectionmicroarray利用本質相關係數建立樣本分類法則之可行性評估