https://scholars.lib.ntu.edu.tw/handle/123456789/118697
標題: | 低階多項式資料映射與支持向量機 Low-degree Polynomial Mapping of Data for SVM |
作者: | 張瀠文 Chang, Yin-Wen |
關鍵字: | 分解方法;低階多項式映射;核心函數;支持向量機;decomposition methods;low-degree polynomial mapping;kernel functions;support vector machines | 公開日期: | 2009 | 摘要: | 支持向量機常使用非線性映射函數將資料點映到高維度空間,以便於將非線性分布的資料點分類。核心函數則可以解決映射到高維度空間之後的資料向量有過多特徵而難以訓練的問題。然而,在大規模資料上訓練支持向量機所需的時間相當長。本論文根據最近在訓練大規模線性支持向量機(不使用非線性核心的支持向量機)上的研究,提出一個在所需訓練時間與測試準確度之間取得平衡的方法。本研究將線性支持向量機的快速訓練方法應用於以低階多項式映射函數展開的資料上。此方法有快速訓練的好處,同時可以達到與使用高度非線性核心函數相近的測試準確度。實驗顯示在特定大規模資料集上,所提方法確實可以以較少的時間得出相近的準確度。 Non-linear mapping functions have long been used in SVM to transform data into a higher dimensional space, allowing the classifier to separate non-linearly distributed data instances. Kernel tricks are used to avoid the problem of a huge number of features of the mapped data point. However, the training/testing for large data is often time consuming. Following the recent advances in training large linear SVM (i.e., SVM without using nonlinear kernels), this work proposes a method that strikes a balance between the training/testing speed and the testing accuracy. We apply the fast training method for linear SVM to the expanded form of data under low-degree polynomial mappings. The method enjoys the fast training/testing, but may achieve testing accuracy close to that of using highly nonlinear kernels. Empirical experiments show that the proposed method is useful for certain large-scale data sets. |
URI: | http://ntur.lib.ntu.edu.tw//handle/246246/185422 |
顯示於: | 資訊工程學系 |
檔案 | 描述 | 大小 | 格式 | |
---|---|---|---|---|
ntu-98-R96922032-1.pdf | 23.32 kB | Adobe PDF | 檢視/開啟 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。