Analysis and Implementation of Large-scale Linear RankSVM in Distributed Environments

Huang, Wei-Lun

Analysis and Implementation of Large-scale Linear RankSVM in Distributed Environments

Date Issued

2016

Date

2016

Author(s)

Huang, Wei-Lun

URI

http://ntur.lib.ntu.edu.tw//handle/246246/275536

Abstract

Linear rankSVM is a useful method to quickly produce a baseline model for learning to rank. Although its parallelization has been investigated and implemented on GPU, it may not handle large-scale data sets. In this thesis, we propose a distributed trust region Newton method for training L2-loss linear rankSVM with two kinds of parallelizations. We carefully discuss the techniques for reducing the communication cost and speeding up the computation, and compare both kinds of parallelizations on dense and sparse data sets. Experiments show that our distributed methods are much faster than the single machine method on two kinds of data sets: one with its number of instances much larger than its number of features, and the other is the opposite.

Subjects

Learning to rank

Ranking support vector machines

Large-scale learning

Linear model

Distributed Newton method

Type

thesis

File(s)

Name

ntu-105-R02922041-1.pdf

Size

23.32 KB

Format

Adobe PDF

Checksum

(MD5):e70328aaa5f8b087f6c71f831974fc1f

Analysis and Implementation of Large-scale Linear RankSVM in Distributed Environments

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)