https://scholars.lib.ntu.edu.tw/handle/123456789/309065
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.author | Kao, H.-Y. | en_US |
dc.contributor.author | Ho, J.-M. | en_US |
dc.contributor.author | MING-SYAN CHEN | - |
dc.creator | Kao, H.-Y.;Ho, J.-M.;Chen, M.-S. | - |
dc.date.accessioned | 2018-09-10T04:53:03Z | - |
dc.date.available | 2018-09-10T04:53:03Z | - |
dc.date.issued | 2004 | - |
dc.identifier.uri | http://www.scopus.com/inward/record.url?eid=2-s2.0-2942525697&partnerID=MN8TOARS | - |
dc.identifier.uri | http://scholars.lib.ntu.edu.tw/handle/123456789/309065 | - |
dc.description.abstract | Due to the growth of dynamic page generation techniques, the amount and the complexity of Web pages has been increasing explosively, as has the information contained within Web pages. Redundant and irrelevant information is distributed and mixed throughout a page, making it difficult to automatically identify the useful information in that page. Consequently, we propose an information hierarchy in this paper, and, from that hierarchy, we can extract the significance and the relationship value of information contained within a Web page. We can then use this hierarchical structure to create a new browsing process. Our DOM-based Information Space Adsorption (DOMISA) system applies information theory to map information in a page into an information space, and our gradient tree adsorption (GTA) process uses the document object model (DOM) trees of pages to build information hierarchies. Experiments on several commercial news Web sites show high precision and recall rates achieved by DOMISA in determining information clusters of pages which validates its practical applicability to Web sites. | - |
dc.language | en | en |
dc.relation.ispartof | SIAM Proceedings Series | - |
dc.source | AH-Scopus to ORCID | - |
dc.title | DOMISA: DOM-based information space adsorption for web information hierarchy mining | - |
dc.type | conference paper | en |
dc.identifier.doi | 10.1137/1.9781611972740.29 | - |
dc.identifier.scopus | 2-s2.0-2942525697 | - |
item.fulltext | no fulltext | - |
item.grantfulltext | none | - |
dc.relation.pages | 312-320 | - |
item.fulltext | no fulltext | - |
item.openairecristype | http://purl.org/coar/resource_type/c_5794 | - |
item.cerifentitytype | Publications | - |
item.openairetype | conference paper | - |
item.grantfulltext | none | - |
crisitem.author.dept | Electrical Engineering | - |
crisitem.author.dept | Computer Science and Information Engineering | - |
crisitem.author.dept | Communication Engineering | - |
crisitem.author.dept | Networking and Multimedia | - |
crisitem.author.orcid | 0000-0002-0711-8197 | - |
crisitem.author.parentorg | College of Electrical Engineering and Computer Science | - |
crisitem.author.parentorg | College of Electrical Engineering and Computer Science | - |
crisitem.author.parentorg | College of Electrical Engineering and Computer Science | - |
crisitem.author.parentorg | College of Electrical Engineering and Computer Science | - |
顯示於: | 電機工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。