Options
Estimating the Number of Topic Specific Websites Based on IP Address-Using Pornographic Websites as An Example
Date Issued
2006
Date
2006
Author(s)
Chen, Rung-Tzuo
DOI
zh-TW
Abstract
It is known that the number of pornographic websites increases as the Web expands. To estimate this number of pornographic websites online remains a big challenge. This paper proposes a method, based on statistical approaches, to estimate the actual number of pornographic websites within a certain confidence interval, and error range.
In order to develop a more systematic and reliable method to estimate the number of pornographic websites, we have chosen to use IP address as our unit of measurement instead of the more commonly used domain name and webpage to describe pornographic website. We have used keywords, database matches, and link analysis to determine if a website contains pornographic content or not. Based on Simple Random Sampling statistics, we have concluded the number of pornographic websites up to date is 69077 with 95% confidence interval and within 10% error.
In order to develop a more systematic and reliable method to estimate the number of pornographic websites, we have chosen to use IP address as our unit of measurement instead of the more commonly used domain name and webpage to describe pornographic website. We have used keywords, database matches, and link analysis to determine if a website contains pornographic content or not. Based on Simple Random Sampling statistics, we have concluded the number of pornographic websites up to date is 69077 with 95% confidence interval and within 10% error.
Subjects
色情網站
網站數量評估
網站單位
pornographic websites
Estimating the Number of Topic Specific Websites
Type
thesis
File(s)
No Thumbnail Available
Name
ntu-95-R93922090-1.pdf
Size
23.31 KB
Format
Adobe PDF
Checksum
(MD5):02939dd268c84f983823de715be7c1db