File(s) under permanent embargo
Methodology for extracting quasi-random samples from world wide web domains
journal contribution
posted on 2009-01-01, 00:00 authored by M Featherstone, Stewart Adam, P BorstorffThe purpose of this paper is to describe a process for sampling specific domain name zones on the World Wide Web. Because of the size of the Web, sampling strategies must be employed in order to effectively model and study the Web business environment. This paper discusses Various efforts employed to sample the Web, which ranged from random generation of Internet Protocol Addresses and domain names, to the process finally
employed to create descriptive models of the dot-com domain name zones. The paper suggests that sampling the Web Top Level Domains offers a reasonable alternative for business researchers because it requires only familiarity with the use of the simple Web utilities such as File Transfer
Protocols to obtain initial domain name listings.
employed to create descriptive models of the dot-com domain name zones. The paper suggests that sampling the Web Top Level Domains offers a reasonable alternative for business researchers because it requires only familiarity with the use of the simple Web utilities such as File Transfer
Protocols to obtain initial domain name listings.