Focused web crawlers: new target markets

by chris bose on June 6, 2011

A focused web crawler needs a similarity set and a starting URL to run. The similarity set is the set of URL’s that resemble the target market sufficiently to guide the crawl. The starting URL is where you begin.

Initial similarity sets come from your client and prospect database.

A useful way to determine new target markets in high technology areas is to LISTEN to directories and take your direction from their categories.

