HELP! * GREY=local LOCAL HTML version of Foils prepared July 28,1995

Foil 7 Web Robots (also called Spiders, Web Worms or Web Wanderers)

From HPDC95 Web Search Presentation HPDC95 Pentagon City -- August 1,1995. by Geoffrey C. Fox * See also color IMAGE

Definition: web robots are a class of software programs that traverse network hosts gathering information from and about resources -- lists, information and collections
Problems Addressed
  • No pre-defined data set or fields for a document on the Web
  • Manual traversal of the Web is virtually impossible
  • Contents of the resources change as also the hyperlinks
  • Improperly maintained Web sites produce dead links
Limitation: Information generated Òon-the-flyÓ (e.g. by CGI scripts) cannot be retrieved by Web rebots



Northeast Parallel Architectures Center, Syracuse University, npac@npac.syr.edu

If you have any comments about this server, send e-mail to webmaster@npac.syr.edu.

Page produced by wwwfoil on Mon Feb 17 1997