HELP! * GREY=local LOCAL HTML version of Foils prepared July 28,1995

Foil 9 Implementation Issues of Web Rebots

From HPDC95 Web Search Presentation HPDC95 Pentagon City -- August 1,1995. by Geoffrey C. Fox * See also color IMAGE

Past Implementations
  • Browsing - A small list was maintained at CERN for browsing through all available servers
  • Listing - Lists of references to resources on the Web, such as Yahoo, HCC NetServices List, NCSA MetaIndex etc
  • Searching - Searchable databases like the GNA Meta Library
Present Implementations
  • Automatic Collection - Automatically retrieve, a fixed set of documents that the Robot has been programmed to parse regularly like the NCSA What new list
  • Automatic Discovery - Exploit automation, analyze and store the documents encountered
  • Automatic Indexing - Automatically index the set of documents cached
  • Web robot can be a standalone program plugged into a search system or a built-in gathering component in a indexing/search engine



Northeast Parallel Architectures Center, Syracuse University, npac@npac.syr.edu

If you have any comments about this server, send e-mail to webmaster@npac.syr.edu.

Page produced by wwwfoil on Mon Feb 17 1997