HELP! * GREY=local LOCAL HTML version of Foils prepared July 28,1995

Foil 5 Major components in a web search system

From HPDC95 Web Search Presentation HPDC95 Pentagon City -- August 1,1995. by Geoffrey C. Fox * See also color IMAGE

an information gathering and filtering subsystem
  • gather source data from remote/local web repositories to local indexing database. Source (searchable text based information on Internet): Web space, including information on servers of HTTP,FTP,Gopher,WAIS etc. Usenet newsgroup, various databases on-line, e.g. periodicals
  • information conversion and extraction
  • deal with update of source
  • usually implemented by a ÒWeb RobotÓ
an indexer
  • meta-data
  • determine the scope and accuracy of the search engine
  • size of the indexed database
  • common indexing schemes: object names, selected keywords, full text
  • pre-computed indexing, automatic indexers



Northeast Parallel Architectures Center, Syracuse University, npac@npac.syr.edu

If you have any comments about this server, send e-mail to webmaster@npac.syr.edu.

Page produced by wwwfoil on Mon Feb 17 1997