HELP! * GREY=local LOCAL HTML version of Foils prepared July 28,1995

Foil 4 Challenges

From HPDC95 Web Search Presentation HPDC95 Pentagon City -- August 1,1995. by Geoffrey C. Fox * See also color IMAGE

Data Volume
  • Estimated Web total size: 30 GB, 5 million documents, grows daily
  • require more sophisticated information search interfaces than browsing and organizing in hyperlinks
Data Diversity
  • Web - a huge distributed database, unstructured, non-relational, hierarchical (multimedia) information entities with various data formats: MIME -- html,plain text,PostScript, LaTex, images,audio/video clips, etc.
  • Web repositories are heterogeneous . inconsistent . incomplete
User Base
  • different requirements in query patterns, search topics and response time
  • rapid growth in number and search requests daily



Northeast Parallel Architectures Center, Syracuse University, npac@npac.syr.edu

If you have any comments about this server, send e-mail to webmaster@npac.syr.edu.

Page produced by wwwfoil on Mon Feb 17 1997