Find this at http://www.npac.syr.edu/users/gcf/hpdc95websearch/

Web Search Presentation for HPDC95 Tutorial

Given by Geoffrey C. Fox at HPDC95 Pentagon City on August 1,1995. Foils prepared July 28,1995

This was prepared for tutorial at HPDC-4 Conference
It starts with motivation and Identification of four components of a Web Search system -- Information Gathering and Filtering, Indexing, Searching and User Interface
Web Robots (gatherers) are reviewed followed by
Discussion in detail of 3 examples Lycos, FreeWAIS and Harvest -- the associated demonstrations also include Oracle Free text search
We end with discussion of future technologies including natural language frontends, distributed queries, metadata, caching and artificial intelligence


Table of Contents for Web Search Presentation for HPDC95 Tutorial


001 Web Search
002 Abstract of Web Search Presentation
003 Motivations
004 Challenges
005 Major components in a web search system
006 Major components in a web search system (cont.)
007 Web Robots (also called Spiders, Web Worms or Web Wanderers)
008 Major uses of Web Robots 
009 Implementation Issues of Web Rebots
010 Costs & Dangers of Using Web Robots
011 Costs & Dangers of Using Web Robots (conŐt)
012 Examples of Web Search Systems
013 Lycos
014 Total Volume and Data Capture in Lycos
015 Data Content in Lycos
016 FreeWAIS - Wide Area Information Server
017 Indexing in FreeWAIS 
018 Search in FreeWAIS
019 Harvest System
020 Harvest Architecture
021 Harvest Overview
022 Harvest Gatherer
023 Customized Content Extraction (Essence)
024 Summary Object Interchange Format (SOIF)
025 Harvest Broker
026 Distributed Gatherer-Broker Arrangement
027 Index & Search in Harvest
028 Cache in Harvest
029 Perfomance of Harvest -- Gatherer
030 Performance of Search in Harvest -- Glimpse
031 Implementation of Harvest
    Standalone
032 Implementations of Harvest
    with Other Systems -- Continued
033 Future Technologies in Web Search 
034 Future Technologies in
    Web Search - NLP
035 Future Technologies in 
    Web Search - DQ
036 Future Technologies in Web Search - MDF
037 Future Technologies in
    Web Search - AI
038 Future Technologies in 
    Web Search - CSCM
039 More about Web Search Systems and Web Robots-- Yahoo
040 More about Web Search Systems and Web Robots
041 Recent Capital Ventures in Web Search Business


Northeast Parallel Architectures Center, Syracuse University, npac@npac.syr.edu

If you have any comments about this server, send e-mail to webmaster@npac.syr.edu.

Page produced by wwwfoil on Mon Feb 17 1997