HELP! * PURPLE=global GREY=local Global HTML version of Foils prepared 15 March 1996

Foil 19 Remarks on CPS713 Case Study III) Topic C: Datamining in Parallel and Distributed Databases

From Case Studies of Computational Science -- Overview of Initial Information Area Applications CPSP713 (714 Prototype) -- Autumn Semester 1994. by Geoffrey C. Fox * See also color IMAGE

Databases contain data which is converted to Information by Datamining
This use of a database is often called a Data warehouse
You extract data and the apply Decision Support tools which are essentially Optimization systems to extract Information
High Visibility Commercial Applications are:
  • Using customer purchase information to optimize store layout. Which products should be placed where, when.
  • Using Credit card data, plan optimal mailings with "offers" which customers are likely to accept. For instance credit cards may show customer is a football fan who likes to spend Xmas in Florida.
    • August mailing will discount combination of Florida trip with Syracuse University Football tickets
  • Using Medicare data to identify fraudulent practices identified as being anomalous (e.g. Doctors claiming to see unusually many patients in a day etc.)
Optimization tools will be those we study in Case Study I)
  • Thinking Machines produced a package (called Darwin originally) featuring Genetic algorithms, Neural Nets etc.


Northeast Parallel Architectures Center, Syracuse University, npac@npac.syr.edu

If you have any comments about this server, send e-mail to webmaster@npac.syr.edu.

Page produced by wwwfoil on Tue Feb 18 1997