Title Page with PIs, departments, institutions, mailing
and e-mail, phone, fax
One page abstract give key words or a list of disciplines/sub-disciplines
covered
There is general expectation that computational science will make increasing and critical use of distributed heterogeneous metacomputing environments. However this implies the development and integration of new technologies from several different disciplines and their testing on complex applications. In this proposal we bring together experts from distributed computing, virtual environments, high speed networking and web based collaboration. These are integrated in terms of carefully chosen applications with which we will be able to motivate and assess the technology base. We believe that any successful system must be able to harness multiple distributed computers, data analysis and visualization subsystems as well as a distributed team of users. This requires a multidisciplinary approach in any successful approach to this problem. This scenario can be expected both for academic applications (as seen today in nationally distributed Grand Challenges) and for Industrial problems as envisaged in multidisciplinary analysis and design for next generation manufacturing.
We build on the Advanced Resource Management System
(ARMS, from the Cornell Theory Center), an advanced metacomputing
resource management tool that has already been designed to handle
both compute and visualization systems. The latter will span virtual
environments from advanced CAVETM capabilities at
the Cornell Theory Center and the University of Houston, as well
as conventional workstation-based analysis. A centerpiece of this
proposal will be the integration of advanced networking with ARMS
which will be lead valuable insight to Internet 2 and NGI as this
develops. TANGOsim (Syracuse University) is a recently developed
Java Server-based collaborative environment that includes an event-driven
simulator with the conventional services offered in comparable
systems offered by the National Center for Supercomputing Applications(Habanero)
and IBM. TANGOsim links applications written in any language.
It can be integrated naturally with the compute services offered
by ARMS, which is also based on a similar Web technology base.
These base technologies will be tied together with industrial
(medical, seismic, and manufacturing) and academic applications
involving all members of the proposal team. We have included a
strong networking research group from Cornell, which will allow
us to assess the results of effort from application, computing,
and networking perspectives. We believe that we have put together
a team that combines leading-edge expertise and technologies in
the diverse components needed in future distributed metacomputing
environments. Further, in each case, base technologies (ARMS,
TANGOsim, virtual environments) are well developed and this proposal
will only fund their integration.
The partners in this proposal are the Cornell Theory Center (CTC), the University of Houston (UH) and Syracuse University (NPAC/Syracuse). In the following we describe the component technologies ARMS, TANGOsim, Virtual Environments and Networking. These are integrated by the applications described at the end with a significant assessment effort.
Advanced Resource Management System (ARMS)
The Cornell Theory Center (CTC) has been developing
an advanced resource management and scheduling system for distributed,
heterogeneous computational, visualization, network, and data
acquisition resources. This system, called ARMS, deterministically
schedules applications on these resources so that network connectivity
and quality of service are ensured. ARMS is a technology enabler,
in that it provides access for applications to these distributed
resources in a seamless, cohesive fashion. This will allow researchers
from all areas to focus on their areas of expertise without being
concerned with the logistics of running their applications or
experiments on geographically distributed heterogeneous resources.
ARMS includes a two-level intelligent scheduling system that uses
an Informix distributed database. The database maintains status
and availability information, which is then used to schedule jobs
with specific computing, synthetic environment, software, data,
and networking resource requirements. The foundation for ARMS
is based on key components from the EASY-IBM LoadLeveler application
programming interface (API) project . Through a minimal set of
API calls, all the necessary information about local and remote
computational resources and user job requirements can be queried
reliably. This API is being extended to work with other major
scheduling systems such as Platform Computing's LSF.
ARMS provides secure, survivable resource management
and job scheduling. Local resource monitoring agents will be developed
that can relay status and availability information to the ARMS
database using the Ensemble system developed at Cornell University.
Ensemble, is a reliable networking protocol that guarantees the
arrival of information and also its ordered arrival. ARMS relies
on DCE/DFS for cross-realm authentication where we are working
closely with Lawrence Livermore National Laboratory. The growing
complexity of distributed environments suggests that distributed
system monitoring tools and navigational tools will be extremely
important for both systems administrators and users. We propose
the development of Web-based Java tools for ARMS which will allow
location of resources and monitoring of their status and availability.
ARMS is being tested in several testbeds. Development
work is being done primarily at CTC on its 512 node IBM SP system
and SGI Power Onyx systems. As code matures it is deployed to
a campus-based distributed system with ATM network connectivity.
In addition, mature ARMS code has been tested on a national testbed
including CTC, Pennsylvania State University, the University of
California at Los Angeles, and the University of Maryland. The
geographical distance between these sites has proven to be an
excellent test of ARMS' capabilities. The reliable and deterministic
service that ARMS will be able to guarantee will make it an essential
part of the NGI. It will provide a mechanism to predict and control
network traffic - a feature that is missing in the current Internet.
Our working closely with key distributed application developers
will ensure the ultimate usability of the ARMS system
Network Infrastructure
Our team has considerable experience in ATM network
technology, with both CTC and Syracuse operating multiple FORE
switches and an IBM 8260 operating in a production environment
for over two years. Initially, we propose to use this infrastructure
by setting up a virtual network between specified resources which
would be completely under the control of ARMS. This will be done
by creating Permanent Virtual Paths (PVPs) with guaranteed bandwidth
in a full mesh between computing resources- ARMS will have full
control of dynamically allocating appropriate Permanent Virtual
Circuits (PVCs) or Switched Virtual Circuits (SVCs) within the
allocated bandwidth of the PVPs. Once tested in the local area,
this concept can be extended, subject to approval, across the
vBNS network to resources at the collaborating sites.
An interesting case arises when there are multiple
alternative paths between two systems- one of which may have higher
bandwidth but also higher latency. Different requirements of
different parts of a particular ìjobî may be transported
over different paths based on their bandwidth and latency requirements.
Early simulations of path latencies can be accomplished by creating
ìloopedî virtual paths out into and back from the
vBNS network.
Collaboration
Recent Web technology developments open the way for
new collaborative computing environments that integrate traditional
video conferencing, distributed simulation as developed by the
DMSO office(SIMNET), and emerging ideas of computational steering.
In this proposal we will link the ARMS system with TANGOsim developed
at NPAC. TANGOsim uses Java Servers to manage Web collaboration
in the traditional fashion already seen in Habanero (NCSA) and
Shaking Hands (IBM but originating in NPAC). The Java Server
supplies session management with multicasting allowing shared
applications between multiple users. TANGOsim was built to support
command and control with scripted environments allowing real and
virtual members in the command teams. This is implemented as an
event driven simulator replacing the simple session manager used
in the other Web collaboration systems.
Since ARMS and TANGOsim are built on a modern Web
backbone we can integrate them so that ARMS provides the asynchronous
scheduling and TANGOsim the synchronous collaborative computational
steering, linking in each case distributed computers, databases,
visualization and multiple collaborating computational scientists.
TANGOsim has been developed with other funding for command and
control and distance education. Here we propose funding for its
integration with ARMS and the special features needed for distributed
computational steering. TANGOsim is implemented as a Java Server
linking a net of Java Applets with a special downloadable plug-in
providing the critical connectivity. It already supports client
applications written in Java, JavaScript, and C++ (and hence other
traditional languages). We have demonstrated a prototype visualization
client for a 3D GIS (Geographical Information System) with a custom
VRML browser written in C++. Other major capabilities of TANGOsim
include a video teleconferencing module and integration with a
state-of-the-art digital video server that supports efficient
multicast services. Further over the next few months, we will
add a database backend (using the new JDBC Java Database Connectivity)
which will allow one to log the complete multimedia collaborative
session. TANGOsim provides traditional basic servicesó
chatboard, shared whiteboard and multimedia mail. Further we have
ported several interesting educational applications, the WebWisdom
dissemination systems and some interesting Physics education applets
, to collaborative mode. These will be used in our first application
area described later.
This proposal involves the integration of ARMS and
TANGOsim with at a lower level TANGOsim integrating the components
of a synchronous computational steering environment. The details
of this integration will depend on the evolution of Web technology
but we expect to continue the basic architecture built around
Netscape's LiveConnect on the client and Java Servers. Note that
TANGOsim can be viewed as "just integration technology"
for collaboration systems "just" provide integration
between the involved clients.
Virtual Environments
Recently, higher bandwidth communications channels
have made collaborative science and engineering using shared virtual
environments practical. Research efforts have focused on two
main areas: software infrastructure and application development.
Researchers concentrating on software infrastructure have developed
software architectures, network protocols, and data-sharing algorithms
to enable distributed, shared, virtual environments. Application
developers have built specific applications to serve as proofs-of-concept
for the effectiveness of distributed, shared virtual environments.
To achieve real-time collaboration over long distances, two approaches
have emerged: (1) rendering graphics locally on similar platforms
and exchanging state data at high rates over a communications
link and; (2) rendering graphics at a single facility and delivering
those graphics, over communications channels, to remote users.
We propose to work with both approaches. UH will develop high
performance remote rendering software, based on work initially
done at SGI and ongoing at the UH with Darpa support. This will
support the remote "writing" of frame buffers so that
a powerful graphics supercomputer at one site can effectively
drive the displays of inexpensive workstations on the scientist's
or engineer's desktop. In this effort, much of the focus will
be on enhancing interaction and collaboration including use of
acoustic and haptic interfaces.
Another major use of visualization will be its application
as a user interface to large-scale mathematical models and simulations.
We will seek to make possible the more effective use of such
models and simulations executing on supercomputers, locally or
remotely. This will be accomplished by allowing scientists and
engineers to construct and work directly with near-real-time meaningful
representations of the physical systems that they are attempting
to understand.
Linking this technology with ARMS and TANGOsim (which
supports such custom viewers) allows us to deliver this environment
and to multiple distributed users by creating an effective interface
and giving the deterministic control of the scheduling of the
multiple resources required.
Integration and Assessment
We propose to deploy a distributed collaborative
computing environment that focuses on high level user services.
Little experience exists on either the useful features of such
a system or on the needed infrastructure capabilities in areas
of network, parallel multimedia database for session logs, and
computational services including visualization. Thus our proposal
will be structured as a set of application experiments described
below where we will carefully assess both the value of particular
system features and needed performance and functionality of underlying
infrastructure. This assessment will use traditional computer
and network performance tools augmented by enhancements in TANGOsim
to log message traffic through the session manger. Here we will
follow the strategy exploited in our Virtual Programming Laboratory
of using a web implementation of Illinois's Pablo system with
,performance data logged in TANGOsim's backend database using
the SDDF data format.
Selected Test Applications
Our approach is applicable to general large scale
computational problems but here we focus on two general areas
-- one from education and the other a classic grand challenge.
Our assessment of these two areas should allow our work to generalize
more broadly.
The Virtual Classroom
To test the usability of the ARMS system for the
development of distributed programming environments, we have been
working on several prototype applications and collaborative tools
layered on top of ARMS. CTC and Syracuse have collaborated on
the development of a secure Web-based scheduler interface that
provides ARMS users with a common interface for submitting jobs
to the ARMS scheduler and for browsing and editing DFS filespace,
no matter where the jobs reside. "Smart Make" facility,
another interface under development, will allow users to submit
source code to the ARMS scheduler, which in turn will locate the
necessary software and computing resources for the code, build
the correct executable, and store metadata on the executable pertaining
to its resource requirements.
CTC is already a leader in the area of asynchronous
Web-based education through its Virtual Workshop. Syracuse has
already integrated its HPCC and Web course system WebWisdom with
Tango to deliver training in classic synchronous style. This application
is particularly demanding of network responsiveness for users.
Within the context of this educational framework CTC and Syracuse
have collaborated on the development of a secure Web-based interface
that provides users with a common interface for submitting jobs
to the ARMS scheduler and for browsing and editing DFS filespace,
no matter where the jobs reside. Until this point in time the
Virtual Workshops have been asynchronous but with the integration
and development of TANGOsim in the Virtual Workshop environment,
we will create a truly Virtual Classroom environment. ARMS will
enable advanced reservation of networking resources so that during
a Virtual Workshop or class session users will be able to interact
as they would in a physical meeting or class room. We will use
the digital server technology developed by NPAC which is already
being used in their ATM K-12 experiment -- the Living SchoolBook.
Applications for visualization
The application that we propose for metacomputing
environments is shared virtual workspaces. Such a metacomputing
application can also be viewed as a metacomputing technology,
which in itself should be demonstrated and measured in terms of
end-user feasibility and value.
As end-user applications we propose 3-D medical imaging and video, and 3-D seismic data sets for exploration and reservoir modeling, and engineering design in the manufacturing industry. These applications are selected because first they involve large, complex, 3-D data sets, second the interpretation of the data sets are typically done by parties in different geographical locations, third the end--user has strong demands on quality of representation, ease of perception, and performance ("quality-of-service"). University of Houston has excellent relationships to all three user categories, and a realistic assessment of accomplishments can be achieved.