Our belief in the PET program is that both synchronous and asynchronous training is appropriate in the DoD HPC community depending on the circumstances. In some cases, "students" will be motivated enough, and have sufficient background to successfully self-study available material, while at other times they will need the structure, educational support, and interactivity provided by the synchronous mode of operation.
Within the PET program, we do plan to investigate async. collab. tools further, but at present it appears that the mildly augmented set of basic tools are quite effective.
One particular area that does seem to be under-exploited is the use of web-linked databases. This is not because the technology isn't available, but rather because the tools and techniques are often unfamiliar to computational scientists compounded by the fact that they are tools rather than a finished products. A few examples of this technology are already available or under development within the PET program: the domain-specific search engine mentioned above, as well as several examples in the "Training Infrastructure" area, described briefly below. But users (MSRC users, PET partners, and others) need to be encouraged to think more about applications of this technology to problems in their CTA.
An asynchronous training resource can be created relatively easily by capturing a live presentation of the material. Simply recording the presentation on videotape is an obvious approach that has been used at most of the MSRCs at least to some extent. Some experiments have also been done with videotaping the instructor without a class present, and with editing the video to produce a more refined presentation. Typically, however, the management and duplication of video tapes has required too much of the Training staff's time to consider this an efficient approach. And in practice such videos, are not often used. One site, with a video library of roughly 250 different events (trainings, workshops, seminars), reports around one use or outside request per month.
By digitizing the a/v stream, it can be distributed easily on the web. This makes the captured lecture a more cost effective and easier training resource to produce and distribute. This has been done in a number of cases using RealNetworks' proprietary RealPlayer system, usually from a videotape recording, and requiring manual synchronization between the a/v stream and the slide changes. Under the PET program, Syracuse University has developed an integrated hardware/software system which goes a step further in the automation of the process. The "LecCorder" is a PC system with a commercial MPEG-1 encoder board. It can take a live a/v feed, i.e. from a class, and produce an archival-quality MPEG-1 digital audio/video strean, as well as capturing the instructor's slide changes during the presentation. Off-line, the high-quality, high-bandwidth a/v stream can be "down-converted" into more network-friendly formats such as H.261 and H.263 (videoconferencing standards) or RealPlayer. Combined with the "clickstream" and the lecture slides themselves, an asynchronous training resource can be produced and published on the web quickly and with almost no human intervention. Lectures captured using LecCorder and other methods are available or planned at all of the MSRCs; there is no data yet on the extent of their use. It is worth noting that these approaches require high-quality recordings to work from -- s-video or better; standard VHS recordings will generally not produce acceptable results.
With a greater investment of time, it is possible to develop online material of greater depth, as an alternative to augmenting it with recorded lectures. This is the approach taken in the Cornell Theory Center's (CTC) Virtual Workshop series, which has also been used by the PET program. Besides having material which is meant to truely stand on its own, the Virtual Workshops are run in specific time frames, during which on-line and telephone-based consulting regarding the class are available. The PET program has sponsored a number of Virtual Workshops, for which CTC has charged $20,000 each for up to 75-100 participants. Although relatively expensive, this approach has proven fairly popular, and more than 170 DoD users have taken advantage of them. CTC has about 40 modules (each equivalent to a single lecture or lab) from which a course can be designed. To our knowledge no one from the PET program has discussed with CTC the possibility of developing new modules, but this would presumably be a fairly expensive proposition given the depth and quality of the material.
Another asynchronous training tool of note is the CD-ROM of educational material on High Performance Computing which was compiled with support from the CEWES MSRC PET program and distributed at the 1998 DoD HPC Users Group Conference at Rice University. More than 300 of these discs have been distributed so far, and anecdotal evidence indicates that they have been well received, including many requests for discs from individuals who did not pick one up at the Conference or didn't attend. The second edition of this resource, which through additional and refined material has expanded to two discs, will be distributed at the 1999 DoD HPC UGC.
In the beginning of the PET program, mbone tools were used to extend trainings to remote sites. This system offers audio/video conferencing and a whiteboard. The system suffered from a number of problems which eventually lead to its abandonment as a distance training tool. Many people found that they did not have access to the multicast networking capability, the quality of the a/v transmission (dependent on the quality of service of the network) often left much to be desired, and there were practical obstacles to using the whiteboard to present lecture slides from both the viewpoint of the instructor and of the students. Mbone tools were originally developed for unix platforms, and little if any development of this system is still going on. As a result, Windows and Macintosh platforms are not well supported, and even for more recent unix systems it can be hard to obtain the necessary drivers.
More recently, PET activities have centered around the Tango Interactive system as a general, extensible framework for both education and collaboration. Development of Tango was begun under a DARPA initiative in 1996 as a C4I tool, and it has been refined and enhanced over the last several years, with support from various sources including the PET programs. It offers a much broader range of collaborative tools than mbone, including a number that were specifically developed for educational use. It supports Windows PCs (95, 98, NT, etc.) as well as several unix platforms and although it's actual performance is subject to the quality of service of the underlying network, most observers agree that it provides better quality audio and video than mbone in poor network environments. It is worth noting that the user perception of any synchronous collaboration/training system, including Tango, can be strongly affected by the network quality of service of the underlying connections.
The initial application of Tango in the PET program was in distance education. This is a fairly well structured type of collaborative interaction, in which the instructor needs an understanding of the use and limits of the collaborative framework, while the students generally do not need as much. With the students located together in an electronic classroom, an appropriately trained support person can provide the required Tango expertise on the receiving end. Using this approach, Syracuse University in New York is now in its fourth semester of delivering regular, semester-long academic credit classes to Jackson State University in Mississippi. And in the current semester, the recipient base has expanded to include Clark-Atlanta University and Mississippi State University as well as an individual at the Waterways Experiment Station. Jackson State University has also begun using the same tools to deliver a course to Morgan State University in Maryland.
The experience gained in these experiments has been critical in guiding work on the Tango system, especially in pointing up where robustness needed improvement. It is also important to realize that Tango is merely a tool for education and collaboration, and that in addition to insuring that the tool functions as intended, it must also be used effectively. The on-going educational work involving Tango has also allowed us to explore some of the sociological factors which make this form of instruction different from traditional face-to-face educational settings. With this experience we have been able to modify and improve our methods to provide a better educational experience, and we are developing enough experience to begin looking forward to issues particular to the training situation, and to less structured environments.
Training is similar to the traditional academic environment in terms of being structured, but because of the compressed time frame, is less forgiving of the occasional problem, and provides less time for students to become comfortable with the tools. In conjunction with the July 1998 release of version 1.0 of Tango Interactive, considered to be the first one to be appropriate for general deployment, the technology was transitioned into PET training activities.
In a collaborative effort involving the Ohio Supercomputer Center (OSC), Syracuse University, and the CEWES MSRC PET program, two prototype distance training classes were taught using Tango Interactive to deliver them to remote sites. In September 1998, a day-long training on Fortran90 was taught at the CEWES MSRC training room and delivered to the training facilities at the ARL MSRC (the HEAT Center) and OSC. In January 1999, a two-day class in OpenMP was presented at the CEWES MSRC training room and delivered to all three other MSRCs, the NRL Distributed Center, and OSC. In both cases, Syracuse University also monitored the class, but did not participate as students. As shown in Table 1, the January class reached more than 30 students in one of the most geographically distributed uses of Tango to date. OSC is also increasingly using Tango to deliver trainings it offers for its own users, and for trainings sponsored by the National Computational Science Alliance (NCSA). Once such class in February on "Java for Scientific Computing" was delivered from Ohio University to the Ohio Supercomputer Center and the Alliance ACCESS Center.
We are also extending this technology to other events similar in structure
to training, such as academic-style seminars: in April, one of the authors
(Fox) recently used Tango to present a seminar simultaneously to all four
MSRCs without leaving Syracuse. The presentation was also recorded with
our LecCorder system and has already been requested by several people unable
to attend the original event. Integration of recording capabilities directly
into Tango is also on the drawing board.
Event | |||||||
Site | Fortran90 | OpenMP | Java | Seminar | |||
28 Sep '98 | 26-27 Jan '99 | 23 Feb '99 | 6 Apr '99 | ||||
ACCESS | - | - | 25 | - | |||
ARL | 5 | 10 | - | 14 | |||
ASC | - | 1 | - | 4 | |||
CEWES | 13 | 10 | - | 16 | |||
NAVO | - | 2 | - | 7 | |||
NRL-DC | - | 2 | - | - | |||
OSC | 6 | 9 | 6 | - | |||
Ohio U | - | - | 9 | - | |||
TOTAL | 24 | 34 | 40 | 41 |
The ASC Training Database will help simplfy the currently labor-intensive tasks around training courses: registration, reminders, account setup and removal, and assessment. It integrates with existing MSRC procedures and data structures, and automates many tasks that are currently done manually. Both of these tools are designed to assist instructors and PET/MSRC staff, and have only a small exposure to the actual user community, but represent additional ways that web/network technology can be used to support and enhance the PET program's ability to provide training to the DoD user community. It is also worth noting that both of these projects are, at their core, web-linked databases, and thereby provide additional examples of how web-linked database technology might be used to support communication and collaboration among DoD users.
One important activity was a study of the implications of universal access. The shared event model of collaboration used by Tango Interactive naturally allows the curriculum material to be rendered separately on each client. This could allow one to deliver classes with one set of clients emphasizing graphical display (with perhaps variable resolution reflecting available network bandwidth) and another set sonification of the material for visually impaired users.
Tango Interactive from NPAC and Habanero from NCSA are probably the leading collaboration systems in academia built around the so called shared event model. It is worth noting that since 1 January 1999, NPAC has averaged 50 distinct downloads per week (with more than 20% of those downloads being Tango servers) and this software is part of Netscape's list of "approved" plug-ins available from their site.
A small set of basic and very familiar tools seem adequate to support asynchronous remote collaboration and training (especially e-mail and the web). There are more technically sophisticated tools available, but their value in practice is not clear -- there is strong evidence to suggest that the level and quality of integration with basic tools like e-mail and the web is more important than new capabilities introduced by the sophisticated tools.
Not surprisingly then, for asynchronous training, it is the educational content that requires the bulk of the effort. The PET program has examined a number of approaches to facilitate the rapid, low-cost development of asynchronous training content by capturing live presentations and publishing them on the web. At the other end of the spectrum, it is also possible to invest quite a bit of effort in developing courseware that can be accessed at varying levels of depth. The PET program has been experimenting with this approach by contracting the opportunitiy to participate in several of the Cornell Theory Center Virtual Workshops. Internally, several other training classes of greater depth are under development or on the drawing board.
For synchronous training and collaboration, it is more the required tools, and their general unfamiliarity to the user community, which has required us to go slowly with their introduction. Our initial experience has lead us to a five point plan for deployment of these tools into the DoD HPC community, working from more to less structured environments. Implementation of the plan is well underway, with basic education and training activities now positioned to become a routine part of PET training. The plan will ultimately lead to both routine collaborative use of Tango and direct to the desktop delivery of trainings.