A Comprehensive Taxonomy and System Architecture for ATM Host-Network Interfaces

Bo-kyun Na

May 12, 2000

Current Network Systems

The current networks are:

The single service-independent network is:

Review of Host-Network Interface Systems

Definition: A network I/O to connect host to network.

With TCP/IP suites,

Figure

Problems of Current HNI Systems

Conventional Classifications of HNIs

*** The Number of Data Transfer copies

*** The Number of Data Transfer Copies

Figure

Limitations of Current Taxonomies

Research Objectives and Approaches

HNI taxonomy is motivated and directed to:

Skillicorn's Taxonomy on Computer Systems

System Model

Two-context taxonomy, as the approaches:

Performance Model

Illustrative Example: Jaguar

Continue...

Figure

Continue...

Figure

Syntactic Notation of Conventional HNIs

This table shows our syntax notation for host-network interface designs that have been reported in the literature.

Figure

Design of Intelligent HNI

Design Requirements

Taxonomy-based Design Approach

Diagram for architectural context of intelligent HNI

Figure

Diagram for protocol context of intelligent HNI

Figure

Formal Performance Metrics: 1

Figure

Formal Performance Metrics: 2

Transmitting Performance of Intelligent HNI

Throughput
=
48 byte/cell ×8 bit/byte
(3 Lsmp +3 SfmP) TmP + (6 LfmC + 7 SfmC +6 computation) TmC
=
384
(11 + 5) (30.3 ×10-9) + (8 + 9 + 6)(25 ×10-9)
=
362.33 Mbps

Transmitting Performance of Conventional OSIRIS HNI

Throughput
=
48 byte/cell ×8 bit/byte
(Lsmp+Ssmp) Tmp + (6 Lsmc + 7 SfmC + 6computation) TmC
=
384
(7 + 7) (30.3 ×10-9) + (22 + 9 + 6)(25 ×10-9)
=
284.61 Mbps

Segmentation Performance of intelligent HNI: worst case

Figure

Segmentation Performance of intelligent HNI: best case

Figure

Receiving Performance of intelligent HNI

Throughput
=
48 Bytes
( 6 Lfmc + 6 Sfmc +12 computation) TuC
=
384
(8 + 8 + 12) 25 ×10-9
=
548.57 Mbps

Receiving Performance of OSIRIS HNI

Throughput
=
48 Bytes
3Sfmp ×Tmp + ( 6 Lsmc + 6 Sfmc + 12 comp) TuC
=
384
5 ×30.3 ×10-9 + (22 + 8 + 12) 25 ×10-9
=
319.60 Mbps

Reassembly Performance of intelligent HNI

Figure

Answer to Research Questions

Future Works

Skillicorn's Taxonomy

Using his taxonomic scheme, Skillicorn established a single category of 28 classes:

class nIP nDP IP-DP IP-IM DP-DM DP-DP Name
3 0 n - - n-n n×n loosely coupled dataflow
4 0 n - - n ×n - tightly coupled dataflow
6 1 n 1-1 1-1 1-1 - von Neumann uniprocessor
8 1 n 1-n 1-1 n-n n ×n type 1 array processor
9 1 n 1-n 1-1 n×n - type 2 array processor
13 n n n-n n-n n-n - separate von Neumann uniprocessors
14 n n n-nn-nn-n n×n loosely coupled von Neumann
15 n n n-nn-nn-n - tightly coupled von Neumann

The Processing Capability of HNI's

Memory Access

According to the address-mapping spaces and data accessing mechanisms, David Banks and et al classifies HNIs as mapping:

Host Access

D. Henry and C. Joerg proposes four categories:

Architectural Context

continue...

Protocol-Implementation Context

Packetization/Depacketization

Data Copying

0-copy from user space on main memory to network

Figure

1-copy from user space of main memory to network buffer memory, then to network

After data is copied to network buffer memory, HNI provides some other protocol processing such as CRC and composing of header fields.

Figure

1-copy from user space of main memory to kernel space of main memory

Figure

2-copy from user space to kernel space of main memory then to network buffer memory

Transport protocol processing on kernel space except CRC checking which is supported on host-network interface.

Figure

Flow Control

Error Handling

Routing/Switching

Illustrative Example: AURORA

Continue...

Figure

Continue...

Figure

Simple HNI Design

Design Requirements

In architectural context:

In protocol context:

Taxonomy-based Design Approach

Diagram for Archtiectural context of simple HNI

Figure

Diagram for protocol context of simple HNI

Figure

Transmitting Performance of simple HNI

To send 1 payload data (384 bits),
6 slow loads and 6 slow stores through the 64-bit PCI system bus.

For the header information,
1 slow store (since less than 1 fast load for the header information and CRC field), 5 arithmatic and 1 branch instructions.

Throughput
=
48 byte/cell ×8 bit/byte
(6 Lsm p + 7 Ssmp + 6 computation) Tmp
=
384
(22 + 22 + 6) (30.3 ×10-9)
=
248.50 Mbps

Segmentation Performance of simple HNI

Figure

Receiving Performance of simple HNI

Reassembly by a lightly loaded host for an ATM cell: 6 loads (7-2-2-2-7-2) by 64 bit DMA B/W; 3 (7-2-2) stores by 128 bit memory B/W.

Throughput
=
48 Bytes
( 6 Lsmp + 3 Ssmp +12 computation) Tup
=
384
(22 + 11 + 12) 30.3 ×10-9
=
281.63  Mbps

In heavy loaded host: additional 3 slow loads from main memory.

Throughput
=
48 Bytes
( 6 Lsmp + 3 Lsmp + 3Ssmp + 12 computation) Tup
=
384
(22 + 11 + 11 + 12) 30.3 ×10-9
=
226.31  Mbps

Reassembly Performance of simple HNI

Figure


File translated from TEX by TTH, version 2.33.
On 10 May 2000, 20:41.