Performance

This section presents performance measurements of gms_net and sequential file access using the GMS/Trapeze prototype in Digital Unix 4.0. We measure all the RPC variants presented in order to illustrate the costs and benefits of the gms_net and Trapeze mechanisms discussed in the previous sections. The file access tests are intended to show the rate at which a GMS/Trapeze client can source and sink data to network storage servers using these communication mechanisms for payload transfer and asynchronous read-ahead. A secondary goal is to show the effect of the operating system kernel interface chosen to read or write file data at these speeds, which are close to the limits of the hardware.

The systems used for these measurements are Alcor and Miata, two DEC Alpha platforms based on the 21164 CPU. Our Alcors (AlphaStation 500 and 600) are clocked at 266 MHz, and use a CIA host-PCI bridge (ASIC pass 2). Miata (PWS 500au) is a newer 500 MHz machine with a Pyxis bridge (ASIC pass 257). All systems are equipped with M2F-PCI32 Myrinet LAN adapters connected through an 8-port switch (M2F-SW8). Both the CIA and Pyxis I/O bridges deliver almost the full bandwidth of the 32-bit 33MHz PCI standard (132 MB/s) in one direction; however, Alcor receives at half-bandwidth and Miata sends at half-bandwidth. Most of the experiments in this section involve a one-way high-volume data transfer: to circumvent the bridge bottlenecks Alcor is always the sender and Miata is always the receiver. Release of the improved Miata-II is imminent, but it is not yet available at the time of this writing.

Table 1: RPC microbenchmark results for gms_net on an AlphaStation/Myrinet network.

Msg/Payload		16/0			16/4096		16/8192
		$\textstyle \parbox{\hackwidth}{\centerline{Latency}}$	$\textstyle \parbox{\hackwidth}{\centerline{Bandwidth}}$		$\textstyle \parbox{\hackwidth}{\centerline{Latency}}$	$\textstyle \parbox{\hackwidth}{\centerline{Bandwidth}}$	$\textstyle \parbox{\hackwidth}{\centerline{Latency}}$	$\textstyle \parbox{\hackwidth}{\centerline{Bandwidth}}$
		( $\mu$ sec)	(msgs/sec)		( $\mu$ sec)	(MB/sec)	( $\mu$ sec)	(MB/sec)
				Solicited	127.6	32.1	165.9	48.8
	Wait	73.7	13600	Unsolicited	134.0	30.6	169.6	48.0
				Unsol+Copy	167.4	24.5	217.2	37.5
				Solicited	125.8	78.9	163.5	95.1
2-way	Cont	69.0	64500	Unsolicited	128.0	79.2	166.7	97.4
				Unsol+Copy	160.8	69.7	210.8	90.5
				Solicited	131.6	76.9	169.8	94.9
	Defer	73.1	64500	Unsolicited	135.2	70.7	170.5	96.7
				Unsol+Copy	170.0	50.4	218.3	92.9
				Solicited	163.4	25.1	201.9	40.3
	Wait	109.5	9150	Unsolicited	168.8	24.3	205.9	39.7
				Unsol+Copy	201.8	20.3	254.0	32.2
				Solicited	162.6	79.4	199.5	94.7
3-way	Cont	104.6	64600	Unsolicited	163.6	78.8	202.3	97.1
				Unsol+Copy	195.2	63.6	246.4	91.3
				Solicited	166.6	77.6	205.7	95.5
	Defer	108.9	64500	Unsolicited	169.6	65.7	206.1	96.3
				Unsol+Copy	204.0	47.9	254.4	88.1