Arrakis: The Operating System is the Control Plane

USENIX Association 11t h U SE NIX S ym pos iu m o n Op era ti ng Sy st em s D es ig n a nd I mpl em en ta tio n (O SD I ’ 14) 1

Simon Peter∗Jialin Li∗Irene Zhang∗Dan R. K. Ports∗Doug Woos∗

Arvind Krishnamurthy∗Thomas Anderson∗Timothy Roscoe†

University of Washington∗ETH Zurich†

Abstract

Recent device hardware trends enable a new approach to

the design of network server operating systems. In a tra-

ditional operating system, the kernel mediates access to

device hardware by server applications, to enforce process

isolation as well as network and disk security. We have de-

signed and implemented a new operating system, Arrakis,

that splits the traditional role of the kernel in two. Applica-

tions have direct access to virtualized I/O devices, allowing

most I/O operations to skip the kernel entirely, while the

kernel is re-engineered to provide network and disk pro-

tection without kernel mediation of every operation. We

describe the hardware and software changes needed to

take advantage of this new abstraction, and we illustrate its

power by showing improvements of 2-5

in latency and

in throughput for a popular persistent NoSQL store

relative to a well-tuned Linux implementation.

1 Introduction

Reducing the overhead of the operating system process

abstraction has been a longstanding goal of systems design.

This issue has become particularly salient with modern

client-server computing. The combination of high speed

Ethernet and low latency persistent memories is consid-

erably raising the efficiency bar for I/O intensive software.

Many servers spend much of their time executingoperating

system code: delivering interrupts, demultiplexing and

copying network packets, and maintaining file system

meta-data. Server applications often perform very simple

functions, such as key-value table lookup and storage, yet

traverse the OS kernel multiple times per client request.

These trends have led to a long line of research aimed

at optimizing kernel code paths for various use cases:

eliminating redundant copies in the kernel [

], reducing

the overhead for large numbers of connections [

protocol specialization [

], resource containers [

direct transfers between disk and network buffers [

interrupt steering [

], system call batching [

], hardware

TCP acceleration, etc. Much of this has been adopted in

mainline commercial OSes, and yet it has been a losing

battle: we show that the Linux network and file system

stacks have latency and throughput many times worse than

that achieved by the raw hardware.

Twenty years ago, researchers proposed streamlining

packet handling for parallel computing over a network of

workstations by mapping the network hardware directly

into user space [

]. Although commercially

unsuccessful at the time, the virtualization market has now

led hardware vendors to revive the idea [

], and

also extend it to disks [52, 53].

This paper explores the OS implications of removing

the kernel from the data path for nearly all I/O operations.

We argue that doing this must provide applications with

the same security model as traditional designs; it is easy to

get good performance by extending the trusted computing

base to include application code, e.g., by allowing

applications unfiltered direct access to the network/disk.

We demonstrate that operating system protection is not

contradictory with high performance. For our prototype

implementation, a client request to the Redis persistent

NoSQL store has 2

better read latency,5

better write la-

tency, and 9×better write throughput compared to Linux.

We make three specific contributions:

•

Wegive an architecture for the division of labor between

the device hardware, kernel, and runtime for direct

network and disk I/O by unprivileged processes, and

we show how to efficiently emulate our model for I/O

devices that do not fully support virtualization (§3).

•

We implement a prototype of our model as a set of

modifications to the open source Barrelfish operating

system, running on commercially available multi-core

computers and I/O device hardware (§3.8).

•

We use our prototype to quantify the potential benefits

of user-level I/O for several widely used network

services, including a distributed object cache, Redis, an

IP-layer middlebox, and an HTTP load balancer (

4).

We show that significant gains are possible in terms of

both latency and scalability, relative to Linux, in many

cases without modifying the application programming

interface; additional gains are possible by changing the

POSIX API (§4.3).

2 Background

We first give a detailed breakdown of the OS and appli-

cation overheads in network and storage operations today,

followed by a discussion of current hardware technologies

that support user-level networking and I/O virtualization.

To analyze the sources of overhead, we record

timestamps at various stages of kernel and user-space pro-

cessing. Our experiments are conducted on a six machine

cluster consisting of 6-core Intel Xeon E5-2430 (Sandy

Bridge) systems at 2.2 GHz running Ubuntu Linux 13.04.

Arrakis: The Operating System is the Control Plane, Exams of Operating Systems

Related documents

Partial preview of the text

Download Arrakis: The Operating System is the Control Plane and more Exams Operating Systems in PDF only on Docsity!

Simon Peter∗^ Jialin Li∗^ Irene Zhang∗^ Dan R. K. Ports∗^ Doug Woos∗

Arvind Krishnamurthy∗^ Thomas Anderson∗^ Timothy Roscoe†

University of Washington∗^ ETH Zurich†

Abstract

1 Introduction

2 Background

3 Design and Implementation

5 Discussion

6 Related Work

7 Conclusion

Acknowledgments

References