2001 USENIX Annual Technical Conference, June 25-30, 2001, Boston, MA

Pp. 235–244 of the Proceedings

The X Resize and Rotate Extension - RandR

Jim Gettys
Cambridge Research Laboratory, Compaq Computer Corporation.
Jim.Gettys@Compaq.com -
Keith Packard
XFree86 Core Team, SuSE Inc.
keithp@keithp.com

Abstract

The X Window System protocol, Version 11, was deliberately designed to be extensible, to provide for both anticipated and unanticipated needs. The X11 core did not anticipate that the properties of X server screens might need to change dynamically, as occurs frequently with desktops, laptops and hand held computers not envisioned in the 1980's.

The Resize and Rotate extension (RandR) is a very small set of client and server extensions designed to allow clients to modify the size, accelerated visuals and rotation of an X screen. RandR also has provisions for informing clients when screens have been resized or rotated and it allows clients to discover which visuals have hardware acceleration available.

RandR needs to be discussed in concert with recent developments in X server implementation and the new Render extension to understand the implications of the aggregate. In isolation, RandR seems to provide a limited but useful improvement, but together with the Render extension and reimplementation of the X server rendering code, RandR provides part of a key change in X Window System capabilities. We believe this will enable much easier migration and replication of applications between X servers for pervasive computing. This paper also describes this vision.

Introduction

The X Window System [SG92] extension framework has served us well, allowing significant extensions to the X design over the last 14 years. This extension framework has encouraged new functionality to be introduced, and has encapsulated optional functionality permitting clean non-universal deployment. Probably more importantly, the extension framework has isolated ``bad'' ideas from the core X functionality allowing their eventual atrophy into irrelevance. ¹

At the time X11 was designed, we did not anticipate that the properties of X server screens might need to change dynamically. However, this routinely occurs today with laptops, and handheld computers.

Even most current desktop systems share this need. As there is usually a significant performance and display memory tradeoff between screen resolution and depths, the ability to change display characteristics without restarting your X session is needed by general users (particularly gamers).

Laptops and handheld computers need to change their screen size to drive external monitors at different resolutions than their built in screens. Permitting these portable devices to rotate their display provides for better use of screen real-estate in applications that prefer displays with the rotated aspect ratio. It is convenient to flip a laptop sideways when reading documents intended for paper presentation--the aspect ration more closely matches the document leaving less wasted space. Projectors using mirror systems may find flipped and/or rotated screens very useful. From the applications perspective, rotating the screen is essentially the same as changing the screen size.

We expect that most applications can remain relatively oblivious to screen size changes, though simple modifications may be required in toolkits and window managers. The only real challenge which screen size change presents to most applications is in ensuring that menus stay visible on the screen. Menus are an important special case as they are typically the only user interface elements not managed by the window manager. Applications must remain informed about the size of the screen to ensure that menus do not extend beyond the boundaries of the screen making portions inaccessible to the user. As menus are generally provided by toolkits, rather than directly by applications, simple changes within the toolkits should resolve these problems. Toolkits may also wish to take advantage of acceleration information provided by RandR to maximize performance.

Rendering

The true physical ``depth'' of a display's frame buffer may need to change to either support different screen resolutions due to limitations in the size of display memory or for performance. Since applications currently have a static view of visual types available on a server, and changing visuals dynamically would likely break too many applications, the visuals advertised by the X server will not change when frame buffer depth changes: instead, the rendering of the screen may need to change from hardware accelerated rendering to software.

A key enabler of this extension is the advent of software frame buffer rendering code (called ``fb'', to distinguishes it from the ``mfb'' and ``cfb'' implementations used in older X server implementations) that can support all depths simultaneously (while being dramatically more compact and as fast or faster than the old frame buffer code on current processors). Recent machines are fast enough for this to provide an adequate level of performance for all but the most performance critical applications.

The Render extension enables arbitrary conversions among pixel formats allowing, for example, the display of 24 bit RGB data on a 16 bit display. RandR needs this to convert all pixel formats to one supported by the display hardware.

RandR permits the list of visuals that are accelerated by the hardware to change on the fly. Toolkits and/or occasionally clients may want to follow this information and dynamically select which visual is being used. This extension can inform clients when such changes occur.

Migration and Replication

Migration takes an existing running application and transfers it's display to another device. Replication takes that same application and duplicates it's output (and input) on multiple devices.

Motivation

We believe the ability to migrate applications between X servers is a critical component for the vision of pervasive computing: applications should be able to migrate between displays routinely as users move and interact with handheld, desktop, appliance and projector screens in the global internet environment. Additionally, applications should be able to survive the loss of their server connection. At some future time they should reconnect to either the same or other device.

For example, you should be able to tell an application running on your handheld computer to use a nearby desktop display, keyboard and mouse, or a projector on the wall. This should not require stopping and starting the application. You should be able to go home, and decide to import applications you left running at work. There are obviously security, authentication and authorization problems left to work out, but these are generally independent of the base window system.

Difficulties Migrating X Applications

Migration and replication have traditionally been difficult in X due a number of interrelated factors:

pseudocolor displays - There was no guarantee that the color you needed or even an approximation of it would be available on the destination server.
pixmap depths - servers have typically only supported a few of the permitted possible pixmap depths; the software frame buffer code typically only supported 1, 8 and 32 bit displays, and would not be present if the server did not have the capability to be used at that depth.
frame buffer depth - lack of any capability to emulate non-native frame buffer formats.

These taken together meant that applications and toolkits had to be carefully written to survive migration or replication, and that applications on most common tookits could not migrate at all. Retrofitting the toolkits was very difficult afterwards due to these issues. Server resources (e.g. pixmaps) might need recomputation to alternate depths, and applications often depend on particular visual types be available throughout their execution. Pixel values do not easily map between servers for pseudocolor visual types, and are not even present with pixmaps, which do not have inherent color information.

While migration of applications between X servers has always possible in X, it was so difficult that unless prior thought were taken both toolkits and applications it would not occur. In practice, it is extremely rare, since toolkit writers did not think it important (at least at the beginning of their projects).

As a result, in practice, migration and replication has at best happened rarely and has been fraught with problems. Only a few research toolkits like Trestle [MN91] have been built with these capabilities, and the retrofit into existing toolkits would have been so difficult as to be impossible. In our opinion, this problem has been one of the most painful limitations in X11 protocol's design. Only a limited number of applications have been migratable and replicable, for example, GNU/emacs [Sta00] using the obscure ``make-frame-on-display'' function.

Proxy server approaches [GWY94] came closest to a general solution, but have serious drawbacks. For best performance, applications should be communicating directly with the X server without a proxy. The proxy must reformat much of the X protocol traffic and even still, ensuring that the result remains compliant with the X protocol specification is quite difficult. Even with these difficulties, proxy servers remain useful.

Another problem is the large variation of display sizes, applications that fit easily on a desktop screen may not fit on a handheld or appliance screen. This problem has intensified over the last 10 years as X servers have been deployed on progressively smaller platforms. Proxy server based replication can't make applications adapt to this change, only toolkit based solutions can. Replicated applications are the basis of important real time collaborative applications, and this ability should span handhelds, desktops and larger displays.

Promoting Toolkit Level Migration and Replication

The best solution is for applications and toolkits to support migration and replication themselves. Making it easy for existing toolkit implementers to implement migration and replication is key to migration and replication becoming ubiquitous. The solutions we have outlined provide for more uniformity among the capabilities of different X servers. This increased uniformity should simplify the implementation of migration and replication in existing toolkits and applications that were not designed to cope with multiple disparate X servers.

It has taken more than five years longer for pseudocolor to phase out than we had anticipated. Instead of increasing in capability, display hardware architecture remained relatively constant while costs reduced dramatically. Fortunately, most desktop environments are finally regularly running in truecolor mode. Very few applications now require pseudocolor visual types to function, whereas pseudocolor was was dominant 10 years ago. The dominant applications that people care about now work with fixed color maps. The lack of pseudocolor displays simplifies the translation among color representations.

Additionally, the Render extension moves applications toward a more abstract representation of color, pixel values become much less important as applications focus on color while the extension automatically translates between the various representations of the data.

Modern toolkits like GTK+ 2.0 and Qt [Dal01] now isolate applications entirely from visual representations from X, and should become much easier to adapt to enable server migration and replication. This would enable all new X applications to be used in a fundamentally more interesting fashion. Even applications entirely based on Xt based widgets or other toolkit should be so migratable with some work, as we have carefully avoided violating the invariants found in most applications we are aware of. We strongly advocate the toolkit work to complete this vision.

RandR itself only provides a small part of the solution (the ability to determine which visuals are accelerated after a change event). The deployment of the RandR and the Render extensions in concert with ``fb'', but more importantly the internal X server implementation required to support them results in X servers offering all pixmap depths. With the phase out of pseudocolor visual types, this should dramatically reduce the variability of server configurations encountered by toolkits. Rather than few if any servers supporting all depths, all servers will eventually support all depths. So we expect to see a great increase in uniformity between X server implementations as these servers deploy, greatly easing the problems faced by toolkit implementers. See the section below on Server implementation for details.

As of the time of this writing, we have not demonstrated application migration using these facilities. We plan to do so soon.

Description

Clients can select for RRScreenChange events to be informed if certain properties of a screen have changed.

The root depth, visual and colormap cannot change when a screen is reconfigured to avoid confusing naive clients. Instead, the X server may re-render onto a different depth of framebuffer, and therefore the visuals that can be accelerated by hardware may change. Clients may therefore wish to be informed when these changes occur, and select other visuals for rendering.

Any server supporting this extension will not list the same visual id at more than one depth. This is to make a visual id uniquely identify a depth.

The core protocol allows the same visual ID to be reported for multiple screens, but as the sample X server implementation does not do so and as there is no change in functionality by making this restriction, the RandR extension gains quite a bit of simplicity by enforcing this restriction.

Suppose a non-RandR-aware client has a window on a non-default depth 24 visual, and is switched to a higher resolution, where depth 24 is not supported. How can this work?

In this and similar cases, the window will be rendered using software and updated asynchronously to the display using the Render extension capabilities for blting between depths. Note that your performance may suffer, although experience with the shadow frame buffer implementation in the XFree86 server shows that the hit is not as bad as you might think; the real frame buffer is never being read, only written.

For simplicity, RandR uses the ``acceleration'' Boolean value instead of the more complex integer for visual quality that the Double Buffer Extension [EW94] uses. Applications will search for ``accelerated'' visuals which meet their requirements with the assumption that non-accelerated visuals will be implemented using the method described above.

Implementation

During the implementation of RandR within XFree86, several distinct issues needed addressing:

Multiple Depths. RandR requires that all possible depths be available all of the time.
Rotation. As hardware doesn't normally support this, software will have to rotate all rasterization.
Size and Depth Changing. Changing the size requires recomputation of window clip lists, changing the depth requires reprogramming the hardware.

Each was managed without significant new code by taking advantage of recent advancements within the XFree86 distribution.

Supporting Additional Depths

Because of the pixel-value oriented rasterization model in the core X protocol, the X server is essentially required to render using precisely the same pixel format as is advertised to applications. This implies that when the hardware frame buffer format does not match the visual, the server must store the pixels off screen in their advertised format. This ensures that raster operations that directly manipulate pixel values operate correctly and that applications can continue to use GetImage and retrieve precisely the right values.

This is implemented using the XFree86 shadow frame buffer code. The shadow frame buffer code was originally designed to allow simple X server porting to frame buffers using a format not supported by the frame buffer code. It works by creating a virtual frame buffer in application memory. All rendering operations are directed to this virtual frame buffer and the areas affected by these operations are tracked by the shadow frame buffer code. The hardware frame buffer is periodically updated by copying data from the shadow frame buffer as seen in Figure 1.

**Figure 1:** Shadow Frame Buffer Operation
$\begin{figure}\begin{center} \epsfig{figure=shadow.eps}\end{center}\end{figure}$

The shadow frame buffer replaces the need for a complete rendering implementation with a simple copy operation from the virtual frame buffer to the hardware. While rendering within the the shadow frame buffer cannot be accelerated with the video hardware, the fact that the frame buffer lives in regular memory and not across a PCI or AGP bus means that performance is generally acceptable. Because damaged regions are batched together during periodic updates, the bandwidth of the bus doesn't significantly impact overall performance.

RandR takes this shadow frame buffer implementation and creates a virtual frame buffer for each depth not supported by the hardware. As seen in Figure 2, copying the data from the virtual frame buffer to the hardware involves converting the format of the data to match the hardware, but as the original pixel data are preserved in the shadow frame buffer, the display remains correct even in the presence of complicated raster operations. Depths supported by the hardware remain implemented directly on the hardware with no intermediate shadow frame buffer. While this architecture could use the additional frame buffers to eliminate some window damage, the current implementation doesn't perform this optimization.

**Figure 2:** Supporting Multiple Depths
$\begin{figure}\begin{center} \epsfig{figure=multi.eps}\end{center}\end{figure}$

Using the shadow frame buffer in this way presumes that the X server can already render to the necessary formats. One recent addition to the XFree86 X server is ``fb'', a new implementation of simple frame buffer rendering code. By taking advantage of changes in the relationship between CPU and memory performance, fb packs support for all X pixel sizes into a single implementation. This permits every X server to support all pixel sizes without any effort, something which had been quite difficult in the past. RandR takes advantage of this to provide simultaneous support for all of the displayable pixel sizes.

Displaying the contents of these shadow frame buffers involves converting the pixel values from one format to another format while preserving the RGB values. The Render extension already provides this capability; all operations within that extension accept operands in arbitrary formats. RandR takes advantage of this functionality to perform the format conversion when updating the hardware frame buffer with the contents of the virtual frame buffers in.

Rotating the Screen

As hardware doesn't generally support changing the order that pixels are read out of memory when generating the screen image, rotating the screen involves changing the software to rotate the image within the frame buffer.

There are two possible implementations for this; the most obvious is to rotate all of the rendering requests directed at the screen. This turns out to be somewhat complicated as there are many operations that involve interactions between the screen and off-screen data and each one must rotate the data.

While this may eventually be implemented, the current RandR implementation uses the shadow frame buffer code as described above to keep a copy of the screen in ``normal'' format; updates to the hardware frame buffer rotate the image to orient the view correctly. This implementation causes a slight loss of performance as no rendering is accelerated and also uses additional main memory for a copy of the frame bufffer.

Changing the Screen Size and Depth

The initial implementation for RandR has been done using the Tiny-X architecture within the XFree86 X server. This alternate driver mechanism provides a minimal X server which eases development of new extensions involving interactions between the device drivers and the extension. RandR requires some significant interactions to switch screen sizes and depths. The intent is to migrate this architecture into the core XFree86 X server when the interfaces have stabilized.

Tiny-X already supports dynamically reconfiguring the size and depth of the video hardware; this capability is exported to the user by exposing multiple ``screens'' sharing the same physical video card. Activating a new screen involves disabling rendering to the old screen, reprogramming the video hardware and enabling rendering for windows on the new screen.

Extending this to support RandR is straightforward: disable rendering to the current screen, reprogram the hardware for the new configuration and re-enable rendering. When the depth changes, virtual frame buffers for the old hardware depth become the targets for rendering at those depths. The new depth receives direct access to the hardware while it's virtual frame buffer is disposed of.

X Library Implementation

We envision Xlib will always select for RRScreenChange events on all screens, and that it will change the screen structure automatically when such events arrive. (We will experimentally determine if this seems to confuse clients. By making the restriction that the default depth and visual cannot change when a screen is changed, we believe most clients will not need modification to support screen changes.)

We expect that most clients and toolkits will be oblivious to changes to the screen stucture, as they generally use the values in the connections Display structure directly. By updating on the fly, we believe pop-up menus and other pop up windows will position themselves correctly in the face of screen configuration changes (the issue is ensuring that pop-ups are visible on the reconfigured screen).

Advanced toolkits may wish to use the facilities of the extension to determine which visuals are accelerated, and possibly change to use them.

The RandR Extension

The extension itself is short enough that the inclusion of most of it is worthwhile (we have elided the encoding). The version in this paper is Version .91, and is not (yet) a standard of any kind, and is certain to change before being finalized. We expect it is likely some additional change may be required as we gain implementation experience.

The specification methodology follows that of other X Window System protocol design specifications.

Types

The following types are used in the request and event definitions in subsequent sections:

ROTATION { 0, 90, 180, 270 }. Degrees clockwise rotation from native frame buffer orientation
VISUALSET LISTofVISUALID
VISUALSETID CARD16 index into VISUALSET
SETofVISUALSET LISTofVISUALSETID
SETofVISUALSETID CARD16 index into SETOFVISUALSET
SIZES [width-in-pixels: CARD16
height-in-pixels: CARD16
width-in-mm: CARD16
height-in-mm: CARD16
visual-group-id: SETofVISUALSETID]
SIZESET LISTofSIZES
SIZESETID CARD16 index into SIZESET

Requests

RRQueryVersion

clientMajorVersion CARD16

clientMinorVersion CARD16

serverMajorVersion CARD16

serverMinorVersion CARD16

The version numbers are an escape hatch in case future revisions of the protocol are necessary. The major version must increment for incompatible changes, and the minor version SHOULD increment for small upward compatible changes. Unrecognized requests or extra arguments to a request within a major version MUST be ignored. Barring changes, the major version will be 1, and the minor version will be 0.

A server may support multiple versions of the extension: the reported version is the one which will be used.

RRGetScreenInfo

drawable Drawable

size-set SIZESET

size-set-index SIZESETID

visual-set SETOFVISUALSET

visual-set-index VISUALSETID

accelerated LISTofVISUALSET

rotations-possible LISTofROTATIONS

rotation ROTATION

timestamp TIMESTAMP

Drawable

The set of visuals advertised by the X server can never change. This is to prevent confusion of naive applications that may presume that the visual type and depth of the root is fixed for all time: instead, the X server will rerender the windows on a changed depth, though performance may be degraded unless the client afterwards selects a visual known to be hardware accelerated.

Accelerated is a list of the various possible configurations for hardware assisted visual support on the screen associated with the specified drawable. Hardware assisted visuals can be expected to be more efficient than non-assisted visuals and may provide more timely feedback for graphics requests.

The visual-set-index indicates which visual set is currently being used to render the screen.

This request returns possible size configurations on the screen associated with the specified drawable.

Modern toolkits SHOULD use RHardwareSelectInput to be notified via a RRHardwareVisualChange event, so that they can change visual types and/or depths and continue to receive the benefits of hardware acceleration.

size-set-index, visual-set-index, rotation, indicate which sizeset entry is being used, which visualset entry is being used, the current screen rotation.

The rotations-possible values indicate which rotations are supported by this server for this screen.

The timestamp indicates when the size-set information last changed: requests to set the screen will fail unless the timestamp indicates that the information the client is using is up to date, to ensure clients can be well behaved in the face of race conditions.

Note that much of this information is provided to enable hot-swapping of display cards, which can already occur on handheld computers using PCMCIA and may occur in the future on other busses. Our experience over X's history is that items we thought were static have often become dynamic as technology changes, so we are designing with the presumption that almost anything about the X server could change.

RRSetScreenConfig

draw DRAWABLE

size-set-index SIZESETID

visual-set-index VISUALSETID

rotation ROTATION

timestamp TIMESTAMP

new-timestamp TIMESTAMP

Value, Match, Drawable

Sets the screen to the specified element of the screen set list returned from RRGetScreenInfo, and the specified hardware visual set element.

Sets the hardware visual set to the specified element of the visualsets list from RRQueryHardwareVisuals.

The screen may be rotated by the specified rotation.

If the timestamp of this request is less than the timestamp reported in the latest RRScreenChange event send, the request is ignored. (This could occur if the screen changed since you last made a RRGetScreenInfo request. Rather than allowing an incorrect call to be executed based on stale data, the server will ignore the request.) The new-time-stamp is returned: if it is not equal to the timestamp you provided, you know the request failed and your screen information is stale and should be re-read.

RRScreenChangeSelectInput

window WINDOW

enable BOOL

Value, Window

Requests that RRScreenChange events of screen changes of the screen associated with the drawable be delivered to the specified window. (whew!)

Clients may then choose to create new resources or use other visuals that have hardware acceleration available that take advantage of the new screen configuration.

Events

To reconfigure the root window, use the RRSetScreen request defined above.

RRScreenChangeNotify

root WINDOW

size-set-index SIZESETID

visual-set-index VISUALSETID

rotation ROTATION

hardware BOOL

config-timestamp TIMESTAMP

timestamp TIMESTAMP

This event is delivered to clients selecting for notification with RRScreenChangeSelectInput requests.

This event is sent whenever the screen is changed, or if a new screen configuration becomes available that was not available in the past. The client MUST call RRGetScreenInfo to update its view of possible screen configurations (and get an up to date timestamp required by the RRSetScreen request).

This event is delivered to clients selecting for notification with RRScreenChangeSelectInput to the window. visual-set-index indicates the current visual set and hardware indicates if the specified window's visual is a member of that set. The hardware boolean indicates if the selected window's visual is still accelerated by hardware.

History and Status

This extension has been designed and significant external review input incorporated. A prototype implementation is functioning in the TinyX X implementation.[CP01]

Future Work

The XAA [VF00] implementation of the full XFree86 will need extension to fully support this protocol extension, prototyped in the TinyX framework.

Toolkits will need updating to support RandR to take advantage of accelerated visual type information to ensure highest possible performance.

Window managers will need to support this extension to layout the screen in some fashion when the size changes to ensure applications are appropriately visible.

There are very entertaining user interface possibilities for moving applications between screens, particularly once systems become aware of resources available in their nearby environment. A free, off the wall (or maybe on the wall), idea might be given a handheld computer with accelerometers such as Itsy[HWV$^+$01] you might almost literally throw windows from one screen to another. Other hacks are left to the bizarre nature of your own imagination.

Acknowledgments

The authors would like to thank their respective employers, Compaq and SuSE, for support of open source software development and the XFree86 project for it's continuing advancement of the X Window System. Additional thanks go to Carl Worth and Alexander Guy for their help in editing the manuscript.

Bibliography

CP01: Juliusz Chroboczek and Keith Packard.
Xkdrive - Tiny X server.
The XFree86 Project, Inc., 2001.
XFree86 Release 4.0.3.
Dal01: Matthias Kalle Dalheimer.
Programming with Qt.
O'Reilly & Associates, Inc., second edition, May 2001.
EW94: Ian Elliott and David P. Wiggins.
Double Buffer Extension Protocol.
X consortium standard, X Consortium, Inc., 1994.
GWY94: Daniel Garfinkel, Bruce C. Welti, and Thomas W. Yip.
HP SharedX: A Tool for Real-Time Collaboration.
HP Journal, April 1994.
HWV$^+$01: William R. Hamburgen, Deborah A. Wallach, Marc A. Viredaz, Lawrence S. Brakmo, Carl A. Waldspurger, Joel F. Bartlett, Timothy Mann, and Keith I. Farkas.
Itsy: Stretching the Bounds of Mobile Computing.
IEEE Computer, 34(4):28-35, April 2001.
MN91: Mark Manasse and Greg Nelson.
Trestle Reference Manual.
Research Report 68, Digital Equipment Corporation Systems Research Center, December 1991.
SG92: Robert W. Scheifler and James Gettys.
X Window System.
Digital Press, third edition, 1992.
Sta00: Richard M. Stallman.
GNU Emacs Manual.
Free Software Foundation, fourteenth edition, June 2000.
VF00: Mark Vojkovich and Marc Aurele La France.
XAA.HOWTO.
Technical report, The XFree86 Project, Inc., 2000.

About this document ...

The X Resize and Rotate Extension - RandR

This document was generated using the LaTeX2HTML translator Version 99.2beta6 (1.42)

The command line arguments were:
latex2html -white -transparent -split 0 -nonavigation -mkdir -dir ../randr randr.tex

The translation was initiated by Keith Packard on 2001-05-01

Footnotes

... irrelevance.¹: E.g. PEX, XIE, LBX, along with wide lines and arcs in the core protocol ...

Keith Packard 2001-05-01

This paper was originally published in the Proceedings of the FREENIX Track: 2001 USENIX Annual Technical Conference, June 25-30, 2001, Boston, Masssachusetts, USA

Last changed: 21 June 2001 bleu

Technical Program

Conference index

USENIX Home