Graphical Definitions in R Graphics

by Paul Murrell http://orcid.org/0000-0002-3224-8858

Version 1: original publication


Creative Commons License
This document by Paul Murrell is licensed under a Creative Commons Attribution 4.0 International License.


This document describes a plan to expand the R graphics engine to support graphical definitions: graphical features like gradients, patterns, masks, and clipping paths.

The code changes are being tracked in a subversion branch called R-defs.

This is a living document that will evolve as the code is developed.

The Features Section outlines the features that I would like to target and the Demonstrations Section provides a glimpse of how some new features are currently implemented at the user level (in the R-defs branch).

The Internal details Section has more on the internal design and internal code changes that have been made.

The Issues Section contains some of the major design issues that I have identified so far and the Discussion Section contains some questions and (some) answers that I have already thought a bit about, including the changes that would be imposed on graphics device maintainers.

Table of Contents:

1. Introduction

The R graphics engine only supports a limited set of graphical features: basic shapes like lines, rectangles, and circles; paths (including shapes with holes); text; raster images; and rectangular clipping regions. These features are sufficient to produce a wide range of statistical plots, but there are many things that R graphics cannot do.

For example, the image below contains a set of filled contours within a map of New Zealand. The natural way to achieve this result would be to draw the filled contours and apply a clipping path to the filled contours based on the boundary of New Zealand. That is not something that can be achieved in R graphics (because R graphics only supports rectangular clipping regions).1

contour lines clipped to a map

The plan is to add new graphical features to the R graphics engine, such as clipping paths, in order to expand the range of graphical images that can be produced with R.

2. Issues

There are a number of issues that need to be resolved when adding a new graphics feature to the R graphics engine.

Device-independence:

Can we generate the same output on all R graphics devices?

This horse has already bolted. The output on different R graphics devices has never been identical and there are already graphics features that are not supported on all R graphics devices (e.g., translucent colours are not supported on the PostScript device). The plan is to double-down on this approach and just allow graphics devices to opt out of any or all of the new features.

This is partly because not all devices will be capable of supporting the new features and partly because it makes it easy to roll out the new features; graphics devices can initially opt out of all new features and add support as and when they are able.

SEXPs in graphics devices:

Can we avoid sending R's SEXP structures to graphics devices ?

In the early days, there was an effort to avoid using R's internal C structures within graphics devices (so that they only had to include R's graphics-related header files and they did not have to worry about complications like R's memory management).

This horse has also bolted. From R 2.13.2 (at least), when the cap graphics device procedure was added (to allow raster "screen shots" of current device state), devices have been able to at least return R structures.

The plan is again to double-down. Some of the new features will require sending relatively complex information to the graphics device (e.g., gradient information and pattern information), so rather than define complex new C structures, it makes sense to make use of R's existing flexible structures.

One problem is that an SEXP in C is basically an opaque pointer. How does the device know about changes to the SEXP structure? The proposal is to write a C API for interacting with the SEXP structures and encourage devices to use that. An example has been developed for gradient fills (see the Demonstrations Section).

The feature set:

How to decide which graphics features to support?

This is almost the inverse of the device-independence problem. Do we shoot for the union of all features supported by all possible graphics devices ? Or do we target the feature set of one particular device or format (e.g., SVG) ?

The plan is to take ideas from everywhere. There are features that will only be supported by some formats (e.g., SVG has filters, but Cairo has only very limited filter support, whereas Cairo has mesh patterns, but SVG does not). There are also subtleties between formats, like the 'gradientUnits' in SVG that are not supported in Cairo, but we can hopefully make an interface for these (at least in 'grid') even if the graphics engine only goes as far as Cairo support.

The PDF feature set is probably the most sophisticated (?). For example, most graphics formats have linear and radial gradients, Cairo graphics has "mesh patterns" that SVG does not, but PDF has seven types of "shaders" (type 2=Linear gradients, 3=Radial gradients, 4&5=Gouraud-shaded triangle meshes, 6=Coons patch meshes, and 7=Tensor-product patch meshes) including Function-based shaders (type 1) that define the colour of every point in the image using a mathematical function. Gradient meshes have been proposed for SVG, but it does not look like they will make it into SVG 2.0.

'grid' versus 'graphics'

Will the new features be implemented for 'graphics' or 'grid' or both ?

The new features will be implemented within the graphics engine. This means that, in theory, interfaces can be made for both 'graphics' and 'grid'.

The plan is to focus, initially at least, on 'grid'. This is because 'grid' allows more flexible and interesting definitions, for example, the use of units in locating gradient stops. 'grid' also allows for things like adding features by post-editing because we are able to address graphical objects by name.

It is also possible in the meantime to apply the new features to 'graphics' output via gridGraphics::grid.echo and functions like grid::grid.edit.

Graphics devices

Which graphics devices should implement which features ?

Obviously, only some features can be added to some graphics devices because of limitations in graphics formats and/or graphics libraries. This issue relates more to when support gets added to different graphics devices. The problem is that implementing all possible features on all possible devices is a big job (made more difficult by the development environment on some platforms, like Windows).

The proposal is to be feature-greedy and just focus initially on the Cairo graphics devices. This gives a reasonable coverage (raster on Linux at least, PDF, and SVG) from a single set of code changes. The idea would be to introduce as many features as possible, on the Cairo devices, for 4.0.0 and then fill in the gaps. Once the Cairo changes are implemented, they will hopefully provide a template so that others could contribute with patches for other devices.

A danger is that only implementing for Cairo may "bias" the graphics engine implementation towards that device and cause problems for implementations on other graphics devices. Perhaps implementing for two devices (e.g., also the PDF device) rather than one would be more prudent.

Another issue is that the default, "graphapp", Windows device is NOT going to be able to support these features very easily (if at all). GraphApp does not support any of these features (?) (even in the latest version and not even the underlying Windows GDI supports gradient fills (though it looks like clipping paths are possible). OTOH, implementing features on the Cairo device makes them available on all platforms (?). I am unsure about whether the RStudio plot pane (RStudioGD) will be able to support these features (across platforms).

Third-party graphics devices can of course update as much as they want when they want, though they will have to immediately provide some minimal stubs for the changes to the graphics device API (GraphicsDevice.h).

3. Features

This section briefly describes the set of graphical features that are currently being considered.

Gradients

This includes linear gradients and radial gradients. Gradients can be applied to both fills and strokes (at least on some devices).

Gradient meshes are also supported on several devices (Cairo and PDF).

Patterns

Pattern fills are similar to gradient fills in that they define a "paint" for filling a region. The difference is that a pattern fill is based on "tiling" a smaller image.

There are other sorts of pattern fills in some formats (e.g., function-based shaders in PDF).

Clipping paths

Allow clipping to an arbitrary path rather than just a rectangle. This relies on being able to specify a "path" separate from drawing (stroking or filling) it.

Masks

Similar to clipping except that affect translucency of rendered output (rather than just visible or not). Again, relies on being able to describe paths without drawing them.

Filters

These are essentially raster operations, but a format like SVG allows them to be defined as part of a vector image (they are applied as part of the rasterization of the vector image, for example, when drawing to screen).

Implementing these in the graphics engine is probably too hard. I don't think any of the standard graphics devices will support them (including the Cairo svg device).

These may have to remain the domain of packages like 'gridSVG' (which provides full access to SVG features) or 'magick' (which allows us to apply raster operations to raster images), possibly in combination with the 'rasterize' package.

It looks like Thomas Lin Pedersen has been thinking along similar lines (to the latter) with a 'ggfx' package.

4. Demonstrations

A new R-defs branch has been created for experimenting. So far, that includes the addition of support for linear and radial gradient fills, patterns, clipping paths, and masks on both Cairo graphics devices and the pdf device (except for patterns), with a 'grid' interface. The following set of examples show how the 'grid' interface has been implemented so far.

NOTE: support for graphics definitions has only been implemented for Cairo-based graphics devices and the pdf device so far. Other graphics devices should run, but will not produce output from graphics definitions. Third-party graphics devices will hopefully error out (if they make use of R_GE_checkVersionOrDie), but otherwise they are likely to segfault until they are updated.

On the positive side, Cairo devices and the pdf device can be used on all major platforms, so there is already an option for accessing these new features on all major platforms.

Gradients

The fill gpar can now be a linear gradient (as well as a normal R colour).

A linear gradient is defined by a start point (default: bottom-left) and an end point (default: top-right) and a series of "stops". Each stop is defined by a position on the line from start point to end point, plus a colour. If the first stop is not at the start point, or the last stop is not at the end point, we can specify how the gradient "extends" beyond the first and last stops.

The following code creates a linear gradient that starts one inch off the bottom of the image, proceeds vertically to the top of the image, and transitions from red to yellow then back to red.

When the gradient fill is specified on a grob, the start and end points are relative to the grob bounding box. The following code fills a rectangle in the central quarter of the image with the default linear gradient; the gradient starts at the bottom-left of the rectangle and ends at the top-right of the rectangle.

A gradient fill can also be specified on a viewport, in which case the gradient is relative to the viewport and all subsequent drawing makes use of that gradient (unless a new explicit fill is specified).

In the code below, we push a viewport that fills the whole image and specify the default linear gradient fill, so the gradient is relative to the whole image. We then draw a rectangle, with no explicit fill, so it "inherits" the gradient from the viewport.

In the next example, we have the same viewport, but we draw the rectangle only in the central quarter of the image. This shows that the gradient that the rectangle inherits is relative to the viewport rather than being relative to the rectangle.

The next example shows another variation on inheritance of gpar settings. This time a viewport is pushed with the default linear gradient and we draw a rectangle in the left third of the image that inherits that gradient (relative to the viewport). Then we push another viewport with a green fill and draw a rectangle within that viewport (in the centre of the image) and that rectangle inherits the green fill. Finally, we pop the second viewport and return to the first viewport and restore its linear gradient so that when we draw a rectangle in the right third of the image it again inherits the linear gradient (relative to the viewport).

The next example also demonstrates inheritance of gpar settings, but this time it is inheritance between viewports. We push a viewport with the default gradient, then push another viewport in the central quarter of the image, then draw a rectangle. The second viewport inherits the gradient from the first viewport and then the rectangle inherits the gradient from the second viewport. The important point is that the gradient is relative to the first viewport, which is where it was defined.

The next example just shows that gradients work with translucent colours. We draw the text "Reveal" then over the top draw a rect with a gradient fill that transitions from (opaque) white to transparent.

The next example shows that radial gradients work via the same interface. It is expected that fill patterns would work the same way.

As with linear gradients, we have control over the start and end of the radial gradient, though in the radial case it is a start circle and an end circle. In the code below, we make a gradient that starts near the top right of the image (with white) and ends with a circle that fills the image (black).

The next example shows that we can add a gradient to an existing image, in this case a 'ggplot2' plot. Here we edit the background of the plot panel to add a grey-to-white gradient.

The next example is similar, but demonstrates that, even with gradients only implemented in 'grid', we can still add gradients to 'graphics' plots if we first convert them using the 'gridGraphics' package.

Patterns

The fill gpar can also be a general pattern that is based on a 'grid' grob. The pattern function takes a grob and draws it as the fill for another grob.

Patterns on grobs are relative to the grob.

Patterns on viewports are relative to the viewport.

Patterns are full drawings so can have gradients and clipping (see "Clipping paths" below).

Patterns can "extend" like gradient fills; probably the most typical use of a pattern will draw a simple shape and then specify extend="repeat" to fill a region. To do this, we must specify a size for the pattern (this specifies a "size" for the "tile" that gets repeated). For example, the code below fills a rectangle with a polka-dot pattern by specifying a circle as the pattern and specifying that the pattern size is just a little bit larger than the circle and specifying that the pattern repeats.

Clipping paths

The clip argument for the viewport function can be a 'grid' grob (in addition to the existing "on", which only enforces a rectangular clipping region). The grob defines a path that is used to set the clipping region.

A simple example is clipping output (in this case text) to a circle. The text and the circle are both drawn in grey and then the text is drawn black with the circle used as a clipping path.

A slightly more dramatic example involves clipping text to several circles at once. Note that this still only requires a single grob to specify the clipping path.

The clipping path is relative to the viewport and clipping paths are inherited like the existing clip settings.

In the following code (after drawing everything in grey without any clipping), we push a viewport with three circles as the clipping path, then we draw the text "test one", which gets clipped to the circles. Next, we push another viewport in the bottom third of the image, which inherits the clipping path, and we draw the text "test two", which also gets clipped to the circles. Finally, we pop those two viewports and push a third viewport in the top third of the image; now there is no clipping path in effect (because we popped the viewport with a clipping path), so when we draw the text "test three" the text is not clipped at all.

Clipping paths are just paths (outlines of shapes), they do not reflect, for example, stroke width or fill colours.

Although a clipping path can be an arbitrary 'grid' grob, clipping and masks (see below) are disallowed within a clipping path. Any viewport that sets clip=TRUE or clip=FALSE or sets clip to be a grob, or that sets mask to be a grob, will generate a warning and the clipping setting (or the mask) will be ignored for determining the clipping path.

This constraint has been applied for several reasons: to make life easier for graphics devices; because it is very difficult or impossible to achieve some nested clipping effects on some devices; formats like PDF and SVG enforce limits on what can go in a clipping path; and fancy clipping shapes can be achieved by using a mask instead.

Masks

A 'grid' viewport can now have a mask (in addition to clip). This value should be either "inherit" (the default), in which case the viewport inherits the mask from its parent, "none", in which case the viewport does not mask its content, or a 'grid' grob that defines a new mask for the viewport.

The following example defines a circle grob with a radial gradient fill that starts at black in the centre and gradually transitions to transparent at the circumference. A viewport is pushed with this circle grob as the mask, then a rectangle is drawn half the width of the viewport and filled with solid black. The result is that only the parts of the rectangle where the mask is non-transparent are drawn; the retangle is filtered by the mask.

The following code blends two 'ggplot2' plots using a mask. This time the mask is a linear gradient from black 30% of the way up the page to transparent 70% of the way up the page. First, we draw a plot with the default ggplot2 them, then we draw a black-and-white themed version of the same plot on top, using the mask, so from 30% up the page the black-and-white plot transitions smoothly into the default themed plot.

Here is an example of a relatively complex mask that cannot be achieved with clipping paths (in the current proposal) because it would require nesting clipping paths. The mask consists of two narrow rectangles, but the left rectangle is drawn within a viewport with a clipping path. The same effect could be achieved by replacing the clipping path with a mask (nested masks are ok).

5. Internal details

Gradients

A patternFill property has been added to the R_GE_gcontext (graphical context) in the graphics engine. This property can be either NULL (indicating no pattern fill) or an R object, which represents a reference to one of the current pattern fills on the graphics device.

This property is in addition to the existing fill property. If patternFill is non-NULL then it overrides fill. This makes it easy for a graphics device to ignore patternFill if it wants to.

A grDevices::linearGradient function has been added. This is not meant to be called directly by users; it is there for 'graphics' and 'grid' to use. The idea is the 'graphics' and 'grid' have user-level interfaces to define gradients and those are converted to graphics engine versions of the gradients for drawing. There is also a grDevices::radialGradient.

A setPattern device function has been added to allow the graphics engine to ask a graphics device to create a pattern. The function takes a "Pattern" object (either a linear or radial gradient so far) and returns an R object (a reference to be used in a patternFill call).

(A graphics device can just always return NULL if it generates a new pattern every time setPattern is called ?)

There is also a releasePattern device function so that, e.g., 'grid' can release a pattern that is just set on a grob (and release all patterns on new page). Specify NULL to release all patterns.

NOTE that the "Pattern" sent to setPattern is an SEXP, which means it is sort of an opaque pointer in C code, so the graphics engine provides a C API for extracting information from a "Pattern" object. For example, R_GE_isPattern can be called to check for a pattern fill, R_GE_patternType can be called to get the pattern type, and R_GE_linearGradientNumStops can be called to get the number of stops from a LinearGradient SEXP.

The Cairo graphics device (x11(type="cairo"), png(type="cairo"), cairo_pdf(), and svg()) has implemented setPattern and releasePattern and maintains an array of patterns. It observes the new R_GE_gcontext.gradientFill setting and uses the value as an index into its array of patterns.

The 'grid' package provides its own linearGradient function for defining a linear gradient in 'grid'. One notable difference is that the start and end points for the gradient can be defined using 'grid' units. The gpar function now also allows a GridPattern as the value for the fill property.

When gpar settings are enforced during drawing a grob, a pattern fill is resolved to a graphics engine Pattern (for example, the start and end points for a linear gradient are converted to locations on the graphics device). If the pattern fill is specified on a viewport, the pattern definition is relative to that viewport; if the pattern fill is specified on a grob, then the pattern definition is relative to the bounding box of the grob.

'grid' calls setPattern to "resolve" a pattern and generates a GridResolvedPattern, which contains the reference from setPattern. This allows 'grid' to reuse the pattern reference, e.g., when drawing multiple shapes from a single grob, or when revisiting a viewport (so that it does not have to ask the graphics device for a new pattern for every shape or for every visit to a viewport).

When drawing a grob, 'grid' resolves the pattern relative to the grob, gets the pattern reference and uses that for every shape that is drawn from this grob, then releases the pattern.

When pushing a viewport, 'grid' resolves the pattern relative to the viewport and stores the resolved pattern in the pushed viewport, so that revisits to the viewport can reuse the resolved pattern. The resolved pattern is only stored in the pushed viewport so that replays of the grid display list (e.g., grid.edit) or the graphics engine display list (e.g., device resize or device copy) will always resolve the pattern again.

When starting a new page, 'grid' releases all patterns.

Patterns

Patterns use the same R_GE_context slot as gradients; a drawing can have a gradient OR a pattern fill OR a normal fill. Patterns are passed to graphics devices via the same setPattern device function.

There is a grDevices::tilingPattern function for 'graphics' and 'grid' to use to create a pattern.

'grid' creates a GridTilingPattern object, which is a list with a component, f, that contains a function to draw the grob that defines the pattern, plus the location, size, and extend rule for the pattern.

The Cairo graphics device creates a pattern by running the pattern function within a cairo_push_group and cairo_pop_group. This means that the pattern is drawn on a temporary surface and that drawing is turned into a Cairo pattern to use as the source for filling other drawing.

The same pattern resolution in 'grid' is used as for gradients; the same pattern array on the Cairo device is used as for gradients.

Clipping paths

A setClipPath device function has been added to allow the graphics engine (or graphics systems) to ask a device to define a clipping path. The function takes a path and a ref and should return a reference to the clipping path on the device. All three values are SEXPs, but all the device needs to know about the path is that it is an R function (with no arguments) that the device should call to create the clipping path. More on how that works in a second. All the graphics engine (or graphics systems) need to know about the reference to a clipping path is ... nothing. These references are generated by the device, passed back to the graphics engine, then sent in again to the device. The graphics engine (or graphics systems) only store the references and pass them back to the device; the references only have meaning within the graphics device.

There is also a releaseClipPath device function, which takes a reference to a clipping path so that the graphics engine (or graphics systems) can indicate to the device that a clipping path is no longer required.

When a 'grid' viewport is created with a grob as the clip argument, a "GridClipPath" object is created, which is a function that draws the grob. When a 'grid' viewport is pushed, the clipping path is "resolved" by calling setClipPath on the graphics device (via R and C interface functions in the 'grDevices' package).

The Cairo device maintains an array of clipping paths. When a new clipping path is requested (with a call to setClipPath) the device starts a new path and enters an "append mode" before it runs the R function that defines the clipping path. Cairo device functions that perform drawing (e.g., Cairo_Rect) only append shapes to the current path when the device is in "append mode" (when not in append mode, these device functions start a new path, add the shape to the path, set parameters like colour, then stroke and/or fill the path). After the R function has run, the path is closed and the clipping region is set from the path. The path is also saved in the array of clipping paths and an integer index to array is returned as the reference (as an R integer vector).

The 'grid' package stores the clipping path reference that is returned, along with the original "GridClipPath", creating a "GridResolvedClipPath". This enhanced clipping path object is stored in the pushed viewport so that, when a viewport is revisited, 'grid' can send the reference to the clipping path as well as the clipping path itself in the call to setClipPath, which allows the graphics device to reuse cached clipping paths (if it is set up that way).

The Cairo device checks whether the ref argument to setClipPath is non-NULL, in which case it looks up the relevant clipping path in its array of clipping paths and reuses that, rather than generating a new clipping path.

'grid' releases all clipping paths on a grid.newpage.

Masks

A setMask device function has been added to allow the graphics engine (or graphics systems) to ask a device to define a mask. The function takes a mask and a ref and should return a reference to the mask on the device. All three values are SEXPs, but all the device needs to know about the mask is that it is an R function (with no arguments) that the device should call to create the mask. As with clipping paths, the references are generated by the device, passed back to the graphics engine, then sent in again to the device. The graphics engine (or graphics systems) only store the references and pass them back to the device; the references only have meaning within the graphics device.

There is also a releaseMask device function, which takes a reference to a mask so that the graphics engine (or graphics systems) can indicate to the device that a mask is no longer required.

The Cairo device maintains an array of masks. When a new mask is requested (with a call to setMask) the device starts a new group (cairo_push_group) so that all drawing on the device is on a temporary surface, calls the mask function, then saves the group as a pattern (cairo_pop_group) on the array of masks and returns the index of the mask in that array.

The 'grid' package stores the mask reference that is returned, along with the original "GridMask", creating a "GridResolvedMask". This enhanced mask object is stored in the pushed viewport so that, when a viewport is revisited, 'grid' can send the reference to the mask as well as the mask itself in the call to setMask; this allows the graphics device to reuse cached masks (if it is set up that way).

The Cairo device also keeps track of the current Mask, as an index into its mask array. This starts as -1, which means there is no mask in effect. setMask updates this index, including resetting to -1 if sent a NULL mask.

The 'grid' package calls setMask whenever a viewport is pushed or revisited to update the current mask on the device.

The shape-drawing functions on the Cairo device check whether the mask is non-negative. If so, rather than drawing directly, they draw to a temporary surface (cairo_push_group, cairo_pop_group), collect the result as a pattern, then mask the pattern, using the current mask, to draw on the main device surface (cairo_mask).

The pdf device

Gradients and clipping paths and masks have also been implemented on the pdf graphics device.

The pdf device maintains an array of "definitions", which are basically strings of PDF code.

Resolving a gradient (via setPattern) creates a definition that contains the PDF code defining the gradient and the array index to that definition is returned as the reference. When a shape is drawn with a gradient fill, code something like /Def3 scn is generated to indicate that definition 3 (a gradient) is to be used as the fill colour. On dev.off, definition strings are written out at the end of the PDF file and the link between /Def3 and the appripriate PDF object number is resolved.

Semi-transparent gradients are a bit more complicated because they require creating an additional "soft mask" definition. In this case, the reference returned is two array indices, one for the gradient and one for the soft mask. When a shape is drawn with a semi-transparent gradient fill an additional piece of code, something like /Def4 gs is generated to enforce the soft mask for the gradient fill. Again, the actual definition string for the soft mask is written out at the end of the PDF file, when links between definition names and object numbers are resolved.

Resolving a pattern creates a definition that contains PDF code to define the pattern, with the array index of that definition returned as a reference. The definition of a pattern contains a "stream" of drawing code, which is obtained by evaluating the R function from the pattern and capturing its output within a temporary definition (that is copied to the pattern definition and not written out itself). When a shape is drawn with a pattern fill, the pattern is set as the fill colour as for gradient fills. Pattern definitions are stored and written out at the end of the PDF file, when links between definition names and object numbers are resolved. An added complication with patterns is that the definition is actually only finalised at this point because the pattern must contain its own Resources Dictionary (it cannot just refer to the document's Resource Dictionary because that contains a reference to the pattern itself!).

Resolving a clipping path (via setClipPath) creates a definition that contains the PDF code that draws the clipping path. Again, the array index of that definition is returned as the reference. The PDF code describing the clipping path is obtained by evaluating the R function for the clipping path and capturing the output within the clipping path definition. The setClipPath call also writes the definition string out (plus W n to enforce the clipping path), but it can simply reuse the definition string (rather than regenerating it) if the reference is not NULL. A clipping path definition is NOT written out at the end of the file (because it is written inline whenever a clipping path is set).

A limitation: the fill rule used by the clipping operator follows the device fill rule. So it is possible to add a path with one fill rule, but have the clipping path use a different fill rule.

Resolving a mask (via setMask) creates a definition that contains the PDF code that defines the mask content. Masks are implemented as alpha soft masks. The mask is described by a "stream" of PDF code that is obtained by evaluating the R function for the mask and capturing its output within a temporary definition (as for patterns). The array index of that definition is returned as the reference. Mask definitions are stored and written out at the end of the PDF file, when links between definition names and object numbers are resolved. The PDF device monitors whether a mask is currently defined and, when drawing normally, if a mask is defined it is enforced with code something like /Def5 gs.

Unlike clipping paths, it is valid to nest masks within masks (and gradients and clipping paths within masks); masks are essentially self-contained drawings. This can be accommodated by the PDF device because all of these features are just stored in definitions as the mask content is captured, and then the definitions are written to file at the end, and only references to other definitions are recored in the mask content.

This implementation on the pdf device provides some evidence that the overall design of graphics definitions support has not been too tightly coupled with the Cairo graphics devices, which gives some hope that other graphics devices will be able to implement at least some of these features.

6. Discussion

The argument to augment the R graphics engine with new graphics features is a simple one: R cannot do some things now, so we should make it so it can. But how important is it to allow R to produce more sophisticated graphical output ?

One motivation for these changes is to allow people to produce more of their figures and diagrams via code. One workflow that people employ is to generate a plot using R and then use Adobe Illustrator to "touch up" the plot. This sort of workflow is not automatable, or reproducible, or shareable, etc. If we can help people to do everything via code and remove manual steps from workflows, that is a big win.

A simple example of the usefulness of clipping paths is the ability to clip to (rectangular) rotated viewports (!).

From my own perspective, having better support for sophisticated graphical features in R would allow me to expand on work already done on importing graphics from other systems into R ('grImport', 'grImport2', 'metapost', 'dvir', 'displayEngine'). Without these features, it is not possible to represent images from other systems properly in R.

Anecdotally, I believe that others would like to have the ability to make use of additional graphics features too. As one piece of hard evidence, this work is being partially funded by a donation from R Studio to the University of Auckland Foundation.

Design

The design of the API for these new features has been focused on flexibility - making things possible. This is reflected particularly in the 'grid' API where clipping paths, patterns, and masks are all defined in terms of a grob.

This flexibility is good in the sense that it makes reasonably complex things possible (e.g., masks containing patterns and clipping paths), but it does place a reasonable burden on the graphics devices. For example, graphics devices must be written to allow for recursion; drawing of a shape may trigger drawing of further shapes (e.g., in order to resolve the fill pattern for the original shape).

This flexibility may also impact on efficiency. For example, it is only currently possible to specify a single pattern fill for a grob (which may draw multiple shapes); pattern fills are not vectorised.

As use-cases appear, there may be a need for adding a simpler interface for patterns, masks, and clipping paths, both to allow graphics devices to support a less flexible API (with less effort) and to allow greater efficiency when drawing a large number of shapes.

Fortunately, the current API passes an SEXP as the specification for a pattern or clipping path or mask to resolve, so it should be possible in future to pass other obects via this interface with, say, classes used to distinguish between different sorts of specifications.

External graphics devices

How much work needs to be done to upgrade a graphics device for these changes ?

A device must implement dev->setPattern, but it can just return NULL if unsupported. A device can also just create a new pattern every time. If it returns a meaningful pattern reference, it must do something sensible with that reference when provided via R_GE_gcontent.patternFill.

A device must implement dev->releasePattern, but it can just do nothing.

There is new R_GE_gcontent.patternFill, but a device can just ignore it (because R_GE_gcontext.fill is always set; it will just be "transparent" when a pattern fill has been specified).

A device must implement dev->setClipPath, but can just return NULL. There are three levels of support: none, in which case the device ignores the clipping path request (and returns NULL); non-caching, where the device sets clipping paths, but returns NULL, so it will always generate a new clipping path on every request; caching, where the device sets clipping paths, caches clipping paths, and returns a reference to the cache, and the reference only has to make sense to the device.

A device must implement dev->releaseClipPath, but can just do nothing.

A device must implement dev->setMask, but can just return NULL. As with clipping paths, there are three levels of support: none, non-caching, and caching.

A device must implement dev->releaseMask, but can just do nothing.

As an example of the (minimal) changes necessary to update a device (WITHOUT support for any of the new features), the following diff shows the changes to get the quartz device running on macOS (at r78533).

@@ -371,6 +371,12 @@
+static int      RQuartz_setPattern(SEXP pattern, pDevDesc dd);
+static void     RQuartz_releasePattern(int index, pDevDesc dd);
+static SEXP     RQuartz_setClipPath(SEXP path, SEXP ref, pDevDesc dd);
+static void     RQuartz_releaseClipPath(SEXP ref, pDevDesc dd);
+static SEXP     RQuartz_setMask(SEXP path, SEXP ref, pDevDesc dd);
+static void     RQuartz_releaseMask(SEXP ref, pDevDesc dd);

@@ -430,6 +436,13 @@
+    dev->setPattern      = RQuartz_setPattern;
+    dev->releasePattern  = RQuartz_releasePattern;
+    dev->setClipPath     = RQuartz_setClipPath;
+    dev->releaseClipPath = RQuartz_releaseClipPath;
+    dev->setMask         = RQuartz_setMask;
+    dev->releaseMask     = RQuartz_releaseMask;

@@ -1282,6 +1295,24 @@
+static int RQuartz_setPattern(SEXP pattern, pDevDesc dd) {
+    return -1;
+}
+
+static void RQuartz_releasePattern(int index, pDevDesc dd) {} 
+
+static SEXP RQuartz_setClipPath(SEXP path, SEXP ref, pDevDesc dd) {
+    return R_NilValue;
+}
+
+static void RQuartz_releaseClipPath(SEXP ref, pDevDesc dd) {}
+
+static SEXP RQuartz_setMask(SEXP path, SEXP ref, pDevDesc dd) {
+    return R_NilValue;
+}
+
+static void RQuartz_releaseMask(SEXP ref, pDevDesc dd) {}
  

The plan is to implement at least one caching device (Cairo, plus maybe PDF), a non-caching device (always generates new definition on every use), and a non-implementing device (does not support features). These then provide templates for external device maintainers.

Efficiency

In an unusual move (for me), I have not completely ignored efficiency concerns.

Definitions are resolved every time a graphics system calls the graphics engine because the graphics engine is a "flat" coordinate system whereas the graphics systems (particularly 'grid' units) are dynamic and depend on the current drawing context.

This may present some challenges for implementing graphical definitions efficiently. However, the worst case scenario is something like filling thousands of data symbols with the same gradient fill.

Another issue is revisiting viewports (up/down). This could also generate a lot of redundant recalculations (or very large file sizes). Though not at the same scale as the thousands-of-shapes problem, it could be more common (?). This also can happen during calculation of things like "grobWidth" units.

The inefficiency could bite both in terms of speed and, for file-based devices, memory.

In both cases, allowing the graphics device to cache its definitions should be beneficial.

The plan is to actually do some measurements to demonstrate that there is a problem and that caching is helpful (or not).

TODOs

Neither Cairo nor PDF devices have currently attempted to include text output in clipping paths or masks.

The R graphics engine does not accommodate the idea of a gradient fill within text.

The PDF device does not (yet) allow rasters within patterns or masks.

The Cairo device currently has a fixed limit on the number of definitions allowed (per page).

Issues

What to do about pattern fill on multiple shapes from single grob: grid.rect(width=1:2/3, gp=gpar(gradientFill=linearGradient())) Currently just resolving pattern based on bounding box of all shapes.

The current implementation of pattern fills in 'grid' "nicely" incorporates them into the gpar(fill=) interface. This may have implications for packages that build on 'grid' if they try to manipulate 'grid' gpars in any way, other than just setting them (e.g., 'gridSVG' converts gpar values to SVG display attributes).

These changes introduce some "modal" behaviour (like whether the device is "appending" or "drawing"); how to make sure we elegantly unwind when an error occurs (e.g., when hit an error while playing/recording a clipping path) ? One defence being employed for Cairo and PDF devices is to only produce warnings rather than errors when things go wrong (in the case of PDF, ask the use politely to shut down the PDF device). Also, modes are reset on new page, so it may be possible to recover a corrupted device by starting a new page.

What will the impact be on saved/restored plots ? (the graphics engine display list and snapshots thereof) I have some test cases working within-session. It would certainly be a bad idea to save from R 4 and replay in R 3 (though the reverse should be ok?).

The fact that clipping paths (mostly) and masks can be arbitrary 'grid' grobs places a reasonable burden on the graphics device. The device has to evaluate an R function and capture the resulting output. A possible future path for devices that cannot meet these demands may be to allow alternate objects as clipping paths and masks. Currently, the SEXP sent to the device is an R function, but that could conceivably be limited to simpler R objects, like a simpler description of one or more shapes. That might make the device's job a LOT easier.

PDF viewers: can get some variation between viewers (e.g., evince, xpdf, gs/gv, and firefox). Can also get (bigger) variation between versions of viewers. This bites harder for more complex drawings. For example, nested tiling patterns are NOT good in gv 3.7.4, xpdf 3.04, and evince 3.18.2 (Ubuntu 16.04), but are fine in gv ??, xpdf ??, and evince ?? (Ubuntu 18.04).

'gridSVG'

Reasons for keeping 'gridSVG': still the only way to support some stuff (e.g., Cairo does not support filters); still the only way to retain structure and labelling when export.

The impact on 'gridSVG' may be interesting. There may be differences between the way that features are implemented in 'grid' vs 'gridSVG'. For example, gradient fills might be added to gpar in 'grid', rather than creating a separate class of grob (as in 'gridSVG'). This means that 'gridSVG' may need updating to work with new 'grid' gradients. (Yes!)

'grImport2' is another interesting case - it may still end up with ext=gridSVG as well as being able to draw directly to normal R graphics devices.

Fonts

Proper font support is another issue entirely. Currently addressed by packages like 'showtext' and 'extrafont'. Maybe a future project.

Extensions

It would be nice if the graphics device API allowed for people to write R code that talked directly to the graphics device without having to be controlled by (and filtered through) the graphics engine. This would allow, for example, people to access features on graphics devices that are still not supported by the graphics engine (sort of like what 'showtext' sneakily does now to hijack text rendering; 'tikzDevice' also does something like this for creating "tags" in tikZ output ?). This would be a little like DVI specials (?) TODO: look at what 'showtext' and 'tikzDevice' need to do and how they currently work around the graphics engine. Another example that has been mentioned is hyperlinks.

Probably also best as a separate, future project.

Footnotes

1 It is actually possible to get this particular result currently in R graphics. We can generate a path consisting of the border of the entire image with the border of New Zealand inside it (i.e., a rectangular path with a hole the shape of New Zealand) and fill that with opaque white. However, that sort of workaround is more awkward and often runs into difficulties, for example, if we want to add other output to the image, but not obscure it with the opaque white fill. In general, we can generate may different graphical results just with the existing R graphics features, but we may have to work very hard to get what we want. It could be said that all this proposal is aiming for is greater convenience, but in some cases it is MUCH greater convenience.


Creative Commons License
This document by Paul Murrell is licensed under a Creative Commons Attribution 4.0 International License.