Extending the API • anvl

In this vignette we will cover some general guidelines that ensure your anvl functions come without surprises. This is primarily intended for extending the API – either in your own package or contributing to {anvl} itself – but is also helpful when writing your own scripts.

The general guidelines are:

The function must be pure.
Consistent input and output types:
1. The dynamic (arrayish) inputs should accept AnvlArrays as well as R vectors of length 1 and arrays.
2. The function should only output AnvlArrays.
The function should work with arbitrary devices.
The function should (unless there are specific reasons) work in eager and jit mode.
Use static arguments when you require data-dependent input checks.

Pure Functions

This is extensively covered in the JIT Deep Dive, so we won’t repeat it here. While the subsequent sections mostly address issues that are relevant in eager mode, purity is the primary requirement to enable usage of jit() with your function.

Consistent Input and Output Types

Functions in anvl have dynamic (arrayish) and static (standard R values) inputs. However, it can also be convenient to pass R objects as dynamic inputs and let anvl convert them. To enable this, there are the as_anvl_array() and as_anvl_arrays() converters. You should call them at the top of your function. Not only will these functions convert the inputs, they will also check them for compatibility, specifically w.r.t. their device and backend. If they don’t live on the same device, an error will be thrown.

Note that this is only really necessary for using your function in eager mode (i.e. without jit()). This is because when a function is wrapped in jit(), {anvl} itself can perform these checks automatically.

The advantage of this input standardization is best illustrated with an example.

Consider the naive implementation of reshaping, which will fail when called on an R vector:

library(anvl)
# operand: dynamic, shape: static
nv_reshape_naive <- function(operand, shape) {
  if (!identical(shape(operand), shape)) {
    prim_reshape(operand, shape)
  } else {
    operand
  }
}
nv_reshape_naive(1L, c(2, 2))
#> Error in `UseMethod()`:
#> ! no applicable method for 'shape' applied to an object of class "c('integer', 'numeric')"

This is because the attribute-getters such as shape(), dtype(), etc. are only implemented for AnvlArrays, not for R vectors, so canonicalizing inputs at the top ensures the function works correctly.

Also, consider this function that converts an input to a specific dtype (or keeps it as-is if dtype is NULL). The problem is that in the no-op case, we return a static R object instead of (as intended) an AnvlArray.

# operand: dynamic, dtype: static
nv_convert_naive <- function(operand, dtype) {
  if (is.null(dtype)) {
    return(operand)
  }
  prim_convert(operand, dtype)
}
nv_convert_naive(1L, "i16")
#> AnvlArray
#>  1
#> [ CPUi16{} ]
nv_convert_naive(1L, NULL)
#> [1] 1

By canonicalizing inputs, such pitfalls can be avoided.

Finally, note that primitives such as prim_convert() already canonicalize their inputs, so if you are only wrapping primitives (or other nv_<op> functions that already canonicalize), you might not have to do this yourself.

When a function takes multiple arrayish inputs, normalize them in a single as_anvl_arrays(...) call covering all of them, so R literals/arrays adopt the device of their AnvlArray siblings instead of landing on the default device.

Arbitrary Devices

In order to ensure that your function works with inputs from arbitrary devices, you need to be careful when creating new constants within your function. Let’s say you are creating your function and working on GPU:

nv_add_one_naive <- function(operand) {
  operand <- as_anvl_array(operand)
  operand + nv_fill(1L, shape(operand), device = "cuda")
}

As long as you are adding ones on a CUDA GPU, this function will work fine! However, if you suddenly use it on the CPU, it will fail, because we can’t add a CPU array to a CUDA array. Constants should always be initialized on the same device as the inputs. If there are multiple inputs and you called as_anvl_arrays() on them at the top, you know that there is only a single device.

One way to achieve this is to simply pass the input’s device to nv_fill():

nv_add_one1 <- function(operand) {
  operand <- as_anvl_array(operand)
  operand + nv_fill(1L, shape(operand), device = device(operand))
}

Another option is to rely on nv_<op>_like functions. These take in another AnvlArray as their first input and use its properties as the defaults for their arguments. In this case, the created array will assume the data type, shape and device from the input operand.

nv_add_one2 <- function(operand) {
  operand + nv_fill_like(operand, 1L)
}

Note that when you only want to use a function with jit(), you can just omit specifying the device at all, as jit() is smart enough to place it on the correct device.

Static Arguments to Enable Input Checks

One restriction of the XLA compiler is that it does not really allow for runtime checks. Let’s say you want to sample from a Bernoulli distribution with probability p. If you make p a dynamic input, you can’t check that it is within [0, 1], so you need to make it a static input. Don’t convert it to an AnvlArray before checking its value. Later in the function, it will actually be converted, but from XLA’s point of view, it will just be a constant within the compiled program and not a dynamic input.

nv_rbernoulli <- function(initial_state, p) {
  initial_state <- as_anvl_array(initial_state)
  stopifnot((p >= 0) && (p <= 1))

  # returns: (state, sample)
  out <- nv_runif(1L, initial_state)
  out_state <- out[[1L]]
  x <- nv_convert(out[[2L]] <= p, "i32")
  list(out_state, x)
}
nv_rbernoulli(nv_rng_state(1), 0.2)[[2L]]
#> AnvlArray
#>  0
#> [ CPUi32{1} ]