RFC: add `broadcast_shapes` to the specification

This RFC proposes adding an API to the specification for explicitly broadcasting a list of shapes to a single shape.

## Overview

Based on array API comparison [data](https://github.com/data-apis/array-api-comparison/blob/c5fe8c3263d41a9bff0ec5a920a21c8ed7bdd16b/signatures/manipulation/broadcast_shapes.md), this API, or some variation of it, is commonly implemented across array libraries.

Currently, the Array API specification only includes `broadcast_arrays` and `broadcast_to` which both require array input. The specification lacks APIs for working directly with shapes without needing to create new array instances.

### Prior Art

- NumPy: https://numpy.org/doc/stable/reference/generated/numpy.broadcast_shapes.html
    - added in 2020: https://github.com/numpy/numpy/pull/17535
    - returns a Tuple
- CuPy: ~~does not currently support~~
    - Correction: CuPy simply borrows `broadcast_shapes` from NumPy: https://github.com/cupy/cupy/blob/a888cc94c79729cf24ebb808d15b9702c0342392/cupy/__init__.py#L302 
- Dask: `da.core.broadcast_arrays` exists as private API only. Supports Dask's bespoke `nan`'s in the shape.
- JAX: follows NumPy
    - returns a Tuple
- PyTorch: follows NumPy 
    - returns a `Size`
- TensorFlow: has two APIs for statically and dynamically known shapes
    - `broadcast_static_shape`: https://www.tensorflow.org/api_docs/python/tf/broadcast_static_shape
    - `broadcast_dynamic_shape`: https://www.tensorflow.org/api_docs/python/tf/broadcast_dynamic_shape
    - both functions only accept two shape arguments
- ndonnx: no API. Shapes can contain `None`, so one cannot use numpy's implementation.

### Proposal

This RFC proposes adding the following API to the specification:

```python
def broadcast_shapes(*shapes: tuple[int | None, ...]) → tuple[int | None, ...]
```  

in which one or more shapes are broadcasted together according to broadcasting rules as enumerated in the specification.

## Questions

- How to handle shapes having unknown dimensions?
  - `dask.array.core.broadcast_shapes` sets the output size to nan if any of the input shapes are nan on the same axis
  - `ndonnx.broadcast_arrays(a, b)` returns arrays with material shapes.

  Note that shape materialization can be a very expensive operation, as it requires materializing the whole graph until that point. In the case of Dask, which doesn't cache intermediate results as a deliberate memory management policy, this means computing everything at least twice.

## Notes

The top-level page on broadcasting mentions on the first line, using non-prescriptive language, that broadcasting allows creating views of the inputs:

> **Broadcasting** refers to the automatic (implicit) expansion of array dimensions to be of equal sizes without copying array data

However, no mention of sharing memory is made in [broadcast_to](https://data-apis.org/array-api/latest/API_specification/generated/array_api.broadcast_to.html#broadcast-to) or [broadcast_arrays](https://data-apis.org/array-api/latest/API_specification/generated/array_api.broadcast_arrays.html#broadcast-arrays).

For the sake of comparison, see the verbiage in [asarray(copy=False)](https://data-apis.org/array-api/latest/API_specification/generated/array_api.asarray.html#asarray).

The problem with this ambiguity is that one can work around the lack of `broadcast_shapes` by calling `xp.broadcast_arrays(*args)[0].shape`, but there is no strong guarantee that the backend won't deep-copy the inputs.

Note that `numpy.broadcast_shapes` doesn't work with shapes containing None (ndonnx and hopefully in the future JAX too) or NaN (Dask; non-standard).

I suggest to either

- Add prescriptive verbiage to `broadcast_to` and `broadcast_arrays` that the output must share memory with the input, or in other words the operation must be O(1), or
- Add `broadcast_shapes` to the standard, and change the verbiage of the broadcasting high level page to "_typically_ without copying array data"

For the time being I am adding the function to `array_api_extra`:

- https://github.com/data-apis/array-api-extra/issues/80
- https://github.com/data-apis/array-api-extra/pull/133

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

RFC: add `broadcast_shapes` to the specification #893

Overview

Prior Art

Proposal

Questions

Notes

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

RFC: add broadcast_shapes to the specification #893

Description

Overview

Prior Art

Proposal

Questions

Notes

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

RFC: add `broadcast_shapes` to the specification #893