Add xtensor broadcast #1489

AllenDowney · 2025-06-20T17:57:38Z

This replaces #1486. This one is based on a rebased labeled_tensor branch

📚 Documentation preview 📚: https://pytensor--1489.org.readthedocs.build/en/1489/

AllenDowney · 2025-06-20T18:05:46Z

@ricardoV94 Here's my attempt to rebase on the changes you just force pushed. Looks like mypy is unhappy -- is that something you expected?

Other than that, I think this is ready for review.

ricardoV94 · 2025-06-20T20:39:52Z

@ricardoV94 Here's my attempt to rebase on the changes you just force pushed. Looks like mypy is unhappy -- is that something you expected?

Other than that, I think this is ready for review.

Yeah I didn't make mypy pass yet

ricardoV94 · 2025-06-20T20:58:47Z

pytensor/xtensor/rewriting/shape.py

+        x_tensor = x_tensor.dimshuffle(shuffle_pattern)
+
+        # Now we are aligned with target dims and correct ndim
+        x_tensor = broadcast_to(x_tensor, out.type.shape)


This won't work when the output shape is not statically known. The target shape has to be computer symbolically from the symbolic input shapes.

You can test by having an xtensor with shape=(None) for a dim that only that tensor has

AllenDowney · 2025-06-21T19:29:13Z

@ricardoV94 I think I have symbolic dimensions working. My solution is more complicated than I think any of us would like, but I don't see a simpler solution. Maybe you will.

Should we continue work on this PR, for now, and I will rebase later?

ricardoV94 · 2025-06-21T20:52:06Z

Here is an idea:

def lower_broadcast(fgraph, node):
  excluded_dims = node.op.exclude
  broadcast_dims = tuple(dim for dim in node.outputs[0].type.dims if dim not in excluded_dims)
  all_dims = broadcast_dims + excluded_dims
  
  # align inputs with all_dims like we do in other rewrites 
  # probably time to refactor this kind of logic into a helper
  inp_tensors = []
  for inp, out in zip(node.inputs, node.outputs, strict=True)
    inp_dims = inp.type.dims
    order = tuple(inp_dims.index(dim) if dim in inp_dims else "x" for dim in all_dims)
    inp_tensors.append(inp.values.dimshuffle(order))
  
  if not excluded_dims:
    out_tensors = pt.broadcast_arrays(*inp_tensors)
  else:
    all_shape = tuple(pt.broadcast_shape(*inp_tensors))
    assert len(all_shape) == len(all_dims)
    for inp_tensor, out in zip(inp_tensors, node.outputs):
      out_dims = out.type.dims
      out_shape = tuple(length for length, dim in zip(all_shape, all_dims) if dim in out_dims)
      out_tensors.append(pt.broadcast_to(inp_tensor, out_shape)

  new_outs = [as_xtensor(out_tensor, dims=out.type.dims) for out_tensor, out in zip(out_tensors, node.outputs)]
  return new_outs

Btw the base branch is merged. You can rebase/ start from it. Note that you don't need to open a new PR. You can force-push your changes after cleaning up the branch to your current remote

AllenDowney · 2025-06-21T20:57:46Z

@ricardoV94 I've added broadcast_like. Once we're happy with broadcast and broadcast_like, I will factor out some common code.

pytensor/xtensor/shape.py

AllenDowney · 2025-06-21T21:40:32Z

Your version of lower_broadcast works when exclude is empty, but it fails on tests that have excluded dims.

I'll work on debugging it, but at the moment it's not clear to me whether these is a small error in your implementation or an actual problem with the logic.

ricardoV94 · 2025-06-21T21:47:44Z

I suspect some wrong assumption on the excluded dims alignment but the general idea should work

AllenDowney · 2025-06-21T22:06:20Z

I think the incorrect assumption is that all outputs have the same shape. When exclude is not empty, they don't, in general.

ricardoV94 · 2025-06-21T22:11:15Z

Actually there's a logical flaw. Two inputs could have an excluded dim with the same name but different length, in which case they shouldn't be aligned for the broadcast shape.

We should add that as a test.

Still the logic for each output should be something like broadcast_to(tensor, common_broadcast_shape + original_excluded_shape)

ricardoV94 · 2025-06-21T22:13:40Z

I think the incorrect assumption is that all outputs have the same shape. When exclude is not empty, they don't, in general.

I didn't assume that, the dimshuffle was supposed to take care of that so that things were put in different axis for broadcasting. Still as I just wrote there was a wrong assumption that you could align shared excluded dims. They don't even come out in a uniform order do they?

ricardoV94 · 2025-06-21T22:15:22Z

I don't think this logical flaw is why the tests are failing though. We should test that case as well

ricardoV94 · 2025-06-25T11:09:07Z

@AllenDowney does this work? If not, what case fails?

from pytensor.xtensor.type import xtensor
from pytensor.xtensor.math import second

def broadcast(array, *arrays, exclude=()):
    if isinstance(exclude, str):
        exclude = (exclude,)
        
    def sum_excluded_dims(array):
        if not exclude:
            return array
        dims = array.dims
        array_exclude = tuple(e for e in exclude if e in dims)
        if not array_exclude:
            return array
        return array.sum(array_exclude)
    
    def align_excluded_dims(array):
        if not exclude:
            return array
        dims = array.dims
        array_exclude = tuple(e for e in exclude if e in dims)
        if not array_exclude:
            return array
        return array.transpose(..., *array_exclude)
        
    if not arrays:
        return array
    
    # Find broadcast shape by doing nested second after excluding via `sum`
    # The sum operation will be removed in rewrites, since only the shape matters
    broadcast_array = sum_excluded_dims(array)
    for other_array in arrays:
        # second is equivalent no `np.broadcast_arrays(x, y)[1]`
        broadcast_array = second(broadcast_array, sum_excluded_dims(other_array))

    # Broadcast each original array with the broadcast_array
    # We further align the excluded dims according to the order given by the user, like xarray does
    return tuple(second(broadcast_array, align_excluded_dims(arr)) for arr in (array, *arrays))

    
x = xtensor(dims=("a", "b", "c"))
y = xtensor(dims=("a", "d"))
z = xtensor(dims=("a", "f", "b"))
for out in broadcast(x, y, z, exclude=("b", "f")):
    print(out.dims)
# ('a', 'c', 'd', 'b')
# ('a', 'c', 'd')
# ('a', 'c', 'd', 'b', 'f')

ricardoV94 and others added 20 commits June 20, 2025 19:18

Avoid no-op DimShuffle

4fb9071

Use DimShuffle instead of Reshape in ix_

5b39df6

Extract ViewOp functionality into a base TypeCastOp

7a7db6f

Implement basic labeled tensor functionality

024136e

Implement stack for XTensorVariables

162b50a

Implement Elemwise and Blockwise operations for XTensorVariables

9676b4e

Implement cast for XTensorVariables

1a0226c

Implement reduction operations for XTensorVariables

b1b5fde

Implement concat for XTensorVariables

3e4f7ae

Implement transpose for XTensorVariables

62d410b

Implement unstack for XTensorVariables

bd29b1b

Implement index for XTensorVariables

e6da6c5

Implement index update for XTensorVariables

e2a87db

Implement diff for XTensorVariables

fc5f668

Implement squeeze for XTensorVariables

4235d63

Implement expand_dims for XTensorVariables (pymc-devs#1449)

7a9db22

Implement dot for XTensorVariables (pymc-devs#1475)

cc28cb0

Implement XTensorVariable version of RandomVariables

0be448e

Add implementation of broadcast for xtensor

2a91f58

Add xtensor broadcast

4f02b39

ricardoV94 reviewed Jun 20, 2025

View reviewed changes

ricardoV94 force-pushed the labeled_tensors branch 4 times, most recently from 71bc4ef to 41d9be4 Compare June 21, 2025 17:24

Handling symbolic dims

ea04c9e

Adding broadcast_like

9a255b6

ricardoV94 reviewed Jun 21, 2025

View reviewed changes

pytensor/xtensor/shape.py Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add xtensor broadcast #1489

Add xtensor broadcast #1489

AllenDowney commented Jun 20, 2025 •

edited by github-actions bot

Loading

Uh oh!

AllenDowney commented Jun 20, 2025

Uh oh!

ricardoV94 commented Jun 20, 2025

Uh oh!

ricardoV94 Jun 20, 2025

Uh oh!

AllenDowney commented Jun 21, 2025

Uh oh!

ricardoV94 commented Jun 21, 2025 •

edited

Loading

Uh oh!

AllenDowney commented Jun 21, 2025

Uh oh!

Uh oh!

AllenDowney commented Jun 21, 2025

Uh oh!

ricardoV94 commented Jun 21, 2025 •

edited

Loading

Uh oh!

AllenDowney commented Jun 21, 2025

Uh oh!

ricardoV94 commented Jun 21, 2025

Uh oh!

ricardoV94 commented Jun 21, 2025 •

edited

Loading

Uh oh!

ricardoV94 commented Jun 21, 2025

Uh oh!

ricardoV94 commented Jun 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

Add xtensor broadcast #1489

Are you sure you want to change the base?

Add xtensor broadcast #1489

Conversation

AllenDowney commented Jun 20, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AllenDowney commented Jun 20, 2025

Uh oh!

ricardoV94 commented Jun 20, 2025

Uh oh!

ricardoV94 Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

AllenDowney commented Jun 21, 2025

Uh oh!

ricardoV94 commented Jun 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AllenDowney commented Jun 21, 2025

Uh oh!

Uh oh!

AllenDowney commented Jun 21, 2025

Uh oh!

ricardoV94 commented Jun 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AllenDowney commented Jun 21, 2025

Uh oh!

ricardoV94 commented Jun 21, 2025

Uh oh!

ricardoV94 commented Jun 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ricardoV94 commented Jun 21, 2025

Uh oh!

ricardoV94 commented Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

AllenDowney commented Jun 20, 2025 •

edited by github-actions bot

Loading

ricardoV94 commented Jun 21, 2025 •

edited

Loading

ricardoV94 commented Jun 21, 2025 •

edited

Loading

ricardoV94 commented Jun 21, 2025 •

edited

Loading

ricardoV94 commented Jun 25, 2025 •

edited

Loading