.. _creating_an_op:

Creating a new :class:`Op`: Python implementation
=================================================

You may have looked through the library documentation but don't see a function that does what you want.

If you can implement something in terms of an existing :class:`Op`, you should do that.
A PyTensor function that builds upon existing expressions will be better optimized, automatic differentiable, and
work seamlessly across different backends.

However, if you cannot implement an :class:`Op` in terms of an existing :class:`Op`, you have to write a new one.

This page will show how to implement some simple Python-based :class:`Op` that perform operations on numpy arrays.

PyTensor Graphs refresher
-------------------------

.. image:: apply.png
    :width: 500 px

PyTensor represents symbolic mathematical computations as graphs. Those graphs
are bi-partite graphs (graphs with two types of nodes), they are composed of
interconnected :ref:`apply` and :ref:`variable` nodes.
:ref:`variable` nodes represent data in the graph, either inputs, outputs or
intermediary values. As such, inputs and outputs of a graph are lists of PyTensor
:ref:`variable` nodes. :ref:`apply` nodes perform computation on these
variables to produce new variables. Each :ref:`apply` node has a link to an
instance of :class:`Op` which describes the computation to perform. This tutorial
details how to write such an :class:`Op` instance. Please refer to
:ref:`graphstructures` for a more detailed explanation about the graph
structure.


:class:`Op`'s basic methods
---------------------------

An :class:`Op` is any Python object that inherits from :class:`Op`.
This section provides an overview of the basic methods you typically have to
implement to make a new :class:`Op`.  It does not provide extensive coverage of all the
possibilities you may encounter or need.  For that refer to
:ref:`Op contract <op_contract>`.

.. testcode:: python

    from typing import Any
    from pytensor.graph.basic import Apply, Variable
    from pytensor.graph.fg import FunctionGraph
    from pytensor.graph.op import Op
    from pytensor.graph.type import Type


    class MyOp(Op):
        # Properties attribute
        __props__ : tuple[Any, ...] = ()

        # Constructor, usually used only to set Op properties
        def __init__(self, *args):
            pass

        # itypes and otypes attributes are compulsory if make_node method is not defined.
        # They're the type of input and output respectively
        itypes: list[Type] | None = None
        otypes: list[Type] | None = None

        # make_node is compulsory if itypes and otypes are not defined
        # make_node is more flexible: output types can be determined
        # based on the input types and Op properties.
        def make_node(self, *inputs) -> Apply:
            pass

        # Performs the numerical evaluation of Op in Python. Required.
        def perform(self, node: Apply, inputs_storage: list[Any], output_storage: list[list[Any]]) -> None:
            pass

        # Defines the symbolic expression for the L-operator based on the input and output variables
        # and the output gradient variables. Optional.
        def L_op(self, inputs: list[Variable], outputs: list[Variable], output_grads: list[Variable]) -> list[Variable]:
            pass

        # Equivalent to L_op, but with a "technically"-bad name and without outputs provided.
        # It exists for historical reasons. Optional.
        def grad(self, inputs: list[Variable], output_grads: list[Variable]) -> list[Variable]:
            # Same as self.L_op(inputs, self(inputs), output_grads)
            pass

        # Defines the symbolic expression for the R-operator based on the input variables
        # and eval_point variables. Optional.
        def R_op(self, inputs: list[Variable], eval_points: list[Variable | None]) -> list[Variable | None]:
            pass

        # Defines the symbolic expression for the output shape based on the input shapes
        # and, less frequently, the input variables via node.inputs. Optional.
        def infer_shape(self, fgraph: FunctionGraph, node: Apply, input_shapes: list[tuple[Variable, ...]]) -> list[tuple[Variable]]:
            pass

An :class:`Op` has to implement some methods defined in the the interface of
:class:`Op`. More specifically, it is mandatory for an :class:`Op` to define either
the method :meth:`make_node` or :attr:`itypes`, :attr:`otypes`, and :meth:`perform`.

:meth:`make_node`
^^^^^^^^^^^^^^^^^^^^^^^^

:meth:`make_node` method creates an :ref:`apply` node representing the application
of the :class:`Op` on the inputs provided. This method is responsible for three things:

- Checks that the inputs can be converted to :ref:`variable`\s whose types are compatible with the current :class:`Op`.
  If the :class:`Op` cannot be applied on the provided input types, it must raise an exception (such as :class:`TypeError`).
- Creates new output :ref:`variable`\s of a suitable symbolic :class:`Type` to serve as the outputs of this :class:`Op`'s application.
- Returns an :ref:`apply` instance with the input and output :ref:`variable`\s, and itself as the :class:`Op`.

If :meth:`make_node` is not defined, the :attr:`itypes` and :attr:`otypes` are used by the :class:`Op`'s
:meth:`make_node` method to implement the functionality method mentioned above.


:meth:`perform`
^^^^^^^^^^^^^^^^^^

:meth:`perform` method defines the Python implementation of an :class:`Op`.
It takes several arguments:

- ``node`` is a reference to an :ref:`apply` node which was previously
  obtained via the :meth:`make_node` method. It is typically not
  used in a simple :class:`Op`, but it contains symbolic information that
  could be required by a complex :class:`Op`.
- ``inputs`` is a list of references to data which can be operated on using
  non-symbolic statements, (i.e., statements in Python, Numpy).
- ``output_storage`` is a list of storage cells where the output
  is to be stored. There is one storage cell for each output of the :class:`Op`.
  The data put in ``output_storage`` must match the type of the
  symbolic output.
  PyTensor may sometimes allow ``output_storage`` elements to persist
  between evaluations, or it may reset ``output_storage`` cells to
  hold a value of ``None``.  It can also pre-allocate some memory
  for the :class:`Op` to use.  This feature can allow ``perform`` to reuse
  memory between calls, for example. If there is something
  preallocated in the ``output_storage``, it will be of the correct
  dtype, but can have the wrong shape and have any stride pattern.

:meth:`perform` method must be determined by the inputs.
That is to say, when applied to identical inputs the method must return the same outputs.


:class:`Op`'s auxiliary methods
-------------------------------

There are other methods that can be optionally defined by the :class:`Op`:

:attr:`__props__`
^^^^^^^^^^^^^^^^^^^^

The :attr:`__props__` attribute lists the :class:`Op` instance properties
that influence how the computation is performed. It must be a hashable tuple.
Usually these are set in :meth:`__init__`. If you don't have any properties
that influence the computation, then you will want to set this attribute to the empty tuple ``()``.

:attr:`__props__` enables the  automatic generation of appropriate :meth:`__eq__` and :meth:`__hash__`.
According to this default, :meth:`__eq__`, two :class:`Op`\s will be equal if they have the same values for all
the properties listed in :attr:`__props__`. Similarly, they will have the same hash.

When PyTensor sees two nodes with equal :class:`Op`\s and the same set of inputs,
it will assume the outputs are equivalent and merge the nodes to avoid redundant computation.
When `Op.__props__` is not specified, two distinct instances of the same class will not be equal
and hash to their `id`. PyTensor won't merge nodes with the same class but different instances in this case.

:attr:`__props__` will also generate a  suitable :meth:`__repr__` and :meth:`__str__` for your :class:`Op`.


:meth:`infer_shape`
^^^^^^^^^^^^^^^^^^^^^^

The :meth:`infer_shape` method allows an :class:`Op` to infer the shape of its
output variables without actually computing them.
It takes as input ``fgraph``, a :class:`FunctionGraph`; ``node``, a reference
to the :class:`Op`'s :ref:`apply` node;
and a list of :class:`Variables`\s (e.g. ``i0_shape``, ``i1_shape``, ...)
which are the dimensions of the :class:`Op` input :ref:`variable`\s.
:meth:`infer_shape` returns a list where each element is a tuple representing
the shape of one output.
This could be helpful if one only needs the shape of the output instead of the
actual outputs, which can be useful, for instance, for rewriting
procedures.

:meth:`L_op`
^^^^^^^^^^^^^^^

The :meth:`L_op` method is required if you want to differentiate some cost
whose expression includes your :class:`Op`. The gradient is
specified symbolically in this method. It takes three arguments ``inputs``, ``outputs`` and
``output_gradients``, which are both lists of :ref:`variable`\s, and
those must be operated on using PyTensor's symbolic language. The :meth:`L_op`
method must return a list containing one :ref:`variable` for each
input. Each returned :ref:`variable` represents the gradient with respect
to that input computed based on the symbolic gradients with respect
to each output.

If the output is not differentiable with respect to an input then
this method should be defined to return a variable of type :class:`NullType`
for that input. Likewise, if you have not implemented the gradient
computation for some input, you may return a variable of type
:class:`NullType` for that input. Please refer to :meth:`L_op` for a more detailed
view.

:meth:`R_op`
^^^^^^^^^^^^^^^
The :meth:`R_op` method is needed if you want :func:`pytensor.gradient.Rop` to
work with your :class:`Op`.

This function implements the application of the R-operator on the
function represented by your :class:`Op`. Let's assume that function is :math:`f`,
with input :math:`x`, applying the R-operator means computing the
Jacobian of :math:`f` and right-multiplying it by :math:`v`, the evaluation
point, namely: :math:`\frac{\partial f}{\partial x} v`.


Example: :class:`Op` definition
-------------------------------

.. testcode:: example

    import numpy as np
    from pytensor.graph.op import Op
    from pytensor.graph.basic import Apply, Variable
    from pytensor.tensor import as_tensor_variable, TensorLike, TensorVariable

    class DoubleOp1(Op):
        __props__ = ()

        def make_node(self, x: TensorLike) -> Apply:
            # Convert (and require) x to be a TensorVariable
            x = as_tensor_variable(x)

            # Validate input type
            if not(x.type.ndim == 2 and x.type.dtype == "float64"):
                raise TypeError("x must be a float64 matrix")

            # Create an output variable of the same type as x
            z = x.type()

            # TensorVariables type include shape and dtype, so this is equivalent to the following
            # z = pytensor.tensor.TensorType(dtype=x.type.dtype, shape=x.type.shape)()
            # z = pytensor.tensor.tensor(dtype=x.type.dtype, shape=x.type.shape)
            return Apply(self, [x], [z])

        def perform(self, node: Apply, inputs: list[np.ndarray], output_storage: list[list[np.ndarray | None]]) -> None:
            x = inputs[0]
            z = output_storage[0]
            # Numerical output based on numerical inputs (i.e., numpy arrays)
            z[0] = x * 2

        def infer_shape(self, fgraph: FunctionGraph, node: Apply, input_shapes: list[list[Variable]]) -> list[list[Variable]]:
            # The output shape is the same as the input shape
            return input_shapes

        def L_op(self, inputs: list[TensorVariable], outputs: list[TensorVariable], output_grads: list[TensorVariable]):
            # Symbolic expression for the gradient
            # For this Op, the inputs and outputs aren't part of the expression
            # output_grads[0] is a TensorVariable!
            return [output_grads[0] * 2]

        def R_op(self, inputs: list[TensorVariable], eval_points: list[TensorVariable | None]) -> list[TensorVariable] | None:
            # R_op can receive None as eval_points.
            # That means there is no differentiable path through that input
            # If this imply that you cannot compute some outputs,
            # return None for those.
            if eval_points[0] is None:
                return None
            # For this Op, the R_op is the same as the L_op
            outputs = self(inputs)
            return self.L_op(inputs, outputs, eval_points)

    doubleOp1 = DoubleOp1()

At a high level, the code fragment declares a class (e.g., ``DoubleOp1``) and then creates one instance of that class (e.g., ``doubleOp1``).

As you'll see below, you can then pass an instantiated :ref:`variable`, such as ``x = tensor.matrix("x")`` to the instantiated :class:`Op`,
to define a new :ref:`variable` that represents the output of applying the :class:`Op` to the input variable.

Under the hood, the :meth:`__call__` will call :meth:`make_node` method and then returns the output variable(s)
of the :ref:`apply` that is returned by the method.

The number and order of the inputs argument in the returned :ref:`apply` should match those in the :meth:`make_node`.
PyTensor may decide to call :meth:`make_node` itself later to copy the graph or perform a generic rewrite.

All the ``inputs`` and ``outputs`` arguments to the returned :ref:`apply` must be :ref:`variable`\s.
A common and easy way to ensure inputs are variables is to run them through
``as_tensor_variable``. This function leaves :class:`TensorVariable` variables alone, raises
an error for variables with an incompatible type, and copies any ``numpy.ndarray`` into
the storage for a :class:`TensorConstant`.

The :meth:`perform` method implements the :class:`Op`'s mathematical logic in Python.
The inputs (here ``x = inputs[0]``) are passed by value, and a single output is stored
as the first element of a single-element list (here ``z = output_storage[0]``).
If ``doubleOp1`` had a second output, it should be stored in ``output_storage[1][0]``.

In some execution modes, the output storage might contain the return value of
a previous call.  That old value can be reused to avoid memory re-allocation,
but it must not influence the semantics of the :class:`Op` output.

You can try the new :class:`Op` as follows:

.. testcode:: example

    from pytensor import function
    from pytensor.tensor import matrix

    doubleOp1 = DoubleOp1()

    x = matrix("x")
    out = doubleOp1(x)
    assert out.type == x.type

    fn = function([x], out)
    x_np = np.random.normal(size=(5, 4))
    np.testing.assert_allclose(x_np * 2, fn(x_np))


It's also a good idea to test the :meth:`infer_shape` implementation.
To do this we can request a graph of the shape only:

.. testcode::

    out_shape = out.shape
    shape_fn = function([x], out_shape)
    assert tuple(shape_fn(x_np)) == x_np.shape

    # We can introspect the compiled function to confirm the Op is not evaluated
    shape_fn.dprint()

.. testoutput::

    MakeVector{dtype='int64'} [id A] 2
     ├─ Shape_i{0} [id B] 1
     │  └─ x [id C]
     └─ Shape_i{1} [id D] 0
        └─ x [id C]


Finally we should test the gradient implementation.
For this we can use the ``pytensor.gradient.verify_grad`` utility which will compare the output of a gradient function with finite differences.

.. testcode::
    from pytensor.gradient import verify_grad

    rng = np.random.default_rng(42)
    test_x = rng.normal(size=(5, 4))

    # Raises if the gradient output is sufficiently different from the finite difference approximation.
    verify_grad(doubleOp1, [test_x], rng=rng)


Example: :attr:`itypes` and :attr:`otypes` definition
-----------------------------------------------------

Since the `Op` has a very strict type signature, we can use :attr:`itypes` and :attr:`otypes` instead of :meth:`make_node`:

.. testcode:: example with itypes and otypes

    from pytensor.tensor import dmatrix

    class DoubleOp2(Op):
        __props__ = ()

        # inputs and output types must be float64 matrices
        itypes = [dmatrix]
        otypes = [dmatrix]

        def perform(self, node, inputs, output_storage):
            x = inputs[0]
            z = output_storage[0]
            z[0] = x * 2

    doubleOp2 = DoubleOp2()


Example: :attr:`__props__` definition
-------------------------------------

We can modify the previous piece of code in order to demonstrate
the usage of the :attr:`__props__` attribute.

We create an :class:`Op` that takes a variable ``x`` and returns ``a*x+b``.
We want to say that two such :class:`Op`\s are equal when their values of ``a`` and ``b`` are equal.

.. testcode:: properties

    from pytensor.graph.op import Op
    from pytensor.graph.basic import Apply
    from pytensor.tensor import as_tensor_variable

    class AXPBOp(Op):
        """
        This creates an Op that takes x to a*x+b.
        """
        __props__ = ("a", "b")

        def __init__(self, a, b):
            self.a = a
            self.b = b
            super().__init__()

        def make_node(self, x):
            x = as_tensor_variable(x)
            return Apply(self, [x], [x.type()])

        def perform(self, node, inputs, output_storage):
            x = inputs[0]
            z = output_storage[0]
            z[0] = self.a * x + self.b


The use of :attr:`__props__` saves the user the trouble of implementing :meth:`__eq__` and :meth:`__hash__` manually.
It also generates default :meth:`__repr__` and :meth:`__str__` methods that prints the attribute names and their values.

We can test this by running the following segment:

.. testcode:: properties

    import numpy as np
    from pytensor.tensor import matrix
    from pytensor import function

    mult4plus5op = AXPBOp(4, 5)
    another_mult4plus5op = AXPBOp(4, 5)
    mult2plus3op = AXPBOp(2, 3)

    assert mult4plus5op == another_mult4plus5op
    assert mult4plus5op != mult2plus3op

    x = matrix("x", dtype="float32")
    f = function([x], mult4plus5op(x))
    g = function([x], mult2plus3op(x))

    inp = np.random.normal(size=(5, 4)).astype("float32")
    np.testing.assert_allclose(4 * inp + 5, f(inp))
    np.testing.assert_allclose(2 * inp + 3, g(inp))


To demonstrate the use of equality, we will define the following graph: ``mult4plus5op(x) + another_mult4plus5op(x) + mult3plus2op(x)``.
And confirm PyTensor infers it can reuse the first term in place of the second ``another_mult4plus5op(x)``.

.. testcode:: exploiting equality

    from pytensor.graph import rewrite_graph

    graph = mult4plus5op(x) + another_mult4plus5op(x) + mult2plus3op(x)
    print("Before:")
    graph.dprint()

    print("\nAfter:")
    rewritten_graph = rewrite_graph(graph)
    rewritten_graph.dprint()


.. testoutput::
    Before:
    Add [id A]
     ├─ Add [id B]
     │  ├─ AXPBOp{a=4, b=5} [id C]
     │  │  └─ x [id D]
     │  └─ AXPBOp{a=4, b=5} [id E]
     │     └─ x [id D]
     └─ AXPBOp{a=2, b=3} [id F]
        └─ x [id D]

    After:
    Add [id A]
     ├─ AXPBOp{a=4, b=5} [id B]
     │  └─ x [id C]
     ├─ AXPBOp{a=4, b=5} [id B]
     │  └─ ···
     └─ AXPBOp{a=2, b=3} [id D]
        └─ x [id C]

Note how after rewriting, the same variable [id B] is used twice.
Also the string representation of the `Op` shows the values of the properties.


Example: More complex :class:`Op`
---------------------------------

As a final example, we will create a multi-output :class:`Op` that takes a matrix and a vector and returns the matrix transposed and the sum of the vector.

Furthermore, this :class:`Op` will work with batched dimensions, meaning we can pass in a 3D tensor or a 2D tensor (or more) and it will work as expected.
To achieve this behavior we cannot use `itypes` and `otypes` as those encode specific number of dimensions.
Instead we will have to define the `make_node` method.

We need to be careful in the :meth:`L_op` method, as one of output gradients may be disconnected from the cost, in which case we should ignore its contribution.
If both outputs are disconnected PyTensor will not bother calling the :meth:`L_op` method, so we don't need to worry about that case.

.. testcode::

    import pytensor.tensor as pt

    from pytensor.graph.op import Op
    from pytensor.graph.basic import Apply
    from pytensor.gradient import DisconnectedType

    class TransposeAndSumOp(Op):
        __props__ = ()

        def make_node(self, x, y):
            # Convert to TensorVariables (and fail if not possible)
            x = pt.as_tensor_variable(x)
            y = pt.as_tensor_variable(y)

            # Validate inputs dimensions
            if x.type.ndim < 2:
                raise TypeError("x must be at least a matrix")
            if y.type.ndim < 1:
                raise TypeError("y must be at least a vector")

            # Create output variables
            out1_static_shape = (*x.type.shape[:-2], x.type.shape[-1], x.type.shape[-2])
            out1_dtype = x.type.dtype
            out1 = pt.tensor(dtype=out1_dtype, shape=out1_static_shape)

            out2_static_shape = y.type.shape[:-1]
            out2_dtype = "float64"  # hard-coded regardless of the input
            out2 = pt.tensor(dtype=out2_dtype, shape=out2_static_shape)

            return Apply(self, [x, y], [out1, out2])

        def perform(self, node, inputs, output_storage):
            x, y = inputs
            out_1, out_2 = output_storage
            out_1[0] = np.swapaxes(x, -1, -2)
            out_2[0] = y.sum(-1).astype("float64")

        def infer_shape(self, fgraph, node, input_shapes):
            x_shapes, y_shapes = input_shapes
            out1_shape = (*x_shapes[:-2], x_shapes[-1], x_shapes[-2])
            out2_shape = y_shapes[:-1]
            return [out1_shape, out2_shape]

        def L_op(self, inputs, outputs, output_grads):
            x, y = inputs
            out1_grad, out2_grad = output_grads

            if isinstance(out1_grad.type, DisconnectedType):
                x_grad = DisconnectedType()()
            else:
                # Transpose the last two dimensions of the output gradient
                x_grad = pt.swapaxes(out1_grad, -1, -2)

            if isinstance(out2_grad.type, DisconnectedType):
                y_grad = DisconnectedType()()
            else:
                # Broadcast the output gradient to the same shape as y
                y_grad = pt.broadcast_to(pt.expand_dims(out2_grad, -1), y.shape)

            return [x_grad, y_grad]

Let's test the `Op` evaluation:

.. testcode::

    import numpy as np
    from pytensor import function

    transpose_and_sum_op = TransposeAndSumOp()

    x = pt.tensor("x", shape=(5, None, 3), dtype="float32")
    y = pt.matrix("y", shape=(2, 1), dtype="float32")
    x_np = np.random.normal(size=(5, 4, 3)).astype(np.float32)
    y_np = np.random.normal(size=(2, 1)).astype(np.float32)

    out1, out2 = transpose_and_sum_op(x, y)

    # Test the output types
    assert out1.type.shape == (5, 3, None)
    assert out1.type.dtype == "float32"
    assert out2.type.shape == (2,)
    assert out2.type.dtype == "float64"

    # Test the perform method
    f = function([x, y], [out1, out2])
    out1_np, out2_np = f(x_np, y_np)
    np.testing.assert_allclose(out1_np, x_np.swapaxes(-1, -2))
    np.testing.assert_allclose(out2_np, y_np.sum(-1))


And the shape inference:

.. testcode::

    out1_shape = out1.shape
    out2_shape = out2.shape
    shape_fn = function([x, y], [out1_shape, out2_shape])

    out1_shape_np, out2_shape_np = shape_fn(x_np, y_np)
    assert tuple(out1_shape_np) == out1_np.shape
    assert tuple(out2_shape_np) == out2_np.shape

    # We can introspect the compiled function to confirm the Op is not needed
    shape_fn.dprint()

.. testoutput::

    MakeVector{dtype='int64'} [id A] 1
     ├─ 5 [id B]
     ├─ 3 [id C]
     └─ Shape_i{1} [id D] 0
        └─ x [id E]
    DeepCopyOp [id F] 2
     └─ [2] [id G]


Finally, the gradient expression:

Again, we can use pytensor `verify_grad` function to test the gradient implementation.
Due to the presence of multiple outputs we need to pass a `Callable` instead of the `Op` instance.
There are different cases we want to test: when both or just one of the outputs is connected to the cost


.. testcode::
    import warnings
    import numpy as np
    from pytensor.gradient import verify_grad

    transpose_and_sum_op = TransposeAndSumOp()

    def both_outs_connected(x, y):
        out1, out2 = transpose_and_sum_op(x, y)
        return out1.sum() + out2.sum()

    def only_out1_connected(x, y):
        out1, _ = transpose_and_sum_op(x, y)
        return out1.sum()

    def only_out2_connected(x, y):
        _, out2 = transpose_and_sum_op(x, y)
        return out2.sum()

    rng = np.random.default_rng(seed=37)
    x_np = rng.random((5, 4, 3)).astype(np.float32)
    y_np = rng.random((2, 1)).astype(np.float32)
    verify_grad(both_outs_connected, [x_np, y_np], rng=rng)

    # PyTensor will raise a warning about the disconnected gradient
    with warnings.catch_warnings():
        warnings.simplefilter("ignore")
        verify_grad(only_out1_connected, [x_np, y_np], rng=rng)
        verify_grad(only_out2_connected, [x_np, y_np], rng=rng)

We are filtering a warning about DisconnectTypes being returned by the gradient method.
PyTensor would like to know how the outputs of the `Op` are connected to the input, which could be done with `connection_pattern`
This was omitted for brevity, since it's a rare edge-case.


Developer testing utilities
---------------------------

PyTensor has some functionalities to test for a correct implementation of an :class:`Op` and it's many methods.

We have already seen some user-facing helpers, but there are also test classes for :class:`Op` implementations
that are added to the codebase, to be used with ``pytest``.

Here we mention those that can be used to test the implementation of:
  :meth:`infer_shape`
  :meth:`L_op`
  :meth:`R_op`


Basic Tests
^^^^^^^^^^^

Basic tests are done by you just by using the :class:`Op` and checking that it returns the right answer.
If you detect an error, you must raise an exception.

You can use the ``assert`` keyword to automatically raise an `AssertionError`, or utilities in `numpy.testing`.

.. testcode:: tests

    import numpy as np
    from pytensor import function
    from pytensor.tensor import matrix
    from tests.unittest_tools import InferShapeTester


    class TestDouble(InferShapeTester):
        def setup_method(self):
            super().setup_method()
            self.op_class = DoubleOp
            self.op = DoubleOp()

        def test_basic(self):
            rng = np.random.default_rng(377)

            x = matrix("x", dtype="float64")
            f = pytensor.function([x], self.op(x))

            inp = np.asarray(rng.random((5, 4)), dtype="float64")
            out = f(inp)

            # Compare the result computed to the expected value.
            np.testing.assert_allclose(inp * 2, out)


Testing the :meth:`infer_shape`
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

When a class inherits from the :class:`InferShapeTester` class,
it gets the :meth:`InferShapeTester._compile_and_check` method that tests the :meth:`infer_shape` method.
It tests that the :class:`Op` gets rewritten out of the graph if only the shape of the output is needed and not the output itself.
Additionally, it checks that the rewritten graph computes the correct shape, by comparing it to the actual shape of the computed output.

:meth:`InferShapeTester._compile_and_check` compiles an PyTensor function.
It takes as parameters the lists of input and output PyTensor variables,
as would be provided to :func:`pytensor.function`,
and a list of real values to pass to the compiled function.
It also takes the :class:`Op` class as a parameter in order to verify that no instance of it appears in the shape-optimized graph.

If there is an error, the function raises an exception.
If you want to see it fail, you can implement an incorrect :meth:`infer_shape`.

When testing with input values with shapes that take the same value over different dimensions
(for instance, a square matrix, or a ``tensor3`` with shape ``(n, n, n)``, or ``(m, n, m)``),
it is not possible to detect if the output shape was computed correctly,
or if some shapes with the same value have been mixed up.
For instance, if the :meth:`infer_shape` uses the width of a matrix instead of its height,
then testing with only square matrices will not detect the problem.
To avoid this the :meth:`InferShapeTester._compile_and_check` method prints a warning in such a case.
If your :class:`Op` works only with such matrices, you can disable the warning with the ``warn=False`` parameter.

.. testcode:: tests


    class TestDouble(InferShapeTester):

        # [...] as previous tests.

        def test_infer_shape(self):
            rng = np.random.default_rng(42)
            x = matrix("x", dtype="float64")
            self._compile_and_check(
                [x],  # pytensor.function inputs
                [self.op(x)],  # pytensor.function outputs
                # Non-square inputs
                [rng.random(size=(5, 4))],
                # Op that should be removed from the graph.
                self.op_class,
            )

Testing the gradient
^^^^^^^^^^^^^^^^^^^^

As shown above, the function :ref:`verify_grad <validating_grad>` verifies the gradient of an :class:`Op` or PyTensor graph.
It compares the analytic (symbolically computed) gradient and the numeric gradient (computed through the Finite Difference Method).

If there is an error, the function raises an exception.
If you want to see it fail, you can implement an incorrect gradient
(for instance, by removing the multiplication by 2).

.. testcode:: tests

        def test_grad(self):
            rng = np.random.default_rng(2024)
            verify_grad(
                self.op,
                [rng.random(size=(5, 7, 2))],
                rng = rng,
            )

Testing the Rop
^^^^^^^^^^^^^^^

The class :class:`RopLopChecker` defines the methods
:meth:`RopLopChecker.check_mat_rop_lop`, :meth:`RopLopChecker.check_rop_lop` and :meth:`RopLopChecker.check_nondiff_rop`.
These allow to test the implementation of the :meth:`R_op` method of a particular :class:`Op`.

For instance, to verify the :meth:`R_op` method of the ``DoubleOp``, you can use this:

.. testcode:: tests

   import numpy
   import tests
   from tests.test_rop import RopLopChecker

   class TestDoubleOpRop(RopLopChecker):

       def test_double_rop(self):
           self.check_rop_lop(DoubleOp()(self.x), self.in_shape)


Running Your Tests
^^^^^^^^^^^^^^^^^^

To perform your tests, simply run ``pytest``.

Exercise
""""""""

Run the code of the ``DoubleOp`` example above.

Modify and execute to compute: ``x * y``.

Modify and execute the example to return two outputs: ``x + y`` and `jx - yj`.

You can omit the :meth:`Rop` functions. Try to implement the testing apparatus described above.

:download:`Solution<extending_pytensor_solution_1.py>`


:func:`as_op`
-------------

:func:`as_op` is a Python decorator that converts a Python function into a
basic PyTensor :class:`Op` that will call the supplied function during execution.

This isn't the recommended way to build an :class:`Op`, but allows for a quick implementation.

It takes an optional :meth:`infer_shape` parameter that must have this signature:

.. code-block:: none

    def infer_shape(fgraph, node, input_shapes):
        # ...
        return output_shapes

  - :obj:`input_shapes` and :obj:`output_shapes` are lists of tuples that
    represent the shape of the corresponding inputs/outputs, and :obj:`fgraph`
    is a :class:`FunctionGraph`.

.. warning::

    Not providing a :meth:`infer_shape` prevents shape-related rewrites from working with this :class:`Op`.
    For example ``your_op(inputs, ...).shape`` will need the :class:`Op` to be executed just to get the shape.

.. note::

    As no L_op is defined, this means you won't be able to
    differentiate paths that include this :class:`Op`.

.. note::

    It converts the Python function to a `Callable` object that takes as
    inputs PyTensor variables that were declared.

.. note::
    The python function wrapped by the :func:`as_op` decorator needs to return a new
    data allocation, no views or in place modification of the input.


:func:`as_op` Example
^^^^^^^^^^^^^^^^^^^^^

.. testcode:: asop

    import pytensor
    import pytensor.tensor as pt
    import numpy as np
    from pytensor import function
    from pytensor.compile.ops import as_op

    def infer_shape_numpy_dot(fgraph, node, input_shapes):
        ashp, bshp = input_shapes
        return [ashp[:-1] + bshp[-1:]]


    @as_op(
        itypes=[pt.dmatrix, pt.dmatrix],
        otypes=[pt.dmatrix],
        infer_shape=infer_shape_numpy_dot,
    )
    def numpy_dot(a, b):
       return np.dot(a, b)

You can try it as follows:

.. testcode:: asop

    x = pt.matrix()
    y = pt.matrix()
    f = function([x, y], numpy_dot(x, y))
    inp1 = np.random.random_sample((5, 4))
    inp2 = np.random.random_sample((4, 7))
    out = f(inp1, inp2)


Final Note
----------

The section :ref:`Other Ops <other_ops>` includes more instructions for the following specific cases:

 - :ref:`scalar_ops`
 - :ref:`sparse_ops`
 - :ref:`openmp_ops`


For defining C-based :class:`COp` see :ref:`creating_a_c_op`.
For defining implementations for other backends see :ref:`creating_a_numba_jax_op`.

.. note::

    This is an introductory tutorial and as such it does not cover how to make
    an :class:`Op` that returns a view or modifies the values in its inputs. Thus, all
    :class:`Op`\s created with the instructions described here MUST return newly
    allocated memory or reuse the memory provided in the parameter
    ``output_storage`` of the :meth:`perform` method. See
    :ref:`views_and_inplace` for an explanation on how to do this.

    If your :class:`Op` returns a view or changes the value of its inputs
    without doing as prescribed in that page, PyTensor will run, but will
    return correct results for some graphs and wrong results for others.

    It is recommended that you run your tests in :class:`DebugMode`, since it
    can help verify whether or not your :class:`Op` behaves correctly in this
    regard.