GraphBuilder#

yobx.xbuilder.GraphBuilder simplifies the programmatic construction and optimization of ONNX graphs. It is the primary tool used to convert a torch.fx.Graph into a onnx.ModelProto, but it can equally be used standalone to build or transform any ONNX graph from scratch.

Class Hierarchy#

GraphBuilder is composed of three cooperative base classes:

  • _BuilderRuntime — evaluates small constant sub-expressions (e.g. the [0, 0, -1] passed to a Reshape node) so the builder can resolve -1 to the correct symbolic formula and fold constants early.

  • _ShapeRuntime — handles value-as-shape tracking needed by operators such as Shape, Gather, Concat, and Slice when their outputs feed directly into a Reshape.

  • _InferenceRuntime — walks the graph node by node, dispatching each node to the matching per-operator handler in yobx.xshape.shape_type_compute so that shapes and types are tracked for every intermediate result.

Two helper classes round out the public API:

Building a graph from scratch#

The simplest workflow is:

  1. Construct a GraphBuilder with an opset version.

  2. Call make_tensor_input to declare each graph input.

  3. Call make_node (or the short-hand g.op.<OpType>(…) syntax) to add operators.

  4. Call make_tensor_output to declare each graph output.

  5. Call to_onnx to obtain a onnx.ModelProto.

<<<

import numpy as np
import onnx
from yobx.helpers.onnx_helper import pretty_onnx
from yobx.xbuilder import GraphBuilder

TFLOAT = onnx.TensorProto.FLOAT

# 1. create builder targeting opset 18
g = GraphBuilder(18, ir_version=10)

# 2. declare inputs
g.make_tensor_input("X", TFLOAT, ("batch", "seq", 64))
g.make_tensor_input("W", TFLOAT, (64, 32))

# 3. add a MatMul node via the short-hand op accessor
result = g.op.MatMul("X", "W")

# 4. declare the output and export
g.make_tensor_output(
    result, elem_type=TFLOAT, shape=("batch", "seq", 32), indexed=False
)
model = g.to_onnx()
print(f"nodes  : {len(model.graph.node)}")
print(f"opset  : {model.opset_import[0].version}")
print(f"output : {model.graph.output[0].name}")
print(pretty_onnx(model))

>>>

    nodes  : 1
    opset  : 18
    output : _onx_matmul_X
    opset: domain='' version=18
    input: name='X' type=dtype('float32') shape=['batch', 'seq', 64]
    input: name='W' type=dtype('float32') shape=[64, 32]
    MatMul(X, W) -> _onx_matmul_X
    output: name='_onx_matmul_X' type=dtype('float32') shape=['batch', 'seq', 32]

Loading an existing model#

Passing an existing onnx.ModelProto to the constructor loads it into the builder so its nodes and initializers can be inspected, modified, or re-optimized.

<<<

import onnx
import onnx.helper as oh
from yobx.xbuilder import GraphBuilder

TFLOAT = onnx.TensorProto.FLOAT

model = oh.make_model(
    oh.make_graph(
        [
            oh.make_node("Add", ["X", "Y"], ["T"]),
            oh.make_node("Relu", ["T"], ["Z"]),
        ],
        "add_relu",
        [
            oh.make_tensor_value_info("X", TFLOAT, ["batch", 4]),
            oh.make_tensor_value_info("Y", TFLOAT, ["batch", 4]),
        ],
        [oh.make_tensor_value_info("Z", TFLOAT, ["batch", 4])],
    ),
    opset_imports=[oh.make_opsetid("", 18)],
    ir_version=10,
)

g = GraphBuilder(model)
print("input  shapes:", {n: g.get_shape(n) for n in g.input_names})
print("nodes        :", [node.op_type for node in g.nodes])

>>>

    input  shapes: {'X': ('batch', 4), 'Y': ('batch', 4)}
    nodes        : ['Add', 'Relu']

Initializers#

Initializers (model weights and constants) are added with make_initializer. The builder deduplicates small integer arrays automatically: if the same value is added twice it returns the name of the first occurrence rather than creating a duplicate node.

<<<

import numpy as np
import onnx
from yobx.xbuilder import GraphBuilder

TFLOAT = onnx.TensorProto.FLOAT

g = GraphBuilder(18, ir_version=10)
g.make_tensor_input("X", TFLOAT, ("batch", 64))

# Add a weight matrix as an initializer
W = np.random.randn(64, 32).astype(np.float32)
w_name = g.make_initializer("W", W, source="example")

result = g.op.MatMul("X", w_name)
g.make_tensor_output(result, elem_type=TFLOAT, shape=("batch", 32), indexed=False)
model = g.to_onnx()
print("initializer name :", list(g.initializers_dict)[0])
print("initializer shape:", list(g.initializers_dict.values())[0].shape)

>>>

    initializer name : W
    initializer shape: (64, 32)

Shape and type tracking#

GraphBuilder inherits the full ShapeBuilder interface. Shapes and types are registered for every intermediate result as nodes are added, and are used during optimization and for populating value_info in the exported proto. See Expected API.

Dynamic shapes#

When some input dimensions are unknown at graph-construction time, they are represented as strings (e.g. "batch", "seq"). For graphs that are later exported for dynamic-shape inference with torch.export, the builder accepts a dynamic_shapes dictionary that maps input names to per-axis dimension objects (torch.export.Dim or WrapDim).

register_dynamic_objects_from_shape registers any string dimension names encountered in a shape so that they are tracked as symbolic dimensions.

<<<

import onnx
from yobx.xbuilder import GraphBuilder

TFLOAT = onnx.TensorProto.FLOAT

g = GraphBuilder(18, ir_version=10)
g.make_tensor_input("X", TFLOAT, ("batch", "seq", 64))
g.make_tensor_input("Y", TFLOAT, ("batch", "seq", 64))

# symbolic dimensions are tracked automatically once shapes are set
result = g.op.Add("X", "Y")
g.make_tensor_output(
    result, elem_type=TFLOAT, shape=("batch", "seq", 64), indexed=False
)
model = g.to_onnx()

out = model.graph.output[0]
dims = [
    d.dim_param if d.dim_param else d.dim_value for d in out.type.tensor_type.shape.dim
]
print("output shape:", dims)

>>>

    output shape: ['batch', 'seq', 64]

Optimizations#

to_onnx runs a sequence of optimization passes by default. The set of passes is controlled by OptimizationOptions.

Default passes (in order):

Pass

Effect

remove_unused

Remove nodes whose outputs are never consumed.

constant_folding

Evaluate operators such as Transpose, Cast, Reshape, Concat, Add, Mul, etc. when all inputs are constants and fold the result into an initializer.

remove_identity

Remove Identity nodes.

remove_duplicated_initializer

Merge identical constant initializers into a single tensor, removing redundant copies.

patterns

Apply user-supplied or built-in fusion patterns (e.g. "default" enables the default set of ONNX-to-ONNX rewrites).

order

Reorder nodes to reduce peak memory by moving each Shape / Size node immediately after the node that produces its input (controlled by OrderAlgorithm, default SHAPE).

<<<

import onnx
import onnx.helper as oh
from yobx.xbuilder import GraphBuilder, OptimizationOptions

TFLOAT = onnx.TensorProto.FLOAT

model = oh.make_model(
    oh.make_graph(
        [
            oh.make_node("Identity", ["X"], ["X2"]),
            oh.make_node("Relu", ["X2"], ["Z"]),
        ],
        "id_relu",
        [oh.make_tensor_value_info("X", TFLOAT, [None, 4])],
        [oh.make_tensor_value_info("Z", TFLOAT, [None, 4])],
    ),
    opset_imports=[oh.make_opsetid("", 18)],
    ir_version=10,
)

opts = OptimizationOptions(remove_identity=True)
g = GraphBuilder(model, optimization_options=opts)
optimized = g.to_onnx()
print("nodes before:", len(model.graph.node))
print("nodes after :", len(optimized.graph.node))

>>>

    nodes before: 2
    nodes after : 1

Optimization report#

Passing return_optimize_report=True to to_onnx makes the method return a (model, stats) tuple instead of just the model. stats is a list of dictionaries — one entry per optimization pass — that records how many nodes were added or removed and how long each pass took.

Key

Description

pattern

Name of the optimization pass (e.g. "remove_identity", "constant_folding", "TransposeTranspose" …).

added

Number of nodes added by this pass.

removed

Number of nodes removed by this pass.

time_in

Wall-clock time spent in this pass (seconds).

iteration

Iteration number (only for pattern-based passes).

match_index

Sequential index of the match within the iteration (pattern passes).

instances

Number of times the pattern was matched (pattern passes).

The list can be converted to a pandas.DataFrame for quick exploration:

<<<

import pandas
import onnx
import onnx.helper as oh
from yobx.xbuilder import GraphBuilder, OptimizationOptions

TFLOAT = onnx.TensorProto.FLOAT

model = oh.make_model(
    oh.make_graph(
        [
            oh.make_node("Identity", ["X"], ["X2"]),
            oh.make_node("Transpose", ["X2"], ["T"], perm=[1, 0]),
            oh.make_node("Transpose", ["T"], ["Z"], perm=[1, 0]),
        ],
        "demo",
        [oh.make_tensor_value_info("X", TFLOAT, [3, 4])],
        [oh.make_tensor_value_info("Z", TFLOAT, [3, 4])],
    ),
    opset_imports=[oh.make_opsetid("", 18)],
    ir_version=10,
)

opts = OptimizationOptions(patterns="default")
g = GraphBuilder(model, infer_shapes_options=True, optimization_options=opts)
optimized = g.to_onnx(return_optimize_report=True)

df = pandas.DataFrame(optimized.report.stats)
# keep only rows that have numeric added/removed counts
df["added"] = df["added"].fillna(0).astype(int)
df["removed"] = df["removed"].fillna(0).astype(int)
print(df[["pattern", "added", "removed", "time_in"]].to_string(index=False))
print(f"\nnodes before: {len(model.graph.node)}")
print(f"nodes after : {len(optimized.graph.node)}")

>>>

                                         pattern  added  removed  time_in
                        dynamic_dimension_naming      0        0 0.000043
                check_A-dynamic_dimension_naming      0        0 0.000028
                                 check_A-opt-sub      0        0 0.000025
                                 remove_identity      1        2 0.000106
                         check_remove_identity-0      0        0 0.000020
                                   remove_unused      0        0 0.000042
                           check_remove_unused-1      0        0 0.000018
                                constant_folding      0        0 0.000021
                apply_constant_folding_new_inits      0        0      NaN
                        check_constant_folding-2      0        0 0.000016
                                   remove_unused      0        0 0.000032
                           check_remove_unused-3      0        0 0.000017
                                        patterns      0        1 0.019755
                                check_pattern_00      0        0 0.000036
                 match_BatchNormalizationPattern      0        0 0.000019
         match_BatchNormalizationTrainingPattern      0        0 0.000013
                               match_CastPattern      0        0 0.000009
                           match_CastCastPattern      0        0 0.000009
                       match_ConcatGatherPattern      0        0 0.000009
                      match_ConcatReshapePattern      0        0 0.000013
                       match_ConvBiasNullPattern      0        0 0.000010
                            match_PadConvPattern      0        0 0.000008
                             match_ExpandPattern      0        0 0.000009
              match_ExpandUnsqueezeExpandPattern      0        0 0.000008
                               match_GeluPattern      0        0 0.000006
                           match_IdentityPattern      0        0 0.000078
                          match_LeakyReluPattern      0        0 0.003135
              match_MulUnsqueezeUnsqueezePattern      0        0 0.000017
                            match_ReshapePattern      0        0 0.000013
         match_ShapeBasedReshapeIsSqueezePattern      0        0 0.000011
             match_ShapeBasedStaticExpandPattern      0        0 0.000009
      match_ShapeBasedEditDistanceReshapePattern      0        0 0.000010
                 match_ShapeBasedIdentityPattern      0        0 0.000039
                 match_ShapedBasedReshapePattern      0        0 0.000008
             match_ShapeBasedSameChildrenPattern      0        0 0.000009
            match_ShapeBasedShapeShapeAddPattern      0        0 0.000008
                     match_ReshapeReshapePattern      0        0 0.000010
                       match_SameChildrenPattern      0        0 0.000019
              match_SameChildrenFromInputPattern      0        0 0.000019
        match_SoftmaxCrossEntropyLossCastPattern      0        0 0.009648
                         match_SqueezeAddPattern      0        0 0.000016
             match_SqueezeBinaryUnsqueezePattern      0        0 0.000011
                   match_SqueezeUnsqueezePattern      0        0 0.000009
                match_StaticConcatReshapePattern      0        0 0.000013
                  match_SwapExpandReshapePattern      0        0 0.000008
                match_SwapExpandUnsqueezePattern      0        0 0.000008
                          match_SwapUnaryPattern      0        0 0.000066
             match_SwapUnsqueezeTransposePattern      0        0 0.000028
                    match_TransposeGatherPattern      0        0 0.000011
          match_TransposeReshapeTransposePattern      0        0 0.000056
                 match_TransposeTransposePattern      0        0 0.000110
          match_UnsqueezeOrSqueezeReshapePattern      0        0 0.000013
                   match_UnsqueezeReshapePattern      0        0 0.000009
                 match_UnsqueezeUnsqueezePattern      0        0 0.000008
                  match_FunctionAttentionPattern      0        0 0.000011
               match_FunctionAttentionGQAPattern      0        0 0.000010
                         insert_and_remove_nodes      0        0 0.000501
                 apply_TransposeTransposePattern      1        2 0.000773
                               check_pattern_A10      0        0 0.000003
                               check_pattern_A20      0        0 0.000029
                         remove_duplicated_shape      0        0 0.000007
                               check_pattern_BD0      0        0 0.000015
                           remove_identity_nodes      0        0 0.000044
                               check_pattern_BI0      0        0 0.000015
                                   remove_unused      0        0 0.000032
                              check_pattern_BUS0      0        0 0.000013
                         build_graph_for_pattern      0        0 0.000031
                                     iteration_0      0        0 0.014834
                 match_BatchNormalizationPattern      0        0 0.000016
         match_BatchNormalizationTrainingPattern      0        0 0.000009
                               match_CastPattern      0        0 0.000008
                           match_CastCastPattern      0        0 0.000006
                       match_ConcatGatherPattern      0        0 0.000006
                      match_ConcatReshapePattern      0        0 0.000010
                       match_ConvBiasNullPattern      0        0 0.000008
                            match_PadConvPattern      0        0 0.000006
                             match_ExpandPattern      0        0 0.000008
              match_ExpandUnsqueezeExpandPattern      0        0 0.000005
                               match_GeluPattern      0        0 0.000002
                           match_IdentityPattern      0        0 0.000007
                          match_LeakyReluPattern      0        0 0.000021
              match_MulUnsqueezeUnsqueezePattern      0        0 0.000007
                            match_ReshapePattern      0        0 0.000007
         match_ShapeBasedReshapeIsSqueezePattern      0        0 0.000008
             match_ShapeBasedStaticExpandPattern      0        0 0.000006
      match_ShapeBasedEditDistanceReshapePattern      0        0 0.000007
                 match_ShapeBasedIdentityPattern      0        0 0.000006
                 match_ShapedBasedReshapePattern      0        0 0.000007
             match_ShapeBasedSameChildrenPattern      0        0 0.000007
            match_ShapeBasedShapeShapeAddPattern      0        0 0.000006
                     match_ReshapeReshapePattern      0        0 0.000007
                       match_SameChildrenPattern      0        0 0.000013
              match_SameChildrenFromInputPattern      0        0 0.000013
        match_SoftmaxCrossEntropyLossCastPattern      0        0 0.000019
                         match_SqueezeAddPattern      0        0 0.000007
             match_SqueezeBinaryUnsqueezePattern      0        0 0.000006
                   match_SqueezeUnsqueezePattern      0        0 0.000007
                match_StaticConcatReshapePattern      0        0 0.000006
                  match_SwapExpandReshapePattern      0        0 0.000005
                match_SwapExpandUnsqueezePattern      0        0 0.000005
                          match_SwapUnaryPattern      0        0 0.000006
             match_SwapUnsqueezeTransposePattern      0        0 0.000012
                    match_TransposeGatherPattern      0        0 0.000006
          match_TransposeReshapeTransposePattern      0        0 0.000006
                 match_TransposeTransposePattern      0        0 0.000005
          match_UnsqueezeOrSqueezeReshapePattern      0        0 0.000006
                   match_UnsqueezeReshapePattern      0        0 0.000005
                 match_UnsqueezeUnsqueezePattern      0        0 0.000005
                  match_FunctionAttentionPattern      0        0 0.000007
               match_FunctionAttentionGQAPattern      0        0 0.000007
                               check_pattern_A20      0        0 0.000022
                         remove_duplicated_shape      0        0 0.000005
                               check_pattern_BD0      0        0 0.000014
                           remove_identity_nodes      0        0 0.000043
                               check_pattern_BI0      0        0 0.000013
                                   remove_unused      0        0 0.000222
                              check_pattern_BUS0      0        0 0.000025
                         build_graph_for_pattern      0        0 0.000028
                                     iteration_1      0        0 0.000971
                 match_BatchNormalizationPattern      0        0 0.000011
         match_BatchNormalizationTrainingPattern      0        0 0.000007
         match_CastLayerNormalizationCastPattern      0        0 0.000009
                               match_CastPattern      0        0 0.000013
                     match_CastCastBinaryPattern      0        0 0.000011
                           match_CastCastPattern      0        0 0.000007
                         match_CastOpCastPattern      0        0 0.000009
                           match_ClipClipPattern      0        0 0.000014
                        match_ConcatEmptyPattern      0        0 0.000011
                       match_ConcatGatherPattern      0        0 0.000007
                      match_ConcatReshapePattern      0        0 0.000008
                   match_ConcatTwiceUnaryPattern      0        0 0.000009
              match_ConstantToInitializerPattern      0        0 0.000007
                       match_ConvBiasNullPattern      0        0 0.000007
                            match_PadConvPattern      0        0 0.000006
                            match_DropoutPattern      0        0 0.000007
                             match_ExpandPattern      0        0 0.000006
                    match_ExpandBroadcastPattern      0        0 0.000007
                         match_ExpandSwapPattern      0        0 0.000006
              match_ExpandUnsqueezeExpandPattern      0        0 0.000006
                       match_GathersSplitPattern      0        0 0.000007
                               match_GeluPattern      0        0 0.000003
                           match_IdentityPattern      0        0 0.000007
                 match_LayerNormalizationPattern      0        0 0.000007
            match_LayerNormalizationScalePattern      0        0 0.000006
                          match_LeakyReluPattern      0        0 0.000018
                            match_MaxReluPattern      0        0 0.000006
                    match_MulMulMulScalarPattern      0        0 0.000008
              match_MulUnsqueezeUnsqueezePattern      0        0 0.000006
                             match_NotNotPattern      0        0 0.000007
                           match_NotWherePattern      0        0 0.000006
                      match_ReduceArgTopKPattern      0        0 0.000008
                      match_ReduceReshapePattern      0        0 0.000008
                 match_ReduceSumNormalizePattern      0        0 0.000007
                            match_ReshapePattern      0        0 0.000007
               match_ReshapeMatMulReshapePattern      0        0 0.000013
                        match_Reshape2Of3Pattern      0        0 0.000007
               match_ReshapeReshapeBinaryPattern      0        0 0.000007
                      match_GemmTransposePattern      0        0 0.000006
                  match_MatMulReshape2Of3Pattern      0        0 0.000006
                       match_MulMulMatMulPattern      0        0 0.000006
         match_ShapeBasedReshapeIsSqueezePattern      0        0 0.000005
             match_ShapeBasedStaticExpandPattern      0        0 0.000005
             match_ShapeBasedConcatExpandPattern      0        0 0.000007
      match_ShapeBasedEditDistanceReshapePattern      0        0 0.000007
                 match_ShapeBasedIdentityPattern      0        0 0.000006
          match_ShapeBasedExpandBroadcastPattern      0        0 0.000008
    match_ShapeBasedExpandBroadcastMatMulPattern      0        0 0.000007
      match_ShapeBasedExpandCastWhereSwapPattern      0        0 0.000007
               match_ShapeBasedExpandSwapPattern      0        0 0.000006
              match_ShapeBasedMatMulToMulPattern      0        0 0.000007
                 match_ShapedBasedReshapePattern      0        0 0.000007
             match_ShapeBasedSameChildrenPattern      0        0 0.000006
            match_ShapeBasedShapeShapeAddPattern      0        0 0.000006
                     match_ReshapeReshapePattern      0        0 0.000005
                    match_RotaryEmbeddingPattern      0        0 0.000006
                       match_SameChildrenPattern      0        0 0.000013
              match_SameChildrenFromInputPattern      0        0 0.000013
                match_SequenceConstructAtPattern      0        0 0.000007
          match_SplitToSequenceSequenceAtPattern      0        0 0.000006
                         match_SliceSlicePattern      0        0 0.000006
                        match_SlicesSplitPattern      0        0 0.000007
        match_SoftmaxCrossEntropyLossCastPattern      0        0 0.000018
                        match_SplitConcatPattern      0        0 0.000006
                         match_SqueezeAddPattern      0        0 0.000005
             match_SqueezeBinaryUnsqueezePattern      0        0 0.000006
                   match_SqueezeUnsqueezePattern      0        0 0.000006
                match_StaticConcatReshapePattern      0        0 0.000005
                            match_Sub1MulPattern      0        0 0.000006
                  match_SwapExpandReshapePattern      0        0 0.000005
                match_SwapExpandUnsqueezePattern      0        0 0.000005
                 match_SwapRangeAddScalarPattern      0        0 0.000007
                          match_SwapUnaryPattern      0        0 0.000006
             match_SwapUnsqueezeTransposePattern      0        0 0.000006
                  match_SwitchOrderBinaryPattern      0        0 0.000012
            match_SwitchReshapeActivationPattern      0        0 0.000007
              match_TransposeEqualReshapePattern      0        0 0.000006
                    match_TransposeGatherPattern      0        0 0.000006
                    match_TransposeMatMulPattern      0        0 0.000007
             match_TransposeReshapeMatMulPattern      0        0 0.000007
          match_TransposeReshapeTransposePattern      0        0 0.000005
                 match_TransposeTransposePattern      0        0 0.000006
                     match_UnsqueezeEqualPattern      0        0 0.000007
          match_UnsqueezeOrSqueezeReshapePattern      0        0 0.000005
                   match_UnsqueezeReshapePattern      0        0 0.000005
                 match_UnsqueezeUnsqueezePattern      0        0 0.000005
                           match_WhereAddPattern      0        0 0.000007
                   match_RotaryConcatPartPattern      0        0 0.000007
                  match_FunctionAttentionPattern      0        0 0.000007
               match_FunctionAttentionGQAPattern      0        0 0.000007
                 match_FunctionCausalMaskPattern      0        0 0.000006
           match_FunctionCausalMaskMulAddPattern      0        0 0.000006
                match_FunctionCosSinCachePattern      0        0 0.000007
        match_FunctionHalfRotaryEmbeddingPattern      0        0 0.000006
                   match_RMSNormalizationPattern      0        0 0.000006
                match_RMSNormalizationMulPattern      0        0 0.000006
                               check_pattern_A20      0        0 0.000026
                         remove_duplicated_shape      0        0 0.000005
                               check_pattern_BD0      0        0 0.000014
                           remove_identity_nodes      0        0 0.000043
                               check_pattern_BI0      0        0 0.000013
                                   remove_unused      0        0 0.000029
                              check_pattern_BUS0      0        0 0.000011
                         build_graph_for_pattern      0        0 0.000024
                                     iteration_2      0        0 0.001395
                 match_BatchNormalizationPattern      0        0 0.000008
         match_BatchNormalizationTrainingPattern      0        0 0.000006
         match_CastLayerNormalizationCastPattern      0        0 0.000007
                               match_CastPattern      0        0 0.000006
                     match_CastCastBinaryPattern      0        0 0.000006
                           match_CastCastPattern      0        0 0.000005
                         match_CastOpCastPattern      0        0 0.000006
                           match_ClipClipPattern      0        0 0.000006
                        match_ConcatEmptyPattern      0        0 0.000006
                       match_ConcatGatherPattern      0        0 0.000006
                      match_ConcatReshapePattern      0        0 0.000007
                   match_ConcatTwiceUnaryPattern      0        0 0.000006
              match_ConstantToInitializerPattern      0        0 0.000005
                       match_ConvBiasNullPattern      0        0 0.000005
                            match_PadConvPattern      0        0 0.000005
                            match_DropoutPattern      0        0 0.000005
                             match_ExpandPattern      0        0 0.000005
                    match_ExpandBroadcastPattern      0        0 0.000006
                         match_ExpandSwapPattern      0        0 0.000005
              match_ExpandUnsqueezeExpandPattern      0        0 0.000005
                       match_GathersSplitPattern      0        0 0.000005
                               match_GeluPattern      0        0 0.000003
                           match_IdentityPattern      0        0 0.000006
                 match_LayerNormalizationPattern      0        0 0.000005
            match_LayerNormalizationScalePattern      0        0 0.000005
                          match_LeakyReluPattern      0        0 0.000014
                            match_MaxReluPattern      0        0 0.000006
                    match_MulMulMulScalarPattern      0        0 0.000005
              match_MulUnsqueezeUnsqueezePattern      0        0 0.000004
                             match_NotNotPattern      0        0 0.000005
                           match_NotWherePattern      0        0 0.000006
                      match_ReduceArgTopKPattern      0        0 0.000006
                      match_ReduceReshapePattern      0        0 0.000006
                 match_ReduceSumNormalizePattern      0        0 0.000006
                            match_ReshapePattern      0        0 0.000006
               match_ReshapeMatMulReshapePattern      0        0 0.000005
                        match_Reshape2Of3Pattern      0        0 0.000010
               match_ReshapeReshapeBinaryPattern      0        0 0.000005
                      match_GemmTransposePattern      0        0 0.000005
                  match_MatMulReshape2Of3Pattern      0        0 0.000006
                       match_MulMulMatMulPattern      0        0 0.000010
         match_ShapeBasedReshapeIsSqueezePattern      0        0 0.000005
             match_ShapeBasedStaticExpandPattern      0        0 0.000005
             match_ShapeBasedConcatExpandPattern      0        0 0.000006
      match_ShapeBasedEditDistanceReshapePattern      0        0 0.000006
                 match_ShapeBasedIdentityPattern      0        0 0.000005
          match_ShapeBasedExpandBroadcastPattern      0        0 0.000005
    match_ShapeBasedExpandBroadcastMatMulPattern      0        0 0.000005
      match_ShapeBasedExpandCastWhereSwapPattern      0        0 0.000005
               match_ShapeBasedExpandSwapPattern      0        0 0.000005
              match_ShapeBasedMatMulToMulPattern      0        0 0.000005
                 match_ShapedBasedReshapePattern      0        0 0.000006
             match_ShapeBasedSameChildrenPattern      0        0 0.000006
            match_ShapeBasedShapeShapeAddPattern      0        0 0.000004
                     match_ReshapeReshapePattern      0        0 0.000005
                    match_RotaryEmbeddingPattern      0        0 0.000005
                       match_SameChildrenPattern      0        0 0.000009
              match_SameChildrenFromInputPattern      0        0 0.000010
                match_SequenceConstructAtPattern      0        0 0.000005
          match_SplitToSequenceSequenceAtPattern      0        0 0.000005
                         match_SliceSlicePattern      0        0 0.000005
                        match_SlicesSplitPattern      0        0 0.000005
        match_SoftmaxCrossEntropyLossCastPattern      0        0 0.000016
                        match_SplitConcatPattern      0        0 0.000005
                         match_SqueezeAddPattern      0        0 0.000006
             match_SqueezeBinaryUnsqueezePattern      0        0 0.000005
                   match_SqueezeUnsqueezePattern      0        0 0.000006
                match_StaticConcatReshapePattern      0        0 0.000005
                            match_Sub1MulPattern      0        0 0.000005
                  match_SwapExpandReshapePattern      0        0 0.000005
                match_SwapExpandUnsqueezePattern      0        0 0.000004
                 match_SwapRangeAddScalarPattern      0        0 0.000005
                          match_SwapUnaryPattern      0        0 0.000005
             match_SwapUnsqueezeTransposePattern      0        0 0.000005
                  match_SwitchOrderBinaryPattern      0        0 0.000005
            match_SwitchReshapeActivationPattern      0        0 0.000005
              match_TransposeEqualReshapePattern      0        0 0.000004
                    match_TransposeGatherPattern      0        0 0.000005
                    match_TransposeMatMulPattern      0        0 0.000005
             match_TransposeReshapeMatMulPattern      0        0 0.000005
          match_TransposeReshapeTransposePattern      0        0 0.000005
                 match_TransposeTransposePattern      0        0 0.000004
                     match_UnsqueezeEqualPattern      0        0 0.000044
          match_UnsqueezeOrSqueezeReshapePattern      0        0 0.000013
                   match_UnsqueezeReshapePattern      0        0 0.000009
                 match_UnsqueezeUnsqueezePattern      0        0 0.000007
                           match_WhereAddPattern      0        0 0.000007
                   match_RotaryConcatPartPattern      0        0 0.000006
                  match_FunctionAttentionPattern      0        0 0.000008
               match_FunctionAttentionGQAPattern      0        0 0.000007
                 match_FunctionCausalMaskPattern      0        0 0.000006
           match_FunctionCausalMaskMulAddPattern      0        0 0.000005
                match_FunctionCosSinCachePattern      0        0 0.000005
        match_FunctionHalfRotaryEmbeddingPattern      0        0 0.000005
                   match_RMSNormalizationPattern      0        0 0.000005
                match_RMSNormalizationMulPattern      0        0 0.000005
                       match_AttentionGQAPattern      0        0 0.000006
                               check_pattern_A20      0        0 0.000023
                         remove_duplicated_shape      0        0 0.000004
                               check_pattern_BD0      0        0 0.000019
                           remove_identity_nodes      0        0 0.000038
                               check_pattern_BI0      0        0 0.000011
                                   remove_unused      0        0 0.000025
                              check_pattern_BUS0      0        0 0.000011
                         build_graph_for_pattern      0        0 0.000022
                                     iteration_3      0        0 0.001183
                 match_BatchNormalizationPattern      0        0 0.000007
         match_BatchNormalizationTrainingPattern      0        0 0.000005
         match_CastLayerNormalizationCastPattern      0        0 0.000006
                               match_CastPattern      0        0 0.000005
                     match_CastCastBinaryPattern      0        0 0.000005
                           match_CastCastPattern      0        0 0.000004
                         match_CastOpCastPattern      0        0 0.000006
                           match_ClipClipPattern      0        0 0.000004
                        match_ConcatEmptyPattern      0        0 0.000005
                       match_ConcatGatherPattern      0        0 0.000005
                      match_ConcatReshapePattern      0        0 0.000007
                   match_ConcatTwiceUnaryPattern      0        0 0.000005
              match_ConstantToInitializerPattern      0        0 0.000005
                       match_ConvBiasNullPattern      0        0 0.000005
                            match_PadConvPattern      0        0 0.000005
                            match_DropoutPattern      0        0 0.000004
                             match_ExpandPattern      0        0 0.000004
                    match_ExpandBroadcastPattern      0        0 0.000005
                         match_ExpandSwapPattern      0        0 0.000005
              match_ExpandUnsqueezeExpandPattern      0        0 0.000005
                       match_GathersSplitPattern      0        0 0.000004
                               match_GeluPattern      0        0 0.000003
                           match_IdentityPattern      0        0 0.000006
                 match_LayerNormalizationPattern      0        0 0.000005
            match_LayerNormalizationScalePattern      0        0 0.000004
                          match_LeakyReluPattern      0        0 0.000013
                            match_MaxReluPattern      0        0 0.000005
                    match_MulMulMulScalarPattern      0        0 0.000004
              match_MulUnsqueezeUnsqueezePattern      0        0 0.000004
                             match_NotNotPattern      0        0 0.000005
                           match_NotWherePattern      0        0 0.000005
                      match_ReduceArgTopKPattern      0        0 0.000005
                      match_ReduceReshapePattern      0        0 0.000005
                 match_ReduceSumNormalizePattern      0        0 0.000005
                            match_ReshapePattern      0        0 0.000005
               match_ReshapeMatMulReshapePattern      0        0 0.000010
                        match_Reshape2Of3Pattern      0        0 0.000005
               match_ReshapeReshapeBinaryPattern      0        0 0.000005
                          match_MatMulAddPattern      0        0 0.000006
                      match_GemmTransposePattern      0        0 0.000005
                  match_MatMulReshape2Of3Pattern      0        0 0.000005
                       match_MulMulMatMulPattern      0        0 0.000005
         match_ShapeBasedReshapeIsSqueezePattern      0        0 0.000004
             match_ShapeBasedStaticExpandPattern      0        0 0.000005
             match_ShapeBasedConcatExpandPattern      0        0 0.000005
      match_ShapeBasedEditDistanceReshapePattern      0        0 0.000005
                 match_ShapeBasedIdentityPattern      0        0 0.000005
          match_ShapeBasedExpandBroadcastPattern      0        0 0.000004
    match_ShapeBasedExpandBroadcastMatMulPattern      0        0 0.000004
      match_ShapeBasedExpandCastWhereSwapPattern      0        0 0.000005
               match_ShapeBasedExpandSwapPattern      0        0 0.000004
              match_ShapeBasedMatMulToMulPattern      0        0 0.000005
                 match_ShapedBasedReshapePattern      0        0 0.000006
             match_ShapeBasedSameChildrenPattern      0        0 0.000005
            match_ShapeBasedShapeShapeAddPattern      0        0 0.000004
                     match_ReshapeReshapePattern      0        0 0.000005
                    match_RotaryEmbeddingPattern      0        0 0.000004
                       match_SameChildrenPattern      0        0 0.000010
              match_SameChildrenFromInputPattern      0        0 0.000010
                match_SequenceConstructAtPattern      0        0 0.000006
          match_SplitToSequenceSequenceAtPattern      0        0 0.000006
                         match_SliceSlicePattern      0        0 0.000006
                        match_SlicesSplitPattern      0        0 0.000006
        match_SoftmaxCrossEntropyLossCastPattern      0        0 0.000012
                        match_SplitConcatPattern      0        0 0.000006
                         match_SqueezeAddPattern      0        0 0.000006
             match_SqueezeBinaryUnsqueezePattern      0        0 0.000004
                   match_SqueezeUnsqueezePattern      0        0 0.000005
                match_StaticConcatReshapePattern      0        0 0.000005
                            match_Sub1MulPattern      0        0 0.000004
                  match_SwapExpandReshapePattern      0        0 0.000004
                match_SwapExpandUnsqueezePattern      0        0 0.000004
                 match_SwapRangeAddScalarPattern      0        0 0.000005
                          match_SwapUnaryPattern      0        0 0.000005
             match_SwapUnsqueezeTransposePattern      0        0 0.000004
                  match_SwitchOrderBinaryPattern      0        0 0.000005
            match_SwitchReshapeActivationPattern      0        0 0.000005
              match_TransposeEqualReshapePattern      0        0 0.000004
                    match_TransposeGatherPattern      0        0 0.000004
                    match_TransposeMatMulPattern      0        0 0.000004
             match_TransposeReshapeMatMulPattern      0        0 0.000004
          match_TransposeReshapeTransposePattern      0        0 0.000004
                 match_TransposeTransposePattern      0        0 0.000004
                     match_UnsqueezeEqualPattern      0        0 0.000004
          match_UnsqueezeOrSqueezeReshapePattern      0        0 0.000005
                   match_UnsqueezeReshapePattern      0        0 0.000005
                 match_UnsqueezeUnsqueezePattern      0        0 0.000004
                           match_WhereAddPattern      0        0 0.000004
                   match_RotaryConcatPartPattern      0        0 0.000004
                  match_FunctionAttentionPattern      0        0 0.000005
               match_FunctionAttentionGQAPattern      0        0 0.000006
                 match_FunctionCausalMaskPattern      0        0 0.000004
           match_FunctionCausalMaskMulAddPattern      0        0 0.000005
                match_FunctionCosSinCachePattern      0        0 0.000009
        match_FunctionHalfRotaryEmbeddingPattern      0        0 0.000004
                   match_RMSNormalizationPattern      0        0 0.000004
                match_RMSNormalizationMulPattern      0        0 0.000004
                       match_AttentionGQAPattern      0        0 0.000004
                               check_pattern_A20      0        0 0.000023
                         remove_duplicated_shape      0        0 0.000004
                               check_pattern_BD0      0        0 0.000011
                           remove_identity_nodes      0        0 0.000035
                               check_pattern_BI0      0        0 0.000009
                                   remove_unused      0        0 0.000024
                              check_pattern_BUS0      0        0 0.000010
                         build_graph_for_pattern      0        0 0.000021
                                check_patterns-4      0        0 0.000024
                                   remove_unused      0        0 0.000022
                           check_remove_unused-5      0        0 0.000011
                                 remove_identity      0        0 0.000029
                         check_remove_identity-6      0        0 0.000010
                                constant_folding      0        0 0.000017
                apply_constant_folding_new_inits      0        0      NaN
                        check_constant_folding-7      0        0 0.000010
                                   remove_unused      0        0 0.000019
                           check_remove_unused-8      0        0 0.000010
                   remove_duplicated_initializer      0        0 0.000005
           check_remove_duplicated_initializer-9      0        0 0.000010
                                 remove_identity      0        0 0.000036
                        check_remove_identity-10      0        0 0.000010
                                   remove_unused      0        0 0.000016
                          check_remove_unused-11      0        0 0.000009
                                           order      0        0 0.000094
                                    check_orderA      0        0 0.000015
                                    check_orderL      0        0 0.000009
                                     shape_order      0        0 0.000041
                                           order      0        0      NaN
                                  check_order-12      0        0 0.000011
                                    optimization      0        2 0.020585
    
    nodes before: 3
    nodes after : 1

The report can be aggregated by pass name:

<<<

import pandas
import onnx
import onnx.helper as oh
from yobx.xbuilder import GraphBuilder, OptimizationOptions

TFLOAT = onnx.TensorProto.FLOAT

model = oh.make_model(
    oh.make_graph(
        [
            oh.make_node("Identity", ["X"], ["X2"]),
            oh.make_node("Transpose", ["X2"], ["T"], perm=[1, 0]),
            oh.make_node("Transpose", ["T"], ["Z"], perm=[1, 0]),
        ],
        "demo",
        [oh.make_tensor_value_info("X", TFLOAT, [3, 4])],
        [oh.make_tensor_value_info("Z", TFLOAT, [3, 4])],
    ),
    opset_imports=[oh.make_opsetid("", 18)],
    ir_version=10,
)

opts = OptimizationOptions(patterns="default")
g = GraphBuilder(model, infer_shapes_options=True, optimization_options=opts)
art = g.to_onnx(return_optimize_report=True)

df = pandas.DataFrame(art.report.stats)
for c in ["added", "removed"]:
    df[c] = df[c].fillna(0).astype(int)
agg = df.groupby("pattern")[["added", "removed", "time_in"]].sum()
agg = agg[(agg["added"] > 0) | (agg["removed"] > 0)].sort_values(
    "removed", ascending=False
)
print(agg.to_string())

>>>

                                     added  removed   time_in
    pattern                                                  
    apply_TransposeTransposePattern      1        2  0.000385
    optimization                         0        2  0.018082
    remove_identity                      1        2  0.000239
    patterns                             0        1  0.017217

Local functions#

A sub-graph can be exported as a reusable ONNX local function (a FunctionProto) by passing a FunctionOptions instance to to_onnx.

<<<

import onnx
from yobx.xbuilder import GraphBuilder, FunctionOptions

TFLOAT = onnx.TensorProto.FLOAT

g = GraphBuilder(18, ir_version=10, as_function=True)
g.make_tensor_input("X", TFLOAT, ("batch", 64))
r = g.op.Relu("X")
g.make_tensor_output(r, indexed=False)

func = g.to_onnx(
    function_options=FunctionOptions(
        export_as_function=True,
        name="MyRelu",
        domain="my.domain",
    ),
    inline=False,
)
proto = func.proto
print(type(proto).__name__)
print("function name  :", proto.name)
print("function domain:", proto.domain)

>>>

    FunctionProto
    function name  : MyRelu
    function domain: my.domain

Debugging GraphBuilder with Environment Variables#

GraphBuilder respects several environment variables that help narrow down construction or optimization problems:

Environment variable

Effect

ONNXSTOP=<name>

Raises an exception the moment result <name> is created.

ONNXSTOPSHAPE=<name>

Raises an exception the moment result <name> receives a shape.

ONNXSTOPTYPE=<name>

Raises an exception the moment result <name> receives a type.

ONNXSTOPOUTPUT=<name>

Raises an exception the moment a node produces output <name>.

ONNXSTOPVALUESHAPE=<name>

Prints extra information for shape-as-value tracking (e.g. inputs to Reshape).

ONNXCST=1

Prints which constant is being evaluated.

ONNXFUNC=1

Prints details when nodes from a local function domain are added.

ONNXSHAPECOMPUTE=1

Raises an exception when a shape is missing for a result that should have one.

NULLSHAPE=1

Raises an exception as soon as a null/empty shape is encountered.

ONNXDYNDIM=<name>

Prints a message every time dynamic dimension <name> is used.

PRINTNAME=<name>

Prints a message every time a node producing <name> is added.

In addition, get_debug_msg returns a detailed text dump of the builder’s internal state (known shapes, types, ranks, constants, and node list) which can be printed or logged whenever an assertion fails.

pretty_text returns a human-readable representation of the whole graph (inputs, initializers, nodes, outputs) and is useful for quick visual inspection:

<<<

import onnx
import onnx.helper as oh
from yobx.xbuilder import GraphBuilder

TFLOAT = onnx.TensorProto.FLOAT

model = oh.make_model(
    oh.make_graph(
        [
            oh.make_node("Add", ["X", "Y"], ["T"]),
            oh.make_node("Relu", ["T"], ["Z"]),
        ],
        "add_relu",
        [
            oh.make_tensor_value_info("X", TFLOAT, ["batch", 4]),
            oh.make_tensor_value_info("Y", TFLOAT, ["batch", 4]),
        ],
        [oh.make_tensor_value_info("Z", TFLOAT, ["batch", 4])],
    ),
    opset_imports=[oh.make_opsetid("", 18)],
    ir_version=10,
)

g = GraphBuilder(model)
print(g.pretty_text())

>>>

    
    dyn---: batch -> WrapSym(batch)
    dynrev: batch -> [('batch', SymInt(batch))]
    dynsrc: batch -> [{batch:('input_name', 'X'), batch:('axis', 0)}, {batch:('input_name', 'Y'), batch:('axis', 0)}, {batch:('input_name', 'Z'), batch:('axis', 0)}]
    opset: : 18
    input:: X                                                                       |T1: batch x 4
    input:: Y                                                                       |T1: batch x 4
    Add: X, Y -> T                                                                  |T1: batch x 4
    Relu: T -> Z                                                                    |T1: batch x 4
    output:: Z                                                                      |T1: batch x 4