onnx_extended.ortops.tutorial.cuda#

get_ort_ext_libs#

onnx_extended.ortops.tutorial.cuda.get_ort_ext_libs() List[str][source]#

Returns the list of libraries implementing new simple onnxruntime kernels implemented for the CUDAExecutionProvider.

List of implemented kernels

<<<

from onnx_extended.ortops.tutorial.cuda import documentation

print("\n".join(documentation()))

>>>

onnx_extented.ortops.tutorial.cuda.CustomGemm#

It calls CUDA library for Gemm \alpha A B + \beta C.

Provider

CUDAExecutionProvider

Inputs

  • A (T): tensor of type T

  • B (T): tensor of type T

  • C (T): tensor of type T

  • D (T): tensor of type T

  • E (T): tensor of type T

Outputs

  • Z (T): \alpha A B + \beta C

Constraints

  • T: float, float16, bfloat16