onnx_extended.ortops.tutorial.cuda

get_ort_ext_libs

onnx_extended.ortops.tutorial.cuda.get_ort_ext_libs() List[str][source]

Returns the list of libraries implementing new simple onnxruntime kernels implemented for the CUDAExecutionProvider.

List of implemented kernels

<<<

from onnx_extended.ortops.tutorial.cuda import documentation

print("\n".join(documentation()))

>>>

onnx_extended.ortops.tutorial.cuda.CustomGemm

It calls CUDA library for Gemm \alpha A B + \beta C.

Provider

CUDAExecutionProvider

Inputs

  • A (T): tensor of type T

  • B (T): tensor of type T

  • C (T): tensor of type T

  • D (T): tensor of type T

  • E (T): tensor of type T

Outputs

  • Z (T): \alpha A B + \beta C

Constraints

  • T: float, float16, bfloat16