46 lines
728 B
ReStructuredText
46 lines
728 B
ReStructuredText
.. _apigemm:
|
|
|
|
flashinfer.gemm
|
|
===============
|
|
|
|
.. currentmodule:: flashinfer.gemm
|
|
|
|
This module provides a set of GEMM operations.
|
|
|
|
FP4 GEMM
|
|
--------
|
|
|
|
.. autosummary::
|
|
:toctree: ../generated
|
|
|
|
mm_fp4
|
|
|
|
FP8 GEMM
|
|
--------
|
|
|
|
.. autosummary::
|
|
:toctree: ../generated
|
|
|
|
bmm_fp8
|
|
gemm_fp8_nt_groupwise
|
|
group_gemm_fp8_nt_groupwise
|
|
group_deepgemm_fp8_nt_groupwise
|
|
batch_deepgemm_fp8_nt_groupwise
|
|
|
|
Mixed Precision GEMM (fp8 x fp4)
|
|
--------------------------------
|
|
|
|
.. autosummary::
|
|
:toctree: ../generated
|
|
|
|
group_gemm_mxfp4_nt_groupwise
|
|
|
|
Grouped GEMM (Ampere/Hopper)
|
|
----------------------------
|
|
|
|
.. autoclass:: SegmentGEMMWrapper
|
|
:members:
|
|
:exclude-members: forward
|
|
|
|
.. automethod:: __init__
|