sglang_v0.5.2/flashinfer_0.3.1/3rdparty/cutlass/test/unit/conv/device
hailin 06e45b5ff9 local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
..
CMakeLists.txt local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_dgrad_implicit_gemm_cf32nhwc_cf32nhwc_cf32nhwc_simt_f32_sm50.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_dgrad_implicit_gemm_cf32nhwc_cf32nhwc_cf32nhwc_simt_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_dgrad_implicit_gemm_f16nhwc_f16nhwc_f16nhwc_tensor_op_f16_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_dgrad_implicit_gemm_f16nhwc_f16nhwc_f32nhwc_tensor_op_f32_sm70.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_dgrad_implicit_gemm_f16nhwc_f16nhwc_f32nhwc_tensor_op_f32_sm75.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_dgrad_implicit_gemm_f16nhwc_f16nhwc_f32nhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_dgrad_implicit_gemm_f32nhwc_f32nhwc_f32nhwc_simt_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_dgrad_implicit_gemm_tf32nhwc_tf32nhwc_f32nhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_few_channels_f16nhwc_f16nhwc_f16nhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_fixed_channels_f16nhwc_f16nhwc_f16nhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_cf32nhwc_cf32nhwc_cf32nhwc_simt_f32_sm50.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_cf32nhwc_cf32nhwc_cf32nhwc_simt_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_f8nhwc_f8nhwc_f8nhwc_tensor_op_f16_sm89.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_f8nhwc_f8nhwc_f8nhwc_tensor_op_f32_sm89.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_f16nhwc_f16nhwc_f16nhwc_simt_f16_sm60.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_f16nhwc_f16nhwc_f16nhwc_tensor_op_f16_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_f16nhwc_f16nhwc_f16nhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_f16nhwc_f16nhwc_f32nhwc_tensor_op_f32_sm70.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_f16nhwc_f16nhwc_f32nhwc_tensor_op_f32_sm75.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_f16nhwc_f16nhwc_f32nhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_f32nhwc_f32nhwc_f32nhwc_simt_f32_sm50.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_f32nhwc_f32nhwc_f32nhwc_simt_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_qf32nhwc_qf32nhwc_qf32nhwc_simt_f32_sm50.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_s4ncxhwx_s4cxrskx_s4ncxhwx_tensor_op_s32_sm75.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_s4ncxhwx_s4cxrskx_s4ncxhwx_tensor_op_s32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_s4nhwc_s4nhwc_s32nhwc_tensor_op_s32_sm75.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_s4nhwc_s4nhwc_s32nhwc_tensor_op_s32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_s8ncxhwx_s8cxrskx_s8ncxhwx_tensor_op_s32_sm75.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_s8ncxhwx_s8cxrskx_s8ncxhwx_tensor_op_s32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_s8nhwc_s8nhwc_s32nhwc_tensor_op_s32_sm75.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_s8nhwc_s8nhwc_s32nhwc_tensor_op_s32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_implicit_gemm_tf32nhwc_tf32nhwc_f32nhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_with_broadcast_simt_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_with_broadcast_sm70.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_with_broadcast_sm75.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_fprop_with_reduction_sm75.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_problems.h local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_strided_dgrad_implicit_gemm_f16nhwc_f16nhwc_f32nhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_strided_dgrad_implicit_gemm_swizzling4_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_strided_dgrad_implicit_gemm_tf32nhwc_tf32nhwc_f32nhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_testbed.h local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_testbed_interleaved.h local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_wgrad_implicit_gemm_cf32nhwc_cf32nhwc_cf32nhwc_simt_f32_sm50.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_wgrad_implicit_gemm_cf32nhwc_cf32nhwc_cf32nhwc_simt_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_wgrad_implicit_gemm_f16nhwc_f16nhwc_f16nhwc_tensor_op_f16_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_wgrad_implicit_gemm_f16nhwc_f16nhwc_f32nhwc_tensor_op_f32_sm70.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_wgrad_implicit_gemm_f16nhwc_f16nhwc_f32nhwc_tensor_op_f32_sm75.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_wgrad_implicit_gemm_f16nhwc_f16nhwc_f32nhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_wgrad_implicit_gemm_f32nhwc_f32nhwc_f32nhwc_simt_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_wgrad_implicit_gemm_tf32nhwc_tf32nhwc_f32nhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_with_absmax_testbed.h local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_with_broadcast_testbed.h local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv2d_with_reduction_testbed.h local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv3d_dgrad_implicit_gemm_f16ndhwc_f16ndhwc_f32ndhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv3d_dgrad_implicit_gemm_f32ndhwc_f32ndhwc_f32ndhwc_simt_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv3d_dgrad_implicit_gemm_tf32ndhwc_tf32ndhwc_f32ndhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv3d_fprop_implicit_gemm_f16ndhwc_f16ndhwc_f32ndhwc_tensor_op_f32_sm75.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv3d_fprop_implicit_gemm_f16ndhwc_f16ndhwc_f32ndhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv3d_fprop_implicit_gemm_f32ndhwc_f32ndhwc_f32ndhwc_simt_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv3d_fprop_implicit_gemm_tf32ndhwc_tf32ndhwc_f32ndhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv3d_fprop_with_broadcast_simt_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv3d_problems.h local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv3d_testbed.h local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv3d_wgrad_implicit_gemm_f16ndhwc_f16ndhwc_f32ndhwc_tensor_op_f32_sm75.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv3d_wgrad_implicit_gemm_f16ndhwc_f16ndhwc_f32ndhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv3d_wgrad_implicit_gemm_f32ndhwc_f32ndhwc_f32ndhwc_simt_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv3d_wgrad_implicit_gemm_tf32ndhwc_tf32ndhwc_f32ndhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
conv3d_with_broadcast_testbed.h local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
deconv2d_implicit_gemm_f32nhwc_f32nhwc_f32nhwc_simt_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
deconv2d_with_broadcast_simt_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
deconv3d_implicit_gemm_f32ndhwc_f32ndhwc_f32ndhwc_simt_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
deconv3d_with_broadcast_simt_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
depthwise_conv2d_direct_conv_testbed.h local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
depthwise_conv2d_fprop_direct_conv_f16nhwc_f16nhwc_f16nhwc_simt_f16_sm60.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
depthwise_conv2d_fprop_direct_conv_fixed_stride_dilation_f16nhwc_f16nhwc_f16nhwc_simt_f16_sm60.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
depthwise_conv2d_fprop_implicit_gemm_f16nhwc_f16nhwc_f16nhwc_simt_f16_sm60.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00
group_conv2d_fprop_implicit_gemm_f16nhwc_f16nhwc_f16nhwc_tensor_op_f32_sm80.cu local source code for flashinfer_0.3.1 && torch vision_0.22.1 2025-09-20 12:33:58 +08:00