Fix CUDA build with contrib ops disabled by Copilot · Pull Request #28554 · microsoft/onnxruntime

Copilot · 2026-05-19T03:52:07Z

Description

The CUDA Attention kernel (core/providers/cuda/llm/attention.cc) depends on contrib_ops internals (flash attention, memory efficient attention, unfused attention helpers) but was compiled unconditionally. When building with --disable_contrib_ops, GetAttentionKernelOptions() is unavailable (guarded by #ifndef DISABLE_CONTRIB_OPS in cuda_kernel.h), causing a compile error.

Changes:

cmake/onnxruntime_providers_cuda.cmake — Exclude attention.h/attention.cc from the CUDA provider source list when contrib ops are disabled
cuda_execution_provider.cc — Guard Attention kernel forward declarations and BuildKernelCreateInfo registrations (opset 23 and 24) with #ifndef DISABLE_CONTRIB_OPS

The CPU EP still provides the ONNX domain Attention kernel as fallback.

Motivation and Context

Building onnxruntime with CUDA enabled and --disable_contrib_ops fails:

error C2039: 'GetAttentionKernelOptions': is not a member of 'onnxruntime::cuda::Attention<float>'

This is a valid build configuration (useful for reducing compile time) that should be supported.

The CUDA Attention kernel implementation (core/providers/cuda/llm/attention.cc) depends on contrib ops (flash attention, memory efficient attention, unfused attention helpers from contrib_ops/cuda/bert/). When DISABLE_CONTRIB_OPS is defined, these dependencies are unavailable causing compilation failures. Fix by: 1. Excluding attention.h/attention.cc from the CUDA provider build when contrib ops are disabled (cmake change). 2. Guarding the Attention kernel class declarations and registrations in cuda_execution_provider.cc with #ifndef DISABLE_CONTRIB_OPS. The CPU EP still provides the standard ONNX domain Attention kernel as fallback when the CUDA implementation is unavailable. Agent-Logs-Url: https://github.com/microsoft/onnxruntime/sessions/4bbef367-4e58-49e5-9bca-8d5a2c8ee872 Co-authored-by: tianleiwu <30328909+tianleiwu@users.noreply.github.com>

github-actions

You can commit the suggested changes from lintrunner.

github-actions · 2026-05-19T15:53:10Z

@@ -3083,9 +3089,11 @@ static Status RegisterCudaKernels(KernelRegistry& kernel_registry) {
      BuildKernelCreateInfo<ONNX_OPERATOR_VERSIONED_KERNEL_CLASS_NAME(kCudaExecutionProvider, kOnnxDomain, 23, 23, Unsqueeze)>,

      // Opset 24


Suggested change

// Opset 24

// Opset 24

github-actions · 2026-05-19T15:53:11Z

@@ -3005,9 +3009,11 @@ static Status RegisterCudaKernels(KernelRegistry& kernel_registry) {
      BuildKernelCreateInfo<ONNX_OPERATOR_TYPED_KERNEL_CLASS_NAME(kCudaExecutionProvider, kOnnxDomain, 22, BFloat16, Sin)>,

      // Opset 23


Suggested change

// Opset 23

// Opset 23

Initial plan

fabb553

Copilot AI assigned Copilot and tianleiwu May 19, 2026

Copilot started work on behalf of tianleiwu May 19, 2026 03:52 View session

Copilot AI linked an issue May 19, 2026 that may be closed by this pull request

[Build] Cannot build onnxruntime with CUDA enabled and contrib ops disabled #28537

Open

Copilot AI changed the title ~~[WIP] Fix onnxruntime build with CUDA enabled and contrib ops disabled~~ Fix CUDA build with contrib ops disabled May 19, 2026

Copilot finished work on behalf of tianleiwu May 19, 2026 03:59

Copilot AI requested a review from tianleiwu May 19, 2026 03:59

github-actions Bot reviewed May 19, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix CUDA build with contrib ops disabled#28554

Fix CUDA build with contrib ops disabled#28554
Copilot wants to merge 2 commits into
mainfrom
copilot/fix-onnxruntime-build-cuda

Copilot AI commented May 19, 2026 •

edited

Loading

Uh oh!

github-actions Bot left a comment

Uh oh!

github-actions Bot May 19, 2026

Uh oh!

github-actions Bot May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -3083,9 +3089,11 @@ static Status RegisterCudaKernels(KernelRegistry& kernel_registry) {
		BuildKernelCreateInfo<ONNX_OPERATOR_VERSIONED_KERNEL_CLASS_NAME(kCudaExecutionProvider, kOnnxDomain, 23, 23, Unsqueeze)>,

		// Opset 24

		@@ -3005,9 +3009,11 @@ static Status RegisterCudaKernels(KernelRegistry& kernel_registry) {
		BuildKernelCreateInfo<ONNX_OPERATOR_TYPED_KERNEL_CLASS_NAME(kCudaExecutionProvider, kOnnxDomain, 22, BFloat16, Sin)>,

		// Opset 23

Conversation

Copilot AI commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot May 19, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot May 19, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented May 19, 2026 •

edited

Loading