Preserving hardware memory during cuvid decoding, exporting/importing via dlpack. #2155

caffeinism · 2026-02-04T16:35:56Z

Hello? I'm a user with limited knowledge of libav, dlpack, and cython. However, recognizing this as a necessary feature, I drafted this with the help of an LLM.

Motivation

If an application decodes video, performs GPU operations, and then re-encodes it, PyAV currently incurs a significant amount of memcopy. (GPU (cuvid) -> CPU (PyAV) -> GPU (Torch, etc.) -> CPU (PyAV) -> GPU (nvenc)) However, if we could export frames decoded by cuvid to dlpack while keeping them on the GPU, we wouldn't need to move the frames to CPU memory.

I passed all existing tests, but with such extensive modifications, it seems difficult for a beginner like me to catch every single detail. However, since most changes involve adding features rather than modifying existing ones, I hope this PR serves as a good starting point.

Usage example

import av
from av.codec.hwaccel import HWAccel
import torch

hwaccel = HWAccel(
    device_type="cuda",
    device=0,
    allow_software_fallback=False,
    output_format="hw", # preserve hw memory
)

# decode using cuvid
with av.open(from_video_filename, "r", hwaccel=hwaccel) as c:
    frame = next(c.decode(video=0))
    y = torch.from_dlpack(frame.planes[0]) # device(type='cuda', index=0), torch.uint8, torch.Size([H, W])
    uv = torch.from_dlpack(frame.planes[1]) # device(type='cuda', index=0), torch.uint8, torch.Size([H/2, W/2])

f = av.VideoFrame.from_dlpack(((y*0.5).to(torch.uint8), uv)) # some operation

with av.open(to_video_filename, "w") as c:
    s = c.add_stream("h264_nvenc", rate=24) # encode using nvenc
    for it in s.encode(f):
        c.mux(it)
    for it in s.encode(None):
        c.mux(it)

caffeinism · 2026-02-04T18:47:06Z

@WyattBlue If I add tests, will it work fine even if it only runs on a CUDA machine? I don't think it will work in the GitHub workflow.

WyattBlue · 2026-02-04T18:48:14Z

You need to test the interface. For example, hw_format does not have an pyi interface, and writing a test would catch that fact.

WyattBlue · 2026-02-04T18:50:52Z

av/hwcontext.pxd‎ should be merged with include/avutil. *.pxd files should otherwise not be free radicals, i.e., they should have a corresponding real .py file.

caffeinism added 3 commits February 5, 2026 01:39

Impl __dlpack__, keep cuda memory

fda4962

Impl VideoFrame.from_dlpack

aaa90db

Impl minimal support device_id

56dd2dc

caffeinism force-pushed the dlpack branch from 5e0f429 to 56dd2dc Compare February 4, 2026 16:39

ruff / isort

9426057

caffeinism force-pushed the dlpack branch from 3ef7b26 to 9426057 Compare February 4, 2026 16:44

WyattBlue added the needs tests This PR needs a test label Feb 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preserving hardware memory during cuvid decoding, exporting/importing via dlpack. #2155

Preserving hardware memory during cuvid decoding, exporting/importing via dlpack. #2155

caffeinism commented Feb 4, 2026 •

edited

Loading

Uh oh!

caffeinism commented Feb 4, 2026

Uh oh!

WyattBlue commented Feb 4, 2026 •

edited

Loading

Uh oh!

WyattBlue commented Feb 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Preserving hardware memory during cuvid decoding, exporting/importing via dlpack. #2155

Are you sure you want to change the base?

Preserving hardware memory during cuvid decoding, exporting/importing via dlpack. #2155

Conversation

caffeinism commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Usage example

Uh oh!

caffeinism commented Feb 4, 2026

Uh oh!

WyattBlue commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

WyattBlue commented Feb 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

caffeinism commented Feb 4, 2026 •

edited

Loading

WyattBlue commented Feb 4, 2026 •

edited

Loading