'mm' should be able to interface (i.e. read, write, and probe) standard multimedia formats, such as images, videos, and audios.
It should use the types present in nx (i.e. tensor0s, nx.array) whenever possible. For example, as soon as you read an image it should be just the raw pixel data, or a tuple of the raw data and relevant metadata
There should be stream writers as well for outputting/writing long streams as well as well as iteration through frames. We should use libav when available and fallbacks for whatever formats we can support without it (but this module won't be super useful when those aren't present)
Types:
mm.ImageStream - From a video, yields frames of the video like an iterable
mm.AudioStream - from an audio/video, yields chunks of the audio like an iterable
'mm' should be able to interface (i.e. read, write, and probe) standard multimedia formats, such as images, videos, and audios.
It should use the types present in
nx(i.e. tensor0s,nx.array) whenever possible. For example, as soon as you read an image it should be just the raw pixel data, or a tuple of the raw data and relevant metadataThere should be stream writers as well for outputting/writing long streams as well as well as iteration through frames. We should use libav when available and fallbacks for whatever formats we can support without it (but this module won't be super useful when those aren't present)
Types:
mm.ImageStream- From a video, yields frames of the video like an iterablemm.AudioStream- from an audio/video, yields chunks of the audio like an iterable