LlmSpeechSummarization with AllAudioTracksJob handler by eric-mccann-pro · Pull Request #421 · openmpf/openmpf-components

eric-mccann-pro · 2026-02-02T18:18:32Z

Issues:

Support feeding forward all tracks to one sub-job (for Python audio jobs only) openmpf#1996

Related PRs:

Please review our Contributor Guide.

This change is

…g BECAUSE it tries to talk to a vllm container TODO: parameterize the URL

…ither needed at build OR plumbing

…he LLM

…y WFM

… I have tested their functionality

…into feat/ff-all-audio-tracks

…enizers during docker build

…into feat/ff-all-audio-tracks

…tracks

jrobble

@jrobble reviewed 22 files and all commit messages, and made 11 comments.
Reviewable status: all files reviewed, 10 unresolved discussions (waiting on eric-mccann-pro).

python/LlmSpeechSummarization/README.md line 3 at r2 (raw file):

# Overview

The OpenMPF LLM Speech Summarization Component summarizes FeedForward video and audio tracks for speech detections.

Change to "feed-forward".

python/LlmSpeechSummarization/llm_speech_summarization_component/llm_speech_summarization_component.py line 197 at r2 (raw file):

    @staticmethod
    def _get_track_for_classifier(t, job: mpf.VideoJob|mpf.AudioJob, classifier_name, classifier, arg_factory):

t is not descriptive enough. Please change to track_cls. Same for the parameter in _get_classifier_track().

python/LlmSpeechSummarization/llm_speech_summarization_component/llm_speech_summarization_component.py line 281 at r2 (raw file):

        return results

    def _process_feed_forward_job(self, job, config, make_track, make_main_track):

Call make_track make_classifier_track for clarity.

Call make_main_track make_summary_track for clarity.

python/LlmSpeechSummarization/llm_speech_summarization_component/llm_speech_summarization_component.py line 316 at r2 (raw file):

            )

        main_detection_properties = {'TEXT': final_summary.summary}

Call this summary_detection_properties.

python/LlmSpeechSummarization/llm_speech_summarization_component/llm_speech_summarization_component.py line 333 at r2 (raw file):

                    make_track,
                    filter(
                        lambda c: c[1].confidence >= classifier_confidence_minimum,

classifier is more descriptive than c.

python/LlmSpeechSummarization/llm_speech_summarization_component/llm_speech_summarization_component.py line 341 at r2 (raw file):

        return results

def run_component_test(clientFactory = None,

In general, we try to keep the test code separate from the component code. Could this function be moved to test_llm_speech_summarization_component.py?

python/LlmSpeechSummarization/llm_speech_summarization_component/llm_speech_summarization_component.py line 367 at r2 (raw file):

if __name__ == '__main__':

Our other Python components don't have a main. Is this needed?

python/LlmSpeechSummarization/llm_speech_summarization_component/tests/test_llm_speech_summarization_component.py line 33 at r2 (raw file):

from llm_speech_summarization_component.llm_speech_summarization_component import run_component_test, _log_exception, logger

logger.setLevel(logging.DEBUG)

logger isn't used in this file. Please remove it.

python/LlmSpeechSummarization/llm_speech_summarization_component/tests/test_llm_speech_summarization_component.py line 535 at r2 (raw file):

    main_detection = result[0]
    classifier_detection = result[1]
    assert main_detection.detection_properties['TEXT'] == "The conversation is a multifaceted discussion centered on Major League Baseball, primarily revolving around the publication and content of a memoir titled 'Reminiscences of an Old Timer' by former player John (Dasher) Troy. The memoir serves as both a historical reflection on early professional baseball and a practical guide for aspiring players, emphasizing foundational skills, strategic decision-making, and the mental and physical demands of the game. Key themes include player positioning, batting and pitching techniques, base running, fielding mechanics, and the importance of experience, observation, and self-awareness. The discussion also highlights the legacy of early baseball players and teams, the evolution of the sport, and the enduring significance of traditional principles such as proper footwork and timing. While several fragments reference real estate, business operations, and promotional content in New York City—including venues in Harlem, Chelsea, and Manhattan—these appear to be incidental or transcribed artifacts and do not form a coherent narrative. The overwhelming focus remains on professional baseball gameplay, rules, player health, team discipline, and historical context, with consistent references to specific teams, players, stadiums, and equipment. The conversation reflects a deep engagement with the sport’s traditions, strategies, and cultural significance."

This same TEXT appears three times in this file. Refactor it into a common variable like this: https://github.com/openmpf/openmpf-components/blob/master/python/AzureTranslation/tests/test_acs_translation.py#L55

python/LlmSpeechSummarization/llm_speech_summarization_component/tests/test_llm_speech_summarization_component.py line 542 at r2 (raw file):

    try:
        raise _log_exception(mpf.DetectionError.OTHER_DETECTION_ERROR_TYPE, 'It worked')
        assert False

It's not possible to assert False here. This is dead code. Please remove it.

python/TransformerTagging/transformer_tagging_component/transformer_tagging_component.py line 258 at r2 (raw file):

        self.json = self._load_json(corpus_file_name)
        start = time.time()
        self.embed = model.encode(self.json["text"].tolist(), convert_to_tensor=True, show_progress_bar=False)

I will be landing a hotfix to develop in the next day or two. You will need to merge that into this before landing.

jrobble

@jrobble reviewed 3 files and all commit messages, and resolved 10 discussions.
Reviewable status: complete! all files reviewed, all discussions resolved (waiting on eric-mccann-pro and tstrass).

jrobble

@jrobble reviewed all commit messages.
Reviewable status: complete! all files reviewed, all discussions resolved (waiting on eric-mccann-pro and tstrass).

Changes addressed

jrobble

@jrobble reviewed 3 files and all commit messages.
Reviewable status: complete! all files reviewed, all discussions resolved (waiting on eric-mccann-pro and tstrass).

jrobble

@jrobble partially reviewed 4 files.
Reviewable status: 22 of 24 files reviewed, all discussions resolved (waiting on eric-mccann-pro and tstrass).

jrobble

@jrobble partially reviewed 3 files.
Reviewable status: all files reviewed (commit messages unreviewed), all discussions resolved (waiting on eric-mccann-pro and tstrass).

jrobble

@jrobble reviewed all commit messages.
Reviewable status: complete! all files reviewed, all discussions resolved (waiting on eric-mccann-pro and tstrass).

eric-mccann-pro added 30 commits December 12, 2025 12:52

Add QwenSpeechSummarization

2689205

Runs main() in container... does not run main during build for testin…

f01498a

…g BECAUSE it tries to talk to a vllm container TODO: parameterize the URL

Logger won't log. Deal with it later

98bc842

Fix format strings

98bb794

Add primary_topic and other_topics to output

e29c34a

Make sure we download the tokenizer giblets during docker build

0604c07

Mock an LLM generator's events stream. Run pytest if RUN_TESTS is true

97ae53f

Use releasable descriptor

f04d5b0

Readme

86a7ab4

Change default RUN_TEST to false

50cb5f7

Parameterize VLLM_MODEL and VLLM_URI at container scope, as they're e…

e006afd

…ither needed at build OR plumbing

+x

b0b1c15

Include served-model-name param in the entrypoint, not the CMD

198f3ec

Make sure tokenizer pull step has VLLM_MODEL defined in env if overriden

0908932

License blocks

c20c3d2

Make exception text less useless when there are no FF tracks

ebbecd7

Fix typo

5aea1b7

Fix another typo

68c8456

Fix default in descriptor

ae4f6f0

Make speaker id optional

de6f2d3

input_cleanup: be cool

cc151c6

again

16a367c

Change summary and print the final summary after it comes back from t…

7d231e5

…he LLM

Print number of results from component video track func when called b…

3c04189

…y WFM

Actually return results. duh

47ca541

Set an ImageLocation for video tracks

dbed34c

Define CLASSIFIERS_FILE and ENABLED_CLASSIFIERS in the json, now that…

bb5d333

… I have tested their functionality

Gate some of the output behind debug parameter

6948569

Provide Items of Interest instruction

82f37b6

Remove businesses from entities list

9e47148

eric-mccann-pro added 4 commits March 9, 2026 17:47

Use TEXT if no TRANSCRIPT and no TRANSLATION

d9dcd42

Merge remote-tracking branch 'origin/feat/qwen-speech-summarization' …

0ca5be6

…into feat/ff-all-audio-tracks

llmspeechsummarization/Dockerfile: download both qwen and gpt-oss tok…

e0e24a6

…enizers during docker build

Merge remote-tracking branch 'origin/feat/qwen-speech-summarization' …

c334de5

…into feat/ff-all-audio-tracks

tstrass changed the base branch from master to develop March 24, 2026 14:41

tstrass and others added 6 commits April 10, 2026 10:48

Merge remote-tracking branch 'origin/develop' into feat/ff-all-audio-…

7bb1bfc

…tracks

Use tokenizer model for tokenization in audio job

4bbb6e6

Refactor audio and video methods to reduce duplication

ca49e9a

Convert from pandas.Series to list.

f239c00

Patch transformer tagging component

7a31621

Merge branch 'master' into hf-merge/transformer-tagging-encode

80874cb

eric-mccann-pro marked this pull request as ready for review April 13, 2026 15:01

jrobble previously requested changes Apr 14, 2026

View reviewed changes

eric-mccann-pro assigned tstrass Apr 15, 2026

Address PR comments.

d2384f8

jrobble reviewed Apr 15, 2026

View reviewed changes

Merge branch 'develop' into feat/ff-all-audio-tracks

32b87b3

jrobble reviewed Apr 15, 2026

View reviewed changes

Move test data.

009c6a2

jrobble reviewed Apr 15, 2026

View reviewed changes

jrobble added 2 commits April 15, 2026 14:54

Use relative path for test data.

1cac898

Move tests dir.

ecfc4d3

jrobble reviewed Apr 15, 2026

View reviewed changes

Fix typo.

df29b3d

jrobble approved these changes Apr 15, 2026

View reviewed changes

jrobble merged commit 85c7ec3 into develop Apr 15, 2026
2 checks passed

jrobble deleted the feat/ff-all-audio-tracks branch April 15, 2026 21:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LlmSpeechSummarization with AllAudioTracksJob handler#421

LlmSpeechSummarization with AllAudioTracksJob handler#421
jrobble merged 213 commits intodevelopfrom
feat/ff-all-audio-tracks

eric-mccann-pro commented Feb 2, 2026 •

edited

Loading

Uh oh!

jrobble left a comment

Uh oh!

jrobble left a comment

Uh oh!

jrobble left a comment

Uh oh!

jrobble left a comment

Uh oh!

jrobble left a comment

Uh oh!

jrobble left a comment

Uh oh!

jrobble left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

eric-mccann-pro commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jrobble left a comment

Choose a reason for hiding this comment

Uh oh!

jrobble left a comment

Choose a reason for hiding this comment

Uh oh!

jrobble left a comment

Choose a reason for hiding this comment

Uh oh!

jrobble left a comment

Choose a reason for hiding this comment

Uh oh!

jrobble left a comment

Choose a reason for hiding this comment

Uh oh!

jrobble left a comment

Choose a reason for hiding this comment

Uh oh!

jrobble left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

eric-mccann-pro commented Feb 2, 2026 •

edited

Loading