Add JS Speech to text with sherpa-onnx as a more reliable alternative without sidecar by jlocala1 · Pull Request #282 · Deodat-Lawson/LaunchStack

jlocala1 · 2026-04-15T05:28:30Z

Replaces sidecar speech to text with sherpa-onnx that runs natively in node.js. Uses whisper model of same quality (upgrade from what I showed at the meeting with HuggingFace) but without python dependency. Requires ffmpeg and one time model download

…wson/PDR_AI_v2 into feature/speech-to-text

… for viewing audio, and also updated the transcription file. transcription now shows the audio playing with timestamps and allows the user to click on a time stamp or line of audio and have the audio clip jump to that spot.

vercel · 2026-04-15T05:28:34Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
launch-stack	Ready	Preview, Comment	Apr 15, 2026 5:28am
pdr-ai-v2	Ready	Preview, Comment	Apr 15, 2026 5:28am

Deodat-Lawson · 2026-04-16T20:47:07Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 635f12a09a

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-04-16T20:53:41Z

+    const { userId, videoUrl, category, title, preferredProvider } = validation.data;
+
+    const [user] = await db


Bind video upload to authenticated user

This handler trusts a caller-supplied userId from the request body and immediately uses it to load tenant context, but it never verifies that userId matches the authenticated session (or that a session exists). In environments where this route is reachable, an attacker can submit another user's ID and enqueue transcriptions/documents into the wrong company, which is a cross-tenant authorization issue.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-16T20:53:41Z

+    with yt_dlp.YoutubeDL(ydl_opts) as ydl:
+        info = ydl.extract_info(url, download=True)


Restrict sidecar download URL before yt-dlp fetch

/download-and-transcribe forwards unvalidated user input directly into yt_dlp.extract_info(). Because this endpoint accepts arbitrary URL strings, callers can force the sidecar to fetch non-approved/internal targets or very large resources, bypassing the app-layer hostname allowlist and turning this endpoint into an SSRF/resource-abuse vector.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-16T20:53:41Z

 volumes:
  postgres_data:
-  seaweedfs_data:
+  sidecar_models:


Keep SeaweedFS named volume declared

The top-level volumes section now declares sidecar_models but no longer declares seaweedfs_data, while the seaweedfs service still mounts seaweedfs_data:/data. This leaves the compose file internally inconsistent for the local-storage setup and can break docker compose validation/startup for that profile.

Useful? React with 👍 / 👎.

ezhu15 and others added 5 commits April 5, 2026 17:06

youtube processing w/ yt-dlp sidecar

033118c

Merge branch 'feature/speech-to-text' of https://github.com/Deodat-La…

5783b54

…wson/PDR_AI_v2 into feature/speech-to-text

fixed youtube upload sidecar

a8d29a8

add sherpa-onnx JS transcription as sidecar alternative

635f12a

jlocala1 requested a review from Deodat-Lawson April 15, 2026 05:29

chatgpt-codex-connector bot reviewed Apr 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add JS Speech to text with sherpa-onnx as a more reliable alternative without sidecar#282

Add JS Speech to text with sherpa-onnx as a more reliable alternative without sidecar#282
jlocala1 wants to merge 5 commits intomainfrom
feature/whisper-js-experiment

jlocala1 commented Apr 15, 2026

Uh oh!

vercel bot commented Apr 15, 2026

Uh oh!

Deodat-Lawson commented Apr 16, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Apr 16, 2026

Uh oh!

chatgpt-codex-connector bot Apr 16, 2026

Uh oh!

chatgpt-codex-connector bot Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		const { userId, videoUrl, category, title, preferredProvider } = validation.data;

		const [user] = await db

		with yt_dlp.YoutubeDL(ydl_opts) as ydl:
		info = ydl.extract_info(url, download=True)

Conversation

jlocala1 commented Apr 15, 2026

Uh oh!

vercel bot commented Apr 15, 2026

Uh oh!

Deodat-Lawson commented Apr 16, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants