Allow for specific perception encoder version IDs to be disallowed #1845

mkaic · 2025-12-23T01:14:49Z

Description

Motivation: that we can enable only the smaller model sizes on serverless!

Currently there's already an inference implementation and workflow block for Perception Encoder, but to the best of my knowledge they're disabled on serverless because the largest model size we support is big enough that it could cause problems (correct me if I'm wrong here).

This PR makes it possible to only enable the two smaller checkpoints for Perception Encoder on serverless using a new environment variable PERCEPTION_ENCODER_DISALLOWED_VERSION_IDS, which is referenced in the PerceptionEncoderInferenceRequest class's pydantic validators.

By setting CORE_MODEL_PE_ENABLED to True and PERCEPTION_ENCODER_DISALLOWED_VERSION_IDS to "PE-Core-G14-448" in roboflow-infra/gcp/serverless-inference/appstack/chart/rf-svrls/values-staging.yaml, serverless will start PE with the exception of the 9.1GB g14-448 variant.

Type of change

New feature (non-breaking change which adds functionality)

How has this change been tested, please provide a testcase or example of how you tested the change?

Tested in workflow running on localhosted inference server. Verified that it works properly with 0, 1, and 2 different version IDs in the environment variable. Verified that the user sees the intended error message if they try to run an unsupported model variant.

Any specific deployment considerations

N/A

Docs

N/A

… in pydantic validator in PerceptionEncoderInferenceRequest

…ll override this for serverless.

… an empty string if the input is an empty string

mkaic added 5 commits December 23, 2025 00:52

add PERCEPTION_ENCODER_DISALLOWED_VERSION_IDS env variable and use it…

078575c

… in pydantic validator in PerceptionEncoderInferenceRequest

change default to no disallowed variants. roboflow-infra's default wi…

2ef74b9

…ll override this for serverless.

change new env variable parsing so it returns an empty set instead of…

e57f0d4

… an empty string if the input is an empty string

change error message to be more helpful

ff65b6c

bump version by minor number

2193ac9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow for specific perception encoder version IDs to be disallowed #1845

Allow for specific perception encoder version IDs to be disallowed #1845

Uh oh!

mkaic commented Dec 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Allow for specific perception encoder version IDs to be disallowed #1845

Are you sure you want to change the base?

Allow for specific perception encoder version IDs to be disallowed #1845

Uh oh!

Conversation

mkaic commented Dec 23, 2025

Description

Type of change

How has this change been tested, please provide a testcase or example of how you tested the change?

Any specific deployment considerations

Docs

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants