[REFACTOR] Replace CostCalculationHelper with litellm.cost_per_token by chandrasekharan-zipstack · Pull Request #1906 · Zipstack/unstract

chandrasekharan-zipstack · 2026-04-07T21:09:12Z

What

Replace the custom CostCalculationHelper in platform-service with litellm's built-in cost_per_token(), moving cost calculation to sdk1's Audit class (caller-side).

Why

CostCalculationHelper fetched pricing data from an external URL, cached it in file storage with a TTL, and did manual price lookups — all of which litellm already handles natively with its bundled pricing database (2645+ models).
This removes an external HTTP dependency, file storage dependency, and simplifies the platform service to pure storage.

How

audit.py (sdk1): Compute cost via litellm.cost_per_token() using the full model name (e.g. azure/gpt-4o) before stripping the provider prefix for DB storage. Send pre-computed cost_in_dollars in the payload to platform-service.
platform.py (platform-service): Read cost_in_dollars directly from the payload instead of computing it. Removed CostCalculationHelper import, provider variable, and input token branching logic.
Deleted cost_calculation.py: No longer needed — was only used by the /usage endpoint.
Cleaned up env.py: Removed MODEL_PRICES_URL, MODEL_PRICES_TTL_IN_DAYS, MODEL_PRICES_FILE_PATH env vars.
Cleaned up utils.py: Removed orphaned format_float_positional function.

Can this PR break any existing features. If yes, please list possible items. If no, please explain why. (PS: Admins do not merge the PR without this section filled)

Low risk. The model name stored in the DB remains in the same stripped format as before (e.g. gpt-4o, not azure/gpt-4o). API deployment responses and dashboard queries are unaffected.
Unknown/custom models return cost=0.0 (same behavior as before via the except Exception fallback).
The platform-service /usage endpoint is backward-compatible: if cost_in_dollars is not in the payload (e.g. from an older SDK), it defaults to 0.0.

Database Migrations

None

Env Config

Removed from platform-service: MODEL_PRICES_URL, MODEL_PRICES_TTL_IN_DAYS, MODEL_PRICES_FILE_PATH (no longer needed)

Relevant Docs

litellm cost_per_token

Related Issues or PRs

N/A

Dependencies Versions

No new dependencies. litellm is already a transitive dependency of platform-service via unstract-sdk1.

Notes on Testing

Verified locally: platform-service starts cleanly, /usage endpoint accepts requests and stores cost correctly.
Compared cost values from litellm.cost_per_token() against previous CostCalculationHelper output for azure/gpt-4o — values match.
Key test cases to verify:
- OpenAI models: gpt-4o, gpt-4o-mini
- Azure models: azure/gpt-4o
- Anthropic models: claude-sonnet-4-20250514
- Embedding models: text-embedding-3-small
- Unknown/custom models: should return cost=0.0

Screenshots

N/A

Checklist

I have read and understood the Contribution Guidelines.

Move cost calculation from platform-service to sdk1's Audit class, using litellm's built-in cost_per_token() instead of a custom helper that fetched pricing data from an external URL. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

coderabbitai · 2026-04-07T21:09:21Z

Summary by CodeRabbit

New Features
- Usage reports now include a cost_in_dollars field and improved token accounting (embeddings use total embedding tokens; completion tokens set to 0 for embedding events). Model name in the report is sent in a display-friendly form.
Refactor
- Cost calculation moved to the client-side; server no longer performs model-price lookups.
Chores
- Removed unused configuration and formatting utilities to simplify the service.

Walkthrough

Server-side cost calculation and related utilities were removed; cost is now computed in the SDK using LiteLLM and sent in the usage payload. The platform controller accepts cost_in_dollars from requests instead of deriving it server-side.

Changes

Cohort / File(s)	Summary
Server-side cost removal `platform-service/src/unstract/platform_service/helper/cost_calculation.py`, `platform-service/src/unstract/platform_service/utils.py`, `platform-service/src/unstract/platform_service/env.py`, `platform-service/src/unstract/platform_service/controller/platform.py`	Deleted `CostCalculationHelper`, removed `format_float_positional()`, removed `Env` constants `MODEL_PRICES_URL`, `MODEL_PRICES_TTL_IN_DAYS`, `MODEL_PRICES_FILE_PATH`. Updated `usage()` to read `cost_in_dollars` from request payload and removed server-side cost/token calculation logic.
Client-side cost addition (SDK) `unstract/sdk1/src/unstract/sdk1/audit.py`	Added LiteLLM `cost_per_token` usage and module logging; changed token accounting for embeddings vs. non-embeddings; preserved full `model_name` for cost lookup while sending a stripped `display_model_name`; included `cost_in_dollars` in POST payload with exception-safe fallback to `0.0`.

Sequence Diagram

sequenceDiagram
    participant SDK as SDK (Client)
    participant LiteLLM as LiteLLM
    participant PlatformService as Platform Service

    rect rgba(100, 150, 200, 0.5)
    Note over SDK,PlatformService: Old Flow (Server-side cost calc)
    SDK->>PlatformService: POST /usage (model, tokens)
    PlatformService->>PlatformService: Load pricing (cache/URL)
    PlatformService->>PlatformService: Calculate cost
    PlatformService-->>SDK: Response (includes computed cost)
    end

    rect rgba(150, 200, 100, 0.5)
    Note over SDK,PlatformService: New Flow (Client-side cost calc)
    SDK->>LiteLLM: cost_per_token(model, prompt_tokens, completion_tokens)
    LiteLLM-->>SDK: cost_in_dollars
    SDK->>PlatformService: POST /usage (model, tokens, cost_in_dollars)
    PlatformService->>PlatformService: Accept cost from payload
    PlatformService-->>SDK: Response
    end

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly and concisely summarizes the main refactoring: replacing the custom CostCalculationHelper with litellm's built-in cost_per_token function.
Description check	✅ Passed	The description is comprehensive and well-structured, covering all template sections including What, Why, How, breaking changes, migrations, env config, testing notes, and dependencies.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch refactor/litellm-cost-calculation

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

greptile-apps · 2026-04-07T21:11:35Z

Greptile Summary

This PR replaces the custom CostCalculationHelper in platform-service — which fetched pricing data from an external URL and cached it via file storage — with litellm's built-in cost_per_token(), shifting cost computation to the SDK's Audit class before the usage payload is sent to platform-service.

Key changes:

audit.py (sdk1): Cost is now computed via litellm.cost_per_token() using the full provider-qualified model name (e.g. azure/gpt-4o) before the provider prefix is stripped for DB storage. Embedding events correctly zero out completion_tokens before the cost lookup. Unknown models fall back to cost=0.0.
platform.py (platform-service): cost_in_dollars is read directly from the payload with a 0.0 default, making the endpoint backward-compatible with older SDK clients that don't send the field.
cost_calculation.py deleted; env.py and utils.py cleaned up of now-orphaned code.
litellm is already a direct dependency of sdk1 (pinned to Zipstack's fork at v1.82.3), so no new dependencies are introduced.

Confidence Score: 5/5

Safe to merge — clean refactor with backward-compatible defaults and no new dependencies.

All remaining findings are P2 style suggestions. The logic is sound: cost is computed before provider-prefix stripping, embedding tokens are correctly zeroed, the platform-service endpoint defaults to 0.0 for older SDK clients, and litellm is already a pinned direct dependency of sdk1.

No files require special attention.

Vulnerabilities

No security concerns identified. The refactor removes an outbound HTTP dependency (external model pricing URL), which reduces the attack surface. Cost values are pre-computed in the SDK and stored as floats; the platform-service endpoint continues to use parameterised SQL queries, so no new injection vectors are introduced.

Important Files Changed

Filename	Overview
unstract/sdk1/src/unstract/sdk1/audit.py	Cost calculation moved from platform-service to SDK; litellm.cost_per_token called before provider-prefix stripping; embedding completion tokens correctly zeroed; exception fallback to 0.0 is safe.
platform-service/src/unstract/platform_service/controller/platform.py	CostCalculationHelper removed; cost_in_dollars now read directly from payload with a safe 0.0 default for backward compatibility with older SDK clients.
platform-service/src/unstract/platform_service/helper/cost_calculation.py	Deleted — external HTTP pricing fetch, file-storage TTL cache, and manual model-price lookup replaced by litellm's bundled pricing database.
platform-service/src/unstract/platform_service/env.py	MODEL_PRICES_URL, MODEL_PRICES_TTL_IN_DAYS, and MODEL_PRICES_FILE_PATH env vars removed; no remaining callers.
platform-service/src/unstract/platform_service/utils.py	Orphaned format_float_positional helper removed; no remaining callers after cost_calculation.py deletion.

Sequence Diagram

sequenceDiagram
    participant SDK as sdk1 Audit
    participant LiteLLM as litellm.cost_per_token
    participant PS as platform-service /usage

    SDK->>LiteLLM: cost_per_token(model="azure/gpt-4o", prompt_tokens, completion_tokens)
    LiteLLM-->>SDK: (prompt_cost, completion_cost)
    Note over SDK: cost_in_dollars = prompt_cost + completion_cost<br/>display_model_name = "gpt-4o" (prefix stripped)
    SDK->>PS: POST /usage { model_name, cost_in_dollars, ... }
    PS-->>SDK: 200 OK

Prompt To Fix All With AI

This is a comment left during a code review.
Path: unstract/sdk1/src/unstract/sdk1/audit.py
Line: 104-107

Comment:
**Exception details not captured in cost-lookup failure log**

The bare `except Exception` silently swallows the error type and message. When `cost_per_token` fails for a new or custom model name, the only log entry is the model name — the root cause (e.g. `NotFoundError`, `BudgetExceededError`, malformed model string) is lost. Adding `exc_info=True` makes these failures much easier to diagnose without changing the fallback behaviour.

```suggestion
            except Exception:
                logger.debug(
                    "Cost lookup failed for model %s, defaulting to 0",
                    model_name,
                    exc_info=True,
                )
```

How can I resolve this? If you propose a fix, please make it concise.

_{Reviews (2): Last reviewed commit: "[REFACTOR] Zero out completion_tokens fo..." | Re-trigger Greptile}

unstract/sdk1/src/unstract/sdk1/audit.py

coderabbitai

🧹 Nitpick comments (1)

platform-service/src/unstract/platform_service/controller/platform.py (1)
219-219: Consider validating the cost_in_dollars value from the payload.

The endpoint now trusts the client-provided cost_in_dollars value without validation. While the endpoint is protected by authentication, consider adding basic type validation to ensure data integrity:

The value could be a non-numeric type (string, None beyond default, dict, etc.)

The value could be negative, which doesn't make sense for costs
💡 Optional validation
-    cost_in_dollars = payload.get("cost_in_dollars", 0.0)
+    cost_in_dollars = payload.get("cost_in_dollars", 0.0)
+    if not isinstance(cost_in_dollars, (int, float)) or cost_in_dollars < 0:
+        cost_in_dollars = 0.0
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@platform-service/src/unstract/platform_service/controller/platform.py` at
line 219, Validate the payload's cost_in_dollars after the line where
cost_in_dollars = payload.get("cost_in_dollars", 0.0): ensure it's a numeric
value and non-negative by attempting to coerce to float (or checking isinstance
int/float) and rejecting invalid input; if coercion fails or the value is < 0,
return a 400-style validation error (or set a safe default and log) from the
controller function that contains this variable so downstream logic only sees a
validated non-negative float.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@platform-service/src/unstract/platform_service/controller/platform.py`:
- Line 219: Validate the payload's cost_in_dollars after the line where
cost_in_dollars = payload.get("cost_in_dollars", 0.0): ensure it's a numeric
value and non-negative by attempting to coerce to float (or checking isinstance
int/float) and rejecting invalid input; if coercion fails or the value is < 0,
return a 400-style validation error (or set a safe default and log) from the
controller function that contains this variable so downstream logic only sees a
validated non-negative float.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 2b5e73f7-f615-4b4d-937b-8c09f0b2db10

📥 Commits

Reviewing files that changed from the base of the PR and between c9ffd9d and 7df2f85.

📒 Files selected for processing (5)

platform-service/src/unstract/platform_service/controller/platform.py
platform-service/src/unstract/platform_service/env.py
platform-service/src/unstract/platform_service/helper/cost_calculation.py
platform-service/src/unstract/platform_service/utils.py
unstract/sdk1/src/unstract/sdk1/audit.py

💤 Files with no reviewable changes (3)

platform-service/src/unstract/platform_service/utils.py
platform-service/src/unstract/platform_service/env.py
platform-service/src/unstract/platform_service/helper/cost_calculation.py

Explicitly set completion_tokens to 0 for embedding events before calling cost_per_token, making the assumption that embeddings have no completion tokens explicit rather than relying on the counter always being zero. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

sonarqubecloud · 2026-04-08T05:44:28Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

github-actions · 2026-04-08T05:44:37Z

Test Results

Summary

✅ Runner Tests: 11 passed, 0 failed (11 total)
✅ SDK1 Tests: 178 passed, 0 failed (178 total)

Runner Tests - Full Report

filepath	function	$$\textcolor{#23d18b}{\tt{passed}}$$	SUBTOTAL
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_logs}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_cleanup}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_cleanup\_skip}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_client\_init}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image\_exists}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_container\_run\_config}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_container\_run\_config\_without\_mount}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_run\_container}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image\_for\_sidecar}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_sidecar\_container}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{TOTAL}}$$		$$\textcolor{#23d18b}{\tt{11}}$$	$$\textcolor{#23d18b}{\tt{11}}$$

SDK1 Tests - Full Report

coderabbitai

🧹 Nitpick comments (1)

unstract/sdk1/src/unstract/sdk1/audit.py (1)

104-107: Consider logging the exception details for easier debugging.

The exception is caught but the actual error message is not logged, making it harder to diagnose failures. For unknown models this is fine, but other failures (e.g., litellm API changes, unexpected types) would be harder to debug.

Proposed improvement

         except Exception:
             logger.debug(
-                "Cost lookup failed for model %s, defaulting to 0", model_name
+                "Cost lookup failed for model %s, defaulting to 0", model_name,
+                exc_info=True,
             )

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@unstract/sdk1/src/unstract/sdk1/audit.py` around lines 104 - 107, The except
block in the cost lookup (in unstract.sdk1.audit.py, around the function
performing model cost lookup) currently swallows all exceptions and only logs
the model_name; modify the exception handler to include the actual exception
details—either by capturing the exception as e and adding it to the log message
(e.g., include str(e)) or by passing exc_info=True to logger.debug—so failures
from litellm API changes or unexpected types are recorded alongside the existing
"Cost lookup failed for model %s, defaulting to 0" message.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@unstract/sdk1/src/unstract/sdk1/audit.py`:
- Around line 104-107: The except block in the cost lookup (in
unstract.sdk1.audit.py, around the function performing model cost lookup)
currently swallows all exceptions and only logs the model_name; modify the
exception handler to include the actual exception details—either by capturing
the exception as e and adding it to the log message (e.g., include str(e)) or by
passing exc_info=True to logger.debug—so failures from litellm API changes or
unexpected types are recorded alongside the existing "Cost lookup failed for
model %s, defaulting to 0" message.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: a41f4ae9-3710-44c0-8d58-4b068323c771

📥 Commits

Reviewing files that changed from the base of the PR and between 7df2f85 and 1a57c38.

📒 Files selected for processing (1)

unstract/sdk1/src/unstract/sdk1/audit.py

pk-zipstack

LGTM

greptile-apps bot reviewed Apr 7, 2026

View reviewed changes

unstract/sdk1/src/unstract/sdk1/audit.py Show resolved Hide resolved

chandrasekharan-zipstack self-assigned this Apr 8, 2026

chandrasekharan-zipstack marked this pull request as ready for review April 8, 2026 05:29

chandrasekharan-zipstack requested review from Deepak-Kesavan and pk-zipstack April 8, 2026 05:29

coderabbitai bot reviewed Apr 8, 2026

View reviewed changes

pk-zipstack approved these changes Apr 8, 2026

View reviewed changes

Deepak-Kesavan approved these changes Apr 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REFACTOR] Replace CostCalculationHelper with litellm.cost_per_token#1906

[REFACTOR] Replace CostCalculationHelper with litellm.cost_per_token#1906
chandrasekharan-zipstack wants to merge 2 commits intomainfrom
refactor/litellm-cost-calculation

chandrasekharan-zipstack commented Apr 7, 2026

Uh oh!

coderabbitai bot commented Apr 7, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Apr 7, 2026 •

edited

Loading

Greptile Summary

Confidence Score: 5/5

Vulnerabilities

Important Files Changed

Sequence Diagram

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

sonarqubecloud bot commented Apr 8, 2026

Uh oh!

github-actions bot commented Apr 8, 2026

Uh oh!

coderabbitai bot left a comment

Uh oh!

pk-zipstack left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

chandrasekharan-zipstack commented Apr 7, 2026

What

Why

How

Can this PR break any existing features. If yes, please list possible items. If no, please explain why. (PS: Admins do not merge the PR without this section filled)

Database Migrations

Env Config

Relevant Docs

Related Issues or PRs

Dependencies Versions

Notes on Testing

Screenshots

Checklist

Uh oh!

coderabbitai bot commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Uh oh!

greptile-apps bot commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Vulnerabilities

Important Files Changed

Sequence Diagram

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Apr 8, 2026

Quality Gate passed

Uh oh!

github-actions bot commented Apr 8, 2026

Test Results

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

pk-zipstack left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

coderabbitai bot commented Apr 7, 2026 •

edited

Loading

greptile-apps bot commented Apr 7, 2026 •

edited

Loading