Shared native price cache #4136

squadgazzz · 2026-02-09T17:26:41Z

Description

Replaces #4044. Once we started forwarding native price estimates from the orderbook to autopilot, CoinGecko API usage went up. This happened because the estimator moved to autopilot, which now handles all requests and also relies on CoinGecko.

This PR refactors native price estimation in a way described below, according to the diagram created by @MartinquaXD:

Changes

Split CachingNativePriceEstimator into three focused components: a passive Cache (shared data store), a
CachingNativePriceEstimator (on-demand price fetching with caching), and a NativePriceUpdater (background maintenance worker).
The autopilot now creates a single shared Cache instance used by both the API-facing estimator and the auction
competition estimator, eliminating duplicate price fetches for the same tokens.
Added an --api-native-price-estimators flag to the autopilot, allowing the API endpoint to use different native price estimator sources than the auction pipeline (falls back to --native-price-estimators, if unset).
Replaced the implicit priority-based update system (high_priority + replace_high_priority) with an explicit token tracking via set_tokens_to_update(), called during the solvable orders cache building in the autopilot.
Moved cache + updater wiring out of the shared factory into each binary, so each service composes the shared building blocks as needed:
- Orderbook: creates its own Cache + CachingNativePriceEstimator.
- Autopilot: Creates 2 separate CachingNativePriceEstimators that share a single instance of Cache. One of the estimators is wrapped with the NativePriceUpdater, which is used in the auction competition. Another one serves the autopilot's API request without a maintenance task.
Removed the unused --native-price-cache-max-update-size flag (dead code after the refactoring removed the truncation logic).

How to test

Existing tests + staging and prod.

Follow-up tasks

The cache is unbounded. This needs to be revisited, and probably adding a simple LRU cache would be safer.

squadgazzz · 2026-02-09T20:14:54Z

/gemini review

gemini-code-assist

Code Review

This pull request refactors the native price estimation by splitting CachingNativePriceEstimator into Cache, CachingNativePriceEstimator, and NativePriceUpdater. This change is intended to reduce API usage by sharing the cache in the autopilot service. The implementation is sound, but a critical validation check for the cache configuration was removed during this refactoring. This could lead to excessive API calls, negating the benefits of the cache. A specific comment has been added to address this, and the suggestion to panic on configuration error aligns with established repository rules for fail-fast behavior in critical background tasks.

crates/shared/src/price_estimation/native_price_cache.rs

crates/e2e/tests/e2e/autopilot_leader.rs

# Conflicts: # crates/e2e/tests/e2e/autopilot_leader.rs

gemini-code-assist

Code Review

This pull request refactors the native price estimation by splitting the CachingNativePriceEstimator to improve architecture and reduce redundant API calls. However, it introduces a resource exhaustion vulnerability by removing the limit on background updates. An attacker could flood the orderbook with solvable orders for a large number of unique tokens, leading to excessive API usage and a potential Denial of Service. Additionally, there is a bug where malformed 0.0 prices can be returned from the cache. Finally, there is an opportunity to reduce code duplication by introducing a shared helper function for creating the CachingNativePriceEstimator.

gemini-code-assist · 2026-02-10T12:30:54Z

crates/shared/src/price_estimation/native_price_cache.rs

+        let outdated_entries = cache.outdated_tokens(max_age, Instant::now());
+
+        tracing::trace!(count = outdated_entries.len(), "outdated prices to fetch");
+
+        metrics
+            .native_price_cache_outdated_entries
+            .set(i64::try_from(outdated_entries.len()).unwrap_or(i64::MAX));
+
+        if outdated_entries.is_empty() {
+            return;
+        }
+
+        let timeout = self.estimator.0.quote_timeout;
+        let mut stream =
+            self.estimator
+                .0
+                .estimate_prices_and_update_cache(&outdated_entries, max_age, timeout);
+        while stream.next().await.is_some() {}


The removal of the limit on the number of tokens updated in a single maintenance cycle creates a potential resource exhaustion vulnerability. An attacker can flood the orderbook with solvable orders for a large number of unique tokens, forcing the autopilot's background task to attempt price fetches for all these tokens, leading to excessive API usage and a potential Denial of Service. Re-introduce a limit on the number of tokens updated per cycle and implement a priority-based update mechanism.

That must be a very expensive attack to create thousands of orders with unique token pairs, where each sell token should have sufficient balance.

crates/shared/src/price_estimation/native_price_cache.rs

crates/autopilot/src/run.rs

gemini-code-assist

Code Review

The pull request successfully refactors the native price estimation system by introducing a shared Cache, a CachingNativePriceEstimator, and a NativePriceUpdater. This aligns with the stated goal of splitting the estimator into focused components and sharing the cache across different parts of the application (API and auction competition). The changes correctly implement the new architecture, including the removal of the implicit priority-based update system and the --native-price-cache-max-update-size flag. The relocation of configuration checks and updates to test cases are consistent with the refactoring. No critical issues or direct logic errors were found in this refactoring.

jmg-duarte

Some small notes. Everything makes sense to me, ~~with the small exception of Martin's comment that I still don't fully grasp~~ Martin explained on Slack, his idea makes perfect sense now

crates/shared/src/price_estimation/native_price_cache.rs

jmg-duarte · 2026-02-11T10:53:54Z

crates/shared/src/price_estimation/native_price_cache.rs

+            let cached = Self::get_ready_to_use_cached_price(
+                *token,
+                now,
+                &mut cache,
+                &self.0.max_age,
+                create_missing_entries,
+            );
+            let label = if cached.is_some() { "hits" } else { "misses" };
+            CacheMetrics::get()
+                .native_price_cache_access
+                .with_label_values(&[label])
+                .inc_by(1);
+            if let Some(result) = cached {
+                results.insert(*token, result.result);
+            }


Additionally separate methods for the hits/missed on the metrics would make this much shorter and easier to read

Suggested change

let cached = Self::get_ready_to_use_cached_price(

*token,

now,

&mut cache,

&self.0.max_age,

create_missing_entries,

);

let label = if cached.is_some() { "hits" } else { "misses" };

CacheMetrics::get()

.native_price_cache_access

.with_label_values(&[label])

.inc_by(1);

if let Some(result) = cached {

results.insert(*token, result.result);

}

if let Some(cached) = Self::get_ready_to_use_cached_price(

*token,

now,

&mut cache,

&self.0.max_age,

create_missing_entries,

) {

results.insert(*token, result.result);

CacheMetrics::get()

.native_price_cache_access

.with_label_values(&["hits"])

.inc_by(1);

} else {

CacheMetrics::get()

.native_price_cache_access

.with_label_values(&["misses"])

.inc_by(1);

}

Doesn't your version add more lines of code and create code duplication in terms of incrementing the metric?

The suggestion alone yes, but in the comment I suggest creating a method for the hits and the misses metrics

Regardless, it's just a suggestion

crates/shared/src/price_estimation/native_price_cache.rs

m-sz · 2026-02-11T11:31:29Z

crates/shared/src/price_estimation/native_price_cache.rs

+        metrics
+            .native_price_cache_background_updates
+            .inc_by(outdated_entries.len() as u64);


Should this metric only count attempts at updating native price?
Some results of native price estimation are not cacheable - would it be useful to track successes/failures?

We already have a separate metric for success/failures:

services/crates/shared/src/price_estimation/instrumented.rs

Line 114 in 1a4790c

price_estimates: IntCounterVec,

crates/shared/src/price_estimation/native_price_cache.rs

MartinquaXD

Looks alright to me. I believe my suggestion doesn't violate any invariants but since I'm not sure I'll already approve the current state.
Wanted to mention to test it again in prod but I saw you already did that. 👍

MartinquaXD · 2026-02-11T17:39:04Z

crates/shared/src/price_estimation/native_price_cache.rs

    ) -> Option<CachedResult> {
-        Self::get_cached_price(token, now, cache, max_age, create_missing_entry)
-            .filter(|cached| cached.is_ready())
+        Self::get_cached_price(token, now, cache, max_age).filter(|cached| cached.is_ready())
+    }


This function stood out to me as being kind of pointless. There is only 1 case where we currently do not also require is_ready() and that instance can be avoided by changing how a cached entry gets updated.

I originally wanted to just comment on this but I realized that explaining the issue is more complicated than just offering a diff. It gets rid of ~27 lines by reducing the API surface.

diff

diff --git a/crates/shared/src/price_estimation/native_price_cache.rs b/crates/shared/src/price_estimation/native_price_cache.rs index 416ce8206..201d3dc3a 100644 --- a/crates/shared/src/price_estimation/native_price_cache.rs +++ b/crates/shared/src/price_estimation/native_price_cache.rs @@ -104,25 +104,30 @@ struct CachedResult { const ACCUMULATIVE_ERRORS_THRESHOLD: u32 = 5; impl CachedResult { - fn new( - result: CacheEntry, - updated_at: Instant, - requested_at: Instant, - current_accumulative_errors_count: u32, - ) -> Self { - let estimator_internal_errors_count = - matches!(result, Err(PriceEstimationError::EstimatorInternal(_))) - .then_some(current_accumulative_errors_count + 1) - .unwrap_or_default(); + fn new(result: CacheEntry) -> Self { + let now = Instant::now(); + let is_accumulating_error = + matches!(result, Err(PriceEstimationError::EstimatorInternal(_))); Self { result, - updated_at, - requested_at, - accumulative_errors_count: estimator_internal_errors_count, + updated_at: now, + requested_at: now, + accumulative_errors_count: u32::from(is_accumulating_error), } } + fn update(&mut self, result: CacheEntry) { + let now = Instant::now(); + self.requested_at = now; + self.updated_at = now; + self.accumulative_errors_count = match result { + Err(PriceEstimationError::EstimatorInternal(_)) => self.accumulative_errors_count + 1, + _ => 0, + }; + self.result = result; + } + /// The result is not ready if the estimator has returned an internal error /// and consecutive errors are less than /// `ESTIMATOR_INTERNAL_ERRORS_THRESHOLD`. @@ -170,12 +175,12 @@ impl Cache { let updated_at = Self::random_updated_at(max_age, now, &mut rng); Some(( token, - CachedResult::new( - Ok(from_normalized_price(price)?), + CachedResult { + result: Ok(from_normalized_price(price)?), updated_at, - now, - Default::default(), - ), + requested_at: now, + accumulative_errors_count: 0, + }, )) }) .collect::<HashMap<_, _>>(); @@ -186,7 +191,7 @@ impl Cache { })) } - pub fn max_age(&self) -> Duration { + fn max_age(&self) -> Duration { self.0.max_age } @@ -199,14 +204,10 @@ impl Cache { now - Duration::from_secs(age) } - pub fn len(&self) -> usize { + fn len(&self) -> usize { self.0.data.lock().unwrap().len() } - pub fn is_empty(&self) -> bool { - self.0.data.lock().unwrap().is_empty() - } - fn get_cached_price( token: Address, now: Instant, @@ -216,20 +217,11 @@ impl Cache { let entry = cache.get_mut(&token)?; entry.requested_at = now; let is_recent = now.saturating_duration_since(entry.updated_at) < *max_age; - is_recent.then_some(entry.clone()) - } - - fn get_ready_to_use_cached_price( - token: Address, - now: Instant, - cache: &mut MutexGuard<HashMap<Address, CachedResult>>, - max_age: &Duration, - ) -> Option<CachedResult> { - Self::get_cached_price(token, now, cache, max_age).filter(|cached| cached.is_ready()) + (is_recent && entry.is_ready()).then_some(entry.clone()) } /// Only returns prices that are currently cached. - pub fn get_cached_prices( + fn get_cached_prices( &self, tokens: &[Address], ) -> HashMap<Address, Result<f64, PriceEstimationError>> { @@ -237,8 +229,7 @@ impl Cache { let mut cache = self.0.data.lock().unwrap(); let mut results = HashMap::default(); for token in tokens { - let cached = - Self::get_ready_to_use_cached_price(*token, now, &mut cache, &self.0.max_age); + let cached = Self::get_cached_price(*token, now, &mut cache, &self.0.max_age); let label = if cached.is_some() { "hits" } else { "misses" }; CacheMetrics::get() .native_price_cache_access @@ -251,8 +242,12 @@ impl Cache { results } - fn insert(&self, token: Address, result: CachedResult) { - self.0.data.lock().unwrap().insert(token, result); + fn insert(&self, token: Address, result: CacheEntry) { + let mut cache = self.0.data.lock().unwrap(); + cache + .entry(token) + .and_modify(|value| value.update(result.clone())) + .or_insert_with(|| CachedResult::new(result)); } } @@ -314,19 +309,14 @@ impl CachingNativePriceEstimator { I::IntoIter: Send + 'a, { let estimates = tokens.into_iter().map(move |token| async move { - let current_accumulative_errors_count = { - // check if the price is cached by now - let now = Instant::now(); + // check if the price is cached by now + let now = Instant::now(); + { let mut cache = self.0.cache.0.data.lock().unwrap(); - - match Cache::get_cached_price(token, now, &mut cache, &max_age) { - Some(cached) if cached.is_ready() => { - return (token, cached.result); - } - Some(cached) => cached.accumulative_errors_count, - None => Default::default(), + if let Some(cached) = Cache::get_cached_price(token, now, &mut cache, &max_age) { + return (token, cached.result); } - }; + } let approximation = self .0 @@ -344,11 +334,7 @@ impl CachingNativePriceEstimator { // update price in cache if should_cache(&result) { - let now = Instant::now(); - self.0.cache.insert( - token, - CachedResult::new(result.clone(), now, now, current_accumulative_errors_count), - ); + self.0.cache.insert(token, result.clone()); }; (token, result) @@ -362,20 +348,12 @@ impl CachingNativePriceEstimator { &self.0.cache } - /// Only returns prices that are currently cached. - fn get_cached_prices( - &self, - tokens: &[Address], - ) -> HashMap<Address, Result<f64, PriceEstimationError>> { - self.0.cache.get_cached_prices(tokens) - } - pub async fn fetch_prices( &self, tokens: &[Address], timeout: Duration, ) -> HashMap<Address, NativePriceEstimateResult> { - let mut prices = self.get_cached_prices(tokens); + let mut prices = self.0.cache.get_cached_prices(tokens); if timeout.is_zero() { return prices; } @@ -413,12 +391,7 @@ impl NativePriceEstimating for CachingNativePriceEstimator { let cached = { let now = Instant::now(); let mut cache = self.0.cache.0.data.lock().unwrap(); - Cache::get_ready_to_use_cached_price( - token, - now, - &mut cache, - &self.0.cache.0.max_age, - ) + Cache::get_cached_price(token, now, &mut cache, &self.0.cache.0.max_age) }; let label = if cached.is_some() { "hits" } else { "misses" };

Thanks! Applied the diff 🙌

squadgazzz added 4 commits February 9, 2026 17:04

Shared native price cache

8b5963a

Update configs

d824df5

Trigger Build

86552a0

Simplify

721a27d

gemini-code-assist bot reviewed Feb 9, 2026

View reviewed changes

crates/shared/src/price_estimation/native_price_cache.rs Show resolved Hide resolved

squadgazzz added 3 commits February 9, 2026 20:26

Fix test

52e4a28

Fix test

b80f3d7

Recover a check

5e75c8b

github-advanced-security bot found potential problems Feb 9, 2026

View reviewed changes

crates/e2e/tests/e2e/autopilot_leader.rs Fixed Show fixed Hide fixed

Tmp

1fdb8f3

squadgazzz force-pushed the shared-native-price-cache-2 branch from 4fba623 to fc3b701 Compare February 10, 2026 10:05

squadgazzz added 4 commits February 10, 2026 10:05

Proper fix

fc3b701

Fix another race condition

4994847

Another fix

cd73658

Fix flaky autoplitot follower e2e test

c4312ac

squadgazzz mentioned this pull request Feb 10, 2026

Fix flaky autopilot follower e2e test #4139

Merged

squadgazzz added 4 commits February 10, 2026 11:43

Metrics port

79e41a4

Merge branch 'fix-flaky-e2e-test' into shared-native-price-cache-2

7ec9c42

# Conflicts: # crates/e2e/tests/e2e/autopilot_leader.rs

Typo

2352acc

Merge branch 'fix-flaky-e2e-test' into shared-native-price-cache-2

e47ec1a

squadgazzz changed the base branch from main to fix-flaky-e2e-test February 10, 2026 11:44

squadgazzz added 4 commits February 10, 2026 12:20

Fix

6255758

Merge branch 'fix-flaky-e2e-test' into shared-native-price-cache-2

9819812

Suppress warning

beb0ca3

Merge branch 'fix-flaky-e2e-test' into shared-native-price-cache-2

e586856

squadgazzz marked this pull request as ready for review February 10, 2026 12:26

squadgazzz requested a review from a team as a code owner February 10, 2026 12:26

gemini-code-assist bot reviewed Feb 10, 2026

View reviewed changes

Fixes

7dd66f2

squadgazzz added 3 commits February 10, 2026 19:46

Simplify

acbf47f

Simplify

5a79b13

Private function

3b729c9

squadgazzz marked this pull request as ready for review February 10, 2026 20:00

squadgazzz requested a review from MartinquaXD February 10, 2026 20:00

gemini-code-assist bot reviewed Feb 10, 2026

View reviewed changes

Migrate the config

b5f0cc7

squadgazzz force-pushed the shared-native-price-cache-2 branch from 34dcce5 to dad1fc5 Compare February 11, 2026 08:55

Fix e2e test config

dad1fc5

jmg-duarte reviewed Feb 11, 2026

View reviewed changes

Simplify logic

4e343fb

m-sz reviewed Feb 11, 2026

View reviewed changes

jmg-duarte reviewed Feb 11, 2026

View reviewed changes

crates/shared/src/price_estimation/native_price_cache.rs Outdated Show resolved Hide resolved

squadgazzz added 3 commits February 11, 2026 11:48

Review comments

aae8ced

Values

294588c

Redundant placeholders

394e4da

squadgazzz requested review from jmg-duarte and m-sz February 11, 2026 12:23

jmg-duarte approved these changes Feb 11, 2026

View reviewed changes

squadgazzz added 2 commits February 11, 2026 12:43

Refactor

5b3fa72

Merge branch 'main' into shared-native-price-cache-2

e1e338c

MartinquaXD approved these changes Feb 11, 2026

View reviewed changes

squadgazzz added the hotfix Labels PRs that should be applied into production right away label Feb 11, 2026

squadgazzz enabled auto-merge February 11, 2026 18:52

Simplify

d002123

squadgazzz added this pull request to the merge queue Feb 11, 2026

Merged via the queue into main with commit 558b1bc Feb 11, 2026
19 checks passed

squadgazzz deleted the shared-native-price-cache-2 branch February 11, 2026 19:15

github-actions bot locked and limited conversation to collaborators Feb 11, 2026

Shared native price cache #4136

Shared native price cache #4136

Uh oh!

Conversation

squadgazzz commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changes

How to test

Follow-up tasks

Uh oh!

squadgazzz commented Feb 9, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

squadgazzz Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

jmg-duarte left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jmg-duarte Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

squadgazzz Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

jmg-duarte Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

m-sz Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

squadgazzz Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MartinquaXD left a comment

Choose a reason for hiding this comment

Uh oh!

MartinquaXD Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

squadgazzz Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

squadgazzz commented Feb 9, 2026 •

edited

Loading

squadgazzz Feb 10, 2026 •

edited

Loading

jmg-duarte left a comment •

edited

Loading