[Flagging] Merge evaluations into develop by typotter · Pull Request #3183 · DataDog/dd-sdk-android

typotter · 2026-02-17T13:12:09Z

What does this PR do?

Motivation

We have implemented Evaluation Logging in the Flagging module. This provides comprehensive visibility into all feature flag evaluations, including defaults, errors, and successful matches. This goes beyond exposure logging by capturing aggregated metrics about evaluation frequency, error rates, and runtime default usage across all flags.

What inspired you to submit this pull request?

Merge the feature branch into develop

Additional Notes

Thank you to all the reviewers along the way!

Anything else we should know when reviewing?

This PR contains the following PRs, and a merge from main to catch up and the deletion of the batched event schema.

🥞 Evaluation Logging Stacked Pull Requests 🥞

-#3147

Review checklist (to be filled by reviewers)

Feature or bugfix MUST have appropriate tests (unit, integration, e2e)
Make sure you discussed the feature or bugfix with the maintaining team in an Issue
Make sure each commit and the PR mention the Issue number (cf the CONTRIBUTING doc)

[Flags] Evaluations subfeature Co-authored-by: typotter <tyler.potter@datadoghq.com>

[FFL-1720] Evaluation Logging: Event Schema & Data Models Co-authored-by: typotter <tyler.potter@datadoghq.com>

[Flags] FlagEvaluation schema Co-authored-by: typotter <tyler.potter@datadoghq.com>

Implements the core aggregation logic for evaluation logging (EVALLOG). Aggregates flag evaluations by key before flushing to reduce network overhead. Key components: - EvaluationEventsProcessor: Aggregates evaluations with time/size-based flushing - Time-based: Configurable interval (default 10s, range 1-60s) - Size-based: Auto-flush at 1000 unique aggregations - Shutdown: Final flush on processor.stop() - AggregationKey: Composite key for grouping evaluations - Groups by: flag, variant, allocation, targeting key, error code - EVALLOG.8: Omits variant/allocation for DEFAULT/ERROR reasons - AggregationStats: Tracks aggregated statistics per key - Count, first/last timestamps, last error message - Thread-safe with @volatile fields and synchronized blocks - EvaluationEventWriter: Interface for persisting FlagEvaluation events - Abstraction allows testing without storage implementation Test infrastructure: - FlagEvaluationAssert: Custom assertions for validation - FlagEvaluationForgeryFactory: Test data generator - EvaluationContextForgeryFactory: Context data generator Uses BatchedFlagEvaluations.FlagEvaluation from PR #1 schema. No runtime integration - isolated business logic only. EVALLOG compliance: 2, 3, 4, 5, 8, 10, 11, 13

- Replace destructuring with 4 entries to individual assignments - Use safe cast for nullable error message

- AggregationKeyTest: Tests aggregation key generation, equality, and grouping logic - AggregationStatsTest: Tests statistics tracking, error message updates, and thread safety - EvaluationEventsProcessorTest: Tests processor orchestration, flush triggers, and concurrency These tests cover: - Aggregation by error code with last message preservation - Thread-safe concurrent operations with high contention scenarios - All flush triggers (time, size, shutdown) - Field validation per evaluation logging spec

[FFL-1720] Evaluation Logging: Aggregation Engine & Test Utilities Co-authored-by: typotter <tyler.potter@datadoghq.com> Co-authored-by: 0xnm <nikita.ogorodnikov@datadoghq.com>

…r3-storage-network

…r4-integration

[FFL-1720] Evaluation Logging: Integration Wires the evaluation logging feature end-to-end by connecting the EvaluationsFeature to the flag evaluation flow. This is the final PR that enables evaluation logging in the Flags SDK.

[FFL-1720] Evaluation Logging: Storage & Network Infrastructure

typotter · 2026-02-17T13:23:28Z

/merge

gh-worker-devflow-routing-ef8351 · 2026-02-17T13:23:32Z

View all feedbacks in Devflow UI.

2026-02-17 13:23:32 UTC ℹ️ Start processing command /merge

2026-02-17 13:23:37 UTC ℹ️ MergeQueue: waiting for PR to be ready

This pull request is not mergeable according to GitHub. Common reasons include pending required checks, missing approvals, or merge conflicts — but it could also be blocked by other repository rules or settings.
It will be added to the queue as soon as checks pass and/or get approvals. View in MergeQueue UI.
Note: if you pushed new commits since the last approval, you may need additional approval.
You can remove it from the waiting list with /remove command.

2026-02-17 14:18:11 UTC ℹ️ MergeQueue: merge request added to the queue

The expected merge time in develop is approximately 1h (p90).

2026-02-17 15:29:15 UTC ℹ️ MergeQueue: This merge request was merged

codecov-commenter · 2026-02-17T13:52:52Z

Codecov Report

❌ Patch coverage is 78.81944% with 61 lines in your changes missing coverage. Please review.
✅ Project coverage is 71.28%. Comparing base (4711ad8) to head (68b597a).
⚠️ Report is 279 commits behind head on develop.

Files with missing lines	Patch %	Lines
...tadog/android/flags/internal/EvaluationsFeature.kt	18.87%	43 Missing ⚠️
...tadog/android/flags/internal/DatadogFlagsClient.kt	67.86%	6 Missing and 3 partials ⚠️
...in/kotlin/com/datadog/android/flags/FlagsClient.kt	33.33%	3 Missing and 1 partial ⚠️
...ndroid/flags/internal/EvaluationEventsProcessor.kt	95.16%	2 Missing and 1 partial ⚠️
...in/com/datadog/android/flags/FlagsConfiguration.kt	84.62%	2 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #3183      +/-   ##
===========================================
+ Coverage    71.21%   71.28%   +0.07%     
===========================================
  Files          922      929       +7     
  Lines        34173    34450     +277     
  Branches      5776     5817      +41     
===========================================
+ Hits         24334    24557     +223     
- Misses        8205     8255      +50     
- Partials      1634     1638       +4

Files with missing lines	Coverage Δ
.../kotlin/com/datadog/android/api/feature/Feature.kt	`100.00% <ø> (ø)`
...src/main/kotlin/com/datadog/android/flags/Flags.kt	`90.91% <100.00%> (+7.58%)`	⬆️
.../android/flags/internal/ExposureEventsProcessor.kt	`100.00% <100.00%> (ø)`
...com/datadog/android/flags/internal/FlagsFeature.kt	`85.07% <100.00%> (+1.49%)`	⬆️
...droid/flags/internal/aggregation/AggregationKey.kt	`100.00% <100.00%> (ø)`
...oid/flags/internal/aggregation/AggregationStats.kt	`100.00% <100.00%> (ø)`
...flags/internal/aggregation/EvaluationAggregator.kt	`100.00% <100.00%> (ø)`
...id/flags/internal/net/EvaluationsRequestFactory.kt	`100.00% <100.00%> (ø)`
...gs/internal/storage/EvaluationEventRecordWriter.kt	`100.00% <100.00%> (ø)`
...in/com/datadog/android/flags/FlagsConfiguration.kt	`89.74% <84.62%> (-2.85%)`	⬇️
... and 4 more

... and 32 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

plousada · 2026-02-18T11:31:11Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: a90ef9333f

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-02-18T11:38:45Z

            }
+
+            // Log evaluation events for errors
+            trackErrorResolution(resolution)


Log evaluation errors in resolve() path

trackErrorResolution is only invoked from resolveTracked, but callers using resolve(flagKey, defaultValue) take a different error branch that returns createErrorResolution without emitting an evaluation event. As a result, missing/type-mismatched flags are excluded from evaluation telemetry whenever apps use the detailed resolve() API, which underreports error rates and skews aggregated evaluation metrics.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-02-18T11:38:45Z

+            ddContext = DDContext(
+                service = service,
+                rumApplicationId = context[RUM_APPLICATION_ID] as? String,
+                rumViewName = context[RUM_VIEW_NAME] as? String


Read RUM view URL instead of view name

This code captures view_name from RUM context and later serializes it into context.dd.rum.view.url, so evaluation events send the human-readable view name in the URL field. In apps where view names differ from URLs (templated routes, custom names), backend grouping/filtering by page URL will be inaccurate; this should use the RUM view_url context key.

Useful? React with 👍 / 👎.

typotter and others added 30 commits January 26, 2026 23:31

evaluations sub feature

da64b8f

ordered FlagsConfig with flush interval clamping

067cfc8

tidy

7b4e7ad

refine docs

a82d92f

api

5dcc207

link generate API surface in flags module

4fe9ef8

api artifact

43480db

set feature name in Feature.kt

2dc7cfb

lint

a9d3f98

revert build change

bfce3c1

move EvalFeature instantiation to Flags.enable

67c8e19

internal companion

de1c6f4

fix API artifact

f0b3be8

Fix artifact

9f388ac

tidy FlagsFeature changes

6f6ada0

regions

2f91cc3

feat: Batched Flag Evaluation schema json file

e9b6f42

fix api spec

7dba2fe

api artifact

b85617d

Merge pull request #3159 from DataDog/typo/flags-evaluations-subfeature

94e4249

[Flags] Evaluations subfeature Co-authored-by: typotter <tyler.potter@datadoghq.com>

Merge pull request #3144 from DataDog/typo/FFL-1720-pr1-schema-models

4618637

[FFL-1720] Evaluation Logging: Event Schema & Data Models Co-authored-by: typotter <tyler.potter@datadoghq.com>

feat: ndjson schema for uploading flag evaluations

f8abf3f

update new schema

deec3a8

extract Identifier in flag evaluation schema

01653b1

api surface

22043fa

Merge pull request #3166 from DataDog/typo/flag-eval-schema

986afe7

[Flags] FlagEvaluation schema Co-authored-by: typotter <tyler.potter@datadoghq.com>

FFL-1720: Fix detekt issues in aggregation engine

1fe7789

- Replace destructuring with 4 entries to individual assignments - Use safe cast for nullable error message

safe calls

c1f825b

typotter and others added 10 commits February 13, 2026 08:10

tidy error tracking

3734ce1

move eval feature into flag client builder

5caa00c

revert name change

102267f

Merge pull request #3145 from DataDog/typo/FFL-1720-pr2-aggregation

adb81d5

[FFL-1720] Evaluation Logging: Aggregation Engine & Test Utilities Co-authored-by: typotter <tyler.potter@datadoghq.com> Co-authored-by: 0xnm <nikita.ogorodnikov@datadoghq.com>

Merge branch 'feature/flags-evaluations-logging' into typo/FFL-1720-p…

d836bce

…r3-storage-network

Merge branch 'typo/FFL-1720-pr3-storage-network' into typo/FFL-1720-p…

bcf08f9

…r4-integration

Merge pull request #3146 from DataDog/typo/FFL-1720-pr3-storage-network

d337d90

[FFL-1720] Evaluation Logging: Storage & Network Infrastructure

Merge branch 'develop' into typo/merge-evaluations-into-develop

97788fa

remove unused schema def

f826bee

typotter requested a review from a team as a code owner February 17, 2026 13:12

typotter mentioned this pull request Feb 17, 2026

Feature/flags evaluations logging #3182

Closed

3 tasks

Merge branch 'develop' into typo/merge-evaluations-into-develop

68b597a

typotter requested a review from 0xnm February 17, 2026 13:14

0xnm approved these changes Feb 17, 2026

View reviewed changes

gh-worker-dd-devflow-36fce6 Bot added the mergequeue-status: waiting label Feb 17, 2026

This comment has been minimized.

Sign in to view

gh-worker-dd-devflow-36fce6 Bot added mergequeue-status: queued mergequeue-status: in_progress and removed mergequeue-status: waiting mergequeue-status: queued labels Feb 17, 2026

gh-worker-dd-mergequeue-cf854d Bot merged commit a90ef93 into develop Feb 17, 2026
27 checks passed

gh-worker-dd-mergequeue-cf854d Bot deleted the typo/merge-evaluations-into-develop branch February 17, 2026 15:29

gh-worker-dd-devflow-36fce6 Bot added mergequeue-status: done and removed mergequeue-status: in_progress labels Feb 17, 2026

chatgpt-codex-connector Bot reviewed Feb 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Flagging] Merge evaluations into develop#3183

[Flagging] Merge evaluations into develop#3183
gh-worker-dd-mergequeue-cf854d[bot] merged 98 commits intodevelopfrom
typo/merge-evaluations-into-develop

typotter commented Feb 17, 2026 •

edited

Loading

Uh oh!

typotter commented Feb 17, 2026

Uh oh!

gh-worker-devflow-routing-ef8351 Bot commented Feb 17, 2026 •

edited

Loading

Uh oh!

This comment has been minimized.

codecov-commenter commented Feb 17, 2026 •

edited

Loading

Uh oh!

Uh oh!

plousada commented Feb 18, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Feb 18, 2026

Uh oh!

chatgpt-codex-connector Bot Feb 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

typotter commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Motivation

Additional Notes

🥞 Evaluation Logging Stacked Pull Requests 🥞

Review checklist (to be filled by reviewers)

Uh oh!

typotter commented Feb 17, 2026

Uh oh!

gh-worker-devflow-routing-ef8351 Bot commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment has been minimized.

codecov-commenter commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

plousada commented Feb 18, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

typotter commented Feb 17, 2026 •

edited

Loading

gh-worker-devflow-routing-ef8351 Bot commented Feb 17, 2026 •

edited

Loading

codecov-commenter commented Feb 17, 2026 •

edited

Loading