feat(csharp/src/Drivers/Databricks): Add Activity-based logging to DatabricksStatement by msrathore-db · Pull Request #3617 · apache/arrow-adbc

msrathore-db · 2025-10-25T13:34:01Z

Summary

Adds comprehensive Activity-based logging to DatabricksStatement.cs for improved observability and debugging of Databricks ADBC operations.

Changes

Methods with Logging Added

SetStatementProperties: Logs configuration (Arrow native types, CloudFetch settings, async mode)
GetSchemaFromMetadata: Logs schema parsing decisions (Arrow vs Thrift, column count)
GetCatalogsAsync: Logs catalog queries with feature flags and row counts
GetSchemasAsync: Logs schema queries with catalog filtering
GetTablesAsync: Logs table queries with SPARK catalog handling
GetColumnsAsync: Logs column queries with search criteria
GetPrimaryKeysAsync: Logs primary key queries
GetCrossReferenceAsync: Logs foreign key queries
GetColumnsExtendedAsync: Logs DESC TABLE EXTENDED queries
SetOption: Logs configuration changes

What Gets Logged

Tags:

Statement configuration (CloudFetch settings, Arrow native types, batch size)
Feature flags (enable_multiple_catalog_support, pk_fk_enabled)
Query parameters (catalog, schema, table, column names)
Results (db.response.returned_rows)

Events:

statement.<operation>.start / statement.<operation>.complete
Decision points (calling_base_implementation, returning_empty_result, fallback_to_base)
Schema handling (using_arrow_schema, fallback_to_thrift)

Example Log Output

{
  "OperationName": "GetCatalogs",
  "Duration": "00:00:00.582",
  "TagObjects": {
    "statement.feature.enable_multiple_catalog_support": true,
    "db.response.returned_rows": 28
  },
  "Events": [
    { "Name": "statement.get_catalogs.start" },
    { "Name": "statement.get_catalogs.calling_base_implementation" },
    { "Name": "statement.get_catalogs.complete" }
  ]
}

Testing

Tested locally by enabling logging with:

properties["adbc.traces.exporter"] = "adbcfile";

Verified that all tags, events, and distributed tracing context are captured correctly in trace files for all implemented methods (GetCatalogs, GetSchemas, GetTables, GetColumns, and statement configuration).

PR generated by Cursor.

eric-wang-1990 · 2025-10-27T16:12:50Z

csharp/src/Drivers/Databricks/DatabricksStatement.cs

        protected override Schema GetSchemaFromMetadata(TGetResultSetMetadataResp metadata)
        {
+            // Log schema parsing decision
+            Activity.Current?.SetTag("statement.schema.has_arrow_schema", metadata.__isset.arrowSchema);


Does this output logs correctly? Why not using this.TraceActivity?
@birschick-bq What should be the recommended way for trace?

I was concerned that Activity.Current might not be set to the activity in your current block - i.e., it might have been set for a child call. But in thinking about it, I don't think that would be the case, as long as the child call disposed the Activity in their scope.

However, I think the best usage pattern for Activity.Current is where you don't start a new activity and you don't pass an existing activity as a parameter. That is, a situation in which you might be called from a method that may or may not have a current activity started.

So I think Activity.Current is used appropriately in the case shown in line 93, above.

Use this.TraceActivity when you want a new activity (line) with its own events and tags. Typically something "major" or structural in your code. If the new activity (this.TraceActivity) is a child call, the ParentId will be set to indicate it is a child activity of the parent activity.

csharp/src/Drivers/Databricks/DatabricksStatement.cs

Added logging to databricksStatement.cs

8d13ad0

msrathore-db requested a review from CurtHagenlocher as a code owner October 25, 2025 13:34

github-actions bot added this to the ADBC Libraries 21 milestone Oct 25, 2025

msrathore-db marked this pull request as draft October 25, 2025 13:41

Fix nullable reference warning in GetColumnsExtendedAsync

f3e967f

msrathore-db marked this pull request as ready for review October 25, 2025 13:47

eric-wang-1990 reviewed Oct 27, 2025

View reviewed changes

eric-wang-1990 reviewed Oct 31, 2025

View reviewed changes

csharp/src/Drivers/Databricks/DatabricksStatement.cs Outdated Show resolved Hide resolved

csharp/src/Drivers/Databricks/DatabricksStatement.cs Outdated Show resolved Hide resolved

csharp/src/Drivers/Databricks/DatabricksStatement.cs Outdated Show resolved Hide resolved

lidavidm modified the milestones: ADBC Libraries 21, ADBC Libraries 22 Nov 3, 2025

msrathore-db added 2 commits November 5, 2025 05:33

Merge branch 'main' into logging

b56102f

Removed duplicate logging in databricksStatement.cs

6c9c826

eric-wang-1990 approved these changes Nov 5, 2025

View reviewed changes

lidavidm modified the milestones: ADBC Libraries 22, ADBC Libraries 23 Jan 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(csharp/src/Drivers/Databricks): Add Activity-based logging to DatabricksStatement#3617

feat(csharp/src/Drivers/Databricks): Add Activity-based logging to DatabricksStatement#3617
msrathore-db wants to merge 4 commits intoapache:mainfrom
msrathore-db:logging

msrathore-db commented Oct 25, 2025 •

edited

Loading

Uh oh!

eric-wang-1990 Oct 27, 2025

Uh oh!

birschick-bq Oct 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

msrathore-db commented Oct 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Methods with Logging Added

What Gets Logged

Example Log Output

Testing

Uh oh!

eric-wang-1990 Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

birschick-bq Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

msrathore-db commented Oct 25, 2025 •

edited

Loading