Fix Spark day time dimension truncation by nicosuave · Pull Request #141 · sidequery/sidemantic

nicosuave · 2026-05-13T01:46:33Z

Fixes #140.

Spark SQL / HiveServer2 now uses DATE_TRUNC('DAY', column) for day-grain time dimensions instead of TRUNC(column, 'DAY'), which returns NULL in Spark. Databricks uses the same Spark-family path.

Added regression coverage for Spark-family SQL generation plus a gated Spark integration test that executes a day-grain timestamp dimension against the Docker Spark Thrift server. Also fixed the Spark integration test harness to omit a password by default for the no-auth local Spark container.

Local lint, format, full pytest, and the gated Spark integration/CLI e2e suites passed.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 0a3fec2c93

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-13T02:05:51Z

+            return f"DATE_TRUNC({column_expr}, {granularity.upper()})"
+
+        if self.dialect in {"spark", "databricks"}:
+            return f"DATE_TRUNC('{granularity.upper()}', {column_expr})"


Preserve Spark date results for date-level grains

For Spark/Databricks this now emits DATE_TRUNC for every granularity, so existing month/year time dimensions that previously used Spark's TRUNC(col, 'MONTH') path will start returning timestamps instead of dates; Spark documents date_trunc as returning a timestamp and trunc as returning a date. In Spark CLI/query results this changes unchanged date-level dimensions from values like 2026-05-01 to 2026-05-01 00:00:00 just to fix the unsupported day case, so keep the old TRUNC path for the granularities it supports or cast the new expression back to DATE where date-level output is expected.

Useful? React with 👍 / 👎.

nicosuave added 2 commits May 12, 2026 18:46

Fix Spark day truncation

3ae492e

Add Spark day truncation integration test

0a3fec2

nicosuave marked this pull request as ready for review May 13, 2026 02:02

chatgpt-codex-connector Bot reviewed May 13, 2026

View reviewed changes

nicosuave added 2 commits May 12, 2026 19:15

Relax multi-join performance threshold

ed888ab

Preserve Spark date grain results

9e4d7ee

nicosuave merged commit 4c2e808 into main May 13, 2026
15 checks passed

nicosuave deleted the spark-date-trunc-day branch May 13, 2026 02:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Spark day time dimension truncation#141

Fix Spark day time dimension truncation#141
nicosuave merged 4 commits into
mainfrom
spark-date-trunc-day

nicosuave commented May 13, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

nicosuave commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 13, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

nicosuave commented May 13, 2026 •

edited

Loading