ticdc: add scheduler configuration recommendations for table split mode by hongyunyan · Pull Request #22483 · pingcap/docs

hongyunyan · 2026-02-25T05:21:14Z

Update TOC and the ticdc integration guide to include ksqlDB and SQL Server. The TOC entry for the Confluent/Snowflake integration is expanded with nested links for Confluent Cloud, Snowflake, ksqlDB, and SQL Server, and the document title, H1, and summary in ticdc/integrate-confluent-using-ticdc.md are updated to reflect the broader integration coverage.

First-time contributors' checklist

I've signed the Contributor License Agreement, which is required for the repository owners to accept my contribution.

What is changed, added or deleted? (Required)

Which TiDB version(s) do your changes apply to? (Required)

Tips for choosing the affected version(s):

By default, CHOOSE MASTER ONLY so your changes will be applied to the next TiDB major or minor releases. If your PR involves a product feature behavior change or a compatibility change, CHOOSE THE AFFECTED RELEASE BRANCH(ES) AND MASTER.

For details, see tips for choosing the affected versions.

What is the related PR or file link(s)?

This PR is translated from:ticdc: add the description for scheduler config setting docs-cn#21381
Other reference link(s):

Do your changes match any of the following descriptions?

Delete files
Change aliases
Need modification after applied to another branch
Might cause conflicts after applied to another branch

gemini-code-assist · 2026-02-25T05:21:34Z

Summary of Changes

Hello @hongyunyan, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request synchronizes the English documentation with recent updates made in the Chinese documentation, specifically by adding detailed configuration recommendations for TiCDC's table split mode within the new architecture. The changes aim to provide users with consistent and up-to-date guidance on optimizing TiCDC settings for table splitting, thereby improving replication efficiency and stability. Additionally, a new document has been added to comprehensively describe TiCDC's data synchronization capabilities.

Highlights

New Configuration Recommendations Section: Introduced a new section in the ticdc-architecture.md document titled 'Configuration recommendations for table split mode'.
Scheduler Parameter Documentation: Documented recommended usage and default values for scheduler.region-threshold, scheduler.region-count-per-span, and scheduler.write-key-threshold.
Critical Configuration Caution: Added a crucial caution regarding the relationship between scheduler.region-threshold and scheduler.region-count-per-span to prevent repeated scheduling and increased replication latency.
New Data Synchronization Capabilities Document: Added a new comprehensive document, ticdc-data-sync-capabilities.md, detailing TiCDC's data synchronization principles, consistency guarantees, and DDL/DML handling.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Changelog

ticdc/ticdc-architecture.md
- Added a new section 'Configuration recommendations for table split mode'.
- Detailed recommended values and usage for scheduler.region-threshold, scheduler.region-count-per-span, and scheduler.write-key-threshold.
- Included a warning about scheduler.region-threshold being less than scheduler.region-count-per-span to avoid issues.
ticdc/ticdc-data-sync-capabilities.md
- New file added: 'TiCDC's Data Synchronization Capability'.
- Introduced background on TiCDC's role in real-time data synchronization.
- Described how TiCDC monitors Raft Log, generates logical operations, and provides consistency guarantees.
- Listed supported downstreams for data synchronization.
- Clarified synchronization support for DDL/DML, DQL/DCL, system tables, and temporary tables.
- Explained handling of index and foreign key constraints, including asynchronous DDL execution for indexes.
- Stated that TiCDC checks internal integrity but not downstream constraint compliance.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request adds configuration recommendations for the table split mode and introduces a new document about TiCDC's data synchronization capabilities. The changes are valuable for keeping the English documentation aligned with the Chinese version. I have provided several suggestions, primarily for the new document, to enhance clarity, correct minor grammatical issues, and ensure adherence to the repository's documentation style guide. Key suggestions include fixing a typo, correcting a broken link, and rephrasing sentences for better readability and consistency.

gemini-code-assist · 2026-02-25T05:23:48Z