feat: Use {{field_name}} placeholders and move criteria to user promp by daniel5u · Pull Request #400 · MigoXLab/dingo

daniel5u · 2026-05-12T08:36:59Z

Replace format_map() with regex-based _replace_placeholders() to avoid ValueError when criteria contain JSON braces. Move rule-specific content from system prompt to user prompt for cleaner LLM judge instructions.
Make description optional in CustomLLMRuleArgs.

feat: v2.2.2

…lated runtime config

…script

Replace format_map() with regex-based _replace_placeholders() to avoid ValueError when criteria contain JSON braces. Move rule-specific content from system prompt to user prompt for cleaner LLM judge instructions. Make description optional in CustomLLMRuleArgs.

Resolve conflicts by keeping main's version of LLMCustomRule files which use {{field_name}} placeholder syntax and criteria in user prompt.

gemini-code-assist

Code Review

This pull request introduces a template-based approach for custom LLM rules, allowing dynamic placeholder substitution within criteria and moving rule-specific context to the user prompt. Key feedback includes a critical mismatch between the regex implementation (expecting double braces {{...}}) and the provided examples/tests (using single braces {...}). Other recommendations involve restoring the metric name in default labels for better tracking, ensuring the LLM receives the metric description for context, and moving the re import to the top of the file to follow standard Python conventions.

gemini-code-assist · 2026-05-12T08:38:54Z

+    @staticmethod
+    def _replace_placeholders(text: str, inputs: dict) -> str:
+        """Replace {{field_name}} placeholders, leaving other braces intact."""
+        import re


The import re statement should be moved to the top of the file to adhere to standard Python coding conventions and avoid repeated import overhead.

gemini-code-assist · 2026-05-12T08:38:54Z

-            f'- If the criteria do not specify labels, use "label": ["QUALITY_BAD.{custom_rule.metric}"] when status is true.\n'
-            "- If the criteria do not specify score semantics, use score 1 for pass/good and score 0 for fail/bad.\n"
-            "- If the criteria do not specify pass/good or fail/bad standard, return 1 for all inputs."
+            '- If no labels are specified, use "label": ["QUALITY_GOOD"] when status is false and "label": ["QUALITY_BAD"] when status is true.\n'


The default failure label has been changed from QUALITY_BAD.{metric} to QUALITY_BAD. This loses the specific metric context in the output. Additionally, this is inconsistent with other error labels in this class (e.g., lines 172 and 206) which still include the metric or exception name. It is recommended to maintain the metric name in the label for consistency and better downstream analysis.

Suggested change

'- If no labels are specified, use "label": ["QUALITY_GOOD"] when status is false and "label": ["QUALITY_BAD"] when status is true.\n'

f'- If no labels are specified, use "label": ["QUALITY_GOOD"] when status is false and "label": ["QUALITY_BAD.{custom_rule.metric}"] when status is true.\n'

gemini-code-assist · 2026-05-12T08:38:54Z

+        user_content = "\n".join(
+            self._replace_placeholders(criterion, inputs) for criterion in custom_rule.criteria
        )


The metric name and description are no longer passed to the LLM in the prompt. While the PR description mentions moving rule-specific content to the user prompt, these fields were removed entirely. Consider including them at the beginning of the user_content to provide the LLM with the necessary context for evaluation.

shijinpjlab and others added 15 commits April 2, 2026 16:08

Merge pull request MigoXLab#383 from MigoXLab/dev

a9629b1

feat: v2.2.2

docs: update wechat (MigoXLab#390)

1e33a80

docs: update wechat (MigoXLab#392)

c59198e

docs: update wechat (MigoXLab#395)

0c780ff

Isolate evaluator dynamic configs

2065d8d

feat(llm): add LLMCustomRule evaluator with structured config and iso…

8d58912

…lated runtime config

docs(examples): add LLMCustomRule metric docs and runnable .env demo …

40ae7e8

…script

review: modify according to AI review

1a3e537

modify: modify the PROMPT for llm_custom_rule

09a0ea5

Improve custom LLM rule response handling

80f8446

Merge branch 'dev' into main

644a743

fix: CI failure

6894694

Merge remote-tracking branch 'fork/main'

bd31395

Merge origin/dev, keeping improved LLMCustomRule from main

85c843e

Resolve conflicts by keeping main's version of LLMCustomRule files which use {{field_name}} placeholder syntax and criteria in user prompt.

gemini-code-assist Bot reviewed May 12, 2026

View reviewed changes

fix

d11d7a3

daniel5u closed this May 14, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Use {{field_name}} placeholders and move criteria to user promp#400

feat: Use {{field_name}} placeholders and move criteria to user promp#400
daniel5u wants to merge 16 commits into
MigoXLab:devfrom
daniel5u:main

daniel5u commented May 12, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist Bot May 12, 2026

Uh oh!

gemini-code-assist Bot May 12, 2026

Uh oh!

gemini-code-assist Bot May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	'- If no labels are specified, use "label": ["QUALITY_GOOD"] when status is false and "label": ["QUALITY_BAD"] when status is true.\n'
	f'- If no labels are specified, use "label": ["QUALITY_GOOD"] when status is false and "label": ["QUALITY_BAD.{custom_rule.metric}"] when status is true.\n'

Conversation

daniel5u commented May 12, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist Bot May 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants