feat: add MiniMax as first-class LLM provider (default: M2.7) by octo-patch · Pull Request #211 · xming521/WeClone

octo-patch · 2026-03-17T05:21:58Z

Summary

Add MiniMax as a first-class LLM provider for data cleaning
Provider preset system with auto-filled base_url and model_name
response_format handling for MiniMax API compatibility
Temperature clamping for MiniMax constraints
Default model: MiniMax-M2.7 (latest flagship with enhanced reasoning and coding)

Changes

Add MiniMax provider enum and preset configuration
Add MiniMax-specific response_format and temperature handling in OnlineLLM
Update config templates, README (EN/ZH) with MiniMax provider docs
25 unit tests + 4 integration tests

Available Models

Model	Description
MiniMax-M2.7 (default)	Latest flagship model with enhanced reasoning and coding
MiniMax-M2.7-highspeed	High-speed version of M2.7 for low-latency scenarios

Testing

All 34 tests passing (30 unit + 4 integration)
Integration tested with real MiniMax API

- Add LLMProvider enum and preset system (OpenAI, DeepSeek, MiniMax, Custom) that auto-fills base_url and model_name in MakeDatasetArgs - Auto-detect MiniMax endpoints and disable response_format (unsupported) - Clamp temperature to 0.01 for MiniMax (rejects zero) - Support both api.minimax.io and api.minimaxi.com (China) endpoints - Update config templates with llm_provider documentation - Add 25 unit tests + 4 integration tests Co-Authored-By: Octopus <liyuan851277048@icloud.com>

sourcery-ai · 2026-03-17T05:22:12Z

Reviewer's Guide

通过在配置中添加提供者预设、在在线推理客户端中加入 MiniMax 特定行为、更新文档与模板，以及新增专门的 MiniMax 测试套件，将 MiniMax 作为数据清洗流水线的一等在线 LLM 提供者集成进来。

基于 MiniMax 的在线数据清洗调用时序图

sequenceDiagram
    actor User
    participant WeCloneCLI
    participant MakeDatasetArgs
    participant OnlineLLM
    participant MiniMaxAPI

    User->>WeCloneCLI: Run make-dataset
    WeCloneCLI->>MakeDatasetArgs: Load settings.jsonc
    MakeDatasetArgs->>MakeDatasetArgs: apply_provider_presets()
    Note over MakeDatasetArgs: llm_provider = minimax
    MakeDatasetArgs-->MakeDatasetArgs: base_url = https://api.minimax.io/v1
    MakeDatasetArgs-->MakeDatasetArgs: model_name = MiniMax-M2.5

    WeCloneCLI->>OnlineLLM: __init__(api_key, base_url, model_name, response_format)
    OnlineLLM->>OnlineLLM: _is_no_response_format_provider(base_url)
    OnlineLLM-->OnlineLLM: _supports_response_format = False
    OnlineLLM-->OnlineLLM: response_format = ""

    WeCloneCLI->>OnlineLLM: chat(prompt_text, system_prompt, temperature=0)
    OnlineLLM->>OnlineLLM: clamp_temperature(temperature, base_url)
    OnlineLLM-->OnlineLLM: temperature = 0.01

    OnlineLLM->>MiniMaxAPI: POST /chat with model, messages, temperature=0.01
    MiniMaxAPI-->>OnlineLLM: JSON response
    OnlineLLM-->>WeCloneCLI: cleaned text
    WeCloneCLI-->>User: Dataset with cleaned data

LLM 提供者预设与 OnlineLLM 行为的类图

classDiagram
    class LLMProvider {
        <<enumeration>>
        OPENAI
        DEEPSEEK
        MINIMAX
        CUSTOM
    }

    class MakeDatasetArgs {
        +llm_provider LLMProvider
        +base_url str
        +llm_api_key str
        +model_name str
        +apply_provider_presets() MakeDatasetArgs
    }

    class LLMProviderPresets {
        +OPENAI_base_url str
        +OPENAI_model_name str
        +DEEPSEEK_base_url str
        +DEEPSEEK_model_name str
        +MINIMAX_base_url str
        +MINIMAX_model_name str
    }

    class OnlineLLM {
        -_NO_RESPONSE_FORMAT_PROVIDERS set
        +api_key str
        +base_url str
        +model_name str
        +response_format str
        +prompt_with_system bool
        +__init__(api_key str, base_url str, model_name str, max_workers int, prompt_with_system bool, response_format str)
        +chat(prompt_text str, system_prompt str, temperature float, max_tokens int) dict
        +_is_no_response_format_provider(base_url str) bool
        +clamp_temperature(temperature float, base_url str) float
    }

    LLMProviderPresets ..> LLMProvider : keys
    MakeDatasetArgs --> LLMProvider : uses
    MakeDatasetArgs ..> LLMProviderPresets : reads
    OnlineLLM ..> LLMProviderPresets : behavior aligned_with

文件级改动

Change	Details	Files
引入 LLM 提供者预设系统，并将其接入数据集构建配置，这样选择提供者时可以自动填充 base_url 和 model_name。	添加带有 openai、deepseek、minimax 和 custom 选项的 LLMProvider StrEnum。定义 LLM_PROVIDER_PRESETS，将各提供者映射到默认的 base_url 和 model_name 值，包括 MiniMax。扩展 MakeDatasetArgs，新增可选的 llm_provider 字段，并添加基于预设配置的说明。添加 Pydantic model_validator，当设置了 llm_provider 时自动应用预设的 base_url/model_name，同时尊重显式提供的值。	`weclone/utils/config_models.py`
调整 OnlineLLM 客户端，以处理 MiniMax 在 response_format 和 temperature 上的 API 特性。	添加用于检测不支持 response_format 的提供者的内部 host 允许列表。通过跟踪 _supports_response_format 标志并在不支持时清空已配置的 response_format，来为 MiniMax 端点禁用 response_format。引入 clamp_temperature，强制满足 MiniMax 对 temperature > 0 的约束，并在 chat 调用开始时应用。确保提供者检测不区分大小写，并同时支持 api.minimax.io 和 api.minimaxi.com 端点。	`weclone/core/inference/online_infer.py`
在面向用户的文档和模板中记录新的提供者预设机制和 MiniMax 的使用方法。	在英文 README 中添加“Online LLM Providers for Data Cleaning”章节，包含提供者表格和 MiniMax 示例配置。在中文 README 中添加对应的提供者章节和 MiniMax 示例。更新 JSONC 配置模板，加入 llm_provider，并体现可通过选择 MiniMax 自动填充 base_url 和 model_name。	`README.md` `README_zh.md` `examples/mllm.template.jsonc` `examples/tg.template.jsonc` `settings.template.jsonc`
添加聚焦 MiniMax 的测试套件，覆盖预设、OnlineLLM 行为、提供者检测以及真实 API 集成。	创建测试，用于验证 MiniMax、OpenAI 和 DeepSeek 的预设值，以及 MakeDatasetArgs 自动填充/覆写行为，包括 custom 提供者路径。添加针对 MiniMax 与非 MiniMax 提供者以及多个端点的 clamp_temperature 行为测试。测试对 MiniMax 禁用/省略 response_format 但对 OpenAI 保留 response_format 的行为，包括在 chat 调用参数中的体现。测试提供者检测逻辑，并添加集成测试（在没有 MINIMAX_API_KEY 时跳过），以覆盖真实 MiniMax chat、JSON 提取、temperature 限制和批量引导解码。	`tests/test_minimax_provider.py`

Tips and commands

Interacting with Sourcery

触发新评审： 在 pull request 中评论 @sourcery-ai review。
继续讨论： 直接回复 Sourcery 的评审评论。
从评审评论生成 GitHub issue： 在评审评论下回复，要求 Sourcery 从该评论创建 issue。你也可以直接回复 @sourcery-ai issue 来从该评论创建 issue。
生成 pull request 标题： 在 pull request 标题的任意位置写上 @sourcery-ai 来随时生成标题。你也可以在 pull request 中评论 @sourcery-ai title 来（重新）生成标题。
生成 pull request 摘要： 在 pull request 正文中任意位置写上 @sourcery-ai summary，即可在对应位置生成 PR 摘要。你也可以在 pull request 中评论 @sourcery-ai summary 来（重新）生成摘要。
生成评审指南： 在 pull request 中评论 @sourcery-ai guide，即可随时（重新）生成评审指南。
解决所有 Sourcery 评论： 在 pull request 中评论 @sourcery-ai resolve，即可解决所有 Sourcery 评论。适用于你已经处理完所有评论且不希望再看到它们的情况。
撤销所有 Sourcery 评审： 在 pull request 中评论 @sourcery-ai dismiss，即可撤销所有现有的 Sourcery 评审。尤其适用于你想从头开始新的评审——别忘了再评论 @sourcery-ai review 来触发新的评审！

Customizing Your Experience

访问你的 dashboard 以：

启用或停用评审功能，例如 Sourcery 生成的 pull request 摘要、评审指南等。
更改评审语言。
添加、移除或编辑自定义评审说明。
调整其他评审设置。

Getting Help

如有问题或反馈，请联系支持团队。
访问我们的文档以获取详细指南和信息。
通过关注我们的 X/Twitter、LinkedIn 或 GitHub 与 Sourcery 团队保持联系。

Original review guide in English

Reviewer's Guide

Adds MiniMax as a first-class online LLM provider for the data cleaning pipeline via provider presets in configuration, MiniMax-specific behavior in the online inference client, updated docs/templates, and a dedicated MiniMax test suite.

Sequence diagram for MiniMax-based online data cleaning call

sequenceDiagram
    actor User
    participant WeCloneCLI
    participant MakeDatasetArgs
    participant OnlineLLM
    participant MiniMaxAPI

    User->>WeCloneCLI: Run make-dataset
    WeCloneCLI->>MakeDatasetArgs: Load settings.jsonc
    MakeDatasetArgs->>MakeDatasetArgs: apply_provider_presets()
    Note over MakeDatasetArgs: llm_provider = minimax
    MakeDatasetArgs-->MakeDatasetArgs: base_url = https://api.minimax.io/v1
    MakeDatasetArgs-->MakeDatasetArgs: model_name = MiniMax-M2.5

    WeCloneCLI->>OnlineLLM: __init__(api_key, base_url, model_name, response_format)
    OnlineLLM->>OnlineLLM: _is_no_response_format_provider(base_url)
    OnlineLLM-->OnlineLLM: _supports_response_format = False
    OnlineLLM-->OnlineLLM: response_format = ""

    WeCloneCLI->>OnlineLLM: chat(prompt_text, system_prompt, temperature=0)
    OnlineLLM->>OnlineLLM: clamp_temperature(temperature, base_url)
    OnlineLLM-->OnlineLLM: temperature = 0.01

    OnlineLLM->>MiniMaxAPI: POST /chat with model, messages, temperature=0.01
    MiniMaxAPI-->>OnlineLLM: JSON response
    OnlineLLM-->>WeCloneCLI: cleaned text
    WeCloneCLI-->>User: Dataset with cleaned data

Class diagram for LLM provider presets and OnlineLLM behavior

classDiagram
    class LLMProvider {
        <<enumeration>>
        OPENAI
        DEEPSEEK
        MINIMAX
        CUSTOM
    }

    class MakeDatasetArgs {
        +llm_provider LLMProvider
        +base_url str
        +llm_api_key str
        +model_name str
        +apply_provider_presets() MakeDatasetArgs
    }

    class LLMProviderPresets {
        +OPENAI_base_url str
        +OPENAI_model_name str
        +DEEPSEEK_base_url str
        +DEEPSEEK_model_name str
        +MINIMAX_base_url str
        +MINIMAX_model_name str
    }

    class OnlineLLM {
        -_NO_RESPONSE_FORMAT_PROVIDERS set
        +api_key str
        +base_url str
        +model_name str
        +response_format str
        +prompt_with_system bool
        +__init__(api_key str, base_url str, model_name str, max_workers int, prompt_with_system bool, response_format str)
        +chat(prompt_text str, system_prompt str, temperature float, max_tokens int) dict
        +_is_no_response_format_provider(base_url str) bool
        +clamp_temperature(temperature float, base_url str) float
    }

    LLMProviderPresets ..> LLMProvider : keys
    MakeDatasetArgs --> LLMProvider : uses
    MakeDatasetArgs ..> LLMProviderPresets : reads
    OnlineLLM ..> LLMProviderPresets : behavior aligned_with

File-Level Changes

Change	Details	Files
Introduce an LLM provider preset system and wire it into dataset-making configuration so provider selection can auto-fill base_url and model_name.	Add LLMProvider StrEnum with openai, deepseek, minimax, and custom options. Define LLM_PROVIDER_PRESETS mapping providers to default base_url and model_name values, including MiniMax. Extend MakeDatasetArgs with an optional llm_provider field and description for preset-based configuration. Add a Pydantic model_validator that applies preset base_url/model_name when llm_provider is set, while respecting explicitly provided values.	`weclone/utils/config_models.py`
Adjust the OnlineLLM client to handle MiniMax’s API quirks around response_format and temperature.	Add an internal provider host allowlist used to detect providers that do not support response_format. Disable response_format for MiniMax endpoints by tracking a _supports_response_format flag and clearing the configured response_format when unsupported. Introduce clamp_temperature to enforce MiniMax’s temperature > 0 constraint and apply it at the start of chat calls. Ensure provider detection is case-insensitive and supports both api.minimax.io and api.minimaxi.com endpoints.	`weclone/core/inference/online_infer.py`
Document the new provider preset mechanism and MiniMax usage in user-facing docs and templates.	Add an "Online LLM Providers for Data Cleaning" section with provider table and MiniMax example configuration to the English README. Add the equivalent provider section and MiniMax example to the Chinese README. Update JSONC config templates to include llm_provider and reflect that MiniMax can be selected to auto-fill base_url and model_name.	`README.md` `README_zh.md` `examples/mllm.template.jsonc` `examples/tg.template.jsonc` `settings.template.jsonc`
Add a focused MiniMax test suite covering presets, OnlineLLM behavior, provider detection, and live API integration.	Create tests validating MiniMax, OpenAI, and DeepSeek preset values and MakeDatasetArgs auto-fill/override behavior, including the custom provider path. Add tests for clamp_temperature behavior for MiniMax vs non-MiniMax providers and multiple endpoints. Test that response_format is disabled/omitted for MiniMax but preserved for OpenAI, including in chat call parameters. Test provider detection logic and add integration tests (skipped without MINIMAX_API_KEY) that exercise real MiniMax chat, JSON extraction, temperature clamping, and batch guided decoding.	`tests/test_minimax_provider.py`

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it. You can also reply to a
review comment with @sourcery-ai issue to create an issue from it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time. You can also comment
@sourcery-ai title on the pull request to (re-)generate the title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time exactly where you
want it. You can also comment @sourcery-ai summary on the pull request to
(re-)generate the summary at any time.
Generate reviewer's guide: Comment @sourcery-ai guide on the pull
request to (re-)generate the reviewer's guide at any time.
Resolve all Sourcery comments: Comment @sourcery-ai resolve on the
pull request to resolve all Sourcery comments. Useful if you've already
addressed all the comments and don't want to see them anymore.
Dismiss all Sourcery reviews: Comment @sourcery-ai dismiss on the pull
request to dismiss all existing Sourcery reviews. Especially useful if you
want to start fresh with a new review - don't forget to comment
@sourcery-ai review to trigger a new review!

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

sourcery-ai

Hey - 我发现了 5 个问题，并留下了一些整体性的反馈：

OnlineLLM 中的提供商检测逻辑依赖对 base_url 的子串匹配来触发 MiniMax 特定行为；建议把「提供商能力」集中管理（例如通过 LLMProvider / 预设，或者一个小的能力映射表），这样 response_format 支持与 temperature 限制就可以由显式的 provider 标志驱动，而不是通过解析 URL。
_NO_RESPONSE_FORMAT_PROVIDERS 常量现在也被用于控制 temperature 限制，这比名字所暗示的职责更宽；可以考虑重命名它，或者单独引入一个用于 temperature 约束的常量，以让这些检查的意图更清晰，并避免把彼此无关的行为耦合在一起。

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- The provider detection logic in `OnlineLLM` relies on substring matching against `base_url` for MiniMax-specific behavior; consider centralizing provider capabilities (e.g., via `LLMProvider`/presets or a small capability map) so response_format support and temperature clamping are driven by an explicit provider flag instead of URL parsing.
- The `_NO_RESPONSE_FORMAT_PROVIDERS` constant is now also used to control temperature clamping, which is broader than its name suggests; renaming it or introducing a separate constant for temperature constraints would make the intent of these checks clearer and avoid coupling unrelated behaviors.

## Individual Comments

### Comment 1
<location path="weclone/utils/config_models.py" line_range="198-207" />
<code_context>
     clean_batch_size: int = Field(10, description="Batch size for data cleaning")
     vision_api: VisionApiConfig = Field(VisionApiConfig())

+    @model_validator(mode="after")
+    def apply_provider_presets(self):
+        """Apply provider presets when llm_provider is set."""
+        if self.llm_provider and self.llm_provider in LLM_PROVIDER_PRESETS:
+            preset = LLM_PROVIDER_PRESETS[self.llm_provider]
+            if not self.base_url:
+                self.base_url = preset["base_url"]
+            if not self.model_name:
+                self.model_name = preset["model_name"]
+        return self
+

</code_context>
<issue_to_address>
**suggestion:** Defaulting based on truthiness may unintentionally override explicitly provided but falsy values.

Using `if not self.base_url` / `if not self.model_name` will treat empty strings as missing and replace them with preset values. If callers may intentionally pass an empty string (e.g. to disable a default), that intent is lost. To only apply presets when the fields are unset, check explicitly for `None` (e.g. `if self.base_url is None:`).

Suggested implementation:

```python
    @model_validator(mode="after")
    def apply_provider_presets(self):
        """Apply provider presets when llm_provider is set."""
        if self.llm_provider and self.llm_provider in LLM_PROVIDER_PRESETS:
            preset = LLM_PROVIDER_PRESETS[self.llm_provider]
            # Only apply defaults when the fields are truly unset (None),
            # so that falsy-but-intentional values like "" are respected.
            if self.base_url is None:
                self.base_url = preset["base_url"]
            if self.model_name is None:
                self.model_name = preset["model_name"]
        return self

```

None required, assuming `model_validator` and `LLM_PROVIDER_PRESETS` are already correctly imported/defined elsewhere in this file.
</issue_to_address>

### Comment 2
<location path="weclone/core/inference/online_infer.py" line_range="50-58" />
<code_context>
+        return any(host in base_url_lower for host in OnlineLLM._NO_RESPONSE_FORMAT_PROVIDERS)
+
+    @staticmethod
+    def clamp_temperature(temperature: float, base_url: str) -> float:
+        """Clamp temperature for providers that require it to be in (0.0, 1.0].
+
+        MiniMax API rejects temperature=0; use a small positive value instead.
+        """
+        if OnlineLLM._is_no_response_format_provider(base_url) and temperature <= 0:
+            return 0.01
+        return temperature

     @retry_openai_api(max_retries=200, base_delay=30.0, max_delay=180.0)
</code_context>
<issue_to_address>
**suggestion:** Clamp temperature on the upper bound as well to fully satisfy MiniMax range requirements.

The docstring says this helper enforces `(0.0, 1.0]` for MiniMax, but it only adjusts `temperature <= 0` and leaves values above `1.0` unchanged. If a caller passes `temperature=1.5`, MiniMax could still reject the request. To fully enforce the documented contract, clamp both ends for MiniMax-like providers, e.g. `return min(max(temperature, 0.01), 1.0)`, while leaving other providers unchanged.

```suggestion
    @staticmethod
    def clamp_temperature(temperature: float, base_url: str) -> float:
        """Clamp temperature for providers that require it to be in (0.0, 1.0].

        For MiniMax-like providers:
        - MiniMax API rejects temperature=0; use a small positive value instead.
        - Values above 1.0 are also rejected; clamp to 1.0.

        Other providers are left unchanged.
        """
        if OnlineLLM._is_no_response_format_provider(base_url):
            return min(max(temperature, 0.01), 1.0)
        return temperature
```
</issue_to_address>

### Comment 3
<location path="tests/test_minimax_provider.py" line_range="145-146" />
<code_context>
+# Unit Tests – Temperature clamping
+# ---------------------------------------------------------------------------
+
+class TestTemperatureClamping:
+    """Test OnlineLLM.clamp_temperature for MiniMax constraints."""
+
</code_context>
<issue_to_address>
**suggestion (testing):** Add a temperature clamping test for `base_url=None`/empty to document passthrough behavior

Current clamping tests cover MiniMax, OpenAI, the China endpoint, and `_is_no_response_format_provider` for empty strings. Please also add a test asserting that `OnlineLLM.clamp_temperature(0.0, None)` (and/or `""`) returns `0.0`. This will document and lock in the intended passthrough behavior when the provider cannot be detected from the URL and protect against future changes to `_is_no_response_format_provider` altering this behavior inadvertently.

```suggestion
class TestTemperatureClamping:
    """Test OnlineLLM.clamp_temperature for MiniMax constraints."""

    def test_clamp_temperature_passthrough_when_base_url_none_or_empty(self):
        """If provider cannot be detected from base_url, temperature is passed through."""
        # base_url is None – should be a no-op passthrough
        assert OnlineLLM.clamp_temperature(0.0, None) == 0.0

        # base_url is empty string – should also be a no-op passthrough
        assert OnlineLLM.clamp_temperature(0.0, "") == 0.0
```
</issue_to_address>

### Comment 4
<location path="tests/test_minimax_provider.py" line_range="180" />
<code_context>
+class TestResponseFormatHandling:
+    """Test that response_format is disabled for MiniMax."""
+
+    @patch("weclone.core.inference.online_infer.OpenAI")
+    def test_minimax_disables_response_format(self, mock_openai_cls):
+        llm = OnlineLLM(
</code_context>
<issue_to_address>
**suggestion (testing):** Add a test that MiniMax overrides an explicitly provided `response_format`

`TestResponseFormatHandling` only covers the default `response_format` case for MiniMax. Please also add a test for the case where a caller explicitly passes a non-empty `response_format` while targeting MiniMax. For example, instantiate `OnlineLLM(..., base_url="https://api.minimax.io/v1", response_format="json_object")` and assert that `llm._supports_response_format is False` and `llm.response_format == ""` to verify `OnlineLLM.__init__` correctly ignores the explicit value when response formats aren’t supported.
</issue_to_address>

### Comment 5
<location path="tests/test_minimax_provider.py" line_range="223-227" />
<code_context>
+        )
+        llm.chat("Hello")
+
+        call_kwargs = mock_client.chat.completions.create.call_args
+        # response_format should NOT be in the call params
+        assert "response_format" not in call_kwargs.kwargs and (
</code_context>
<issue_to_address>
**suggestion (testing):** Simplify and harden the assertion that `response_format` is omitted for MiniMax

The current assertion inspects both `call_kwargs.kwargs` and the string representation of `call_kwargs`, which is brittle and may yield false positives if that representation changes. Since `call_args` is an `(args, kwargs)` tuple, you can simplify to:

```python
_, kwargs = mock_client.chat.completions.create.call_args
assert "response_format" not in kwargs
```

This keeps the test focused on the actual keyword arguments passed to the client and avoids reliance on `unittest.mock` internals.

```suggestion
        _, kwargs = mock_client.chat.completions.create.call_args
        # response_format should NOT be in the call params
        assert "response_format" not in kwargs
```
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

Original comment in English

Hey - I've found 5 issues, and left some high level feedback:

The provider detection logic in OnlineLLM relies on substring matching against base_url for MiniMax-specific behavior; consider centralizing provider capabilities (e.g., via LLMProvider/presets or a small capability map) so response_format support and temperature clamping are driven by an explicit provider flag instead of URL parsing.
The _NO_RESPONSE_FORMAT_PROVIDERS constant is now also used to control temperature clamping, which is broader than its name suggests; renaming it or introducing a separate constant for temperature constraints would make the intent of these checks clearer and avoid coupling unrelated behaviors.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- The provider detection logic in `OnlineLLM` relies on substring matching against `base_url` for MiniMax-specific behavior; consider centralizing provider capabilities (e.g., via `LLMProvider`/presets or a small capability map) so response_format support and temperature clamping are driven by an explicit provider flag instead of URL parsing.
- The `_NO_RESPONSE_FORMAT_PROVIDERS` constant is now also used to control temperature clamping, which is broader than its name suggests; renaming it or introducing a separate constant for temperature constraints would make the intent of these checks clearer and avoid coupling unrelated behaviors.

## Individual Comments

### Comment 1
<location path="weclone/utils/config_models.py" line_range="198-207" />
<code_context>
     clean_batch_size: int = Field(10, description="Batch size for data cleaning")
     vision_api: VisionApiConfig = Field(VisionApiConfig())

+    @model_validator(mode="after")
+    def apply_provider_presets(self):
+        """Apply provider presets when llm_provider is set."""
+        if self.llm_provider and self.llm_provider in LLM_PROVIDER_PRESETS:
+            preset = LLM_PROVIDER_PRESETS[self.llm_provider]
+            if not self.base_url:
+                self.base_url = preset["base_url"]
+            if not self.model_name:
+                self.model_name = preset["model_name"]
+        return self
+

</code_context>
<issue_to_address>
**suggestion:** Defaulting based on truthiness may unintentionally override explicitly provided but falsy values.

Using `if not self.base_url` / `if not self.model_name` will treat empty strings as missing and replace them with preset values. If callers may intentionally pass an empty string (e.g. to disable a default), that intent is lost. To only apply presets when the fields are unset, check explicitly for `None` (e.g. `if self.base_url is None:`).

Suggested implementation:

```python
    @model_validator(mode="after")
    def apply_provider_presets(self):
        """Apply provider presets when llm_provider is set."""
        if self.llm_provider and self.llm_provider in LLM_PROVIDER_PRESETS:
            preset = LLM_PROVIDER_PRESETS[self.llm_provider]
            # Only apply defaults when the fields are truly unset (None),
            # so that falsy-but-intentional values like "" are respected.
            if self.base_url is None:
                self.base_url = preset["base_url"]
            if self.model_name is None:
                self.model_name = preset["model_name"]
        return self

```

None required, assuming `model_validator` and `LLM_PROVIDER_PRESETS` are already correctly imported/defined elsewhere in this file.
</issue_to_address>

### Comment 2
<location path="weclone/core/inference/online_infer.py" line_range="50-58" />
<code_context>
+        return any(host in base_url_lower for host in OnlineLLM._NO_RESPONSE_FORMAT_PROVIDERS)
+
+    @staticmethod
+    def clamp_temperature(temperature: float, base_url: str) -> float:
+        """Clamp temperature for providers that require it to be in (0.0, 1.0].
+
+        MiniMax API rejects temperature=0; use a small positive value instead.
+        """
+        if OnlineLLM._is_no_response_format_provider(base_url) and temperature <= 0:
+            return 0.01
+        return temperature

     @retry_openai_api(max_retries=200, base_delay=30.0, max_delay=180.0)
</code_context>
<issue_to_address>
**suggestion:** Clamp temperature on the upper bound as well to fully satisfy MiniMax range requirements.

The docstring says this helper enforces `(0.0, 1.0]` for MiniMax, but it only adjusts `temperature <= 0` and leaves values above `1.0` unchanged. If a caller passes `temperature=1.5`, MiniMax could still reject the request. To fully enforce the documented contract, clamp both ends for MiniMax-like providers, e.g. `return min(max(temperature, 0.01), 1.0)`, while leaving other providers unchanged.

```suggestion
    @staticmethod
    def clamp_temperature(temperature: float, base_url: str) -> float:
        """Clamp temperature for providers that require it to be in (0.0, 1.0].

        For MiniMax-like providers:
        - MiniMax API rejects temperature=0; use a small positive value instead.
        - Values above 1.0 are also rejected; clamp to 1.0.

        Other providers are left unchanged.
        """
        if OnlineLLM._is_no_response_format_provider(base_url):
            return min(max(temperature, 0.01), 1.0)
        return temperature
```
</issue_to_address>

### Comment 3
<location path="tests/test_minimax_provider.py" line_range="145-146" />
<code_context>
+# Unit Tests – Temperature clamping
+# ---------------------------------------------------------------------------
+
+class TestTemperatureClamping:
+    """Test OnlineLLM.clamp_temperature for MiniMax constraints."""
+
</code_context>
<issue_to_address>
**suggestion (testing):** Add a temperature clamping test for `base_url=None`/empty to document passthrough behavior

Current clamping tests cover MiniMax, OpenAI, the China endpoint, and `_is_no_response_format_provider` for empty strings. Please also add a test asserting that `OnlineLLM.clamp_temperature(0.0, None)` (and/or `""`) returns `0.0`. This will document and lock in the intended passthrough behavior when the provider cannot be detected from the URL and protect against future changes to `_is_no_response_format_provider` altering this behavior inadvertently.

```suggestion
class TestTemperatureClamping:
    """Test OnlineLLM.clamp_temperature for MiniMax constraints."""

    def test_clamp_temperature_passthrough_when_base_url_none_or_empty(self):
        """If provider cannot be detected from base_url, temperature is passed through."""
        # base_url is None – should be a no-op passthrough
        assert OnlineLLM.clamp_temperature(0.0, None) == 0.0

        # base_url is empty string – should also be a no-op passthrough
        assert OnlineLLM.clamp_temperature(0.0, "") == 0.0
```
</issue_to_address>

### Comment 4
<location path="tests/test_minimax_provider.py" line_range="180" />
<code_context>
+class TestResponseFormatHandling:
+    """Test that response_format is disabled for MiniMax."""
+
+    @patch("weclone.core.inference.online_infer.OpenAI")
+    def test_minimax_disables_response_format(self, mock_openai_cls):
+        llm = OnlineLLM(
</code_context>
<issue_to_address>
**suggestion (testing):** Add a test that MiniMax overrides an explicitly provided `response_format`

`TestResponseFormatHandling` only covers the default `response_format` case for MiniMax. Please also add a test for the case where a caller explicitly passes a non-empty `response_format` while targeting MiniMax. For example, instantiate `OnlineLLM(..., base_url="https://api.minimax.io/v1", response_format="json_object")` and assert that `llm._supports_response_format is False` and `llm.response_format == ""` to verify `OnlineLLM.__init__` correctly ignores the explicit value when response formats aren’t supported.
</issue_to_address>

### Comment 5
<location path="tests/test_minimax_provider.py" line_range="223-227" />
<code_context>
+        )
+        llm.chat("Hello")
+
+        call_kwargs = mock_client.chat.completions.create.call_args
+        # response_format should NOT be in the call params
+        assert "response_format" not in call_kwargs.kwargs and (
</code_context>
<issue_to_address>
**suggestion (testing):** Simplify and harden the assertion that `response_format` is omitted for MiniMax

The current assertion inspects both `call_kwargs.kwargs` and the string representation of `call_kwargs`, which is brittle and may yield false positives if that representation changes. Since `call_args` is an `(args, kwargs)` tuple, you can simplify to:

```python
_, kwargs = mock_client.chat.completions.create.call_args
assert "response_format" not in kwargs
```

This keeps the test focused on the actual keyword arguments passed to the client and avoids reliance on `unittest.mock` internals.

```suggestion
        _, kwargs = mock_client.chat.completions.create.call_args
        # response_format should NOT be in the call params
        assert "response_format" not in kwargs
```
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

weclone/utils/config_models.py

weclone/core/inference/online_infer.py

tests/test_minimax_provider.py

- Use is None checks instead of truthiness for preset fields - Clamp MiniMax temperature upper bound to 1.0 (was only lower) - Add tests for None/empty base_url passthrough behavior - Add test for explicit response_format override on MiniMax - Simplify response_format assertion using call_args unpacking

- Update default model from MiniMax-M2.5 to MiniMax-M2.7 - Update all config templates, README docs, and tests - Keep all previous models as alternatives

JiwaniZakir

The template files (settings.template.jsonc, mllm.template.jsonc, tg.template.jsonc) leave "base_url": "https://xxx/v1" as an active, non-commented field directly below the new llm_provider comment block. Since the docs say setting llm_provider auto-fills base_url, a user who enables "llm_provider": "minimax" but forgets to remove the explicit base_url placeholder will likely get a broken config — it's unclear from the template whether an explicit base_url takes precedence or whether the placeholder value causes an error. The placeholder should either be commented out or replaced with a note like // "base_url": "" // optional: overrides provider default.

Additionally, _extract_json_from_text in tests/test_minimax_provider.py is a local re-implementation rather than an import of the actual production function. This means the tests won't catch regressions if the real extract_json_from_text in offline_infer.py is changed — the unit tests will keep passing against stale logic. The mock setup already goes to lengths to isolate heavy dependencies; it would be cleaner to either import the real function directly (mocking only the transitive heavy imports that trigger on module load) or at minimum add a comment explaining why the divergence is intentional.

PR Bot and others added 2 commits March 17, 2026 13:21

🎈 auto fixes by pre-commit hooks

8a2cdc1

sourcery-ai bot reviewed Mar 17, 2026

View reviewed changes

PR Bot added 2 commits March 17, 2026 14:11

feat: upgrade MiniMax default model to M2.7

b345bc6

- Update default model from MiniMax-M2.5 to MiniMax-M2.7 - Update all config templates, README docs, and tests - Keep all previous models as alternatives

octo-patch changed the title ~~feat: add MiniMax as first-class LLM provider for data cleaning~~ feat: add MiniMax as first-class LLM provider (default: M2.7) Mar 18, 2026

JiwaniZakir reviewed Apr 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add MiniMax as first-class LLM provider (default: M2.7)#211

feat: add MiniMax as first-class LLM provider (default: M2.7)#211
octo-patch wants to merge 4 commits intoxming521:masterfrom
octo-patch:feature/add-minimax-provider

octo-patch commented Mar 17, 2026 •

edited

Loading

Uh oh!

sourcery-ai bot commented Mar 17, 2026 •

edited

Loading

Interacting with Sourcery

Customizing Your Experience

Getting Help

Reviewer's Guide

Sequence diagram for MiniMax-based online data cleaning call

Class diagram for LLM provider presets and OnlineLLM behavior

File-Level Changes

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JiwaniZakir left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

octo-patch commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Available Models

Testing

Uh oh!

sourcery-ai bot commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviewer's Guide

基于 MiniMax 的在线数据清洗调用时序图

LLM 提供者预设与 OnlineLLM 行为的类图

文件级改动

Interacting with Sourcery

Customizing Your Experience

Getting Help

Reviewer's Guide

Sequence diagram for MiniMax-based online data cleaning call

Class diagram for LLM provider presets and OnlineLLM behavior

File-Level Changes

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JiwaniZakir left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

octo-patch commented Mar 17, 2026 •

edited

Loading

sourcery-ai bot commented Mar 17, 2026 •

edited

Loading