feat: add MiniMax as first-class LLM provider (M3 default, chat + embedding) by octo-patch · Pull Request #356 · volcengine/MineContext

octo-patch · 2026-03-21T09:28:24Z

Summary

Add MiniMax as a fully supported LLM provider alongside OpenAI and Doubao, covering both chat completion (via OpenAI-compatible API) and embedding generation (via MiniMax native API). The default chat model is MiniMax-M3 (512K context window, 128K max output, image input).

Changes

Backend (opencontext/llm/llm_client.py):

Add MINIMAX enum to LLMProvider
Implement _minimax_embedding() / _minimax_embedding_async() for MiniMax native embedding API (embo-01 model, non-OpenAI-compatible format using texts/type/vectors)
Fix _request_embedding() to avoid NameError on undefined response variable in the MiniMax code path
Fix _request_embedding_async() to route MiniMax to native API instead of falling through to incompatible OpenAI SDK
Add MiniMax embedding validation in validate()
Add MiniMax error codes (invalid_api_key, insufficient_balance) to _extract_error_summary()

Frontend (settings/constants.tsx, settings.tsx):

Add MiniMax to ModelTypeList with M3 (default), M2.7 and M2.7-highspeed chat models
Add embo-01 embedding model and api.minimax.io base URL presets
Add MiniMax provider icon (minimax.svg)
Wire up API key link, base URL routing, and form rendering

Documentation (README.md, README_zh.md):

List MiniMax as supported provider in Quick Start and Backend Architecture sections
Add minimax option in config.yaml examples

Tests (22 unit + 3 integration):

Provider enum, client init, chat completion, streaming, thinking mode
Default model is now MiniMax-M3; M2.7-highspeed continues to be covered
Embedding: native API format, auth headers, error handling, dimension truncation
Validation: chat and embedding success/failure paths
Error handling: MiniMax-specific error code extraction
Integration tests with real MiniMax API (requires MINIMAX_API_KEY)

Test Plan

All 22 unit tests pass with M3 as default
2/3 integration tests pass (embedding test may hit MiniMax rate limits)
Frontend builds successfully with new provider
Manual verification with MiniMax API key in the Settings UI

MiniMax Models

Model	Context	Max Output	Use Case
MiniMax-M3 (default)	512K tokens	128K tokens	Chat / VLM (image input)
MiniMax-M2.7	192K tokens	—	Previous-gen chat
MiniMax-M2.7-highspeed	192K tokens	—	Previous-gen low-latency chat
embo-01	—	—	Embedding (1536 dims)

Add MiniMax (https://www.minimaxi.com/) as a fully supported LLM provider alongside OpenAI and Doubao, covering both chat completion and embedding generation. Backend changes (opencontext/llm/llm_client.py): - Add MINIMAX enum to LLMProvider - Implement _minimax_embedding() for native MiniMax embedding API (non-OpenAI-compatible: uses texts/type/vectors format) - Implement _minimax_embedding_async() for async embedding support - Fix _request_embedding() to properly handle MiniMax path (avoid NameError on undefined response variable) - Fix _request_embedding_async() to route MiniMax to native API instead of falling through to incompatible OpenAI SDK - Add MiniMax embedding validation in validate() - Add MiniMax error codes (invalid_api_key, insufficient_balance) to _extract_error_summary() Frontend changes: - Add MiniMax to ModelTypeList enum with M2.7/M2.7-highspeed models - Add embo-01 embedding model and api.minimax.io base URL presets - Add MiniMax provider icon (minimax.svg) - Wire up API key link, base URL routing, and form rendering - Add MiniMax default model in form initialValues Documentation: - Update README.md and README_zh.md to list MiniMax as supported - Add minimax option in config.yaml examples Tests (22 unit + 3 integration): - TestLLMProviderEnum: enum value validation - TestMiniMaxChatClient: init, completion, streaming, thinking - TestMiniMaxEmbeddingClient: native API format, auth, errors, truncation - TestMiniMaxValidation: chat and embedding validation paths - TestMiniMaxErrorHandling: MiniMax-specific error extraction - TestMiniMaxIntegration: real API tests (requires MINIMAX_API_KEY)

- Add MiniMax-M3 to ModelInfoList (placed first as default) - Keep MiniMax-M2.7 and MiniMax-M2.7-highspeed as alternatives - Update unit and integration tests to use M3 as the default MiniMax-M3 is the latest MiniMax model: 512K context window, 128K max output, and supports image input via OpenAI-compatible API.

PR Bot and others added 2 commits March 21, 2026 17:27

octo-patch changed the title ~~feat: add MiniMax as first-class LLM provider (chat + embedding)~~ feat: add MiniMax as first-class LLM provider (M3 default, chat + embedding) Jun 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add MiniMax as first-class LLM provider (M3 default, chat + embedding)#356

feat: add MiniMax as first-class LLM provider (M3 default, chat + embedding)#356
octo-patch wants to merge 2 commits into
volcengine:mainfrom
octo-patch:feature/add-minimax-provider

octo-patch commented Mar 21, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

octo-patch commented Mar 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Test Plan

MiniMax Models

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

octo-patch commented Mar 21, 2026 •

edited

Loading