Model Configuration

WiseMindAI's AI features require initial model configuration. There are two ways to set this up:

Initial Setup: When you first launch WiseMindAI, it will guide you through installing a local model and completing basic workspace settings.
Third-party Keys: You can also connect your own existing third-party API keys, such as those for ChatGPT, DeepSeek, Gemini, etc.

Open the 「Settings」 menu in the bottom-left corner of WiseMindAI and navigate to «Model Settings» to configure different models based on usage scenarios.

All subsequent model settings can be modified here at any time.

Configure by Need

Local Models: Used for managing local services like Ollama or LM Studio. Ideal for users who prefer offline usage or prioritize data privacy.
General LLMs: Used for core features such as AI chat, document summarization, knowledge base summaries, knowledge cards, mind maps, and plugins. See: Apply Large Language Model Key.
Embedding Models: Converts documents, web pages, notes, etc., into searchable knowledge indexes; a critical prerequisite for knowledge base Q&A. See: Apply Embedding Model Key.
OCR Text Recognition: Extracts text from images, screenshots, and scanned materials to facilitate subsequent summarization and Q&A. See: Apply OCR Service Key.
Audio/Video Transcription: Converts audio or video content into text for summarization, outline generation, and knowledge card creation. See: Apply Audio/Video Transcribe Service Key.
Image Generation Models: Generates infographics, posters, and other images based on concepts or prompts. See: Apply Image Generation Model Key.
Live Transcription Models: Convert continuous microphone audio into real-time transcripts for meetings, classes, and interviews. See: Live Transcription.

Audio/video transcription and Live Transcription are separate categories. The former processes existing audio or video files, while the latter processes continuous microphone input. Some providers activate and bill these services separately, so make sure you configure the correct product.

Test online models directly

After configuring an online model, select Test on its model card. WiseMindAI sends a minimal request with a built-in sample to verify the key, model name, and service endpoint.

Direct testing is currently available for:

Model category	Test result
General LLM	Returns a short test response
Embedding Model	Displays the generated vector dimensions
Image Generation	Displays a generated test image
Image Text Recognition	Displays text recognized from a built-in image
Audio Model	Displays a transcription of built-in audio
Live Transcription	Displays recognized text, time to first segment, and total duration

You do not need to set a model as active before testing it. A successful test does not change the active model or create production data.

Cost notice

An online test makes a real provider request and may incur a small fee. Fees are charged by the provider. Local models do not display this online testing option.

Understand test errors

If a test fails, use the message on the page to check:

Incomplete configuration: One or more required fields are empty.
Authentication failed: The API Key, Secret Key, or another credential is incorrect.
Cannot connect: The network is unavailable or the endpoint is incorrect.
Model not found: The model name is incorrect or the account lacks permission.
Insufficient quota or too many requests: Add credit, wait for quota recovery, or retry later.
Request timed out: The provider is responding slowly; retry later.
No valid result: The service responded without usable text, an image, or a vector.

If the test succeeds but the feature still does not work, confirm that the feature uses the same model and that the current material meets its processing requirements.

Usage Recommendations

If you are new to WiseMindAI, start by keeping the default local model installed, then add a General LLM key as needed. This ensures core features like document summarization and knowledge base Q&A remain stable.

If you already have API keys from platforms such as OpenAI, Gemini, DeepSeek, Zhipu AI, Alibaba Cloud Bailian, or OpenRouter, enter them directly in the corresponding service provider section within WiseMindAI. If you use an OpenAI-compatible interface, select Custom OpenAI-Compatible Service.

After configuration, test the model before processing a long document, generating an image, or starting a recording. This catches key, model name, endpoint, and quota issues early.

Troubleshooting Tips

If document summarization works but document chat or knowledge base Q&A performs poorly, prioritize checking if your Embedding model is configured and whether documents have been analyzed.

If vectorization fails for a document, check the error message and logs. Newer versions record more detailed failure information to help diagnose issues related to model configuration, network connectivity, document format, or content size.

If Live Transcription receives no text, run the microphone test and the Live Transcription model test separately. The microphone test confirms that the device receives sound, while the model test confirms that the provider can recognize it. Both must pass.

Model Configuration ​

Configure by Need ​

Test online models directly ​

Understand test errors ​

Usage Recommendations ​

Troubleshooting Tips ​