Discourse AI - Large Language Model (LLM) settings page

Discourse · 2024-08-02T21:28:38.753Z

This guide covers the LLM settings page which is part of the Discourse AI plugin.

Required user level: Administrator

The dedicated settings page is designed to have everything related to Large Language Models (LLMs) used for Discourse AI features in one place.

Depending on the Discourse AI feature enabled, an LLM might be needed. Please check each Discourse AI feature to know if an LLM is a pre-requisite.

CleanShot 2024-08-02 at 15.06.50@2x2532×1686 284 KB

CleanShot 2024-08-02 at 15.07.09@2x1550×1592 176 KB

Features

Add new models, with prepopulated information
Add custom models not mentioned
Configure LLM settings
Allow specific LLM use for AI Bot
- See the AI Bot username
Enable vision support (model dependent)
Test
Save settings

Adding LLM connections

Go to Admin → Plugins → AI
Go to the LLMs tab
Add a new connection, pick your model
Add in the API key (depending on the model, you might have more fields to input manually) and save
(Optional) Test your connection to make sure it’s working

Supported LLMs

You can always add a custom option if you don’t see your model listed. Supported models are continually added.

Grok-2
Deepseek-R1
Nova Pro
Nova Lite
Nova Micro
o3-pro
o3
o3-mini
GPT-4.1 (including: nano,mini)
GPT-4o
GPT-4o mini
OpenAI o1 Preview
OpenAI o1 mini Preview
Claude Sonnet 3.7
Claude Sonnet 3.5
Claude Haiku 3.5
Gemini Pro 1.5
Gemini Flash 1.5
Gemini Flash 2.0
Llama 3.1
Llama 3.3
Mistral large
Pixtral large
Qwen 2.5 Coder

Additionally, hosted customers can use the following pre-configured LLMs in the settings page. These are open-weights LLMs hosted by Discourse, ready for use to power AI features.

CDCK Hosted Large LLM: Llama 3.3
CDCK Hosted Small LLM: Qwen 2.5
CDCK Hosted Vision LLM: Qwen 2-VL

Configurations fields

You will only see the fields relevant to your selected LLM provider. Please double-check any of the pre-populated fields with the appropriate provider, such as Model name

Name to display
Model name
Service hosting the model
URL of the service hosting the model
API Key of the service hosting the model
AWS Bedrock Access key ID
AWS Bedrock Region
Optional OpenAI Organization ID
Tokenizer
Number of tokens for the prompt
Enable responses API (Open AI only, be sure to set URL to https://api.openai.com/v1/responses)

Technical FAQ

What is tokenizer?

The tokenizer translates strings into tokens, which is what a model uses to understand the input.

What number should I use for Number of tokens for the prompt ?

A good rule of thumb is 50% of the model context window, which is the sum of how many tokens you send and how many tokens they generate. If the prompt gets too big, the request will fail. That number is used to trim the prompt and prevent that from happening

Caveats

Sometimes you may not see the model you wanted to use listed. While you can add them manually, we will support popular models as they come out.

Last edited by @sam 2025-06-11T08:35:39Z

Check document
Perform check on document:

qianping_chen · 2024-09-30T17:16:20.703Z

It’s too difficult, I don’t know how to do it at all. I hope to update specific tutorials on various AIs, such as Google login settings.

sam · 2024-10-01T05:40:38.236Z

We improved the UI a lot in the past week, can you try it out again?

hameedacpa · 2025-02-24T17:08:28.363Z

When Gemini 2.0 will be supported ?

sam · 2025-02-24T21:58:17.667Z

Been supported for quite a while.

Joe_F · 2025-03-11T13:21:50.920Z

I seem to have an issue where I cannot Select a LLM even though I have the CDCK hosted ones configured…

is this normal?

sam · 2025-03-12T00:17:15.189Z

A lot to unwrap here, which llm are you trying to choose for what?

The CDCK LLMs are only available for very specific features, to see which you need to head to /admin/whats-new on your instance and click “only show experimental features”, you will need to enable them to unlock the CDCK LLM on specific features.

Any LLM you define outside of CDCK LLMs is available to all features.

AquaL1te · 2025-03-12T09:13:55.439Z

Is there also a topic that provides a general rundown of the best cost/quality balance? Or even which LLM can be used for free for a small community and basic functionality? I can dive into the details and play around. But I’m a bit short in terms of time.

For example, I only care about spam detection and a profanity filter. I had this for free, but those plugins are deprecated or soon to be. It would be nice if I can retain this functionality without having to pay for an LLM.

Saif · 2025-03-12T19:20:39.840Z

AquaL1te:

Is there also a topic that provides a general rundown of the best cost/quality balance? Or even which LLM can be used for free for a small community and basic functionality? I can dive into the details and play around. But I’m a bit short in terms of time.

We do have this topic, that might be what you are looking for.

What LLM to use for Discourse AI? Site Management

It’s important to understand the needs of you as the community admin and your members when choosing a Large Language Model (LLM) to power Discourse AI features. Several factors may influence your decisions. Performance for use-case. Are you looking for the best-performing model? Performance can vary depending on task, the assessment is based on the model’s ability to generate correct and relevant responses. Context length. The context window is the amount of text a model can “see” and con…

AquaL1te · 2025-03-25T09:36:43.066Z

Done! It was indeed pretty easy. But maybe for a non techie it may still be a bit hard to setup. For example, the model name was automatically set in the settings, but wasn’t the correct one. Luckily I recognized the model name in a curl example for Claude on the API page and then it worked

Estimated costs are maybe 30 euro cents per month for spam control (I don’t have a huge forum). So that’s manageable! I’ve set a limit of 5 euros in the API console, just in case.

Saif · 2025-03-25T16:16:56.403Z

AquaL1te:

For example, the model name was automatically set in the settings, but wasn’t the correct one.

Which one did you pick for Claude? What was the incorrect name shown, and what did you correct it to?

AquaL1te · 2025-03-26T09:31:18.541Z

I use Claude 3.5, the model ID is by default claude-3-5-haiku, but I had to change it to claude-3-5-haiku-20241022, otherwise I got an error.

Saif · 2025-03-26T15:49:43.722Z

AquaL1te:

I use Claude 3.5, the model ID is by default claude-3-5-haiku, but I had to change it to claude-3-5-haiku-20241022, otherwise I got an error.

Good to note, yeah sometimes there might be a disconnect. The auto-populated info should act as guidance, which tends to work most of the time, but does fall short in certain cases such as yours (given all the different models and provider configs)

I have updated the OP of this guide

jrgong · 2025-04-11T11:20:22.232Z

Discourse:

Gemini Flash 2.0

This model is not listed on 3.4.2 - are those pre-configs only available on 3.5 and I have to add them manually?

Edit: Also what option do I choose for “Tokenizer” when using Grok 3 models?

Screenshot 2025-04-11 at 13.50.14786×376 41.5 KB

Falco · 2025-04-11T17:15:01.692Z

jrgong:

This model is not listed on 3.4.2 - are those pre-configs only available on 3.5 and I have to add them manually?

Pre-configs are simply templates, you can get the same end result by using the “Manual configuration”.

jrgong:

Edit: Also what option do I choose for “Tokenizer” when using Grok 3 models?

I’ve found that the Gemini tokenizer is pretty close the the Grok one, so try that.

CraigW · 2025-07-24T22:52:31.089Z

Is there a way to use IBM WatsonX through the current configuration management, or would this require additional development work by the Discourse staff?

Falco · 2025-07-24T23:15:40.919Z

Does IBM WatsonX expose an OpenAI compatible API by any chance?

CraigW · 2025-07-25T18:22:13.104Z

Great question. A quick poke around the docs didn’t tell me much, but the fact that this repository exists suggests that it is not directly compatible: GitHub - aseelert/watsonx-openai-api: Watsonx Openai compatible API