Free Models

While paid models often provide superior performance and capabilities, there are also several free models available that can be used with ServerAssistantAI. Many of these models are open-source and can be a good starting point for servers.

Embedding Models

Embedding models are used to convert text data into numerical representations called embeddings. These embeddings capture the semantic meaning and relationships between different pieces of text. When the documents/ directory is updated, the content is sent to the embedding API. The resulting embeddings are saved to the cache/ directory, allowing the AI to find relevant context efficiently without reprocessing or making new API requests for each query.

Provider
Model Name
MTEB
Embedding Dimensions

text-embedding-004

66.31

768

embed-english-v3.0

64.47

1024

embed-multilingual-v3.0

64.01

1024

Large Language Models

Large Language Models (LLMs) are powerful AI models that can understand and generate human-like text based on the input they receive. In ServerAssistantAI, when a user asks a question, the system retrieves relevant cached context from the embedding API results. This context, along with the user's question, is sent to the LLM to generate accurate and context-aware responses.

Provider
Model Name
ELO
Speed (tokens per second)
Service uses responses to improve model?

gpt-4o

1442

N/A

No

gemini-2.5-flash

1409

253.8 TPS

Yes

gpt-4.1

1408

N/A

No

deepseek-r1-distill-llama-70b

1393

400 TPS

No

gpt-4.1-mini

1371

N/A

No

gemini-2.0-flash-001

1365

N/A

Yes

gemma-3-27b-it

1363

54.4 TPS

Yes

gemini-1.5-pro

1350

57.2 TPS

Yes

qwen/qwen3-32b

1346

662 TPS

No

command-a-03-2025

1345

67.4 TPS

No

llama-3.3-70b-versatile

1316

394 TPS

No

gemini-1.5-flash

1284

133.3 TPS

Yes

gpt-4o-mini

1273

N/A

No

command-r-plus-08-2024

1278

47.7 TPS

No

llama3-70b-8192

1275

300+! TPS

No

command-r-plus

1264

43.16 TPS

No

command-r

1230

94.52 TPS

No

meta-llama/Meta-Llama-3-8B-Instruct

1224

N/A

No

01-ai/Yi-1.5-34B-Chat

1217

N/A

No

mistralai/Mixtral-8x7B-Instruct-v0.1

1200

N/A

No

google/gemma-1.1-7b-it

1179

N/A

No

mistralai/Mistral-7B-Instruct-v0.3

1113

N/A

No

Model results were updated on 8/5/2025 using the following sources: | Source 1 | Source 2 | Source 3 | Source 4 |

Last updated

Was this helpful?