Paid Models

The pricing for different providers and models varies depending on the specific model and usage requirements. Here's an overview of different providers and models for both the large language models and embeddings:

Embedding Models

Embedding models are used to convert text data into numerical representations called embeddings. These embeddings capture the semantic meaning and relationships between different pieces of text, allowing the AI to find the most relevant context from the document.txt file when answering a question.

PlatformModel NameMTEBEmbedding DimensionsAPI Price (per 1K tokens)

64.59

3072

Usage: $0.00013

62.26

1536

Usage: $0.00002

60.99

1536

Usage: $0.0001

Large Language Models

Large Language Models (LLMs) are powerful AI models that can understand and generate human-like text based on the input they receive. In ServerAssistantAI, LLMs process the user's question along with the relevant context provided by the embedding model to generate accurate and context-aware responses.

PlatformModel NameELOSpeed (tokens per second)API Price (per 1K tokens)

1287

49.97 TPS

Input: $0.005

Output: $0.015

1268

57.2 TPS

Input: $0.0035 Output: $0.0105

1252

38.90 TPS

Input: $0.03

Output: $0.06

1232

133.3 TPS

Input: $0.00035 Output: $0.00105

1246

25.51 TPS

Input: $0.015

Output: $0.075

1199

81.21 TPS

Input: $0.003

Output: $0.015

1181

214.95 TPS

Input: $0.00025

Output: $0.00125

1113

53.07 TPS

Input: $0.0005

Output: $0.0015

Pricing and model results were updated on 5/17/2024 using the following sources: | Source 1 | Source 2 | Source 3 | Source 4 | Source 5 | Source 6 | Source 7 |

Last updated