Paid Models

The pricing for different providers and models varies depending on the specific model and usage requirements. Here's an overview of different providers and models for both the large language models and embeddings:

Embedding Models

Embedding models are used to convert text data into numerical representations called embeddings. These embeddings capture the semantic meaning and relationships between different pieces of text. When the documents/ directory is updated, the content is sent to the embedding API. The resulting embeddings are saved to the cache/ directory, allowing the AI to find relevant context efficiently without reprocessing or making new API requests for each query.

Platform

Model Name

MTEB

Embedding Dimensions

API Price (per 1K tokens)

OpenAI

text-embedding-3-large

64.59

3072

Usage: $0.00013

OpenAI

text-embedding-3-small

62.26

1536

Usage: $0.00002

OpenAI

text-embedding-ada-002

60.99

1536

Usage: $0.0001

Large Language Models

Large Language Models (LLMs) are powerful AI models that can understand and generate human-like text based on the input they receive. In ServerAssistantAI, when a user asks a question, the system retrieves relevant cached context from the embedding API results. This context, along with the user's question, is sent to the LLM to generate accurate and context-aware responses.

Platform

Model Name

ELO

Speed (tokens per second)

API Price (per 1K tokens)

OpenAI

gpt-4o

1442

49.97 TPS

Input: $0.0025

Output: $0.0125

Anthropic