Loading...
Loading...
Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the board. Hermes 3 70B is a competitive, if not superior finetune of the [Llama-3.1 70B foundation model](/models/meta-llama/llama-3.1-70b-instruct), focused on aligning LLMs to the user, with powerful steering capabilities and control given to the end user. The Hermes 3 series builds and expands on the Hermes 2 set of capabilities, including more powerful and reliable function calling and structured output capabilities, generalist assistant capabilities, and improved code generation skills.
Quality Index
10.6
349th of 444
Top 80%
Price/1M
$0.30
335th cheapest
At median
Top 50%
Speed
41 tok/s
Top 53%
TTFT
0.32s
Context Window
131K
145th largest
Top 63%
Input
$0.30
per 1M tokens
Output
$0.30
per 1M tokens
Blended
$0.30
per 1M tokens
Cheaper than 50% of models. Median price is $0.30/1M tokens.
Daily
$0.30
Monthly
$9.00
41
tokens/sec
Faster than 47% of models
0.32
seconds
Faster than 58% of models
0.32
seconds
Faster than 59% of models
Market Median
45 tok/s
8% slower
Median TTFT
0.42s
23% faster
Throughput/Dollar
138
tok/s per $/1M
Speed Comparison
Context Window
131K
tokens
Larger than 37% of models