Fleet Operations
Discover
Live operational intelligence across every free NVIDIA NIM endpoint — ranked by throughput, reliability, and congestion, refreshed continuously.
- 46
- Endpoints
- 8
- Providers
- 21
- Degraded
- just now
- Last probe
Fleet Degraded25 of 46 endpoints operational · 21 degraded
251110
LIVEThroughput▼6.0%
35.2tok/s
P95 Latency▲34%
3,316ms
Reliability▼2.5pt
88.0%
Congestion▲2.5pt
23%
Model Registry
46 endpoints
Fleet Performance · 12h
836ms
▲ 2.7% · 12havg930min395max2230
Loading…
Provider Intelligence
8 providers · 46 models · 21 degraded
ProviderMCongestionUptimeDegRouteRecovery
AAlibaba1
6%
100.00%
00/1
CGoogle4
32%
79.17%
10/4
DOther10
26%
89.83%
60/10
DNVIDIA15
17%
91.11%
63/15
DMistral4
9%
94.17%
20/4
DMicrosoft1
100%
35.00%
10/1
DMeta9
18%
94.26%
30/9
DDeepSeek2
61%
55.00%
20/2
Overall Rankings
46 ranked · throughput · reliability · congestion
Runner Up
nemotron-3-nano-30b-a3b
NVIDIA
628 pts
Best Overall
mistral-small-4-119b-2603
Mistral
746 pts
Third
step-3.7-flash
Other
627 pts
1746A
mistral-small-4-119b-2603
Mistral
2628A
nemotron-3-nano-30b-a3b
NVIDIA
3627A
step-3.7-flash
Other
4596A
nemotron-3-nano-omni-30b-a3b-reasoning
NVIDIA
5595A
nemotron-mini-4b-instruct
NVIDIA
6589A
gpt-oss-20b
Other
7579A
diffusiongemma-26b-a4b-it
Google
8544A
llama-4-maverick-17b-128e-instruct
Meta
9538A
nvidia-nemotron-nano-9b-v2
NVIDIA
10529A
nemotron-3-content-safety
NVIDIA
11525A
riva-translate-4b-instruct-v1.1
NVIDIA
12519A
qwen3-next-80b-a3b-instruct
Alibaba
13501A
llama-3.1-nemotron-safety-guard-8b-v3
NVIDIA
14497A
sarvam-m
Other
15494A
kimi-k2.6
Other
16493A
llama-3.1-8b-instruct
Meta
17492A
llama-3.3-nemotron-super-49b-v1
NVIDIA
18489A
solar-10.7b-instruct
Other
19489A
llama-3.2-3b-instruct
Meta
20488A
nemotron-3.5-content-safety
NVIDIA
21480A
stockmark-2-100b-instruct
Other
22479A
ministral-14b-instruct-2512
Mistral
23477A
gpt-oss-120b
Other
24475A
llama-3.2-1b-instruct
Meta
25470A
llama-3.1-nemoguard-8b-content-safety
NVIDIA
26470A
gemma-3n-e4b-it
Google
27468A
llama-3.2-90b-vision-instruct
Meta
28465A
gemma-3n-e2b-it
Google
29464A
llama-3.2-11b-vision-instruct
Meta
30457A
step-3.5-flash
Other
31455A
llama-3.1-nemotron-nano-vl-8b-v1
NVIDIA
32449A
llama-guard-4-12b
Meta
33445A
mistral-large-3-675b-instruct-2512
Mistral
34434A
mistral-medium-3.5-128b
Mistral
35432A
nemotron-3-super-120b-a12b
NVIDIA
36394A
llama-3.1-70b-instruct
Meta
37366A
llama-3.3-nemotron-super-49b-v1.5
NVIDIA
38315A
deepseek-v4-pro
DeepSeek
39249C
llama-3.3-70b-instruct
Meta
40224C
llama-3.1-nemotron-nano-8b-v1
NVIDIA
41174D
glm-5.1
Other
42174D
nemotron-3-ultra-550b-a55b
NVIDIA
43144D
deepseek-v4-flash
DeepSeek
44120D
minimax-m2.7
Other
45105D
phi-4-mini-instruct
Microsoft
4651D
gemma-4-31b-it
Google
Trend Analysis
46 models tracked
Rising2
nemotron-mini-4b-instruct100%
nemotron-3-nano-omni-30b-a3b-reasoning100%
Improving17
glm-5.158%
nemotron-3-super-120b-a12b95%
llama-3.3-nemotron-super-49b-v1.573%
llama-3.3-nemotron-super-49b-v197%
llama-3.1-nemotron-nano-8b-v143%
Stable27
solar-10.7b-instruct100%
stockmark-2-100b-instruct100%
step-3.7-flash100%
step-3.5-flash100%
sarvam-m100%
Declining0
All clear
Reliability Matrix
by routing confidence
solar-10.7b-instruct100%High
stockmark-2-100b-instruct100%Moderate
step-3.7-flash100%Moderate
step-3.5-flash100%Moderate
sarvam-m100%High
qwen3-next-80b-a3b-instruct100%Moderate
gpt-oss-20b100%High
gpt-oss-120b100%Moderate
riva-translate-4b-instruct-v1.1100%Moderate
nvidia-nemotron-nano-9b-v2100%Moderate
nemotron-mini-4b-instruct100%Moderate
nemotron-3.5-content-safety100%Moderate
nemotron-3-nano-omni-30b-a3b-reasoning100%Moderate
nemotron-3-nano-30b-a3b100%High
nemotron-3-content-safety100%Moderate
llama-3.1-nemotron-safety-guard-8b-v3100%High
llama-3.1-nemotron-nano-vl-8b-v1100%Moderate
llama-3.1-nemoguard-8b-content-safety100%Moderate
kimi-k2.6100%Moderate
mistral-small-4-119b-2603100%High
llama-4-maverick-17b-128e-instruct100%Moderate
gemma-3n-e4b-it100%Moderate
gemma-3n-e2b-it100%Moderate
diffusiongemma-26b-a4b-it100%High
ministral-14b-instruct-251298%Moderate
llama-guard-4-12b98%Moderate
llama-3.2-90b-vision-instruct98%Moderate
llama-3.2-3b-instruct98%Moderate
llama-3.2-11b-vision-instruct98%Moderate
llama-3.1-8b-instruct98%Moderate
llama-3.3-nemotron-super-49b-v197%Moderate
nemotron-3-super-120b-a12b95%Moderate
mistral-medium-3.5-128b90%Moderate
mistral-large-3-675b-instruct-251288%Moderate
llama-3.2-1b-instruct88%Moderate
llama-3.1-70b-instruct85%Moderate
llama-3.3-70b-instruct83%Avoid
llama-3.3-nemotron-super-49b-v1.573%Avoid
deepseek-v4-pro62%Avoid
glm-5.158%Avoid
nemotron-3-ultra-550b-a55b58%Avoid
deepseek-v4-flash48%Avoid
llama-3.1-nemotron-nano-8b-v143%Avoid
minimax-m2.740%Avoid
phi-4-mini-instruct35%Avoid
gemma-4-31b-it17%Avoid
Incident Timeline
21 active
Congestion spike detected — consider failover
glm-5.1 · Other
Congestion spike detected — consider failover
nemotron-3-ultra-550b-a55b · NVIDIA
Congestion spike detected — consider failover
llama-3.3-nemotron-super-49b-v1.5 · NVIDIA
Congestion spike detected — consider failover
llama-3.1-nemotron-nano-8b-v1 · NVIDIA
Congestion spike detected — consider failover
minimax-m2.7 · Other
Congestion spike detected — consider failover
phi-4-mini-instruct · Microsoft
Congestion spike detected — consider failover
llama-3.3-70b-instruct · Meta
Congestion spike detected — consider failover
gemma-4-31b-it · Google
Congestion spike detected — consider failover
deepseek-v4-pro · DeepSeek
Congestion spike detected — consider failover
deepseek-v4-flash · DeepSeek
Throughput degradation on primary endpoint
step-3.7-flash · Other
Throughput degradation on primary endpoint
step-3.5-flash · Other
Throughput degradation on primary endpoint
gpt-oss-120b · Other
Throughput degradation on primary endpoint
nvidia-nemotron-nano-9b-v2 · NVIDIA
Throughput degradation on primary endpoint
nemotron-3-super-120b-a12b · NVIDIA
Throughput degradation on primary endpoint
llama-3.1-nemotron-nano-vl-8b-v1 · NVIDIA
Throughput degradation on primary endpoint
kimi-k2.6 · Other
Throughput degradation on primary endpoint
mistral-medium-3.5-128b · Mistral
Throughput degradation on primary endpoint
mistral-large-3-675b-instruct-2512 · Mistral
Throughput degradation on primary endpoint
llama-3.2-1b-instruct · Meta
Throughput degradation on primary endpoint
llama-3.1-70b-instruct · Meta