Nemotron3 Nano 4B
NVIDIA Nemotron — ✓ ✓ ✓ ✓ ✓ — — ✓ —
Run
Run Nemotron3 Nano 4B
Nemotron3 Nano 4B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine llama.cpp Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
FunctionGemma
Google Gemma3 — ✓ ✓ ✓ ✓ ✓ — — ✓ —
Run
Run FunctionGemma
FunctionGemma Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine llama.cpp Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Cosmos Reason 1 7B
NVIDIA Cosmos Reason VLM ✓ ✓ ✓ ✓ ✓ ✓ — — —
Run
Run Cosmos Reason 1 7B
Cosmos Reason 1 7B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Gemma 3 270M
Google Gemma3 — ✓ ✓ ✓ ✓ ✓ ✓ ✓ — —
Run
Run Gemma 3 270M
Gemma 3 270M Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container Ollama Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Gemma 4 E2B
Google Gemma4 VLM ✓ ✓ ✓ ✓ ✓ ✓ — ✓ —
Run
Run Gemma 4 E2B
Gemma 4 E2B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container llama.cpp Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
GPT OSS 20B
OpenAI GPT OSS — ✓ ✓ ✓ — — ✓ — — —
Run
Run GPT OSS 20B
GPT OSS 20B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Llama 3.2 3B
Meta Llama 3 — ✓ ✓ ✓ ✓ ✓ ✓ ✓ — —
Run
Run Llama 3.2 3B
Llama 3.2 3B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container Ollama Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
MiniMax M2.7 New
MiniMax M2.7 — ✓ — — — — — — ✓ —
Run
Run MiniMax M2.7
MiniMax M2.7 Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine llama.cpp Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Ministral 3 3B Instruct
Mistral AI Ministral 3 VLM ✓ ✓ ✓ ✓ ✓ ✓ ✓ — —
Run
Run Ministral 3 3B Instruct
Ministral 3 3B Instruct Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container Ollama CLI
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Nemotron3 Nano 30B-A3B
NVIDIA Nemotron — ✓ ✓ ✓ ✓ ✓ ✓ ✓ — —
Run
Run Nemotron3 Nano 30B-A3B
Nemotron3 Nano 30B-A3B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container Ollama CLI
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Qwen3 4B
Alibaba Qwen3 — ✓ ✓ ✓ ✓ ✓ ✓ — — —
Run
Run Qwen3 4B
Qwen3 4B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Qwen3.5 35B-A3B (MoE)
Alibaba Qwen3.5 — ✓ ✓ ✓ — — ✓ — — —
Run
Run Qwen3.5 35B-A3B (MoE)
Qwen3.5 35B-A3B (MoE) Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Qwen3.6 35B-A3B (MoE) New
Alibaba Qwen3.6 — ✓ ✓ ✓ — — ✓ — — —
Run
Run Qwen3.6 35B-A3B (MoE)
Qwen3.6 35B-A3B (MoE) Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Gemma 3 1B
Google Gemma3 — ✓ ✓ ✓ ✓ ✓ ✓ ✓ — —
Run
Run Gemma 3 1B
Gemma 3 1B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container Ollama Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Gemma 4 E4B
Google Gemma4 VLM ✓ ✓ ✓ ✓ — ✓ — ✓ —
Run
Run Gemma 4 E4B
Gemma 4 E4B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container llama.cpp Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
GPT OSS 120B
OpenAI GPT OSS — ✓ ✓ — — — ✓ — — —
Run
Run GPT OSS 120B
GPT OSS 120B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Llama 3.1 8B
Meta Llama 3 — ✓ ✓ ✓ ✓ ✓ ✓ ✓ — —
Run
Run Llama 3.1 8B
Llama 3.1 8B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container Ollama Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Ministral 3 8B Instruct
Mistral AI Ministral 3 VLM ✓ ✓ ✓ ✓ ✓ ✓ ✓ — —
Run
Run Ministral 3 8B Instruct
Ministral 3 8B Instruct Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container Ollama CLI
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Nemotron Nano 9B v2
NVIDIA Nemotron — ✓ ✓ — — — ✓ — — —
Run
Run Nemotron Nano 9B v2
Nemotron Nano 9B v2 Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Qwen3.5 27B
Alibaba Qwen3.5 — ✓ ✓ ✓ — — ✓ — — —
Run
Run Qwen3.5 27B
Qwen3.5 27B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Qwen3.6 27B New
Alibaba Qwen3.6 — ✓ ✓ — — — ✓ — — —
Run
Run Qwen3.6 27B
Qwen3.6 27B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Qwen3 8B
Alibaba Qwen3 — ✓ ✓ ✓ ✓ — ✓ — — —
Run
Run Qwen3 8B
Qwen3 8B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Gemma 3 4B
Google Gemma3 VLM ✓ ✓ ✓ ✓ ✓ ✓ ✓ — —
Run
Run Gemma 3 4B
Gemma 3 4B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container Ollama Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Gemma 4 26B-A4B
Google Gemma4 VLM ✓ ✓ ✓ — — ✓ — ✓ —
Run
Run Gemma 4 26B-A4B
Gemma 4 26B-A4B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container llama.cpp Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Cosmos Reason 2 2B
NVIDIA Cosmos Reason VLM ✓ ✓ ✓ ✓ ✓ ✓ — ✓ — Llama 3.1 70B
Meta Llama 3 — ✓ ✓ ✓ ✓ ✓ ✓ ✓ — —
Run
Run Llama 3.1 70B
Llama 3.1 70B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container Ollama Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Ministral 3 14B Instruct
Mistral AI Ministral 3 VLM ✓ ✓ ✓ ✓ ✓ ✓ ✓ — —
Run
Run Ministral 3 14B Instruct
Ministral 3 14B Instruct Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container Ollama CLI
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Nemotron Nano 12B VL
NVIDIA Nemotron VLM ✓ ✓ — — — ✓ — — —
Run
Run Nemotron Nano 12B VL
Nemotron Nano 12B VL Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Qwen3 30B-A3B (MoE)
Alibaba Qwen3 — ✓ ✓ ✓ — — ✓ — — —
Run
Run Qwen3 30B-A3B (MoE)
Qwen3 30B-A3B (MoE) Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Qwen3.5 9B
Alibaba Qwen3.5 VLM ✓ ✓ ✓ ✓ — ✓ — — —
Run
Run Qwen3.5 9B
Qwen3.5 9B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Gemma 3 12B
Google Gemma3 VLM ✓ ✓ ✓ ✓ ✓ ✓ ✓ — —
Run
Run Gemma 3 12B
Gemma 3 12B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container Ollama Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Cosmos Reason 2 8B
NVIDIA Cosmos Reason VLM ✓ ✓ ✓ ✓ ✓ ✓ — ✓ — Gemma 4 31B
Google Gemma4 VLM ✓ ✓ ✓ — — ✓ — ✓ —
Run
Run Gemma 4 31B
Gemma 4 31B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container llama.cpp Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Ministral 3 3B Reasoning
Mistral AI Ministral 3 VLM ✓ ✓ ✓ ✓ ✓ ✓ — — —
Run
Run Ministral 3 3B Reasoning
Ministral 3 3B Reasoning Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Qwen3 32B
Alibaba Qwen3 — ✓ ✓ — — — ✓ — — —
Run
Run Qwen3 32B
Qwen3 32B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Nemotron 3 Nano Omni New
NVIDIA Nemotron VLM ✓ ✓ ✓ — — ✓ ✓ ✓ —
Run
Run Nemotron 3 Nano Omni
Nemotron 3 Nano Omni Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container llama.cpp Container Ollama Local
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Qwen3.5 4B
Alibaba Qwen3.5 VLM ✓ ✓ ✓ ✓ ✓ ✓ — — —
Run
Run Qwen3.5 4B
Qwen3.5 4B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Gemma 3 27B
Google Gemma3 VLM ✓ ✓ ✓ ✓ ✓ ✓ ✓ — —
Run
Run Gemma 3 27B
Gemma 3 27B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container Ollama Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Ministral 3 8B Reasoning
Mistral AI Ministral 3 VLM ✓ ✓ ✓ ✓ — ✓ — — —
Run
Run Ministral 3 8B Reasoning
Ministral 3 8B Reasoning Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Qwen3.5 0.8B
Alibaba Qwen3.5 VLM ✓ ✓ ✓ ✓ ✓ ✓ — — —
Run
Run Qwen3.5 0.8B
Qwen3.5 0.8B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Qwen3 VL 4B
Alibaba Qwen3 VLM ✓ ✓ ✓ ✓ ✓ ✓ — — —
Run
Run Qwen3 VL 4B
Qwen3 VL 4B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Ministral 3 14B Reasoning
Mistral AI Ministral 3 VLM ✓ ✓ ✓ — — ✓ — — —
Run
Run Ministral 3 14B Reasoning
Ministral 3 14B Reasoning Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details
Qwen3 VL 8B
Alibaba Qwen3 VLM ✓ ✓ ✓ ✓ — ✓ — — —
Run
Run Qwen3 VL 8B
Qwen3 VL 8B Quick Start Runner
Jetson T5000 module Jetson AGX Thor Developer Kit
Jetson T4000 module
Jetson AGX Orin 64GB module Jetson AGX Orin 64GB Developer Kit
Jetson Orin NX 16GB module
Jetson Orin Nano 8GB module Jetson Orin Nano 8GB Developer Kit
Inference Engine vLLM Container
Copy serve command Loading command...
Commands are auto-generated based on your configuration settings.
Advanced configuration Show Advanced vLLM Configuration
Configure vLLM server parameters. Leave empty to use defaults.
Reset to defaults
Details