Gemma 4 on Jetson
Run Google Gemma 4 models on Jetson with vLLM or llama.cpp. Covers E2B, E4B, 26B-A4B, and 31B on Orin and Thor, including reasoning, tool calling, and runtime selection.
Step-by-step guides to deploy, experiment, and run cutting-edge AI models on NVIDIA Jetson devices.
100-minute hands-on workshop experiencing Jetson Thor's Physical AI capabilities. Learn to deploy AI microservices, run Vision Language Models, and build conversational AI pipelines.
Master inference optimization on Jetson Thor with vLLM. Learn to deploy production-grade LLM serving, quantization strategies (FP16 β FP8 β FP4), and advanced optimizations like speculative decoding.
Everything you need to get started with NVIDIA Jetson at a hackathon. Setup tips, project ideas, and resources to help your team build an impressive AI project.
Try a different search term