nvidia-dynamo

Here are 3 public repositories matching this topic...

aws-samples / sample-genai-on-eks-starter-kit

A comprehensive toolkit for deploying production-ready Generative AI infrastructure on Amazon EKS. Includes pre-configured components for: 🚀 AI Gateway (LiteLLM) 🤖 LLM Serving (vLLM, SGLang, Ollama) 📊 Vector Databases, 🔍 Embedding Models (TEI) 📈 Observability (Langfuse, Phoenix) etc. Fast-track your GenAI deployment with Kubernetes

kubernetes aws terraform ai-agents amazon-eks ai-platform vector-database ai-engineering generative-ai llmops llm-serving gpu-inference vllm llm-inference langfuse ai-gateway agentic-ai mcp-server nvidia-dynamo

Updated May 26, 2026
JavaScript

rutujaingole / Optimizing-LLM-Inference-using-NVIDIA-Dynamo-and-TorchDynamo

Star

The goal of the project is to benchmark and optimize BERT inference using different backends—PyTorch eager mode, TorchDynamo (Inductor backend), and NVIDIA Triton Inference Server. We use GLUE SST-2 samples for evaluation and compare performance through profiling, kernel timing, and latency analysis.

machine-learning machine-learning-algorithms pytorch high-performance-computing profiling bert nvidia-gpu hpml torchdynamo llm-inference nvidia-dynamo

Updated May 10, 2025
Jupyter Notebook

developertogo / velo-sentinel

Star

Production-grade Java 25 Virtual Thread inference gateway bridging NVIDIA Triton → Dynamo with Earliest Deadline First (EDF) priority queuing, adaptive batching, and async shadow validation.

redis distributed-systems grpc priority-queues load-balancing model-serving triton-inference-server virtual-threads inference-gateway semantic-caching nvidia-dynamo disaggregated-serving

Updated May 9, 2026
Java

Improve this page

Add a description, image, and links to the nvidia-dynamo topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the nvidia-dynamo topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly