News

NVIDIA Technical Blog
developer. nvidia. com > blog > run-diffusiongemma-on-nvidia-for-developer-ready-high-throughput-text-generation

Run Diffusion Gemma on NVIDIA for Developer-Ready, High-Throughput Text Generation

1+ hour, 58+ min ago  (409+ words) | NVIDIA Technical Blog NVIDIA Developer Run Diffusion Gemma on NVIDIA for Developer-Ready, High-Throughput Text Generation - Diffusion Gemma, developed by Google Deep Mind and optimized for NVIDIA hardware, generates tokens in parallel using diffusion-based denoising, enabling much faster and more scalable…...

Symbols: nasdaq:nvda
NVIDIA Technical Blog
developer. nvidia. com > blog > delivering-lifecycle-control-for-ai-infrastructure-at-scale-with-nvidia-dgx-spark-enterprise-manageability

Delivering Lifecycle Control for AI Infrastructure at Scale with NVIDIA DGX Spark Enterprise Manageability

1+ day, 6+ min ago  (692+ words) As AI infrastructure scales, enterprise expectations for operational maturity are increasing. Organizations expect these systems to be provisionable, observable, secure, and manageable at scale'the same standard applied to all critical infrastructure. The moment an AI system moves from development into…...

Symbols: nasdaq:smci
NVIDIA Technical Blog
developer. nvidia. com > blog > model-quantization-turn-fp8-checkpoints-into-high-performance-inference-engines-with-nvidia-tensorrt

Model Quantization: Turn FP8 Checkpoints into High-Performance Inference Engines with NVIDIA Tensor RT

23+ hour, 50+ min ago  (726+ words) Converting a quantized checkpoint into an NVIDIA Tensor RT engine bridges the gap between model optimization and production deployment, enabling faster inference, higher throughput, and more efficient GPU utilization at scale. This post picks up where we left off, walking…...

Symbols: nasdaq:crwv
NVIDIA Technical Blog
developer. nvidia. com > blog

Evaluate Clinical ASR Models Faster with Agent Skills and NVIDIA Nemotron Speech

4+ day, 19+ hour ago  (515+ words) Training a speech AI model to correctly recognize or synthesize clinical terminology is surprisingly difficult. Drug names like Acetaminophen, Amlodipine…...

Symbols: btc-usd,nasdaq:npce
NVIDIA Technical Blog
developer. nvidia. com > blog > train-models-faster-with-jax-and-maxtext-using-nvfp4-on-nvidia-blackwell

Train Models Faster with JAX and Max Text Using NVFP4 on NVIDIA Blackwell

2+ day ago  (751+ words) Pre-training frontier LLMs comes down to throughput. When training spans trillions of tokens across thousands of accelerators, every percentage point of step time can add up to days of training and substantial compute costs. Numerical precision is one of the…...

Symbols: nasdaq:crwv
NVIDIA Technical Blog
developer. nvidia. com > blog > nvidia-nemotron-3-ultra-powers-faster-more-efficient-reasoning-for-long-running-agents

NVIDIA Nemotron 3 Ultra Powers Faster, More Efficient Reasoning for Long-Running Agents

6+ day, 5+ hour ago  (1100+ words) Single-turn chatbots are evolving into long-running agents that can reason, maintain context, use tools, and run efficiently across many turns to complete complex workflows. However, these multi-agent workflows cause token counts to grow quickly. Agents plan, call tools, invoke sub-agents,…...

Symbols: nasdaq:nvda
NVIDIA Technical Blog
developer. nvidia. com > blog > build-personal-ai-agents-on-windows-pcs-with-new-tools-from-microsoft-and-nvidia

Build Personal AI Agents on Windows PCs with New Tools from Microsoft and NVIDIA

1+ week, 23+ hour ago  (907+ words) Turnkey agent sandboxing on native Windows is now available, plus 2x faster agentic inference, new agent apps, and more AI agents are changing how you interact with your PC. Creators, developers, and AI enthusiasts are already using these agents extensively to…...

Symbols: nasdaq:nvda
Google News
developer. nvidia. com > blog > deploy-self-evolving-agents-for-faster-more-secure-research-with-a-hermes-agent-and-nvidia-nemoclaw

Deploy Self-Evolving Agents for Faster, More Secure Research with a Hermes Agent and NVIDIA Nemo Claw

1+ week, 1+ day ago  (745+ words) AI agents are a powerful tool for synthesizing data to accelerate research, summarize information, and help teams make decisions faster. But combining internal data with public sources poses security challenges." This post shares an open source example using Hermes Agent…...

Symbols: nasdaq:veea
NVIDIA Technical Blog
developer. nvidia. com > blog > deploy-agentic-ready-ai-at-the-edge-with-memory-efficiency-in-nvidia-jetpack-7-2

Deploy Agentic-Ready AI at the Edge with Memory Efficiency in NVIDIA Jet Pack 7. 2

1+ week, 1+ day ago  (658+ words) Release features include one-command deployment of NVIDIA Nemo Claw, NVIDIA agent skills for Jetson, official Yocto Project support, Super Mode on Jetson AGX Orin 32 GB As AI agents move from the digital world to the physical environment, they can readily…...

Symbols: nasdaq:nvda
Google News
developer. nvidia. com > blog > run-local-ai-agents-with-faster-models-and-multi-node-clustering-on-nvidia-dgx-spark

Run Local AI Agents with Faster Models and Multi-Node Clustering on NVIDIA DGX Spark

1+ week, 1+ day ago  (570+ words) The rise of autonomous, long-running AI agents has introduced a new class of compute demand, namely tasks that maintain large context windows, spawn concurrent subagents, and iterate continuously without cloud dependency. Security and privacy concerns are also accelerating the shift…...

Symbols: nasdaq:nvda