How to Prune and Distill Llama-3.1 8B to an NVIDIA Llama-3.1-Minitron 4B Model
|
|
8
|
110
|
October 4, 2024
|
Just Released: NVIDIA NeMo Curator Improvements for Accelerating Data Curation
|
|
1
|
6
|
October 4, 2024
|
Achieve Innovative Hyperconverged Networking with NVIDIA Spectrum Ethernet and Microsoft Azure Stack HCI
|
|
3
|
335
|
October 3, 2024
|
Event: Community Over Code
|
|
1
|
6
|
October 3, 2024
|
AI Investigates Antarctica's Disappearing Moss to Uncover Climate Change Clues
|
|
1
|
8
|
October 3, 2024
|
Event: NVIDIA cuOpt at INFORMS 2024
|
|
1
|
8
|
October 3, 2024
|
New Reward Model Helps Improve LLM Alignment with Human Preferences
|
|
1
|
6
|
October 3, 2024
|
Managing AI Inference Pipelines on Kubernetes with NVIDIA NIM Operator
|
|
3
|
35
|
October 2, 2024
|
Webinar: Accelerating Python with GPUs
|
|
1
|
70
|
October 2, 2024
|
Building LLM-Powered Production Systems with NVIDIA NIM and Outerbounds
|
|
1
|
10
|
October 2, 2024
|
AI Uses Zero-Shot Learning to Find Existing Drugs for Treating Rare Diseases
|
|
1
|
5
|
October 2, 2024
|
Building Real-time Dermatology Classification with NVIDIA Clara AGX
|
|
2
|
493
|
October 2, 2024
|
Accelerating LLMs with llama.cpp on NVIDIA RTX Systems
|
|
1
|
4
|
October 2, 2024
|
Revolutionizing Cloud Gaming and Graphics Rendering with NVIDIA GDN
|
|
1
|
9
|
October 1, 2024
|
Evolving AI-Powered Game Development with Retrieval-Augmented Generation
|
|
1
|
5
|
October 1, 2024
|
Evaluating Medical RAG with NVIDIA AI Endpoints and Ragas
|
|
1
|
8
|
October 1, 2024
|
Improve Reinforcement Learning from Human Feedback with Leaderboard-Topping Reward Model
|
|
1
|
5
|
September 30, 2024
|
Advancing Quantum Algorithm Design with GPTs
|
|
1
|
7
|
September 30, 2024
|
NVIDIA GB200 NVL72 Delivers Trillion-Parameter LLM Training and Real-Time Inference
|
|
14
|
1781
|
September 27, 2024
|
AI Chatbot Delivers Multilingual Support to African Farmers
|
|
1
|
6
|
September 30, 2024
|
Just Released: NVIDIA HPC SDK v24.9
|
|
1
|
2
|
September 30, 2024
|
Low Latency Inference Chapter 2: Blackwell is Coming. NVIDIA GH200 NVL32 with NVLink Switch Gives Signs of Big Leap in Time to First Token Performance
|
|
1
|
5
|
September 27, 2024
|
Harnessing Data with AI to Boost Zero Trust Cyber Defense
|
|
1
|
3
|
September 26, 2024
|
Spotlight: Montai Builds a Multimodal AI Platform for Drug Discovery Using NVIDIA NIM Microservices
|
|
1
|
8
|
September 26, 2024
|
Advancing the Accuracy-Efficiency Frontier with Llama-3.1-Nemotron-51B
|
|
2
|
15
|
September 26, 2024
|
Build a Digital Human Interface for AI Apps with an NVIDIA NIM Agent Blueprint
|
|
1
|
5
|
September 25, 2024
|
Deploying Accelerated Llama 3.2 from the Edge to the Cloud
|
|
1
|
16
|
September 25, 2024
|
Train Generative AI Models for Drug Discovery with NVIDIA BioNeMo Framework
|
|
1
|
313
|
September 25, 2024
|
How AI and Robotics are Driving Agricultural Productivity and Sustainability
|
|
1
|
4
|
September 25, 2024
|
Accelerating HPC Applications with NVIDIA Nsight Compute Roofline Analysis
|
|
2
|
339
|
September 25, 2024
|