Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Profiling Deep Learning Applications with NVIDIA Tools Extension (NVTX)

Nvidia via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Discover how to leverage the NVIDIA Tools Extension Library (NVTX) to enhance deep learning application profiling in this 24-minute webinar from Nvidia. Explore techniques for annotating code to enrich profiler timelines, gaining insights into network performance optimization. Learn about the Nsight product family, NVTX syntax, and best practices for memory pinning and asynchronous data copy. Dive into practical examples of backward propagation with and without Automatic Mixed Precision (AMP), and understand how to identify the right performance knobs to tune your deep learning applications effectively.

Syllabus

Intro
A TYPICAL SCENARIO
HOW TO SPEED-UP NETWORK
WHAT'S GOING ON?
NSIGHT PRODUCT FAMILY
NSIGHT SYSTEMS Overview
NVIDIA TOOLS EXTENSION (NVTX)
ADDING NVTX START AND END OPERATIONS
NSIGHT SYSTEMS PROFILE
SIMPLE TRAINING PROGRAM
NVTX SYNTAX Backward propagation with or without using AMP
2ND PROFILING SESSION
FINDING THE RIGHT KNOB
BEST PRACTICES (4)
MEMORY PINNING
ASYNCHRONOUS DATA COPY

Taught by

NVIDIA Developer

Reviews

Start your review of Profiling Deep Learning Applications with NVIDIA Tools Extension (NVTX)

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.