facebook pixel
@nvidia
Check out HALP (Hardware-Aware Latency Pruning), a new method designed to adapt convolutional neural networks (CNNs) and #transformer-based architectures for real-time performance. HALP optimizes pre-trained models to maximize compute utilization. In testing with NVIDIA DRIVE Orin™ on the road, it consistently outperformed alternative approaches. 00:00:00 - Introducing Hardware-Aware Latency Pruning (HALP) 00:00:29 - Common Model Optimization 00:00:59 - DNN Pruning 00:01:21 - Hardware Aware Latency Pruning 00:01:31 - Classification Tasks 00:01:37 - 3D Object Detection 00:02:04 - HALP with Transformers 00:03:09 - To learn more, visit our GitHub and project pages GitHub: nvda.ws/3rlM7mo Product page: nvda.ws/46961je Watch the full series here: nvda.ws/3LsSgnH Learn more about DRIVE Labs: nvda.ws/36r5c6t Follow us on social: Twitter: nvda.ws/3LRdkSs LinkedIn: nvda.ws/3wI4kue #NVIDIADRIVE

 5.9k

 152

 5.9k

Credits
    Tags, Events, and Projects
    • transformer
    • nvidiadrive