Courses from 1000+ universities
Two years after its first major layoff round, Coursera announces another, impacting 10% of its workforce.
600 Free Google Certifications
Graphic Design
Data Analysis
Digital Marketing
El rol de la digitalización en la transición energética
First Step Korean
Supporting Successful Learning in Primary School
Organize and share your learning with Class Central Lists.
View our Lists Showcase
Dive into advanced techniques for handling long-context scenarios in Large Language Models, exploring efficient methods and architectural innovations for extended sequence processing.
Dive into advanced techniques for handling extended context in Large Language Models, exploring efficiency strategies and implementation methods for enhanced AI processing capabilities.
Explore neural architecture search techniques, including hardware-aware approaches and joint optimization of hardware and neural architectures. Learn applications and advancements in efficient deep learning design.
Explore advanced neural network quantization techniques, including post-training quantization, quantization-aware training, binary/ternary quantization, and mixed-precision quantization.
Explore neural network quantization, including numeric data types, K-means-based quantization, and linear quantization for efficient deep learning on resource-constrained devices.
Explore neural network pruning techniques, including sensitivity scans, automatic pruning, and the lottery ticket hypothesis. Learn to fine-tune sparse networks and understand system support for sparsity.
Explore advanced neural architecture search techniques for efficient deep learning on resource-constrained devices. Gain insights into model optimization for mobile and IoT applications.
Explore Atomique, a novel quantum compiler for reconfigurable neutral atom arrays, enhancing scalability and efficiency in quantum computing through strategic atom movements and optimized gate scheduling.
Explore activation-aware weight quantization for compressing and accelerating large language models. Learn about AWQ's innovative approach to LLM optimization.
Explore efficient GPU-based sparse convolution with TorchSparse++, an advanced framework for training and inference in deep learning applications.
Explore innovative techniques for deploying LLMs in streaming applications, focusing on the "attention sink" phenomenon and efficient KV cache management.
Explore efficient machine learning techniques for deploying neural networks on resource-constrained devices. Gain hands-on experience with mobile AI and quantum machine learning.
Explore efficient machine learning techniques for deploying neural networks on resource-constrained devices, covering model compression, pruning, quantization, and more.
Explore knowledge distillation techniques for efficient machine learning, covering theory, applications, and practical implementation strategies.
Get personalized course recommendations, track subjects and courses with reminders, and more.