tinyML Talks: A Practical Guide to Neural Network Quantization
“A Practical Guide to Neural Network Quantization“
Marios Fournarakis
Deep Learning Researcher
Qualcomm AI Research, Amsterdam
Neural network quantization is an effective way of reducing the power requirements and latency of neural network inference while maintaining high accuracy. The success of quantization has led to a large volume of literature and competing methods in recent years, and Qualcomm has been at the forefront of this research. This talk aims to cut through the noise and introduce a practical guide for quantizing neural networks inspired by our research and expertise at Qualcomm. We will begin with an introduction to quantization and fixed-point accelerators for neural network inference. We will then consider implementation pipelines for quantizing neural networks with near floating-point accuracy for popular neural networks and benchmarks. Finally, you will leave this talk with a set of diagnostic and debugging tools to address common neural network quantization issu
14 views
18
5
2 years ago 00:10:23 18
How TinyML Gives us Spider-Man Powers | Emelie Eldracher | TEDxMIT
2 years ago 00:51:33 2
#100 Embedded Machine Learning on Edge Devices(with Daniel Situnayake)
3 years ago 00:59:44 9
tinyML Talks: Advanced Anomaly Detection Made Easy
3 years ago 01:01:09 3
tinyML Talks: Energy-Efficiency and Security for TinyML and EdgeAI: A Cross-Layer Approach
3 years ago 01:04:11 4
tinyML Talks Pakistan: FFConv: An FPGA-based Accelerator for Fast Convolution Layers in...
3 years ago 00:53:51 2
tinyML Talks: Oculi is putting the human eye in A.I.
3 years ago 00:28:21 2
tinyML Asia 2021 Justin Kao: A lightweight face detection method working with Himax Ultra-Low...
3 years ago 00:23:01 1
tinyML Asia 2021 Haochen Xie: An approach to dynamically integrate heterogenous AI components...
3 years ago 00:29:39 1
tinyML Asia 2021 Joshua Chang: Sensor Fusion using Machine Learning: Smart Forehead Temperature...
3 years ago 01:01:24 1
tinyML Talks: The Multilingual Spoken Words Corpus, a Massive Keyword Spotting Dataset
3 years ago 00:34:03 9
tinyML Talks Toronto Part 1: Evolutionary Needs of TinyML
3 years ago 00:17:35 1
tinyML Talks Toronto Part 2: tinyMLedu: widening access to tinyML education and resources
3 years ago 00:27:20 54
tinyML Talks Toronto Part 3: tinyML4STEM: using tinyML for Neuroscience in K12
3 years ago 01:06:51 1
tinyML Talks India: Single Lead ECG Classification On Wearable and Implantable Devices
3 years ago 00:26:53 4
tinyML Asia 2021 Yihong Wu: Lightweight visual localization with deep learning
3 years ago 00:56:12 6
tinyML Talks: CFU Playground: Customize Your ML Processor for Your Specific TinyML Model
3 years ago 00:49:23 5
tinyML Asia 2021 Chanwoo Kim: A review of on-device fully neural end-to-end speech recognition...
3 years ago 01:10:29 3
tinyML Talks: The Value of Edge AI for Industrial Applications: onsemi and SensiML IIoT Solutions
3 years ago 00:53:29 1
Pete Warden — Practical Applications of TinyML
3 years ago 01:00:43 2
tinyML Talks: AutoML + TinyML with Edge Impulse’s EON Tuner
3 years ago 00:56:06 10
tinyML Talks Morocco: Enabling Ultra-low Power Always-On Computer Vision at Qualcomm
3 years ago 01:01:21 3
tinyML Talks: Verification of ML-based AI systems and its applicability in Edge ML
3 years ago 01:01:20 14
tinyML Talks: A Practical Guide to Neural Network Quantization
3 years ago 00:16:49 4
EMEA 2021 tiny Talks: Building Heterogeneous TinyML Pipelines