Temporal Knowledge Distillation for On-device Audio Classification

Choi, Kwanghee; Kersner, Martin; Morton, Jacob; Chang, Buru

Computer Science > Sound

arXiv:2110.14131 (cs)

[Submitted on 27 Oct 2021 (v1), last revised 5 Feb 2022 (this version, v2)]

Title:Temporal Knowledge Distillation for On-device Audio Classification

Authors:Kwanghee Choi, Martin Kersner, Jacob Morton, Buru Chang

View PDF

Abstract:Improving the performance of on-device audio classification models remains a challenge given the computational limits of the mobile environment. Many studies leverage knowledge distillation to boost predictive performance by transferring the knowledge from large models to on-device models. However, most lack a mechanism to distill the essence of the temporal information, which is crucial to audio classification tasks, or similar architecture is often required. In this paper, we propose a new knowledge distillation method designed to incorporate the temporal knowledge embedded in attention weights of large transformer-based models into on-device models. Our distillation method is applicable to various types of architectures, including the non-attention-based architectures such as CNNs or RNNs, while retaining the original network architecture during inference. Through extensive experiments on both an audio event detection dataset and a noisy keyword spotting dataset, we show that our proposed method improves the predictive performance across diverse on-device architectures.

Comments:	ICASSP 2022
Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2110.14131 [cs.SD]
	(or arXiv:2110.14131v2 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2110.14131

Submission history

From: Buru Chang [view email]
[v1] Wed, 27 Oct 2021 02:29:54 UTC (77 KB)
[v2] Sat, 5 Feb 2022 15:44:59 UTC (79 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.SD

< prev | next >

new | recent | 2021-10

Change to browse by:

cs
cs.LG
eess
eess.AS

References & Citations

DBLP - CS Bibliography

listing | bibtex

Martin Kersner

export BibTeX citation

Computer Science > Sound

Title:Temporal Knowledge Distillation for On-device Audio Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Temporal Knowledge Distillation for On-device Audio Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators