WHAT YOU HEAR IS WHAT YOU SEE: AUDIO QUALITY FROM IMAGE QUALITY METRICS

Research output: Contribution to conferenceConference Paperpeer-review

Abstract

In this study, we investigate the feasibility of utilizing state-of-the-art perceptual image metrics for evaluating audio signals by representing them as spectrograms. The encouraging outcome of the proposed approach is based on the similarity between the neural mechanisms in the auditory and visual pathways. Furthermore, we customise one of the metrics which has a psychoacoustically plausible architecture to account for the peculiarities of sound signals. We evaluate the effectiveness of our proposed metric and several baseline metrics using a music dataset, with promising results in terms of the correlation between the metrics and the perceived quality of audio as rated by human evaluators.
Original languageEnglish
Pages367-370
Number of pages4
Publication statusPublished - 7 Sept 2023
Event26th International Conference on Digital Audio Effects, DAFx 2023 - Copenhagen, Denmark
Duration: 4 Sept 20237 Sept 2023
https://dafx23.create.aau.dk/

Conference

Conference26th International Conference on Digital Audio Effects, DAFx 2023
Country/TerritoryDenmark
CityCopenhagen
Period4/09/237/09/23
Internet address

Bibliographical note

Publisher Copyright:
© 2023 Tashi Namgyal et al.

Fingerprint

Dive into the research topics of 'WHAT YOU HEAR IS WHAT YOU SEE: AUDIO QUALITY FROM IMAGE QUALITY METRICS'. Together they form a unique fingerprint.

Cite this