Abstract
Deep learning is now playing an important role in enhancing the performance of conventional hybrid video codecs. These learning-based methods typically require diverse and representative training material for optimization in order to achieve model generalization and optimal coding performance. However, existing datasets either offer limited content variability or come with restricted licensing terms constraining their use to research purposes only. To address these issues, we propose a new training dataset, named BVI-AOM, which contains 956 uncompressed sequences at various resolutions from 270p to 2160p, covering a wide range of content and texture types. The dataset comes with more flexible licensing terms and offers competitive performance when used as a training set for optimizing deep video coding tools. The experimental results demonstrate that when used as a training set to optimize two popular network architectures for two different coding tools, the proposed dataset leads to additional bitrate savings of up to 0.29 and 2.98 percentage points in terms of PSNR-Y and VMAF, respectively, compared to an existing training dataset, BVI-DVC, which has been widely used for deep video coding. The BVI-AOM dataset is available at https://github.com/fan-aaron-zhang/bvi-aom.
Original language | English |
---|---|
Title of host publication | 2024 IEEE International Conference on Visual Communications and Image Processing |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Number of pages | 5 |
ISBN (Electronic) | 9798331529543 |
ISBN (Print) | 9798331529550 |
DOIs | |
Publication status | Published - 27 Jan 2025 |
Event | 2024 IEEE international Conference on Visual Communications and Image Processing - International Conference Center, Waseda University, Tokyo, Japan Duration: 8 Dec 2024 → 11 Dec 2024 https://www.vcip2024.org/ |
Publication series
Name | IEEE Visual Communications and Image Processing (VCIP) |
---|---|
Publisher | IEEE |
ISSN (Print) | 1018-8770 |
ISSN (Electronic) | 2642-9357 |
Conference
Conference | 2024 IEEE international Conference on Visual Communications and Image Processing |
---|---|
Abbreviated title | VCIP 2024 |
Country/Territory | Japan |
City | Tokyo |
Period | 8/12/24 → 11/12/24 |
Internet address |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Keywords
- Deep video compression
- BVI-AOM
- training dataset
- neural network based video coding
Fingerprint
Dive into the research topics of 'BVI-AOM: A New Training Dataset for Deep Video Compression Optimization'. Together they form a unique fingerprint.Equipment
-
HPC (High Performance Computing) and HTC (High Throughput Computing) Facilities
Alam, S. R. (Manager), Williams, D. A. G. (Manager), Eccleston, P. E. (Manager) & Greene, D. (Manager)
Facility/equipment: Facility
-
Research Data Storage Facility (RDSF)
Alam, S. R. (Manager), Williams, D. A. G. (Manager) & Eccleston, P. E. (Manager)
IT ServicesFacility/equipment: Facility