Skip navigation
Please use this identifier to cite or link to this item: https://libeldoc.bsuir.by/handle/123456789/54283
Title: Improving Spatial Resolution of First-order Ambisonics Using Sparse MDCT Representation
Authors: Likhachov, D.
Petrovsky, N.
Azarov, E.
Keywords: материалы конференций;spatial audio;ambisonics;upmixing;spatial resolution;sparse representation;FFT;MDCT
Issue Date: 2023
Publisher: BSU
Citation: Likhachov, D. Improving Spatial Resolution of First-order Ambisonics Using Sparse MDCT Representation / D. Likhachov, N. Petrovsky, E. Azarov // Pattern Recognition and Information Processing (PRIP'2023) = Распознавание образов и обработка информации (2023) : Proceedings of the 16th International Conference, October 17–19, 2023, Minsk, Belarus / United Institute of Informatics Problems of the National Academy of Sciences of Belarus. – Minsk, 2023. – P. 122–125.
Abstract: The paper presents a method for improving spatial resolution of first-order ambisonic audio. The method is based on time/frequency decomposition of the audio with subsequent extraction of a directed plane wave from each frequency component. The method develops the basic ideas of high angular resolution planewave expansion (HARPEX) and directional audio coding (DirAC) taking advantage of real valued sparse decomposition. Real-valued frequency components as opposed to complex-valued introduce simpler and more stable direction of arrival estimates, while sparse decomposition introduces an accurate and unified approach to describing sounds of different nature from transient to tonal sounds.
URI: https://libeldoc.bsuir.by/handle/123456789/54283
Appears in Collections:Pattern Recognition and Information Processing (PRIP'2023) = Распознавание образов и обработка информации (2023)

Files in This Item:
File Description SizeFormat 
Likhachov_Improving.pdf422.22 kBAdobe PDFView/Open
Show full item record Google Scholar

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.