Source author record

Ondřej Mokrý

Ondřej Mokrý appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

eess.AS Sound eess.SP math.OC

Catalog footprint

What is connected

7works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

Algorithms for audio inpainting based on probabilistic nonnegative matrix factorization

Audio inpainting, i.e., the task of restoring missing or occluded audio signal samples, usually relies on sparse representations or autoregressive modeling. In this paper, we propose to structure the spectrogram with nonnegative matrix factorization (NMF) in a probabilistic framework. First, we treat the missing samples as latent variables, and derive two expectation-maximization algorithms for estimating the parameters of the model, depending on whether we formulate the problem in the time- or time-frequency domain. Then, we treat the missing samples as parameters, and we address this novel problem by deriving an alternating minimization scheme. We assess the potential of these algorithms for the task of restoring short- to middle-length gaps in music signals. Experiments reveal great convergence properties of the proposed methods, as well as competitive performance when compared to state-of-the-art audio inpainting techniques.

preprint2023arXiv

Multiple Hankel matrix rank minimization for audio inpainting

Sasaki et al. (2018) presented an efficient audio declipping algorithm, based on the properties of Hankel-structure matrices constructed from time-domain signal blocks. We adapt their approach to solving the audio inpainting problem, where samples are missing in the signal. We analyze the algorithm and provide modifications, some of them leading to an improved performance. Overall, it turns out that the new algorithms perform reasonably well for speech signals but they are not competitive in the case of music signals.

preprint2021arXiv

Audio Dequantization Using (Co)Sparse (Non)Convex Methods

The paper deals with the hitherto neglected topic of audio dequantization. It reviews the state-of-the-art sparsity-based approaches and proposes several new methods. Convex as well as non-convex approaches are included, and all the presented formulations come in both the synthesis and analysis variants. In the experiments the methods are evaluated using the signal-to-distortion ratio (SDR) and PEMO-Q, a perceptually motivated metric.

preprint2020arXiv

A Proper version of Synthesis-based Sparse Audio Declipper

Methods based on sparse representation have found great use in the recovery of audio signals degraded by clipping. The state of the art in declipping has been achieved by the SPADE algorithm by Kitić et. al. (LVA/ICA2015). Our recent study (LVA/ICA2018) has shown that although the original S-SPADE can be improved such that it converges significantly faster than the A-SPADE, the restoration quality is significantly worse. In the present paper, we propose a new version of S-SPADE. Experiments show that the novel version of S-SPADE outperforms its old version in terms of restoration quality, and that it is comparable with the A-SPADE while being even slightly faster than A-SPADE.

preprint2020arXiv

Flexible framework for audio reconstruction

The paper presents a unified, flexible framework for the tasks of audio inpainting, declipping, and dequantization. The concept is further extended to cover analogous degradation models in a transformed domain, e.g. quantization of the signal's time-frequency coefficients. The task of reconstructing an audio signal from degraded observations in two different domains is formulated as an inverse problem, and several algorithmic solutions are developed. The viability of the presented concept is demonstrated on an example where audio reconstruction from partial and quantized observations of both the time-domain signal and its time-frequency coefficients is carried out.

preprint2020arXiv

S-SPADE Done Right: Detailed Study of the Sparse Audio Declipper Algorithms

This technical report shows and discusses in detail how Sparse Audio Declipper (SPADE) algorithms are derived from the signal model using the ADMM approach. The analysis version (A-SPADE) of Kitić et. al. (LVA/ICA 2015) is derived and justified. The synthesis version (S-SPADE) of the same research team is shown to solve a different optimization task than intended. This issue is corrected in this report, leading to the new S-SPADE algorithm which is in line to A-SPADE.

preprint2019arXiv

Introducing SPAIN (SParse Audio INpainter)

A novel sparsity-based algorithm for audio inpainting is proposed. It is an adaptation of the SPADE algorithm by Kitić et al., originally developed for audio declipping, to the task of audio inpainting. The new SPAIN (SParse Audio INpainter) comes in synthesis and analysis variants. Experiments show that both A-SPAIN and S-SPAIN outperform other sparsity-based inpainting algorithms. Moreover, A-SPAIN performs on a par with the state-of-the-art method based on linear prediction in terms of the SNR, and, for larger gaps, SPAIN is even slightly better in terms of the PEMO-Q psychoacoustic criterion.

Ondřej Mokrý

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

Algorithms for audio inpainting based on probabilistic nonnegative matrix factorization

Multiple Hankel matrix rank minimization for audio inpainting

Audio Dequantization Using (Co)Sparse (Non)Convex Methods

A Proper version of Synthesis-based Sparse Audio Declipper

Flexible framework for audio reconstruction

S-SPADE Done Right: Detailed Study of the Sparse Audio Declipper Algorithms

Introducing SPAIN (SParse Audio INpainter)