Source author record

Alexander Kozlov

Alexander Kozlov appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language Computer Vision eess.AS eess.IV Machine Learning nucl-ex physics.atom-ph Sound

Catalog footprint

What is connected

4works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2020arXiv

Deep Speaker Embeddings for Far-Field Speaker Recognition on Short Utterances

Speaker recognition systems based on deep speaker embeddings have achieved significant performance in controlled conditions according to the results obtained for early NIST SRE (Speaker Recognition Evaluation) datasets. From the practical point of view, taking into account the increased interest in virtual assistants (such as Amazon Alexa, Google Home, AppleSiri, etc.), speaker verification on short utterances in uncontrolled noisy environment conditions is one of the most challenging and highly demanded tasks. This paper presents approaches aimed to achieve two goals: a) improve the quality of far-field speaker verification systems in the presence of environmental noise, reverberation and b) reduce the system qualitydegradation for short utterances. For these purposes, we considered deep neural network architectures based on TDNN (TimeDelay Neural Network) and ResNet (Residual Neural Network) blocks. We experimented with state-of-the-art embedding extractors and their training procedures. Obtained results confirm that ResNet architectures outperform the standard x-vector approach in terms of speaker verification quality for both long-duration and short-duration utterances. We also investigate the impact of speech activity detector, different scoring models, adaptation and score normalization techniques. The experimental results are presented for publicly available data and verification protocols for the VoxCeleb1, VoxCeleb2, and VOiCES datasets.

preprint2020arXiv

Neural Network Compression Framework for fast model inference

In this work we present a new framework for neural networks compression with fine-tuning, which we called Neural Network Compression Framework (NNCF). It leverages recent advances of various network compression methods and implements some of them, such as sparsity, quantization, and binarization. These methods allow getting more hardware-friendly models which can be efficiently run on general-purpose hardware computation units (CPU, GPU) or special Deep Learning accelerators. We show that the developed methods can be successfully applied to a wide range of models to accelerate the inference time while keeping the original accuracy. The framework can be used within the training samples, which are supplied with it, or as a standalone package that can be seamlessly integrated into the existing training code with minimal adaptations. Currently, a PyTorch version of NNCF is available as a part of OpenVINO Training Extensions at https://github.com/openvinotoolkit/nncf.

preprint2014arXiv

Optical atomic clocks with suppressed black body radiation shift

We study a wide range of neutral atoms and ions suitable for ultra-precise atomic optical clocks with naturally suppressed black body radiation shift of clock transition frequency. Calculations show that scalar polarizabilities of clock states cancel each other for at least one order of magnitude for considered systems. Results for calculations of frequencies, quadrupole moments of the states, clock transition amplitudes and natural widths of upper clock states are presented.

preprint2001arXiv

Coherent π^0 threshold production from the deuteron at Q^2 = 0.1 GeV^2/c^2

First data on coherent threshold π^0 electroproduction from the deuteron taken by the A1 Collaboration at the Mainz Microtron MAMI are presented. At a four-momentum transfer of q^2=-0.1 GeV^2/c^2 the full solid angle was covered up to a center-of-mass energy of 4 MeV above threshold. By means of a Rosenbluth separation the longitudinal threshold s wave multipole and an upper limit for the transverse threshold s wave multipole could be extracted and compared to predictions of Heavy Baryon Chiral Perturbation Theory.