Graph explorer

Binary Speaker Embedding

The popular i-vector model represents speakers as low-dimensional continuous vectors (i-vectors), and hence it is a way of continuous speaker embedding. In this paper, we investigate binary speaker embedding, which transforms i-vectors to binary vectors (codes) by a hash function. We start from locality sensitive hashing (LSH), a simple binarization approach where binary codes are derived from a set of random hash functions. A potential problem of LSH is that the randomly sampled hash functions might be suboptimal. We therefore propose an improved Hamming distance learning approach, where the hash function is learned by a variable-sized block training that projects each dimension of the original i-vectors to variable-sized binary codes independently. Our experiments show that binary speaker embedding can deliver competitive or even better results on both speaker verification and identification tasks, while the memory usage and the computation cost are significantly reduced.

8 nodes8 linksoverview mapBinary Speaker Embedding
8 nodes8 links
Binary Speaker Embedding8 visible / 8 total nodes / 18 links
Related contextCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipAuthorshipAuthorshipAuthorshipAuthorshipTopic signalTopic signalAuthorshipWBinary Speaker Embeddingpreprint / 2016ALantian LiResearcherADong WangResearcherAChao XingResearcherAKaimin YuResearcherTMachine Learning49008 worksTSound3727 worksAThomas Fang ZhengResearcher
PaperSignal 107 links

Binary Speaker Embedding

preprint / 2016

Open