Theoretical vulnerabilities in map speaker adaptation

Tetsushi Ohki, Akira Otsuka
2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.2042-2046, March 2017.
[ Paper ]

Abstract

We analyze the theoretical vulnerability of maximum a posteriori(MAP) speaker adaptation, which is widely used in practical speaker recognition systems. First, we proved that there exist a set of feature vectors, what are called wolves, which can impersonate almost all the registered speakers with probability asymptotically close to 1 with at most two trials. Second, our experiment shows that the wolves with appropriate parameters achieved 0.99 of successful impersonation rate on Spear speaker recognition toolkit with ATR speech database.

Updated: