Pretrained Models

Speaker Recognition

ResNet18 training with VoxCeleb

Download: Baidu, Google Drive

I followed the ideas in paper VoxCeleb2 1806.05622 to train this model, the differences between them:

  Res18 in this repo Res34 in paper
Trained on VoxCeleb2 VoxCeleb2
Input spec size 224x224 512x300
Eval on Random 9500+ pair samples from VoxCeleb1 Original VoxCeleb1 test set
Metric Accuracy: 0.932656 +- 0.005187 EER: 0.0504
Framework Mxnet Gluon Matconvnet
ROC img1