M. Senoussaoui, P. Kenny, N. Dehak, and P. Dumouchel, An i-vector extractor suitable for speaker recognition with both microphone and telephone speech, p.6, 2010.

L. V. Maaten and G. Hinton, Visualizing data using t-SNE, Journal of Machine Learning Research, vol.9, pp.2579-2605, 2008.

J. H. Hansen, Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition, Speech Communication, vol.20, issue.1-2, pp.151-173, 1996.
DOI : 10.1016/S0167-6393(96)00050-7

URL : http://crss.utdallas.edu/Publications/Hansen1996a.pdf

F. Kelly, A. Drygajlo, and N. Harte, Speaker verification in score-ageing-quality classification space, Computer Speech & Language, vol.27, issue.5, pp.1068-1084, 2013.
DOI : 10.1016/j.csl.2012.12.005

URL : http://www.mee.tcd.ie/%7Esigmedia/pmwiki/uploads/Main.Publications/kelly_sv_score_age_quality.pdf

N. Dehak, Z. N. Karam, D. A. Reynolds, R. Dehak, W. M. Campbell et al., A channel-blind system for speaker verification, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.4536-4539, 2011.
DOI : 10.1109/ICASSP.2011.5947363

G. Liu, Y. Lei, and J. H. Hansen, Robust feature front-end for speaker identification, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.2012-2012
DOI : 10.1109/ICASSP.2012.6288853

S. J. Prince and J. H. Elder, Probabilistic Linear Discriminant Analysis for Inferences About Identity, 2007 IEEE 11th International Conference on Computer Vision, pp.1-8, 2007.
DOI : 10.1109/ICCV.2007.4409052

P. Kenny, Bayesian speaker verification with heavy-tailed priors, p.14, 2010.

A. O. Hatch, S. S. Kajarekar, and A. Stolcke, Within-class covariance normalization for svm-based speaker recognition, p.2006, 2006.

F. Richardson, D. Reynolds, and N. Dehak, Deep Neural Network Approaches to Speaker and Language Recognition, IEEE Signal Processing Letters, vol.22, issue.10, pp.1671-1675, 2015.
DOI : 10.1109/LSP.2015.2420092

M. Senoussaoui, P. Kenny, T. Stafylakis, and P. Dumouchel, A Study of the Cosine Distance-Based Mean Shift for Telephone Speech Diarization, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.1, pp.217-227, 2014.
DOI : 10.1109/TASLP.2013.2285474

J. Goldberger, G. E. Hinton, S. T. Roweis, and R. Salakhutdinov, Neighbourhood components analysis, Advances in Neural Information Processing Systems, pp.513-520, 2004.

O. Rippel, M. Paluri, P. Dollar, and L. Bourdev, Metric learning with adaptive density discrimination, International Conference on Learning Representations (ICLR), 2015. URL http

P. Kenny, G. Boulianne, and P. Dumouchel, Eigenvoice modeling with sparse training data, IEEE Transactions on Speech and Audio Processing, vol.13, issue.3, pp.345-354, 2005.
DOI : 10.1109/TSA.2004.840940

G. Liu and J. H. Hansen, An Investigation into Back-end Advancements for Speaker Recognition in Multi-Session and Noisy Enrollment Scenarios, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.12, pp.1978-1992, 2014.
DOI : 10.1109/TASLP.2014.2352154

N. Dehak, P. A. Torres-carrasquillo, D. A. Reynolds, and R. Dehak, Language recognition via i-vectors and dimensionality reduction, pp.857-860, 2011.

M. H. Bahari, M. Mclaren, H. Van-hamme, and D. A. , Speaker age estimation using i-vectors, Engineering Applications of Artificial Intelligence, vol.34, pp.99-108, 2014.
DOI : 10.1016/j.engappai.2014.05.003

S. O. Sadjadi, S. Ganapathy, and J. Pelecanos, The IBM 2016 Speaker Recognition System, Odyssey 2016, pp.174-180
DOI : 10.21437/Odyssey.2016-25

URL : http://arxiv.org/pdf/1602.07291

S. Ioffe, Probabilistic Linear Discriminant Analysis, European Conference on Computer Vision (ECCV), pp.531-542, 2006.
DOI : 10.1109/NNSP.1999.788121

N. Lack, Non-Parametric Discriminant Analysis, Medizinische Informationsverarbeitung und Epidemiologie im Dienste der Gesundheit, pp.320-322, 1988.
DOI : 10.1007/978-3-642-83520-9_60

Z. Liang, Y. Li, and S. Xia, Adaptive weighted learning for linear regression problems via Kullback???Leibler divergence, Pattern Recognition, vol.46, issue.4, pp.1209-1219, 2013.
DOI : 10.1016/j.patcog.2012.10.017

W. Yang, K. Wang, and W. Zuo, Fast neighborhood component analysis, Neurocomputing, vol.83, pp.31-37, 2012.
DOI : 10.1016/j.neucom.2011.10.021

K. Q. Weinberger and L. K. Saul, Distance metric learning for large margin nearest neighbor classification, Journal of Machine Learning ResearchFeb), vol.10, pp.207-244, 2009.

L. Van-der-maaten and K. Weinberger, Stochastic triplet embedding, 2012 IEEE International Workshop on Machine Learning for Signal Processing, pp.1-6, 2012.
DOI : 10.1109/MLSP.2012.6349720

D. Chen, X. Cao, L. Wang, F. Wen, and J. Sun, Bayesian Face Revisited: A Joint Formulation, European Conference on Computer Vision, pp.566-579, 2012.
DOI : 10.1007/978-3-642-33712-3_41

URL : http://research.microsoft.com/en-us/um/people/jiansun/papers/ECCV12_BayesianFace.pdf

H. O. Song, Y. Xiang, S. Jegelka, and S. Savarese, Deep Metric Learning via Lifted Structured Feature Embedding, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.4004-4012, 2016.
DOI : 10.1109/CVPR.2016.434

URL : http://dspace.mit.edu/bitstream/1721.1/113397/1/Jegelka_Deep%20metric.pdf

M. A. Bautista, A. Sanakoyeu, E. Tikhoncheva, and B. Ommer, CliqueCNN: Deep unsupervised exemplar learning, Advances In Neural Information Processing Systems, pp.3846-3854, 2016.

C. Li, X. Ma, B. Jiang, X. Li, X. Zhang et al., Deep speaker: an end-to-end neural speaker embedding system, arXiv preprint, Z. Zhu, 1705.

L. Bottou, Large-scale machine learning with stochastic gradient descent, 19th International Conference on Computational Statistics, pp.177-186, 2010.
DOI : 10.1007/978-3-7908-2604-3_16

URL : http://leon.bottou.org/publications/pdf/compstat-2010.pdf

A. Shrivastava, A. Gupta, and R. Girshick, Training Region-Based Object Detectors with Online Hard Example Mining, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.761-769, 2016.
DOI : 10.1109/CVPR.2016.89

URL : http://arxiv.org/pdf/1604.03540

W. Yang, L. Jin, D. Tao, Z. Xie, and Z. Feng, DropSample : A new training method to enhance deep convolutional neural networks for large-scale unconstrained handwritten Chinese character recognition, Pattern Recognition, vol.58, pp.58-190, 2015.
DOI : 10.1016/j.patcog.2016.04.007

URL : http://arxiv.org/pdf/1505.05354

O. Canévet and F. Fleuret, Large Scale Hard Sample Mining with Monte Carlo Tree Search, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.5128-5137, 2016.
DOI : 10.1109/CVPR.2016.554

C. D. Manning, P. Raghavan, and H. Schütze, Introduction to Information Retrieval, 2008.
DOI : 10.1017/CBO9780511809071

G. Montavon, G. Orr, and K. Muller, Neural networks : tricks of the trade, 2012.
DOI : 10.1007/978-3-642-35289-8

Z. Allen-zhu, Y. Yuan, and K. Sridharan, Exploiting the structure: Stochastic gradient methods using raw clusters, arXiv preprint, NIPS 2016

L. Shen, Z. Lin, and Q. Huang, Relay Backpropagation for Effective Learning of Deep Convolutional Neural Networks, European Conference on Computer Vision (ECCV), pp.467-482, 2016.
DOI : 10.1007/s11263-014-0748-y

URL : http://arxiv.org/pdf/1512.05830

R. Hadsell, S. Chopra, and Y. Lecun, Dimensionality Reduction by Learning an Invariant Mapping, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.1735-1742, 2006.
DOI : 10.1109/CVPR.2006.100

URL : http://www.cs.nyu.edu/~raia/docs/cvpr06.pdf

D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek et al., The kaldi speech recognition toolkit, IEEE 2011 workshop on automatic speech recognition and understanding, no. EPFL-CONF-192584, 2011.

J. V. Davis, B. Kulis, P. Jain, S. Sra, and I. S. Dhillon, Information-theoretic metric learning, Proceedings of the 24th international conference on Machine learning, ICML '07, pp.209-216, 2007.
DOI : 10.1145/1273496.1273523

URL : http://www.cs.utexas.edu/users/jdavis/papers/jdavis_nips06_itml.pdf

C. Huang, C. C. Loy, and X. Tang, Local similarity-aware deep feature embedding, Advances in Neural Information Processing Systems, pp.1262-1270, 2016.