Publications and Awards

PUBLICATIONS (Journals)

[J6] A. Avila, D. O’Shaughnessy, T. Falk, Automatic Speaker Verification from Affective Speech Using Gaussian Mixture Model Based Estimation of Neutral Speech Characteristics, J. Speech Communicationn, vol. 132, p. 21-31, 2021.

[J5] A. Avila, J. Alam, F. Prado, D. O’Shaughnessy, T. Falk, On the Use of Blind Channel Response Estimation and a Residual Neural Network to Detect Physical Access Attacks to Speaker Verification Systems, J. Computer Speech & Language, vol. 66, 2021.

[J4] A. Avila, D. O’Shaughnessy, T. Falk, Non-intrusive Speech Quality Prediction Based on the Blind Estimation of Clean Speech and the i-vector Framework, J. Quality and User Experience, vol. 5, no 1, p. 1-15, 2020.

[J3] A. Avila, J. Alam, D. O’Shaughnessy, T. Falk, On the Use of the I-vector Speech Representation for Instrumental Quality Measurement, J. Quality and User Experience, 2020, vol. 5, no 1, pp. 1-14, DOI: https://doi.org/10.1007/s41233-020-00036-z.

[J2] B. Sadou, A. Lahoulou, T. Bouden, A. Avila, T. Falk, Z. Akhtar, Free-Reference Image Quality Assessment Framework using Metrics Fusion and Dimensionality Reduction, Signal & Image Processing, 2019, Vol. 10, No. 5, pp. 1-14, DOI: 10.5121/sipij.2019.10501.

[J1] A. Avila, Z. Akhtar, J. Santos, D. O’Shaughnessy, T. Falk, Feature Pooling for Spontaneous Speech-Based Emotion Recognition in-the-wild, IEEE Transaction on Affective Computing, 2018, pp. 1-12, DOI: 10.1109/TAFFC.2018.2858255.

PUBLICATIONS (Conferences)

[C17] A.Avila, etal. Low-bit Shift Network for End-to-End Spoken Language Understanding, Interspeech 2022, DOI: 10.21437/Interspeech.2022-760.

[C16] N.Potdar, A.Avila,C.XING,etal. A Streaming End-to-End Framework For Spoken Language Understanding, IJCAI 2021, pp. 3906-3914, DOI: 10.24963/ijcai.2021/538.

[C15] Y.Cao, N.Potdar, and A.Avila. Sequential End-to-End Intent and Slot Label Classification and Localization, Interspeech 2021.

[C14] A. Avila, H. Gamper, C. Reddy, R. Cutler, I. Tashev, J. Gehrke, Non-intrusive speech quality assessment using neural networks, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 631-635, DOI: 10.1109/ICASSP.2019.8683175.

[C13] A. Avila, J. Alam, D. O’Shaughnessy, T. Falk, Blind Channel Response Estimation for Replay Attack Detection, Interspeech 2019, pp. 2893-2897, DOI: 10.21437/Interspeech.2019-2956.

[C12] A. Avila, S. Kshirsagar, A. Tiwari, D. Lafond, D. O’Shaughnessy, and T. Falk, Speech-Based Stress and Emotion Classification Based on Modulation Spectral Features and Convolutional Neural Networks, 27th European Signal Processing Conference (EUSIPCO) 2019, pp. 1-5, DOI: 10.23919/EUSIPCO.2019.8903014.

[C11] B. Sadou, A. Lahoulou, T. Bouden, A. Avila, T. Falk, Z. Akhtar, Blind Image Quality Assessment using SVD based Dominant Eigenvectors for Feature Selection, SIPRO 2019.

[C10] A. Avila, J. Alam, D. O’Shaughnessy, T. Falk, Intrusive Quality Measurement of Noisy and Enhanced Speech based on i-Vector Similarity, QoMEX 2019, pp. 1-5, DOI: 10.1109/QoMEX.2019.8743285. *Nominated for Best Paper Award*

[C9] A. Avila, J. Alam, D. O’Shaughnessy, T. Falk, Investigating Speech Enhancement and Perceptual Quality for Speech Emotion Recognition, Interspeech 2018, pp. 3663-3667, DOI: 10.21437/Interspeech.2018-2350..

[C8] A. Avila, J. Monteiro, D. O’Shaughnessy, T. Falk, Speech Emotion Recognition on Mobile Devices Using a New Modulation Spectrum Pooling and Deep Neural Networks, ISSPIT 2017, 2017 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT), pp. 360-365, DOI: 10.1109/ISSPIT.2017.8388669.

[C7] A. Avila, B. Cauchi, S. Goetze, S. Doclo, T. Falk, Performance Comparison of Intrusive and Non-instrusive Instrumental Quality Measures for Enhanced Speech, IWAENC 2016, 2016 IEEE International Workshop on Acoustic Signal Enhancement (IWAENC), pp. 1-5, DOI: 10.1109/IWAENC.2016.7602907.

[C6] A. Avila, M. Santos, F. Fraga, and T. Falk, The Effect of Speech Rate on Automatic Speaker Verification: a Comparative Analysis of GMM-UBM and I-vector Based Methods, 12th Audio Engineering Conference (AES-Brazil), May 2014.

[C5] A. Avila, M. Paja, F. Fraga, D. O’Shaughnessy, and T. Falk, Improving the Performance of Far-Field Speaker Verification Using Multi-Condition Training: The Case of GMM-UBM and i-vector Systems, Interspeech’2014.

[C4] A. Avila, M. Santos, F. Fraga, and T. Falk, Investigating the use of Modulation Spectral Features within an Ivector Framework for Far-Field Automatic Speaker Verification, International. Telecommunications Symposium, 2014.

[C3] A. Avila, F. Prado, G. Kobayashi, E. Rocha, Performance Comparison of Overdetermined Multilateration Algorithms for Estimating Aircraft Position. In: Workshop on Distance Geometry and Applications (DGA), 2013, Manaus.

[C2] A. Avila, M. Paja, F. Fraga, Proposta de um Sistema de Diálogo Automático Baseado em Algoritmos de Aprendizado Por Reforço. In: Proceedings of the 10th AES Brazil Conference. Rio de Janeiro: Audio Engineering Society, 2012. v. 1. p. 75-78.

[C1] A. Avila, M. Paja, F. Fraga, Integracão de Sistemas de Reconhecimento, Tradução e Síntese Automática da Fala para Facilitar a Comunicação de Turistas. In: The 14th LAC AES Conference. Montevideo: Audio Engineering Society, 2011. v. 1, p. 1-4.

HONORS AND AWARDS

Merit Scholarship Program for Foreign Student (2nd rank), Canada, 2015
Science Without Borders, Brazil, 2015
Emerging Leaders in the Americas Program, Canada, 2014
Centre for Advanced Systems and Technologies in Communications, Canada, 2013
Development Industrial Technological Scholarship, Brazil, 2011