Medizinische Physik

Publications

Journal papers

  • de Tailléz, T., Kollmeier, B., Meyer, B.T. (2018). "Machine learning for decoding listeners' attention from EEG evoked by continuous speech," European Journal of Neuroscience. https://doi.org/10.1111/ejn.13790
  • Spille, C., Kollmeier, B., Meyer, B.T. (2018). "Predicting Speech Intelligibility with Deep Neural Networks," Computer Speech and Language 48, pp. 51-66. https://doi.org/10.1016/j.csl.2017.10.004
  • Spille, C., Kollmeier, B., Meyer, B.T. (2017). "Combining binaural and cortical features for robust speech recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (4), pp. 756-767. https://doi.org/10.1109/TASLP.2017.2661712
  • Castro Martínez, A.M., Mallidi, S.H., Meyer, B.T. (2017). "On the Relevance of Auditory-Based Gabor Features for Deep Learning in Automatic Speech Recognition," Computer, Speech and Language 45, pp. 21-38. http://dx.doi.org/10.1016/j.csl.2017.02.006 [ pdf ]
  • Kollmeier, B., Schädler, M.R., Warzybok, A., Meyer, B.T., Brand, T. (2016). "Sentence recognition prediction for hearing-impaired listeners in stationary and fluctuation noise with FADE: Empowering the Attenuation and Distortion concept by Plomp with a quantitative processing model," Trends in Hearing, Sep 7;20. doi:10.1177/2331216516655795.
  • Xiong, F. Meyer, B.T., Moritz, N., Rehr, R., Anemueller, J., Gerkmann, T., Doclo, S., Goetze, S. (2015). "Front-End Technologies for Robust ASR in Reverberant Environments - Spectral Enhancement-based Dereverberation and Auditory Modulation Filterbank Features," EURASIP Journal on Advances in Signal Processing, 2015: 70. doi:10.1186/s13634-015-0256-4. [ url ]
  • Schädler, M.R., Meyer, B.T., and Kollmeier, B. (2012). "Spectro-temporal modulation subspace-spanning filter bank features for robust automatic speech recognition", J. Acoust. Soc. Am. Volume 131, Issue 5, pp. 4134-4151. [pdf - see copyright notice below]
  • Meyer, B.T., Brand, T., Kollmeier, B. (2011). "Effect of speech-intrinsic variations on human and automatic recognition of spoken phonemes", J. Acoust. Soc. Am. 129, pp. 388-403. [url | pdf - see copyright notice below (1)]
  • Meyer, B.T., Jürgens, T., Wesker, T., Brand, T., Kollmeier, B. (2010). "Human phoneme recognition as a function of speech-intrinsic variabilities", J. Acoust. Soc. Am. 128 (5), pp. 3126–3141 [url | pdf - see copyright notice below (1)]
  • Meyer, B.T. and Kollmeier, B. (2010). "Robustness of spectro-temporal features against intrinsic and extrinsic variations in automatic speech recognition", Speech Communication 53 (5) (Special issue on Statistical and Perceptual Audition), pp. 753-767. dx.doi.org/10.1016/j.specom.2010.07.002 [url]

 

Peer-reviewed conference proceedings

  • Huber, R., Spille, C., Meyer, B.T. (2017). "Single-ended prediction of listening effort based on automatic speech recognition," in Proc. Interspeech. [ pdf ]
  • Spille, C. and Meyer, B.T. (2017). "Listening in the dips: Comparing relevant features for speech recognition in humans and machines," Proc. Interspeech. [ pdf ]
  • Meyer, B.T., Mallidi, S.H., Kayser, H., Hermansky, H. (2017). "Predicting error rates for unknown data in automatic speech recognition," in Proc. ICASSP. [ pdf
  • Xiong, F., Goetze, S., Meyer, B.T. (2017). "Combination strategy based on relative performance monitoring for multi-stream reverberant speech recognition," in Proc. ICASSP. [ pdf ]
  • Xiong, F., Goetze, S., Meyer, B.T. (2017). "On DNN posterior probability combination in multi-stream speech recognition for reverberant environments," in Proc. ICASSP. [ pdf ]
  • Xiong, F., Meyer, B.T., Cauchi, B., Jukic, A., Doclo, S., Goetze, S. (2017). "Performance Comparison of Real-Time Single-Channel Speech Dereverberation Algorithms," in Proc. Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA), San Francisco, CA.
  • Meyer, B.T., Mallidi, S.H., Castro Martínez, A.M., Paya-Vaya, G., Kayser, H., Hermansky, H. (2016). "Performance monitoring for automatic speech recognition in noisy multi-channel environments," IEEE Workshop on Spoken Language Technology. [ pdf ]
  • Spille, C., Kayser, H., Hermansky, H., Meyer, B.T. (2016). "Assessing speech quality in speech-aware hearing aids based on phoneme posteriorgrams," in Proc. Interspeech. [ pdf ]
  • Exter, M., Meyer, B.T. (2016). "DNN-based automatic speech recognition as a model for human phoneme perception," in Proc. Interspeech. [ pdf ]
  • Frye, M., Micheli, C., Schepers, I.M., Schalk, G., Rieger, J.W., Meyer, B.T. (2016). "Neural responses to speech-specific modulations derived from a spectro-temporal filter bank," in Proc Interspeech. [ pdf ]
  • Eichenauer, A., Dietz, M., Meyer, B.T., Jürgens, T. (2016). "Introducing temporal rate coding for speech in cochlear implants: A microscopic evaluation in humans and models," in Proc. Interspeech. [ pdf ]
  • Xiong, F., Goetze, S., Meyer, B.T. (2015). "Joint Estimation of Reverberation Time and Direct-To-Reverberation Ratio from Speech Using Auditory-Inspired Features," ACE Challenge Workshop, satellite event of IEEE-WASPAA. 
  • Meyer, B.T., Kollmeier, B., and Ooster, J. (2015). "Autonomous measurement of speech intelligibility utilizing automatic speech recognition," in Proc. Interspeech. [ pdf ]
  • Kayser, H., Spille, C., Marquardt, D., Meyer, B.T. (2015). "Improving automatic speech recognition in spatially-aware hearing aids," in Proc. Interspeech. [ pdf ]
  • Xiong, F., Meyer, B., Goetze, S. (2015). "A Study on Joint Beamforming and Spectral Enhancement for Robust Speech Recognition in Reverberant Environments," Proc. 40th International Conference on Acoustics, Speech, and Signal Processing (ICASSP). [ pdf ]
  • Spille, C., Meyer, B.T. (2014). "Identifying the human-machine differences in complex binaural scenes: What can be learned from our auditory system," in Proc. Interspeech, pp. 626-631. [ pdf ]
  • Castro Martinez, A.M., Moritz, N., Meyer, B.T. (2014). "Should deep neural nets have ears? The role of auditory features in deep learning approaches," in Proc. Interspeech, pp. 2435-2439. [ pdf ]
  • Xiong, F., Moritz, N., Rehr, R., Anemüller, J., Meyer, B.T., Gerkmann, T., Doclo, S., Goetze, S. (2014). "Robust ASR in reverberant environments using temporal cepstrum smoothing for speech enhancement and an amplitude modulation filterbank for feature extraction," in Proc. REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge. [ pdf ]
  • Xiong, F., Goetze, S., Meyer, B.T. (2014). "Estimating room acoustic parameters for speech recognizer adaptation and combination in reverberant environments," Proc. 39th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 5522-5526. [ pdf ]
  • Meyer, B.T. (2013). "What's the difference? Comparing humans and machines on the Aurora2 speech database," in Proc. Interspeech 2013, 2634-2638. [ pdf ]
  • Spille, C., Dietz, M., Hohmann, V., Meyer, B.T. (2013). "Using binaural processing for automatic speech recognition in multi-talker scenes," Proc. 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 7805-7809. [ pdf ]
  • Xiong, F., Goetze, S., Meyer, B.T. (2013). "Blind estimation of reverberation time based on spectro-temporal modulation filtering," Proc. 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 443-447. [ pdf ]
  • Chang, S., Meyer, B.T., Morgan, N. (2013). "Spectro-temporal features for noise-robust speech recognition using power-law nonlinearity and power-bias subtraction," Proc. 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 7063-7067. [ pdf ]
  • Moritz, N., Schädler, M.R., Adiloglu, K., Meyer, B.T., Jürgens, T., Gerkmann, T., Goetze, S. (2013). "Noise robust distant automatic speech recognition utilizing NMF based source separation and auditory feature extraction," Workshop on Machine Listening in Multisource Environments (CHiME 2013). [ pdf ]
  • Meyer, B.T., Spille, C., Kollmeier, B., and Morgan, N. (2012). "Hooking up spectro-temporal filters with auditory-inspired representations for robust automatic speech recognition," in Proc. Interspeech. [ pdf ]
  • Kollmeier, B., Schädler, M.R., Meyer, A., Anemüller, J., and Meyer, B.T. (2012). "Do we need STRFs for cocktail parties? - On the relevance of physiologically motivated features for human speech perception derived from automatic speech recognition", in Proc. International Symposium of Hearing (ISH), Cambridge, UK.
  • Lei, H., Meyer, B.T., and Mirghafori, N. (2012). "Spectro-temporal Gabor features for speaker recognition," in Proc. ICASSP, pp. 4241-4244. [ pdf ]
  • Meyer, B.T. (2011). "Improving automatic speech recognition by learning from human errors," in Proc. 162nd Meeting Acoustical Society of America. Selected one of the highlights of the ASA meeting. POMA Volume 14, pp. 060001. [ url ]
  • Meyer, B.T. (2011). "Extraction of Spectro-Temporal Speech Cues for Robust Automatic Speech Recognition," in Proc. 42nd International Conference of the Acoustic Engineering Society (AES), pp. 108-116. 
  • Meyer, B.T., Ravuri, S., Schädler, M.R., and Morgan, N. (2011). "Comparing different flavors of spectro-temporal features for ASR", in Proc. Interspeech, pp. 1269-1272. [ pdf ]
  • Meyer, B.T. and Kollmeier, B. (2010). "Learning from human errors: Prediction of phoneme confusions based on modified ASR training", in Proc. Interspeech. [ pdf ]
  • Meyer, B. and Kollmeier, B. (2009). "Complementarity of MFCC, PLP and Gabor features in the presence of speech-intrinsic variabilities,” in Proc. Interspeech. [ pdf ]
  • Meyer, B.T. and Kollmeier, B. (2008). "Optimization and Evaluation of Gabor feature sets for ASR,” in Proc. Interspeech. [ pdf ]
  • Garcia Lecumberri, M.L., Cooke, M., Cutugno, F., Giurgiu, M., Meyer, B.T., Scharenborg, O., van Dommelen, W., and Volin, A. (2008). "The non-native consonant challenge for European languages,” in Proc. Interspeech.
  • Meyer, B.T., Wächter, M., Brand, T., and Kollmeier, B. (2007). "Phoneme confusions in human and automatic speech recognition,” in Proc. Interspeech, Antwerpen, Belgium, pp. 1485-1488. [ pdf ]
  • Kollmeier, B., Meyer, B.T., Jürgens, T., Beutelmann, R., Meyer, R., and Brand, T. (2007). "Speech reception in noise: How much do we understand?,” in Proceedings of the International Symposium on Auditory and Audiological Research (ISAAR), Helsingør, Denmark. 
  • Meyer, B.T., Wesker, T., Brand, T., Mertins, A., and Kollmeier, B. (2006). "A human-machine comparison in speech recognition based on a logatome corpus,” in Workshop on Speech Recognition and Intrinsic Variation, Toulouse, France. [ pdf ]
  • Wesker, T., Meyer, B., Wagener, W., Anemüller, J., Mertins, A., and Kollmeier, B. (2005). "Oldenburg Logatome Speech Corpus (OLLO) for speech recognition experiments with humans and machines,” in Proceedings of Interspeech, Lisbon, Portugal, pp. 1273-1276. [ pdf ]

 

Book Chapters

  • Spille, C., Meyer, B.T., Dietz, M., Hohmann, V. (2013). Chapter "Binaural scene analysis with multi-dimensional statistical filters," in "The Technology of Binaural Listening" (Ed. Blauert, J.), Springer, Berlin.
  • Kollmeier, B., Brand, T., and Meyer, B. (2008). Chapter "Perception of speech and sound," in "Springer Handbook of Speech Processing," pp. 61-82, Springer, Berlin.

 

Theses

  • Meyer, B., "Human and automatic speech recognition in the presence of speech-intrinsic variations,” Ph. D. thesis, Carl-von-Ossietzky Universität, Oldenburg, 2009. [ url ]
  • Meyer, B., "Robust Speech Recognition based on Spectro-Temporal Features,” diploma thesis, Carl-von-Ossietzky Universität, Oldenburg, 2004. [ pdf ]

 

Other publications

  • Kollmeier, B., Schädler, M.R., Warzybok, A., Meyer, B.T., Brand, T. (2015). "Individual speech recognition in noise, the audiogram & more: Using automatic speech recognition (ASR) as a modelling tool and consistency check across audiological measures," abstract for the International Symposium On Auditory And Audiological Research (ISAAR).
  • Meyer, B. (2011). "Human and automatic speech recognition in the presence of speech-intrinsic variations", Summary of Ph.D. thesis, Zeitschrift für Audiologie 50 (2), pp. 77-78.
  • Schädler, M. R., Meyer, B. and Kollmeier, B. (2011). "Robuste Spracherkennung mit spektro- temporalen Filterbankmerkmalen", Fortschritte der Akustik - Tagungsband der DAGA.
  • Meyer, B. and Kollmeier, B., "Einfluss intrinsischer Sprachvariation auf automatische Spracherkenner – Vergleich spektraler und spektro-temporaler Merkmale", in Proc. DAGA,Berlin, 2010.
  • Meyer, B., Brand, T., and Kollmeier, B., "Phonemverwechslungen bei menschlicher und automatischer Spracherkennung,” in Proceedings of DAGA, Stuttgart, Germany, 2007, pp. 79-80. [ pdf ]
  • Meyer, B. and Kleinschmidt, M., "Robust Speech Recognition Based on Localized Spectro-Temporal Features,” in Proceedings of the Elektronische Sprach- und Signalverarbeitung (ESSV), Karlsruhe, 2003. [ pdf ]
  • Wesker, T., Meyer, B., Brand, T., Wagener, K., and Kollmeier, B., "OLLO - Ein Logatom-Sprachkorpus für Sprachverständlichkeitsmesungen und Erkennungsexperimente mit Menschen und Maschinen,” in 9. Jahrestagung der Deutschen Gesellschaft für Audiologie, Zeitschrift für Audiologie, Suppl. IX, 2006.

 

(1) Copyright Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America.