Europe

STG Publications

Speech Technology Group, Selected Recent Scientific Publications

Go to 2016 2015 2014 2013 2012 2011 2010 2009 2008 2007 2006 2005 2004



2016   back to top

  • Multi-domain Spoken Dialogue Systems using Domain-Independent Parameterisation
    A. Papangelis and Y. Stylianou
    Proc. DADA 2016, Riva del Garda, Italy (September 2016)

  • Multi-Stream Spectral Representation for Statistical Parametric Speech Synthesis
    K. Yanagisawa, R. Maia and Y. Stylianou
    Proc. ICASSP 2016, Shanghai, China (March 2016)

  • Speaker Adaptive Training in Deep Neural Networks using Speaker Dependent Bottle Neck Features
    R. Doddipatla
    Proc. ICASSP 2016, Shanghai, China (March 2016)

  • Iterative Estimation of Phase using Complex Cepstrum Representation
    R. Maia and Y. Stylianou
    Proc. ICASSP 2016, Shanghai, China (March 2016)

  • Initial Investigation of Speech Synthesis Based on Complex-Valued Neural Networks
    Q. Hu, K. Richmond, J. Yamagishi, K. Subramanian and Y. Stylianou
    Proc. ICASSP 2016, Shanghai, China (March 2016)

  • Voice Activity Detection: Merging Source and Filter-based Information
    T. Drugman, Y. Stylianou, Y. Kida and M. Akamine
    IEEE Signal Processing Letters, Vol. 23, No. 2 (February 2016)

2015   back to top

  • Expressive Visual Text-To-Speech as an Assistive Technology for Individuals with Autism Spectrum Conditions
    B. Stenger, S. A. Cassidy, L. Van Dongen, K. Yanagisawa, R. Anderson, V. Wan, S. Baron-Cohen and R. Cipolla
    Accepted in Computer Vision and Image Understanding, Special Issue on Assistive Computer Vision and Robotics

  • A Fast Algorithm for Improved Intelligibility of Speech-in-Noise Based on Frequency and Time Domain Energy Reallocation
    T.C. Zorila and Y. Stylianou
    Proc. Interspeech 2015, Singapore (September 2015)

  • A Maximum Likelihood Approach to Detect Moments of Maximum Excitation and its Application to High Quality Speech Parameterization
    R. Maia, Y. Stylianou and M. Akamine
    Proc. Interspeech 2015, Singapore (September 2015)

  • Fast and Accurate Phase Unwrapping
    T. Drugman and Y. Stylianou
    Proc. Interspeech 2015, Singapore (September 2015)

  • Fusion of Multiple Parametrization for DNN-Based Sinusoidal Speech Synthesis with Multi-Task Learning
    Q. Hu, Z. Wu, K. Richmond, J. Yamagishi, Y. Stylianou and R. Maia
    Proc. Interspeech 2015, Singapore (September 2015)

  • Intelligibility Enhancement of Casual Speech for Reverberant Environments Inspired by Clear Speech Properties
    M. Koutsogiannaki, P. Petkov and Y. Stylianou
    Proc. Interspeech 2015, Singapore (September 2015)

  • Towards a Linear Dynamical Model Based Speech Synthesizer
    V. Tsiaras, R. Maia, V. Diakoloukas, Y. Stylianou and V. Digalakis
    Proc. Interspeech 2015, Singapore (September 2015)

  • Learning Domain-Independent Dialogue Policies via Ontology Parameterisation
    Z. Wang and Y. Stylianou
    Proc. SIGDIAL 2015, Prague, Czech Republic (September 2015)

  • Speaker and Expression Factorization for Audiobook Data: Expressiveness and Transplantation
    L. Chen, N. Braunschweiler and M. Gales
    IEEE Trans. Audio, Speech and Language Processing, Vol. 23 (April 2015)

  • Methods for Applying Dynamic Sinusoidal Models to Statistical Parametric Speech Synthesis
    Q. Hu, Y. Stylianou, R. Maia, K. Richmond and J. Yamagishi
    Proc. ICASSP 2015, Brisbane, Australia (April 2015)

  • Robust Excitation-based Features for Automatic Speech Recognition
    T. Drugman, Y. Stylianou, L. Chen, X. Chen and M. Gales
    Proc. ICASSP 2015, Brisbane, Australia (April 2015)

  • Improved Face-to-Face Communication Using Noise Reduction and Speech Intelligibility Enhancement
    A. Griffin, T.-C. Zorila and Y. Stylianou
    Proc. ICASSP 2015, Brisbane, Australia (April 2015)

2014   back to top

  • Enhancing the Intelligibility of Statistically Generated Synthetic Speech by Means of Noise-Independent Modifications
    D. Erro, T. Zorila and Y. Stylianou
    IEEE Transactions on Audio, Speech and Language Processing, Vol. 22, Issue 12, pp. 2101-2111 (December 2014)

  • Fast Inter-Harmonic Reconstruction for Spectral Envelope Estimation in High-Pitch Voices
    T. Drugman and Y. Stylianou
    IEEE Signal Processing Letters, Vol. 21, Issue 11 (November 2014)

  • Maximum Voiced Frequency Estimation: Exploiting Amplitude and Phase Spectra
    T. Drugman and Y. Stylianou
    IEEE Signal Processing Letters, Vol. 21, Issue 10 (October 2014)

  • Noise-robust TTS Speaker Adaptation with Statistics Smoothing
    K. Yanagisawa, L. Chen and M. Gales
    Proc. Interspeech 2014, Singapore (September 2014)

  • Speech Intonation for TTS: Study on Evaluation Methodology
    J. Latorre, K. Yanagisawa, V. Wan, B. Kolluru and M. Gales
    Proc. Interspeech 2014, Singapore (September 2014)

  • Voice Expression Conversion Using HMM-TTS Models
    J. Latorre, V. Wan and K. Yanagisawa
    Proc. Interspeech 2014, Singapore (September 2014)

  • Generating Multiple-Accent Pronunciations for TTS using Joint Sequence Model Interpolation
    B. Kolluru, V. Wan, J. Latorre, K. Yanagisawa and M. Gales
    Proc. Interspeech 2014, Singapore (September 2014)

  • Enabling Controllability for Continuous Expression Space
    L. Chen and N. Braunschweiler
    Proc. Interspeech 2014, Singapore (September 2014)

  • An Investigation of the Application of Dynamic Sinusoidal Models to Statistical Parametric Speech Synthesis
    Q. Hu, Y. Stylianou, R. Maia, K. Richmond, J. Yamagishi and J. Latorre
    Proc. Interspeech 2014, Singapore (September 2014)

  • Analysis of Emotional Speech using an Adaptive Sinusoidal Model
    G. Kafentzis, T. Yakoumaki, A. Mouchtaris and Y. Stylianou
    Proc. EUSIPCO 2014, Lisbon, Portugal (September 2014)

  • On the Impact of Excitation and Spectral Parameters for Expressive Statistical Parametric Speech Synthesis
    R. Maia and M. Akamine
    Computer Speech & Language, Vol. 28, Issue 5 (September 2014)

  • Cluster Adaptive Training of Average Voice Models
    V. Wan, J. Latorre, K. Yanagisawa, M. Gales and Y. Stylianou
    Proc. ICASSP 2014, Florence, Italy (May 2014)

  • Linear Dynamical Models in Speech Synthesis
    V. Tsiaras, R. Maia, V. Diakoloukas, Y. Stylianou and V. Digalakis
    Proc. ICASSP 2014, Florence, Italy (May 2014)

  • Speaker Dependent Expression Predictor From Text: Expressiveness and Transplantation
    L. Chen, N. Braunschweiler, and M. Gales
    Proc. ICASSP 2014, Florence, Italy (May 2014)

  • Complex Cepstrum Factorization for Statistical Parametric Synthesis
    R. Maia and Y. Stylianou
    Proc. ICASSP 2014, Florence, Italy (May 2014)

  • A Fixed Dimension and Perceptually Based Dynamic Sinusoidal Model of Speech
    Q. Hu, Y. Stylianou, K. Richmond, R. Maia, J. Yamagishi and J. Latorre
    Proc. ICASSP 2014, Florence, Italy (May 2014)

  • Real Time Speech-in-Noise Intelligibility Enhancement Based on Spectral Shaping and Dynamic Range Compression
    V. Tsiaras, T. C. Zorilla, Y. Stylianou and M. Akamine
    Proc. ICASSP 2014, Show and Tell, Florence, Italy (May 2014)

  • Building HMM-TTS Voices on Diverse Data
    V. Wan, J. Latorre, K. Yanagisawa, N. Braunschweiler, L. Chen, M. Gales and M. Akamine
    IEEE Journal of Selected Topics in Signal Processing, Vol. 8, Issue 2, pp. 296-306 (April 2014)

  • Integrated Expression Prediction and Speech Synthesis from Text
    L. Chen, M. Gales, N. Braunschweiler, M. Akamine and K. Knill
    IEEE Journal of Selected Topics in Signal Processing, Vol. 8, Issue 2, pp. 323-335 (April 2014)

  • Intelligibility Enhancement of HMM-generated Speech in Additive Noise by Modifying Mel Cepstral Coefficients to Increase the Glimpse Proportion
    C. Valentini-Botinhao, J. Yamagishi, S. King and R. Maia
    Computer Speech & Language, Vol. 28, Issue 2 (March 2014)

  • R. Morinaka, Y. Nasu, M. Tamura, V. Wan, K. Yanagisawa, B. Stenger, M. Morita, T. Kagoshima, M. Akamine
    Development of Xpressive TalkTM generating expressive speech and facial images
    Proc. Acoustical Society of Japan 2014 Spring Meeting, Tokyo, Japan (March 2013)

2013   back to top

  • Automatic Detection of Inhalation Breath Pauses for Improved Pause Modelling in HMM-TTS
    N. Braunschweiler and L. Chen
    Proc. 8th Speech Synthesis Workshop (SSW8), Barcelona, Spain (August 2013)

  • Noise Robustness in HMM-TTS Speaker Adaptation
    K. Yanagisawa, V. Wan , J. Latorre, M. Gales and S. King
    Proc. 8th Speech Synthesis Workshop (SSW8), Barcelona, Spain (August 2013)

  • An Experimental Comparison of Multiple Vocoder Types
    Q. Hu, K. Richmond, J. Yamagishi and J. Latorre
    Proc. 8th Speech Synthesis Workshop (SSW8), Barcelona, Spain (August 2013)

  • Unsupervised Speaker and Expression Factorization for Multi-Speaker Expressive Synthesis of E-Books
    L. Chen and N. Braunschweiler
    Proc. Interspeech 2013, Lyon, France (August 2013)

  • Minimum Mean Squared Error Based Warped Complex Cepstrum Analysis for Statistical Parametric Speech Synthesis
    R. Maia, M. Gales, Y. Stylianou and M. Akamine
    Proc. Interspeech 2013, Lyon, France (August 2013)

  • Photo-Realistic Expressive Text to Talking Head Synthesis
    V. Wan, R. Anderson, A. Blokland, N. Braunschweiler, L. Chen, B. Kolluru, J. Latorre, R. Maia, B. Stenger, K. Yanagisawa, Y. Stylianou, M. Akamine, M. Gales and R. Cipolla
    Proc. Interspeech 2013, Lyon, France (August 2013)

  • An Expressive Text-Driven 3D Talking Head
    R. Anderson, B. Stenger, V. Wan and R. Cipolla
    Proc. SIGGRAPH 2013, Anaheim, California, USA (July 2013)

  • Expressive Visual Text-to-Speech Using Active Appearance Models
    R. Anderson, B. Stenger, V. Wan and R. Cipolla
    Proc. IEEE International Conference on Computer Vision and Pattern Recognition, Portland, Oregon, USA (June 2013)

  • Complex Cepstrum for Statistical Parametric Speech Synthesis
    R. Maia, M. Akamine and M. Gales
    Speech Communication, Vol. 55, Issue 5 (June 2013)

  • Integrated Automatic Expression Prediction and Speech Synthesis From Text
    L. Chen, M. Gales, N. Braunschweiler, M. Akamine and K. Knill
    Proc. ICASSP 2013, Vancouver, Canada (May 2013)

  • Training a Supra-Segmental Parametric F0 Model Without Interpolating F0
    J. Latorre, M. Gales, K. Knill and M. Akamine
    Proc. ICASSP 2013, Vancouver, Canada (May 2013)

  • Complex Cepstrum Analysis Based on the Minimum Mean Squared Error
    R. Maia, M. Akamine and M. Gales
    Proc. ICASSP 2013, Vancouver, Canada (May 2013)

2012   back to top

  • Crowdsourced Assessment of Speech Synthesis
    S. Buchholz, J. Latorre and K. Yanagisawa
    In "Crowdsourcing for Speech Processing: Applications to Data Collection, Transcription and Assessment Wiley & Sons (September 2012)

  • Analysis of the Importance of Short-term Speech Parameterizations for Emotional Statistical Parametric Speech Synthesis
    R. Maia and M. Akamine
    Proc. Interspeech 2012, Portland, Oregon, USA (September 2012)

  • C2H: a Computational Model to Manage Phonetic Contrast Along the H&H Continuum in Speech Production
    M. Nicolao, J. Latorre and R. Moore
    Proc. Interspeech 2012, Portland, Oregon, USA (September 2012)

  • Combining Multiple High Quality Corpora for Improving HMM-TTS
    V. Wan, J. Latorre, M. Gales, L. Chen, K. Chin, K. Knill and M. Akamine
    Proc. Interspeech 2012, Portland, Oregon, USA (September 2012)

  • Exploring Rich Expressive Information from Audio Book Data Using Cluster Adaptive Training
    L. Chen, M. Gales, V. Wan and J. Latorre
    Proc. Interspeech 2012, Portland, Oregon, USA (September 2012)

  • Noise Compensation for Subspace Gaussian Mixture Models
    L. Lu, K. Chin, A. Ghoshad and S. Pends
    Proc. Interspeech 2012, Portland, Oregon, USA (September 2012)

  • Speech Factorization for HMM-TTS Based on Cluster Adaptive Training
    J. Latorre, V. Wan, M. Gales, L. Chen, K. Chin and K. Knill
    Proc. Interspeech 2012, Portland, Oregon, USA (September 2012)

  • Statistical Parametric Speech Synthesis Based on Speaker and Language Factorization
    H. Zen, N. Braunschweiler, S. Buchholz, M. Gales, K. Knill and S. Krstulovic
    IEEE Transactions on Audio, Speech and Language Processing, August 2012

  • Cepstral Analysis Based on the Glimpse Proportion Measure for Improving the Intelligibility of HMM-Based Synthetic Speech in Noise
    C. Valentini-Botinhao, R. Maia, J. Yamagishi, S. King and H. Zen
    Proc. ICASSP 2012, Kyoto, Japan (March 2012)

  • Complex Cepstrum as Phase Information in Statistical Parametric Speech Synthesis
    R. Maia, M. Akamine and M. Gales
    Proc. ICASSP 2012, Kyoto, Japan (March 2012)

  • Unsupervised Clustering of Emotion and Voice Styles for Expressive TTS
    F. Eyben, N. Braunschweiler, S. Buchholz, V. Wan, M. Gales, J. Latorre and K. Knill
    Proc. ICASSP 2012, Kyoto, Japan (March 2012)

  • Product of Experts for Statistical Parametric Speech Synthesis
    H Zen, M Gales, Y Nankaku, K Tokuda
    IEEE Transactions Audio, Speech and Language Processing, Vol. 20, No. 3, pp. 1558-7916 (March 2012)

2011   back to top

  • The Effect of Using Normalized Models in Statistical Speech Synthesis
    M Shannon, H Zen, B Byrne
    Proc. Interspeech 2011, Florence, Italy (August 2011)

  • Gaussian Process Experts for Voice Conversion
    N Pilkington, H Zen, M Gales
    Proc. Interspeech 2011, Florence, Italy (August 2011)

  • Multipulse Sequences for Residual Signal Modeling
    R Maia, H Zen, K Knill, M Gales, S Buchholz
    Proc. Interspeech 2011, Florence, Italy (August 2011)

  • Crowdsourcing Preference Tests, and How to Detect Cheating
    S Buchholz, J Latorre
    Proc. Interspeech 2011, Florence, Italy (August 2011)

  • Integrated Online Speaker Clustering and Adaptation
    C Breslin, K Chin, M Gales, K Knill
    Proc. Interspeech 2011, Florence, Italy (August 2011)

  • Automatic Sentence Selection from Speech Corpora Including Diverse Speech for Improved HMM-TTS Synthesis Quality
    N Braunschweiler, S Buchholz
    Proc. Interspeech 2011, Florence, Italy (August 2011)

  • Joint Uncertainty Decoding with Predictive Methods for Noise Robust Speech Recognition
    H Xu, M Gales, K Chin
    IEEE Transactions on Audio, Speech and Language Processing, Vol. 19, No. 6, pp1665-1676 (August 2011)

  • Context Adaptive Training with Factorised Decision Trees for HMM-Based Statistical Parametric Speech Synthesis
    K Yu, H Zen, F Mairesses, S Young
    Speech Communication, Elsevier Publications, Vol. 53, Issue 6, pp914-923 (July 2011)

  • Decision Tree-Based Context Clustering on Cross Validation and Hierachical Priors
    H Zen, M Gales
    Proc. ICASSP 2011, Prague, Czech Republic (May 2010)

  • Continuous F0 in the Source-Excitation Generation for HMM-Based TTS: do we need Voiced/Unvoiced Classification?
    J Latorre, M Gales, S Buchholz, K Knill, M Tamura, Y Ohtani, M Akamine
    Proc. ICASSP 2011, Prague, Czech Republic (May 2010)

  • Rapid Joint Speaker and Noise Compensation for Robust Speech Recognition
    K Chin, H Xu, M Gales, C Breslin, K Knill
    Proc. ICASSP 2011, Prague, Czech Republic (May 2010)

  • Constrained Discriminative Mapping Transforms for Unsupervised Speaker Adaptation
    L Chen, M Gales, K Chin
    Proc. ICASSP 2011, Prague, Czech Republic (May 2011)

  • Development of US English Text-to-Speech Synthesizer using HMM-based Speech Synthesis
    M Tamura, S Krstulovic, T Morinaka, R Tokuda, H Zen, M Morita, T Kagoshima, M Akamine
    Proc. Spring Meeting of Acoustic Society of Japan, Tokyo, Japan (March 2011)

2010   back to top

  • An Open Source HMM-based Text-to-Speech System for Brazilian Portuguese
    I Couto, N Neto, V Tadaiesky, A Klautau, R Maia
    Proc. 7th International Telecommunications Symposium (ITS 2010), Manaus, Brazil (September 2010)

  • Synthesis of Emotional Speech
    M Schröder, F Burkhardt, S Krstulovic
    Published in "Blue Print for Affective Computing: a Sourcebook" (Section 5.2), Oxford University Press (September 2010)

  • Speaker and Language Adaptive Training For HMM-based Polygot Speech Synthesis
    H Zen
    Proc. Interspeech 2010, Makuhari, Japan (September 2010)

  • Context Adaptive Training with Factorized Decision Trees for HMM-based Speech Synthesis
    K Yu, H Zen, F Mairesse, S Young
    Proc. Interspeech 2010, Makuhari, Japan (September 2010)

  • A Comparison of Pronunciation Modelling Approaches for HMM TTS
    G Webster, S Krstulovic, K Knill
    Proc. Interspeech 2010, Makuhari, Japan (September 2010)

  • An Implementation of Decision Tree-based Context Clustering on Graphics Processing Units
    N Pilkington, H Zen
    Proc. Interspeech 2010, Makuhari, Japan (September 2010)

  • Training a Parametric-based F0 Model with the Minimum Generation Error Criterion
    J Latorre, M Gales, S Buchholz
    Proc. Interspeech 2010, Makuhari, Japan (September 2010)

  • Prior Information for Rapid Speaker Adaption
    C Breslin, H Xu, K Chin, M Gales, K Knill
    Proc. Interspeech 2010, Makuhari, Japan (September 2010)

  • Lightly Supervised Recognition for Automatic Alignment of Large Coherent Speech Recordings
    N Braunschweiler, M Gales, S Buchholz
    Proc. Interspeech 2010, Makuhari, Japan (September 2010)

  • HMM-Based Polygot Speech Synthesis by Speaker and Language Adaptive Training
    H Zen, J Latorre
    Proc. 7th ISCA Speech Synthesis Workshop (SSW7), Kyoto, Japan (September 2010)

  • Statistical Parametric Speech Synthesis Based on the Joint Estimation of Acoustic and Excitation Model Parameters
    R Maia, H Zen, M Gales
    Proc. 7th ISCA Speech Synthesis Workshop (SSW7), Kyoto, Japan (September 2010)

  • Text-to-Speech Synthesis to Improve TV Accessibility
    K Knill
    IEEE Speech and Language Processing Technical Committee Newsletter(July 2010)

  • Annotating the Enron Corpus with Number Senses
    S Moore, A Korhonen, S Bucholz
    Proc. 7th International Conference on Language Resources and Evaluation (LREC), Valetta, Malta (May 2010)

  • Automatic feature selection from a large number of features for phone duration prediction
    G. Webster, S. Buchholz, J. Latorre
    Proc. Speech Prosody, Chicago, USA (May 2010)

  • Usages of an external duration model for HMM-based speech synthesis
    J. Latorre, S. Buchholz, M. Akamine
    Proc. Speech Prosody, Chicago, USA (May 2010)

  • Statistical Parametric Speech Synthesis Based on Product of Experts
    H. Zen, M. Gales, Y. Nankaku, K. Tokuda
    Proc. ICASSP 2010, Dallas, USA (March 2010)

2009   back to top

  • Improving Joint Uncertainty Decoding by Predictive Methods for Noise Robust Speech Recognition
    H. Xu, M. Gales, K. Chin
    Proc. ASRU 2009, Merano, Italy (December 2009)

  • Comparison of Estimation Techniques in Joint Uncertainty Decoding for Noise Robust Speech Recognition
    H. Xu, K. Chin
    Proc. Interspeech 2009, Brighton (September 2009)

  • Compression Techniques Applied to Multiple Speech Recognition Systems
    C. Breslin, M. Stuttle, K.  Knill
    Proc. Interspeech 2009, Brighton (September 2009)

  • Context Dependent Additive Log F0 Model for HMM-based Speech Synthesis
    H. Zen, N. Braunschweiler
    Proc. Interspeech 2009, Brighton (September 2009)

  • Improved Language Modelling Using Bag of Word Pairs
    L. Chen, K. Chin, K.  Knill
    Proc. Interspeech 2009, Brighton (September 2009)

  • Joint Uncertainty Decoding with the Second Order Approximation for Noise Robust Speech Recognition
    H. Xu, K. Chin
    Proc. ICASSP 2009, Taipei (April 2009)

2008   back to top

    2008   back to top
  • An Evaluation of Non-standard Features for Grapheme-to-Phoneme Conversion
    G. Webster, N. Braunschweiler
    Proc. Interspeech 2008, Brisbane (September 2008)

  • Improving Japanese Language Models Using POS Information
    L. Chen, H. Nagae, M Stuttle
    Proc. Interspeech 2008, Brisbane (September 2008)

  • Sentence-based Emotional Classification for Text-to-Speech
    E. Spyropoulou, S Buchholz, S. Teufel
    Proc. CAFFEi 2008, (August 2008)

  • Comparing QMT1 and HMMs for the synthesis of American English Prosody
    S. Krstulovic, J. Latorre, S. Buchholz
    Proc. Speech Prosody 2008, Campinas (May 2008)

  • Efficient Language Model Look-Ahead Probabilities Generation Using Lower Order LM Look-Ahead Information
    L. Chen, K. Chin
    Proc. ICASSP 2008, Las Vegas (April 2008)

2007   back to top

  • Sentence Level Intellibility Evaluation for Mandarin Text-to-Speech Systems Using Semantically Unpredictable Sentences
    J. Li, D. Sityaev, J. Hao
    Proc. Interspeech 2007, Antwerp (September 2007)

  • The Toshiba Entry for the 2007 Blizzard Challenge
    S. Buchholz, N. Braunschweiler, M. Morita, G. Webster
    Proc. The Blizzard Challenger 2007 Workshop, Bonn (August 2007)

  • How (not) to select your voice corpus: random selection vs phonologically balanced
    T. Lambert, N. Braunschweiler, S. Buchholz
    Proc. Sixth ISCA Workshop on Speech Synthesis, Bonn (August 2007)

  • Some aspects of prosody of friendly formal and friendly informal speaking styles
    D. Sityaev, G. Webster, N. Braunschweiler, S. Buchholz, K. Knill
    Proc. ICPhS XVI, Saarbruecken (August 2007)

2006   back to top

  • Comparison of the ITU-T P.85 Standard to Other Methods for the Evaluation of Text-to-Speech Systems
    D. Sityaev, K. Knill, T. Burrows
    Proc. Interspeech 2006, Pittsburgh (September 2006)

  • CoNLL-X Shared Task on Multilingual Dependency Parsing
    S. Buchholz, E. Marsi
    Proc. CoNLL-X, New York (June 2006)

  • Quality Control of Treebanks: Documenting, Converting and Patching
    S. Buchholz, D. Green
    Proc. LREC, Genoa (May 2006)

  • Adaptation of Prosodic Phrasing Models
    P. Bell, T. Burrows, P. Taylor
    Proc. Speech Prosody, Dresden (May 2006)

  • Analysis and Modelling of Question Intonation in American English
    D. Sityaev, T. Burrows, P. Jackson and K. Knill
    Proc. Speech Prosody, Dresden (May 2006)

  • The Prosodizer — Automatic Prosodic Annotations of Speech Synthesis Databases
    N. Braunschweiler
    Proc. Speech Prosody, Dresden (May 2006)

  • Robust Endpoint Detection for Speech Recognition Based on Discriminative Feature Extraction
    K. Yamamoto, F. Jabloun, K. Reinhard, A. Kawamura
    Proc. ICASSP 2006, (May 2006)

  • Investigating Prosodic Modifications for Polyglot Text-to-Speech Synthesis
    P. Olaszi, T. Burrows, K. Knill
    Proc. Multiling 2006, Stellenbosch (April 2006)

2005   back to top

  • A Study on Endpoint Detection for Speech Recognition Based on Discriminative Feature Extraction
    K. Yamamoto, F. Jabloun, K. Reinhard, A. Kawamura
    Proc. Information Processing Society of Japan Audio Language Information Processing, Tokyo (December 2005)

  • A Comparison of Methods for Speaker-Dependent Pronunciation Tuning for Text-to-Speech Synthesis
    G. Webster, T. Burrows, K. Knill
    Proc. Interspeech 2005, Lisbon (September 2005)

  • Combining Models of Prosodic Phrasing and Pausing
    T. Burrows, P. Jackson, K. Knill, D. Sityaev
    Proc. Interspeech 2005, Lisbon (September 2005)

  • Influence of Syntax on Prosodic Boundary Prediction
    T. Ingulfsen, T. Burrows, S. Buchholz
    Proc. Interspeech 2005, Lisbon (September 2005)

  • Intonational Sequence of Tuscan Italian
    J. Bishop, M. Peake, D. Sityaev
    Proc. Interspeech 2005, Lisbon (September 2005)

2004   back to top

  • Improving Letter-to-Pronunciation Accuracy with Automatic Morphologically Based Stress Prediction
    G. Webster
    Proc. Interspeech 2004, Jeju (October 2004)