Best Papers

Best Student Paper Award Shortlist

Each year ISCA selects up to three best student papers from INTERSPEECH to honor with a Best Student Paper Award. The winners are announced during the closing ceremony of the conference.
This year's shortlist of papers is below. The final decisions will be made based on the papers themselves and their presentations at the conference.

Yasmina Benabderrahmane (INRS‐EMT Telecommunications)

    with Sid-Ahmed Selouani (Universite de Moncton), and Douglas O’Shaughnessy (INRS‐EMT Telecommunications) for the paper titled "Oriented PCA Method for Blind Speech Separation of Convolutive Mixtures"     
  • Mon-Ses3-O3 Source Separation, 16:20-16:40, Monday

Joost van Doremalen (Radboud University Nijmegen)

    with Catia Cucchiarini (Radboud University Nijmegen), and Helmer Strik (Radboud University Nijmegen) for the paper titled "Using Non-Native Error Patterns to Improve Pronunciation Verification"     
  • Mon-Ses3-P3 Language Learning, TTS, and Other Applications, 16:00-18:00, Monday

Camille Guinaudeau (INRIA/IRISA)

    with Guillaume Gravier (IRISA/CNRS) , and Pascale Sébillot (IRISA/INSA) for the paper titled "Improving ASR-Based Topic Segmentation of TV Programs with Confidence Measures and Semantic Relations"     
  • Tue-Ses3-O2 Language Processing, 16:00-16:20, Tuesday

Jeffrey J. Holliday (Ohio State University)

    with Mary E. Beckman (Ohio State University), and Chanelle Mays (Ohio State University) for the paper titled "Did you say susi or shushi? Measuring the Emergence of Robust Fricative Contrasts in English- and Japanese-Acquiring Children"     
  • Wed-Ses1-P3 First and Second Language Acquisition, 10:00-12:00, Wednesday

Vahid Khanagha (INRIA Bordeaux Sud-Ouest)

    with Khalid Daoudi (INRIA Bordeaux Sud-Ouest), Oriol Pont (INRIA Bordeaux Sud-Ouest), and Hussein Yahia (INRIA Bordeaux Sud-Ouest) for the paper titled "A Novel Text-Independent Phonetic Segmentation Algorithm Based on the Microcanonical Multiscale Formalism"     
  • Tue-Ses3-O3 Speech and Audio Segmentation, 16:20-16:40, Tuesday

Marcel Kockmann (Brno University of Technology / SVOX)

    with Lukáš Burget (Brno University of Technology), Ondřej Glembek (Brno University of Technology), Luciana Ferrer (SRI International), and Jan “Honza” Černocký (Brno University of Technology) for the paper titled "Prosodic Speaker Verification Using Subspace Multinomial Models with Intersession Compensation"     
  • Tue-Ses2-O2 Speaker Characterization and Recognition II, 13:50-14:10, Tuesday

Christine Kühnel (Technische Universitat Berlin)

    with Benjamin Weiss (Technische Universitat Berlin), and Sebastian Möller (Technische Universitat Berlin) for the paper titled "Parameters Describing Multimodal Interaction ? Definitions and Three Usage Scenarios"     
  • Wed-Ses1-P4 Spoken Language Resources, Systems and Evaluation I, 10:00-12:00, Wednesday

Brian Langner (Carnegie Mellon University)

    with Stephan Vogel (Carnegie Mellon University), and Alan W Black (Carnegie Mellon University) for the paper titled "Evaluating a Dialog Language Generation System: Comparing the MOUNTAIN System to Other NLG Approaches"     
  • Tue-Ses2-O4 Speech Synthesis IV: Miscellaneous Topics, 13:50-14:10, Tuesday

Zihan Liu (Northwestern Polytechnical University)

    with Lei Xie (Northwestern Polytechnical University), and Wei Feng (City University of Hong Kong) for the paper titled "Maximum Lexical Cohesion for Fine-Grained News Story Segmentation"     
  • Tue-Ses2-P4 SLP Systems, 13:30-15:30, Tuesday

Christophe Mertens (Universite Libre de Bruxelles)

    with Francis Grenez (Universite Libre de Bruxelles), Lise Crevier-Buchman (Hopital Europeen Georges Pompidou), and Jean Schoentgen (National Fund for Scientific Research) for the paper titled "Reliable Tracking Based on Speech Sample Salience of Vocal Cycle Length Perturbations"     
  • Thu-Ses1-O2 Physiology and Pathology of Spoken Language, 10:00-10:20, Thursday

Okko Johannes Räsänen (Aalto University School of Science and Technology)

    for the paper titled "Fully Unsupervised Word Learning from Continuous Speech Using Transitional Probabilities of Atomic Acoustic Events"     
  • Thu-Ses2-O4 Modeling First Language Acquisition, 14:30-14:50, Thursday

Miaomiao Wen (The University of Tokyo)

    with Miaomiao Wang (The University of Tokyo), Keikichi Hirose (The University of Tokyo), and Nobuaki Minematsu (The University of Tokyo) for the paper titled "Improving Mandarin Segmental Duration Prediction with Automatically Extracted Syntax Features"     
  • Wed-Ses2-P3 Voice Conversion and Speech Synthesis, 13:30-15:30, Wednesday

Simon Wiesler (RWTH Aachen University)

    with Georg Heigold (RWTH Aachen University), Markus Nußbaum-Thom (RWTH Aachen University), Ralf Schlüter (RWTH Aachen University), and Hermann Ney (RWTH Aachen University) for the paper titled "A Discriminative Splitting Criterion for Phonetic Decision Trees"     
  • Mon-Ses2-O1 ASR: Acoustic Models I, 13:30-13:50, Monday

Lan-Ying Yeh (National Chiao Tung University)

    with Tai-Shih Chi (National Chiao Tung University) for the paper titled "Spectro-Temporal Modulations for Robust Speech Emotion Recognition"
  • Tue-Ses1-O4 Emotional Speech, 10:40-11:00, Tuesday
This page was last updated on 27-September-2010 3:00 UTC.top


Best Student Paper Award Winners

Joost van Doremalen (Radboud University Nijmegen)

Jeffrey J. Holliday (Ohio State University)

Christophe Mertens (Universite Libre de Bruxelles)

This page was last updated on 15-October-2010 3:00 UTC.top


Best Papers Invited to SPECOM/CSL

This year, the Technical Program Committee of INTERSPEECH 2010 selected the best papers in each area. The authors have been invited to submit expanded versions to SPECOM (Speech Communication) and CSL (Computer Speech and Language).

Detecting Categorical Perception in Continuous Discrimination Data

    Paul Boersma (University of Amsterdam)
    Katerina Chladkova (University of Amsterdam)
  • Mon-Ses2-O3: Speech Perception I: Factors Influencing Perception, 13:30-13:50, Monday

Locally-Weighted Regression for Estimating the Forward Kinematics of a Geometric Vocal Tract Model

    Adam Lammert (University of Southern California)
    Louis Goldstein (University of Southern California)
    Khalil Iskarous (Haskins Laboratories)
  • Tue-Ses3-P4: Speech Production II: Vocal Tract Modeling and Imaging, 16:00-18:00, Tuesday

Fully Unsupervised Word Learning from Continuous Speech Using Transition Probabilities of Atomic Acoustic Events

    Okko Johannes Räsänen (Aalto University School of Science and Technology)
  • Thu-Ses2-O4: Modeling first language acquisition, 14:30-14:50, Thursday

F0 Declination in English and Mandarin Broadcast News Speech

    Jiahong Yuan (University of Pennsylvania)
    Mark Liberman (University of Pennsylvania)
  • Mon-Ses2-O4: Prosody: Models, 14:10-14:30, Monday

Automatic Classification of Married Couples’ Behavior using Audio Features

    Matthew Black (University of Southern California)
    Athanasios Katsamanis (University of Southern California)
    Chi-Chun Lee (University of Southern California)
    Adam Lammert (University of Southern California)
    Brian Baucom (University of Southern California)
    Andrew Christensen (University of California)
    Panayiotis Georgiou (University of Southern California)
    Shrikanth Narayanan (University of Southern California)
  • Wed-Ses2-O4: Paralanguage & Cognition, 14:50-15:10, Wednesday

A Factorial Sparse Coder Model for Single Channel Source Separation

    Robert Peharz (Graz University of Technology)
    Michael Stark (Graz University of Technology)
    Franz Pernkopf (Graz University of Technology)
    Yannis Stylianou (University of Crete)
  • Mon-Ses3-O3: Source separation, 16:00-16:20, Monday

A Corpus-Based Approach to Speech Enhancement from Nonstationary Noise

    Ming Ji (Queen's University Belfast)
    Ramji Srinivasan (Queen's University Belfast)
    Danny Crookes (Queen's University Belfast)
  • Tue-Ses2-O3: Single-channel speech enhancement, 14:50-15:10, Tuesday

Simple and Efficient Speaker Comparison using Approximate KL Divergence

    William Campbell (MIT Lincoln Laboratory)
    Zahi Karam (MIT Lincoln Laboratory)
  • Mon-Ses3-O2: Speaker characterization and recognition I, 16:00-16:20, Monday

Context Adaptive Training with Factorized Decision Trees for HMM-Based Speech Synthesis

    Kai Yu (Cambridge University)
    Heiga Zen (Toshiba Research Europe Ltd.)
    Francois Mairesse (Cambridge University)
    Steve Young (Cambridge University)
  • Mon-Ses3-O4: Speech Synthesis II: HMM-based Speech Synthesis, 16:20-16:40, Monday

Boosting Systems for LVCSR

    George Saon (IBM T.J. Watson Research Center)
    Hagen Soltau (IBM T.J. Watson Research Center)
  • Tue-Ses3-O1: ASR: Acoustic Models II, 16:00-16:20, Tuesday

A MMSE Estimator in Mel-Cepstral Domain for Robust Large Vocabulary Automatic Speech Recognition using Uncertainty Propagation

    Ramón Fernández Astudillo (Technical University Berlin, Germany)
    Reinhold Orglmeister (Technical University Berlin, Germany)
  • Tue-Ses1-O1: Robust ASR, 10:20-10:40, Tuesday

Evaluation of a Silent Speech Interface Based on Magnetic Sensing

    Robin Hofe (University of Sheffield)
    Stephen R. Ell (University of Hull)
    Michael J. Fagan (University of Hull)
    James M. Gilbert (University of Hull)
    Phil D. Green (University of Sheffield)
    Roger K. Moore (University of Sheffield)
    Sergey I. Rybchenko (University of Hull)
  • Mon-Ses2-P3: Special-purpose speech applications, 13:30-15:30, Monday

Improving ASR-Based Topic Segmentation of TV Programs with Confidence Measures and Semantic Relations

    Camille Guinaudeau (INRIA/IRISA)
    Guillaume Gravier (IRISA/CNRS)
    Pascale Sébillot (IRISA/INSA)
  • Tue-Ses3-O2: Language Processing, 16:00-16:20, Tuesday

Direct Construction of Compact Context-Dependency Transducers from Data

    David Rybach (RWTH Aachen University)
    Michael Riley (Google Inc.)
  • Mon-Ses2-P2: ASR: Search, Decoding and Confidence Measures I, 13:30-15:30, Monday

Improved Topic Classification and Keyword Discovery Using an HMM-Based Speech Recognizer Trained without Supervision

    Man-Hung Siu (Raytheon BBN Technologies)
    Herbert Gish (Raytheon BBN Technologies)
    Arthur Chan (Raytheon BBN Technologies)
    William Belfield (Raytheon BBN Technologies)
  • Thu-Ses2-O1: New Paradigms in ASR II, 13:30-13:50, Thursday

Language Model Cross Adaptation for LVCSR System Combination

    Xunying Liu (Cambridge University)
    Mark Gales (Cambridge University)
    Phil Woodland (Cambridge University)
  • Mon-Ses3-O1: Systems for LVCSR, 16:20-16:40, Monday

Natural Belief-Critic: a Reinforcement Algorithm for Parameter Estimation in Statistical Spoken Dialogue Systems

    Filip Jurčíček (Cambridge University)
    Blaise Thomson (Cambridge University)
    Simon Keizer (Cambridge University)
    François Mairesse (Cambridge University)
    Milica Gašić (Cambridge University)
    Kai Yu (Cambridge University)
    Steve Young (Cambridge University)
  • Mon-Ses2-O2: Spoken dialogue systems I, 14:30-14:50, Monday

Phrase Alignment Confidence for Statistical Machine Translation

    Sankaranarayanan Ananthakrishnan (BBN Technologies)
    Rohit Prasad (BBN Technologies)
    Prem Natarajan (BBN Technologies)
  • Thu-Ses2-O2: Spoken Language Understanding and Spoken Language Translation II, 14:50-15:10, Thursday

Say It As You Mean It – Analyzing Free User Comments in the VOICE Awards Corpus

    Florian Gödde (Technische Universitat Berlin)
    Sebastian Möller (Technische Universitat Berlin)
  • Wed-Ses2-O2: Spoken language resources, systems and evaluation II, 14:30-14:50, Wednesday
This page was last updated on 12-October-2010 0:00 UTC.top