Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods / Edition 1

Hardcover (Print)
Buy New
Buy New from
Used and New from Other Sellers
Used and New from Other Sellers
from $40.07
Usually ships in 1-2 business days
(Save 72%)
Other sellers (Hardcover)
  • All (8) from $40.07   
  • New (5) from $40.07   
  • Used (3) from $64.96   


This book discusses large margin and kernel methods for speech and speaker recognition

Speech and Speaker Recognition: Large Margin and Kernel Methods is a collation of research in the recent advances in large margin and kernel methods, as applied to the field of speech and speaker recognition. It presents theoretical and practical foundations of these methods, from support vector machines to large margin methods for structured learning. It also provides examples of large margin based acoustic modelling for continuous speech recognizers, where the grounds for practical large margin sequence learning are set. Large margin methods for discriminative language modelling and text independent speaker verification are also addressed in this book.

Key Features:

  • Provides an up-to-date snapshot of the current state of research in this field
  • Covers important aspects of extending the binary support vector machine to speech and speaker recognition applications
  • Discusses large margin and kernel method algorithms for sequence prediction required for acoustic modeling
  • Reviews past and present work on discriminative training of language models, and describes different large margin algorithms for the application of part-of-speech tagging
  • Surveys recent work on the use of kernel approaches to text-independent speaker verification, and introduces the main concepts and algorithms
  • Surveys recent work on kernel approaches to learning a similarity matrix from data

This book will be of interest to researchers, practitioners, engineers, and scientists in speech processing and machine learning fields.

Read More Show Less

Product Details

  • ISBN-13: 9780470696835
  • Publisher: Wiley
  • Publication date: 2/24/2009
  • Edition number: 1
  • Pages: 268
  • Product dimensions: 6.80 (w) x 9.80 (h) x 0.80 (d)

Meet the Author

Dr Joseph Keshet, IDIAP, Switzerland

Dr Keshet  received his B.Sc. and M.Sc. in electrical engineering from the Tel-Aviv University, Tel-Aviv, Israel, in 1994 and 2002, respectively. He got his Ph.D. from the Hebrew University of Jerusalem, Israel in 2007. From 1994 to 2002, he was with the Israeli Defense Forces (Intelligence Corps), where he was in charge of advanced research activities in the fields of speech coding. Since 2007, he is a research scientist in speech recognition at IDIAP Research Institute, Martigny, Switzerland.

Dr Samy Bengio, Google, California, US

Dr Bengio received his M.Sc. and Ph.D. degrees in Computer Science from University of Montreal in 1989 and 1993 respectively. Between 1999 and 2006, he was a senior researcher in statistical machine learning at IDIAP Research Institute, where he supervised PhD students and postdoctoral fellows working on many areas of machine learning. He is the author/co-author of more than 160 international publications, including 30 journal papers. He has organized several international workshops (such as the MLMI series) and been in the organization committee of several well known conferences (such as NIPS). Since early 2007, he is a research scientist in machine learning at Google, in Mountain View, California.

Read More Show Less

Table of Contents

List of Contributors.


I Foundations.

1 Introduction (Samy Bengio and Joseph Keshet).

1.1 The Traditional Approach to Speech Processing.

1.2 Potential Problems of the Probabilistic Approach.

1.3 Support Vector Machines for Binary Classification.

1.4 Outline.


2 Theory and Practice of Support Vector Machines Optimization (Shai Shalev-Shwartz and Nathan Srebo).

2.1 Introduction.

2.2 SVM and L2-regularized Linear Prediction.

2.3 Optimization Accuracy From a Machine Learning Perspective.

2.4 Stochastic Gradient Descent.

2.5 Dual Decomposition Methods.

2.6 Summary.


3 From Binary Classification to Categorial Prediction (Koby Crammer).

3.1 Multi-category Problems.

3.2 Hypothesis Class.

3.3 Loss Functions.

3.4 Hinge Loss Functions.

3.5 A Generalized Perceptron Algorithm.

3.6 A Generalized Passive–Aggressive Algorithm.

3.7 A Batch Formulation.

3.8 Concluding Remarks.

3.9 Appendix. Derivations of the Duals of the Passive–Aggressive Algorithm and the Batch Formulation.


II Acoustic Modeling.

4 A Large Margin Algorithm for Forced Alignment (Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer and Dan Chazan).

4.1 Introduction.

4.2 Problem Setting.

4.3 Cost and Risk.

4.4 A Large Margin Approach for Forced Alignment.

4.5 An Iterative Algorithm.

4.6 Efficient Evaluation of the Alignment Function.

4.7 Base Alignment Functions.

4.8 Experimental Results.

4.9 Discussion.


5 A Kernel Wrapper for Phoneme Sequence Recognition (Joseph Keshet and Dan Chazan).

5.1 Introduction.

5.2 Problem Setting.

5.3 Frame-based Phoneme Classifier.

5.4 Kernel-based Iterative Algorithm for Phoneme Recognition.

5.5 Nonlinear Feature Functions.

5.6 Preliminary Experimental Results.

5.7 Discussion: Canwe Hope for Better Results?


6 Augmented Statistical Models: Using Dynamic Kernels for Acoustic Models (Mark J. F. Gales).

6.1 Introduction.

6.2 Temporal Correlation Modeling.

6.3 Dynamic Kernels.

6.4 Augmented Statistical Models.

6.5 Experimental Results.

6.6 Conclusions.



7 Large Margin Training of Continuous Density Hidden Markov Models (Fei Sha and Lawrence K. Saul).

7.1 Introduction.

7.2 Background.

7.3 Large Margin Training.

7.4 Experimental Results.

7.5 Conclusion.


III Language Modeling.

8 A Survey of Discriminative Language Modeling Approaches for Large Vocabulary Continuous Speech Recognition (Brian Roark).

8.1 Introduction.

8.2 General Framework.

8.3 Further Developments.

8.4 Summary and Discussion.


9 Large Margin Methods for Part-of-Speech Tagging (Yasemin Altun).

9.1 Introduction.

9.2 Modeling Sequence Labeling.

9.3 Sequence Boosting.

9.4 Hidden Markov Support Vector Machines.

9.5 Experiments.

9.6 Discussion.


10 A Proposal for a Kernel Based Algorithm for Large Vocabulary Continuous Speech Recognition (Joseph Keshet).

10.1 Introduction.

10.2 Segment Models and Hidden Markov Models.

10.3 Kernel Based Model.

10.4 Large Margin Training.

10.5 Implementation Details.

10.6 Discussion.



IV Applications.

11 Discriminative Keyword Spotting (David Grangier, Joseph Keshet and Samy Bengio).

11.1 Introduction.

11.2 Previous Work.

11.3 Discriminative Keyword Spotting.

11.4 Experiments and Results.

11.5 Conclusions.



12 Kernel-based Text-independent Speaker Verification (Johnny Mariéthoz, Samy Bengio and Yves Grandvalet).

12.1 Introduction.

12.2 Generative Approaches.

12.3 Discriminative Approaches.

12.4 Benchmarking Methodology.

12.5 Kernels for Speaker Verification.

12.6 Parameter Sharing.

12.7 Is the Margin Useful for This Problem?

12.8 Comparing all Methods.

12.9 Conclusion.


13 Spectral Clustering for Speech Separation (Francis R. Bach and Michael I. Jordan).

13.1 Introduction.

13.2 Spectral Clustering and Normalized Cuts.

13.3 Cost Functions for Learning the Similarity Matrix.

13.4 Algorithms for Learning the Similarity Matrix.

13.5 Speech Separation as Spectrogram Segmentation.

13.6 Spectral Clustering for Speech Separation.

13.7 Conclusions.

References .


Read More Show Less

Customer Reviews

Be the first to write a review
( 0 )
Rating Distribution

5 Star


4 Star


3 Star


2 Star


1 Star


Your Rating:

Your Name: Create a Pen Name or

Barnes & Review Rules

Our reader reviews allow you to share your comments on titles you liked, or didn't, with others. By submitting an online review, you are representing to Barnes & that all information contained in your review is original and accurate in all respects, and that the submission of such content by you and the posting of such content by Barnes & does not and will not violate the rights of any third party. Please follow the rules below to help ensure that your review can be posted.

Reviews by Our Customers Under the Age of 13

We highly value and respect everyone's opinion concerning the titles we offer. However, we cannot allow persons under the age of 13 to have accounts at or to post customer reviews. Please see our Terms of Use for more details.

What to exclude from your review:

Please do not write about reviews, commentary, or information posted on the product page. If you see any errors in the information on the product page, please send us an email.

Reviews should not contain any of the following:

  • - HTML tags, profanity, obscenities, vulgarities, or comments that defame anyone
  • - Time-sensitive information such as tour dates, signings, lectures, etc.
  • - Single-word reviews. Other people will read your review to discover why you liked or didn't like the title. Be descriptive.
  • - Comments focusing on the author or that may ruin the ending for others
  • - Phone numbers, addresses, URLs
  • - Pricing and availability information or alternative ordering information
  • - Advertisements or commercial solicitation


  • - By submitting a review, you grant to Barnes & and its sublicensees the royalty-free, perpetual, irrevocable right and license to use the review in accordance with the Barnes & Terms of Use.
  • - Barnes & reserves the right not to post any review -- particularly those that do not follow the terms and conditions of these Rules. Barnes & also reserves the right to remove any review at any time without notice.
  • - See Terms of Use for other conditions and disclaimers.
Search for Products You'd Like to Recommend

Recommend other products that relate to your review. Just search for them below and share!

Create a Pen Name

Your Pen Name is your unique identity on It will appear on the reviews you write and other website activities. Your Pen Name cannot be edited, changed or deleted once submitted.

Your Pen Name can be any combination of alphanumeric characters (plus - and _), and must be at least two characters long.

Continue Anonymously

    If you find inappropriate content, please report it to Barnes & Noble
    Why is this product inappropriate?
    Comments (optional)