Skip to main content
Applied IntelligenceVolume 37, Issue 3, October 2012, Pages 377-389

A novel split-and-merge algorithm for hierarchical clustering of Gaussian mixture models(Article)

  Save all to author list
  • aFaculty of Technical Sciences, University of Novi Sad, Novi Sad, Serbia
  • bMathematical Institute, Serbian Academy of Sciences and Arts, Belgrade, Serbia
  • cAlfanum Speech Technologies, Novi Sad, Serbia

Abstract

The paper presents a novel split-and-merge algorithm for hierarchical clustering of Gaussian mixture models, which tends to improve on the local optimal solution determined by the initial constellation. It is initialized by local optimal parameters obtained by using a baseline approach similar to k-means, and it tends to approach more closely to the global optimum of the target clustering function, by iteratively splitting and merging the clusters of Gaussian components obtained as the output of the baseline algorithm. The algorithm is further improved by introducing model selection in order to obtain the best possible trade-off between recognition accuracy and computational load in a Gaussian selection task applied within an actual recognition system. The proposed method is tested both on artificial data and in the framework of Gaussian selection performed within a real continuous speech recognition system, and in both cases an improvement over the baseline method has been observed.. © 2011 Springer Science+Business Media, LLC.

Author keywords

Continuous speech recognitionGaussian mixturesHierarchical clusteringSplit-and-merge operation

Indexed keywords

Engineering controlled terms:Continuous speech recognitionEconomic and social effectsGaussian distributionIterative methods
Engineering uncontrolled termsGaussian Mixture ModelGaussian mixturesHier-archical clusteringLocal optimal solutionRecognition accuracyRecognition systemsSplit-and-merge operationsSplitting and merging
Engineering main heading:Clustering algorithms

Funding details

Funding sponsor Funding number Acronym
Ministarstvo Prosvete, Nauke i Tehnološkog RazvojaTR 32035MPNTR
  • 1

    Acknowledgements This research work has been supported by the Serbian Ministry of Education and Science, and it has been realized as a part of “Development of Dialogue Systems for Serbian and Other South Slavic Languages” research project (id TR 32035).

  • ISSN: 0924669X
  • CODEN: APITE
  • Source Type: Journal
  • Original language: English
  • DOI: 10.1007/s10489-011-0333-9
  • Document Type: Article
  • Publisher: Kluwer Academic Publishers

  Popovíc, B.; Faculty of Technical Sciences, University of Novi Sad, Serbia;
© Copyright 2019 Elsevier B.V., All rights reserved.

Cited by 10 documents

Do-Duc, H. , Chau-Thanh, D. , Tran-Thai, S.
A New Algorithm for Speech Feature Extraction Using Polynomial Chirplet Transform
(2024) Circuits, Systems, and Signal Processing
Zhou, Y. , Song, Y. , Zhang, Q.
A clustering-based simplification of massive automobile-bodies point cloud for lightweight design
(2022) International Journal of Vehicle Design
Delić, V. , Borovac, B. , Gnjatović, M.
Toward more expressive speech communication in human-robot interaction
(2018) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
View details of all 10 citations
{"topic":{"name":"Computational Linguistics; Graphics Processing Unit; Speech Recognition","id":21340,"uri":"Topic/21340","prominencePercentile":11.724461,"prominencePercentileString":"11.724","overallScholarlyOutput":0},"dig":"2481a930e3ddbc0893740cb7199b6a796021c3ca9f48dcdb941da1f45fa67c85"}

SciVal Topic Prominence

Topic:
Prominence percentile: