

The paper presents a novel split-and-merge algorithm for hierarchical clustering of Gaussian mixture models, which tends to improve on the local optimal solution determined by the initial constellation. It is initialized by local optimal parameters obtained by using a baseline approach similar to k-means, and it tends to approach more closely to the global optimum of the target clustering function, by iteratively splitting and merging the clusters of Gaussian components obtained as the output of the baseline algorithm. The algorithm is further improved by introducing model selection in order to obtain the best possible trade-off between recognition accuracy and computational load in a Gaussian selection task applied within an actual recognition system. The proposed method is tested both on artificial data and in the framework of Gaussian selection performed within a real continuous speech recognition system, and in both cases an improvement over the baseline method has been observed.. © 2011 Springer Science+Business Media, LLC.
| Engineering controlled terms: | Continuous speech recognitionEconomic and social effectsGaussian distributionIterative methods |
|---|---|
| Engineering uncontrolled terms | Gaussian Mixture ModelGaussian mixturesHier-archical clusteringLocal optimal solutionRecognition accuracyRecognition systemsSplit-and-merge operationsSplitting and merging |
| Engineering main heading: | Clustering algorithms |
| Funding sponsor | Funding number | Acronym |
|---|---|---|
| Ministarstvo Prosvete, Nauke i Tehnološkog Razvoja | TR 32035 | MPNTR |
Acknowledgements This research work has been supported by the Serbian Ministry of Education and Science, and it has been realized as a part of “Development of Dialogue Systems for Serbian and Other South Slavic Languages” research project (id TR 32035).
Popovíc, B.; Faculty of Technical Sciences, University of Novi Sad, Serbia;
© Copyright 2019 Elsevier B.V., All rights reserved.