外文翻译--基于仿生模式识别的非特定人连续语音识别系统
《外文翻译--基于仿生模式识别的非特定人连续语音识别系统》由会员分享,可在线阅读,更多相关《外文翻译--基于仿生模式识别的非特定人连续语音识别系统(9页珍藏版)》请在毕设资料网上搜索。
1、第 1 页 中文 3276 字 附录 英文原文 : Chinese Journal of Electronics Vo1.15,No.3,July 2006 A Speaker-Independent Continuous Speech Recognition System Using Biomimetic Pattern Recognition WANG Shoujue and QIN Hong (Laboratory of Artificial Neural Networks, Institute ol Semiconductors, Chinese Academy Sciences, B
2、eijing 100083, China) AbstractIn speaker-independent speech recognition,the disadvantage of the most diffused technology(HMMs, or Hidden Markov models)is not only the need of many more training samples, but also long train time requirement. This Paper describes the use of Biomimetic pattern recognit
3、ion(BPR)in recognizing some mandarin continuous speech in a speaker-independent Manner. A speech database was developed for the course of study The vocabulary of the database consists of 15 Chinese dishs names, the length of each name is 4 Chinese words Neural networks(NNs)based on Multi-weight neur
4、on(MWN) model are used to train and recognize the speech sounds The number of MWN was investigated to achieve the optimal performance of the NNs-based BPR.This system, which is based on BPR and can carry out real time recognition reaches a recognition rate of 98.14 for the first option and 99.81 for
5、 the first two options to the Persons from different provinces of China speaking common Chinese speech Experiments were also carried on to evaluate Continuous density hidden Markov models(CDHMM ),Dynamic time warping(DTW)and BPR for speech recognition The Experiment results show that BPR outperforms
6、 CDHMM and DTW especially in the cases of samples of a finite size Key wordsBiomimetic pattern recognition, Speech recogniton,Hidden Markov models(HMMs),Dynamic time warping(DTW) I Introduction The main goal of Automatic speech recognition(ASR)is to produce a system which will recognize accurately n
7、ormal human speech from any speaker The recognition system may be classified as speaker-dependent or speaker-independent The speaker dependence requires that the system be personally trained with the speech of the person that will be involved with its operation in order to achieve a high recognition
8、 rate For applications on the public facilities, on the other hand, the system must be capable of recognizing the speech uttered by many different people, with different gender, age, accent,etc.,the speaker independence has many more applications, primarily in the general area of public facilities T
9、he most diffused technology in speaker-independent speech recognition is Hidden Markov Models, the disadvantage of it is not only the need of many more training samples, but also long train time requirement Since Biomimetic pattern recognition(BPR) was first proposed by Wang Shoujue, it has already
10、been applied to object recognition, face identification and face recognition etc.,and achieved much better performance With some adaptations, such modeling techniques could be easily used within speech recognition 第 2 页 too In this paper, a real-time mandarin speech recognition system based on BPR i
11、s proposed, which outperforms HMMs especially in the cases of samples of a finite size The system is a small vocabulary speaker independent continuous speech recognition one. The whole system is implemented on the PC under windows98 2000 XP environment with CASSANN-II neurocomputer.It supports stand
12、ard 16-bit sound card II Introduction of Biomimetic Pattern Recognition and Multi Weights Neuron Networks 1 Biomimetic pattern recognition Traditional Pattern Recognition aims at getting the optimal classification of different classes of sample in the feature space However, the BPR intends to find t
13、he optimal coverage of the samples of the same type. It is from the Principle of HomologyContinuity, that is to say, if there are two samples of the same class, the difference between them must be gradually changed So a gradual change sequence must be exists between the two samples. In BPR theory th
14、e construction of the sample subspace of each type of samples depends only on the type itself More detailedly, the construction of the subspace of a certain type of samples depends on analyzing the relations between the trained types of samples and utilizing the methods of “coverage of objects with
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中设计图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- 外文 翻译 基于 仿生 模式识别 非特 连续 语音 识别 辨认 系统
