Gaussian Mixture Model for speech recognition

4 views (last 30 days)
Hi all! I'm implementing a tool for speech recognition (command based).
My training data are 21 commands (7 different commands with 3 utterances for each). I did:
  • the pre-processing phase (silence removal and end-point detection)
  • the features extraction phase (with MFCC calculation).
So, for every utterance in my training set, i have a MFCC matrix with 12 columns (12=number of MFCC) and as much rows as the number of frames i divided the signal.
For the recognition phase, i was wondering to use the gmdistribution tool.
I read this article:
% model = gmdistribution.fit(MFCCtraindata,M);
What is the MFCCtraindata parameter?
Is it the MFCC matrix associated with every utterance?
For each command i have 3 utterances, so i have 3 different MFCC matrixes.
How can i do to create a unique gmm if, for every command, i will got 3 different gmm?
Any kind of help will be appreciated.
Thank you!!

Answers (5)

Castalia
Castalia on 8 Mar 2013
Nobody could give me any advice, please?

Rania Ziedan
Rania Ziedan on 22 Oct 2015
i really need help in the same issue if you handled it could you help me thanks in advance

MUZITIANXINJIE
MUZITIANXINJIE on 26 Jun 2016
Yes,I want,but no one help me! I really need to use the deep learning tu classfy the voice recognition . thanks for your help.

yasir riaz
yasir riaz on 21 Dec 2016
please help

hanieh rafiee
hanieh rafiee on 19 Feb 2017
Hi Is the answer to your question receipts? Will you help me please?

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!