Automatic speech-to-text conversion
Automate labeling and tagging of speech recordings, assess the performance of DSP pipelines for voice and speech enhancement, run text analytics on voice recordings, and more.This entry enables you
- 8.3K (All time)
- 13 (Last 30 days)
- 4.4 / 5
- Community
-
21 Dec 2025
Automatic text-to-speech synthesis
Convert text into human-like speech in a variety of voices and languages.This entry enables you to synthesize strings into sampled speech recordings available as MATLAB vectors using a single
- 2.2K (All time)
- 5 (Last 30 days)
- 4.0 / 5
- Community
-
21 Dec 2025
text-to-speech, speech synthesis, tts, let Matlab speak
TTS text to speech. TTS (TXT) synthesizes speech from string TXT, and speaks it. The audio format is mono, 16 bit, 16k Hz by default. WAV = TTS(TXT) does not vocalize but output to the
- 13.5K (All time)
- 3 (Last 30 days)
- 4.7 / 5
- Community
-
26 Dec 2007
Silence removal in speech signals
A simple method for silence removal in speech streams
This is a simple method for silence removal and segmentation of audio streams that contain speech. The method is based in two simple audio features (signal energy and spectral centroid). As long as
- 9.8K (All time)
- 2 (Last 30 days)
- 4.2 / 5
- Community
-
18 Mar 2014
Converts text to speech.
Any text is spoken. Get started ...1. add the text2speech folder to your Matlab path2. Test your new function: tts('This is a test.')Examples:Casual chat.tts('Hi - how are you?');tts({'Hello. How
- 11.1K (All time)
- 3 (Last 30 days)
- 4.3 / 5
- Community
-
25 Jan 2011
It pronounces the number(from 1 to 999 only) that user input at run time with an animation.
- 326 (All time)
- 1 (Last 30 days)
- 5.0 / 5
- Community
-
7 Aug 2013
Speech recognition using MFCC and LPC
This program implements a basic speech recognition for 6 symbols using MFCC and LPC
- 13K (All time)
- 3 (Last 30 days)
- 4.6 / 5
- Community
-
26 Apr 2012
Superimpose multiple semitransparent images with individual colormaps on the current axis.
- 1.2K (All time)
- 1 (Last 30 days)
- 5.0 / 5
- Community
-
3 Mar 2012
The exercise tries to separate the main properties of speech excitation function from vocal tract.
The goal of this MATLAB Exercise is to try to separate out the main properties (primarily pitch and intensity) of the speech excitation function (as estimated using LPC analysis) from the properties
- 1.6K (All time)
- 1 (Last 30 days)
- 5.0 / 5
- Community
-
3 Jun 2015
Speech compression using Linear Predictive Coding
A lossy speech compression algorithm.
LPC is the oldest and the most basic of modern speech coders. Its a lossy scheme. Playback quality isn't preserved in the process but it can be used in low bit-rate systems.
- 16.4K (All time)
- 3 (Last 30 days)
- 4.3 / 5
- Community
-
29 Jan 2007
speech signal framing
In this program, we are dividing the speech signal into number of frames about 240. Displaying the specified frame by its number with the original speech signal.
- 1.7K (All time)
- 1 (Last 30 days)
- 4.3 / 5
- Community
-
24 Jan 2013
This exercise shows how the method of linear predictive coding (LPC) models a speech frame.
This MATLAB exercise computes the log magnitude of the STFT of a specified frame of speech. Then, using the same frame of speech, the exercise computes LPC log spectral matches to the speech frame
- 1.4K (All time)
- 3 (Last 30 days)
- 5.0 / 5
- Community
-
2 Jun 2015
Shows the (flat spectrum) nature of the LPC error signal for a typical speech frame (voiced speech)
Speech processing designates a team consisting of Prof. Lawrence Rabiner (Rutgers University and University of California, Santa Barbara), Prof. Ronald Schafer (Stanford University), Kirty Vedula and
- 1.3K (All time)
- 2 (Last 30 days)
- 5.0 / 5
- Community
-
2 Jun 2015
Plots of Harmonic Product Spectrum (HPS) and log HPS of a running sequence of frames.
- 290 (All time)
- 1 (Last 30 days)
- 5.0 / 5
- Community
-
11 Sep 2015
Isolated Words Speech Recognition
Basic program to recognize a word based on the analysis of its energy.
- 5.8K (All time)
- 2 (Last 30 days)
- 4.5 / 5
- Community
-
28 Aug 2009
Student Competition : Code Generation Training
Files associated with the Student Competition : Code Generation Training
- 1.5K (All time)
- 6 (Last 30 days)
- 5.0 / 5
- Community
-
16 Oct 2020
- 496 (All time)
- 1 (Last 30 days)
- 5.0 / 5
- Community
-
24 Jun 2020
Record your own speech file to use for other exercises.
A MATLAB exercise that uses the file read command to read in an existing speech file, therecord function to record a speech signal, and the file save command to save the results in a designated file
- 1.7K (All time)
- 1 (Last 30 days)
- 5.0 / 5
- Community
-
3 Jun 2015
- 1.8K (All time)
- 2 (Last 30 days)
- 5.0 / 5
- Community
-
24 Dec 2016
A set of speech files used for the speech processing exercises.
A set of speech files used for the speech processing exercises. This folder MUST be in the same folder as all the other exercises.
- 9.6K (All time)
- 10 (Last 30 days)
- 4.8 / 5
- Community
-
29 Jan 2014
This exercise computes the frequency response of a p-tube model of a human vocal tract.
- 977 (All time)
- 1 (Last 30 days)
- 5.0 / 5
- Community
-
23 Jun 2015
This exercise computes the short-time average magnitude difference function (AMDF) of a speech frame
This MATLAB exercise calculates and displays the AMDF of a frame of speech from a designated speech file and implements a pitch detection algorithm based on using the AMDF on a frame-by-frame basis
- 2.1K (All time)
- 1 (Last 30 days)
- 4.8 / 5
- Community
-
23 Jun 2015
This MATLAB exercise implements a phase vocoder.
This MATLAB exercise implements a phase vocoder with the capability of speed-up or slow-down of a speech or audio signal by a factor, r, which varies from r = 0:25 (slow-down by factor of 4) to r =
- 2.3K (All time)
- 2 (Last 30 days)
- 5.0 / 5
- Community
-
23 Jun 2015
- 703 (All time)
- 5 (Last 30 days)
- 5.0 / 5
- Community
-
10 Dec 2015
G.723.1 Speech Coder and Decoder
Matlab implementation of ITU-T G.723.1 speech coder and decoder
This package implements the ITU-T G.723.1 speech coder and decoder in Matlab. The goal of the package is to provide a well-documented and modular program that was designed to facilitate
- 7.6K (All time)
- 2 (Last 30 days)
- 5.0 / 5
- Community
-
8 Dec 2020
speech processing tool
Speech analysis and parameter extractionShort-term analysis, frames and windows Time-domain analysis: energy, zero-crossings, statistic parameters, autocorrelation Frequency-domain analysis: spectra
- 9K (All time)
- 2 (Last 30 days)
- 3.6 / 5
- Community
-
31 Mar 2016
Directly Extract Part Of Speech (POS) Information from MeCab
The functions in this repository enable to extract direct output of MeCab tokenizer. 形態素解析器MeCabの品詞分類を直接読み出すためのラッパー関数です.
Speech)は15種類に絞られています.品詞情報を用いて単語の選別を行う際に,MeCabが提供するきめ細かい品詞情報を(69種)を使えないのはなんとも勿体ないなあと思い簡単な関数を作成しました.tokenizedDocumentJP.m 形態素解析を行う関数です.通常の` tokenizedDocument `を呼ぶのと同様に使えます.ただし,出力される `tokenizedDocument `オブジェクトに対して,関数 `normalizeWords` を使用して原形を取得することは出来ません.また,形態素解析オプション` mecabOptions `の` LemmaExtractor
- 78 (All time)
- 4 (Last 30 days)
- 5.0 / 5
- Community
-
7 Jul 2020
Device are controlled with speech using parallel port.
The purpose of the speech is communication. The area of speech processing is just developing, and shows the tremendous potentialities for widespread use in the future.In this project we have
- 10.9K (All time)
- 1 (Last 30 days)
- 4.5 / 5
- Community
-
11 Sep 2007
Speech emotion recognitions based on frequency parameters
Speech emotion recognitions based on frequency parameters the emotions are happy,sad ,laugh,anger,
Speech emotion recognition based on frequency parameters the emotions are happy,sad ,laugh,and anger,Any doubts pls contact -www.jitectechnolgies.inwhats app -+91 9994444414
- 553 (All time)
- 3 (Last 30 days)
- 5.0 / 5
- Community
-
19 Dec 2018
Pick & Place application by integrating Matlab & ROS
ROS-Industrial Consortium and advanced functions such as image recognition with deep learning, inverse kinematics, trajectory planning, and speech recognition provided by MATLAB.
- 467 (All time)
- 1 (Last 30 days)
- 5.0 / 5
- Community
-
15 Jul 2019
- 1.6K (All time)
- 2 (Last 30 days)
- 4.0 / 5
- Community
-
14 Jul 2014
This is a program to developed speech spectrum shaped noise for audiological applications
This is a Program to generate power spectrum matched noise. This is primarily intended for audiologists to generate their own speech shaped noises based on existing speech Corpus (Multiple speech
- 805 (All time)
- 4 (Last 30 days)
- 4.3 / 5
- Community
-
1 Mar 2016
Companion material for the book "Introduction to Audio Analysis, A MATLAB approach"
- 13.7K (All time)
- 8 (Last 30 days)
- 4.5 / 5
- Community
-
18 Mar 2014
Audio Toolbox Interface for SpeechBrain and Torchaudio Libraries
Deep Learning models supporting Audio Toolbox AI-powered functions for speech and audio signal processing
The Audio Toolbox Interface for SpeechBrain and Torchaudio Libraries enables the use of a collection of AI-powered speech processing functions in Audio Toolbox™ for automatic speech recognition (ASR
- 1.3K (All time)
- 149 (Last 30 days)
- -- / 5
- MathWorks
-
26 Jan 2026
- 8.3K (All time)
- 2 (Last 30 days)
- 5.0 / 5
- Community
-
25 Apr 2018
Waveform Similarity and Overlap Add (WSOLA) for Speech and Audio
implements the WSOLA method of Verhelst and Roelands for for High Quality Time-Scaled speech
Speech processing designates a team consisting of Prof. Lawrence Rabiner (Rutgers University and University of California, Santa Barbara), Prof. Ronald Schafer (Stanford University), Kirty Vedula and
- 2K (All time)
- 1 (Last 30 days)
- 5.0 / 5
- Community
-
23 Jun 2015
Builds an LPC vocoder, i.e., performs LPC analysis and synthesis on a speech file
Speech processing designates a team consisting of Prof. Lawrence Rabiner (Rutgers University and University of California, Santa Barbara), Prof. Ronald Schafer (Stanford University), Kirty Vedula and
- 3.4K (All time)
- 1 (Last 30 days)
- 4.5 / 5
- Community
-
3 Jun 2015
Text to Speech Mandarin Demo (Can mix Chinese with English)
This uses Microsoft .NET speech synthesis, with Mandarin Female voice Hanhan. Can use this in your MATLAB programming environment directly.
command to call Microsoft NET, you can type "help System.Speech" to get more details, you should not need any toolbox to do this.In case you could not get it running, please view the included MP4 video, see
- 48 (All time)
- 2 (Last 30 days)
- 5.0 / 5
- Community
-
22 Jan 2021
High quality speech spectrogram plot generation routine
- 4.5K (All time)
- 4 (Last 30 days)
- 4.3 / 5
- Community
-
1 Dec 2010
Speak the recognitized character
The function enables you to recognize the character and speak it using the microsoft speech api
The function uses morphological operation of hit and miss transform to recognize characters and uses Microsoft speech API for text to speechconversion.The charecter size of the integers present in
- 3.8K (All time)
- 2 (Last 30 days)
- 4.0 / 5
- Community
-
25 Feb 2008
Speech in noise mixing, signal to noise ratio
a simple function to mix a speech signal with a desired noise desired signal to noise ratios
This was developed by G Nike Gnanateja for mixing speech signals with noise at difference signal to noise ratios. This function mixes the speech and noise signals in terms of the RMS signal to noise
- 1.7K (All time)
- 3 (Last 30 days)
- 4.6 / 5
- Community
-
7 Feb 2017
Wiener filter for Noise Reduction and speech enhancement
Wiener Noise Suppressor based on Decision-Directed method with TSNR and HRNR algorithms.
techniques, including TSNR, introduce harmonic distortion in the enhanced speech. To overcome this problem, a method called harmonic regeneration noise reduction (HRNR) is implemented in order to refine
- 14K (All time)
- 8 (Last 30 days)
- 4.2 / 5
- Community
-
23 Oct 2020
The spectral Subtraction Method for enhancement of noisy speech.
The spectral Subtraction Method for enhancement of noisy speech signals proposed by Boll 79. The method implements spectral averaging and residual noise reduction proposed in the paper. Note that the
- 13.6K (All time)
- 6 (Last 30 days)
- 4.6 / 5
- Community
-
18 May 2005
A GUI for Speech Analysis using LPC
This GUI is used to analyze the speech signal at the selected region of 256 samples. All the calculation is based on the sampling of 8 KHz. First 3 formants of the selected block of samples are
- 20.8K (All time)
- 2 (Last 30 days)
- 4.3 / 5
- Community
-
25 Oct 2005
First Coherence-Based Dual Microphone Speech Enhancement Algorithm (IEEE TASL 2012)
Applicable to two microphone (Endfire setting) systems such as hearing aids, headsets, ...
MATLAB implementation of the following paper:Nima Yousefian, Philipos C. Loizou: A Dual-Microphone Speech Enhancement Algorithm Based on the Coherence Function. IEEE Transactions on Audio, Speech
- 1K (All time)
- 1 (Last 30 days)
- 5.0 / 5
- Community
-
12 Nov 2013
Second Coherence-Based Dual Microphone Speech Enhancement Algorithm (IEEE TASL 2013)
Applicable to dual microphone systems such as hearing aids, headsets, ...
MATLAB Implementation of the following paper:Nima Yousefian, Philipos C. Loizou: A Dual-Microphone Algorithm That Can Cope With Competing-Talker Scenarios. IEEE Transactions on Audio, Speech &
- 973 (All time)
- 1 (Last 30 days)
- 5.0 / 5
- Community
-
12 Nov 2013
M-QAM modulation and demodulation
this is the QAM modulation and demodulation tech. with speech example
you can rum these files by my_speech.m file, taking an value like my_speech(64, 'qam') in the command window of MATLAB.
- 24K (All time)
- 4 (Last 30 days)
- 4.1 / 5
- Community
-
27 Apr 2007
Text2Speech for Matlab using unofficial google service
Function to make Matlab speak via unofficial Google text2speach.
This is a command line demo on how to use the ActiveX VideoLAN.VLCPlugin.2 in combination with the unofficial google text to speech (TTS) engine to generate speech from text.You need to install the
- 1.5K (All time)
- 1 (Last 30 days)
- 4.0 / 5
- Community
-
4 Nov 2011
speech recognition using correlation
this is very simple and small code for speech recognition .in this code input voice signal will be correlated with already stored voice signals .if both voice signal will match almost then, allowing
- 9.3K (All time)
- 3 (Last 30 days)
- 3.7 / 5
- Community
-
17 Apr 2010
Implementation of LPC Vocoder
This MATLAB exercise builds an LPC vocoder, i.e., performs LPC analysis and synthesis on a speech file, resulting in a synthetic speech approximation to the original speech. The LPC analysis uses a
- 1.1K (All time)
- 1 (Last 30 days)
- 5.0 / 5
- Community
-
11 Sep 2015
VUS-Voiced/Unvoiced/Silence_Training
This exercise utilizes four programs to train a Bayesian classifier and classify frames of signals.
This MATLAB exercise utilizes a set of four MATLAB programs to both train a Bayesian classifier (using a designated training set of 11 speech files embedded within a background of low level noise and
- 1.2K (All time)
- 1 (Last 30 days)
- 5.0 / 5
- Community
-
14 Jul 2015
n-PSK modulation and demodulation
keep all files in a folder and run any of the test my speech file, with any value of n, in power of
The demonstration of n-PSK modulation and demodulation with my own speech, with different value of n and sampling frequency and carrier, n = 2, 4, 8, 16 ,32.......
- 8K (All time)
- 2 (Last 30 days)
- 4.0 / 5
- Community
-
27 Apr 2007
Text To Speech function, via Win32 Speech API (SAPI).
tts(textString) is a very simple function, which "reads" the textString string. The tts function calls the Microsoft(r) Win32 Speech API (SAPI).
- 4.5K (All time)
- 2 (Last 30 days)
- 4.9 / 5
- Community
-
3 May 2004
- 339 (All time)
- 1 (Last 30 days)
- 4.0 / 5
- Community
-
10 Mar 2014
- 877 (All time)
- 1 (Last 30 days)
- 5.0 / 5
- Community
-
12 Jun 2003
- 99 (All time)
- 1 (Last 30 days)
- 4.0 / 5
- Community
-
23 Jul 2019
Phase Spectrum Compensation (PSC) framework for speech enhancement.
Basic implementation of the Phase Spectrum Compensation (PSC) [1] method for single channel speech enhancement is included, along with a demo that illustrates its usage. References:[1] A.P. Stark
- 2.9K (All time)
- 2 (Last 30 days)
- 5.0 / 5
- Community
-
21 Mar 2011
Code for webinar "Deep Learning for Signals and Sound"
These are the two scrips used in the webinar to train the music generation network and the denoiser network.
- 1.1K (All time)
- 4 (Last 30 days)
- 4.6 / 5
- Community
-
21 Dec 2023
Ephraim's MMSE STSA Speech Enhancemnet method with decision directed method.
- 6.5K (All time)
- 1 (Last 30 days)
- 5.0 / 5
- Community
-
27 Feb 2006
- 586 (All time)
- 2 (Last 30 days)
- 5.0 / 5
- Community
-
5 Nov 2012