Main Content

441 results

Automatic speech-to-text conversion

Automate labeling and tagging of speech recordings, assess the performance of DSP pipelines for voice and speech enhancement, run text analytics on voice recordings, and more.This entry enables you

Automatic text-to-speech synthesis

Convert text into human-like speech in a variety of voices and languages.This entry enables you to synthesize strings into sampled speech recordings available as MATLAB vectors using a single

text-to-speech

Version 1.0.0.0

by Siyi Deng

text-to-speech, speech synthesis, tts, let Matlab speak

TTS text to speech. TTS (TXT) synthesizes speech from string TXT, and speaks it. The audio format is mono, 16 bit, 16k Hz by default. WAV = TTS(TXT) does not vocalize but output to the

A simple method for silence removal in speech streams

This is a simple method for silence removal and segmentation of audio streams that contain speech. The method is based in two simple audio features (signal energy and spectral centroid). As long as

Converts text to speech.

Any text is spoken. Get started ...1. add the text2speech folder to your Matlab path2. Test your new function: tts('This is a test.')Examples:Casual chat.tts('Hi - how are you?');tts({'Hello. How

It pronounces the number(from 1 to 999 only) that user input at run time with an animation.

This program implements a basic speech recognition for 6 symbols using MFCC and LPC

Superimpose multiple semitransparent images with individual colormaps on the current axis.

The exercise tries to separate the main properties of speech excitation function from vocal tract.

The goal of this MATLAB Exercise is to try to separate out the main properties (primarily pitch and intensity) of the speech excitation function (as estimated using LPC analysis) from the properties

A lossy speech compression algorithm.

LPC is the oldest and the most basic of modern speech coders. Its a lossy scheme. Playback quality isn't preserved in the process but it can be used in low bit-rate systems.

speech signal framing

In this program, we are dividing the speech signal into number of frames about 240. Displaying the specified frame by its number with the original speech signal.

lpc frame

Version 1.4.0.0

by Speech Processing

This exercise shows how the method of linear predictive coding (LPC) models a speech frame.

This MATLAB exercise computes the log magnitude of the STFT of a specified frame of speech. Then, using the same frame of speech, the exercise computes LPC log spectral matches to the speech frame

LPC Error

Version 1.4.0.0

by Speech Processing

Shows the (flat spectrum) nature of the LPC error signal for a typical speech frame (voiced speech)

Speech processing designates a team consisting of Prof. Lawrence Rabiner (Rutgers University and University of California, Santa Barbara), Prof. Ronald Schafer (Stanford University), Kirty Vedula and

Plots of Harmonic Product Spectrum (HPS) and log HPS of a running sequence of frames.

Record your own speech file to use for other exercises.

A MATLAB exercise that uses the file read command to read in an existing speech file, therecord function to record a speech signal, and the file save command to save the results in a designated file

Basic program to recognize a word based on the analysis of its energy.

Pick & Place application by integrating Matlab & ROS

ROS-Industrial Consortium and advanced functions such as image recognition with deep learning, inverse kinematics, trajectory planning, and speech recognition provided by MATLAB.

Files associated with the Student Competition : Code Generation Training

This demo shows a simple deep learning application for audio data.

A set of speech files used for the speech processing exercises.

A set of speech files used for the speech processing exercises. This folder MUST be in the same folder as all the other exercises.

This exercise utilizes four programs to train a Bayesian classifier and classify frames of signals.

This MATLAB exercise utilizes a set of four MATLAB programs to both train a Bayesian classifier (using a designated training set of 11 speech files embedded within a background of low level noise and

This exercise computes the frequency response of a p-tube model of a human vocal tract.

AMDF

Version 1.4.0.0

by Speech Processing

This exercise computes the short-time average magnitude difference function (AMDF) of a speech frame

This MATLAB exercise calculates and displays the AMDF of a frame of speech from a designated speech file and implements a pitch detection algorithm based on using the AMDF on a frame-by-frame basis

This MATLAB exercise implements a phase vocoder.

This MATLAB exercise implements a phase vocoder with the capability of speed-up or slow-down of a speech or audio signal by a factor, r, which varies from r = 0:25 (slow-down by factor of 4) to r =

Matlab implementation of ITU-T G.723.1 speech coder and decoder

This package implements the ITU-T G.723.1 speech coder and decoder in Matlab. The goal of the package is to provide a well-documented and modular program that was designed to facilitate

speech processing tool

Speech analysis and parameter extractionShort-term analysis, frames and windows Time-domain analysis: energy, zero-crossings, statistic parameters, autocorrelation Frequency-domain analysis: spectra

The functions in this repository enable to extract direct output of MeCab tokenizer. 形態素解析器MeCabの品詞分類を直接読み出すためのラッパー関数です.

Speech)は15種類に絞られています.品詞情報を用いて単語の選別を行う際に,MeCabが提供するきめ細かい品詞情報を(69種)を使えないのはなんとも勿体ないなあと思い簡単な関数を作成しました.tokenizedDocumentJP.m 形態素解析を行う関数です.通常の` tokenizedDocument `を呼ぶのと同様に使えます.ただし,出力される `tokenizedDocument `オブジェクトに対して,関数 `normalizeWords` を使用して原形を取得することは出来ません.また,形態素解析オプション` mecabOptions `の` LemmaExtractor

Device are controlled with speech using parallel port.

The purpose of the speech is communication. The area of speech processing is just developing, and shows the tremendous potentialities for widespread use in the future.In this project we have

Speech emotion recognitions based on frequency parameters the emotions are happy,sad ,laugh,anger,

Speech emotion recognition based on frequency parameters the emotions are happy,sad ,laugh,and anger,Any doubts pls contact -www.jitectechnolgies.inwhats app -+91 9994444414

word recognition matlab

This is a program to developed speech spectrum shaped noise for audiological applications

This is a Program to generate power spectrum matched noise. This is primarily intended for audiologists to generate their own speech shaped noises based on existing speech Corpus (Multiple speech

Companion material for the book "Introduction to Audio Analysis, A MATLAB approach"

implements the WSOLA method of Verhelst and Roelands for for High Quality Time-Scaled speech

Speech processing designates a team consisting of Prof. Lawrence Rabiner (Rutgers University and University of California, Santa Barbara), Prof. Ronald Schafer (Stanford University), Kirty Vedula and

This uses Microsoft .NET speech synthesis, with Mandarin Female voice Hanhan. Can use this in your MATLAB programming environment directly.

command to call Microsoft NET, you can type "help System.Speech" to get more details, you should not need any toolbox to do this.In case you could not get it running, please view the included MP4 video, see

LPC Vocoder

Version 1.4.0.0

by Speech Processing

Builds an LPC vocoder, i.e., performs LPC analysis and synthesis on a speech file

Speech processing designates a team consisting of Prof. Lawrence Rabiner (Rutgers University and University of California, Santa Barbara), Prof. Ronald Schafer (Stanford University), Kirty Vedula and

MATLABのディープラーニング機能を使い、異常音の分類を行うサンプルスクリプトです。

High quality speech spectrogram plot generation routine

The function enables you to recognize the character and speak it using the microsoft speech api

The function uses morphological operation of hit and miss transform to recognize characters and uses Microsoft speech API for text to speechconversion.The charecter size of the integers present in

a simple function to mix a speech signal with a desired noise desired signal to noise ratios

This was developed by G Nike Gnanateja for mixing speech signals with noise at difference signal to noise ratios. This function mixes the speech and noise signals in terms of the RMS signal to noise

Wiener Noise Suppressor based on Decision-Directed method with TSNR and HRNR algorithms.

techniques, including TSNR, introduce harmonic distortion in the enhanced speech. To overcome this problem, a method called harmonic regeneration noise reduction (HRNR) is implemented in order to refine

The spectral Subtraction Method for enhancement of noisy speech.

The spectral Subtraction Method for enhancement of noisy speech signals proposed by Boll 79. The method implements spectral averaging and residual noise reduction proposed in the paper. Note that the

A GUI for Speech Analysis using LPC

This GUI is used to analyze the speech signal at the selected region of 256 samples. All the calculation is based on the sampling of 8 KHz. First 3 formants of the selected block of samples are

Applicable to two microphone (Endfire setting) systems such as hearing aids, headsets, ...

MATLAB implementation of the following paper:Nima Yousefian, Philipos C. Loizou: A Dual-Microphone Speech Enhancement Algorithm Based on the Coherence Function. IEEE Transactions on Audio, Speech

Applicable to dual microphone systems such as hearing aids, headsets, ...

MATLAB Implementation of the following paper:Nima Yousefian, Philipos C. Loizou: A Dual-Microphone Algorithm That Can Cope With Competing-Talker Scenarios. IEEE Transactions on Audio, Speech &amp

Function to make Matlab speak via unofficial Google text2speach.

This is a command line demo on how to use the ActiveX VideoLAN.VLCPlugin.2 in combination with the unofficial google text to speech (TTS) engine to generate speech from text.You need to install the

this is the QAM modulation and demodulation tech. with speech example

you can rum these files by my_speech.m file, taking an value like my_speech(64, 'qam') in the command window of MATLAB.

Deep Learning models supporting Audio Toolbox AI-powered functions for speech and audio signal processing

The Audio Toolbox Interface for SpeechBrain and Torchaudio Libraries enables the use of a collection of AI-powered speech processing functions in Audio Toolbox™ for automatic speech recognition (ASR

speech recognition using correlation

this is very simple and small code for speech recognition .in this code input voice signal will be correlated with already stored voice signals .if both voice signal will match almost then, allowing

LPC_Vocoder

Version 1.0.0.0

by Lawrence Rabiner

Implementation of LPC Vocoder

This MATLAB exercise builds an LPC vocoder, i.e., performs LPC analysis and synthesis on a speech file, resulting in a synthetic speech approximation to the original speech. The LPC analysis uses a

keep all files in a folder and run any of the test my speech file, with any value of n, in power of

The demonstration of n-PSK modulation and demodulation with my own speech, with different value of n and sampling frequency and carrier, n = 2, 4, 8, 16 ,32.......

Text To Speech function, via Win32 Speech API (SAPI).

tts(textString) is a very simple function, which "reads" the textString string. The tts function calls the Microsoft(r) Win32 Speech API (SAPI).

SPEECH COMPARER

Version 1.0.0.0

by SHARAD VERMA

COMPARES TWO DIFFERENT SPEECHES

COMPARING 2 DIFFERENT SPEECHES AND ANALYSE.

Text to speech

Version 1.0.1

by Rahul punk

test simple text to speech "output sound"

text to speech

nspread

Version 1.0.0.0

by Rua Haszard Morris

Reads .nsp format audio files as generated by Multispeech software.

Phase Spectrum Compensation (PSC) framework for speech enhancement.

Basic implementation of the Phase Spectrum Compensation (PSC) [1] method for single channel speech enhancement is included, along with a demo that illustrates its usage. References:[1] A.P. Stark

MMSE STSA

Version 1.0.0.0

by Esfandiar Zavarehei

Ephraim's MMSE STSA Speech Enhancemnet method with decision directed method.

These are the two scrips used in the webinar to train the music generation network and the denoiser network.

spectrogram

Version 1.0.0.0

by koc

shows spectrogram of the speech

shows spectrogram of the speech with colormap (jet).

Load more