mfcc Questions

2

Last month, a user called @jojek told me in a comment the following advice: I can bet that given enough data, CNN on Mel energies will outperform MFCCs. You should try it. It makes more sense to...

3

Solved

I'm trying to do extract MFCC features from audio (.wav file) and I have tried python_speech_features and librosa but they are giving completely different results: audio, sr = librosa.load(file, s...
Charleton asked 2/3, 2020 at 15:55

5

Here is my code so far on extracting MFCC feature from an audio file (.WAV): from python_speech_features import mfcc import scipy.io.wavfile as wav (rate,sig) = wav.read("AudioFile.wav")...
Lisbethlisbon asked 19/4, 2017 at 21:15

3

I'm using the librosa library to convert music segments into mel-spectrograms to use as inputs for my neural network, as shown in the docs here. How is this different from MFCCs, if at all? Are th...
Funk asked 25/12, 2018 at 20:22

4

Solved

I'm a bit stuck understanding MFCCs. From what I have read the mel filter banks should be a series of triangles that get wider and their peaks are at the same place. Like this... However when I...
Ustkamenogorsk asked 22/10, 2016 at 21:7

1

As you might notice, i am really new to python and sound processing. I (hopefully) extracted FFT data from a wave file using python and the logfbank and mfcc function. (The logfbank seems to give t...
Hiawatha asked 8/7, 2019 at 9:20

1

Solved

I'm trying to extract MFCC features from an audio file with 13 MFCCs with the below code: import librosa as l x, sr = l.load('/home/user/Data/Audio/Tracks/Dev/FS_P01_dev_001.wav', sr = 8000) n_ff...
Termagant asked 8/7, 2019 at 7:6

2

Solved

I am extracting the MFCC features using two different libraries: The python_speech_features lib The BOB lib However the output of the two is different and even the shapes are not the same. Is ...
Raphael asked 31/8, 2018 at 9:14

1

Solved

from librosa.feature import mfcc from librosa.core import load def extract_mfcc(sound): data, frame = load(sound) return mfcc(data, frame) mfcc = extract_mfcc("sound.wav") I would like to ge...

1

Solved

I came across this nice tutorial https://github.com/manashmndl/DeadSimpleSpeechRecognizer where the data is trained based on samples separated by folders and all mfcc are calculated at once. I am...
Gonsalez asked 30/1, 2018 at 4:17

1

Solved

Using librosa, I created mfcc for my audio file as follows: import librosa y, sr = librosa.load('myfile.wav') print y print sr mfcc=librosa.feature.mfcc(y=y, sr=sr) I also have a text file that ...
Generative asked 22/1, 2018 at 19:7

1

Solved

My main goal is in feeding mfcc features to an ANN. However I am stuck at the data pre processing step and my question has two parts. BACKGROUND : I have an audio. I have a txt file that has the ...
Bracelet asked 19/1, 2018 at 3:2

1

Solved

I am trying to extract MFCC vectors from the audio signal as input into a recurrent neural network. However, I am having trouble figuring out how to obtain the raw audio frames in Swift using Core ...
Augusto asked 1/12, 2017 at 22:38

2

Solved

I searched everywhere and I couldn't figure out how to extract MFCC feature using TarsosDSP on Android. I know how to get FFT out of a file. Any help?
Tantra asked 5/11, 2016 at 8:6

1

I'm looking for methods that work in practise for determining some kind of acoustical similarity between different songs. Most of the methods I've seen so far (MFCC etc.) seem actually to aim at f...
Nineteenth asked 19/1, 2012 at 15:42

2

Is there any implementation of MFCC available in C/C++? Any source codes or libraries? I've already found http://code.google.com/p/libmfcc/ which seem to be good.
Quinquepartite asked 29/10, 2012 at 15:21

1

Solved

Using Librosa library, I generated the MFCC features of audio file 1319 seconds into a matrix 20 X 56829. The 20 here represents the no of MFCC features (Which I can manually adjust it). But I don'...
Mangle asked 22/6, 2016 at 8:51

1

Some background first: I want to plot of Mel-Frequency Cepstral Coefficients of various songs and compare them. I calculate MFCC's throughout a song and then average them to get one array of 13 c...
Allveta asked 13/1, 2015 at 19:53

1

Solved

For my final year project i am trying to identify dog/bark/bird sounds real time (by recording sound clips). I am using MFCC as the audio features. Initially i have extracted altogether 12 MFCC vec...

1

Solved

I'm trying to do binary LSTM classification using theano. I have gone through the example code however I want to build my own. I have a small set of "Hello" & "Goodbye" recordings that I am u...
Glovsky asked 7/1, 2016 at 17:47

2

I have extracted 13 MFCC features of two utterances. Feature set for first utterance is of size 11*13 and other is 18*13. So, how to compare two feature sets to find the similarity between these tw...
Thorr asked 20/9, 2014 at 14:16

1

I am trying to learn HMM GMM implementation and created a simple model to detect some certain sounds (animal calls etc.) I am trying to train a HMM (Hidden Markov Model) network with GMM (Gaussian...
Barrington asked 31/10, 2014 at 15:45

1

I'm currently trying to create and train a neural network to perform simple speech classification using MFCCs. At the moment, I'm using 26 coefficients for each sample, and a total of 5 different ...
Taste asked 8/2, 2014 at 10:53

2

I am implementing MFCC algorithm in Java. There is a sample code here: http://www.ee.columbia.edu/~dpwe/muscontent/practical/mfcc.m at Matlab. However I have some problems with mel filter banking...
Mazonson asked 20/5, 2011 at 20:55

1

I am implementing MFCC algorithm with Java. There is a sample code for triangular filters and MFCC at Java. Here is the link: MFCC Java However I should follow that code written in Matlab: MFCC Mat...
Serpasil asked 2/6, 2011 at 9:52

© 2022 - 2024 — McMap. All rights reserved.