sound-parameters-extractor

# Sound parameters extractor

![XO code style](https://github.com/sindresorhus/xo)

This package helps you to get the acoustics parameters for a given sound
and provides some tools to add others parameters.

Feel free to contribute or discuss my choices.

The MFCC code comes from : vails-systems implementation.
The package was broken for me so I decided to fix it and provide some new tools.

* Installation
* Usage
* Basic usage
* Advanced usage
* Framer
* MFCC
* Parameters
* Zero Crossing Rate
* Spectral roll off point
* Spectral centroid
* Deltas
* FFT Modulus
* Remarkable energy rate
* Parameters from file
* Array to raw

Installation

npm install --save sound-parameters-extractor

Usage

$3

Read the wav file and then write the MFCC in a binary format
(usable by Alize)

``javascript const { getParamsFromFile, arrayToRaw } = require('sound-parameters-extractor');

const config = { fftSize: 32, bankCount: 24, lowFrequency: 1, highFrequency: 8000, // samplerate/2 here sampleRate: 16000 };

getParamsFromFile('sound.wav', config, 16) .then(params => { console.log(params); arrayToRaw(params.mfcc, 'sound.raw'); });`

`$3`


#### Framer
Divides the signal in frames, the end of the signal will be filled with 0.

`javascript const {framer} = require('sound-parameters-extractor');

const windowSize = 4; const overlap = '50%';

const signal = new Array(64).fill(0).map((val, index) => index); const framedSignal = framer(signal, windowSize, overlap); console.log(framedSignal); //[[0, 1, 2, 3], [2, 3, 4, 5], [4, 5, 6, 7], ...]`

#### MFCC Computes the MFCC for a given signal

`javascript const fft = require('fft-js'); // is dependency const {framer, mfcc} = require('sound-parameters-extractor');

const config = { fftSize: 32, bankCount: 24, lowFrequency: 1, highFrequency: 8000, // samplerate/2 here sampleRate: 16000 }; const windowSize = config.fftSize * 2; const overlap = '50%'; const mfccSize = 12;

const signal = new Array(1024).fill(0).map((val, index) => index); const framedSignal = framer(signal, windowSize, overlap);

//mfccSize is optionnal default 12 const mfccMatrix = mfcc.construct(config, mfccSize); const mfccSignal = framedSignal.map(window => { const phasors = fft.fft(window); return mfccMatrix(fft.util.fftMag(phasors)); });

console.log(mfccSignal); // mfccSignal contains the mel-frenquencies`

#### Parameters ##### Zero Crossing Rate Computes the number of times the signal cross 0. Must be computed on the signal. ######zeroCrossingRate(window)Formal application of the formula. ######zeroCrossingRateClipping(window, threshold)Allows you to use a threshold for better noise resistance. This method gives the same result but has better performance than the formal one.

##### Spectral roll off point ######spectralRollOffPoint(frame, sampleRate, cutoff, hz = false)Computes the spectral roll-off point on a given frame. Is computed on the modulus of the fft (don't forget to delete the first half of the FFT). If hz is true the return will be in hertz, if not it's the index in the vector.

##### Spectral centroid ######spectralCentroid(frame)Computes the spectral centroid on a given frame. It's computed on the modulus of the fft (don't forget to delete the first half of the FFT).

###### spectralCentroidSRF(frame, sampleRate)This method uses the spectral roll off point with a cutoff of 50%, this is the equivalent of the spectral centroid, currently the spectral centroid method have some problems with ALIZE

##### Deltas ######deltaFrameAllSignal(acousticVectors, overlap)UsesdeltaFrameto compute the derivative, used for MFCC. Implementation of derivative formula from Practical Cryptography Use it on the deltaParameters to have the delta delta.

###### deltaCustomAllSignal(acousticVectors) and deltaDeltaCustomAllSignal(acousticVectors)Use a Taylor decomposition to estimate the first and second derivative. The delta delta are computed on the acoustic vector (and not the deltas) to minimize the approximation.

##### FFT Modulus ######modulusFFT(frame, removeHalf)Computes the modulus of the FFT for a given frame. You may want to delete the first half of the FFT before computing its modulus. IfremoveHalf is true, the second half of the fft will be removed before computing the fft.

##### Remarkable energy rate ######remarkableEnergyRate(arrayDecoded, framedSound)Computed the RER on the signal

#### Parameters from file This is a simple wrapper from file reading to parameters, you can have a look at it to see how to get sound parameters. Please have a look at Basic usage on how to use it. This method returns an object containing : * The sound extracted by node-wav (key:arrayDecoded) ; * The framed sound (key:framedSound) ; * MFCC (key :mfcc) ; * FFT (key :fft) ; * Zero crossing rate (key :zcr) ; * Spectral Centroid (key :sc) ; * Spectral Centroid computed via spectral roll of method (key :sc2) ; * Spectral Roll-Off Point (key :srf) ; * Remarkable energy rate (key :rer). #### Array to Raw ######arrayToRaw(array, outputName, outputPath = '')Write the given vectors to a binary file (RAW) this can be used by Alize (e.g. i-vectors).arrayis two dimensionaloutputName is the name and extension of the file (eg : 'file.raw')outputPath is optional if not provided file will be write in process.cwd() if provided the directories will be created if they are not existing (eg : ./not/existing/yet`)

# Sound parameters extractor

![XO code style](https://github.com/sindresorhus/xo)

This package helps you to get the acoustics parameters for a given sound
and provides some tools to add others parameters.

Feel free to contribute or discuss my choices.

The MFCC code comes from : vails-systems implementation.
The package was broken for me so I decided to fix it and provide some new tools.

Installation

npm install --save sound-parameters-extractor

Usage

$3

Read the wav file and then write the MFCC in a binary format
(usable by Alize)

``javascript const { getParamsFromFile, arrayToRaw } = require('sound-parameters-extractor');

const config = { fftSize: 32, bankCount: 24, lowFrequency: 1, highFrequency: 8000, // samplerate/2 here sampleRate: 16000 };

getParamsFromFile('sound.wav', config, 16) .then(params => { console.log(params); arrayToRaw(params.mfcc, 'sound.raw'); });`

`$3`


#### Framer
Divides the signal in frames, the end of the signal will be filled with 0.

`javascript const {framer} = require('sound-parameters-extractor');

const windowSize = 4; const overlap = '50%';

#### MFCC Computes the MFCC for a given signal

`javascript const fft = require('fft-js'); // is dependency const {framer, mfcc} = require('sound-parameters-extractor');

const signal = new Array(1024).fill(0).map((val, index) => index); const framedSignal = framer(signal, windowSize, overlap);

console.log(mfccSignal); // mfccSignal contains the mel-frenquencies`

##### Spectral centroid ######spectralCentroid(frame)Computes the spectral centroid on a given frame. It's computed on the modulus of the fft (don't forget to delete the first half of the FFT).

##### Remarkable energy rate ######remarkableEnergyRate(arrayDecoded, framedSound)Computed the RER on the signal

sound-parameters-extractor

Contents

Installation

Usage

$3

`$3`

sound-parameters-extractor

Contents

Installation

Usage

$3

`$3`