audio-file-decoder

![npm version](https://npmjs.org/package/audio-file-decoder "View this project on npm")

About

A library for decoding audio files, including support for decoding specific timestamp ranges within files. Written with FFmpeg and compiled to WebAssembly via Emscripten. Intended for use in browser environments only.

The following audio file formats are supported:
* MP3
* WAV
* FLAC
* AAC/M4A
* OGG

$3

WebAudio currently provides decodeAudioData as a means to access raw samples from audio files in a faster than realtime manner. It only supports decoding entire audio files however which can take huge amounts of memory. For example, a 10 minute audio file with a sample rate of 44100 Hz, floating point samples, and stereo channels will occupy 44100 Hz 600 seconds 4 bytes * 2 channels = ~212 MB of memory when decoded.

The WebCodecs proposal is planning to address this oversight (see here for more info) but until adoption by browsers this can be used as a more memory-friendly alternative to WebAudio's current implementation.

$3

* Files still need be stored in memory for access since the filesystem is sandboxed.
* Multiple channels are automatically downmixed into a single channel via sample averaging. Decoded audio is also NOT resampled, whereas decodeAudioData will automatically resample to the sample rate of its AudioContext.
* Sample position accuracy may be slightly off when decoding timestamp ranges due to timestamp precision and how FFmpeg's seek behaves. FFmpeg tries to seek to the closest frame possible for timestamps which may introduce an error of a few frames, where each frame contains a fixed (e.g 1024 samples) or dynamic number of samples depending on the audio file encoding.
* Performance is about ~2x slower than Chromium's implementation of decodeAudioData. Chromium's implementation also uses FFmpeg for decoding, but is able to run natively with threading and native optimizations enabled, while this library has them disabled for WebAssembly compatibility.

Usage / API

$3

bash
npm install --save audio-file-decoder


$3

An example of synchronous audio file decoding in ES6:

ts
import { getAudioDecoder } from 'audio-file-decoder';
import DecodeAudioWasm from 'audio-file-decoder/decode-audio.wasm'; // path to wasm asset
// either a File object or an ArrayBuffer representing the audio file
const fileOrArrayBuffer = ...;
getAudioDecoder(DecodeAudioWasm, fileOrArrayBuffer)
  .then(decoder => {
    const sampleRate = decoder.sampleRate; // the sample rate of the audio file (e.g 44100)
    const channelCount = decoder.channelCount; // the number of channels in the audio file (e.g 2 if stereo)
    const encoding = decoder.encoding; // the encoding of the audio file as a string (e.g pcm_s16le)
    const duration = decoder.duration; // the duration of the audio file in seconds (e.g 5.43)
    // samples are returned as a Float32Array
    let samples;
    // decode entire audio file
    samples = decoder.decodeAudioData();
    // decode from 5.5 seconds to the end of the file
    samples = decoder.decodeAudioData(5.5, -1);
    // decode from 30 seconds for a duration of 60 seconds
    samples = decoder.decodeAudioData(30, 60);
    // decode with options
    const options: DecodeAudioOptions = {
      multiChannel: true,
    };
    samples = decoder.decodeAudioData(0, -1, options);

// ALWAYS dispose once finished to free resources decoder.dispose(); });`

`$3`


An example of asynchronous audio file decoding in ES6:

ts
import { getAudioDecoderWorker } from 'audio-file-decoder';
import DecodeAudioWasm from 'audio-file-decoder/decode-audio.wasm'; // path to wasm asset
// either a File object or an ArrayBuffer representing the audio file
const fileOrArrayBuffer = ...;
let audioDecoder;
getAudioDecoderWorker(DecodeAudioWasm, fileOrArrayBuffer)
  .then(decoder => {
    const sampleRate = decoder.sampleRate; // the sample rate of the audio file (e.g 44100)
    const channelCount = decoder.channelCount; // the number of channels in the audio file (e.g 2 if stereo)
    const encoding = decoder.encoding; // the encoding of the audio file as a string (e.g pcm_s16le)
    const duration = decoder.duration; // the duration of the audio file in seconds (e.g 5.43)
    audioDecoder = decoder;
    const options: DecodeAudioOptions = {
      multiChannel: false,
    };
    // decode from 15 seconds for a duration of 45 seconds, with options
    return decoder.getAudioData(15, 45, options);
  })
  .then(samples => {
    // samples are returned as a Float32Array
    console.log(samples);

// ALWAYS dispose once finished to free resources audioDecoder.dispose(); });`

`$3`


You can pass additional options when decoding audio data. Currently supported options are listed below:

ts
interface DecodeAudioOptions {
  // whether to decode multiple channels. defaults to false.
  // if set to true, the resulting array will contain samples interleaved from each channel.
  // - using the channel count, samples can be accessed using samples[sample * channelCount + channel]
  // if set to false, the resulting will contain downmixed samples averaged from each channel.
  // - samples can be accessed using samples[sample]
  multiChannel?: boolean;
}

$3

The getAudioDecoder and getAudioDecoderWorker factory functions expect relative paths (from your app's origin) to the wasm file or inlined versions of the wasm file provided by the library. You'll need to include this wasm file as an asset in your application, either by using a plugin/loader if using module bundlers (e.g file-loader for webpack) or by copying this file over in your build process.

If using a module bundler with appropriate plugins/loaders, you can simply import the required wasm asset like below:`ts import { getAudioDecoder, getAudioDecoderWorker } from 'audio-file-decoder'; import DecodeAudioWasm from 'audio-file-decoder/decode-audio.wasm';

// passing the path or inlined wasm to getAudioDecoder getAudioDecoder(DecodeAudioWasm, myAudioFile); // passing the path or inlined wasm to getAudioDecoderWorker getAudioDecoderWorker(DecodeAudioWasm, myAudioFile);`

If you aren't using module bundler, then you need to make sure your build process copies the asset over. The wasm file is located at:`bash /node_modules/audio-file-decoder/decode-audio.wasm`

For example, a typical application using this library should include it as an asset like in the example file structure below:`bash app/ dist/ index.html index.js decode-audio.wasm`

Make sure to then manually pass in the correct relative path (again, from your app's origin) when using getAudioDecoder or getAudioDecoderWorker.

`Building on M1 Macs`

`$3`

Before you begin, ensure you have Homebrew installed on your Mac. If not, you can install it by following the instructions at brew.sh.

Once Homebrew is installed, run the following commands to install the necessary build tools:

`bash brew update brew install cmake autoconf automake libtool pkg-config`

`$3`

If you're using an M1 Mac, you may need to take a few additional steps to get the project building correctly:

1. Ensure you have the latest version of Emscripten:`bash cd ~/code/audio-file-decoder/emsdk ./emsdk install latest ./emsdk activate latest`

2. If sourcing emsdk_env.sh doesn't work for you, you can set up the Emscripten environment manually. From the emsdkdirectory, run the following commands:`bash export EMSDK="$PWD" export EM_CONFIG="$HOME/.emscripten" export EMSDK_NODE="$EMSDK/node/$(node --version)_64bit/bin/node" export PATH="$EMSDK:$EMSDK/upstream/emscripten:$EMSDK_NODE:$PATH"`After setting these environment variables, generate the Emscripten configuration file:`bash emcc --generate-config`

3. Set the EM_LLVM_ROOTenvironment variable:`bash export EM_LLVM_ROOT="$HOME/code/audio-file-decoder/emsdk/upstream/bin"`

4. Set the BINARYEN_ROOTin the Emscripten configuration: Edit the file~/.emscriptenand add:`python BINARYEN_ROOT = '/Users/your_username/code/audio-file-decoder/emsdk/upstream'`Replaceyour_username with your actual username.

5. The Makefile has been updated to use ECMAScript 2021 for the Closure Compiler. If you're using an older version of the project, you may need to update the Makefile manually: In theCOMMON_CCFLAGvariable, add:`makefile --closure-args="--language_in=ECMASCRIPT_2021" \`And in the worker target rule, update to:`makefile EMCC_CLOSURE_ARGS="--language_in=ECMASCRIPT_2021" $(CC) $(CCFLAG_WORKER) -o $@ $? $(LDFLAG)`

After making these changes, you should be able to build the project successfully on your M1 Mac.

Note: You may want to add the environment variable exports to your shell configuration file (e.g., ~/.zshrc or ~/.bash_profile) for persistence across terminal sessions.

`Building`


The build steps below have been tested on Ubuntu 20.04.1 LTS.

First clone the repo, then navigate to the repo directory and run the following commands:`bash

`install necessary build tools`


sudo apt-get update -qq
sudo apt-get install -y autoconf automake build-essential cmake git pkg-config wget libtool
grab emscripten sdk which is needed to compile ffmpeg

built with emsdk 3.0.0 (upgrade at your own risk!)

git clone https://github.com/emscripten-core/emsdk.git
./emsdk/emsdk install 3.0.0
./emsdk/emsdk activate 3.0.0
set emscripten environment variables

this needs to be invoked when you start a new terminal

source ./emsdk/emsdk_env.sh
install npm deps, sync/download ffmpeg + deps, then build ffmpeg

will only need to be run once unless you plan on making changes to how ffmpeg/dependencies are compiled

npm install && npm run sync && npm run build-deps
build the wasm module and the library

basic workflow when making changes to the wasm module/js library

npm run build-wasm && npm run build

Commands for the WebAssembly module, which can be useful if modifying or extending the C++ wrapper around FFmpeg:`bash

`build the WebAssembly module - output is located at src/wasm`


npm run build-wasm
removes the wasm output

npm run clean-wasm

Commands for FFmpeg and dependencies, which can be useful if modifying the compilation of FFmpeg and its dependencies:`bash

`downloads FFmpeg and its dependencies - output is located at deps/src`


npm run sync
removes FFmpeg and its dependencies 

npm run unsync
builds FFmpeg and its dependencies - output is located at deps/dist/ffmpeg

npm run build-deps
cleans the FFmpeg dist output

npm run clean-deps

Contributing

Contributions are welcome! Feel free to submit issues or PRs for any bugs or feature requests.

License

Licensed under LGPL v2.1 or later. See the license file for more info.

audio-file-decoder

![npm version](https://npmjs.org/package/audio-file-decoder "View this project on npm")

About

The following audio file formats are supported:
* MP3
* WAV
* FLAC
* AAC/M4A
* OGG

$3

Usage / API

$3

bash
npm install --save audio-file-decoder


$3

An example of synchronous audio file decoding in ES6:

ts
import { getAudioDecoder } from 'audio-file-decoder';
import DecodeAudioWasm from 'audio-file-decoder/decode-audio.wasm'; // path to wasm asset
// either a File object or an ArrayBuffer representing the audio file
const fileOrArrayBuffer = ...;
getAudioDecoder(DecodeAudioWasm, fileOrArrayBuffer)
  .then(decoder => {
    const sampleRate = decoder.sampleRate; // the sample rate of the audio file (e.g 44100)
    const channelCount = decoder.channelCount; // the number of channels in the audio file (e.g 2 if stereo)
    const encoding = decoder.encoding; // the encoding of the audio file as a string (e.g pcm_s16le)
    const duration = decoder.duration; // the duration of the audio file in seconds (e.g 5.43)
    // samples are returned as a Float32Array
    let samples;
    // decode entire audio file
    samples = decoder.decodeAudioData();
    // decode from 5.5 seconds to the end of the file
    samples = decoder.decodeAudioData(5.5, -1);
    // decode from 30 seconds for a duration of 60 seconds
    samples = decoder.decodeAudioData(30, 60);
    // decode with options
    const options: DecodeAudioOptions = {
      multiChannel: true,
    };
    samples = decoder.decodeAudioData(0, -1, options);

// ALWAYS dispose once finished to free resources decoder.dispose(); });`

`$3`


An example of asynchronous audio file decoding in ES6:

ts
import { getAudioDecoderWorker } from 'audio-file-decoder';
import DecodeAudioWasm from 'audio-file-decoder/decode-audio.wasm'; // path to wasm asset
// either a File object or an ArrayBuffer representing the audio file
const fileOrArrayBuffer = ...;
let audioDecoder;
getAudioDecoderWorker(DecodeAudioWasm, fileOrArrayBuffer)
  .then(decoder => {
    const sampleRate = decoder.sampleRate; // the sample rate of the audio file (e.g 44100)
    const channelCount = decoder.channelCount; // the number of channels in the audio file (e.g 2 if stereo)
    const encoding = decoder.encoding; // the encoding of the audio file as a string (e.g pcm_s16le)
    const duration = decoder.duration; // the duration of the audio file in seconds (e.g 5.43)
    audioDecoder = decoder;
    const options: DecodeAudioOptions = {
      multiChannel: false,
    };
    // decode from 15 seconds for a duration of 45 seconds, with options
    return decoder.getAudioData(15, 45, options);
  })
  .then(samples => {
    // samples are returned as a Float32Array
    console.log(samples);

// ALWAYS dispose once finished to free resources audioDecoder.dispose(); });`

`$3`


You can pass additional options when decoding audio data. Currently supported options are listed below:

ts
interface DecodeAudioOptions {
  // whether to decode multiple channels. defaults to false.
  // if set to true, the resulting array will contain samples interleaved from each channel.
  // - using the channel count, samples can be accessed using samples[sample * channelCount + channel]
  // if set to false, the resulting will contain downmixed samples averaged from each channel.
  // - samples can be accessed using samples[sample]
  multiChannel?: boolean;
}

$3

If you aren't using module bundler, then you need to make sure your build process copies the asset over. The wasm file is located at:`bash /node_modules/audio-file-decoder/decode-audio.wasm`

For example, a typical application using this library should include it as an asset like in the example file structure below:`bash app/ dist/ index.html index.js decode-audio.wasm`

Make sure to then manually pass in the correct relative path (again, from your app's origin) when using getAudioDecoder or getAudioDecoderWorker.

`Building on M1 Macs`

`$3`

Before you begin, ensure you have Homebrew installed on your Mac. If not, you can install it by following the instructions at brew.sh.

Once Homebrew is installed, run the following commands to install the necessary build tools:

`bash brew update brew install cmake autoconf automake libtool pkg-config`

`$3`

If you're using an M1 Mac, you may need to take a few additional steps to get the project building correctly:

1. Ensure you have the latest version of Emscripten:`bash cd ~/code/audio-file-decoder/emsdk ./emsdk install latest ./emsdk activate latest`

3. Set the EM_LLVM_ROOTenvironment variable:`bash export EM_LLVM_ROOT="$HOME/code/audio-file-decoder/emsdk/upstream/bin"`

After making these changes, you should be able to build the project successfully on your M1 Mac.

Note: You may want to add the environment variable exports to your shell configuration file (e.g., ~/.zshrc or ~/.bash_profile) for persistence across terminal sessions.

`Building`


The build steps below have been tested on Ubuntu 20.04.1 LTS.

First clone the repo, then navigate to the repo directory and run the following commands:`bash

`install necessary build tools`


sudo apt-get update -qq
sudo apt-get install -y autoconf automake build-essential cmake git pkg-config wget libtool
grab emscripten sdk which is needed to compile ffmpeg

built with emsdk 3.0.0 (upgrade at your own risk!)

git clone https://github.com/emscripten-core/emsdk.git
./emsdk/emsdk install 3.0.0
./emsdk/emsdk activate 3.0.0
set emscripten environment variables

this needs to be invoked when you start a new terminal

source ./emsdk/emsdk_env.sh
install npm deps, sync/download ffmpeg + deps, then build ffmpeg

will only need to be run once unless you plan on making changes to how ffmpeg/dependencies are compiled

npm install && npm run sync && npm run build-deps
build the wasm module and the library

basic workflow when making changes to the wasm module/js library

npm run build-wasm && npm run build

Commands for the WebAssembly module, which can be useful if modifying or extending the C++ wrapper around FFmpeg:`bash

`build the WebAssembly module - output is located at src/wasm`


npm run build-wasm
removes the wasm output

npm run clean-wasm

Commands for FFmpeg and dependencies, which can be useful if modifying the compilation of FFmpeg and its dependencies:`bash

`downloads FFmpeg and its dependencies - output is located at deps/src`


npm run sync
removes FFmpeg and its dependencies 

npm run unsync
builds FFmpeg and its dependencies - output is located at deps/dist/ffmpeg

npm run build-deps
cleans the FFmpeg dist output

npm run clean-deps

Contributing

Contributions are welcome! Feel free to submit issues or PRs for any bugs or feature requests.

License

Licensed under LGPL v2.1 or later. See the license file for more info.

@martel/audio-file-decoder

audio-file-decoder

About

$3

$3

Usage / API

$3

$3

$3

$3

$3

Building on M1 Macs

$3

$3

Building

install necessary build tools

grab emscripten sdk which is needed to compile ffmpeg

built with emsdk 3.0.0 (upgrade at your own risk!)

set emscripten environment variables

this needs to be invoked when you start a new terminal

install npm deps, sync/download ffmpeg + deps, then build ffmpeg

will only need to be run once unless you plan on making changes to how ffmpeg/dependencies are compiled

build the wasm module and the library

basic workflow when making changes to the wasm module/js library

build the WebAssembly module - output is located at src/wasm

removes the wasm output

downloads FFmpeg and its dependencies - output is located at deps/src

removes FFmpeg and its dependencies

builds FFmpeg and its dependencies - output is located at deps/dist/ffmpeg

cleans the FFmpeg dist output

Contributing

License

@martel/audio-file-decoder

audio-file-decoder

About

$3

$3

Usage / API

$3

$3

$3

$3

$3

Building on M1 Macs

$3

$3

Building

install necessary build tools

grab emscripten sdk which is needed to compile ffmpeg

built with emsdk 3.0.0 (upgrade at your own risk!)

set emscripten environment variables

this needs to be invoked when you start a new terminal

install npm deps, sync/download ffmpeg + deps, then build ffmpeg

will only need to be run once unless you plan on making changes to how ffmpeg/dependencies are compiled

build the wasm module and the library

basic workflow when making changes to the wasm module/js library

build the WebAssembly module - output is located at src/wasm

removes the wasm output

downloads FFmpeg and its dependencies - output is located at deps/src

removes FFmpeg and its dependencies

builds FFmpeg and its dependencies - output is located at deps/dist/ffmpeg

cleans the FFmpeg dist output

Contributing

License

`$3`

`$3`

`Building on M1 Macs`

`$3`

`$3`

`Building`

`install necessary build tools`

`build the WebAssembly module - output is located at src/wasm`

`downloads FFmpeg and its dependencies - output is located at deps/src`

`$3`

`$3`

`Building on M1 Macs`

`$3`

`$3`

`Building`

`install necessary build tools`

`build the WebAssembly module - output is located at src/wasm`

`downloads FFmpeg and its dependencies - output is located at deps/src`