OpenWakeWord WASM (browser)

Inspired by Miro Hristov’s Deep Core Labs write-up, this package brings the same browser-only wake-word pipeline into a reusable npm module. A full React sandbox lives in this repo as well: openwakeword_wasm_react_demo.

Small browser-first wrapper around the OpenWakeWord models using onnxruntime-web. It exposes a WakeWordEngine class you can drop into a React app to listen for wake words like hey_jarvis directly in Chrome, no native layer required.

Agents should read AGENTS.md to get details and onboarding instructions.

Installation

``bash npm install file:../openwakeword_wasm

`or after publishing: npm install openwakeword-wasm-browser`

Make sure the ONNX model files in models/ are hosted somewhere the browser can fetch them (for CRA/Vite you can copy the folder into public/openwakeword/models). If you self-host the ORT wasm files, pass ortWasmPath (e.g. /openwakeword/ort/).

`Basic React usage`

`jsx import { useEffect, useMemo, useState } from 'react'; import WakeWordEngine from 'openwakeword-wasm-browser';

export default function WakeWordDemo() { const [detected, setDetected] = useState(null); const engine = useMemo(() => new WakeWordEngine({ baseAssetUrl: '/openwakeword/models', // where you host the .onnx files keywords: ['hey_jarvis'], // or any of: alexa, hey_mycroft, hey_rhasspy, timer, weather detectionThreshold: 0.5, cooldownMs: 2000 }), []);

useEffect(() => { let unsub; engine.load().then(() => { unsub = engine.on('detect', ({ keyword, score }) => { setDetected(${keyword} (${score.toFixed(2)})); }); engine.start(); // prompts for mic }); return () => { unsub?.(); engine.stop(); }; }, [engine]);

return (


      Listening for hey_jarvis…

      {detected && Detected: {detected}
}


  );
}

$3

`js import WakeWordEngine from 'openwakeword-wasm-browser';

const engine = new WakeWordEngine({ baseAssetUrl: '/openwakeword/models', ortWasmPath: '/openwakeword/ort/', keywords: ['hey_jarvis', 'alexa'], detectionThreshold: 0.55, });

await engine.load(); engine.on('speech-start', () => status.textContent = 'Speech detected'); engine.on('speech-end', () => status.textContent = 'Silence'); engine.on('detect', ({ keyword }) => playTone(keyword)); await engine.start({ deviceId: preferredMicId, gain: 1.3 });

document.querySelector('#stop').addEventListener('click', () => engine.stop()); document.querySelector('#keyword').addEventListener('change', (evt) => { engine.setActiveKeywords([evt.target.value]); });`

`API reference`

- await engine.load()downloads ONNX models (mel, embedding, VAD, keyword heads) and infers keyword window sizes. -await engine.start({ deviceId?, gain? })starts microphone streaming and posts 1280-sample chunks through the AudioWorklet. -await engine.stop()tears down the graph, stops tracks, and clears cooldowns. -engine.setGain(value) updates the GainNodewhile running. -await engine.runWav(arrayBuffer)runs the entire pipeline offline and returns the highest score seen. -engine.setActiveKeywords(name[]) gates which keywords are allowed to emit detect.

`$3`

ready

 fires once models finish loading.
-

detect surfaces { keyword, score, at }

 when score > threshold, VAD hangover is active, and cooldown is clear.
-

speech-start / speech-end

 mirror the VAD state transitions.
-

error

 emits any pipeline failures (getUserMedia, onnxruntime, decoding issues).
$3

Example with Vite/CRA:


public/
  openwakeword/
    models/
      melspectrogram.onnx
      embedding_model.onnx
      silero_vad.onnx
      hey_jarvis_v0.1.onnx
      ...
    ort/
      ort-wasm.wasm
      ort-wasm-simd.wasm


Then instantiate with

baseAssetUrl: '/openwakeword/models' and ortWasmPath: '/openwakeword/ort' if you host the wasm yourself. If ortWasmPath is omitted, onnxruntime-web

 uses its default CDN.
$3

- The engine runs at 16 kHz with 80 ms frames, mirroring the reference demo in

main.js

.
- VAD hangover is tuned to 12 frames to keep speech open long enough for the wake word score to peak.
- Cooldown (

cooldownMs

) prevents multiple detections per utterance; lower if you want rapid-fire triggers.
$3

-

npm pack (or npm publish) includes src/, models/, and README.md via the files

 list so consumers get the engine and bundled assets.
- Ship the ONNX assets alongside the package or document the public hosting location (

baseAssetUrl). The React demo copies them into public/openwakeword/models

.
- Consider running

engine.runWav() against hey_jarvis_11-2.wav` before publishing to verify the scoring path still peaks near 1.0.