Demucs music stem separator rewritten in Rust – runs in the browser

Q: What can Demucs music stem separator rewritten in Rust – runs in the browser do?

browser-native audio stem separation with onnx inference, real-time audio buffer streaming and windowing, onnx model weight loading and caching, multi-stem parallel inference orchestration, audio format conversion and resampling, stem output export to audio files, progress reporting and cancellation for long-running inference, error handling and graceful degradation for inference failures

FrameworkFree

Hi HN! I reimplemented HTDemucs v4 (Meta's music source separation model) in Rust, using Burn. It splits any song into individual stems — drums, bass, vocals, guitar, piano — with no Python runtime or server involved.Try it now: https://nikhilunni.github.io/demucs-rs/ (needs

Open Source

/ 100

8 capabilities

Capabilities8 decomposed

browser-native audio stem separation with onnx inference

Medium confidence

Executes the Demucs neural network model (vocals, drums, bass, other) directly in the browser using ONNX Runtime WebAssembly, eliminating server-side processing. The Rust codebase compiles to WebAssembly via wasm-bindgen, exposing a JavaScript API that loads pre-trained model weights and runs inference on client-side audio buffers without network latency or privacy concerns.

Solves for

separate music into individual stems (vocals, drums, bass, other) without uploading to a serverbuild a web app that processes audio locally without backend infrastructureintegrate stem separation into a browser-based music production workflowreduce latency and privacy risks by keeping audio processing on-device

Best for

web developers building music production tools

teams building privacy-first audio applications

indie music producers wanting client-side stem separation

Requires

modern browser with WebAssembly support (Chrome 57+, Firefox 52+, Safari 11+)

sufficient client RAM (minimum 2GB for typical songs)

ONNX model files pre-converted and hosted (Demucs v3 or v4 models)

Limitations

ONNX Runtime WebAssembly has higher memory overhead than native inference; typical 3-5 minute songs require 2-4GB RAM during processing

inference speed depends on client CPU/GPU; no GPU acceleration in most browsers (WebGPU still experimental)

model weights must be downloaded to client (typically 200-500MB for full Demucs model); no streaming model loading

What makes it unique

Rewrite of Demucs (originally Python/PyTorch) into Rust compiled to WebAssembly, enabling full stem separation inference in browsers without server dependency. Uses ONNX Runtime WebAssembly for cross-platform model execution, avoiding the need to bundle PyTorch or maintain Python backend infrastructure.

vs alternatives

Faster and more private than cloud-based stem separation services (Splitter.ai, Lalal.ai) because processing happens locally; more accessible than native Demucs because no Python/GPU setup required; smaller bundle than full PyTorch-to-WASM ports because ONNX Runtime is optimized for inference-only workloads.

real-time audio buffer streaming and windowing

Medium confidence

Handles chunked audio input processing by managing sliding windows of audio frames, buffering partial chunks, and coordinating inference timing to avoid gaps or overlaps in stem output. The Rust implementation uses ring buffers or deque structures to queue incoming audio data and emit inference-ready chunks at the model's required sample rate and frame size, with overlap-add reconstruction for seamless stem reconstruction.

Solves for

process audio in real-time chunks rather than requiring the entire file upfrontstream audio from a file or microphone input into the stem separatoravoid memory spikes by processing audio in fixed-size windowsreconstruct seamless output stems from overlapping inference windows

Best for

developers building real-time audio processing pipelines

applications that need to process long audio files without loading entirely into memory

live music production tools that process microphone input

Requires

audio source providing samples at consistent sample rate

buffer size matching model's expected input dimensions (typically 16384 or 32768 samples)

resampler library if input sample rate differs from model's expected rate

Limitations

overlap-add reconstruction introduces latency proportional to window size; typical 2-5 second delay before first stem output

requires careful synchronization between input buffering and inference scheduling; race conditions can cause audio artifacts

no built-in handling for variable sample rates; input must be resampled to model's expected rate (typically 44.1kHz or 48kHz) before buffering

What makes it unique

Implements overlap-add windowing in Rust with zero-copy buffer management, allowing seamless reconstruction of stems from overlapping inference windows without intermediate allocations. Uses WASM memory views to avoid copying audio data between JavaScript and Rust boundaries.

vs alternatives

More memory-efficient than loading entire audio files before processing because windowing processes fixed-size chunks; lower latency than naive chunking because overlap-add prevents discontinuities at chunk boundaries.

onnx model weight loading and caching

Medium confidence

Loads pre-trained Demucs model weights from ONNX format files and caches them in browser memory or IndexedDB to avoid re-downloading on subsequent uses. The implementation handles model initialization, weight tensor mapping to the inference graph, and optional persistent storage using browser APIs, with fallback to re-download if cache is unavailable.

Solves for

avoid re-downloading large model files (200-500MB) on every page loadpersist model weights across browser sessions using IndexedDBinitialize the inference engine with pre-trained weights on first loadsupport multiple model variants (Demucs v3, v4, different architectures) with separate caches

Best for

web apps where users return multiple times and want fast startup

applications with bandwidth constraints or slow network connections

teams building music production tools with offline-first requirements

Requires

ONNX model files hosted on CDN or same-origin server

browser support for IndexedDB (all modern browsers)

sufficient disk quota in browser storage (varies by device)

Limitations

IndexedDB storage quota varies by browser (typically 50MB-1GB); large models may exceed quota on mobile devices

no built-in versioning; updating model weights requires cache invalidation logic in application code

model loading blocks inference; no lazy loading of model layers

What makes it unique

Implements dual-layer caching (in-memory + IndexedDB) for ONNX models in Rust/WASM, with automatic fallback to re-download if cache is stale or unavailable. Uses WASM memory views to avoid copying model weights between storage and inference engine.

vs alternatives

Faster repeat loads than cloud-based services because models are cached locally; more efficient than naive re-download on every page load because IndexedDB persists across sessions; avoids server-side model serving costs.

multi-stem parallel inference orchestration

Medium confidence

Coordinates inference across multiple output stems (vocals, drums, bass, other) by running the Demucs model once per stem or using a multi-output model variant that produces all stems in a single forward pass. The Rust implementation manages tensor allocation, inference scheduling, and output collection to ensure all stems are computed and synchronized before returning results to the caller.

Solves for

extract all four stems (vocals, drums, bass, other) from a single audio inputrun inference efficiently by batching stem computation or using multi-output modelsensure all stems are synchronized and have matching sample countsreturn stems in a structured format (e.g., object with vocal, drum, bass, other keys)

Best for

music production applications that need all stems simultaneously

batch processing pipelines that separate multiple songs

applications where stem synchronization is critical (e.g., mixing tools)

Requires

ONNX model supporting multi-stem output or separate models per stem

sufficient RAM to hold input audio + all output stems simultaneously

inference engine capable of running model multiple times or supporting batched inference

Limitations

inference time scales linearly with number of stems if using single-output model; no parallelization across stems in browser

memory usage multiplies with number of stems; storing all four stems in memory simultaneously requires 4x the audio buffer size

no built-in stem selection; must compute all stems even if only one is needed

What makes it unique

Orchestrates inference across multiple stems using ONNX Runtime's graph execution, potentially leveraging multi-output model variants to compute all stems in a single forward pass rather than sequential inference. Manages tensor lifecycle and memory to minimize allocations across stem computations.

vs alternatives

More efficient than running separate models per stem because a single multi-output model reduces redundant computation; faster than sequential single-stem inference because overlapping computation can be parallelized on multi-core CPUs.

audio format conversion and resampling

Medium confidence

Converts input audio from various formats (MP3, WAV, WebM, OGG) to raw PCM buffers at the model's expected sample rate, handling codec decoding and sample rate conversion transparently. The implementation uses browser Web Audio API for decoding and Rust-based resampling (e.g., sinc interpolation or linear interpolation) to match the model's input requirements without requiring external libraries.

Solves for

accept audio files in multiple formats without requiring users to pre-convertautomatically resample audio to match the model's expected sample rate (e.g., 44.1kHz)handle mono and stereo audio, converting to the model's expected channel countnormalize audio levels to prevent clipping or silent output

Best for

web apps that accept user-uploaded audio files in arbitrary formats

music production tools that need to handle diverse audio sources

applications where audio preprocessing should be transparent to users

Requires

Web Audio API support in browser (all modern browsers)

audio codec support for input format (browser-dependent; MP3 and WAV widely supported)

resampling library or algorithm (can be implemented in Rust or delegated to Web Audio API)

Limitations

resampling quality depends on algorithm; linear interpolation introduces aliasing artifacts; sinc interpolation is slower but higher quality

Web Audio API decoding is asynchronous and blocks the main thread; large files may cause UI freezing

no support for lossless formats like FLAC in all browsers; MP3 decoding may introduce artifacts

What makes it unique

Implements resampling in Rust/WASM to avoid JavaScript overhead and enable high-quality sinc interpolation without external dependencies. Uses Web Audio API for codec decoding (browser-native, no transcoding overhead) and delegates resampling to Rust for performance and quality control.

vs alternatives

More efficient than JavaScript-based resampling libraries because Rust/WASM is faster; avoids server-side transcoding because Web Audio API handles decoding; supports more formats than naive implementations because it leverages browser codec support.

stem output export to audio files

Medium confidence

Encodes separated stems from raw PCM buffers into downloadable audio files (WAV, MP3, or other formats) with metadata (sample rate, bit depth, channel count). The implementation uses browser APIs or Rust-based encoders to convert Float32Array buffers to file formats, handling byte ordering, header generation, and optional compression.

Solves for

export separated stems as downloadable audio files for use in DAWs or other toolssave stems in multiple formats (WAV for lossless, MP3 for smaller file size)preserve audio quality and metadata during exportbatch export multiple stems with consistent naming and formatting

Best for

music production applications where users need to download stems

batch processing pipelines that generate stem files

applications integrating with DAWs or other audio software

Requires

browser File API and Blob support for file generation

audio encoder library (can be Rust-based or JavaScript-based)

sufficient disk space for generated files

Limitations

WAV encoding is lossless but produces large files (typically 50-100MB per stem for 3-5 minute songs)

MP3 encoding requires external library or browser API; quality depends on bitrate (typically 128-320kbps)

no built-in metadata tagging (ID3 for MP3, RIFF INFO for WAV); requires additional library for full metadata support

What makes it unique

Implements WAV encoding directly in Rust/WASM to avoid JavaScript overhead and external encoder dependencies. Generates valid WAV headers with correct RIFF structure and PCM format specifications, enabling direct file download without server-side encoding.

vs alternatives

Faster than JavaScript-based WAV encoding because Rust is compiled; avoids server-side encoding costs and latency; produces valid WAV files without external libraries or APIs.

progress reporting and cancellation for long-running inference

Medium confidence

Exposes callbacks or event emitters that report inference progress (e.g., percentage complete, current stem being processed) and allow users to cancel ongoing inference. The implementation divides inference into checkpoints, emits progress events after each checkpoint, and checks for cancellation signals before proceeding to the next step.

Solves for

show users progress during long inference operations (3-10 minutes for typical songs)allow users to cancel inference if they change their mind or want to process a different fileprovide feedback that the application is still responsive during inferenceestimate time remaining based on progress rate

Best for

web applications with long-running inference that need user feedback

music production tools where users may want to cancel and retry

applications with strict UX requirements for responsiveness

Requires

inference engine that supports checkpoint-based execution or can be interrupted between steps

callback or event emitter mechanism to communicate progress to JavaScript caller

cancellation token or flag that inference loop checks regularly

Limitations

progress granularity depends on inference checkpoints; may report progress in large jumps rather than smoothly

cancellation is not instantaneous; must wait for current inference step to complete before stopping

no built-in time estimation; requires tracking historical inference times to estimate remaining time

What makes it unique

Implements checkpoint-based progress reporting in Rust/WASM by dividing inference into discrete steps and emitting progress events via JavaScript callbacks. Uses atomic flags for cancellation signaling to avoid race conditions between WASM and JavaScript threads.

vs alternatives

More responsive than blocking inference because progress is reported incrementally; allows cancellation without restarting the entire process; provides better UX than silent inference by keeping users informed.

error handling and graceful degradation for inference failures

Medium confidence

Catches inference errors (e.g., out-of-memory, invalid model, corrupted audio) and returns meaningful error messages to the caller, with optional fallback strategies (e.g., reduce audio quality, use smaller model variant). The implementation includes validation of input audio, model state checks, and error propagation through the JavaScript API.

Solves for

provide clear error messages when inference fails (e.g., 'Out of memory, try a shorter audio file')gracefully handle edge cases like very long audio files or unusual sample ratessuggest recovery strategies to users (e.g., reduce audio quality or use a smaller model)prevent silent failures or cryptic WASM errors from reaching users

Best for

production web applications where error handling is critical

applications with diverse user inputs and device capabilities

teams that need to support users with limited device resources

Requires

input validation logic for audio buffers and model state

error types and messages defined in Rust and exposed to JavaScript

optional fallback models or strategies pre-loaded

Limitations

error messages are only as good as the validation logic; some errors may not be caught until inference time

fallback strategies (e.g., smaller models) must be pre-implemented; no automatic model selection

out-of-memory errors may crash the browser tab; no recovery from OOM without reloading

What makes it unique

Implements comprehensive error handling in Rust with custom error types that map to JavaScript exceptions, providing structured error information (code, message, recovery suggestions) rather than opaque WASM panics. Validates input audio and model state before inference to catch errors early.

vs alternatives

More informative than raw WASM errors because custom error types provide context; better UX than silent failures because errors are reported with recovery suggestions; more robust than naive implementations because validation catches edge cases early.

Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.

Related Artifactssharing capabilities

Artifacts that share capabilities with Demucs music stem separator rewritten in Rust – runs in the browser, ranked by overlap. Discovered automatically through the match graph.

Model50

wav2vec2-large-xlsr-53-russian

automatic-speech-recognition model by undefined. 45,90,191 downloads.

streaming and chunked audio processing for real-time transcriptionbatch audio processing with dynamic padding and mixed-precision inference

2 shared capabilities

Model48

distil-large-v3

automatic-speech-recognition model by undefined. 13,05,832 downloads.

onnx-export-and-cross-platform-inference

1 shared capability

Template42

ruvector-onnx-embeddings-wasm

Portable WASM embedding generation with SIMD and parallel workers - run text embeddings in browsers, Cloudflare Workers, Deno, and Node.js

onnx model loading and runtime initialization

1 shared capability

Model46

RMBG-1.4

image-segmentation model by undefined. 10,16,325 downloads.

onnx-based cross-platform inference without pytorch dependency

1 shared capability

Model59

Kokoro TTS

Lightweight 82M parameter open-source TTS with high-quality output.

onnx-based audio waveform generation from phoneme embeddings

1 shared capability

Model23

OpenAI: GPT-4o Audio

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs...

real-time-audio-streaming-inference

1 shared capability

Best For

✓web developers building music production tools
✓teams building privacy-first audio applications
✓indie music producers wanting client-side stem separation
✓developers prototyping audio ML features without server costs
✓developers building real-time audio processing pipelines
✓applications that need to process long audio files without loading entirely into memory
✓live music production tools that process microphone input
✓web apps where users return multiple times and want fast startup

Known Limitations

⚠ONNX Runtime WebAssembly has higher memory overhead than native inference; typical 3-5 minute songs require 2-4GB RAM during processing
⚠inference speed depends on client CPU/GPU; no GPU acceleration in most browsers (WebGPU still experimental)
⚠model weights must be downloaded to client (typically 200-500MB for full Demucs model); no streaming model loading
⚠browser tab may become unresponsive during long inference; no built-in progress reporting or cancellation
⚠limited to browsers with WebAssembly support (IE11 not supported)
⚠overlap-add reconstruction introduces latency proportional to window size; typical 2-5 second delay before first stem output

Requirements

modern browser with WebAssembly support (Chrome 57+, Firefox 52+, Safari 11+)sufficient client RAM (minimum 2GB for typical songs)ONNX model files pre-converted and hosted (Demucs v3 or v4 models)JavaScript runtime to load wasm module and call inference APIaudio source providing samples at consistent sample ratebuffer size matching model's expected input dimensions (typically 16384 or 32768 samples)resampler library if input sample rate differs from model's expected rateONNX model files hosted on CDN or same-origin server

Input / Output

Accepts: audio/wav, audio/mp3, audio/webm, raw PCM audio buffers (Float32Array), streaming audio buffers (Float32Array chunks), audio file paths or Blob objects, ONNX model files (.onnx format), model metadata (input/output shapes, expected sample rate), single audio buffer (Float32Array) containing mixed audio, audio/mpeg (MP3), audio/wav (WAV), audio/ogg, audio/flac (limited browser support), Blob or File objects from file input, raw PCM audio buffers (Float32Array) for each stem, metadata (sample rate, bit depth, channel count), callback function or event listener for progress updates, cancellation token or abort signal, audio buffers and model state, error handling configuration (e.g., fallback strategy)

Produces: audio/wav (separate stem files for vocals, drums, bass, other), raw PCM audio buffers (Float32Array per stem), streaming stem buffers (Float32Array chunks per stem), complete stem audio files after processing finishes, loaded model state in WASM memory, cached model weights in IndexedDB, structured object with separate Float32Array buffers for each stem (vocals, drums, bass, other), or four separate audio files in WAV/MP3 format, raw PCM audio buffers (Float32Array) at model's expected sample rate, mono or stereo depending on model requirements, audio/wav files (WAV format with PCM encoding), audio/mpeg files (MP3 format with lossy compression), Blob objects ready for download or further processing, progress events with percentage complete and current step, cancellation confirmation or error if cancelled, error objects with message, code, and optional recovery suggestions, fallback results if recovery strategy is applied

UnfragileRank

Adoption36%(30% weight)

Quality16%(20% weight)

Ecosystem36%(15% weight)

Match Graph25%(30% weight)

Freshness75%(5% weight)

UnfragileRank is computed from adoption signals, documentation quality, ecosystem connectivity, match graph feedback, and freshness. No artifact can pay for a higher rank.

Type: Framework

8 capabilities

Visit Demucs music stem separator rewritten in Rust – runs in the browser→

About

Show HN: Demucs music stem separator rewritten in Rust – runs in the browser

Alternatives to Demucs music stem separator rewritten in Rust – runs in the browser

GitHub Copilot70Extension

Your AI pair programmer

Compare →

Supabase69Platform

Search the Supabase docs for up-to-date guidance and troubleshoot errors quickly. Manage organizations, projects, databases, and Edge Functions, including migrations, SQL, logs, advisors, keys, and type generation, in one flow. Create and manage development branches to iterate safely, confirm costs

Compare →

langchain63Framework

Typescript bindings for langchain

Compare →

ChatGPT62Extension

GPT-4,Key-free,Free of charge,免Key,免魔法,免注册,免费

Compare →

Are you the builder of Demucs music stem separator rewritten in Rust – runs in the browser?

Claim this artifact to get a verified badge, access match analytics, see which intents users search for, and manage your listing.

Claim this artifact →Verification via email

Get the weekly brief

New tools, rising stars, and what's actually worth your time. No spam.

Data Sources

hackernews

Looking for something else?

Search →

Capabilities8 decomposed

browser-native audio stem separation with onnx inference

Medium confidence

Solves for

Best for

web developers building music production tools

teams building privacy-first audio applications

indie music producers wanting client-side stem separation

Requires

modern browser with WebAssembly support (Chrome 57+, Firefox 52+, Safari 11+)

sufficient client RAM (minimum 2GB for typical songs)

ONNX model files pre-converted and hosted (Demucs v3 or v4 models)

Limitations

ONNX Runtime WebAssembly has higher memory overhead than native inference; typical 3-5 minute songs require 2-4GB RAM during processing

inference speed depends on client CPU/GPU; no GPU acceleration in most browsers (WebGPU still experimental)

model weights must be downloaded to client (typically 200-500MB for full Demucs model); no streaming model loading

What makes it unique

vs alternatives

real-time audio buffer streaming and windowing

Medium confidence

Solves for

Best for

developers building real-time audio processing pipelines

applications that need to process long audio files without loading entirely into memory

live music production tools that process microphone input

Requires

audio source providing samples at consistent sample rate

buffer size matching model's expected input dimensions (typically 16384 or 32768 samples)

resampler library if input sample rate differs from model's expected rate

Limitations

overlap-add reconstruction introduces latency proportional to window size; typical 2-5 second delay before first stem output

requires careful synchronization between input buffering and inference scheduling; race conditions can cause audio artifacts

no built-in handling for variable sample rates; input must be resampled to model's expected rate (typically 44.1kHz or 48kHz) before buffering

What makes it unique

vs alternatives

onnx model weight loading and caching

Medium confidence

Solves for

Best for

web apps where users return multiple times and want fast startup

applications with bandwidth constraints or slow network connections

teams building music production tools with offline-first requirements

Requires

ONNX model files hosted on CDN or same-origin server

browser support for IndexedDB (all modern browsers)

sufficient disk quota in browser storage (varies by device)

Limitations

IndexedDB storage quota varies by browser (typically 50MB-1GB); large models may exceed quota on mobile devices

no built-in versioning; updating model weights requires cache invalidation logic in application code

model loading blocks inference; no lazy loading of model layers

What makes it unique

vs alternatives

multi-stem parallel inference orchestration

Medium confidence

Solves for

Best for

music production applications that need all stems simultaneously

batch processing pipelines that separate multiple songs

applications where stem synchronization is critical (e.g., mixing tools)

Requires

ONNX model supporting multi-stem output or separate models per stem

sufficient RAM to hold input audio + all output stems simultaneously

inference engine capable of running model multiple times or supporting batched inference

Limitations

inference time scales linearly with number of stems if using single-output model; no parallelization across stems in browser

memory usage multiplies with number of stems; storing all four stems in memory simultaneously requires 4x the audio buffer size

no built-in stem selection; must compute all stems even if only one is needed

What makes it unique

vs alternatives

audio format conversion and resampling

Medium confidence

Solves for

Best for

web apps that accept user-uploaded audio files in arbitrary formats

music production tools that need to handle diverse audio sources

applications where audio preprocessing should be transparent to users

Requires

Web Audio API support in browser (all modern browsers)

audio codec support for input format (browser-dependent; MP3 and WAV widely supported)

resampling library or algorithm (can be implemented in Rust or delegated to Web Audio API)

Limitations

resampling quality depends on algorithm; linear interpolation introduces aliasing artifacts; sinc interpolation is slower but higher quality

Web Audio API decoding is asynchronous and blocks the main thread; large files may cause UI freezing

no support for lossless formats like FLAC in all browsers; MP3 decoding may introduce artifacts

What makes it unique

vs alternatives

stem output export to audio files

Medium confidence

Solves for

Best for

music production applications where users need to download stems

batch processing pipelines that generate stem files

applications integrating with DAWs or other audio software

Requires

browser File API and Blob support for file generation

audio encoder library (can be Rust-based or JavaScript-based)

sufficient disk space for generated files

Limitations

WAV encoding is lossless but produces large files (typically 50-100MB per stem for 3-5 minute songs)

MP3 encoding requires external library or browser API; quality depends on bitrate (typically 128-320kbps)

no built-in metadata tagging (ID3 for MP3, RIFF INFO for WAV); requires additional library for full metadata support

What makes it unique

vs alternatives

Faster than JavaScript-based WAV encoding because Rust is compiled; avoids server-side encoding costs and latency; produces valid WAV files without external libraries or APIs.

progress reporting and cancellation for long-running inference

Medium confidence

Solves for

Best for

web applications with long-running inference that need user feedback

music production tools where users may want to cancel and retry

applications with strict UX requirements for responsiveness

Requires

inference engine that supports checkpoint-based execution or can be interrupted between steps

callback or event emitter mechanism to communicate progress to JavaScript caller

cancellation token or flag that inference loop checks regularly

Limitations

progress granularity depends on inference checkpoints; may report progress in large jumps rather than smoothly

cancellation is not instantaneous; must wait for current inference step to complete before stopping

no built-in time estimation; requires tracking historical inference times to estimate remaining time

What makes it unique

vs alternatives

error handling and graceful degradation for inference failures

Medium confidence

Solves for

Best for

production web applications where error handling is critical

applications with diverse user inputs and device capabilities

teams that need to support users with limited device resources

Requires

input validation logic for audio buffers and model state

error types and messages defined in Rust and exposed to JavaScript

optional fallback models or strategies pre-loaded

Limitations

error messages are only as good as the validation logic; some errors may not be caught until inference time

fallback strategies (e.g., smaller models) must be pre-implemented; no automatic model selection

out-of-memory errors may crash the browser tab; no recovery from OOM without reloading

What makes it unique

vs alternatives

Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.

Alternatives to Demucs music stem separator rewritten in Rust – runs in the browser

GitHub Copilot70Extension

Your AI pair programmer

Compare →

Supabase69Platform

Compare →

langchain63Framework

Typescript bindings for langchain

Compare →

ChatGPT62Extension

GPT-4,Key-free,Free of charge,免Key,免魔法,免注册,免费

Compare →

Demucs music stem separator rewritten in Rust – runs in the browser

Capabilities8 decomposed

browser-native audio stem separation with onnx inference

real-time audio buffer streaming and windowing

onnx model weight loading and caching

multi-stem parallel inference orchestration

audio format conversion and resampling

stem output export to audio files

progress reporting and cancellation for long-running inference

error handling and graceful degradation for inference failures

Related Artifactssharing capabilities

wav2vec2-large-xlsr-53-russian

distil-large-v3

ruvector-onnx-embeddings-wasm

RMBG-1.4

Kokoro TTS

OpenAI: GPT-4o Audio

Best For

Known Limitations

Requirements

Input / Output

UnfragileRank

About

Categories

Alternatives to Demucs music stem separator rewritten in Rust – runs in the browser

Are you the builder of Demucs music stem separator rewritten in Rust – runs in the browser?

Get the weekly brief

Data Sources

Demucs music stem separator rewritten in Rust – runs in the browser

Capabilities8 decomposed

browser-native audio stem separation with onnx inference

real-time audio buffer streaming and windowing

onnx model weight loading and caching

multi-stem parallel inference orchestration

audio format conversion and resampling

stem output export to audio files

progress reporting and cancellation for long-running inference

error handling and graceful degradation for inference failures

Related Artifactssharing capabilities

wav2vec2-large-xlsr-53-russian

distil-large-v3

ruvector-onnx-embeddings-wasm

RMBG-1.4

Kokoro TTS

OpenAI: GPT-4o Audio

Best For

Known Limitations

Requirements

Input / Output

UnfragileRank

About

Categories

Alternatives to Demucs music stem separator rewritten in Rust – runs in the browser

Are you the builder of Demucs music stem separator rewritten in Rust – runs in the browser?

Get the weekly brief

Data Sources