Capability
Low Latency Real Time Audio Video Communication
20 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →Top Matches
via “low-latency streaming voice activity detection with frame buffering”
automatic-speech-recognition model by undefined. 23,46,228 downloads.
Unique: Implements frame-buffered streaming inference with configurable temporal smoothing windows, enabling real-time predictions on unbounded audio streams while maintaining accuracy through learned temporal context aggregation rather than simple energy-based windowing
vs others: Lower latency than batch-processing approaches and more accurate than simple energy/spectral thresholding; enables true streaming inference without requiring full audio upfront