Real Time Video Stream Processing From Smart Glasses

1

segformer-b2-finetuned-ade-512-512Fine-tune42/100

via “real-time-video-segmentation-with-frame-buffering”

image-segmentation model by undefined. 63,104 downloads.

Unique: Implements frame buffering and adaptive processing to maintain consistent throughput under variable load, with optional temporal smoothing to reduce flickering. Supports multiple input sources (files, cameras, RTSP) with automatic frame rate detection and metrics tracking.

vs others: Handles real-time video processing with configurable latency-throughput tradeoffs, compared to naive frame-by-frame processing that causes variable latency and dropped frames. Temporal smoothing reduces flickering compared to independent frame segmentation.

2

Smart glasses that tell me when to stop pouringRepository30/100

via “real-time video stream processing from smart glasses”

I've been experimenting with a more proactive AI interface for the physical world.This project is a drink-making assistant for smart glasses. It looks at the ingredients, selects a recipe, shows the steps, and guides me in real time based on what it sees. The behavior I wanted most was simple:

Unique: Direct integration with Rokid smart glasses hardware APIs for native video capture, bypassing generic USB/HDMI capture methods that add latency and reduce frame quality. Implements hardware-level frame synchronization to ensure consistent timestamps across video and sensor data.

vs others: Achieves lower latency than generic webcam capture libraries (OpenCV, ffmpeg) because it uses native Rokid device APIs rather than OS-level video abstractions, reducing frame buffering overhead by ~30-50ms

3

FacePoke_CLONE-THIS-REPO-TO-USE-ITWeb App23/100

via “real-time facial expression manipulation via webcam”

FacePoke_CLONE-THIS-REPO-TO-USE-IT — AI demo on HuggingFace

Unique: Operates as a browser-native HuggingFace Space with direct WebRTC webcam integration, avoiding server-side video upload overhead; uses client-side canvas rendering for low-latency feedback loop between detection and visualization

vs others: Faster feedback than cloud-based face editing services because processing happens in-browser with no network round-trip per frame; simpler deployment than self-hosted solutions since it runs entirely on HuggingFace infrastructure

4

Myelin FoundryProduct

via “real-time video stream processing”

5

Chooch AI VisionProduct

via “real-time-video-stream-analysis”

6

Webcam WondersProduct

via “real-time facial beauty enhancement”

7

SiwaluProduct

via “real-time camera feed breed detection”

Unique: Processes live camera streams with temporal smoothing and frame skipping to deliver real-time breed identification at 15-30 FPS, suggesting architecture with frame buffering, inference queueing, and exponential moving average filtering for stable predictions

vs others: More responsive user experience than batch-processing competitors, but with higher computational cost and battery drain compared to single-image identification

8

SkyglassProduct

via “real-time visual effects processing during video capture”

9

VeesualProduct

via “real-time-try-on-video-streaming”

10

SignapseProduct

via “real-time sign language video-to-text translation”

Unique: Uses skeletal pose estimation (likely MediaPipe or similar hand-tracking models) combined with temporal sequence modeling to recognize sign language as a continuous gesture stream rather than discrete static hand shapes, enabling context-aware translation of signs that depend on movement trajectory and speed.

vs others: Eliminates dependency on specialized hardware or wearables (unlike glove-based systems) and works with standard webcams, making it more accessible to end users than proprietary sign language input devices.

Top Matches

Also Known As

Company