Capability
Audio Preprocessing And Feature Extraction
12 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →Top Matches
via “mel-spectrogram-feature-extraction-with-augmentation”
automatic-speech-recognition model by undefined. 22,16,403 downloads.
Unique: Applies SpecAugment (time and frequency masking) during training to improve robustness to acoustic variability without requiring additional training data. Uses learnable mel-frequency scaling to adapt to different audio characteristics.
vs others: More robust than raw waveform or MFCC features for neural models; faster to compute than constant-Q transform; standard representation enabling transfer learning from pre-trained models.