I published a tutorial where I build a preprocessing pipeline for audio data

I often get a ton of questions from programmers and data scientists about audio data preprocessing:

– How can I extract spectrograms?

– How can I normalise the signal?

– What if I have files of different lengths?

To answer these questions and more, I published a tutorial where you can learn how to build an audio preprocessing pipeline for AI applications. The pipeline batch preprocesses audio files applying Short-Time Fourier Transform, zero-padding, normalisation all in one go!

This video is a new installment of the series “Generating sound with neural nets”, where you can learn to generate sound using Variational AutoEncoders.

Here’s the video:

https://www.youtube.com/watch?v=O04v3cgHNeM&list=PL-wATfeyAMNpEyENTc-tVH5tfLGKtSWPp&index=12

submitted by /u/diabulusInMusica
[visit reddit] [comments]

Leave a Reply Cancel reply