I often get a ton of questions from programmers and data scientists about audio data preprocessing:
– How can I extract spectrograms?
– How can I normalise the signal?
– What if I have files of different lengths?
To answer these questions and more, I published a tutorial where you can learn how to build an audio preprocessing pipeline for AI applications. The pipeline batch preprocesses audio files applying Short-Time Fourier Transform, zero-padding, normalisation all in one go!
This video is a new installment of the series “Generating sound with neural nets”, where you can learn to generate sound using Variational AutoEncoders.
Here’s the video: