python librosa package - How can I extract audio from spectrum

Tags:

In case of vocal separation using Librosa, the vocal and background music can be plotted separately but I want to extract the audio from vocal part and the spectrum of vocal part is located in a variable named 'S_foreground' (please visit the above link for demonstration). How can I get the foreground (vocal) audio?

459

asked Feb 11 '18 09:02

Saha Reno

2 Answers

You may have noticed that S_foreground comes from S_full which comes from a function called magphase. According to the document about this function, it can

Separate a complex-valued spectrogram D into its magnitude (S) and phase (P) components, so that D = S * P.

Since the actual parameter taken by magphase in

S_full, phase = librosa.magphase(librosa.stft(y))

is stft(y), which is the Short-Time Fourier Transform of y, the initial ndarray, I reckon what you need to do is to calculate a new D:

D_foreground = S_foreground * phase

And throw it to the Inverse stft function (librosa.istft):

y_foreground = librosa.istft(D_foreground)

After that, you can use the output function:

librosa.output.write_wav(output_file_path, y_foreground, sr)

To be honest, I am not familiar with these theoretical things (my poor output quality using this method might be a proof), but above is my guess on how you should export your audio. It turns out that the fidelity is very poor (at least in my case), so you might want to try some other software out if you really care about the audio quality.

135

answered Oct 24 '22 05:10

Alioth

the answer of @Alioth is working except:

librosa.output.write_wav(output_file_path, y_foreground, sr)

which the output method in librosa is deprecated, so the alternative solution could be the soundfile:

import soundfile as sf
sf.write('your_output_path.wav', y_foreground, sr)

answered Oct 24 '22 05:10

Ali Tavana

Related questions
                            
                                Drop if all entries in a spark dataframe's specific column is null
                            
                                How to automatically detect columns that contain datetime in a pandas dataframe
                            
                                Why do pandas and dask perform better when importing from CSV compared to HDF5?
                            
                                Python numpy equivalent of R rep and rep_len functions
                            
                                Cython compilation error "Not allowed in a constant expression"
                            
                                How to import models from one app to another app in Django?
                            
                                Python Dictionary: "in" vs "get"
                            
                                how to set the position of a tkinter window without setting the dimensions
                            
                                Passing extra arguments to scrapy.Request()
                            
                                Django DRF - What's the use of serializers?
                            
                                Conversion of image type int16 to uint8
                            
                                Unable to install nltk using pip
                            
                                Convert image to array for CNN
                            
                                Run process as admin with subprocess.run in python
                            
                                IPython Console in Spyder(Anaconda) is truncating output
                            
                                Standardization/Normalization test data in Python
                            
                                how to get covariance matrix in tensorflow?
                            
                                What's the meaning of cv2.videoCapture.release()?
                            
                                Python scikit-learn to JSON
                            
                                oauth2client.clientsecrets.InvalidClientSecretsError: Missing property "redirect_uris" in a client type of "web"

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

python librosa package - How can I extract audio from spectrum

Tags:

python

librosa

Saha Reno

People also ask

2 Answers

Alioth

Ali Tavana

Recent Activity

Donate For Us