Real-time audio signal processing using python

Tags:

I have been trying to do real-time audio signal processing using 'pyAudio' module in python. What I did was a simple case of reading audio data from microphone and play it via headphones. I tried with the following code(both Python and Cython versions). Thought it works but unfortunately it is stalls and not smooth enough. How can I improve the code so that it will run smoothly. My PC is i7, 8GB RAM.

Python Version

import pyaudio
import numpy as np

RATE    = 16000
CHUNK   = 256

p               =   pyaudio.PyAudio()

player = p.open(format=pyaudio.paInt16, channels=1, rate=RATE, output=True, 
frames_per_buffer=CHUNK)
stream = p.open(format=pyaudio.paInt16, channels=1, rate=RATE, input=True, frames_per_buffer=CHUNK)

for i in range(int(20*RATE/CHUNK)): #do this for 10 seconds
player.write(np.fromstring(stream.read(CHUNK),dtype=np.int16))
stream.stop_stream()
stream.close()
p.terminate()

Cython Version

import pyaudio
import numpy as np

cdef int RATE   = 16000
cdef int CHUNK  = 1024
cdef int i      
p               =   pyaudio.PyAudio()

player = p.open(format=pyaudio.paInt16, channels=1, rate=RATE, output=True, frames_per_buffer=CHUNK)
stream = p.open(format=pyaudio.paInt16, channels=1, rate=RATE, input=True, frames_per_buffer=CHUNK)

for i in range(500): #do this for 10 seconds
    player.write(np.fromstring(stream.read(CHUNK),dtype=np.int16))
stream.stop_stream()
stream.close()
p.terminate()

380

asked Sep 24 '17 02:09

Sajil C K

1 Answers

The code below will take the default input device, and output what's recorded into the default output device.

import PyAudio
import numpy as np

p = pyaudio.PyAudio()

CHANNELS = 2
RATE = 44100

def callback(in_data, frame_count, time_info, flag):
    # using Numpy to convert to array for processing
    # audio_data = np.fromstring(in_data, dtype=np.float32)
    return in_data, pyaudio.paContinue

stream = p.open(format=pyaudio.paFloat32,
                channels=CHANNELS,
                rate=RATE,
                output=True,
                input=True,
                stream_callback=callback)

stream.start_stream()

while stream.is_active():
    time.sleep(20)
    stream.stop_stream()
    print("Stream is stopped")

stream.close()

p.terminate()

This will run for 20 seconds and stop. The method callback is where you can process the signal : audio_data = np.fromstring(in_data, dtype=np.float32)

return in_data is where you send back post-processed data to the output device.

Note chunk has a default argument of 1024 as noted in the PyAudio docs: http://people.csail.mit.edu/hubert/pyaudio/docs/#pyaudio.PyAudio.open

172

answered Sep 16 '22 15:09

kckaiwei

Related questions
                            
                                Split output of a layer in keras
                            
                                Convert API to Pandas DataFrame
                            
                                Why don't f-strings change when variables they reference change?
                            
                                Outer product of each column of a 2D array to form a 3D array - NumPy
                            
                                What do the functions tf.squeeze and tf.nn.rnn do?
                            
                                Environment specific pip.conf under anaconda
                            
                                Hiding and showing a widget in Kivy
                            
                                How do I have a "press enter to continue" feature in python? [duplicate]
                            
                                sqlalchemy print results instead of objects
                            
                                pip install mod_wsgi, How to Set MOD_WSGI_APACHE_ROOTDIR environment?
                            
                                ImportError: No module named googleapiclient.discovery
                            
                                How does paging work in the list_blobs function in Google Cloud Storage Python Client Library
                            
                                Is LASSO regression implemented in Statsmodels?
                            
                                Import CSV to database using sqlalchemy
                            
                                In method call args, how to override keyword argument of unpacked dict?
                            
                                mypy: how to define a generic subclass
                            
                                LSTM: Understand timesteps, samples and features and especially the use in reshape and input_shape
                            
                                Set values based on df.query?
                            
                                What is the necessity of sys.exit(app.exec_()) in PyQt?
                            
                                Bin elements per row - Vectorized 2D Bincount for NumPy

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Real-time audio signal processing using python

Tags:

python

signals

real-time

cython

audio

Sajil C K

People also ask

1 Answers

kckaiwei

Recent Activity

Donate For Us