I want to learn audio programming [closed]

Tags:

At my high school we can take a class where we basically learn about a subject on our own for a semester. I was thinking that I want to learn about "sound programming," but I realized that I have no idea what that entails. I'm interested in learning about, for example, how a synthesizer works and how sound works in computer science. I really want to focus on the low-level code part, not so much the composition part. Is this a feasible subject? Are there any good tutorials out there for somebody completely new to this? I know C++ and am using Windows. The first answer in this is something that interests me (although it's over my head).

961

asked Jan 26 '11 05:01

ahota

Video Answer

2 Answers

"Sound programming" is a very broad field. First of all, it is definitely a feasible subject, but since you need to cram stuff into a single semester you will need to limit your scope. I can see that you're looking for a place to start, so here are some ideas to get you thinking.

Since you have mentioned both "how sound works in computer science" and "synthesizers", it's worth pointing out the difference between analogue sound, sampled sound and synthesized sound, as they are different concepts. I'll explain them briefly here.

Analogue sound is sound as we humans typically interpret it -- vibrations of air sensed by the human ear. You can think of sound as a one-dimensional signal, where the independent variable is time and the dependent variable is amplitude of vibration. Analogue sound is continuous both in the time and amplitude domain. Older sound recording methods (e.g. magnetic tape) used an analogue sound representation. Analogue sound is not frequently used with computers (computers aren't good with storing continuous-domain data), but understanding analogue signals is important nevertheless. Expect to see plenty of math (e.g. complex numbers, Fourier transforms) if you go down this path.

Sampled sound is the sound representation that lends itself well to processing with a computer. People are most familiar with sampled sound through CDs and other musical recordings. An analogue signal is sampled at some frequency (e.g. 44.1KHz for CD recording). So a sampled sound signal is discrete in the time domain. If the signal is quantized then it will be discrete in the amplitude domain as well. Formats like MP3 are sampled formats. There's lots of things to study in this field if you're interested, such as restoration (removing static, etc) and compression (again, codecs MP3, Ogg Vorbis). It's a lot of fun because there's lots to experiment with and code.

Both analogue and sampled sound dig deeply into a field called Digital Signal Processing. Google around for that to get a feel of what it's like. It's often taught as a course at universities, so if you're really keen you can have a look at some lecture slides or even try some of the earlier, simpler projects.

Synthesized sound is a representation that is suited for reproduction of a music track, where the instruments playing the track are known beforehand. Think of it as sheet music for the computer. Somebody has to write the sheet music -- you can't just record it like analogue or sampled sound. This makes synthesized sound a completely different representation to analogue sound and sampled sound Also, the computer needs to know what the instruments are (e.g. piano) so that it can play (synthesize) the track. If it doesn't know the instrument, it either gives up or picks a close match (e.g. replaces the piano with electric keyboard). I have never worked with synthesizers before so I can't comment on the learning curve for them.

So, based on what I wrote -- pick a direction that interests you more, Google around and then refine your question.

EDIT

A good book to read is this. You can probably look around related titles in Amazon and find something newer, but it's been a while since I did my audio processing shopping.

And if you have half an hour to spare, then have a look at this video tutorial. It covers sound, image and video processing -- they're actually closely related fields.

102

answered Oct 11 '22 06:10

mpenkov

Consider working through the book "Who Is Fourier?: A Mathematical Adventure". You could adapt the examples to make small programming assignments that demonstrate the basic concepts. After you're done you should be able to use the fft to make a spectrogram of your voice as you pronounce the vowels a,e,i,o,u -- identifying the fundamental frequency and the formants of each vowel.

I recommend learning Python and the modules NumPy, SciPy, and matplotlib (there's a ton there, so beyond the basic tutorials, just learn as you go). The iPython shell has the option "-pylab -p scipy" to automatically import the most common tools into your namespace. You can record and play audio using PyAudio. There's also Pygame, which expands on SDL (Simple DirectMedia layer), and pyglet, which uses OpenAL (the OpenGL of audio; it does 3D audio and effects).

As to C/C++, there's IT++, SPUC, and FFTW for signal processing, and SDL/SDL_mixer and OpenAL/ALmixer for interfacing with hardware and audio files.

answered Oct 11 '22 07:10

Eryk Sun

Related questions
                            
                                frame rate vs sample rate
                            
                                Python change pitch of wav file [closed]
                            
                                Trim audio files with Sox in milliseconds
                            
                                where to start with audio synthesis on iPhone
                            
                                How can I detect whether a WAV file has a 44 or 46-byte header?
                            
                                Join two WAV files from Java?
                            
                                AVAudioPlayer with external URL to *.m4p
                            
                                How to find the fundamental frequency of a guitar string sound?
                            
                                midi keyboard not working on all platforms
                            
                                How to mix / overlay two mp3 audio file into one mp3 file (not concatenate)
                            
                                Playing 2 musics through 2 different sound cards at same time
                            
                                How to make my application be considered as a communication program in Windows
                            
                                Changing Speed of Audio Using the Web Audio API Without Changing Pitch
                            
                                Perceptual similarity between two audio sequences
                            
                                Resources for audio DSP beginners? [closed]
                            
                                Music Analysis and Visualization
                            
                                Playing Sound In Hidden Tag
                            
                                Does "16bit integer PCM data" mean it's signed or unsigned?
                            
                                Ffmpeg to duplicate an audio stream and encode this new stream
                            
                                How to detect iphone is on silent mode

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

I want to learn audio programming [closed]

Tags:

signal-processing

audio

synthesis