Python library to modify MP3 audio without transcoding

Tags:

I am looking for some general advice about the mp3 format before I start a small project to make sure I am not on a wild-goose chase.

My understanding of the internals of the mp3 format is minimal. Ideally, I am looking for a library that would abstract those details away. I would prefer to use Python (but could be convinced otherwise).

I would like to modify a set of mp3 files in a fairly simple way. I am not so much interested in the ID3 tags but in the audio itself. I want to be able to delete sections (e.g. drop 10 seconds from the 3rd minute), and insert sections (e.g. add credits to the end.)

My understanding is that the mp3 format is lossy, and so decoding it to (for example) PCM format, making the modifications, and then encoding it again to MP3 will lower the audio quality. (I would love to hear that I am wrong.)

I conjecture that if I stay in mp3 format, there will be some sort of minimum frame or packet-size to deal with, so the granularity of the operations may be coarser. I can live with that, as long as I get an accuracy of within a couple of seconds.

I have looked at PyMedia, but it requires me to migrate to PCM to process the data. Similarly, LAME wants to help me encode, but not access the data in place. I have seen several other libraries that only deal with the ID3 tags.

Can anyone recommend a Python MP3 library? Alternatively, can you disabuse me of my assumption that going to PCM and back is bad and avoidable?

670

asked Nov 22 '08 02:11

Oddthinking

2 Answers

If you want to do things low-level, use pymad. It turns MP3s into a buffer of sample data.

If you want something a little higher-level, use the Echo Nest Remix API (disclosure: I wrote part of it for my dayjob). It includes a few examples. If you look at the cowbell example (i.e., MoreCowbell.dj), you'll see a fork of pymad that gives you a NumPy array instead of a buffer. That datatype makes it easier to slice out sections and do math on them.

answered Oct 29 '22 01:10

iconoplast

I got three quality answers, and I thank you all (and upvoted you all) for them. I haven't chosen any as the accepted answer, because each addressed one aspect, so I wanted to write a summary.

Do you need to work in MP3?

Transcoding to PCM and back to MP3 is unlikely to result in a drop in quality.
Don't optimise audio-quality prematurely; test it with a simple prototype and listen to it.

Working in MP3

Wikipedia has a summary of the MP3 File Format.
MP3 frames are short (1152 samples, or just a few milliseconds) allowing for moderate precision at that level.
However, Wikipedia warns that "Frames are not independent items ("byte reservoir") and therefore cannot be extracted on arbitrary frame boundaries."
Existing libraries are unlikely to be of assistance, if I really want to avoid decoding.

Working in PCM

There are several libraries at this level:

LAME (latest release: October 2017)
PyMedia (latest release: February 2006)
PyMad (Linux only? Decoder only? Latest release: January 2007)

Working at a higher level

Echo Nest Remix API (Mac or Linux only, at the moment) is an API to a web-service that supports quite sophisticated operations (e.g. finding the locations of music beats and tempo, etc.)
mp3DirectCut (Windows only) is a GUI that apparently performs the operations I want, but as an app. It is not open-source. (I tried to run it, got an Access Denied installer error, and didn't follow up. A GUI isn't suitably for me, as I want to repeatedly run these operations on a changing library of files.)

My plan is now to start out in PyMedia, using PCM. Thank you all for your assistance.

answered Oct 29 '22 00:10

3 revs, 3 users 95%

Related questions
                            
                                alembic create_table, check if table exists
                            
                                Why use packed *args/**kwargs instead of passing list/dict?
                            
                                Building custom Caffe layer in python
                            
                                aws - "Unable to import module 'process': /var/task/numpy/core/multiarray.so: invalid ELF header"
                            
                                Append a 1d array to a 2d array in Numpy Python
                            
                                pytest dynamically generate test method
                            
                                How to configure Airflow dag to run at specific time on daily basis?
                            
                                Custom cluster colors of SciPy dendrogram in Python (link_color_func?)
                            
                                Django CSRF cookie not set correctly
                            
                                numpy difference between flat and ravel()
                            
                                Why does list(next(iter(())) for _ in range(1)) == []?
                            
                                Multivariable/Multiple Linear Regression in Scikit Learn?
                            
                                TensorFlow version 1.0.0-rc2 on Windows: "OpKernel ('op: "BestSplits" device_type: "CPU"') for unknown op: BestSplits" with test code
                            
                                How to export Estimator model with export_savedmodel function
                            
                                Is `setup.cfg` deprecated?
                            
                                How to pass rgb color values to python's matplotlib eventplot?
                            
                                ValueWarning: No frequency information was provided, so inferred frequency MS will be used
                            
                                Shared python generator
                            
                                What is the non deprecated version of open "U" mode
                            
                                How do you design data models for Bigtable/Datastore (GAE)?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python library to modify MP3 audio without transcoding

Tags:

python

mp3

codec

Oddthinking

People also ask

2 Answers

iconoplast

3 revs, 3 users 95%

Recent Activity

Donate For Us