Precise seeking with ffmpeg

Tags:

Let's say I have an audio file being decoded with ffmpeg. The source format is something like AAC where the audio is split into packets. When seeking to a particular time, it is clear that the time will not fall, most of the time, on the packet border but somewhere within the packet duration. Do I have to seek within packet myself or av_seek_frame does it all by itself and sets up decoding so that the next decoded frame should start at the position I've requested?

If I use the function av_seek_frame with the flag AVSEEK_FLAG_BACKWARD, I assume that the next packet returned by av_read_frame will be the packet containing the time position I am seeking to. Is that right?

If I decode this packet with avcodec_decode_audio4, will the frame returned contain the audio data at the start time of the packet begining or from the time I've passed to av_seek_frame? In the latter case how can I find out the frame/packet timestamp so as to estimate the number of samples to skip in the decoded frame? The PTS after seek is zero and DTS looks useless either.

Is it possible to seek with precision to a particular time using ffmpeg?

854

asked Aug 06 '15 11:08

Taras Galchenko

1 Answers

There is no frame-exact or audio-sample-exact seeking in ffmpeg, that's an application-level problem. The reason is quite simple: libavformat does the seeking, and it doesn't know what's inside the packets that individual demuxers return. It just has a blob of data with timestamp X and duration Y. It doesn't know anything about that data, you'd have to decode the data to do anything meaningful with it, which is libavcodec, not libvformat.

So, to answer your questions: av_seek_frame seeks to packet boundaries, AVSEEK_FLAG_BACKWARD means the packet will be strictly before the given ts; for audio, that means that the packet will most likely contain your timestamp. However, this is not always the case, because some demuxers seek based on an index, and not each packet may have an index entry. You may have to call av_read_frame() several times before you get to the packet that contains your specified timestamp after the seek.

Other than you calling avcodec_flush(), libavcodec doesn't know anything about seeking, so the output of the next call to avcodec_decode_audio4 will start at the start of the input packet. For sample-specific seeking, applications have to chop off leading samples themselves.

answered Oct 09 '22 10:10

Ronald S. Bultje

Related questions
                            
                                How to use constant memory for beginners (Cuda C)
                            
                                PeekMessage triggers WndProc callback
                            
                                Pass a C array to a Rust function
                            
                                Undefined reference to memcpy in ARM-NONE-EABI link chain
                            
                                implicit declaration of function ‘usleep’
                            
                                How to implement a language interpreter without regular expressions?
                            
                                What are the next step to improve malloc() algorithm? [closed]
                            
                                Typecasting int pointer to float pointer
                            
                                C structure syntax
                            
                                Beginner's confusion about x86 stack
                            
                                Sending files from client to server using sockets in C
                            
                                Why we flush a stream but not a buffer?
                            
                                Defining a function inside the input of another function in C
                            
                                Performance implications of a large number of mutexes
                            
                                How to make new line when using echo to write a file in C
                            
                                memcpy for copying a fixed length buffer into a structure
                            
                                Random number range, exclude 0
                            
                                Multiple threads accessing one variable
                            
                                Extraction motion vectors from H.264 bitstream [closed]
                            
                                Comparison with NaN using AVX

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Precise seeking with ffmpeg

Tags:

c

ffmpeg

audio

Taras Galchenko

People also ask

1 Answers

Ronald S. Bultje

Recent Activity

Donate For Us