Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Flink Tumble Window Trigger time

I am using Flink to aggregate the data from kafka topics. I am using a tumble window of 1 hour, with the time characteristic set to Event Time. I am also using AscendingTimestampExtractor and assigning watermarks to the input based on a particular field value in json.

env.setStreamTimeCharacteristic(TimeCharacteristic.EventTime);

We can assume that the JSON field used for watermarking has a time value similar to that of system clock.

If I deploy the job at 3:25 pm, when can I expect the flink aggregated output? Will it be at 4:00 pm, 5:00 pm ... or 3:25 pm -4:25 pm, 4:25 pm -5 :25 pm....

like image 797
Madhu Kumar Tallapalli Avatar asked Oct 13 '25 01:10

Madhu Kumar Tallapalli


1 Answers

Flink's windows are aligned with the epoch (rather than the first event), so if the job is deployed at 3:25, the first window will be about the interval from 3:00:00 to 3:59:59.999, and will be produced soon thereafter (based on the watermarking).

Note that you can pass an offset to the constructor, so if wanted the windows to end at 10 minutes past the hour instead, you could do this by specifying

.window(TumblingEventTimeWindows.of(Time.hours(1), Time.minutes(10)))
like image 126
David Anderson Avatar answered Oct 14 '25 16:10

David Anderson