Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Back pressure in Kafka

I have a situation in Kafka where the producer publishes the messages at a very higher rate than the consumer consumption rate. I have to implement the back pressure implementation in kafka for further consumption and processing.

Please let me know how can I implement in spark and also in normal java api.

like image 240
sirigiri sai kumar Avatar asked Apr 05 '18 02:04

sirigiri sai kumar


2 Answers

Kafka acts as the regulator here. You produce at whatever rate you want to into Kafka, scaling the brokers out to accommodate the ingest rate. You then consume as you want to; Kafka persists the data and tracks the offset of the consumers as they work their way through the data they read.

like image 182
Robin Moffatt Avatar answered Oct 16 '22 11:10

Robin Moffatt


You can disable auto-commit by enable.auto.commit=false on consumer and commit only when consumer operation is finished. That way consumer would be slow, but Kafka knows how many messages consumer processed, also configuring poll interval with max.poll.interval.ms and messages to be consumed in each poll with max.poll.records you should be good.

like image 30
donm Avatar answered Oct 16 '22 10:10

donm