Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to Process a kafka KStream and write to database directly instead of sending it another topic

I don't want to write processed KStream to another topic, I directly want to write enriched KStream to database. How should I proceed?

like image 593
Megha Avatar asked Oct 02 '17 11:10

Megha


1 Answers

You can implement a custom Processor that opens a DB connection and apply it via KStream#process(). Cf. https://docs.confluent.io/current/streams/developer-guide/dsl-api.html#applying-processors-and-transformers-processor-api-integration

Note, you will need to do sync writes into your DB to guard against data loss.

Thus, not writing back to a topic has multiple disadvantages:

  • reduced throughput because of sync writes
  • you cannot use exactly-once semantics
  • coupling your application with the database (if DB goes down, your app goes down, too, as it can't write its results anymore)

Therefore, it's recommended to write the results back into a topic and use Connect API to get the data into your database.

like image 148
Matthias J. Sax Avatar answered Nov 15 '22 07:11

Matthias J. Sax