Kafka Streams - Send on different topics depending on Streams Data

Tags:

I have a kafka streams application waiting for records to be published on topic user_activity. It will receive json data and depending on the value of against a key I want to push that stream into different topics.

This is my streams App code:

KStream<String, String> source_user_activity = builder.stream("user_activity");
        source_user_activity.flatMapValues(new ValueMapper<String, Iterable<String>>() {
            @Override
            public Iterable<String> apply(String value) {
                System.out.println("value: " +  value);
                ArrayList<String> keywords = new ArrayList<String>();
                try {
                    JSONObject send = new JSONObject();
                    JSONObject received = new JSONObject(value);

                    send.put("current_date", getCurrentDate().toString());
                    send.put("activity_time", received.get("CreationTime"));
                    send.put("user_id", received.get("UserId"));
                    send.put("operation_type", received.get("Operation"));
                    send.put("app_name", received.get("Workload"));
                    keywords.add(send.toString());
                    // apply regex to value and for each match add it to keywords

                } catch (Exception e) {
                    // TODO: handle exception
                    System.err.println("Unable to convert to json");
                    e.printStackTrace();
                }

                return keywords;
            }
        }).to("user_activity_by_date");

In this code, I want to check operation type and then depending on that I want to push the streams into the relevant topic.

How can I achieve this?

EDIT:

I have updated my code to this:

final StreamsBuilder builder = new StreamsBuilder();

KStream<String, String> source_o365_user_activity = builder.stream("o365_user_activity");
KStream<String, String>[] branches = source_o365_user_activity.branch( 
      (key, value) -> (value.contains("Operation\":\"SharingSet") && value.contains("ItemType\":\"File")),
      (key, value) -> (value.contains("Operation\":\"AddedToSecureLink") && value.contains("ItemType\":\"File")),
      (key, value) -> true
     );

branches[0].to("o365_sharing_set_by_date");
branches[1].to("o365_added_to_secure_link_by_date");
branches[2].to("o365_user_activity_by_date");

423

asked Feb 23 '18 14:02

el323

1 Answers

You can use branch method in order to split your stream. This method takes predicates for splitting the source stream into several streams.

The code below is taken from kafka-streams-examples:

KStream<String, OrderValue>[] forks = ordersWithTotals.branch(
    (id, orderValue) -> orderValue.getValue() >= FRAUD_LIMIT,
    (id, orderValue) -> orderValue.getValue() < FRAUD_LIMIT);

forks[0].mapValues(
    orderValue -> new OrderValidation(orderValue.getOrder().getId(), FRAUD_CHECK, FAIL))
    .to(ORDER_VALIDATIONS.name(), Produced
        .with(ORDER_VALIDATIONS.keySerde(), ORDER_VALIDATIONS.valueSerde()));

forks[1].mapValues(
    orderValue -> new OrderValidation(orderValue.getOrder().getId(), FRAUD_CHECK, PASS))
    .to(ORDER_VALIDATIONS.name(), Produced
  .with(ORDER_VALIDATIONS.keySerde(), ORDER_VALIDATIONS.valueSerde()));

100

answered Sep 25 '22 20:09

codejitsu

Related questions
                            
                                How to print address of a variable in Java
                            
                                GC overhead limit exceeded with Apache POI
                            
                                Java: initialized inline private final field is null [duplicate]
                            
                                Why does the assignment of a short variable to an Integer reference produce a compile time error?
                            
                                Could someone please explain "Note: This method should be called under AWT tree lock."?
                            
                                javax.ws.rs.core.Cookie vs javax.ws.rs.core.NewCookie , What is the difference?
                            
                                Is it possible to set transition speed/time in transition everywhere on android
                            
                                Realm ORM: how to deal with Maps?
                            
                                Stream API not working for lazy loaded collections in EclipseLink / Glassfish?
                            
                                Unable to find Default Activity Class Name
                            
                                Kafka Avro Consumer with Decoder issues
                            
                                How to gracefuly shutdown a Spring Boot application by start-stop-daemon [duplicate]
                            
                                how to format string to show two decimal places [duplicate]
                            
                                Spring Data JPA - Specifications join
                            
                                Finding the shortest path nodes with breadth first search
                            
                                Replace last instance of comma with 'and' in a string
                            
                                How to get index of findFirst() in java 8?
                            
                                How to upcast object contained in Java 8 Optional?
                            
                                How to create a nested Map using Collectors.groupingBy?
                            
                                How are beans named by default when created with annotation?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Kafka Streams - Send on different topics depending on Streams Data

Tags:

java

apache-kafka

apache-kafka-streams

el323

People also ask

1 Answers

codejitsu

Recent Activity

Donate For Us