How to output one data stream to different outputs depending on the data?

Tags:

In Apache Flink I have a stream of tuples. Let's assume a really simple Tuple1<String>. The tuple can have an arbitrary value in it's value field (e.g. 'P1', 'P2', etc.). The set of possible values is finite but I don't know the full set beforehand (so there could be a 'P362'). I want to write that tuple to a certain output location depending on the value inside of the tuple. So e.g. I would like to have the following file structure:

/output/P1
/output/P2

In the documentation I only found possibilities to write to locations that I know beforehand (e.g. stream.writeCsv("/output/somewhere")), but no way of letting the contents of the data decide where the data is actually ending up.

I read about output splitting in the documentation but this doesn't seem to provide a way to redirect the output to different destinations the way I would like to have it (or I just don't understand how this would work).

Can this be done with the Flink API, if so, how? If not, is there maybe a third party library that can do it or would I have to build such a thing on my own?

865

asked Oct 29 '15 12:10

Jan Thomä

1 Answers

You can implement a custom sink. Inherit from one of both:

org.apache.flink.streaming.api.functions.sink.SinkFunction
org.apache.flink.streaming.api.functions.sink.RichSinkFunction

In your program use:

stream.addSink(SinkFunction<T> sinkFunction);

instead of stream.writeCsv("/output/somewhere").

191

answered Nov 15 '22 18:11

Matthias J. Sax

Related questions
                            
                                Is there a Many to Many Collection in Java using Generics (Domain Model, not Persistence Layer)?
                            
                                What would be different in Java if Enum declaration didn't have the recursive part
                            
                                Can't see my own application methods in Java VisualVM
                            
                                Specify foreign key constraint name when using Map and @ElementCollection with Hibernate
                            
                                Is it possible to use multiple ehcache.xml (in different projects, same war)?
                            
                                Is java.io.BufferedOutputStream safe to use?
                            
                                Zoom JPanel in Java Swing
                            
                                Get skype contact list and status in android
                            
                                Coldfusion 10 slower when using Java 1.7 compared to 1.6
                            
                                Android Keystore getEntry() and generateKeyPair() throw Exceptions sometimes
                            
                                Incompatible types inferred type does not conform to equality constraint(s)
                            
                                How to set enum attribute by value instead of name in Android Layout?
                            
                                Did we always have to register to download the Java 5 JDK, or is this new Oracle fun?
                            
                                How to avoid making defensive copies of ByteBuffer?
                            
                                Hibernate, JDBC and Java performance on medium and big result set
                            
                                Implementing Bitcoin and java.util.Currency
                            
                                JAXB Marshalling a variable list of elements with the same name
                            
                                Yarn MapReduce Job Issue - AM Container launch error in Hadoop 2.3.0
                            
                                Which is the best way to create file and write to it in Java [closed]
                            
                                How to make a ArrayList using Javassist

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to output one data stream to different outputs depending on the data?

Tags:

java

apache-flink

flink-streaming

Jan Thomä

People also ask

1 Answers

Matthias J. Sax

Recent Activity

Donate For Us