I have a single source of data items and I want to share that Flux with multiple downstream streams. It is very similar to the example in the reference guide, but I feel that example cheats by calling <code>.connect()</code> manually. Specifically, I do not know how many downstream subscribers there will be, and I do not have control to call <code>.connect()</code> "at the end". Consumers should be able to subscribe, but not trigger the pulling of data immediately. And then somewhere in the future when the data is actually needed they will pull as necessary. Additionally, the source is sensitive to the consumption so it cannot be re-fetched. To add to that, it is going to be very big so buffering and replaying is not an option. Ideally, on top of all that, the whole thing happens in one thread, so no concurrency or waiting. (Giving a very small wait time for subscribers to join is not desirable) I was able to achieve nearly the desired effect for Monos (single end result values): <pre class="prettyprint lang-java prettyprint-override"><code>public class CoConsumptionTest { @Test public void convenientCoConsumption() { // List used just for the example: List<Tuple2<String, String>> source = Arrays.asList( Tuples.of("a", "1"), Tuples.of("b", "1"), Tuples.of("c", "1"), Tuples.of("a", "2"), Tuples.of("b", "2"), Tuples.of("c", "2"), Tuples.of("a", "3"), Tuples.of("b", "3"), Tuples.of("c", "3") ); // Source which is sensitive to consumption AtomicInteger consumedCount = new AtomicInteger(0); Iterator<Tuple2<String, String>> statefulIterator = new Iterator<Tuple2<String, String>>() { private ListIterator<Tuple2<String, String>> sourceIterator = source.listIterator(); @Override public boolean hasNext() { return sourceIterator.hasNext(); } @Override public Tuple2<String, String> next() { Tuple2<String, String> e = sourceIterator.next(); consumedCount.incrementAndGet(); System.out.println("Audit: " + e); return e; } }; // Logic in the service: Flux<Tuple2<String, String>> f = Flux.fromIterable(() -> statefulIterator); ConnectableFlux<Tuple2<String, String>> co = f.publish(); Function<Predicate<Tuple2<String, String>>, Mono<Tuple2<String, String>>> findOne = (highlySelectivePredicate) -> co.filter(highlySelectivePredicate) .next() //gives us a Mono .toProcessor() //makes it eagerly subscribe and demand from the upstream, so it wont miss emissions .doOnSubscribe(s -> co.connect()); //when an actual user consumer subscribes // Subscribing (outside the service) assumeThat(consumedCount).hasValue(0); Mono<Tuple2<String, String>> a2 = findOne.apply(select("a", "2")); Mono<Tuple2<String, String>> b1 = findOne.apply(select("b", "1")); Mono<Tuple2<String, String>> c1 = findOne.apply(select("c", "1")); assertThat(consumedCount).hasValue(0); // Data is needed SoftAssertions softly = new SoftAssertions(); assertThat(a2.block()).isEqualTo(Tuples.of("a", "2")); softly.assertThat(consumedCount).hasValue(4); assertThat(b1.block()).isEqualTo(Tuples.of("b", "1")); softly.assertThat(consumedCount).hasValue(4); assertThat(c1.block()).isEqualTo(Tuples.of("c", "1")); softly.assertThat(consumedCount).hasValue(4); softly.assertAll(); } private static Predicate<Tuple2<String, String>> select(String t1, String t2) { return e -> e.getT1().equals(t1) && e.getT2().equals(t2); } } </code></pre> Question: I want to know how to achieve this for Flux results, i.e. for multiple values after the filtering is applied, not just the first/next. (Still demanding only as much as necessary) (Tried naively replacing <code>.toProcessor()</code> with <code>.publish().autoConnect(0)</code> but did not succeed) Edit 1: While buffering of the source is not allowed, the filters that come as parameters are expected to be highly selective, so buffering after the filtering is okay. Edit 2: Coming back to this after a while, I tried my posted example on a newer version of <code>reactor</code> and it actually works. <pre class="prettyprint"><code>io.projectreactor:reactor-bom:Californium-SR8 > io.projectreactor:reactor-core:3.2.9.RELEASE </code></pre>

I don't like giving a "non-answer" style answer, but I think at least one of your requirements has to give here. From your question, the requirements seem to be: <ul> <li>Buffering not allowed</li> <li>Not allowed to drop elements</li> <li>Unknown number of subscribers</li> <li>Subscribers can connect at any time</li> <li>Each subscriber must have all the data available when it demands it</li> <li>No re-fetching from source</li> </ul> Take the case where one subscriber requests data form a <code>Flux</code>, the first few elements in that <code>Flux</code> are consumed, and then eventually another subscriber shows up at an arbitrary point in the future that wants that same data. With the above requirements, that's impossible - you'll either have to go and get the data again, or have it saved somewhere, and you've ruled both those options out. However, if you're prepared to relax those requirements a bit, then there's a few potential options: <h3>Known number of subscribers</h3> If you can work out the number of subscribers you'll end up with somehow, then you can use <code>autoConnect(n)</code> to automatically connect to a <code>ConnectableFlux</code> after that number of subscriptions has been made. <h3>Allowing elements to be dropped</h3> If you can allow elements to be dropped, then you can just call <code>share();</code> on the original <code>Flux</code> to get it to auto-connect on the first subscription, and then future subscribers will have previous elements dropped. <h3>Allowing a time for subscribers to connect</h3> This is perhaps one of the more promising strategies, since you say: <blockquote> no concurrency or waiting. (Giving a very small wait time for subscribers to join is not desirable) </blockquote> You can turn the <code>Flux</code> into a hot source that caches all emitted elements for a certain time period. This means that you can, at the cost of some amount of memory (but without buffering the whole stream), give subscribers a small wait time when they can subscribe and still receive all the data. <h3>Buffering a known number of elements</h3> Similarly to above, you can use another variant of the <code>cache()</code> method to just cache a known number of elements. If you know you can safely fit <code>n</code> elements into memory, but no more, then this could give you the maximum time possible for subscribers to safely connect.

Project Reactor: ConnectableFlux auto-connecting on demand

Tags:

java

reactive-programming

project-reactor

reactive-streams

I have a single source of data items and I want to share that Flux with multiple downstream streams.

It is very similar to the example in the reference guide, but I feel that example cheats by calling .connect() manually. Specifically, I do not know how many downstream subscribers there will be, and I do not have control to call .connect() "at the end". Consumers should be able to subscribe, but not trigger the pulling of data immediately. And then somewhere in the future when the data is actually needed they will pull as necessary.

Additionally, the source is sensitive to the consumption so it cannot be re-fetched.
To add to that, it is going to be very big so buffering and replaying is not an option.

Ideally, on top of all that, the whole thing happens in one thread, so no concurrency or waiting.
(Giving a very small wait time for subscribers to join is not desirable)

I was able to achieve nearly the desired effect for Monos (single end result values):

public class CoConsumptionTest {
    @Test
    public void convenientCoConsumption() {
        // List used just for the example:
        List<Tuple2<String, String>> source = Arrays.asList(
                Tuples.of("a", "1"), Tuples.of("b", "1"), Tuples.of("c", "1"),
                Tuples.of("a", "2"), Tuples.of("b", "2"), Tuples.of("c", "2"),
                Tuples.of("a", "3"), Tuples.of("b", "3"), Tuples.of("c", "3")
        );

        // Source which is sensitive to consumption
        AtomicInteger consumedCount = new AtomicInteger(0);
        Iterator<Tuple2<String, String>> statefulIterator = new Iterator<Tuple2<String, String>>() {
            private ListIterator<Tuple2<String, String>> sourceIterator = source.listIterator();

            @Override
            public boolean hasNext() {
                return sourceIterator.hasNext();
            }

            @Override
            public Tuple2<String, String> next() {
                Tuple2<String, String> e = sourceIterator.next();
                consumedCount.incrementAndGet();
                System.out.println("Audit: " + e);
                return e;
            }
        };

        // Logic in the service:
        Flux<Tuple2<String, String>> f = Flux.fromIterable(() -> statefulIterator);
        ConnectableFlux<Tuple2<String, String>> co = f.publish();

        Function<Predicate<Tuple2<String, String>>, Mono<Tuple2<String, String>>> findOne = (highlySelectivePredicate) ->
                co.filter(highlySelectivePredicate)
                        .next() //gives us a Mono
                        .toProcessor() //makes it eagerly subscribe and demand from the upstream, so it wont miss emissions
                        .doOnSubscribe(s -> co.connect()); //when an actual user consumer subscribes

        // Subscribing (outside the service)
        assumeThat(consumedCount).hasValue(0);
        Mono<Tuple2<String, String>> a2 = findOne.apply(select("a", "2"));
        Mono<Tuple2<String, String>> b1 = findOne.apply(select("b", "1"));
        Mono<Tuple2<String, String>> c1 = findOne.apply(select("c", "1"));
        assertThat(consumedCount).hasValue(0);

        // Data is needed
        SoftAssertions softly = new SoftAssertions();

        assertThat(a2.block()).isEqualTo(Tuples.of("a", "2"));
        softly.assertThat(consumedCount).hasValue(4);

        assertThat(b1.block()).isEqualTo(Tuples.of("b", "1"));
        softly.assertThat(consumedCount).hasValue(4);

        assertThat(c1.block()).isEqualTo(Tuples.of("c", "1"));
        softly.assertThat(consumedCount).hasValue(4);

        softly.assertAll();
    }

    private static Predicate<Tuple2<String, String>> select(String t1, String t2) {
        return e -> e.getT1().equals(t1) && e.getT2().equals(t2);
    }
}

Question: I want to know how to achieve this for Flux results, i.e. for multiple values after the filtering is applied, not just the first/next. (Still demanding only as much as necessary)
(Tried naively replacing .toProcessor() with .publish().autoConnect(0) but did not succeed)

Edit 1: While buffering of the source is not allowed, the filters that come as parameters are expected to be highly selective, so buffering after the filtering is okay.

Edit 2: Coming back to this after a while, I tried my posted example on a newer version of reactor and it actually works.

io.projectreactor:reactor-bom:Californium-SR8
> io.projectreactor:reactor-core:3.2.9.RELEASE

736

asked Jun 16 '19 12:06

Anly

Video Answer

1 Answers

I don't like giving a "non-answer" style answer, but I think at least one of your requirements has to give here. From your question, the requirements seem to be:

Buffering not allowed
Not allowed to drop elements
Unknown number of subscribers
Subscribers can connect at any time
Each subscriber must have all the data available when it demands it
No re-fetching from source

Take the case where one subscriber requests data form a Flux, the first few elements in that Flux are consumed, and then eventually another subscriber shows up at an arbitrary point in the future that wants that same data. With the above requirements, that's impossible - you'll either have to go and get the data again, or have it saved somewhere, and you've ruled both those options out.

However, if you're prepared to relax those requirements a bit, then there's a few potential options:

Known number of subscribers

If you can work out the number of subscribers you'll end up with somehow, then you can use autoConnect(n) to automatically connect to a ConnectableFlux after that number of subscriptions has been made.

Allowing elements to be dropped

If you can allow elements to be dropped, then you can just call share(); on the original Flux to get it to auto-connect on the first subscription, and then future subscribers will have previous elements dropped.

Allowing a time for subscribers to connect

This is perhaps one of the more promising strategies, since you say:

no concurrency or waiting. (Giving a very small wait time for subscribers to join is not desirable)

You can turn the Flux into a hot source that caches all emitted elements for a certain time period. This means that you can, at the cost of some amount of memory (but without buffering the whole stream), give subscribers a small wait time when they can subscribe and still receive all the data.

Buffering a known number of elements

Similarly to above, you can use another variant of the cache() method to just cache a known number of elements. If you know you can safely fit n elements into memory, but no more, then this could give you the maximum time possible for subscribers to safely connect.

140

answered Oct 06 '22 10:10

Michael Berry

Related questions
                            
                                How to override type of element inherited from parent complex type?
                            
                                Happens-before and reordering of volatile
                            
                                Failed to capture fingerprint of input files for task ':checkDevClasspath' property 'compileClasspath' during up-to-date check
                            
                                How to properly map collection of enums in hibernate?
                            
                                Asynchronous File I/O via POSIX AIO or Windows Overlapped IO in Java
                            
                                Bubble sorting a 2D ArrayList
                            
                                Rename columns in spark using @JsonProperty while creating Datasets
                            
                                Connect to postgresql: FATAL: database "xxx" does not exist
                            
                                How to check if subscriber is valid to accept the message received for a published topic on MQTT
                            
                                Spring Boot fails to return JSON for a object but not for list of objects
                            
                                shallow copy of linkedlist does not reflect changes when adding new node
                            
                                How to make java class thread safe?
                            
                                How to display HDR picture in Android?
                            
                                Stream spliterator implementation detail
                            
                                Why Scala on Mac fails when run in unicode directory
                            
                                Conflating Java streams
                            
                                Failed to send data from arduino
                            
                                Jaxb unmarshal xml which contains the & <> signs
                            
                                How to sum a Stream of CompleteableFuture<BigDecimal> conveniently?
                            
                                How to fix jetpack navigation pop behaviour incorrect animation?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With