Say I have the following collection of <code>Student</code> objects which consist of Name(String), Age(int) and City(String). I am trying to use Java's Stream API to achieve the following sql-like behavior: <pre class="prettyprint"><code>SELECT MAX(age) FROM Students GROUP BY city </code></pre> Now, I found two different ways to do so: <pre class="prettyprint"><code>final List<Integer> variation1 = students.stream() .collect(Collectors.groupingBy(Student::getCity, Collectors.maxBy((s1, s2) -> s1.getAge() - s2.getAge()))) .values() .stream() .filter(Optional::isPresent) .map(Optional::get) .map(Student::getAge) .collect(Collectors.toList()); </code></pre> And the other one: <pre class="prettyprint"><code>final Collection<Integer> variation2 = students.stream() .collect(Collectors.groupingBy(Student::getCity, Collectors.collectingAndThen(Collectors.maxBy((s1, s2) -> s1.getAge() - s2.getAge()), optional -> optional.get().getAge()))) .values(); </code></pre> In both ways, one has to <code>.values() ...</code> and filter the empty groups returned from the collector. Is there any other way to achieve this required behavior? These methods remind me of <code>over partition by</code> sql statements... Thanks <hr> Edit: All the answers below were really interesting, but unfortunately this is not what I was looking for, since what I try to get is just the values. I don't need the keys, just the values.

The second approach calls <code>get()</code> on an <code>Optional</code>; this is usually a bad idea as you don't know if the optional will be empty or not (use <code>orElse()</code>, <code>orElseGet()</code>, <code>orElseThrow()</code> methods instead). While you might argue that in this case there always be a value since you generate the values from the student list itself, this is something to keep in mind. Based on that, you might turn the variation 2 into: <pre class="prettyprint"><code>final Collection<Integer> variation2 = students.stream() .collect(collectingAndThen(groupingBy(Student::getCity, collectingAndThen( mapping(Student::getAge, maxBy(naturalOrder())), Optional::get)), Map::values)); </code></pre> Although it really starts to be difficult to read, I'll probably use the variant 1: <pre class="prettyprint"><code>final List<Integer> variation1 = students.stream() .collect(groupingBy(Student::getCity, mapping(Student::getAge, maxBy(naturalOrder())))) .values() .stream() .map(Optional::get) .collect(toList()); </code></pre>

Do not always stick with <code>groupingBy</code>. Sometimes <code>toMap</code> is the thing you need: <pre class="prettyprint"><code>Collection<Integer> result = students.stream() .collect(Collectors.toMap(Student::getCity, Student::getAge, Integer::max)) .values(); </code></pre> Here you just create a <code>Map</code> where keys are cities and values are ages. In case when several students have the same city, merge function is used which just selects maximal age here. It's faster and cleaner.

As addition to Tagir’s great answer using <code>toMap</code> instead of <code>groupingBy</code>, here the short solution, if you want to stick to <code>groupingBy</code>: <pre class="prettyprint"><code>Collection<Integer> result = students.stream() .collect(Collectors.groupingBy(Student::getCity, Collectors.reducing(-1, Student::getAge, Integer::max))) .values(); </code></pre> Note that this three arg <code>reducing</code> collector already performs a mapping operation, so we don’t need to nest it with a <code>mapping</code> collector, further, providing an identity value avoids dealing with <code>Optional</code>. Since ages are always positive, providing <code>-1</code> is sufficient and since a group will always have at least one element, the identity value will never show up as a result. Still, I think Tagir’s <code>toMap</code> based solution is preferable in this scenario. <hr> The <code>groupingBy</code> based solution becomes more interesting when you want to get the actual students having the maximum age, e.g <pre class="prettyprint"><code>Collection<Student> result = students.stream().collect( Collectors.groupingBy(Student::getCity, Collectors.reducing(null, BinaryOperator.maxBy( Comparator.nullsFirst(Comparator.comparingInt(Student::getAge))))) ).values(); </code></pre> well, actually, even this can also be expressed using the <code>toMap</code> collector: <pre class="prettyprint"><code>Collection<Student> result = students.stream().collect( Collectors.toMap(Student::getCity, Function.identity(), BinaryOperator.maxBy(Comparator.comparingInt(Student::getAge))) ).values(); </code></pre> You can express almost everything with both collectors, but <code>groupingBy</code> has the advantage on its side when you want to perform a mutable reduction on the values.

Java 8 Stream API - Selecting only values after Collectors.groupingBy(..)

Tags:

java

java-stream

grouping

Say I have the following collection of Student objects which consist of Name(String), Age(int) and City(String).

I am trying to use Java's Stream API to achieve the following sql-like behavior:

SELECT MAX(age)
FROM Students
GROUP BY city

Now, I found two different ways to do so:

final List<Integer> variation1 =
            students.stream()
                    .collect(Collectors.groupingBy(Student::getCity, Collectors.maxBy((s1, s2) -> s1.getAge() - s2.getAge())))
                    .values()
                    .stream()
                    .filter(Optional::isPresent)
                    .map(Optional::get)
                    .map(Student::getAge)
                    .collect(Collectors.toList());

And the other one:

final Collection<Integer> variation2 =
            students.stream()
                    .collect(Collectors.groupingBy(Student::getCity,
                            Collectors.collectingAndThen(Collectors.maxBy((s1, s2) -> s1.getAge() - s2.getAge()),
                                    optional -> optional.get().getAge())))
                    .values();

In both ways, one has to .values() ... and filter the empty groups returned from the collector.

Is there any other way to achieve this required behavior?

These methods remind me of over partition by sql statements...

Thanks

Edit: All the answers below were really interesting, but unfortunately this is not what I was looking for, since what I try to get is just the values. I don't need the keys, just the values.

265

asked Feb 29 '16 22:02

Ghost93

3 Answers

The second approach calls get() on an Optional; this is usually a bad idea as you don't know if the optional will be empty or not (use orElse(), orElseGet(), orElseThrow() methods instead). While you might argue that in this case there always be a value since you generate the values from the student list itself, this is something to keep in mind.

Based on that, you might turn the variation 2 into:

final Collection<Integer> variation2 =
     students.stream()
             .collect(collectingAndThen(groupingBy(Student::getCity,
                                                   collectingAndThen(
                                                      mapping(Student::getAge, maxBy(naturalOrder())),
                                                      Optional::get)), 
                                        Map::values));

Although it really starts to be difficult to read, I'll probably use the variant 1:

final List<Integer> variation1 =
        students.stream()
            .collect(groupingBy(Student::getCity,
                                mapping(Student::getAge, maxBy(naturalOrder()))))
            .values()
            .stream()
            .map(Optional::get)
            .collect(toList());

133

answered Sep 28 '22 09:09

Alexis C.

Do not always stick with groupingBy. Sometimes toMap is the thing you need:

Collection<Integer> result = students.stream()
    .collect(Collectors.toMap(Student::getCity, Student::getAge, Integer::max))
    .values();

Here you just create a Map where keys are cities and values are ages. In case when several students have the same city, merge function is used which just selects maximal age here. It's faster and cleaner.

answered Sep 28 '22 09:09

Tagir Valeev

As addition to Tagir’s great answer using toMap instead of groupingBy, here the short solution, if you want to stick to groupingBy:

Collection<Integer> result = students.stream()
    .collect(Collectors.groupingBy(Student::getCity,
                 Collectors.reducing(-1, Student::getAge, Integer::max)))
    .values();

Note that this three arg reducing collector already performs a mapping operation, so we don’t need to nest it with a mapping collector, further, providing an identity value avoids dealing with Optional. Since ages are always positive, providing -1 is sufficient and since a group will always have at least one element, the identity value will never show up as a result.

Still, I think Tagir’s toMap based solution is preferable in this scenario.

The groupingBy based solution becomes more interesting when you want to get the actual students having the maximum age, e.g

Collection<Student> result = students.stream().collect(
   Collectors.groupingBy(Student::getCity, Collectors.reducing(null, BinaryOperator.maxBy(
     Comparator.nullsFirst(Comparator.comparingInt(Student::getAge)))))
).values();

well, actually, even this can also be expressed using the toMap collector:

Collection<Student> result = students.stream().collect(
    Collectors.toMap(Student::getCity, Function.identity(),
        BinaryOperator.maxBy(Comparator.comparingInt(Student::getAge)))
).values();

You can express almost everything with both collectors, but groupingBy has the advantage on its side when you want to perform a mutable reduction on the values.

answered Sep 28 '22 08:09

Holger

Related questions
                            
                                java.security.NoSuchProviderException: no such provider: BC
                            
                                Set JFrame to center of Screen in NetBeans
                            
                                Convert string to decimal number with 2 decimal places in Java
                            
                                Java 8 idiomatic way to apply a Lambda to a List returning another List?
                            
                                Alternative to MoreObjects in Java 8
                            
                                'java.lang.AssertionError: assertion failed' Error while starting Scala-IDE(Eclipse)
                            
                                No tests were found - Empty test suite when running jUnit 5 testcase on bare-bone Spring Boot Maven project
                            
                                Internet permission not working in oreo and pie
                            
                                LinkedList remove method
                            
                                SoapFault exception: [HTTP] Unsupported Media Type when accessing Java web-service from PHP
                            
                                Does garbage collection guarantee that a program will not run out of memory?
                            
                                Question about multiple 'catch'
                            
                                Remove everything in parentheses java using regex
                            
                                Scalable solution for Rock-Paper-Scissor
                            
                                Why does code with successive semi-colons compile?
                            
                                Using string.split() with a decimal - not working
                            
                                Is it better using a guard clause or catching the exception? [closed]
                            
                                Java method to sum any number of ints
                            
                                Android JSONObject : add Array to the put method
                            
                                Too many bind arguments. 5 arguments were provided but the statement needs 4 arguments

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With