Given an array A with possible duplicate entries, find the k entries that occur most frequently. My approach: Create a MinHeap of k most occurring elements ordered by the frequency. top element obviously being least occurring of rest of the elements. Create a HashMap to keep track of all element counts and whether or not they are in MinHeap. When reading a new integer: <ul> <li>check if it is in HashMap: Increment the count in HashMap</li> <li>also if it is check if it is in Heap :then Increment the count there also and heapify.</li> <li>if not then compare with root element count and remove the root to add this if necessary. Then heapify.</li> </ul> In the end return MinHeap as desired output. <pre class="prettyprint"><code>class Wrapper{ boolean inHeap; int count; } </code></pre> This would take O(n+k) space and O(n log k) time complexity. Is there a better way to do space and/or time complexity wise.

We can say the space complexity of your approach is <code>O(n)</code>, since you can never use more than <code>O(2n) = O(n)</code> memory. <hr> Skip the heap and just create the HashMap. After you've created the HashMap, you can iterate through it and put all the elements in an array. Then you can run a selection algorithm such as quickselect on the array to get <code>k</code>-th element, and the first <code>k</code> elements from there (the extension to extract the first <code>k</code> elements via quickselect is fairly trivial, or you can just iterating through again to get them). Then you sort the <code>k</code> elements, if required. The running time would be expected <code>O(n)</code> or <code>O(n + k log k)</code> if sorting is required. The space complexity would be <code>O(n)</code>.

Find k most occurring elements in an integer array

Tags:

java

arrays

algorithm

Given an array A with possible duplicate entries, find the k entries that occur most frequently.

My approach:

Create a MinHeap of k most occurring elements ordered by the frequency. top element obviously being least occurring of rest of the elements. Create a HashMap to keep track of all element counts and whether or not they are in MinHeap.

When reading a new integer:

check if it is in HashMap: Increment the count in HashMap
also if it is check if it is in Heap :then Increment the count there also and heapify.
if not then compare with root element count and remove the root to add this if necessary. Then heapify.

In the end return MinHeap as desired output.

class Wrapper{
 boolean inHeap;
 int count;
}

This would take O(n+k) space and O(n log k) time complexity. Is there a better way to do space and/or time complexity wise.

372

asked Jun 04 '14 01:06

m0nish

1 Answers

We can say the space complexity of your approach is O(n), since you can never use more than O(2n) = O(n) memory.

Skip the heap and just create the HashMap.

After you've created the HashMap, you can iterate through it and put all the elements in an array.

Then you can run a selection algorithm such as quickselect on the array to get k-th element, and the first k elements from there (the extension to extract the first k elements via quickselect is fairly trivial, or you can just iterating through again to get them).

Then you sort the k elements, if required.

The running time would be expected O(n) or O(n + k log k) if sorting is required.

The space complexity would be O(n).

145

answered Nov 04 '22 01:11

Bernhard Barker

Related questions
                            
                                Unit test GeneratedKeyHolder in namedParameterJdbcTemplate
                            
                                Trying to use hashmap to count frequency of words in array
                            
                                Checking to see if array is full
                            
                                Fetching millions of records in java [closed]
                            
                                Maven Local Dependency
                            
                                antlr 4.2.2 output to console warning (157)
                            
                                Why don't common Map implementations cache the result of Map.containsKey() for Map.get()
                            
                                Are lambda-expressions in Java8 executed multi threaded?
                            
                                How to get a nested field
                            
                                Spring email add attachment
                            
                                NoClassDefFoundError ProcessingException while migrating from jersey 1.x to jersey 2.x ( 2.8 )
                            
                                User environment variable in java
                            
                                How to set JVM options for Tomcat instance executed from eclipse
                            
                                Not able to understand array declaration int[] it2= new int[][]{{1}}[0];
                            
                                How to find blank pages inside a PDF using PDFBox?
                            
                                Set only maximum height for a panel (without using setMaximumSize) with GroupLayout
                            
                                Java Docs says interfaces cannot have fields.Why?
                            
                                Can high cyclomatic complexity (warnings) be avoided when using switch-case on a large enum?
                            
                                mvn exec:exec and mvn exec:java difference
                            
                                Amazon sns & sqs messages with java

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With