Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Remove sequential duplicates from Ruby array

Tags:

ruby

Let's say I have the following array and I would like to get rid of contiguous duplicates:

arr = [1,1,1,4,4,4,3,3,3,3,5,5,5,1,1,1]

I would like to get the following:

=> [1,4,3,5,1]

It would be great if there's something simpler and more efficient than my solutions (or variants thereof):

(arr + [nil]).each_cons(2).collect { |i| i[0] != i[1] ? i[0] : nil }.compact

or

(arr + [nil]).each_cons(2).each_with_object([]) { 
   |i, memo| memo << i[0] unless i[0] == i[1] 
 }

EDIT: It looks like @ArupRakshit's solution below is very simple. I'm still looking for better efficiency than my solution.

EDIT:

I'll be benchmarking responses as they come:

require 'fruity'
arr = 10000.times.collect { [rand(5)] * (rand(4) + 2) }.flatten

compare do
  abdo { (arr + [nil]).each_cons(2).collect { 
    |i| i[0] != i[1] ? i[0] : nil }.compact 
  }
  abdo2 { 
          (arr + [nil]).each_cons(2).each_with_object([]) { 
           |i, memo| memo << i[0] unless i[0] == i[1] 
          }
  }
  arup { arr.chunk(&:to_i).map(&:first) }
  arupv2 { arr.join.squeeze.chars.map(&:to_i) }
  agis {
    i = 1
    a = [arr.first]

    while i < arr.size
      a << arr[i] if arr[i] != arr[i-1]
      i += 1
     end
    a
  }
  arupv3 { arr.each_with_object([]) { |el, a| a << el if a.last != el } }
end

Benchmark results:

agis is faster than arupv3 by 39.99999999999999% ± 10.0%
arupv3 is faster than abdo2 by 1.9x ± 0.1
abdo2 is faster than abdo by 10.000000000000009% ± 10.0%
abdo is faster than arup by 30.000000000000004% ± 10.0%
arup is faster than arupv2 by 30.000000000000004% ± 10.0%

If we use:

arr = 10000.times.collect { rand(4) + 1 } # less likelihood of repetition

We get:

agis is faster than arupv3 by 19.999999999999996% ± 10.0%
arupv3 is faster than abdo2 by 1.9x ± 0.1
abdo2 is similar to abdo
abdo is faster than arupv2 by 2.1x ± 0.1
arupv2 is similar to arup
like image 751
Abdo Avatar asked Feb 12 '14 14:02

Abdo


1 Answers

Do as below using Enumerable#chunk :

arr = [1,1,1,4,4,4,3,3,3,3,5,5,5,1,1,1]
arr.chunk { |e| e }.map(&:first)
# => [1, 4, 3, 5, 1]
# if you have only **Fixnum**, something magic
arr.chunk(&:to_i).map(&:first)
# => [1, 4, 3, 5, 1]

UPDATE

as per @abdo's comment, here is another choice :

arr.join.squeeze.chars.map(&:to_i)
# => [1, 4, 3, 5, 1]

another choice

arr.each_with_object([]) { |el, a| a << el if a.last != el }
like image 134
Arup Rakshit Avatar answered Sep 28 '22 14:09

Arup Rakshit