How to remove duplicated records\observations WITHOUT sorting in SAS？

Tags:

I wonder if there is a way to unduplicate records WITHOUT sorting?Sometimes, I want to keep original order and just want to remove duplicated records.

Is it possible?

BTW, below are what I know regarding unduplicating records, which does sorting in the end..

proc sql;
   create table yourdata_nodupe as
   select distinct *
   From abc;
quit;

proc sort data=YOURDATA nodupkey;    
    by var1 var2 var3 var4 var5;    
run;

611

asked Apr 18 '11 03:04

mj023119

1 Answers

You could use a hash object to keep track of which values have been seen as you pass through the data set. Only output when you encounter a key that hasn't been observed yet. This outputs in the order the data was observed in the input data set.

Here is an example using the input data set "sashelp.cars". The original data was in alphabetical order by Make so you can see that the output data set "nodupes" maintains that same order.

data nodupes (drop=rc);;
  length Make $13.;

  declare hash found_keys();
    found_keys.definekey('Make');
    found_keys.definedone();

  do while (not done);
    set sashelp.cars end=done;
    rc=found_keys.check();
    if rc^=0 then do;      
      rc=found_keys.add(); 
      output;              
    end;
  end;
  stop;
run;

proc print data=nodupes;run;

answered Sep 22 '22 19:09

cmjohns

Related questions
                            
                                Java - Sort Strings like Windows Explorer
                            
                                Can an array be grouped more efficiently than sorted?
                            
                                Add values of keys and sort it by occurrence of the keys in a list of dictionaries in Python
                            
                                How to count neighboring numbers in an array using Javascript?
                            
                                strcmp for python or how to sort substrings efficiently (without copy) when building a suffix array
                            
                                Does STL sort use swap or binary copy?
                            
                                Sorting an array of filenames containing strings with numbers
                            
                                JavaScript : Sorting an array
                            
                                Defining < for STL sort algorithm - operator overload, functor or standalone function?
                            
                                JQGrid Sorting - how to trigger onSortCol event
                            
                                Why use two different algorithm for sorting arrays?
                            
                                Sorting a pair of vectors
                            
                                Access array key using uasort in PHP
                            
                                fast way to get index of top-k elements of every column in a pandas dataframe
                            
                                How to sort only few values inside a list in Python
                            
                                Sort array of objects by array of IDs [duplicate]
                            
                                How is order of properties maintained for sort in mongodb?
                            
                                Sorting with Java 8 by Field given as Input
                            
                                How do I write a sort worse than O(n!)
                            
                                Chaining of ordering predicates (e.g. for std::sort)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to remove duplicated records\observations WITHOUT sorting in SAS？

Tags:

sorting

duplicates

sas

mj023119

People also ask

1 Answers

cmjohns

Recent Activity

Donate For Us