Query to detect duplicate rows

Tags:

I had read data from an XML file into DataSet by using c# than I want to identify duplicate (completely the same ) rows in that set. I tried such kind of grouping and that works!

Click to copy

var d= from r1 in table.AsEnumerable()
       group r1 by new
       {
            t0 = r1[0],
            t1 = r1[1],
            t2 = r1[2],
            t3 = r1[3],
            t4 = r1[4],
            t5 = r1[5],
            t6 = r1[6],
            t7 = r1[7],
            t8 = r1[8],
       }
       into grp
       where grp.Count() > 1
       select grp;

But the number of data columns can be differ, so I cannot apply static grouping in query like above. I had to generate the grouping array dynamically?

I don't want to delete dublicate, I just want to find them!

364

asked Jan 28 '13 06:01

srcnaks

1 Answers

Click to copy

var rows = table.AsEnumerable();
var unique = rows.Distinct(DataRowComparer.Default);
var duplicates = rows.Except(unique); // , DataRowComparer.Default);

132

answered Sep 18 '22 11:09

abatishchev

Related questions
                            
                                C# - Custom GUI Design
                            
                                Cross-platform Sqlite
                            
                                Combine similar character in string in C#
                            
                                Making several modifications to a SyntaxTree at once
                            
                                Treeview draw glitch
                            
                                NHibernate Has-Many Collection With Cascading Deletes is Failing
                            
                                Updating GAC dlls
                            
                                Conditional projection with LINQ to Entities
                            
                                How to get the description of a running process on a remote machine?
                            
                                How we can automatically run the test methods in .net?
                            
                                Resharper C# Formatting Style shows "new" on new line instead of same line when chopping long lines
                            
                                OCR with perceptron neural network of Aforge.net answers wrong
                            
                                Using pivot table in linq [duplicate]
                            
                                Printing a scroll-able windows form. [duplicate]
                            
                                How do I create a SHA256 Hash with Salt?
                            
                                Unexpected Linq Behavior - ToList()
                            
                                What is AutoClass in .net?
                            
                                WCF Common Parameters ClientMessageInspector, DispatchMessageInspector or alternative?
                            
                                Add property to POCO class at runtime
                            
                                Keeping a DataGridView autosorted

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Query to detect duplicate rows

Tags:

c#

.net

duplicates

linq

srcnaks

People also ask

1 Answers

abatishchev

Recent Activity

Donate For Us