I want to have EF core translate <code>.Select(x=>x.property).Distinct().Count()</code> into something like <pre class="prettyprint"><code>SELECT COUNT(DISTINCT property) </code></pre> Let's take an example. Let's say I have a DB table with PersonID(long), VisitStart(datetime2) and VisitEnd(datetime2). If i want to get the number of distinct days a particular person has visited, then I could write SQL like <pre class="prettyprint"><code>SELECT COUNT(DISTINCT CONVERT(date, VisitStart)) FROM myTable GROUP BY PersonID </code></pre> But using EF core and this <pre class="prettyprint"><code>MyTable .GroupBy(x=>x.PersonID) .Select(x=> new { Count = x.Select(y=>y.VisitStart.Date).Distinct().Count() }) </code></pre> which gives the right results, translates into this SQL <pre class="prettyprint"><code>SELECT [x].[PersonID], [x].[VisitStart], [x].[VisitEnd] FROM [myTable] as [x] ORDER BY [x].[PersonID] </code></pre> There is no GROUP BY and no DISTINCT or COUNT anywhere so the grouping must be done in memory, which is not ideal when operating on a table that has millions of records that potentially has to be pulled from DB. So anyone know how to get EF core to translate a <code>.Select(...).Distinct().Count()</code> into <code>SELECT COUNT(DISTINCT ...)</code>

I wanted to share an idea I had for solving my issues about count distinct. Ultimately another way of doing count distinct in a group by function, is by having nested group by functions (assuming you can aggregate your data through). Here is an example of what I used, it seems to work. Apologes for the criptic acronims, I am using this to keep my JSON as small as can be. <pre class="prettyprint"><code>var myData = _context.ActivityItems .GroupBy(a => new { ndt = EF.Property<DateTime>(a, "dt").Date, ntn = a.tn }) .Select(g => new { g.Key.ndt, g.Key.ntn, dpv = g.Sum(o => o.pv), dlv = g.Sum(o => o.lv), cnt = g.Count(), }) .GroupBy(a => new { ntn = a.ntn }) .Select(g => new { g.Key.ntn, sd = g.Min(o => o.ndt), ld = g.Max(o => o.ndt), pSum = g.Sum(o => o.dpv), pMin = g.Min(o => o.dpv), pMax = g.Max(o => o.dpv), pAvg = g.Average(o => o.dpv), lSum = g.Sum(o => o.dlv), lMin = g.Min(o => o.dlv), lMax = g.Max(o => o.dlv), lAvg = g.Average(o => o.dlv), n10s = g.Sum(o => o.cnt), ndays = g.Count() }); </code></pre>

How to get COUNT DISTINCT in translated SQL with EF Core

Tags:

c#

ef-core-2.1

I want to have EF core translate .Select(x=>x.property).Distinct().Count() into something like

SELECT COUNT(DISTINCT property)

Let's take an example. Let's say I have a DB table with PersonID(long), VisitStart(datetime2) and VisitEnd(datetime2). If i want to get the number of distinct days a particular person has visited, then I could write SQL like

SELECT COUNT(DISTINCT CONVERT(date, VisitStart)) FROM myTable GROUP BY PersonID

But using EF core and this

MyTable
    .GroupBy(x=>x.PersonID)
    .Select(x=> new 
    {
        Count = x.Select(y=>y.VisitStart.Date).Distinct().Count()
    })

which gives the right results, translates into this SQL

SELECT [x].[PersonID], [x].[VisitStart], [x].[VisitEnd]
FROM [myTable] as [x]
ORDER BY [x].[PersonID]

There is no GROUP BY and no DISTINCT or COUNT anywhere so the grouping must be done in memory, which is not ideal when operating on a table that has millions of records that potentially has to be pulled from DB.

So anyone know how to get EF core to translate a .Select(...).Distinct().Count() into SELECT COUNT(DISTINCT ...)

354

asked Jun 28 '19 08:06

smok

1 Answers

I wanted to share an idea I had for solving my issues about count distinct.

Ultimately another way of doing count distinct in a group by function, is by having nested group by functions (assuming you can aggregate your data through).

Here is an example of what I used, it seems to work.

Apologes for the criptic acronims, I am using this to keep my JSON as small as can be.

var myData = _context.ActivityItems
                        .GroupBy(a => new { ndt = EF.Property<DateTime>(a, "dt").Date, ntn = a.tn })
                        .Select(g => new
                        {
                            g.Key.ndt,
                            g.Key.ntn,
                            dpv = g.Sum(o => o.pv),
                            dlv = g.Sum(o => o.lv),
                            cnt = g.Count(),
                        })
                        .GroupBy(a => new { ntn = a.ntn })
                        .Select(g => new
                        {
                            g.Key.ntn,
                            sd = g.Min(o => o.ndt),
                            ld = g.Max(o => o.ndt),
                            pSum = g.Sum(o => o.dpv),
                            pMin = g.Min(o => o.dpv),
                            pMax = g.Max(o => o.dpv),
                            pAvg = g.Average(o => o.dpv),
                            lSum = g.Sum(o => o.dlv),
                            lMin = g.Min(o => o.dlv),
                            lMax = g.Max(o => o.dlv),
                            lAvg = g.Average(o => o.dlv),
                            n10s = g.Sum(o => o.cnt),
                            ndays = g.Count()
                        });

answered Sep 28 '22 11:09

Gareth

Related questions
                            
                                Why does GC collects my object when I have a reference to it?
                            
                                How does the SQLite Entity Framework 6 provider handle Guids?
                            
                                What is the purpose of StreamReader when Stream.Read() exists?
                            
                                Optional arguments in a generic Func<>
                            
                                Where can I find Microsoft.Office.Interop.Word.dll (2010)?
                            
                                Singleton Scope for EF's DbContext
                            
                                Visual Studio 2015 RC Entity Framework 6.1.3 Migrations Error
                            
                                What is C# 6.0 #pragma disable warnings feature?
                            
                                How Can I bind DataContext to a Generic ViewModel in XAML?
                            
                                UTF-16 safe substring in C# .NET
                            
                                Is it possible to pass interpolated strings as parameter to a method?
                            
                                Restart a crashed program with RegisterApplicationRestart without user prompt
                            
                                System.Web.HttpContext.Current does not exist
                            
                                Starting async method as Thread or as Task
                            
                                Adding Query String Params to my Swagger Specs
                            
                                IServiceCollection not found in web API with MVC 6
                            
                                Jetbrains Rider + Visual Studio WPF
                            
                                Linq performance: should I first use `where` or `select`
                            
                                How to enable Application Logs in Azure for Net Core 2 App?
                            
                                How to set NLog max file size?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With