I have a data.table <code>dt</code>: <pre class="prettyprint"><code>library(data.table) dt = data.table(a=LETTERS[c(1,1:3)],b=4:7) a b 1: A 4 2: A 5 3: B 6 4: C 7 </code></pre> The result of <code>dt[, .N, by=a]</code> is <pre class="prettyprint"><code> a N 1: A 2 2: B 1 3: C 1 </code></pre> I know the <code>by=a</code> or <code>by="a"</code> means grouped by <code>a</code> column and the <code>N</code> column is the sum of duplicated times of <code>a</code>. However, I don't use <code>nrow()</code> but I get the result. The <code>.N</code> is not just the column name? I can't find the document by <code>??".N"</code> in R. I tried to use <code>.K</code>, but it doesn't work. What does <code>.N</code> means?

Think of <code>.N</code> as a variable for the number of instances. For example: <pre class="prettyprint"><code>dt <- data.table(a = LETTERS[c(1,1:3)], b = 4:7) dt[.N] # returns the last row # a b # 1: C 7 </code></pre> Your example returns a new variable with the number of rows per case: <pre class="prettyprint"><code>dt[, new_var := .N, by = a] dt # a b new_var # 1: A 4 2 # 2 'A's # 2: A 5 2 # 3: B 6 1 # 1 'B' # 4: C 7 1 # 1 'C' </code></pre> For a list of all special symbols of data.table, see also https://www.rdocumentation.org/packages/data.table/versions/1.10.0/topics/special-symbols

What does ".N" means in data table in r?

Tags:

I have a data.table dt:

library(data.table)
dt = data.table(a=LETTERS[c(1,1:3)],b=4:7)

   a b
1: A 4
2: A 5
3: B 6
4: C 7

The result of dt[, .N, by=a] is

   a N
1: A 2
2: B 1
3: C 1

I know the by=a or by="a" means grouped by a column and the N column is the sum of duplicated times of a. However, I don't use nrow() but I get the result. The .N is not just the column name? I can't find the document by ??".N" in R. I tried to use .K, but it doesn't work. What does .N means?

476

asked Oct 13 '15 12:10

Eric Chang

1 Answers

Think of .N as a variable for the number of instances. For example:

dt <- data.table(a = LETTERS[c(1,1:3)], b = 4:7)

dt[.N] # returns the last row
#    a b
# 1: C 7

Your example returns a new variable with the number of rows per case:

dt[, new_var := .N, by = a]
dt
#    a b new_var
# 1: A 4       2 # 2 'A's
# 2: A 5       2
# 3: B 6       1 # 1 'B'
# 4: C 7       1 # 1 'C'

For a list of all special symbols of data.table, see also https://www.rdocumentation.org/packages/data.table/versions/1.10.0/topics/special-symbols

191

answered Sep 19 '22 16:09

David

Related questions
                            
                                How to group result by array column in Postgres?
                            
                                How to identify date from a string in Java
                            
                                How do I remove the first item of an array in twig?
                            
                                boolean values in Spring application.properties file?
                            
                                Debugging website on local IIS without administrative privileges
                            
                                Rows returned by pyodbc are not JSON serializable
                            
                                The ec2 instance can't access internet in a public subnet without a elastic ip address?
                            
                                Excel vba add code to sheet module programmatically
                            
                                How to create dataset similar to cifar-10 [closed]
                            
                                How to make a loading screen in three.js?
                            
                                Which view should be used for new Material Design Bottom Navigation? [duplicate]
                            
                                Can't import the maven project in IntelliJ Idea 2016.1.1

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With