Yes I know, there have been a number of questions (see this one, for example) regarding the usage of <code>&</code> vs. <code>&&</code> in R, but I have not found one that specifically answers my question. As I understand the differences, <ul> <li> <code>&</code> does element-wise, vectorised comparison, much like the other arithmetic operations. It hence returns a logical vector that has length > 1 if both arguments have length > 1.</li> <li> <code>&&</code> compares the first elements of both vectors and always returns a result of length 1. Moreover, it does short-circuiting: <code>cond1 && cond2 && cond3 && ...</code> only evaluates <code>cond2</code> if <code>cond1</code> is <code>TRUE</code>, and so forth. This allows for things like <code>if(exists("is.R") && is.function(is.R) && is.R())</code> and particularly means that using <code>&&</code> is strictly necessary in some cases.</li> </ul> Moreover, <code>if</code> issues the warning <blockquote> the condition has length > 1 and only the first element will be used </blockquote> if its condition has more than one element. Judging from these preliminaries, I'd consider it safer to prefer <code>&</code> to <code>&&</code> in all <code>if</code> statements where short-circuiting isn't required. If something went wrong during calculations and I accidentally have a vector in one of <code>&</code>'s arguments, I get a warning, which is good. If not, everything is fine as well. If, on the other hand, I used <code>&&</code>, and something went wrong in my calculations and one of <code>&&</code>'s arguments is a vector, I don't get a warning. This is bad. If, for some reason, I really want to compare the first elements of two vectors, I'd argue that it's much cleaner to do so explicitly rather than implicitly. Note that this is contrary to what seems to be common agreement between R programmers and contrary to what the R docs recommend. (1) Hence my question: Are there any reasons except short-circuiting that make <code>&&</code> preferable to <code>&</code> in <code>if</code> statements? <hr> (1) Citing <code>help(&&)</code>: <blockquote> '&' and '&&' indicate logical AND and '|' and '||' indicate logical OR. The shorter form performs elementwise comparisons in much the same way as arithmetic operators. The longer form evaluates left to right examining only the first element of each vector. Evaluation proceeds only until the result is determined. The longer form is appropriate for programming control-flow and typically preferred in 'if' clauses. </blockquote>

No, using <code>&&</code> does not offer any advantages other than short-circuiting. However, short-circuiting is very much preferable for control flow, so much so that it should be the default. <code>if</code> statements should not take vectorised arguments - that's what <code>ifelse</code> is for. If you are passing a logical vector into <code>if</code> typically you would be contracting it to a single logical value using <code>any</code> or <code>all</code> for the evaluation. The major advantages of short circuiting are in avoiding lengthy or failure-prone steps (eg internet connections - though these should be dealt with through <code>try</code>): <pre class="prettyprint"><code>#avoiding lengthy calculations system.time(if(FALSE & {Sys.sleep(2);TRUE}) print("Hello")) user system elapsed 0.00 0.00 1.99 system.time(if(FALSE && {Sys.sleep(2);TRUE}) print("Hello")) user system elapsed 0 0 0 #avoiding errors if(FALSE & {stop("Connection Failed");TRUE}) print("Success") else print("Condition not met") Error: Connection Failed if(FALSE && {stop("Connection Failed");TRUE}) print("Success") else print("Condition not met") [1] "Condition not met" </code></pre> It is clear that in order to take advantage of these features, you would have to know in advance which steps take the longest or are prone to errors and construct the logical statement appropriately.

Is there a reason to prefer '&&' over '&' in 'if' statements, other than short-circuiting?

Tags:

r

vectorization

conventions

Yes I know, there have been a number of questions (see this one, for example) regarding the usage of & vs. && in R, but I have not found one that specifically answers my question.

As I understand the differences,

& does element-wise, vectorised comparison, much like the other arithmetic operations. It hence returns a logical vector that has length > 1 if both arguments have length > 1.
&& compares the first elements of both vectors and always returns a result of length 1. Moreover, it does short-circuiting: cond1 && cond2 && cond3 && ... only evaluates cond2 if cond1 is TRUE, and so forth. This allows for things like if(exists("is.R") && is.function(is.R) && is.R()) and particularly means that using && is strictly necessary in some cases.

Moreover, if issues the warning

the condition has length > 1 and only the first element will be used

if its condition has more than one element.

Judging from these preliminaries, I'd consider it safer to prefer & to && in all if statements where short-circuiting isn't required.

If something went wrong during calculations and I accidentally have a vector in one of &'s arguments, I get a warning, which is good. If not, everything is fine as well.

If, on the other hand, I used &&, and something went wrong in my calculations and one of &&'s arguments is a vector, I don't get a warning. This is bad. If, for some reason, I really want to compare the first elements of two vectors, I'd argue that it's much cleaner to do so explicitly rather than implicitly.

Note that this is contrary to what seems to be common agreement between R programmers and contrary to what the R docs recommend. (1)

Hence my question: Are there any reasons except short-circuiting that make && preferable to & in if statements?

(1) Citing help(&&):

'&' and '&&' indicate logical AND and '|' and '||' indicate logical OR. The shorter form performs elementwise comparisons in much the same way as arithmetic operators. The longer form evaluates left to right examining only the first element of each vector. Evaluation proceeds only until the result is determined. The longer form is appropriate for programming control-flow and typically preferred in 'if' clauses.

928

asked Apr 27 '15 14:04

jhin

2 Answers

Short answer: Yes, the different symbol makes the meaning more clear to the reader.

Thanks for this interesting question! If I can summarize, it seems to be a follow-up specifically about this section of my answer to the question you linked,

... you want to use the long forms only when you are certain the vectors are length one. You should be absolutely certain your vectors are only length 1, such as in cases where they are functions that return only length 1 booleans. You want to use the short forms if the vectors are length possibly >1. So if you're not absolutely sure, you should either check first, or use the short form and then use all and any to reduce it to length one for use in control flow statements, like if.

I hear your question (given comments) this way: But & and && will do the same thing if the inputs are length one, so other than short-circuiting, why prefer &&? Perhaps & should be preferred because if they're not length one, if will give me a warning, helping me be even more certain that the inputs are length one.

First, I agree with the comment by @James that you may be "overstating the value of getting a warning"; if it's not length one, the safer thing will be to handle this appropriately, not to just plow ahead. You could make a case that && should throw an error if they're not length one, and perhaps a good case; I don't know the reason why it does what it does. But without going back in time, the best we can do now is to check that the inputs are indeed appropriate for your use.

Given then, that you have checked to make sure your inputs are appropriate, I would still recommend && because it semantically reminds me as the reader that I should be making sure the inputs are scalars (length one). I'm so used to thinking vector-ally that this reminder is helpful to me. It follows the principle that different operations should have different symbols, and for me, a operation that is meant for use on scalars is different enough than a vectorized operation that it warrants a different symbol.

(Not to start a flame war (I hope), but this is also why I prefer <- to =; one for assigning variables, one for setting parameters to functions. Although deep down this is the same thing, it's different enough in practice to make the different symbols helpful to me as a reader.)

199

answered Oct 22 '22 00:10

Aaron left Stack Overflow

No, using && does not offer any advantages other than short-circuiting.

However, short-circuiting is very much preferable for control flow, so much so that it should be the default. if statements should not take vectorised arguments - that's what ifelse is for. If you are passing a logical vector into if typically you would be contracting it to a single logical value using any or all for the evaluation.

The major advantages of short circuiting are in avoiding lengthy or failure-prone steps (eg internet connections - though these should be dealt with through try):

#avoiding lengthy calculations
system.time(if(FALSE & {Sys.sleep(2);TRUE}) print("Hello"))
   user  system elapsed 
   0.00    0.00    1.99 
system.time(if(FALSE && {Sys.sleep(2);TRUE}) print("Hello"))
   user  system elapsed 
      0       0       0 

#avoiding errors
if(FALSE & {stop("Connection Failed");TRUE}) print("Success") else print("Condition not met")
Error: Connection Failed
if(FALSE && {stop("Connection Failed");TRUE}) print("Success") else print("Condition not met")
[1] "Condition not met"

It is clear that in order to take advantage of these features, you would have to know in advance which steps take the longest or are prone to errors and construct the logical statement appropriately.

answered Oct 21 '22 22:10

James

Related questions
                            
                                R Statistical Package Gem For A Rails Application
                            
                                R 3.0.1 package build warning
                            
                                Add a row with notes using stargazer
                            
                                performance of .Primitive and .Internal
                            
                                How to plot in RStudio and not have a new window pop up (R Graphics: Device (ACTIVE)?
                            
                                Keyboard shortcut for code chunk in R Markdown for windows gives í
                            
                                How to combine top navigation (navbarPage) and a sidebar menu (sidebarMenu) in shiny
                            
                                Change Border Color of Border on Table- Kable
                            
                                How to position strip labels in facet_wrap like in facet_grid
                            
                                Scoping and functions in R 2.11.1 : What's going wrong?
                            
                                Partition into classes: jenks vs kmeans
                            
                                Remove escapes from a string, or, "how can I get \ out of the way?"
                            
                                Combine Voronoi polygons and maps
                            
                                Can I have R vignette with a pre-compiled PDF with manual index.html show up the vignette list?
                            
                                Submit jobs to a slave node from within an R script?
                            
                                R: Apply function on specific columns preserving the rest of the dataframe
                            
                                How to login and then download a file from aspx web pages with R
                            
                                ggplot2 shade area under density curve by group
                            
                                R caret train glmnet final model lambda values not as specified
                            
                                Get margin line locations (mgp) in user coordinates

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With