I get an error when using an R function that I wrote: <pre class="prettyprint"><code>Warning messages: 1: glm.fit: algorithm did not converge 2: glm.fit: algorithm did not converge </code></pre> What I have done: <ol> <li>Step through the function</li> <li>Adding print to find out at what line the error occurs suggests two functions that should not use <code>glm.fit</code>. They are <code>window()</code> and <code>save()</code>.</li> </ol> My general approaches include adding <code>print</code> and <code>stop</code> commands, and stepping through a function line by line until I can locate the exception. However, it is not clear to me using those techniques where this error comes from in the code. I am not even certain which functions within the code depend on <code>glm.fit</code>. How do I go about diagnosing this problem?

I'd say that debugging is an art form, so there's no clear silver bullet. There are good strategies for debugging in any language, and they apply here too (e.g. read this nice article). For instance, the first thing is to reproduce the problem...if you can't do that, then you need to get more information (e.g. with logging). Once you can reproduce it, you need to reduce it down to the source. Rather than a "trick", I would say that I have a favorite debugging routine: <ol> <li>When an error occurs, the first thing that I usually do is look at the stack trace by calling <code>traceback()</code>: that shows you where the error occurred, which is especially useful if you have several nested functions.</li> <li>Next I will set <code>options(error=recover)</code>; this immediately switches into browser mode where the error occurs, so you can browse the workspace from there.</li> <li>If I still don't have enough information, I usually use the <code>debug()</code> function and step through the script line by line. </li> </ol> The best new trick in R 2.10 (when working with script files) is to use the <code>findLineNum()</code> and <code>setBreakpoint()</code> functions. As a final comment: depending upon the error, it is also very helpful to set <code>try()</code> or <code>tryCatch()</code> statements around external function calls (especially when dealing with S4 classes). That will sometimes provide even more information, and it also gives you more control over how errors are handled at run time. These related questions have a lot of suggestions: <ul> <li>Debugging tools for the R language</li> <li>Debugging lapply/sapply calls</li> <li>Getting the state of variables after an error occurs in R</li> <li>R script line numbers at error?</li> </ul>

The best walkthrough I've seen so far is: http://www.biostat.jhsph.edu/%7Erpeng/docs/R-debug-tools.pdf Anybody agree/disagree?

What is your favorite R debugging trick? [duplicate]

Tags:

r

r-faq

debugging

I get an error when using an R function that I wrote:

Warning messages: 1: glm.fit: algorithm did not converge  2: glm.fit: algorithm did not converge

What I have done:

Step through the function
Adding print to find out at what line the error occurs suggests two functions that should not use glm.fit. They are window() and save().

My general approaches include adding print and stop commands, and stepping through a function line by line until I can locate the exception.

However, it is not clear to me using those techniques where this error comes from in the code. I am not even certain which functions within the code depend on glm.fit. How do I go about diagnosing this problem?

394

asked Dec 14 '10 18:12

David LeBauer

2 Answers

I'd say that debugging is an art form, so there's no clear silver bullet. There are good strategies for debugging in any language, and they apply here too (e.g. read this nice article). For instance, the first thing is to reproduce the problem...if you can't do that, then you need to get more information (e.g. with logging). Once you can reproduce it, you need to reduce it down to the source.

Rather than a "trick", I would say that I have a favorite debugging routine:

When an error occurs, the first thing that I usually do is look at the stack trace by calling traceback(): that shows you where the error occurred, which is especially useful if you have several nested functions.
Next I will set options(error=recover); this immediately switches into browser mode where the error occurs, so you can browse the workspace from there.
If I still don't have enough information, I usually use the debug() function and step through the script line by line.

The best new trick in R 2.10 (when working with script files) is to use the findLineNum() and setBreakpoint() functions.

As a final comment: depending upon the error, it is also very helpful to set try() or tryCatch() statements around external function calls (especially when dealing with S4 classes). That will sometimes provide even more information, and it also gives you more control over how errors are handled at run time.

These related questions have a lot of suggestions:

Debugging tools for the R language
Debugging lapply/sapply calls
Getting the state of variables after an error occurs in R
R script line numbers at error?

117

answered Sep 21 '22 02:09

Shane

The best walkthrough I've seen so far is:

http://www.biostat.jhsph.edu/%7Erpeng/docs/R-debug-tools.pdf

Anybody agree/disagree?

answered Sep 23 '22 02:09

Christopher DuBois

Related questions
                            
                                Cumulative sum for positive numbers only [duplicate]
                            
                                Nested facets in ggplot2 spanning groups
                            
                                python equivalent of qnorm, qf and qchi2 of R
                            
                                Add row to data frame with dplyr
                            
                                view source code in R [duplicate]
                            
                                Extract string before "|" [duplicate]
                            
                                How can I remove all duplicates so that NONE are left in a data frame?
                            
                                count of entries in data frame in R
                            
                                Reshape multiple value columns to wide format
                            
                                Use of switch() in R to replace vector values
                            
                                Sweave for python
                            
                                R Language: How to print the first or last rows of a data set? [duplicate]
                            
                                Add line break to axis labels and ticks in ggplot
                            
                                How can put multiple plots side-by-side in shiny r?
                            
                                Compute mean and standard deviation by group for multiple variables in a data.frame
                            
                                Avoiding error when using rename in dplyr and column doesn't exist
                            
                                How to combine two vectors into a data frame
                            
                                How can I format axis labels with exponents with ggplot2 and scales?
                            
                                Dplyr or Magrittr - tolower?
                            
                                Insert line breaks in long string -- word wrap

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With