Should LOC counting include tests and comments?

Tags:

While LOC (# lines of code) is a problematic measurement of a code's complexity, it is the most popular one, and when used very carefully, can provide a rough estimate of at least relative complexities of code bases (i.e. if one program is 10KLOC and another is 100KLOC, written in the same language, by teams of roughly the same competence, the second program is almost certainly much more complex).

When counting lines of code, do you prefer to count comments in ? What about tests?

I've seen various approaches to this. Tools like cloc and sloccount allow to either include or exclude comments. Other people consider comments part of the code and its complexity.

The same dilemma exists for unit tests, that can sometimes reach the size of the tested code itself, and even exceed it.

I've seen approaches all over the spectrum, from counting only "operational" non-comment non-blank lines, to "XXX lines of tested, commented code", which is more like running "wc -l on all code files in the project".

What is your personal preference, and why?

588

asked Nov 08 '08 09:11

Eli Bendersky

1 Answers

A wise man once told me 'you get what you measure' when it comes to managing programmers.

If you rate them in their LOC output amazingly you tend to get a lot of lines of code.

If you rate them on the number of bugs they close out, amazingly you get a lot of bugs fixed.

If you rate them on features added, you get a lot of features.

If you rate them on cyclomatic complexity you get ridiculously simple functions.

Since one of the major problems with code bases these days is how quickly they grow and how hard they are to change once they've grown, I tend to shy away from using LOC as a metric at all, because it drives the wrong fundamental behavior.

That said, if you have to use it, count sans comments and tests and require a consistent coding style.

But if you really want a measure of 'code size' just tar.gz the code base. It tends to serve as a better rough estimate of 'content' than counting lines which is susceptible to different programming styles.

answered Oct 12 '22 00:10

Edward Kmett

Related questions
                            
                                Why Pearson correlation is different between Tensorflow and Scipy
                            
                                Keras custom RMSLE metric
                            
                                How do I use ELB's HealthyHostCount for monitoring in CloudWatch?
                            
                                Is there a python version for the JVM based metrics library
                            
                                How to get the K most distant points, given their coordinates?
                            
                                How can you get the height metric of a string in PostScript?
                            
                                Are there any good tools to collect Objective-C metrics?
                            
                                Apache Beam Counter/Metrics not available in Flink WebUI
                            
                                is there a way with spaCy's NER to calculate metrics per entity type?
                            
                                Custom Metrics for Actuator Prometheus
                            
                                How do I make Hudson/Jenkins fail if Sonar thresholds are breached?
                            
                                How do I convert between a measure of similarity and a measure of difference (distance)?
                            
                                Collecting Application Metrics in Java (optionally .Net)
                            
                                Disable plugins on Eclipse startup
                            
                                Storm UI: Difference between Execute and Process Latencies
                            
                                sklearn metrics.log_loss is positive vs. scoring 'neg_log_loss' is negative
                            
                                Kubernetes 1.11 could not find heapster for metrics
                            
                                PHPUnit and C.R.A.P index
                            
                                Specific software metrics for Clojure programs
                            
                                Difference between Codahale metrics and Dropwizard metrics

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Should LOC counting include tests and comments?

Tags:

code-metrics

metrics

Eli Bendersky

People also ask

1 Answers

Edward Kmett

Recent Activity

Donate For Us