I was creating a faster string splitter method. First, I wrote a non-tail recursive version returning <code>List</code>. Next, a tail recursive one using <code>ListBuffer</code> and then calling <code>toList</code> (<code>+=</code> and <code>toList</code> are O(1)). I fully expected the tail recursive version to be faster, but that is not the case. Can anyone explain why? Original version: <pre class="prettyprint"><code>def split(s: String, c: Char, i: Int = 0): List[String] = if (i < 0) Nil else { val p = s indexOf (c, i) if (p < 0) s.substring(i) :: Nil else s.substring(i, p) :: split(s, c, p + 1) } </code></pre> Tail recursive one: <pre class="prettyprint"><code>import scala.annotation.tailrec import scala.collection.mutable.ListBuffer def split(s: String, c: Char): Seq[String] = { val buffer = ListBuffer.empty[String] @tailrec def recurse(i: Int): Seq[String] = { val p = s indexOf (c, i) if (p < 0) { buffer += s.substring(i) buffer.toList } else { buffer += s.substring(i, p) recurse(p + 1) } } recurse(0) } </code></pre> This was benchmarked with code here, with results here, by #scala's jyxent.

You expect the tail recursive version to be faster due to the tail call optimization and I think this is right, if you compare apples to apples: <pre class="prettyprint"><code>def split3(s: String, c: Char): Seq[String] = { @tailrec def recurse(i: Int, acc: List[String] = Nil): Seq[String] = { val p = s indexOf (c, i) if (p < 0) { s.substring(i) :: acc } else { recurse(p + 1, s.substring(i, p) :: acc) } } recurse(0) // would need to reverse } </code></pre> I timed this <code>split3</code> to be faster, except of course to get the same result it would need to reverse the result. It does seem <code>ListBuffer</code> introduces inefficiencies that the tail recursion optimization cannot make up for. Edit: thinking about avoiding the reverse... <pre class="prettyprint"><code>def split3(s: String, c: Char): Seq[String] = { @tailrec def recurse(i: Int, acc: List[String] = Nil): Seq[String] = { val p = s lastIndexOf (c, i) if (p < 0) { s.substring(0, i + 1) :: acc } else { recurse(p - 1, s.substring(p + 1, i + 1) :: acc) } } recurse(s.length - 1) } </code></pre> This has the tail call optimization and avoids <code>ListBuffer</code>.

Why doesn't tail recursion results in better performance in this code?

Tags:

performance

benchmarking

scala

I was creating a faster string splitter method. First, I wrote a non-tail recursive version returning List. Next, a tail recursive one using ListBuffer and then calling toList (+= and toList are O(1)). I fully expected the tail recursive version to be faster, but that is not the case.

Can anyone explain why?

Original version:

def split(s: String, c: Char, i: Int = 0): List[String] = if (i < 0) Nil else {
  val p = s indexOf (c, i)
  if (p < 0) s.substring(i) :: Nil else s.substring(i, p) :: split(s, c, p + 1)
}

Tail recursive one:

import scala.annotation.tailrec
import scala.collection.mutable.ListBuffer
def split(s: String, c: Char): Seq[String] = {
  val buffer = ListBuffer.empty[String]
  @tailrec def recurse(i: Int): Seq[String] =  {
    val p = s indexOf (c, i)
    if (p < 0) {
      buffer += s.substring(i)
      buffer.toList
    } else {
      buffer += s.substring(i, p)
      recurse(p + 1)
    }
  }
  recurse(0)
}

This was benchmarked with code here, with results here, by #scala's jyxent.

679

asked Jul 28 '11 20:07

Daniel C. Sobral

2 Answers

You're simply doing more work in the second case. In the first case, you might overflow your stack, but every operation is really simple, and :: is as small of a wrapper as you can get (all you have to do is create the wrapper and point it to the head of the other list). In the second case, not only do you create an extra collection initially and have to form a closure around s and buffer for the nested method to use, but you also use the heavierweight ListBuffer which has to check for each += whether it's already been copied out to a list, and uses different code paths depending on whether it's empty or not (in order to get the O(1) append to work).

175

answered Nov 15 '22 10:11

Rex Kerr

You expect the tail recursive version to be faster due to the tail call optimization and I think this is right, if you compare apples to apples:

def split3(s: String, c: Char): Seq[String] = {
  @tailrec def recurse(i: Int, acc: List[String] = Nil): Seq[String] =  {
    val p = s indexOf (c, i)
    if (p < 0) {
      s.substring(i) :: acc
    } else {
      recurse(p + 1, s.substring(i, p) :: acc)
    }
  }
  recurse(0) // would need to reverse
}

I timed this split3 to be faster, except of course to get the same result it would need to reverse the result.

It does seem ListBuffer introduces inefficiencies that the tail recursion optimization cannot make up for.

Edit: thinking about avoiding the reverse...

def split3(s: String, c: Char): Seq[String] = {
  @tailrec def recurse(i: Int, acc: List[String] = Nil): Seq[String] =  {
    val p = s lastIndexOf (c, i)
    if (p < 0) {
      s.substring(0, i + 1) :: acc
    } else {
      recurse(p - 1, s.substring(p + 1, i + 1) :: acc)
    }
  }
  recurse(s.length - 1)
}

This has the tail call optimization and avoids ListBuffer.

answered Nov 15 '22 10:11

huynhjl

Related questions
                            
                                In terms of performance, which is better Flex or Silverlight?
                            
                                How can I learn SQL Server index tuning? [closed]
                            
                                Best implementation for an RSS feed in C# (ASP.net)
                            
                                Speeding up jQuery empty() or replaceWith() Functions When Dealing with Large DOM Elements
                            
                                Eclipse 3.5 64-bit Performance Windows 7
                            
                                Is there a way to get rows_examined in MySQL without the slow log?
                            
                                What's the most efficient way to compare two blocks of memory in the D language?
                            
                                Performance penalty of getSerializedSize() in Protocol Buffers
                            
                                Speed up expand/collapse all nodes of a JTree
                            
                                Java NIO Servlet to File
                            
                                Insert a lot of data into database in very small inserts
                            
                                How can I speed up this Sql Server Spatial query? [closed]
                            
                                Modern CPU Inner Loop Indirection Optimizations
                            
                                Any solution for Oracle TNS inefficiencies (many roundtrips, latency) from a Java app?
                            
                                An Open-Source tool for Glassfish Performance Monitoring [closed]
                            
                                Do access modifiers affect performance?
                            
                                Heavy mysql usage CPU or Memory
                            
                                MS Access databases on slow network: Is it faster to separate back ends?
                            
                                Quickly Find the Index in an Array Closest to Some Value
                            
                                Why does SQL cost explode with simple "or"?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With