I thought that ruby just call method to_s but I can't explain how this works: <pre class="prettyprint lang-ruby prettyprint-override"><code>class Fake def to_s self end end "#{Fake.new}" </code></pre> By the logic this should raise stack level too deep because of infinity recursion. But it works fine and seems to call #to_s from an Object. <pre class="prettyprint"><code>=> "#<Fake:0x137029f8>" </code></pre> But why? ADDED: <pre class="prettyprint"><code>class Fake def to_s Fake2.new end end class Fake2 def to_s "Fake2#to_s" end end </code></pre> This code works differently in two cases: <pre class="prettyprint"><code>puts "#{Fake.new}" => "#<Fake:0x137d5ac4>" </code></pre> But: <pre class="prettyprint"><code>puts Fake.new.to_s => "Fake2#to_s" </code></pre> I think it's abnormal. Can somebody suggest when in ruby interpreter it happens internally?

<h3>Short version</h3> Ruby does call <code>to_s</code>, but it checks that <code>to_s</code> returns a string. If it doesn't, ruby calls the default implementation of <code>to_s</code> instead. Calling <code>to_s</code> recursively wouldn't be a good idea (no guarantee of termination) - you could crash the VM and ruby code shouldn't be able to crash the whole VM. You get different output from <code>Fake.new.to_s</code> because irb calls <code>inspect</code> to display the result to you, and <code>inspect</code> calls <code>to_s</code> a second time <h3>Long version</h3> To answer "what happens when ruby does x", a good place to start is to look at what instructions get generated for the VM (this is all MRI specific). For your example: <pre class="prettyprint"><code>puts RubyVM::InstructionSequence.compile('"#{Foo.new}"').disasm </code></pre> outputs <pre class="prettyprint"><code>0000 trace 1 ( 1) 0002 getinlinecache 9, <is:0> 0005 getconstant :Foo 0007 setinlinecache <is:0> 0009 opt_send_simple <callinfo!mid:new, argc:0, ARGS_SKIP> 0011 tostring 0012 concatstrings 1 0014 leave </code></pre> There's some messing around with the cache, and you'll always get <code>trace</code>, <code>leave</code> but in a nutshell this says. <ol> <li>get the constant Foo</li> <li>call its new method</li> <li>execute the tostring instruction</li> <li>execute the concatstrings instruction with the result of the tostring instruction (the last value on the stack (if you do this with multiple #{} sequences you can see it building up all the individual strings and then calling concatstrings once on all consuming all of those strings)</li> </ol> The instructions in this dump are defined in insns.def: this maps these instructions to their implementation. You can see that <code>tostring</code> just calls <code>rb_obj_as_string</code>. If you search for <code>rb_obj_as_string</code> through the ruby codebase (I find http://rxr.whitequark.org useful for this) you can see it's defined here as <pre class="prettyprint"><code>VALUE rb_obj_as_string(VALUE obj) { VALUE str; if (RB_TYPE_P(obj, T_STRING)) { return obj; } str = rb_funcall(obj, id_to_s, 0); if (!RB_TYPE_P(str, T_STRING)) return rb_any_to_s(obj); if (OBJ_TAINTED(obj)) OBJ_TAINT(str); return str; } </code></pre> In brief, if we already have a string then return that. If not, call the object's <code>to_s</code> method. Then, (and this is what is crucial for your question), it checks the type of the result. If it's not a string it returns <code>rb_any_to_s</code> instead, which is the function that implements the default <code>to_s</code>

What happens when you use string interpolation in ruby?

Tags:

ruby

I thought that ruby just call method to_s but I can't explain how this works:

class Fake
  def to_s
    self
  end
end

"#{Fake.new}"

By the logic this should raise stack level too deep because of infinity recursion. But it works fine and seems to call #to_s from an Object.

=> "#<Fake:0x137029f8>"

But why?

ADDED:

class Fake
  def to_s
    Fake2.new
  end
end

class Fake2
  def to_s
    "Fake2#to_s"
  end
end

This code works differently in two cases:

puts "#{Fake.new}" => "#<Fake:0x137d5ac4>"

But:

puts Fake.new.to_s => "Fake2#to_s"

I think it's abnormal. Can somebody suggest when in ruby interpreter it happens internally?

691

asked Aug 25 '14 15:08

abonec

1 Answers

Short version

Ruby does call to_s, but it checks that to_s returns a string. If it doesn't, ruby calls the default implementation of to_s instead. Calling to_s recursively wouldn't be a good idea (no guarantee of termination) - you could crash the VM and ruby code shouldn't be able to crash the whole VM.

You get different output from Fake.new.to_s because irb calls inspect to display the result to you, and inspect calls to_s a second time

Long version

To answer "what happens when ruby does x", a good place to start is to look at what instructions get generated for the VM (this is all MRI specific). For your example:

puts RubyVM::InstructionSequence.compile('"#{Foo.new}"').disasm

outputs

0000 trace            1                                               (   1)
0002 getinlinecache   9, <is:0>
0005 getconstant      :Foo
0007 setinlinecache   <is:0>
0009 opt_send_simple  <callinfo!mid:new, argc:0, ARGS_SKIP>
0011 tostring         
0012 concatstrings    1
0014 leave

There's some messing around with the cache, and you'll always get trace, leave but in a nutshell this says.

get the constant Foo
call its new method
execute the tostring instruction
execute the concatstrings instruction with the result of the tostring instruction (the last value on the stack (if you do this with multiple #{} sequences you can see it building up all the individual strings and then calling concatstrings once on all consuming all of those strings)

The instructions in this dump are defined in insns.def: this maps these instructions to their implementation. You can see that tostring just calls rb_obj_as_string.

If you search for rb_obj_as_string through the ruby codebase (I find http://rxr.whitequark.org useful for this) you can see it's defined here as

VALUE
rb_obj_as_string(VALUE obj)
{
    VALUE str;

    if (RB_TYPE_P(obj, T_STRING)) {
    return obj;
    }
    str = rb_funcall(obj, id_to_s, 0);
    if (!RB_TYPE_P(str, T_STRING))
    return rb_any_to_s(obj);
    if (OBJ_TAINTED(obj)) OBJ_TAINT(str);
    return str;
}

In brief, if we already have a string then return that. If not, call the object's to_s method. Then, (and this is what is crucial for your question), it checks the type of the result. If it's not a string it returns rb_any_to_s instead, which is the function that implements the default to_s

130

answered Nov 15 '22 20:11

Frederick Cheung

Related questions
                            
                                Can't enter Umlauts in Ruby 1.9.3 IRB
                            
                                Passing a Block to a delayed_job
                            
                                Accessing a Ruby hash with a variable as the key
                            
                                Integrate Gitlab and TravisCi
                            
                                Is there a cucumber hook to run before and after each feature
                            
                                Devise Unlock Button in Views
                            
                                Worker, Threads & Pool size using Puma
                            
                                Ruby: How to access a constant from the class a module is included into
                            
                                Gmaps4rails : Setting map width and height
                            
                                Using fork in Windows with Ruby
                            
                                Ruby Text Analysis
                            
                                How to get only a subset of an ordered hash in Ruby 1.9?
                            
                                How can I post with an instance variable and HTTParty to an API that uses OAuth2
                            
                                Conditional inclusion of a key-value pair in a hash [closed]
                            
                                What are the pros and cons of Asset-Pipeline/Turbolinks from Rails 4 for a big application? [closed]
                            
                                call next on ruby loop from external method
                            
                                What are the major omissions in mruby compared to MRI?
                            
                                How do I debug why a gem install fails?
                            
                                Sidekiq: Ensure all jobs on the queue are unique
                            
                                configuring rspec-rails generators

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With