Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Why are binaries built with gccgo smaller (among other differences?)

Tags:

go

gccgo

I've been experimenting with gc and gccgo, and I've encountered some odd behaviour.

Using a program I once wrote to test some theorem, I got these results: (I removed unnecessary information for readablitity)

$ time go build -compiler gc -o checkprog_gc checkprog.go (x 3)
go build <...>    0.13s user 0.02s system 100% cpu 0.149 total
go build <...>    0.13s user 0.01s system 99%  cpu 0.148 total
go build <...>    0.14s user 0.03s system 100% cpu 0.162 total
 --> average:     0.13s user 0.02s system 100% cpu 0.153 total


$ time go build -compiler gccgo -o checkprog_gccgo checkprog.go (x 3)
go build <...>    0.10s user 0.03s system 96% cpu 0.135 total
go build <...>    0.12s user 0.01s system 96% cpu 0.131 total
go build <...>    0.10s user 0.01s system 92% cpu 0.123 total
 --> average:     0.11s user 0.02s system 95% cpu 0.130 total


$ strip -s -o checkprog_gc_stripped checkprog_gc
$ strip -s -o checkprog_gccgo_stripped checkprog_gccgo

$ ls -l
 1834504 checkprog_gc*
 1336992 checkprog_gc_stripped*
   35072 checkprog_gccgo*
   24192 checkprog_gccgo_stripped*

$ time ./checkprog_gc
./checkprog_gc  6.68s user 0.01s system 100% cpu 6.674 total
./checkprog_gc  6.75s user 0.01s system 100% cpu 6.741 total
./checkprog_gc  6.66s user 0.00s system 100% cpu 6.643 total
 --> average:   6.70s user 0.01s system 100% cpu 6.686 total

$ time ./checkprog_gccgo
./checkprog_gccgo  10.95s user 0.02s system 100% cpu 10.949 total
./checkprog_gccgo  10.98s user 0.01s system 100% cpu 10.964 total
./checkprog_gccgo  10.94s user 0.01s system 100% cpu 10.929 total
 --> average       10.96s user 0.01s system 100% cpu 10.947 total

I can see the following patterns:

  1. Binaries built with gccgo are radically smaller in size (and stripping doesn't help to change this difference)
  2. Binaries built with gc are faster to execute
  3. It takes a bit more time to build with gccgo than with gc

I also tested some other go programs (while not that extensively) and all of them exhibit the same behavior.

This seems to contradict what this answer states:

In short: gccgo: more optimization, more processors.

I'd think that more optimization means faster binaries, while needing more time to compile...

What's the reason these three patterns?

like image 455
Evert Heylen Avatar asked Nov 21 '14 17:11

Evert Heylen


People also ask

Is Gccgo faster?

Compared to gc, gccgo is slower to compile code but supports more powerful optimizations, so a CPU-bound program built by gccgo will usually run faster.

What is Gccgo?

The gccgo compiler is a new frontend for GCC, the widely used GNU compiler. Although the frontend itself is under a BSD-style license, gccgo is normally used as part of GCC and is then covered by the GNU General Public License (the license covers gccgo itself as part of GCC; it does not cover code generated by gccgo).

How does the Go compiler work?

Go is a compiled language. This means we must run our source code files through a compiler, which reads source code and generates a binary, or executable, file that is used to run the program. Examples of other popular compiled languages include C, C++, and Swift.


1 Answers

There are a bunch of differences--bradfitz talked about some of them in a May 2014 talk:

  • gccgo can produce a binary that dynamically links in libgo, which makes the output smaller but means the relevant library to be installed on the target machine. Go binaries without cgo don't have that requirement.
  • gccgo does more low-level optimizations 'cause it can use gcc's code generator and optimizer. Writing some data-compression code, gccgo ran it noticeably faster than gc. Those same optimizations make the compiler slower: it's doing more work.
  • gccgo supports the target processors that gcc does, so it's the only way to get on some architectures like SPARC, ARMv8 (64-bit) or POWER. (Canonical uses it to compile their Juju service orchestration tool for arm64 and ppc64.)
  • gccgo and gc both support ARMv7 (32-bit), but according to bradfitz's talk gc does not generate the most efficient ARM code.
  • There are certain optimizations only gc has.
    • A big one is escape analysis, in which the compiler determines that some variables will never "escape" the function where they're allocated and therefore can be stack-allocated. (So, surprisingly, new(T) may not heap-allocate if its return value doesn't escape.) This reduces how often garbage collection needs to run.
    • Another is that .s assembler files in the standard library are only linked in by gc, so some stuff like Intel hardware CRC32C isn't used by gccgo by default (you'd have to provide an implementation specifically for gccgo).
  • gc implements new language features first and has generally been a minor Go version or two ahead of the latest gccgo .
like image 73
7 revs Avatar answered Oct 12 '22 01:10

7 revs