Well, I decided to do a very quick and dirty grepping of statistics with the fol...

fmela · on June 23, 2015

I am surprised to see lea so high on the list. Back when I last wrote x86 assembly code (early 2000s), it was considered slow, but perhaps that has changed.

e12e · on June 23, 2015

I came across:

http://www.realworldtech.com/haswell-cpu/4/

And older, about amd64 in general (x86/32bit is quite different from 64bit):

"lea -0x30(%edx,%esi,8),%esi

Compute the address %edx+8*%esi-48, but don’t refer to the contents of memory. Instead, store the address itself into register %esi. This is the “load effective address” instruction: its binary coding is short, it doesn’t tie up the integer unit, and it doesn’t set the flags."

http://people.freebsd.org/~lstewart/references/amd64.pdf

[ed: in general the purpose of lea is to get the address of things in c/pascal arrays (pointer to array+offset) -- as I understand it. I don't know if it's used for other tricks by optimizing (c) compilers much -- but taking the address of an item in an array (say a character in a C string) sounds like it would be fairly common. If lea is fast(er) on amd64 using the dedicated operand would make sense.]

fmela · on June 23, 2015

Yeah, despite it's name, lea is an arithmetic instruction; it doesn't reference memory (although it was designed with computing memory addresses in mind). You can do some neat arithmetic tricks with lea, because it computes register1 + register2<<shift + offset.

astrange · on June 23, 2015

'lea' is a fast way to multiply by 3, for instance. It's also faster than two adds.

e12e · on June 23, 2015

I found the (top) operands that deal with push/pop (in some form) most interesting:

  9899354 callq
  3879175 pop
  2824699 push
  1311476 retq

I suppose many of the callq goes into the kernel/libc, so the coresponding ret/retq wouldn't be in the binaries?

Interesting that pop ~ 2x push. Perhaps this is due to many off the callq's leading to something interesting being left on the stack?

SamReidHughes · on June 23, 2015

The callq/retq ratio means most functions have... apparently, about 8 calls to others, after inlining. It has nothing to do with libc, per se, there's no reason in general to expect a 1-to-1 call/ret ratio.

kens · on June 23, 2015

I've been wondering what were the worst decisions in assigning single-byte opcodes; can you determine that from your data? I'm guessing that the decimal/ASCII adjust operations (DAA/DAS/AAA/AAS) have extremely low use considering they take up 4/256 of the opcode space.

jcranmer · on June 23, 2015

Some of the 1-byte opcodes are specific subsets of instructions (e.g., 04 is ADD AL, Ib). Keeping in mind that many single-byte opcodes don't actually exist in 64-bit code (e.g., PUSH DS or DAS), the ones that do exist that aren't used: ins, outs, lods, in, stc, cli, wait, pushf, popf, lahf

cbd1984 · on June 23, 2015

Interesting how you know so much shell but still uselessly use cat.

bensummers · on June 23, 2015

It reads better than the alternative, which would be to bury the input filename in the middle of that second line.

cbd1984 · on June 23, 2015

    < file od

for example works just as well.