The feedback we received indicates that you used the backspace key twice during coding. We expect higher precision than that.Also, we&#x27;d like to remind you that you can only re-apply after our cool-off period, which is 25 years.

(I am the post author).
I wrote the original version of this post using x86, but leetcode won&#x27;t accept it. So I had to improvise...

This is pretty damn cool, but unfortunately you failed the interview as you accepted the challenge to use x86 assembler, but solved the problem using a different programming language from the one we asked you to use. We&#x27;ll keep your resume on file, and if there are any openings in the future we encourage you to apply for those.

For those who don&#x27;t know, that&#x27;s why big and little endian were called that, because the debate was so frivolous. It&#x27;s a reference to the book Gulliver&#x27;s Travels by Jonathan Swift in which an island folk was split about from which end you should crack a boiled egg. (I&#x27;m a big endian for example).

except now that everything depends on the internet, and words that go over networks are big endian, it seems insane to throw away millions and millions of cpu cycles every year converting them to little endian to be processed by our little endian cpus. sure, it&#x27;s a single cpu instruction, but between every computer in the world, almost all of them being little endian arm or intel, that&#x27;s billions and billions and billions of instructions wasted.

AT&amp;T really is annoying but it feels like the big vs little endian debate. Fairly easy to convert between the two as well.

I immediately stopped reading the minute I read that in the text. I can&#x27;t take anything they say seriously after reading that.

&gt;I will be using x64 assembly with the AT&amp;T syntax as it is objectively superior than the Intel syntax.Them&#x27;s fighting words.

&gt; I will be using x64 assembly with the AT&amp;T syntax as it is objectively superior than the Intel syntax.This made me laugh because it must be a reference to this: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=33652023" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=33652023</a>&gt; I contend that the AT&amp;T syntax is harmful and bad, and should never be used, for any reason, under any circumstances, by anyone.

However, when writing in assembly one must pay attention that at least RBX, RBP and R12 through R15 must be preserved by any functionOnly if you&#x27;re calling external code that assumes that. The power of Asm largely comes from not needing to follow arbitrary conventions in your own code. The boundaries where you interface to external code are the only constraints.

&gt; The variables on the stack are the most efficient after registersWhy are variables on the stack more efficient than other memory accesses?

I know none of the calling conventions in any detail anymore and just used the registers in alphabetical order. Totally expected that this would violate something.

The variables on the stack are the most efficient after registers, so you are right that a local variable should be kept into a register if possible, otherwise in the stack, and only then in other places (e.g. if it is too large for the stack).However, when writing in assembly one must pay attention that at least RBX, RBP and R12 through R15 must be preserved by any function (on Windows also RDI and RSI must be preserved).So in your code you should not use RBX, but a volatile register, e.g. RDX or RCX. If you would insist on using RBX, it would have to be saved and restored.

No. Why do you ask?EDIT: As Wikipedia describes it as a stack-oriented language, because of my comment about putting everything onto the stack?

Assume RAX points to the root node and nodes just contain two child pointers and everything is aligned and whatever.<pre><code> :invert
 cmp rax, 0
 jnz swap:
 ret
 :swap
 push rax
 mov rax, [rax]
 call invert
 mov rbx,[rsp]
 xchg rax, [rbx+8]
 mov [rbx], rax
 call invert
 pop rax
 ret
</code></pre>
The last time I used an assembler was before x86-64 was invented, I am not even sure I ever used one in protected mode. But that seems a totally reasonable whiteboard interview question. Written in notepad, might not assemble. Might even be totally incorrect and I am posting it so that the internet generates the warning and error messages.EDIT: After reading the article now, that seems rather inefficient to me, to use local variables on the stack for everything. And why is the function returning a node if it is mutating the tree in place?

The tweet this is based on is a joke. To invert a Merkle tree would mean to invert cause and effect. I’m pretty sure the tweet author is implying they want you to find a hash key collision for each node. Hope you have a couple spare universes in your pockets because this is gonna take a while.

This doesn&#x27;t actually invert a Merkle tree though, since you have to recompute the hashes (except the leaf hashes) when you invert a Merkle tree. Gonna be a no-hire evaluation from me dawg.

I&#x27;m not well versed in assembly, so learning assembly first would be the hard part!I&#x27;m with GP, it&#x27;s fun seeing how solutions differ between languages as a way to peek into other language communities I don&#x27;t spend as much time in.

The hard bit of solving them is usually the algorithm though - when you know that you can code it in anything.

I love seeing people solve leetcode challenges in asm, are there any more blogposts like this?

Oooh. So that&#x27;s why people invented C.

Inverting a binary tree is easy; express the tree as a matrix (laplacian), invert the matrix, then convert that back to a tree. What the canonical question is asking is not inversion.Since too many people were memorizing inversion, I switched to asking how to evert a binary tree. This leads naturally into a discussion of the 1:1 relationship between complex numbers and cohomology sets, I figure if somebody can get that right, they can be a junior programmer on the team.

I don&#x27;t get why this one is the meme. Just because it&#x27;s recursion? Because it&#x27;s (nearly) pointless? There are so many other algorithms I find more difficult&#x2F;more tedious.

Is there a practical reason to do this in a real-world program?

&quot;Flip&quot; or &quot;mirror&quot; is probably a better term. It seems the goal is to swap left and right: <a href="https:&#x2F;&#x2F;leetcode.com&#x2F;problems&#x2F;invert-binary-tree&#x2F;" rel="nofollow">https:&#x2F;&#x2F;leetcode.com&#x2F;problems&#x2F;invert-binary-tree&#x2F;</a>

What does it mean to &quot;invert&quot; a binary tree?

You&#x27;re alluding to using the Morris traversal algorithm which can traverse a binary tree in O(1) space, but Morris traversal is actually much much slower than using a stack, especially as is used by this algorithm. Doing a Morris traversal requires at a minimum twice the number of operations as using a stack, and due to its cache unfriendly nature will in practice be closer to 4x slower.You typically only use Morris traversal on exceptionally large trees, and by large I mean when working with data that lives on a disk. It&#x27;s definitely the exception, not the norm.

That solution is terrible, with a bad algorithm that requires O(tree_height) space (the optimal one involves temporarily using left&#x2F;right pointers as a parent pointer so that you only need constant space) and lacking any sort of assembly optimization, being worse than what a compiler would produce (e.g. it&#x27;s a real mystery how the author managed to decide that local_right should be spilled on the stack).Definitely not what you want to submit to someone testing your programming skills.

I was thinking of this from the perspective of CPU pipeline pressure, but in reality it seems prosessors are indeed smart enough to avoid burdoning the ALUs execution with these kinds of special cases.Read more here <a href="https:&#x2F;&#x2F;stackoverflow.com&#x2F;questions&#x2F;17981447&#x2F;microarchitectural-zeroing-of-a-register-via-the-register-renamer-performance-v&#x2F;18027854#18027854" rel="nofollow">https:&#x2F;&#x2F;stackoverflow.com&#x2F;questions&#x2F;17981447&#x2F;microarchitectu...</a>&gt; [...] these zeroing instructions extremely efficient, with a throughput of four zeroing instructons per clock cycle.Also, the xor instruction takes up the smallest amount of .text space (right?).

Yes, it’s 1 cycle but it’s longer to decode and occupies more of l1i cache. It’s not all about execution cycles.

You can use the 32bit xor to reset the register. Also TEST REG,REG might be better for checking if it’s zero.

For learning assembly, usually you learn the syntax for your assembler. The rest of it (the majority of it) is then learning what instructions are available on your platform.I liked <a href="http:&#x2F;&#x2F;rayseyfarth.com&#x2F;asm&#x2F;" rel="nofollow">http:&#x2F;&#x2F;rayseyfarth.com&#x2F;asm&#x2F;</a> as an intro to both. I&#x27;d already had a class on computer architecture that did assembly before that, though.Once you get going with that, you can download and read the Intel or AMD programmers manuals. Of course, this assumes x86_64.

So anyone have good recommendations on learning assembly? I followed the link to the instructions ABI but can’t really read on my device.

Wait, so this whole mystical inverting is just swapping left and right children of all nodes?