To be clear, I&#x27;m not really arguing that the intent is to embed comments. But I&#x27;m just going by the &quot;validateSignature&quot; example they give in the video. All that does is generate a 1-1 mapping of each code line to an English comment. TBH I would expect a senior developer to just be faster reading the code.Perhaps there are some other examples that explain more of a &quot;structural understanding&quot; of the code, but I&#x27;m skeptical without more evidence.

Why could you not create a comment block above the code section in question and fold it. All IDE&#x27;s nowadays support folding, including comment block folding.
To see what a section of code does, you unfold the comment block above it, read it, then fold it back.

I don’t think the intent of this tool is to generate comments which you’d then embed into the code it describes. I think it’s meant to explain, in plain language, what the actual behavior is (for whatever confidence level you might assign to “actual” and “is”).To your point about the utility of code comments describing the behavior this way, I agree it’s probably much more valuable for beginners. In fact when I’ve mentored early programmers, I sometimes ask them to write out essentially prose like this in comments before writing a single line of executable code.Now, I’m far from a beginner. I’ve been considered a senior engineer long enough that friends discourage me from disclosing the amount of time, for fear of age discrimination. I can absolutely see the potential of this tool as part of my IDE. I’m on vacation now, but when I return to work I plan to take it for a spin as an aid for refactoring areas of code which clearly work as intended (well, for the most part) but the actual behavior and intent is much less clear.Here’s why I think it’ll be valuable for refactoring: it can help limit the amount of mental context switching necessary to build a mental model of what the code does. I often find myself trying to produce prose much like this for my own reference, but I end up losing context as fast as I acquire it as I follow references into their respective rabbit holes. Having the tool do that for me can help me stay in a single area of focus. It could also be a useful reference for adding and improving type definitions, maybe even regression tests.The best part is that it doesn’t, from what I’ve seen, do anything besides populate ephemeral annotations. It doesn’t try to write code or automate anything other than producing a narrative. Like at least one other commenter, I’m skeptical about the reliability of that. But unlike that commenter, I’m willing to take the risk… probably because I’ve learned to be skeptical of my own reliability performing the same task. If my instinct is right that I can use this tool the way I hope, I’ll still scrutinize it for accuracy. But that’s potentially much better than having only one imperfect, meat-based computer doing the work.

The Fortune bash command was kind enough to provide me with this yesterday:&quot;When code and comments disagree, both are probably wrong&quot;, Norm Schryer.

I completely agree with your point here. The extreme pathological case is when you see things like this<pre><code> i += 2 &#x2F;&#x2F; Increments i by 1
</code></pre>
Now you&#x27;re screwed. Added bonus - the commit message introducing this change is &quot;Yay! Fixed all the tests!!!!&quot;

Fortunately for me, at my current job I don&#x27;t see a lot of code that makes me say &quot;what the hell is this even trying to do?&quot; The most useful comments are the ones explaining limitations and why this implementation was chosen, or business rules that wouldn&#x27;t be apparent in the code.

Seeing as how its knowledge base is pulled from similar code pulled from the internet, I&#x27;d find it very likely to get the reference and interpret that as the quake fast inverse square root. Same with general context&#x2F;library stuff - though obviously that&#x27;s conditional on the quality of its training data.e.g. it would probably &quot;understand&quot; common devops&#x2F;organization&#x2F;OS&#x2F;library stuff even if it&#x27;s not part of the language it&#x27;s presumably reading - just because other users probably left comments on those lines similarly - but it&#x27;s not gonna necessarily understand your application-specific business logic beyond what&#x27;s actually happening. Would need some very specific examples though (by definition lol). Even dumb stuff like interpreting &quot;make the div spin around&quot; from some CSS&#x2F;JS would probably work, as someone somewhere probably coded that similarly.

GPT3 knows this is the fast inverse square root. Don&#x27;t know about codex though, guess it does too.

Another way to look at it is that it doesnt (afaik) have externsl context, so there are severe limits on what it could possibly infer from some code. Like you say, if something unusual is being done because of a library, or business rules or something, &quot;AI&quot; cannot take this into account. There may be some cases where something non-obvious can be distilled out of the code, though I agree that mostly the stuff you can infer without context is mostly self evident from the code anywayEdit: just thinking, the canonical example would be something like the fast inverse square root from quake. Is it going to summarize or is it going to tell you<pre><code> i = 0x5f3759df - (i &gt;&gt; 1);

 &#x2F;&#x2F; Shift i right by one and subtract it from 0x5f...
</code></pre>
(It would be cool if it does work here, even if it does, when someone makes up a new thing like this, it couldn&#x27;t possibly comprehend why it&#x27;s being done)

I&#x27;m always amazed by what AI can do but never been amazed by the actual output(amazed within the context that this is computer generated). Not just Copilot but any tool that has too much of intelligence of itself(Dall-e, Midjourneys etc) feels like this because it reminds me of a person with great talent for compositing stuff but doesn&#x27;t know what they are doing.AI generated papers that got published in prestigious journals situation all over again and again. At glance looks amazing but the machine definitely doesn&#x27;t have any kind of intelligence and the output is actually worthless.This AI stuff work really well when they do something very specific that is quickly inspectable by human, for example generating interpolated frames in videos or extending a pattern or detecting anomalies kind of stuff.The moment it strays away from human control it fails amazingly well.

Yes, the examples I gave are gross over-simplifications. But look at the attached video. It literally just gives a 1-1 mapping of each code line to an English sentence, and that code is pretty trivial to understand in any case. I mean<pre><code> $data = file_get_contents(filename: &#x27;php:&#x2F;&#x2F;input&#x27;);
</code></pre>
to<pre><code> &quot;We&#x27;re getting the raw request body&quot;
</code></pre>
I still argue that developers should be able to read the raw code like that faster than turning it into an English sentence. I&#x27;m not saying the text generation isn&#x27;t extremely impressive, I just don&#x27;t think it&#x27;s that useful.

You are grossly over-simplifying what this is doing. Nowhere does it say anything as simple as setting x to y. In nearly each case it takes the context of the variables into account and states the meaning of the function calls, not the meaning of the syntax.

The tweet and video don’t seem to imply this _should_ be a comment.I have been “learning to program” for 20+ years and would absolutely find this useful as a quick way to get basic information about a chunk of code I’m unfamiliar with.Not that learning to read code isn’t important, just not always necessarily worth the time (:

My Rule of comments:1. Write comment first, code later.2. Tell the intent of code, not the instructions to achieve it

Yeah, the &#x27;why&#x27; often literally doesn&#x27;t exist as information in the code anywhere, e.g. business rules, domain-specific knowledge, etc. If you&#x27;re extremely lucky it might exist as &#x27;why comments&#x27; :)I do wonder if ML could be applied if you actually also train it on your particular (large) application where if might be able to &#x27;cross-reference&#x27; domain knowledge present in commented code to other similar code. (But then you might want to just remove the pseudo-duplication anyway.)

I caused a bit of an &quot;incident&quot; at a web dev company I worked at many many years ago by removing commit access from one of the &quot;technical managers&quot;, whose hobby was removing every single comment he could find.&quot;But the code shouldn&#x27;t need comments,&quot; he&#x27;d complain, &quot;it should be obvious what it does otherwise it&#x27;s just bad code!&quot;Yes, you dobber, it *is* obvious from the code, the *how* is obvious, but the why and the what might not be. The comments explain what it&#x27;s doing stuff to and why, and in particular why you&#x27;d want that and why it&#x27;s important. Disk space is free, so in a big long comment just write up what that particular bit of business logic is intended to achieve and what it expects as inputs and outputs.or, TL;DR &quot;DOCSTRINGS BITCH DO YOU SPEAK IT&quot;

Agree. I find, what you usually want to understand is not the &quot;what&quot; or &quot;how&quot; but the &quot;why&quot;, and that is quite a bit harder to automate than translating syntax into natural language statements.

FWIW, when I&#x27;m doing a code review, these are the exact kind of comments that I would tell a committer to remove.That is, it&#x27;s like it generates these kinds of comments:<pre><code> &#x2F;&#x2F; initializes the variable x and sets it to 5
 let x = 5;

 &#x2F;&#x2F; adds 2 to the variable x and sets that to a new variable y
 let y = x + 2;
</code></pre>
That is, IMO the whole purpose of comments should be to tell you things that aren&#x27;t readily apparent just by looking at the code, e.g. &quot;this looks wonky but we had to do it specifically to work around a bug in library X&quot;.Perhaps could be useful for people learning to program, but otherwise people should learn how to read code as code, not &quot;translate&quot; it to a verbose English sentence in their head.

The underhanded C contest is a great practical demonstration even &quot;biological intelligences&quot; have a difficult time reading and summarizing code. I wouldn&#x27;t trust this thing further than I do comments, but I could see it being equally as useful.

There&#x27;s an example in this example. The line it translates as &quot;we&#x27;re getting the raw request body&quot; doesn&#x27;t work on multipart&#x2F;form-data.I can easily imagine the reason you&#x27;re looking at an unfamiliar function in an unfamiliar language (hence needing such a line-by-line translation) is that there&#x27;s some sort of bug and that edge case is exactly why. The tool would mislead you into thinking it&#x27;s one of the other lines, because of how simple its translation is.

This is also the story of the past several millenia of &quot;progress&quot; in human translation between natural languages.You trust a human translator because their ability has been partially verified by an authority (language certificate) and many people have employed their services with few complaints. Similarly, you trust a translation program because it was partially verified (hidden test cases) and many people use it with an acceptable amount of complaints (given its convenience).

&gt; and it sure does a real good job of sounding convincing, but if you don&#x27;t understand the code there&#x27;s no way to know if what it&#x27;s saying is accurateThis is also the story of the past decade of &quot;progress&quot; in machine translation between natural languages.

Whether this thing is worthy of the label of &quot;intelligent&quot; or not is fairly uninteresting. What matters for something like this is its accuracy and if it can be trusted - that is what I think OP is getting at.

Have you ever read &quot;A Canticle for Leibowitz&quot;? A peripheral bit in the story has a monk develop a mathematical system to determine what word would come next in a manuscript whose edge has been lost. Walter M. Miller, writing that story in 1959, does not portray such a system as having or being perceived to have &quot;actual intelligence&quot;, because he can easily imagine that a complex system could appear to work in that way without intelligence.The goalpost was never in that spot.

Does it do all that, or does it just pretend to understand context and extract the essence from texts? It looks as if it does because it follows the form you&#x27;d expect an answer to have if the person is intelligent. But when you look more closely, it often falls apart.It reminds me of people who use &quot;big words&quot; without actually understanding them. If they don&#x27;t overdo it or really miss the meaning of a term, they can seem much more educated than they are.

&gt; understanding contextYou&#x27;re asserting here that it understands context, but you haven&#x27;t provided any argument in support of that assertion.I think you&#x27;ll also need to define what you mean by &quot;understanding&quot; (because that term is loaded with anthropocentric connotations) and clearly state what &quot;context&quot; you think the model has.

&gt; it has no actual intelligenceThis is a prime example of the moving goalpost of what intelligence &quot;actually&quot; is - in previous eras, we would undoubtedly consider understanding context, putting together syntactically correct sentences and extracting the essence from texts as &quot;intelligent&quot;

I just don&#x27;t trust it, I&#x27;ve worked with GPT-3 before and it sure does a real good job of sounding convincing, but if you don&#x27;t understand the code there&#x27;s no way to know if what it&#x27;s saying is accurate, or whether it&#x27;s just regurgitating random nonsense that sounds plausible.It knows how to create sentences that sound like something a human would write, and it&#x27;s even good at understanding context. But that&#x27;s it, it has no actual intelligence, it doesn&#x27;t actually understand the code, and most importantly, it&#x27;s not able to say &quot;Sorry chief, I don&#x27;t actually know what this doing, look it up yourself.&quot;

FWIW I’ve been using CO pilot now for a while and I have to do very little laying out, usually just from a name and context it will give me 80% of what I want and then it’s much quicker for me to just edit it into the correct form if need be. My productivity has very heavily increased because of the amount of rote boiler plate I can now just completely obviate.I think you should be careful to realize that though it may not fit for you intellisense is very helpful for a lot of people and that it may be your tastes as to what you find annoying that do not generalize. I for one don’t even notice the things you’re saying bother you because the mental overhead to me is very little. Just quick glance, tab to auto complete if it’s useful otherwise keep typing.

What I am talking about has nothing to do with Intellisense or your workflow. What I am saying is that, if someone in 2019 told you that there is a &quot;thing&quot; that is able to take a very complex sentence and qith high accuracy (and awareness of the database details) generate 50 lines of SQL, using CTEs, complex JOINs, subqueries, string formatting, date manipulation, etc, you would have been amazed. That thing now exists, and it didn&#x27;t exist before. It is a complete phase shift and it cannot simply be viewed as a incremental improvement. This is a whole different beast.Using this beast as intellisense is just one application (called &quot;Copilot&quot;) and it has all these annoyance factors sometimes. But I am not talking about that.To me, this is like we found a way to transform iron to gold with low energy usage, and people are complaining that gold is not that useful. And most chemists not even hearing about the news. I&#x27;m constantly amazed by this, every single day, as I read threads like this one.

&gt; Intellisense has _always_ had that annoyance factor of getting in your way sometimes, forcing you to write code in a certain way to minimize that. All this just makes it more annoying and I don&#x27;t believe anyone who claims it truly makes them more productive.I have the same issues with these tools, but the one situation I can imagine it being really useful is people who are good at reading and understanding code, but are slow typists. Or more particularly, people who have to think about typing, no matter what the speed is (though I think they&#x27;re usually the slower ones). I believe it&#x27;s only once you can type without thinking about typing, and have done it for a while, that these tools become an annoyance because you&#x27;ve gotten used to not interrupting your thoughts on the problem at hand.

MS has been trying to get AI into intellisense for years now and I always turn it off.The lack of control over it just makes it annoying. In many ways it&#x27;s faster to just type out the algorithm than it is to lay the algorithm out and spend the time trying to understand what&#x27;s there so I can successfully convert the code to what I need.Then there&#x27;s the lack of stability. Yesterday it did something different from what it&#x27;s doing today, so I can&#x27;t even use muscle memory to interact with it anymore.Intellisense has _always_ had that annoyance factor of getting in your way sometimes, forcing you to write code in a certain way to minimize that. All this just makes it more annoying and I don&#x27;t believe anyone who claims it truly makes them more productive.

&gt; 95% accurate, 99% accurate, and 99.9% accurate are all aweful in this context.Really? What exactly is the bar then? I&#x27;d say most professionals I know hover somewhere between 95% and 99%.

It&#x27;s something run repeatedly, so small chances will occur. Amoung it&#x27;s failure states are being very, very wrong in ways that are hard for a skilled human to detect without more work that writing from scratch.Also makes for a great plot summary for the original Jurassic Park

I have no use for it, and don&#x27;t expect ever having a use for it. 95% accurate, 99% accurate, and 99.9% accurate are all aweful in this context.It&#x27;s something run repeatedly, so small chances will occur. Amoung it&#x27;s failure states are being very, very wrong in ways that are hard for a skilled human to detect without more work that writing from scratch.And no one in the space is discussing ways to eliminate categories of bugs, only ways to reduce the frequency. Most of those solutions have the side effect of making the less frequent bugs harder to detect. On balance, that&#x27;s worse.And, less importantly, it&#x27;s only useful for writing boring code that should probably be generalized to an API. Sure, I write plenty of that, but it&#x27;s not an exciting area to follow in my spare time.

This is what amazes me, since it seems like such big news, and people in the field are just not aware of it. Just for reference of what I am talking about, here is a piece of code that was generated, without any cherry picking at all (you just have to trust me on this, sorry) by allowing Codex to be aware of the database with some smart prompting (this is on a DB with music store data):Q: Best selling artist per countryA: <a href="https:&#x2F;&#x2F;pastebin.com&#x2F;qBVu2mvc" rel="nofollow">https:&#x2F;&#x2F;pastebin.com&#x2F;qBVu2mvc</a>Needless to say, this query works and returns the data I wanted. Whether this is useful or not is up for discussion. But I cannot understand how it&#x27;s not amazing.

I was aware of and use copilot but I didn’t realize it was built on top of codex. And wasn’t even aware codex existed until you commmented.I read hn pretty regularly but unless you’re really excited about the AI space a lot of this news washes over you and you mostly ignore it.

I&#x27;d love to try Codex if I could run it on a local GPU and finetune it for my own code. I&#x27;d even push to use it at work. But as we&#x27;re writing in a niche language and our code is heavily problem domain dependent, I don&#x27;t feel like making my workflow vulnerable to an external supplier, even aside the IP concerns.Call me when I can download and finetune the weights, like I can with Stable Diffusion.

The problem is that they don&#x27;t understand the practical ways it can be used. Even tech savvy people don’t yet get it. Even my CEO, a kind of technical person, did not understand the full potential until I explained some use cases.In one scenario, I took the slow running long MySQL query and rewrote that with Codex in 2 mins.But I think people have started to realize the potential now.Pitch: My app <a href="https:&#x2F;&#x2F;Elephas.app" rel="nofollow">https:&#x2F;&#x2F;Elephas.app</a> brings GPT-3 and Codex to all applications in MacOS. Many business professionals are using it.

How come general developer audiences aren&#x27;t more acquainted with GPT-3 (and Codex in particular) capabilities? People in the twitter thread all seem completely mind blown over an app that basically just passes your code to an existing API and prints the result.I don&#x27;t want to sound negative of course, and I expect many of these apps coming up, until Codex stops being free (if they put it on the same pricing as text DaVinci model, which Codex is a fine-tuned version of, it will cost a ~cent per query). I&#x27;m just wondering how come the information about this type of app reaches most people way before the information about &quot;the existence of Codex&quot; reaches them.For all the publicity around Codex recently (and especially on HN), it still seems like the general IT audience is completely unaware of the (IMHO) most important thing going on in the field.And to anyone saying &quot;all these examples are cherrypicked, Codex is stupid&quot;, I urge you to try Copilot and try to look at its output with the ~2019 perspective. I find it hard to beileve that anything but amazement is a proper reaction. And still, more people are aware of the recent BTC price, than this.Source: have been playing with Codex API for better part of every day for the last few weeks. Built an app that generates SQL for a custom schema, and have been using it in my daily work to boost ma productivity as a data scientis&#x2F;engineer&#x2F;analyst a lot.

The question is - can this actually explain the code which really needs explanation - or can it only explain code that should be easy and straight forward to read anyway?And does having this reduce the amount of discomfort badly readable code creates, and thus make you less inclined to take care the code is and stays easily readable?

A little open to interpretation, and the fingers will be fucked up.

Why not just type the code into DALL-E 2 and have it paint a picture of what the code does?

So it translates the code into COBOL. That&#x27;s awesome.

what would its answer be based upon? Seeing that &quot;very distinctive bit of code&quot; doesn&#x27;t provide the &quot;TLDR&quot; style answer ...

TBF, that is actually a situation where the big pattern-matching trained system would probably easily find and regurgitate the correct answer, just from prior exposure to a very distinctive bit of code.

...4. What the fuck?...And returns the approximate inverse square root.

... until it gets to<pre><code> i = 0x5f3759df - ( i &gt;&gt; 1 );</code></pre>

The kinds of comments that are useful are less about what the code already tells you is happening, and more things like:&#x2F;* we don&#x27;t use the actual price but the discounted price, as per email from Manager Bob on 2022-09-16 *&#x2F;subtotal += price * customer_discount_factor;or&#x2F;* note there&#x27;s a 2ms delay while relays settle; this is subtracted from sample time, so timeout is not what you might expect *&#x2F;select(0,&amp;readfd,NULL,NULL,&amp;timeout);

About license agreements, there was a program called EULAlyzer (<a href="https:&#x2F;&#x2F;www.brightfort.com&#x2F;eulalyzer.html" rel="nofollow">https:&#x2F;&#x2F;www.brightfort.com&#x2F;eulalyzer.html</a>).edit: Comment modified since thought submitted link was about the tldr pages rather another tool. For something like tldr pages but legal-wise there&#x27;s <a href="https:&#x2F;&#x2F;tldrlegal.com" rel="nofollow">https:&#x2F;&#x2F;tldrlegal.com</a> (<a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=7367027" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=7367027</a>).

Can we get this for legal documents? Maybe with the ability to spot things that might be loopholes?

Yep. Not only it generates useless comments, for me it is actually easy to read the code itself, than the generated comments in this case. I don&#x27;t know neither language nor framework they use, still it is completely readable.

No only I find the tool not useful, as it just state the obvious. My personal opinion is the code should be already very near what the tool gives.
The code should be clear enough for not needed such tool. If you need it, you have a very different problem, my friend.

Garbage. It generates a line-by-line translation of code into English, rather than a concise summary, so we end up needing a TLDRTLDR.Actual human documentation would read something like:<pre><code> &gt; Return true if the X-HELPSCOUT-SIGNATURE request header matches the
 &gt; base-64 encoded SHA1 hash of the raw request data.</code></pre>

Something like this could be helpful if the stumbling block is the syntax. If the output consistently looks like the example, though, it&#x27;s not going to be very much help explaining the longer tail of straightforward code that simply implements hard-to-understand logic, though.I can see something this functionality being useful to explain dense, ungooglable code, like regex, or maybe APL. That said, I couldn&#x27;t really trust current-generation ML to actually produce a correct explanation instead of being confidently and wildly wrong.

in this particular example at least for me it is easier to read the original code. Even though I don&#x27;t even know the language they use. And I&#x27;m pretty sure this is the case for the most developers with non-zero experience.

Yes, but there&#x27;s quite good chance the translation is wrong so you&#x27;ll probably need to read the code anyway

It’s helpful if you can read English but the code is difficult to understand. Most explanations of code are more verbose than the code they’re explaining because code is usually pretty terse compared to natural language.You can think of “too long” referring to the time it might take someone to reason out a particularly terse, dense line of code verse the actual length of the code.

TLDR but you actually end up reading even more than the original. I could be wrong and this might actually work and condense a big function but if that is true, why showcase such an example.