""Tasks that seemed straightforward often took days rather than hours, with Devin getting stuck in technical dead-ends or producing overly complex, unusable solutions," the researchers explain in

Miguel Afonso Caetano@tldr.nettime.org · 11 days ago

""Tasks that seemed straightforward often took days rather than hours, with Devin getting stuck in technical dead-ends or producing overly complex, unusable solutions," the researchers explain in

Riskable@programming.dev · 11 days ago

Devin must’ve been trained on enterprise code.

Ioannis Konstantoulas@mathstodon.xyz · 11 days ago

@remixtures@tldr.nettime.org So they are really going to take devs’ jobs!

mindbleach@sh.itjust.works · 11 days ago

If you mean to shit on how it works now, and tell idiot managers not to throw money at it, shout this from the rooftops.

If you mean to suggest it’s going to stay this dumb forever then you haven’t been paying attention.

“It rarely worked” acknowledges that it does sometimes work. That’s where quite a lot of money and effort will be expended. We’ve invented a program that writes other programs. You can just describe code into existence, based on high-level goals, in plain English. Even if human code forever remains the better option - it is now an option.

This whole evolutionary branch might still get snipped. LLMs hallucinate by design. We’ve proven you can get spooky results out of a neural network, a Library Genesis torrent, and a roomful of GPUs. We’re seeing the local maxima for this shape of network. The global maxima damn well ought to code better than us, for the same reasons and to the same degree a calculator can multiply better than us.

Mirko Tavosanis@mastodon.uno · 11 days ago

@remixtures@tldr.nettime.org not surprising at all!