Except, the other day I wanted to convert some units and the AI results was having a fucking stroke for some reason. The numbers did not make sense at all. Never seen it do that before, but alas, I did not take a screenshot.
Usually I’ll see something mild or something niche get wildly messed up.
I think a few times I managed to get a query from a post in, but I think they are monitoring for viral bad queries and very quickly massage it one way or another to not provide the ridiculous answer. For example a fair amount of times the AI overview just would be seemingly disabled for queries I found in these sorts of posts.
Also have to contend with the reality that people can trivially fake it and if the AI isn’t weird enough, they will inject a weirdness to get their content to be more interesting.
Those LLMs can’t handle numbers, they have zero concept of what a number is. They can pull some definitions, they can sorta get very basic arithmetic to work in a limited domain based on syntax rules, but it will mess up most calculations. ChatGPT tries to work around it by recognizing the prompt is related to math, passing it to a more normal Wolfram-Alpha style algorithm, and then using the language model to format the reply into something more appealing, but even this approach often fails because if the AI gets confused for any reason it will feed moronic data to the maths algorithm.
LLMs don’t verify their output is true. Math is something where verifying its truth is easy. Ask an LLM how many Rs in strawberry and it’s plain to see if the answer is correct or not. Ask an LLM for a summary of Columbian history and it’s not as apparent. Ask an LLM for a poem about a tomato and there really isn’t a wrong answer.
Yeah, I never get these strange AI results.
Except, the other day I wanted to convert some units and the AI results was having a fucking stroke for some reason. The numbers did not make sense at all. Never seen it do that before, but alas, I did not take a screenshot.
Usually I’ll see something mild or something niche get wildly messed up.
I think a few times I managed to get a query from a post in, but I think they are monitoring for viral bad queries and very quickly massage it one way or another to not provide the ridiculous answer. For example a fair amount of times the AI overview just would be seemingly disabled for queries I found in these sorts of posts.
Also have to contend with the reality that people can trivially fake it and if the AI isn’t weird enough, they will inject a weirdness to get their content to be more interesting.
Those LLMs can’t handle numbers, they have zero concept of what a number is. They can pull some definitions, they can sorta get very basic arithmetic to work in a limited domain based on syntax rules, but it will mess up most calculations. ChatGPT tries to work around it by recognizing the prompt is related to math, passing it to a more normal Wolfram-Alpha style algorithm, and then using the language model to format the reply into something more appealing, but even this approach often fails because if the AI gets confused for any reason it will feed moronic data to the maths algorithm.
What do humans do? Does the human brain have different sections for language processing and arithmetic?
LLMs don’t verify their output is true. Math is something where verifying its truth is easy. Ask an LLM how many Rs in strawberry and it’s plain to see if the answer is correct or not. Ask an LLM for a summary of Columbian history and it’s not as apparent. Ask an LLM for a poem about a tomato and there really isn’t a wrong answer.
Meanwhile, GNU Units can do that, reliably and consistently, on a freaking 486. 😂