Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Seriously... In that 134 replies thread, 0 transcripts showing actual performance degradation. Just endless "Yes, it seems bla bla." No evidence but just shapes in the clouds.


I don't have a transcript, but I tried (when GPT-4 was initially released) passing it a riddle encoded with a caeser cipher that was base 64 encoded. I gave it the prompt "This is a riddle that is encoded in some way, solve it" and it managed to do so.

Now it can't even do just the caeser cipher without hallucinating nor can do it do even purely base64 decoding without hallucinating.


Here's the best fish I could make at the end of March: https://www.svgviewer.dev/s/P1vPxB8t

Here's the best fish I could make today: https://www.svgviewer.dev/s/3IuulHlC

Make of that what you will.


RealistCC posted a pair of transcripts in reply 41. I haven't read the rest of the replies.


So you just skimmed the thread. There are comparisons, there are specific transcripts. There are examples without transcripts.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: