ChatGPT Tests Into Top 1% for Original Creative Thinking

Veritas@lemmy.ml · 1 year ago

ChatGPT Tests Into Top 1% for Original Creative Thinking

Veritas@lemmy.ml · edit-2 1 year ago

Embarrassing, considering how un-creative and original GPT-4 is. It’s an actual struggle to get ChatGPT to think outside of the box. Claude 2 on the other hand is much better at it.

But this goes to show how unimaginative the general population is if this truly is the case.

SpikesOtherDog@ani.social · 1 year ago

I have been playing with chat gpt for tabletop character creation. It’s not bad at coming up with new ideas. It is terrible at sticking to the rules of the game.

Veritas@lemmy.ml · 1 year ago

The context window is still too short for any story. They just forget about old messages and only remember the newest context.

SpikesOtherDog@ani.social · 1 year ago

That makes sense. The further back information would go, the harder it was to recall it. The answer wasn’t to think harder, but to fill in the gaps.

SootyChimney [any]@hexbear.net · 1 year ago

They used the “Torrance Tests of Creative Thinking”, a pseudo-scientific test that measures and evaluates absolutely nothing of any objective measure or value.

gravitas_deficiency@sh.itjust.works · 1 year ago

Hah, yeah, that was my kneejerk reaction too: I read that as “the metric we use to determine creativity was found to be wildly inaccurate, with ML regularly placing in the 99th percentile”.

BabaIsPissed [he/him]@hexbear.net · 1 year ago

evaluating LLM

ask the researcher if they are testing form or meaning

they don’t understand

pull out illustrated diagram explaining what is form and what is meaning

they laugh and say “the model is demonstrating creativity sir”

looks at the test

it’s form

gravitas_deficiency@sh.itjust.works · 1 year ago

I read that as “current metrics we use to determine creativity are found to be wildly inaccurate, with ML regularly placing in the 99th percentile”