upvote
+1, same experience, switched model as I've read the news thinking "let's try".

But it spent lots and lots of time thinking more than 4.5, did you had the same impression.

reply
I didn't compare to that level, just had it create a plan first then implemented it.
reply