undefined

points

by kingstnap11 hours ago |

comments

by kingstnap11 hours ago|

[-]

Btw if anyone is wondering, GPT 5.5 does the same garbage as 5.4 mini for 4 times the cost.

Fable manages to make a reasonable game, at a cost of 40 cents.

  X X O O O
  O O X X X
  X X X O O
  X O O X O
  X O X X O

by ubanholzer11 hours ago|

prev|

[-]

Nice idea. I just asked Haiku to do the same in Claude Chat on iOS: it created a interactive react game, implemented the rules and let it play. Clever move for 1$ input and 5$ output, Anthropic!

by a_c9 hours ago|

prev|

[-]

While LLM models are bad at games, they are perfectly capable of writing a RL agent to train on the game itself.

by asimovDev11 hours ago|

prev|

[-]

when i will be extremely bored, I think I will make two models play chess against each other. I bet there's a chess benchmark / llm tournament already somewhere

by rusticpenn11 hours ago|

parent|

[-]

Models are bad at chess. I am using a middleman to help models play chess and experimenting. https://abhay-ai.github.io/R_Daneel_AI/

by fuglede_7 hours ago|

parent|

prev|

[-]

In fact, you don't even need an LLM tournament when you can have tom7's Elo World tournament: https://www.youtube.com/watch?v=DpXy041BIlA

by cbg08 hours ago|

prev|

[-]

[flagged]