undefined

points

[-]

I asked Gemini to format some URLs into an XML format. It got halfway through and gave up. I asked if it truncated the output, and it said yes and then told _me_ to write a python script to do it.

by queenkjuul63 days ago|

parent|

[-]

On the one hand, it did better than chatgpt at understanding what i wanted and actually transforming my data

On the other, truncating my dataset halfway through is nearly as worthless as not doing it at all (and i was working with a single file, maybe hundreds of kilobytes)

by walls66 days ago|

parent|

prev|

[-]

This is my most common experience with Gemini. Ask it to do something, it'll tell you how you can do it yourself and then stop.

by edoloughlin66 days ago|

parent|

[-]

Given that Gemini seems to have frequent availability issues, I wonder if this is a strategy to offload low-hanging fruit (from a human-effort pov) to the user. If it is, I think that's still kinda impressive.

by ASalazarMX66 days ago|

parent|

prev|

[-]

Somehow I like this. I hate that current LLMs act like yes-men, you can't trust them to give unbiased results. If it told me my approach is stupid, and why, I would appreciate it.

by chipsrafferty63 days ago|

parent|

[-]

I just asked ChatGPT to help me design a house where the walls are made of fleas and it told me the idea is not going to work, and also has ethical concerns.

by ASalazarMX59 days ago|

parent|

[-]

I tried it with a Gemini personality that uses this kind of attack, and since that kind of prompt strongly encourages it to provide a working answer, it decided that the fleas were a metaphor about botnet clients, and the walls were my network, all so it could give an actionable answer.

I inadvertently made a stronger yes-man.

by GoToRO66 days ago|

parent|

prev|

[-]

That's a different kind of push back.

by danielbln67 days ago|

prev|

[-]

I've noticed Gemini pushing back more as well, whereas Claude will just butter me up and happily march on unless I specifically request a critical evaluation.

by kelvinjps1066 days ago|

parent|

[-]

Y experience as well

by captainkrtek67 days ago|

prev|

[-]

Interesting, can you share more context on the topic you were asking it about?

by GoToRO66 days ago|

parent|

[-]

coding in a stack I didn't bother to learn first (android)