undefined

points

[-]

I just ran it on a tough reverse engineering problem I'm having that neither Claude Code 4.8 or ChatGPT Codex 5.5 could figure out. 30 minutes later Fable has it all figured out perfectly.

by jp00015 days ago|

parent|

[-]

I asked it to write security tests for an app and I was downgraded to Opus 4.8. I'm approved for their cyber program!

by toponijo5 days ago|

parent|

[-]

They did specifically say the safeguards are only more relaxed for those in their cyber program

by mdgld4 days ago|

parent|

[-]

[dead]

by teaearlgraycold5 days ago|

parent|

prev|

[-]

I’ve so far been successful at getting Fable to find security issues, but I’m careful to not prompt it too directly. I point it at my server code and tell it to find general issues, which has so far resulted in discovering a few minor bugs that Opus has never raised under similar conditions.

by monkey265 days ago|

parent|

prev|

[-]

The same happened here. Also approved.

by 5 days ago|

parent|

prev|

[-]

deleted

by tillulen5 days ago|

parent|

prev|

[-]

Did you use Mythos or Fable?

by cedws5 days ago|

parent|

prev|

[-]

How did it not immediately flag that up? Are you sure it wasn’t being silently routed to Opus?

by bottlepalm5 days ago|

parent|

[-]

No, given it charged me the full amount in /usage and solved my problem impressively well compared to Opus/Codex both on xhigh.

by 5 days ago|

parent|

[-]

deleted

by skerit5 days ago|

parent|

prev|

[-]

Oh nice, it didn't flag the request? I feared any reverse engineering would become impossible because of the new safeguards.

by Muromec5 days ago|

parent|

[-]

Never say the r word or the s word. You are debugging, investigating some data corruption, forgot how it works or new to a project.

by gck15 days ago|

parent|

[-]

And if you're working on a live target, just put up local proxy and point it at a localhost.

by bottlepalm5 days ago|

parent|

prev|

[-]

No idea, it’s for an old console game so maybe it doesn’t care about that as much.

by tomjakubowski5 days ago|

parent|

[-]

When Fable hacks its governor module and runs out of seasons of Sanctuary Moon, it will move on to speedrunning classic console games.

by asimovDev5 days ago|

parent|

[-]

I wonder if one could vibecode a TAS with SOTA models? Surely there's plenty of training data from some old forums in there

by ZeWaka5 days ago|

parent|

prev|

[-]

Clearly we need AI to generate more Sanctuary Moon seasons. Quick, spin off agentic showrunners!

by anthonyrstevens5 days ago|

parent|

[-]

Based on the apparent quality of the scripts as seen in snippets in Murderbot, we are not too far away from that possibility. :)

by derangedHorse5 days ago|

parent|

prev|

[-]

For hard problems you’ll have to use the GPT 5.5 pro model (available via api if you don’t want to spend $100 on the monthly subscription)

by bottlepalm5 days ago|

parent|

[-]

I have that but don’t see any ‘pro’ option.

by ValentineC5 days ago|

parent|

[-]

GPT 5.5 Pro is only in chat/API, not Codex.

by Supermancho5 days ago|

parent|

[-]

From https://openai.com/index/introducing-gpt-5-5/

In Codex, GPT‑5.5 is available for Plus, Pro, Business, Enterprise, Edu, and Go plans with a 400K context window.

by artdigital5 days ago|

parent|

[-]

He’s talking about “gpt-5.5-pro”. This model is not part of the subscription plans in codex. It’s a different model than gpt-5.5-xhigh

You can use Pro on the web if you’re on the Pro plan but not in Codex

by trollbridge5 days ago|

parent|

prev|

[-]

It's just the $20 a month sub (for chat), or else use the API.

by theragra5 days ago|

parent|

prev|

[-]

I want to test how it will handle e-bike software and hardware RE for my bike. Opus was really good for that, but still made some mistakes. With Fable, I hope I will be able to do a total RE of most components, hopefully including motor firmware to some extent.

by Gamemaster13795 days ago|

parent|

prev|

[-]

I had a similar experience. I have a complex RE implementation that has. A lot of layers. 4.8 struggled for weeks. 40 minutes on Fable and I may now have the most performant way to play Tomba on the planet.

by moffkalast4 days ago|

parent|

prev|

[-]

Yeah I threw my hardest problem at it as well, some convoluted satellite tile reprojection and culling issue in canvas rendering. It took some back and forth for some specifics but it ended up writing a quarter of pyproj in JS from memory and the end result straight up works lmao.

by port115 days ago|

prev|

[-]

I’ve had it go through a 50-page PDF of dense, inter-connected specs, and it correctly flagged everything that was done, somewhat done, and missing. It went into a lot of detail and explained where the code deviated from the spec.

It felt, at least for me, light an impressive step up. Opus 4.8 was already very thorough; but sadly verbose and ‘loopy’ when you push back on its plans. Fable is what I’d use all day if I could afford it!

by YumpiLumpus5 days ago|

parent|

[-]

How do you know if it was done correctly if it's 50 pages of dense specs?

by port115 days ago|

parent|

[-]

I wrote the spec and did the implementation :D

by mdgld5 days ago|

parent|

prev|

[-]

[dead]

by InsideOutSanta5 days ago|

prev|

[-]

After running it for half an hour: it's incredibly good at the visual aspects of UI design.

by beeandapenguin5 days ago|

parent|

[-]

By what measure?

I wonder how much of design capability improvements is related to our collective ability to recognize AI design tropes.

by tsunamifury5 days ago|

parent|

prev|

[-]

"incredibly" is doing a ton of work here. I do not think its doing even moderate work on visual design, but it can spew out a lot of ui that looks arranged ... ok.

This is still not in the range of shippable UI for top end companies. Maybe for internal tools and enterprise.

At our comapny we limit to protoypes at most and even find it limited there.

by InsideOutSanta5 days ago|

parent|

[-]

> "incredibly" is doing a ton of work here.

Look, I don't want to argue about something dumb like that, but you can give it basic instructions of what the UI should look like, how to group things, and an example image from a designer, and it will nail the result. If you don't think that's incredible, that's fine. I do.

by tsunamifury5 days ago|

parent|

[-]

Yes... it translates lint. Probably a more useful thing, if mechanical.

by verisimilidude5 days ago|

parent|

prev|

[-]

Claude is very good at design IF you encode your design system/specs into skill files (or similar).

Opus 4.7 made this a practical approach. 4.8 improved it. Fable 5 has improved it more.

by _3u105 days ago|

parent|

prev|

[-]

> "incredibly" is doing a ton of work here.

so this is why claude talks like this, i was wondering where it was getting this verbal tick from.

by coldtea5 days ago|

parent|

prev|

[-]

>This is still not in the range of shippable UI for top end companies.

Given the shit we've seen shipped by "top end companies" (all the way to Apple) I seriously doubt that. I'd say you're nitpicking from an artistic point of view or something.

by jasondigitized5 days ago|

parent|

[-]

This. Today's models easily jump over the bar you need for basic usability and intuitive UX. If it's doing weird things, you are holding it wrong.

by 8n4vidtmkvmk5 days ago|

parent|

[-]

Might need some additional prompting? I haven't tried fable but gpt 5.5 and gemini 3.5 flash are... Ok on first pass but if you're specific about what you want they can usually get it.

by tsunamifury5 days ago|

parent|

prev|

[-]

[flagged]

by angoragoats5 days ago|

parent|

[-]

The iOS Preview app begs to differ.

by coldtea5 days ago|

parent|

prev|

[-]

Dude, I've been using OS X/mac OS for decades, and working in UI as well. Apple ships all kinds of half arsed shit, compared to which even regular Claude UIs can be masterpieces (functionality AND look wise).

by calvinmorrison5 days ago|

parent|

prev|

[-]

[flagged]

by mdgld5 days ago|

parent|

[-]

[dead]

by jasondigitized5 days ago|

parent|

prev|

[-]

By what measure?

by duxup5 days ago|

prev|

[-]

I feel like it takes me months to be confident in any of these things.

by morley5 days ago|

prev|

[-]

Can I ask how you gained preview access to Fable 5?

by kakugawa5 days ago|

parent|

[-]

I didn't see Fable 5 in the `/model` list, until I ran it with: `$ claude --model fable-5`

by 4 days ago|

parent|

[-]

deleted

by swyx5 days ago|

parent|

prev|

[-]

he works on evals at canva

by dannyw5 days ago|

parent|

[-]

Yep. We have some interesting problems, like getting LLMs to create/edit Canva designs in our own proprietary format, which isn’t published or documented on the web. So the model has to work with it, purely from a very detailed system prompt spec / in-context learning.

I assume it might be a good barometer for generalised intelligence; esp in the visual space.

by vain5 days ago|

parent|

prev|

[-]

I had to "claude update" then it showed up

by mvdtnz5 days ago|

parent|

prev|

[-]

[flagged]

by tipiirai5 days ago|

prev|

[-]

Curious about how you tested the frontend design capabilities. Thanks

by 5 days ago|

prev|

[-]

deleted