It isn't this simple for liquid oxygen and methane mixtures, and there's a great deal of disagreement between industry and regulators over what the right percentage of TNT equivalence is. Naturally, industry thinks the percentage is low, and regulators are skeptical, so there's a government-run test campaign going on as we speak to collect data for proper modeling.
This serves as a basis of comparison for this deflagration. If we are considering specifically the appearance of the late fireball, the heat output is the relevant figure of merit.
Assuming about 10-15% of the total bomb energy remained in the heat of the late fireball (with the rest spent on the blast wave, peak thermal radiation and neutron/gamma radiation), the fireball of this rocket deflagration could have exceeded the late fireball from the bomb. But this assumes the tanks were fully filled, which we do not know yet.
It's still a big boom, but not anywhere close to what world occur with optional mixing.
Everyone can be glad though that no hypergolics are involved at least!
The energy of the detonation wave in rocket explosions is typically 1-2% of the energy in the fuel, at least that is the ballpark of what people use for estimating the effects of mishaps.
We also do not know if the tanks were fully filled -- it the past, rocket companies have called 10 second static fire tests a "full duration static fire test." We will probably find out later what it actually was meant to be.
The question was whether during this test the stage was loaded with the same amount of fuel as for an actual flight, or only a small fraction of that.
You fire the rocket as if it’s going to space, but you keep it on the pad. (From the engine’s perspective, it did a full launch.)
If clamped down, it’s a full-duration static fire. If clamps release, it goes to space. Basically, if the engine can’t tell (apart from atmosphere, which is a big apart) it isn’t going to space, it’s a FDSF. It’s a whole-engine show. If you’re running parts through a full duty cycle, that can be done in a lab (or on a stand).
Some reports say that this means "running all seven BE-4 engines at full thrust for up to 38 seconds".
In flight the engines fire for 190 seconds.
So what the full duration means, and whether they fill the tanks with just enough fuel for the firing, or with a larger amount to help the clamps to hold the stage down, all this we will probably only find out from the investigation, if the results are ever published.
(on that note it's also amazing that these exploding bolts are so reliable, I can imagine even a single one not releasing would cause... Issues)
The heat from combustion of this amount would be about 3.4 kt, which is roughly the same as the heat in the late fireball of the Trinity test.
The mushroom cloud from the New Glenn explosion was also substantial: https://photos.app.goo.gl/a7uPVjsB5n453SJA7