What would hope to be achieved by making this case lazy? If you wanted these to run in parallel, with a multi-gpu system, you would use the appropriate parallel interface.
.abs().max().item()
of something that can be identified as definitionally zero.The parallel interface, which is async, is probably what you're lookin for.