Their solution basically just amounts of "Ethically sourced Styles" which still has all the red tape that a normal text2image model has because majority of the data is still unapproved for use in an AI model.
Businesses didn't want to get wrapped up in a pesudolegal model that really has no better legality than base SD.
Music conglomerates have money and their lawsuits will probably settle the issue.(unless they settle) That will be applied for all copyrighted works, regardless of the medium.
I believe going against the big guys is the reason why the big ones don't yet have music generation LLMs.