No way Google is going to bake the ads into training data. Their entire business is built on auctioning off each ad slot in realtime.
That would be an intentional poisoning of the models with biased or outright untruthful data.
I believe that many people would be unwilling to use such models.