upvote
It is if more training data results in better performance. In which case, GP will continue to use the language that is likely to have the most training data available.
reply
> It is if more training data results in better performance.

Sure. But given the relation with translation systems, it seems far more likely that there are diminishing returns to larger volumes of training data.

reply