This will always be an issue as long as people can fork the code, so one might say we need a license that prevents a module from being used in ML training, better yet, we need a way for a commented line or something that'd pork the training pipeline if found in the source training, and removing it would violate the license.