Originally Posted by
Anjok
That's a good question and it's something I would have to experiment with. I think one way I can potentially make this happen is to train the AI on a dataset consisting of only full mixes paired with their official TV track counterparts. To have a model as effective as the ones I've shared, it would have to consist of at least 200 pairs as well. The GUI that's being developed will have a drop-down of models to choose from, so you'll be able to toggle between a karaoke model and a full vocal removal model.