Anjok........is there anyway that the AI has been used on a speaking part of a song......
does it struggle with just plain talking as opposed to singing due to the amount of reverb that some singers use
Just speaking from my own experiences here, I have found it handles speaking exceptionally well in general.
I've heard some fantastic results on rap/hip-hop tracks etc. The flip side is that when the instrumentation is bare, every tiny little missed detail stands out. It's easier to scrub in spectral editing since there's not much sound on the spectrum to dig through ... but it all sticks out.
But really, since it's trained on voice, so much depends on how well it recognizes a specific type of voice sound, and how much it mistakes certain instrumentation for voice. That's why having multiple models could potentially prove very useful. Reverb kinda fits into that category as well. This model is trained on different music than the primary AI over in the other thread. The primary one can't handle reverb nearly as well as this one.
I agree with this 100%. I just got my new PC parts today so I will be ready to start doing some tests this week! Regarding rkeanes' question, I put a dataset together consisting of trailer music with and without movie dialogue to see how well it learns to separate spoken word. I found a YouTube channel that has official instrumentals for trailers, so I'm hoping it pans out! I will probably include rap in that dataset as well because it isn't quite big enough.
I will be doing A LOT of experiments. This will take some time!
Totally looking forward to the results!
I just completed a split conversion you can all check out.
It's for Red Hot Chili Peppers - Higher Ground
The AI in this thread was more effective at converting the 'verses' with the reverb,
and the AI 2.0 from the other thread was more effective at the end part.
Plus some added hours in spectral editing and a couple other cuts, and here's the results!
I included both conversions and my completed mix.
https://mega.nz/folder/IR8UxArT#b14akmLvfbkptUVrKpQYXw
**UPDATE**
I've made a lot traction on the GUI and should have it released by the end of the first week of June, along with an updated model that so far will be the best one I've made. On par with the one used to create the instrumentals in the other thread!
Last edited by Anjok; 01-06-2020 at 21:29.