What does that mean automatic? We had speech to text before Resolve and other programs.
Former user
wrote on 6/14/2023, 8:34 PM
When this was discussed earlier, it sounded like they planned on bringing the speech to text from Vegas365(Subscription version) to normal Vegas's but also they were planning on charging for it, so most likely you pay for Vegas, then you pay a monthly subscription just for the subtitling.
Anyone opposed to subscription models aren't going to like that approach. I guess if they added in Transcription editing where you rough edit an interview(as am example) via text, you search for where the subject says something important such as "Mary had a little lamb" , highlight text drag to timeline, and that section of audio/video is added to timeline.
At least you would get a whole new way of editing videos with that subscription cost, but I think speech to text should not cost extra, is not an additional cost with Resolve studio, Capcut and Premiere. Problem is they are cutting the fat from Vegas to make it leaner to sell it cheaper, I don't see them absorbing the server cost. The other method that Resolve and Premiere use is to utilize large language models that could become part of the Vegas Deep Learning models, so the processing is done locally on a users GPU or CPU and Magix doesn't have to pay for 3rd party server speech to text services.
I thought Premiere was doing it locally with (not so great) Mozilla libraries. No idea about Resolve as it's recent. Having done local speech processing on CPU it is very resource intensive but if you have processing power to spare it's viable.
There hasn't been an update about pay-as-you-go plans to add online services like cloud computing and storage to non-365 VEGAS, so no idea if that's going to happen.
I'm not talking about voice synthesis, I'm talking about automatic subtitles. For example, I import a video with a voice, and the subtitles appear directly on the screen, with a choice to customize the font, etc., just like on Premiere Pro.
Je pensais que Premiere le faisait localement avec (pas si génial) les bibliothèques Mozilla. Aucune idée de Resolve car c'est récent. Après avoir effectué un traitement local de la parole sur le processeur, cela nécessite beaucoup de ressources, mais si vous avez de la puissance de traitement, c'est viable.
Il n'y a pas eu de mise à jour sur les plans de paiement à l'utilisation pour ajouter des services en ligne comme le cloud computing et le stockage à non-365 VEGAS, donc aucune idée si cela va se produire.
Just curious ... if it works so well for you in Premiere Pro, why not continue using that NLE for that purpose? Each NLE has its own advantages and disadvantages and no NLE is the be-all-and-end-all of NLEs. Clearly, what you are seeking is not going to happen with Vegas Pro in the next few weeks or months.
I don't use première pro, I just think it's a shame that vegas don't do anything to update, and are behind in a lot of things compared to their competitors, if they carry on like this they're going to lose us all that's all.
Oui, maintenant c'est necessaire d'avoir 365 pour utilizer tout les services en ligne.
Former user
wrote on 6/15/2023, 6:51 PM
I thought Premiere was doing it locally with (not so great) Mozilla libraries. No idea about Resolve as it's recent. Having done local speech processing on CPU it is very resource intensive but if you have processing power to spare it's viable.
Premiere started out with Cloud processing the speech like Vegas is doing now, they moved to doing it locally, download the language models you need. I have looked at task manager when Premiere is transcribing, not using much CPU or GPU, yet it's reasonably fast, but it's not so great for accuracy, Being able to separate speakers is nice.
Resolve acts more like the GPU based whisper transcribers, using most of your available GPU and can be very fast, seen transcribe processing rates up to 42x, but also as low as 6x. The good thing about these built in transcribers is the editing via text(transcript) that's obviously a feature you don't get with standalone whisper transcribers and something Vegas could add.