Vegas Pro Build 300 Text-based Editing (Beta) Feedback

SilentNight wrote on 4/10/2024, 12:00 PM

Loving the performance in build 300 so far. Preview playback is much smoother and using VSTs is much more bearable now. I'm really hoping it fixes the crashing too but only time will tell. I was also very excited for the Text-based Editing feature but unfortunately the way it works right now currently seems useless for the videos I make which are long studio voiceovers with many takes stitched together.

Currently each take shows up individually in the Transcript (Beta) window. So my feedback for this beta is to add the ability to transcribe tracks. This way I could transcribe the whole voiceover track which would make these long videos much easier to work on.

Comments

Steve_Rhoden wrote on 4/10/2024, 2:13 PM

Good suggestion and feedback !

VEGASDerek wrote on 4/10/2024, 2:38 PM

I'm pinning this to the top of the forum page. Please provide any feedback about this feature that you can as you use it. We want to get a good read on how people are using this so we can improve it for the general release in VEGAS Pro 22.

10Year_User wrote on 4/10/2024, 9:25 PM

Here I leave my comments, I hope you take them into account.

 1) Please leave the option to save the favorite language to be transcribed.  Most people only speak one language and choosing it manually is tedious.

 2) Please add a button or script to remove all silences.  If there are 100 silences, going manually makes no sense.

 3) Add the option for the words to appear at the same time they are mentioned.  Capcut transcribes sentences but have a option for the words appear in order as the word is mentioned.

 4) Add more animations for the texts, one or 2 pop or gamer type animations would be very useful

 Thank!

set wrote on 4/11/2024, 2:52 AM

I'm leaving the screenshot report here from one user in Facebook group:

https://www.facebook.com/groups/13702281564/posts/10161614911126565/

Setiawan Kartawidjaja
Bandung, West Java, Indonesia (UTC+7 Time Area)

Personal FB | Personal IG | Personal YT Channel
Chungs Video FB | Chungs Video IG | Chungs Video YT Channel
Personal Portfolios YouTube Playlist
Pond5 page: My Stock Footage of Bandung city

 

System 5-2021:
Processor: Intel(R) Core(TM) i7-10700 CPU @ 2.90GHz   2.90 GHz
Video Card1: Intel UHD Graphics 630 (Driver 31.0.101.2127 (Feb 1 2024 Release date))
Video Card2: NVIDIA GeForce RTX 3060 Ti 8GB GDDR6 (Driver Version 551.23 Studio Driver (Jan 24 2024 Release Date))
RAM: 32.0 GB
OS: Windows 10 Pro Version 22H2 OS Build 19045.3693
Drive OS: SSD 240GB
Drive Working: NVMe 1TB
Drive Storage: 4TB+2TB

 

System 2-2018:
ASUS ROG Strix Hero II GL504GM Gaming Laptop
Processor: Intel(R) Core(TM) i7 8750H CPU @2.20GHz 2.21 GHz
Video Card 1: Intel(R) UHD Graphics 630 (Driver 31.0.101.2111)
Video Card 2: NVIDIA GeForce GTX 1060 6GB GDDR5 VRAM (Driver Version 537.58)
RAM: 16GB
OS: Win11 Home 64-bit Version 22H2 OS Build 22621.2428
Storage: M.2 NVMe PCIe 256GB SSD & 2.5" 5400rpm 1TB SSHD

 

* I don't work for VEGAS Creative Software Team. I'm just Voluntary Moderator in this forum.

RogerS wrote on 4/11/2024, 4:00 AM

The second screenshot seems to be about text to speech, not text-based editing.

Custom PC (2022) Intel i5-13600K with UHD 770 iGPU with latest driver, MSI z690 Tomahawk motherboard, 64GB Corsair DDR5 5200 ram, NVIDIA 2080 Super (8GB) with latest studio driver, 2TB Hynix P41 SSD, Windows 11 Pro 64 bit

Dell XPS 15 laptop (2017) 32GB ram, NVIDIA 1050 (4GB) with latest studio driver, Intel i7-7700HQ with Intel 630 iGPU (latest available driver), dual internal SSD (1TB; 1TB), Windows 10 64 bit

VEGAS Pro 19.651
VEGAS Pro 20.411
VEGAS Pro 21.208

Try the
VEGAS 4K "sample project" benchmark: https://forms.gle/ypyrrbUghEiaf2aC7
VEGAS Pro 20 "Ad" benchmark: https://forms.gle/eErJTR87K2bbJc4Q7

gabgilson wrote on 4/11/2024, 4:14 AM

Hmm, puzzled. I use 'speech to text' lots to auto generate subtitles (e.g. for social clips ready for facebook). In the new version, it seems very hard to do this. Previously I could select the clips I wanted transcribed, choose 'speech to text', click Analyse and it would generate subtitles for just the portion of those clips used on the EDL.

Now even if I select just the clips I want, it gives me a long list of all (some? doesn't look like all, but lots of clips anyway) the clips on my EDL. I scroll through and find just the two clips I want to transcribe, click analyse and it takes ages to analyse the entire clip (not just the portion showing in the EDL). Then I go to 'subtitles beta' and click generate, and it puts subtitles that don't exactly match the edited clips on the EDL. They are close, but they start and stop too late in the clip (as in the words at the start are missing), as if it's trying to sync to the audio but picking the wrong in and out point. I always had to manually correct the auto-subtitles for layout, but now there's a load to correct to add back the missing words.

I think it also ignored half the formatting options I selected - so the font is correct, but they're not on a double line and line length isn't what I chose.

Not sure this is a step forward for what I need to do. is there are a way to go back to the non beta version? Thanks.

I'm on 365 pro subscription.

studiolynga wrote on 4/11/2024, 9:03 AM

I also just want to generate subtitles from speech, like in previous versions. Is there some option to select only what is on the dialog or voice over track?

SilentNight wrote on 4/11/2024, 2:37 PM

I'm pinning this to the top of the forum page. Please provide any feedback about this feature that you can as you use it. We want to get a good read on how people are using this so we can improve it for the general release in VEGAS Pro 22.

If that's the case I'll elaborate a bit more on what I'd want from a "track transcribe" feature.
1. If it works in a similar way to the current AI transcribe then having the tracks in a separate list from the media items would be very helpful.
2. Ideally it could be even easier in a right click menu for example: Select Track(s) to Transcribe -> Right Click -> Speech to Text.
3. Adding a right click menu option for this on media items in the timeline would also be very helpful.
4. The ability to transcribe multiple tracks/media items at once through this right click feature would also be very helpful.

I look forward to future developments on this feature. The act of finding the part of an hours long video where I said something would be made a lot easier with this.

ccscotty wrote on 4/20/2024, 7:00 PM

Trying out the transcription to text based editing beta feature.

It shows silent/pause areas, but you can hide them.

You can click on the Transcript window and it adjusts the timeline marker.

Suggestions/bugs:

  • I'd prefer if transcript generation were a local process instead of cloud based. Privacy considerations and I'd wonder if a local process would be faster while still as accurate or even better depending.
  • The interface isn't clear you need to do the transcript first because it lets you access the editing option but it's a blank screen.
  • Having auto-ripple on clears out all of the text in the transcript window after deleting anything, so it appears to be broken.
  • Some sections of silence are including spoken words. For example in the screenshot the word "6" is attached to the silent portion rather than before or after.
  • You can't do the reverse of clicking on the timeline to select text in the transcript editing window.
  • It would make sense to have this transcript window part of the left panel tabs.
  • I don't see a natural way to play the audio while using the text editor to confirm it is selecting what it needs to do.
  • Deleting individual pieces or groups of text might not be positioned properly so some way of dealing with that from an interface standpoint would be nice.
  • I'd also like it to work on timeline audio rather than source clips.

I like this idea! I could eventually see it faster to use than timeline editing for certain types of videos.

j.razz wrote on 4/21/2024, 2:47 PM

I would like to see in addition to removing all silence over x amount, the ability to leave a buffer of silence on either side of the cut/split.

Here's a use case. The speaker has a pause as they are parsing out how they say what they want to say to conclude a sentence. They then resume and move along with the dialogue. There still needs to be a small amount of silence left to make the dialogue flow naturally.

10Year_User wrote on 4/29/2024, 5:44 PM

Suggestion

There should be an option to transcribe all clips in the timeline.  In my case I usually add two hours of material and if I could delete all the silences with one click it would be extremely useful.  Currently I have only been able to do it by nesting the entire timeline and analyzing the entire project.

Sergey-Kulikov wrote on 5/5/2024, 1:05 PM

Why "Text to speech" and "Speech to text" not working - "The service is unavailable"?

Reyfox wrote on 5/5/2024, 1:38 PM

@Sergey-Kulikov good question. It isn't working here either.

I'll check back later this evening.

studiolynga wrote on 5/7/2024, 3:06 AM

I wish we could have the old "speech to text" function back as a separate function. It worked very well for the way I work with subtitles. The only way for me to use this new function now is to have just one audio file in the project and to make sure there are no edits done to it. It then takes a very long time to analyze the file. When I choose to have the subtitle shown on two lines it still delivers one long line. The subtitles are not "timed" to the speech as they were in the previous version. All the "text plates" come in a long row without any space between them. So now It takes much more time for me to finish the subtitling.

 

 

iEmby wrote on 5/14/2024, 8:53 AM

I think a search option should be given in Text Based Editing where user can search any word or line split out it from text. 

beccause some times there can be long interview or podcast videos. 

PROCESSOR
     

Operating System: Windows 11 Pro 64-bit (Always Updated)
System Manufacturer: ASUS
12th Gen Intel(R) Core(TM) i7-12700 (20 CPUs), ~2.1GHz - 4.90GHz
Memory: 32GB RAM
Page File: 11134MB used, 7934MB Available
DirectX Version: DirectX 12

-----------------------------------------------

MOTHERBOARD

 

ASUS PRIME H610-CS D4
Intel® H610 (LGA 1700)
Ready for 12th Gen Intel® Processors
Micro-ATX Motherboard with DDR4
Realtek 1 Gb Ethernet
PCH Heatsink
PCIe 4.0 | M.2 slot (32Gbps) 
HDMI® | D-Sub | USB 3.2 Gen 1 ports
SATA 6 Gbps | COM header
LPT header | TPM header
Luminous Anti-Moisture Coating
5X Protection III
(Multiple Hardware Safeguards
For all-round protection)

-----------------------------------------------
EXTERNAL GRAPHIC CARD

-----------------------------------------------

INTERNAL GRAPHIC CARD (iGPU)

------------------------------------------------

LED - MONITOR

Monitor Name: Generic PnP Monitor
Monitor Model: HP 22es
Monitor Id: HWP331B
Native Mode: 1920 x 1080(p) (60.000Hz)
Output Type: HDMI

-----------------------------------------------

STORAGE DRIVE

Drive: C:
Free Space: 182.3 GB
Total Space: 253.9 GB
File System: NTFS
Model: WD Blue SN570 1TB (NVMe)

---------------O----------------

My System Info (PDF File).

https://drive.google.com/open?id=1-eoLmuXzshTRH_8RunAYAuNocKpiLoiV&usp=drive_fs

 

Also Check

VEGAS Scripts Collection By Me

https://www.vegascreativesoftware.info/us/forum/vegas-pro-scripts-collections-share-your-script-here--145667/

VEGAS Pro Vault Blogger Created By Me.

https://vegasprofreetools.blogspot.com

My YouTube Channel Dedicated to Only VEGAS Pro Tutorials

http://www.youtube.com/@editroom1580

iEmby wrote on 5/14/2024, 8:54 AM

If possible new language request. as other Indian languages are already there, please add one more which is Punjabi (Gurmukhi) language.

PROCESSOR
     

Operating System: Windows 11 Pro 64-bit (Always Updated)
System Manufacturer: ASUS
12th Gen Intel(R) Core(TM) i7-12700 (20 CPUs), ~2.1GHz - 4.90GHz
Memory: 32GB RAM
Page File: 11134MB used, 7934MB Available
DirectX Version: DirectX 12

-----------------------------------------------

MOTHERBOARD

 

ASUS PRIME H610-CS D4
Intel® H610 (LGA 1700)
Ready for 12th Gen Intel® Processors
Micro-ATX Motherboard with DDR4
Realtek 1 Gb Ethernet
PCH Heatsink
PCIe 4.0 | M.2 slot (32Gbps) 
HDMI® | D-Sub | USB 3.2 Gen 1 ports
SATA 6 Gbps | COM header
LPT header | TPM header
Luminous Anti-Moisture Coating
5X Protection III
(Multiple Hardware Safeguards
For all-round protection)

-----------------------------------------------
EXTERNAL GRAPHIC CARD

-----------------------------------------------

INTERNAL GRAPHIC CARD (iGPU)

------------------------------------------------

LED - MONITOR

Monitor Name: Generic PnP Monitor
Monitor Model: HP 22es
Monitor Id: HWP331B
Native Mode: 1920 x 1080(p) (60.000Hz)
Output Type: HDMI

-----------------------------------------------

STORAGE DRIVE

Drive: C:
Free Space: 182.3 GB
Total Space: 253.9 GB
File System: NTFS
Model: WD Blue SN570 1TB (NVMe)

---------------O----------------

My System Info (PDF File).

https://drive.google.com/open?id=1-eoLmuXzshTRH_8RunAYAuNocKpiLoiV&usp=drive_fs

 

Also Check

VEGAS Scripts Collection By Me

https://www.vegascreativesoftware.info/us/forum/vegas-pro-scripts-collections-share-your-script-here--145667/

VEGAS Pro Vault Blogger Created By Me.

https://vegasprofreetools.blogspot.com

My YouTube Channel Dedicated to Only VEGAS Pro Tutorials

http://www.youtube.com/@editroom1580

Roger Bansemer wrote on 5/17/2024, 7:57 AM

Are there any new youtube videos out on these features? Some of those that were making tutorial videos on youtube just haven't uploaded much in a long time.

fr0sty wrote on 5/17/2024, 5:46 PM

This is a beta feature, so there aren't any tutorials out for it just yet.

Systems:

Desktop

AMD Ryzen 7 1800x 8 core 16 thread at stock speed

64GB 3000mhz DDR4

Geforce RTX 3090

Windows 10

Laptop:

ASUS Zenbook Pro Duo 32GB (9980HK CPU, RTX 2060 GPU, dual 4K touch screens, main one OLED HDR)