Australian voices quality from cloud text-to-speech (subscription)?

Acon wrote on 12/17/2023, 8:28 PM

G'day there,

I'm based in Australia and now use Vegas Pro 18 (perpetual license). I'm considering signing up with the Smart Subscription to get the cloud text-to-speech feature.

From some review videos I found on YouTube, I saw an Australian English option on the list.

For those who are subscribing to Vegas Pro, can you tell me how many Australian male & female voices are there to choose from that category? And how good are these voices?

Thanks heaps.

Regards,

Acon

Comments

Dexcon wrote on 12/17/2023, 9:03 PM

Male = 6

Female = 8

No speech styles for any of the voices but then only a few English language voices have speech styles (e.g. angry, cheerful, sad, etc), but there are pitch and speed sliders so that those paramenters can be adjusted.

I've only just today started to use one of the male Oz voices for a project soon to be uploaded to YouTube. Like many automated voices that I've heard over the years, the tone of the voices is a rather 'flat emotionless read' with a tendency at times to oh.ver.ee.nun.cee.ate a little. This type of narration/commentary would be completely out of place for something exciting or emotional like on Foxtel yesterday when Nathan Lyon got his 500th test wicket (test cricket AUS v PAK).

Last changed by Dexcon on 12/17/2023, 9:21 PM, changed a total of 1 times.

Cameras: Sony FDR-AX100E; GoPro Hero 11 Black Creator Edition

Installed: Vegas Pro 15, 16, 17, 18, 19, 20, 21 & 22, HitFilm Pro 2021.3, DaVinci Resolve Studio 19.0.3, BCC 2025, Mocha Pro 2025.0, NBFX TotalFX 7, Neat NR, DVD Architect 6.0, MAGIX Travel Maps, Sound Forge Pro 16, SpectraLayers Pro 11, iZotope RX11 Advanced and many other iZ plugins, Vegasaur 4.0

Windows 11

Dell Alienware Aurora 11:

10th Gen Intel i9 10900KF - 10 cores (20 threads) - 3.7 to 5.3 GHz

NVIDIA GeForce RTX 2080 SUPER 8GB GDDR6 - liquid cooled

64GB RAM - Dual Channel HyperX FURY DDR4 XMP at 3200MHz

C drive: 2TB Samsung 990 PCIe 4.0 NVMe M.2 PCIe SSD

D: drive: 4TB Samsung 870 SATA SSD (used for media for editing current projects)

E: drive: 2TB Samsung 870 SATA SSD

F: drive: 6TB WD 7200 rpm Black HDD 3.5"

Dell Ultrasharp 32" 4K Color Calibrated Monitor

 

LAPTOP:

Dell Inspiron 5310 EVO 13.3"

i5-11320H CPU

C Drive: 1TB Corsair Gen4 NVMe M.2 2230 SSD (upgraded from the original 500 GB SSD)

Monitor is 2560 x 1600 @ 60 Hz

Dexcon wrote on 12/18/2023, 5:05 AM

@Acon  ... I've just uploaded the video:

The Australian male voice is at the beginning of the video and towards the end from 03:47.

Cameras: Sony FDR-AX100E; GoPro Hero 11 Black Creator Edition

Installed: Vegas Pro 15, 16, 17, 18, 19, 20, 21 & 22, HitFilm Pro 2021.3, DaVinci Resolve Studio 19.0.3, BCC 2025, Mocha Pro 2025.0, NBFX TotalFX 7, Neat NR, DVD Architect 6.0, MAGIX Travel Maps, Sound Forge Pro 16, SpectraLayers Pro 11, iZotope RX11 Advanced and many other iZ plugins, Vegasaur 4.0

Windows 11

Dell Alienware Aurora 11:

10th Gen Intel i9 10900KF - 10 cores (20 threads) - 3.7 to 5.3 GHz

NVIDIA GeForce RTX 2080 SUPER 8GB GDDR6 - liquid cooled

64GB RAM - Dual Channel HyperX FURY DDR4 XMP at 3200MHz

C drive: 2TB Samsung 990 PCIe 4.0 NVMe M.2 PCIe SSD

D: drive: 4TB Samsung 870 SATA SSD (used for media for editing current projects)

E: drive: 2TB Samsung 870 SATA SSD

F: drive: 6TB WD 7200 rpm Black HDD 3.5"

Dell Ultrasharp 32" 4K Color Calibrated Monitor

 

LAPTOP:

Dell Inspiron 5310 EVO 13.3"

i5-11320H CPU

C Drive: 1TB Corsair Gen4 NVMe M.2 2230 SSD (upgraded from the original 500 GB SSD)

Monitor is 2560 x 1600 @ 60 Hz

Acon wrote on 12/18/2023, 12:53 PM

Hi @Dexcon,

Thanks a lot for sharing all these. Very helpful.

The Australian male voice sounds exactly the same as William from https://ttsfree.com.

https://i.imgur.com/EYbXJhH.png

Not too impressive to be honest. William is the only free text-to-speech voice that I can find on the internet. You mentioned there are 6 male Australian voices. How are the other 5?

Regards,

Acon

Dexcon wrote on 12/18/2023, 5:15 PM

@Acon  ... yes, the voice is indeed William.

The other male Australian voices seem to be in a similar age group as William but with different timbres. The 'read' for all of them is rather similar - a part of the sentence sounds good but then the next clause can sound a bit too robotic maybe even a bit staccato (i.e. over enunciated).

I was disappointed the first time I tested this, and without knowing the voice's source, my wife commented while passing by that it sounded like a mechanical voice or something like that.

Interestingly, the Australian female voices have better reads and are thus much more life-like.

Nonetheless, some audio editing is necessary after importing the voice on to Vegas Pro's timeline to close the overly long gaps where there is a comma or a full stop in the text.

Cameras: Sony FDR-AX100E; GoPro Hero 11 Black Creator Edition

Installed: Vegas Pro 15, 16, 17, 18, 19, 20, 21 & 22, HitFilm Pro 2021.3, DaVinci Resolve Studio 19.0.3, BCC 2025, Mocha Pro 2025.0, NBFX TotalFX 7, Neat NR, DVD Architect 6.0, MAGIX Travel Maps, Sound Forge Pro 16, SpectraLayers Pro 11, iZotope RX11 Advanced and many other iZ plugins, Vegasaur 4.0

Windows 11

Dell Alienware Aurora 11:

10th Gen Intel i9 10900KF - 10 cores (20 threads) - 3.7 to 5.3 GHz

NVIDIA GeForce RTX 2080 SUPER 8GB GDDR6 - liquid cooled

64GB RAM - Dual Channel HyperX FURY DDR4 XMP at 3200MHz

C drive: 2TB Samsung 990 PCIe 4.0 NVMe M.2 PCIe SSD

D: drive: 4TB Samsung 870 SATA SSD (used for media for editing current projects)

E: drive: 2TB Samsung 870 SATA SSD

F: drive: 6TB WD 7200 rpm Black HDD 3.5"

Dell Ultrasharp 32" 4K Color Calibrated Monitor

 

LAPTOP:

Dell Inspiron 5310 EVO 13.3"

i5-11320H CPU

C Drive: 1TB Corsair Gen4 NVMe M.2 2230 SSD (upgraded from the original 500 GB SSD)

Monitor is 2560 x 1600 @ 60 Hz

Acon wrote on 12/18/2023, 5:28 PM

Thanks @Dexcon for your insight. Now I need to think about if I really need to upgrade to the Smart Subscription+ plan. Anything else good besides the text-to-speech feature that you enjoy? Like the stock footage and music? Also how fast and stable is the current version 21 compared to previous ones (I got 18)?

RogerS wrote on 12/18/2023, 5:50 PM

Stock footage (and music and sound Fx) is very worthwhile.

21 is a big improvement over 18 for media decoding support, addition of VST3 audio Fx support, a much better color grading panel and more.

As VEGAS is undergoing major changes I'd recommend staying up to date with it.

Dexcon wrote on 12/18/2023, 6:08 PM

@Acon

I haven't used very much stock footage other than aerial drone shots of Copenhagen and some places in Norway, but have found some good theme music and sound effects. But this is very much a personal choice thing depending on the project that is being worked on. Vegas Content is, I believe, sourced from Storyblocks so you might want to explore their website - https://www.storyblocks.com/ - to see what's on offer. I read on the forum recently that not absolutely everything on their website is available via Vegas Content but that most of it is.

Thus far, I have found Vegas Pro build 208 to be very stable and fast - but then I have recently upgraded my desktop's spinning disks to SSDs including most recently the C drive to a better M.2 just 4 weeks ago. So 'how fast' is very much dependent on the computer's specifications that Vegas Pro is running on.

There have been so many improvements and new features in Vegas Pro since VP18 that it is hard to know where to start. Have a look through the new version announcements under the News section of the forum as they list new features etc. Though you've probably already explored it, Vegas Pro 21's product webpage has information about the features that come with the various versions of Vegas Pro 21.

I see on that webpage that NewBlueFX's Titler Pro 7 (worth $299 USD alone) is a bonus extra with VP21 until 9 January - so that is a really good inducement to buy or subscribe before then. NewBlueFX have a trial version of Titler Pro 7 that you can test should you want to.

Why not download the trial version of Vegas Pro 21 to test it out for yourself and explore its features. With the trial version, its the basic Vegas Pro only without access to Vegas Content or the add-on programs. Also, rendering in a trial version is limited to a project that is less than 2 minutes in total length. But its a great way to see for yourself if an upgrade to VP21 or subscription is right for you.

Cameras: Sony FDR-AX100E; GoPro Hero 11 Black Creator Edition

Installed: Vegas Pro 15, 16, 17, 18, 19, 20, 21 & 22, HitFilm Pro 2021.3, DaVinci Resolve Studio 19.0.3, BCC 2025, Mocha Pro 2025.0, NBFX TotalFX 7, Neat NR, DVD Architect 6.0, MAGIX Travel Maps, Sound Forge Pro 16, SpectraLayers Pro 11, iZotope RX11 Advanced and many other iZ plugins, Vegasaur 4.0

Windows 11

Dell Alienware Aurora 11:

10th Gen Intel i9 10900KF - 10 cores (20 threads) - 3.7 to 5.3 GHz

NVIDIA GeForce RTX 2080 SUPER 8GB GDDR6 - liquid cooled

64GB RAM - Dual Channel HyperX FURY DDR4 XMP at 3200MHz

C drive: 2TB Samsung 990 PCIe 4.0 NVMe M.2 PCIe SSD

D: drive: 4TB Samsung 870 SATA SSD (used for media for editing current projects)

E: drive: 2TB Samsung 870 SATA SSD

F: drive: 6TB WD 7200 rpm Black HDD 3.5"

Dell Ultrasharp 32" 4K Color Calibrated Monitor

 

LAPTOP:

Dell Inspiron 5310 EVO 13.3"

i5-11320H CPU

C Drive: 1TB Corsair Gen4 NVMe M.2 2230 SSD (upgraded from the original 500 GB SSD)

Monitor is 2560 x 1600 @ 60 Hz

Acon wrote on 12/18/2023, 6:37 PM

Thanks @RogerS and @Dexcon for your replies. Now I'm tempted to upgrade :). I've used Vegas for about 25 years since version 3 or 4 so will definitely keep using it. It's a shame that they don't offer a BFCM or Xmas discount on the smart subscription plan or I wouldn't be hesitating for so long.

 

 

Former user wrote on 12/18/2023, 8:17 PM
The Australian male voice sounds exactly the same as William from https://ttsfree.com.

https://i.imgur.com/EYbXJhH.png

Not too impressive to be honest. William is the only free text-to-speech voice that I can find on the internet.

One thing you could try is using your own voice or Vegas TTS then use Capcut Voice changer to replace the voice with a Character voice which will still keep some of the qualities of the original such as accent. It has it's own TTS Australian voice but has a weird cadence, they all seem to have news reader type voices which isn't very natural.

This is the voice changer on the Vegas TTS that Dexcon shared, plus a couple of examples of it's own TTS.

Acon wrote on 12/18/2023, 8:25 PM

Hi @Former user, thanks for sharing. The first female voice on your video sounds really OZ.😂

You mentioned CapCut and I happen to use it from time to time, but I didn't find a voice changer option with OZ accent in my software. Where did you find it?

ps. I'm using the Chinese version of CapCut called 剪映.

 

 

Former user wrote on 12/18/2023, 8:41 PM

The voice changer characters take on the accent of the original voice, so in the example above because the Vegas TTS has an Australian accent the voice changed character also has an Australian accent. If the Vegas TTS used an American accent the voice changed version would also have an American accent.

Unfortunately the Capcut characters are very limited