Let me get this out of the way first: The whole reason I'm asking this is due to ineptitude with the software I'm using. So if this question is a weird headscratcher, you can blame VASST Voice Assistant for not having common sense functionality. You see, VASST does not allow you to specify a threshold for when you want audio to be ducked, it is very literal and only ducks when there is no clip on the timeline at all. See here:
https://i.imgur.com/8B7NRtJ.png
My voice track is the bottom-most one. You can see I am not speaking and thus the volume of the track is -50 to -inf.
https://i.imgur.com/jh0v8Dt.png
Well, VASST doesn't care. Because there's still a track on the timeline, even if there is no audio playing on it, it just always ducks to the maximum amount.
https://i.imgur.com/HbImmGV.png
What I want it to look like is this (or similar), but there is 20 hours of audio here and it's just not feasible to do that by hand. So to avoid having to spend more money on plugins, my question is this: Is there a way to automatically split up and delete the resulting split clips if they are at a certain threshold or below in Vegas 15? If not, do you know of a way to make this work that doesn't involve doing it all by hand?