We have a Steam curator now. You should be following it. https://store.steampowered.com/curator/44994899-RPGHQ/

[Resource] RVC voice modules

Game development hub. Projects, modding, and resources.
User avatar
Metalhead33
Posts: 311
Joined: Feb 26, '24

Post by Metalhead33 »

rusty_shackleford wrote: April 4th, 2024, 15:48
This reminds me of how people are unaware pixel art for consoles was designed around the limitations of the medium which is why it looks like shit without scanlines.
Not just the CRT scanlines, but also the NTSC distortion of the signal, which blended in the dithered pixels.

Image
rusty_shackleford wrote: April 4th, 2024, 15:48
Yes, they knew the audio recording devices had limitations, and they were making voices that sounded best within that limitation, not something to be cleaned up later.
Except when we're training an AI voice clone, we aren't making it for a fake 1930s movie - we're making it for a mod for a contemporary video game, where high-quality audio is expected.

Tags:
User avatar
rusty_shackleford
Site Admin
Posts: 10344
Joined: Feb 2, '23
Gender: Watermelon
Contact:

Post by rusty_shackleford »

Metalhead33 wrote: April 4th, 2024, 16:03
Except when we're training an AI voice clone, we aren't making it for a fake 1930s movie - we're making it for a mod for a contemporary video game, where high-quality audio is expected.
But it still sounds wrong.
User avatar
Metalhead33
Posts: 311
Joined: Feb 26, '24

Post by Metalhead33 »

rusty_shackleford wrote: April 4th, 2024, 16:05
Metalhead33 wrote: April 4th, 2024, 16:03
Except when we're training an AI voice clone, we aren't making it for a fake 1930s movie - we're making it for a mod for a contemporary video game, where high-quality audio is expected.
But it still sounds wrong.
Would you rather have a mixture of voices (in the same game or same mod), where some have heavy static (and zero bass) while others sound ultra-HD?
User avatar
Metalhead33
Posts: 311
Joined: Feb 26, '24

Post by Metalhead33 »

orinEsque wrote: April 4th, 2024, 13:53
You definitely need more epochs. The conversions are too tinny.
I am re-training my modules at 500 epochs.

I wonder how many epochs would be needed to approach ElevenLabs' quality?
Metalhead33 wrote: April 5th, 2024, 05:10
I forgot to unsubscribe from ElevenLabs, so they charged my credit card $5 yesterday. So I decided to use up all the monthly credits as soon as possible, and then unsubscribe.

So, here is some stuff from my fictional world's lore:

https://waysofdarkness.miraheze.org/wik ... War_of_718


https://waysofdarkness.miraheze.org/wik ... Revolution
"The Rape War" is NOT a reference to @Hubumba


Sypnosis of my future book, that I am co-writing with someone


The aforementioned co-writer's OC


One of the villains, who initially pretends to be an ally


Yes, I used the Stronghold Crusader narrator voices and Stronghold Crusader soundtrack.
Last edited by Metalhead33 on April 5th, 2024, 05:19, edited 1 time in total.
User avatar
orinEsque
Posts: 1589
Joined: Oct 9, '23
Location: Narnia
Gender: Potato

Post by orinEsque »

Metalhead33 wrote: April 5th, 2024, 05:17
orinEsque wrote: April 4th, 2024, 13:53
You definitely need more epochs. The conversions are too tinny.
I am re-training my modules at 500 epochs.

I wonder how many epochs would be needed to approach ElevenLabs' quality?
Metalhead33 wrote: April 5th, 2024, 05:10
I forgot to unsubscribe from ElevenLabs, so they charged my credit card $5 yesterday. So I decided to use up all the monthly credits as soon as possible, and then unsubscribe.

So, here is some stuff from my fictional world's lore:

https://waysofdarkness.miraheze.org/wik ... War_of_718


https://waysofdarkness.miraheze.org/wik ... Revolution
"The Rape War" is NOT a reference to @Hubumba


Sypnosis of my future book, that I am co-writing with someone


The aforementioned co-writer's OC


One of the villains, who initially pretends to be an ally


Yes, I used the Stronghold Crusader narrator voices and Stronghold Crusader soundtrack.
Probably a lot. And more varied input in some cases. Sometimes when you don't have enough samples you can train an eleven lana sample, have it read a book chapter or something and use the output as further input for RVC
User avatar
Metalhead33
Posts: 311
Joined: Feb 26, '24

Post by Metalhead33 »

orinEsque wrote: April 5th, 2024, 08:37
Probably a lot. And more varied input in some cases. Sometimes when you don't have enough samples you can train an eleven lana sample, have it read a book chapter or something and use the output as further input for RVC
If only I knew this earlier.

(I generated a ton of conversations between Baldur's Gate 1-2 characters for the lulz using ElevenLabs. I still have them saved on my HDD.)
User avatar
orinEsque
Posts: 1589
Joined: Oct 9, '23
Location: Narnia
Gender: Potato

Post by orinEsque »

Metalhead33 wrote: April 5th, 2024, 08:44
orinEsque wrote: April 5th, 2024, 08:37
Probably a lot. And more varied input in some cases. Sometimes when you don't have enough samples you can train an eleven lana sample, have it read a book chapter or something and use the output as further input for RVC
If only I knew this earlier.

(I generated a ton of conversations between Baldur's Gate 1-2 characters for the lulz using ElevenLabs. I still have them saved on my HDD.)
Ah but i think you can continue training an already trained model.
User avatar
Metalhead33
Posts: 311
Joined: Feb 26, '24

Post by Metalhead33 »

orinEsque wrote: April 5th, 2024, 10:39
Ah but i think you can continue training an already trained model.
And that is exactly what i am doing.
Turns out, it's taking longer than I expected though. I can guarantee that I will post the 500-epoch retrained versions this week though. Just not today.

EDIT: @orinEsque It is done.
Last edited by Metalhead33 on April 10th, 2024, 06:58, edited 1 time in total.
Post Reply