I just found this.
This is huge!
As a german, I use thorsten medium as he simply made the best dataset.
Mixing english with german, speaking numbers, single letters, pausing without a “.” but just a linebreak, all those can be essential.
And… it is nearly perfect! And all local!
This is crazy!
eSpeak can finally go to rest!
To those late to the party, you can sample voices here so that you’re not in a crapshoot: https://rhasspy.github.io/piper-samples/
Nice, thanks!