Posting here, as this has more to do with the human voice itself than electronics.

I am planning to build a simple human voice synth, but I couldn’t find much on the internet. The overall plan is to generate a signal, pass it through about 10-ish bandpass filters and adjust each filter’s gain to create speech.

For my source signal, I found this, which seems to be the sound generated by the larynx before passing through the throat and mouth. From what I read online, a relaxation oscillator or a sawtooth wave seems to be a close approximation of it.

One of the things I am struggling to find is the frequency components corresponding to certain phonetics. Though I am pretty sure it is either because I can’t find the right keywords or because SEO ruined the internet.

US2121142A is the patent for Voder, the first human voice synthesizer by bell labs. It has a similar structure to what I’ve been modeling in my head. Should I just use the frequency values here for my bandpass filters or should I use something else?

  • frongt@lemmy.zip
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    3
    ·
    edit-2
    1 month ago

    No need to call, I’m sure there are plenty of papers published if you use Google scholar or similar. Or thesis projects on github, etc.