Posting here, as this has more to do with the human voice itself than electronics.
I am planning to build a simple human voice synth, but I couldn’t find much on the internet. The overall plan is to generate a signal, pass it through about 10-ish bandpass filters and adjust each filter’s gain to create speech.
For my source signal, I found this, which seems to be the sound generated by the larynx before passing through the throat and mouth. From what I read online, a relaxation oscillator or a sawtooth wave seems to be a close approximation of it.
One of the things I am struggling to find is the frequency components corresponding to certain phonetics. Though I am pretty sure it is either because I can’t find the right keywords or because SEO ruined the internet.
US2121142A is the patent for Voder, the first human voice synthesizer by bell labs. It has a similar structure to what I’ve been modeling in my head. Should I just use the frequency values here for my bandpass filters or should I use something else?


There are, but there’s no harm in calling or e-mailing: It could be the start of a collaboration.