Vocal Tract Shape Estimation

M. B. Gülmezoğlu, A. Barkana, Anadolu Üniversitesi, Türkiye.

Abstract

The previous works done by other researchers to determine the cross-sectional areas of the vocal tract assumes that the glottal wave shape consists of only an impulse for each vibration of the vocal cords.

At the beginning of this work, it was expected that if the actual sawtooth input signal is taken as the glottal wave input, obtained cross-sectional areas of the vocal tract can be closer to actual ones. Input signals from the glottis and the output signals Irom the lips are measured and digitized simultaneously with the sampling frequency of 10kHz.

In this work, the transfer function of the vocal tract is obtained by using least-squares technique. The coefficients of the denominator polynomial of the transfer function are used to determine the cross-sectional areas of the vocal tract.

At first, the previous ·works whicb assume an impulse wave shape for the input is also repeated to find the cross-sectional areas of the vocal tract. It is observed that the obtained areas are similar to previous ones.

In the second step, the cross-sectional areas obtained by taking the actual input signal as the glottal wave input. The results of this work are compared with the work which takes the impulse signal as the glottal wave input.

The work with actual input signal seems to yield the position of the constriction point of the tongue better than the results of the previous work.

Özet

Diğer araştırmacılar tarafından, ses yolunun enine kesit alanlarını belirlemek için yapılan önceki çalışmalar, glottal dalga şeklinin, sadece ses tellerinin her titreşimi için bir dürtüden oluştuğunu varsayar.

Bu çalışmanın başlangıcında, gerçek testere dişi giriş sinyali glottal dalga girişi olarak alındığında, vokal kanalın kesit alanlarının fiili olanlara daha yakın olabileceği beklenmektedir. Glottisten gelen giriş sinyalleri ve dudaklardan çıkan sinyaller, 10kHz örnekleme frekansıyla aynı anda ölçülür ve dijitalleştirilir.

Bu çalışmada, en küçük kareler tekniği kullanılarak ses yolunun transfer fonksiyonu elde edilmiştir. Transfer fonksiyonunun paydas polinomunun katsayıları, ses yolunun kesit alanlarını belirlemek için kullanılır.

İlk başta, girdi için bir dürtü dalgası şeklini alan önceki çalışmalar, ses yolunun kesit alanlarını bulmak için de tekrarlanır. Elde edilen alanların öncekine benzer olduğu görülmektedir.

İkinci aşamada, gerçek giriş sinyali glottal dalga girişi olarak alınarak elde edilen kesitsel alanlar. Bu çalışmanın sonuçları, impuls sinyalini glottal dalga girişi olarak alan çalışmayla karşılaştırılır.

Gerçek giriş sinyaliyle yapılan çalışma, dilin daralma noktasının konumunu önceki çalışmaların sonuçlarından daha iyi veriyor gibi görünüyor.

Tam Metin

Bildirinin tamamını görüntülemek için aşağıdaki dosyayı indiriniz.

PreviousIsolated Speech Recognition System NextComparison of Human and Lovebird Speech

Last updated 6 years ago

Was this helpful?