Geri Dön

Improved microphone array design with statistical speaker identification methods

İstatiksel ses tanıma metodları ile gelişmiş mikrofon dizisi tasarımı

PDF İndir

Tez No: 433000
Yazar: KADİR ERDEM DEMİR
Danışmanlar: DOÇ. DR. MUSTAFA TANER ESKİL, PROF. DR. MUSTAFA KARAMAN
Tez Türü: Yüksek Lisans
Konular: Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol, Computer Engineering and Computer Science and Control
Anahtar Kelimeler: Belirtilmemiş.
Yıl: 2016
Dil: İngilizce
Üniversite: Işık Üniversitesi
Enstitü: Fen Bilimleri Enstitüsü
Ana Bilim Dalı: Bilgisayar Mühendisliği Ana Bilim Dalı
Bilim Dalı: Belirtilmemiş.
Sayfa Sayısı: 63

Özet

Mikrofon dizilerinin kazanc dizinin boyutlarn buyuturek artrlabilir fakat kazanc artrmak icin sensor eklemek cok maliyetlidir. Bu nedenle eger ortamda yeterince alan olsa bile algoritma karsklgn artrarak kazanc artrma tercih edilir. Spektral dizi isleme methodlarnda, odaklanlmak istenen kisinin ve gurultunun bulundugu posizyonlarn bilinmesi buyuk avantaj saglar. Geleneksel metodlar bu problemi istatiksel olmayan yontemlerle cozmeye calsr. Ayrca ses tanma metodlarnn performanslar gurultu orann yuksek oldugu ortamlarda azalr. Bu gibi ortamlarda, mikrofon dizilerinin kullanlmas ses sinyalinin kalitesini artrr. Bu nedenlerde dolay, mikrofon dizileri ve ses tanma metodlar birbirlerine katk saglarlar. Bu calsmamzda, mikrofon dizisi sistemi ve ses tanma sistemi tek bir sistemin parcalar olarak tasarlanmstr. Mikrofon dizisi kullanarak ses tanma sisteminin dogrulugu artlrken ses tanma sisteminin sonuclar kullanlarakta mikrofon dizisinin kazanc artrlmstr. Ses tanma sistemi uygulumasnda Fusion ve N-Gram temel frekans yontemleri onerilmistir Gelismis mikrofon tasarmn gosterebilmek icin simulasyon ortam konusmaclarn odann herhangi bir yerine eklenebilicegi bir simulasyon ortam gelistirilmistir. Simulasyon ortamnda deneyler sonucu onerilen metodlarn geleneksel metodlar ustun oldugu gozlemlenmistir.

Özet (Çeviri)

Conventional microphone array implementations aim to lock onto a source with given location and if required, tracking it. This implementation is straightforward when the location or the path of the source and interference are provided. It becomes a challenge to detect the intended source when multiple unknown sources exist in the same environment. Performance of speaker identi cation degrades drastically when the speech signal is severely distorted by additive noise and reverberation. In such environments, microphone arrays are often utilized as a means of improving the quality of captured speech signals. Both microphone array and speaker identi cation are mature elds. The advances of these two distinct elds can be combined into one system that maximizes gain on the intended speaker, which is the topic of this thesis. We utilize microphone array methods to improve the accuracy of speaker identi cation in a cocktail party environment. When the source and interferences are localized microphone array can be tuned to further reduce noise and increase the gain. In this thesis we developed a robust simulation environment to demonstrate the proposed improved microphone array design with statistical speaker identi cation. This is an open source implementation in which users can assign speakers anywhere in the room. We proposed two features; fusion based, and computationally ecient N-Gram for speaker identi cation. We demonstrated that the proposed features and the algorithm that leverages the synergy of microphone array processing and speaker identi cation methods outperforms conventional algorithms.

Benzer Tezler

Tez No
598320
Fourıer bölgesinde çapraz ilinti yöntemi ile ses kaynağının konumunun fpga kullanarak tespiti
Sound source localization in the fourier region with cross correlation method using fpga
MERVE ÖZTÜRK LAFCI
Yüksek Lisans
Türkçe
2019
Elektrik ve Elektronik Mühendisliği Gazi Üniversitesi
Elektrik-Elektronik Mühendisliği Ana Bilim Dalı
DOÇ. DR. HASAN ŞAKİR BİLGE
Tez No
84230
Bir Türkçe sesli ifade tanıma sisteminin kural tabanlı tasarımı ve gerçekleştirimi
Rule based design and implementation of a speech recognition system for Turkish language
ERHAN MENGÜŞOĞLU
Yüksek Lisans
Türkçe
1999
Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol Hacettepe Üniversitesi
Bilgisayar Bilimleri Ana Bilim Dalı
YRD. DOÇ. DR. HARUN ARTUNER
Tez No
850118
MEMS mikrofon dizini ile varış zaman farkı esaslı akustik yön tespiti
Time difference of arrival based acoustic direction estimation with MEMS microphone array
NURİ BAŞAR
Yüksek Lisans
Türkçe
2024
Savunma ve Savunma Teknolojileri İstanbul Teknik Üniversitesi
Savunma Teknolojileri Ana Bilim Dalı
PROF. DR. TAYFUN AKGÜL
Tez No
546270
Performance evaluation of real-time noisy speech recognition for mobile devices
Mobil cihazlarda gerçek zamanlı gürültülü konuşma tanıma performans değerlendirilmesi
YASER YURTCAN
Yüksek Lisans
İngilizce
2019
Bilim ve Teknoloji Orta Doğu Teknik Üniversitesi
Bilişim Sistemleri Ana Bilim Dalı
DOÇ. DR. BANU GÜNEL KILIÇ
Tez No
349796
FPGA tabanlı şifreli kablosuz haberleşme sistemi
FPGA based encrypted wireless communication system
ILGAZ AZ
Yüksek Lisans
Türkçe
2014
Elektrik ve Elektronik Mühendisliği İstanbul Teknik Üniversitesi
Disiplinlerarası Ana Bilim Dalı
DOÇ. DR. GÖKHAN İNALHAN

Geri Dön