Noise robust speech recognition using parallel model compensation and voice activity detection methods

PMT ve ses aktivitesi tespiti metodları kullanarak gürültüye dayanıklı konuşma tanıma

PDF İndir

Tez No: 382389
Yazar: SERHAT HIZLISOY
Danışmanlar: DOÇ. DR. ZEKERİYA TÜFEKCİ
Tez Türü: Yüksek Lisans
Konular: Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol, Computer Engineering and Computer Science and Control
Anahtar Kelimeler: Ses Aktivitesi Tespiti, Gürültü Tahmini, Paralel Model Tazmini, Voice Activity Detection, Noise Estimation, Parallel Model Compensation
Yıl: 2015
Dil: İngilizce
Üniversite: Çukurova Üniversitesi
Enstitü: Fen Bilimleri Enstitüsü
Ana Bilim Dalı: Bilgisayar Mühendisliği Ana Bilim Dalı
Bilim Dalı: Belirtilmemiş.
Sayfa Sayısı: 88

Özet

Bu çalışmanın amacı bilinmeyen gürültülü ortamlarda konuşma tanıma performansını artırmaktır. Gürültülü ortamlarda konuşma tanıma sisteminin tanıma oranını artırmak için birçok yöntem önerilmiştir. Gürültülü konuşma ile başa çıkmak için en etkili tekniklerden biri Parelel Model Tazmin (PMT) yöntemidir. PMT konuşma tanıma sisteminin verilen gürültülü ortamda eğitilmesi durumunda sistemin eğitim ve test koşulu arasında uyumsuzluk olmadığında en iyi sonuç vereceği varsayımına dayanır. Bundan dolayı PMT gürültülü ortam için konuşma tanıma sistem parametrelerini tahmin edip gürültü modeli ile temiz konuşma modelini kullanarak eğitim ve test koşulu arasındaki uyumsuzluğu en aza indirmeye çalışır, konuşma tanıma modelini oluşturur. Gürültülü Konuşma sinyalini konuşma olan ve konuşma olmayan kısımlara ayıran birçok iyi bilinen Ses Aktivitesi Tespiti (SAT) metodu bulunmaktadır. SAT tarafından konuşma olmayan şeklinde ayrılan kısımlar gürültü modelini tahmin etmek için kullanılabilir. Bu tezde, gürültü modeli tahmin etmek için SAT yöntemlerini kullanmayı ve ayrıca PMT'de kullanmak için SAT yöntemi kullanarak tahmin edilen gürültü modeli ile temiz konuşma modelini gürültülü konuşma modelini tahmin etmek için kullanmayı önerdik. Bu çalışmada gürültülü konuşma tanıma için iyi bilinen üç SAT metodu ile baz performansı karşılaştırıldı. Baz metod için gürültü kullanılarak gürültü modeli tahmin edildi. Bütün metodlarda ise gürültülü konuşma modelini tahmin etmek için PMT kullanıldı. Bununla birlikte gürültü modelin parametrelerini tahmin etmek için yeni bir SAT önerildi. Önerilen SAT yönteminin konuşma tanıma performansı önerilen SAT metodu bu iyi bilinen SAT yöntemleri ile karşılaştırıldığında daha az hesaplama gerektirmesine rağmen iyi bilinen SAT yöntemlerinin çok daha iyidir.

Özet (Çeviri)

The main purpose of this study is to increase the performance of a speech recognition system under unknown noisy environments. Many methods have been proposed to increase the recognition performance of a speech recognition system for noisy speech. One of the most efficient techniques for dealing with the noisy speech is the Parallel Model Compensation (PMC) method. The speech recognition system will give the best results when there is no mismatch between the training and testing condition. Therefore, PMC method tries to minimize the mismatch between the training and testing conditions by estimating the speech recognition system parameters for noisy environment using the clean speech model and noise model. There are many well-known voice activity detection (VAD) methods for classifying the speech signal into speech and non-speech segments. Non-speech segments that are classified by a VAD method can be used to estimate the noise model. In this thesis, we propose to use VAD methods for estimating the noise model, and we also propose to use PMC for estimating the noisy speech model using the clean speech model and noise model which is estimated using a VAD method. In this study, performances of the baseline and three well know VAD methods have been compared for noisy speech recognition. Noise model were estimated using noise for the baseline method. For all method, PMC is used to estimate the noisy speech model. In addition to this, a new VAD method is proposed to estimate parameters of the noise model. The proposed VAD method's speech recognition performance is better than the most of the well-known VAD methods despite less computational requirement of the proposed VAD method compared to these well-known VAD methods.

Benzer Tezler

Tez No
390504
Noise robust speaker recognition under unknown noise environment
Bilinmeyen gürültü çevre ortamında güvenli konuşmacı tanımlama
MOHAMAD DIA ABDULKARİM
Yüksek Lisans
İngilizce
2015
Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol Çukurova Üniversitesi
Bilgisayar Mühendisliği Ana Bilim Dalı
DOÇ. DR. ZEKERİYA TÜFEKÇİ
Tez No
79308
Large vocabulary speech recognition in noisy environments
Gürültülü ortamlarda geniş kelime hazneli konuşma tanıma
FİRAS JABLOUN
Yüksek Lisans
İngilizce
1998
Elektrik ve Elektronik Mühendisliği İhsan Doğramacı Bilkent Üniversitesi
Elektrik-Elektronik Mühendisliği Ana Bilim Dalı
DOÇ. DR. A. ENİS ÇETİN
Tez No
389387
Örneklem tabanlı gürbüz konuşma tanıma
Exemplar based noise robust speech recognition
FATİH AKTÜRK
Yüksek Lisans
Türkçe
2015
Elektrik ve Elektronik Mühendisliği İstanbul Teknik Üniversitesi
Elektronik ve Haberleşme Mühendisliği Ana Bilim Dalı
DOÇ. DR. ENDER METE EKŞİOĞLU
Tez No
84138
Gürültülü ortamlarda Türkçe ayrık sözcük konuşma tanıma sistemi gerçekleştirimi
Realization of a Turkish isolated word speech recognition system under noisy environments
NEDİM KARACA
Doktora
Türkçe
1999
Elektrik ve Elektronik Mühendisliği Hacettepe Üniversitesi
Elektrik-Elektronik Mühendisliği Ana Bilim Dalı
PROF. DR. H. SELÇUK GEÇİM
Tez No
546270
Performance evaluation of real-time noisy speech recognition for mobile devices
Mobil cihazlarda gerçek zamanlı gürültülü konuşma tanıma performans değerlendirilmesi
YASER YURTCAN
Yüksek Lisans
İngilizce
2019
Bilim ve Teknoloji Orta Doğu Teknik Üniversitesi
Bilişim Sistemleri Ana Bilim Dalı
DOÇ. DR. BANU GÜNEL KILIÇ

Geri Dön