Modelling human-robot audition for sound localization with deep neural network

Derin sinir ağı ile ses yerelleştirmesi için insan-robot seçmelerinin modellenmesi

PDF İndir

Tez No: 937327
Yazar: MOHAMMAD IBRAHİM KHLİL AL KARAKI
Danışmanlar: DR. IHAB ELAFF
Tez Türü: Yüksek Lisans
Konular: Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol, Computer Engineering and Computer Science and Control
Anahtar Kelimeler: Belirtilmemiş.
Yıl: 2024
Dil: İngilizce
Üniversite: Üsküdar Üniversitesi
Enstitü: Fen Bilimleri Enstitüsü
Ana Bilim Dalı: Bilgisayar Bilimleri ve Mühendisliği Ana Bilim Dalı
Bilim Dalı: Belirtilmemiş.
Sayfa Sayısı: 82

Özet

Hesaplamalı işitsel sistemler alanında bu araştırma, insan sesinin lokalizasyonunu modelleme gibi karmaşık bir görev için derin bir sinir ağından yararlanarak yeni bir yaklaşıma öncülük ediyor. Pratiklik ve verimliliği hedefleyen çalışma, bu hesaplamalı modeli kompakt ve çok yönlü Raspberry Pi platformuna benzersiz bir şekilde entegre ederek gerçek dünya uygulamaları için somut bir çözüm sunuyor. Geleneksel ses lokalizasyon yöntemlerinin doğasında var olan kısıtlamalardan yola çıkan bu araştırma, daha doğru ve uyarlanabilir bir işitsel sistem kurarak yeni bir çığır açmayı amaçlıyor.

Özet (Çeviri)

In the realm of computational auditory systems, this research pioneers a novel approach by leveraging a deep neural network for the intricate task of modeling human sound localization. Aiming for practicality and efficiency, the study uniquely integrates this computational model into the compact and versatile Raspberry Pi platform, presenting a tangible solution for real-world applications. Motivated by the inherent constraints of traditional sound localization methods, this research seeks to break new ground by establishing a more accurate and adaptive auditory system. Objectives and Methodology The primary objectives encompass the design and implementation of a customized deep neural network tailored explicitly for sound localization. This computational model is seamlessly integrated into the Raspberry Pi environment, harnessing its accessibility and cost-effectiveness. To ensure the model's robustness and versatility, diverse sound data is meticulously acquired using specific tools, including Raspberry Pi 3 and 2mic. Evaluation and Results Beyond mere implementation, the research conducts thorough evaluations, assessing the accuracy and efficiency of the proposed system under various environmental conditions. The results showcase promising accuracy rates, with an error rate ranging from 0.45% to 1.15% across diverse sample sizes, affirming the effectiveness of the computational auditory system. Significance and Applications The significance of this research extends beyond the immediate application, contributing to the evolving landscape of both computational audition and machine learning. The fusion of human-like sound localization mechanisms with the adaptability of deep learning models is poised to unlock new possibilities in fields such as robotics, assistive technologies, and virtual reality. Structure and Content Structured into four chapters, the thesis unfolds with an introductory exploration of the broader field of computer audition. Subsequent chapters delve into the theoretical foundations and implementation details of the spherical head model, providing a robust framework for sound localization. The intricate system design is elucidated, covering architecture, components, and methodologies, emphasizing the seamless integration with the Raspberry Pi environment. Conclusion In conclusion, this research aspires to propel the boundaries of what is achievable in the computational auditory realm. By converging the intricacies of human auditory perception with the robust capabilities of deep learning, and leveraging the accessibility of the Raspberry Pi platform, this study aims to not only advance the field of sound localization but also to foster innovations in human-computer interaction and technological accessibility.

Benzer Tezler

Tez No
753040
Lifelong learning for auditory scene analysis
İşitsel sahne analizi için hayat boyu öğrenme
BARIŞ BAYRAM
Doktora
İngilizce
2022
Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol İstanbul Teknik Üniversitesi
Bilgisayar Mühendisliği Ana Bilim Dalı
DOÇ. DR. GÖKHAN İNCE
Tez No
400042
Ego noise estimation for robot audition
Başlık çevirisi yok
GÖKHAN İNCE
Doktora
İngilizce
2011
Makine Mühendisliği Tokyo Institute of Technology
PROF. JUNİCHİ IMURA
Tez No
709912
Identification of object manipulation anomalies for service robots
Servis robotları için nesne etkileşim anomalilerinin tanısı
DOĞAN ALTAN
Doktora
İngilizce
2021
Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol İstanbul Teknik Üniversitesi
Bilgisayar Mühendisliği Ana Bilim Dalı
DOÇ. DR. SANEM SARIEL UZER
Tez No
456241
Paralel mekanizmalı ayak bileği rehabilitasyon robotu üzerinde kontrol stratejilerinin geliştirilmesi ve uygulanması
Development and implementation of control strategies on a parallel mechanism ankle rehabilitation robot
MUSTAFA ŞİNASİ AYAS
Doktora
Türkçe
2017
Elektrik ve Elektronik Mühendisliği Karadeniz Teknik Üniversitesi
Elektrik-Elektronik Mühendisliği Ana Bilim Dalı
PROF. DR. İSMAİL HAKKI ALTAŞ
Tez No
510217
Değişken parametreli bir alt ekstremite güçlendirme robotunun mekanik tasarımı ve ileri kontrol algoritmalarının uygulanması
Mechanical design and implementation of advanced control algorithms of a variable parametered strengthening lower-extremity exoskeleton
ALPER KADİR TANYILDIZI
Doktora
Türkçe
2018
Makine Mühendisliği Fırat Üniversitesi
Makine Mühendisliği Ana Bilim Dalı
DOÇ. OĞUZ YAKUT

Geri Dön