Ses karakteristiğinden yararlanarak anlamlı görüntü akışı elde etme

Production of meaningful image by using sound characteristics

Tez No: 106253
Yazar: ÖZNUR KILIÇ TAN
Danışmanlar: PROF. DR. M. YAHYA KARSLIGİL
Tez Türü: Yüksek Lisans
Konular: Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol, Computer Engineering and Computer Science and Control
Anahtar Kelimeler: Hızlı Fourier Dönüşümü, Ayrık Dalgacık Dönüşümü, Ölçek, Ses işareti. xı, Fast Fourier Transform, Discrete Wavelet Transform, Scale, Sound Signal xu
Yıl: 2001
Dil: Türkçe
Üniversite: Yıldız Teknik Üniversitesi
Enstitü: Fen Bilimleri Enstitüsü
Ana Bilim Dalı: Bilgisayar Bilimleri Ana Bilim Dalı
Bilim Dalı: Belirtilmemiş.
Sayfa Sayısı: 116

Özet

ÖZET Günümüzde bilgisayar programlarının çoğu bilginin görüntüyle ifade edilerek kullanıcıya aktarılması yöntemini benimsemiştir. Özellikle İnternet tabanlı programlar incelendiğinde resim, video gibi çoklu ortam veri tiplerine sesten daha çok yer verildiği görülmektedir. Bunun temelinde insanların görme duyusunun işitme duyusuna göre daha gelişmiş olduğu gerçeği yatmaktadır. Örnek olarak, tanıdığımız bir kişinin telefondaki sesini ayırt edememe olasılığımızın, bu kişiyi gördüğümüzde tanıyamama olasılığından daha fazla olması ya da eğitimde çoğunlukla görsel materyallerden faydalanılması gösterilebilir. Bu çalışmada, ses bilgisinin karakteristik özelliklerinin çeşitli yollarla görüntüye yansıtılması yardımıyla sesteki farklılıkların, değişimlerin, ayrıntıların kolaylıkla algılanmasını sağlayan bir yöntem geliştirilmesi amaçlanmıştır. Geliştirilen bu sistemde öncelikle ses bilgisi elde edilmektedir. Sesin elde edilmesi var olan bir ses dosyasının okunması ya da mikrofon kullanılarak ses kartı yardımıyla alınması olmak üzere iki şekilde gerçekleştirilir. Sesin işlenmesi aşamasında, örneklenmiş bilgiye Hızlı Fourier Dönüşümü uygulanarak frekans spektral yoğunluğu elde edilir. Bu uygulamada diğerlerinden farklı olarak, ses bilgisinin frekans-genlik gösterimi ile ifade edilmesi yanında, frekans-genlik-faz bilgileri kullanılarak anlamlı görüntü akışı oluşturulmuştur. Hızlı Fourier Dönüşümü'nün ses gibi dinamik işaretlerin analizinde yetersiz kalması dolayısıyla Ayrık Dalgacık Dönüşümü uygulanarak ses işaretinin zaman-ölçek düzleminde görüntülenebilmesi sağlanmıştır. Bu tez ile geliştirilen sistem, sese ait Ayrık Dalgacık Dönüşümü sonucunu üç boyutlu görüntüye dönüştürmesi acısından bu alandaki çalışmalardan farklıdır. Sonuç olarak geliştirilen bu program ile ses bilgisinden gerçek zamanlı görüntü akışı elde edilmiştir. Ayrık Dalgacık Dönüşümü yöntemi ile sesin hem zaman hem de frekans düzlemindeki karakteristiği aynı anda üç boyutlu olarak görüntülenerek, işaretin frekans bileşenlerinin hangi zaman aralığında oluştuğu gösterilip, ses gibi dinamik işaretlerin analizinde doğru ve kesin sonuçlar elde edilmesi sağlanmıştır.

Özet (Çeviri)

ABSTRACT Today, most of the computer programs are designed to present the data to the users by the images. Especially the Internet based programs seem to offer more video and images as the multimedia data than the sound. The basic reason of this lies in the fact that the visual perception of the human brain is much stronger than the perception by hearing. For example, it's highly probable that we may not recognize the voice of a person that we know on the phone whereas the probability of not recognizing this person when we see him is much lower. Using the video and images in the education is also a good example of the strength of the visual representation. In this thesis, it's aimed to develop a method to perceive the differences, changes and details in the sound signals by means of reflecting the characteristics of these signals in the images. In this developed system as the first step the digital sound data is acquired. This data is acquired by two methods, by reading an already existing wave file or by recording the audio through a digital sound card by means of a microphone. As the second step processing of this digital data takes place. In this step the sampled data is applied the Fast Fourier Transform to determine the spectral components. In this application, unlike the similar ones, besides the frequency-magnitude representation, a meaningful image is displayed by using the frequency, magnitude and phase data As the FFT is not sufficient for the analysis of dynamic signals like sound, the Discrete Wavelet Transform is used to yield the representation of the sound signal in the time-scale plane. The application developed in this thesis differs from the applications in this area in that it converts the output of the DWT to a 3D image. To conclude, the program developed yields real time image representation of the sound. Using the DWT; both the time and frequency characteristics of the sound is acquired and hence the exact time when a spectral component has occurred is shown in a single 3D representation and as a result correct and definite results of analysis of dynamic signals, like sound, are gained.

Benzer Tezler

Tez No
517431
Automatic posture evaluation for professional voice users
Profesyonel ses kullanıcıları için otomatik postür değerlendirmesi
ÇAĞATAY DEMİREL
Yüksek Lisans
İngilizce
2018
Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol İstanbul Teknik Üniversitesi
Bilgisayar Mühendisliği Ana Bilim Dalı
YRD. DOÇ. DR. GÖKHAN İNCE
Tez No
687997
Adaptif filtereme kullanılarak solunum hızı tespit sistem tasarımı
Respiratory rate detection system design using adaptive filtering
MOHAMED ALFITURI ALBASHIR ELHASHMI
Yüksek Lisans
Türkçe
2021
Bilim ve Teknoloji Karabük Üniversitesi
Biyomedikal Mühendisliği Ana Bilim Dalı
DR. ÖĞR. ÜYESİ AHMET REŞİT KAVSAOĞLU
Tez No
8151
Cenab Şehabeddin'in şiirleri üzerinde bir araştırma
Başlık çevirisi yok
HASAN AKAY
Doktora
Türkçe
1989
Türk Dili ve Edebiyatı İstanbul Üniversitesi
PROF. DR. ZEYNEP KERMAN
Tez No
180638
Speaker-dependent speech coding
Kişiye bağımlı ses kodlaması
METE KEMAL KART
Yüksek Lisans
İngilizce
2006
Elektrik ve Elektronik Mühendisliği İhsan Doğramacı Bilkent Üniversitesi
Elektrik-Elektronik Mühendisliği Ana Bilim Dalı
PROF. DR. ENİS ÇETİN
Tez No
474905
Prediction of sound transmission characteristics of multiple elastomeric bulb seals
Çoklu elastomerik balon fitillerin ses iletim karakteristiğinin incelenmesi
BURAK NEBİL BARUTÇU
Yüksek Lisans
İngilizce
2017
Makine Mühendisliği Orta Doğu Teknik Üniversitesi
Makine Mühendisliği Ana Bilim Dalı
PROF. DR. MEHMET ÇALIŞKAN

Geri Dön