Neurosymbolic representations for lifelong learning

Hayat boyu öğrenme için nörosembolik gösterimler

PDF İndir

Tez No: 859378
Yazar: ALPER AHMETOĞLU
Danışmanlar: DOÇ. DR. EMRE UĞUR, PROF. ERHAN ÖZTOP
Tez Türü: Doktora
Konular: Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol, Computer Engineering and Computer Science and Control
Anahtar Kelimeler: Belirtilmemiş.
Yıl: 2024
Dil: İngilizce
Üniversite: Boğaziçi Üniversitesi
Enstitü: Fen Bilimleri Enstitüsü
Ana Bilim Dalı: Bilgisayar Mühendisliği Ana Bilim Dalı
Bilim Dalı: Belirtilmemiş.
Sayfa Sayısı: 166

Özet

Bu tez, derin sinir ağı mimarilerinin yüksek boyutlu vektörleri işlemedeki avantajları ile klasik yapay zeka arama tekniklerini birleştirerek robot öğrenimi için yeni bir yöntem sunmaktadır. Bu yöntem, robotun sürekli gösterimdeki duyudevinimsel verileri ile sonlu sayıda nesneler içeren ortamlar arasında bir köprü kurmak için tasarlanmıştır. Önerilen yöntemin amacı ortam ile etkileşimler yoluyla toplanan verileri uygun sembolik yapılara çevirerek bu semboller üzerinden ağaç arama yöntemleri ile istenilen bir hedef durumu bulmaktır. Yöntemin genel yapısında gizyazıcı-gizçözücü tipinde bir sinir ağı bulunmaktadır. Sinir ağının darboğaz katmanında türev akışına izin veren i-kili etkinleştirme hücreleri bulunmaktadır. Ortamdaki nesnelerin öznitelikleri ile temsil edilen ortam durumu, gizyazıcıya girdi olarak verilmektedir. Gizyazıcı her nesne için ayrık bir vektör üretir, ve bu vektörler nesnelerin sembolleri olarak kullanılır. Nesne sembolleri eylem vektörü ile birlikte gizçözücüye girdi olarak verilir, ve gizçözücü robotun eyleminin yol açtığı etkiyi tahmin eder. Tüm yapı eğitildikten sonra sürekli bir şekilde gösterilen ortamın tanımı gizyazıcı kullanılarak sembolik hale dönüştürülebilir. Bu sayede ortamdaki geçişler semboller üzerinden kurallar tanımlanarak gösterilebilir. Bu kurallar planlama alan tanım diline çevrildiğinde çeşitli planlama yöntemleri kullanılarak hedef bir durum aranabilir. Masaüstü nesne etkileşimi deney düzeneklerinde yaptığımız deneyler bu sistemin ortama dair uygun semboller öğrendiğini göstermiştir. Gözetimsiz bir şekilde öğrenilmiş bu semboller üzerinden tanımlanan kurallar kullanılarak çeşitli yüksekliklerde nesne kuleleri ve nesneler arası ilişkilerin modellenmesini gerektiren karmaşık nesne yapıları kurulmuştur. Bu yöntem türevlenebilir yapı taşları ile kurulduğu için derin öğrenmedeki yenilikler ile çeşitli yönlerde genişletilebilir.

Özet (Çeviri)

This thesis presents a novel framework for robot learning that combines the advantages of deep neural architectures in processing high-dimensional vectors with classical AI search techniques to bridge the gap between continuous sensorimotor data of the robot and domains consisting of finite entities. The aim is to convert information about the environment collected through interactions into an appropriate symbolic form on which a search tree can be built to reach a desired state. The framework consists of an encoder-decoder type of network with binarized activations in the bottleneck layer. The state of the environment, represented as a set of object features, is given to the encoder as input. The output is a discrete vector, treated as the object's symbol, given to the decoder together with the action vector. The decoder predicts the effect observed by the agent due to the executed action. Once the network is trained, we can transform the continuously represented environment definition into symbolic vectors using the encoder. This allows us to build rules defining the transitions in the environment defined over these symbols. These rules can be translated into planning domain definition language (PDDL), allowing domain-independent off-the-shelf planners to be used to search for a goal state. Our experiments on tabletop object manipulation setups show that the system can learn appropriate symbols of the environment that allow it to build object towers with desired heights and complex object structures that require modeling the relations between objects by reasoning through the rules defined over the symbols learned in an unsupervised manner. As the framework is built with differentiable blocks, it affords appending recent advances in deep learning with ease, allowing it to be extensible in multiple directions.

Benzer Tezler

Geri Dön