Learning variable selection rules via reinforcement learning

Başlık çevirisi mevcut değil.

PDF İndir

Tez No: 755261
Yazar: GÜLÇİN ECE ÖZKAHYA
Danışmanlar: DR. JUAN VERA LİZCANO
Tez Türü: Yüksek Lisans
Konular: İşletme, Business Administration
Anahtar Kelimeler: Combinatorial optimization, mixed integer linear programming, branch and bound, variable selection, Reinforcement Learning, Q-learning, dimensionality reduction
Yıl: 2021
Dil: İngilizce
Üniversite: Tilburg University
Enstitü: Yurtdışı Enstitü
Ana Bilim Dalı: Belirtilmemiş.
Bilim Dalı: Belirtilmemiş.
Sayfa Sayısı: 60

Özet

Özet yok.

Özet (Çeviri)

Branch and bound algorithm is a traditional method that is used in combinatorial optimization problems, including Mixed Integer Linear Programming (MILP). One of the most important components in the algorithm that effects the efficiency of the algorithm is variable selection. Since there is no exact mathematical understanding behind variable selection step, there are many computational heuristics in the literature. In our thesis, we create our branch and bound environment and set hand-made branching rules. Then, we formulate Q-learning to learn variable selection rules and we apply the method to binary knapsack and set covering problem sets. However, Reinforcement learning algorithms cannot converge to good policy in the case of exponential growth of state space and action space. To tackle this problem, we apply one of the dimensionality reduction methods, PCA, to project state space from high-dimensional state space to lower-dimensional space. The learning phase in the Q-learning is executed in the lower dimensional space, which leads to faster convergence to better policy. However, there is a convergenceperformance trade-off due to loss of important data in the projection. Although experimental results shows that dimensionality reduced Q-learning cannot converge to objective value as good as hand-made branching rules, applyling PCA to state space improves the performance in comparison to learning in the full dimensional space. convergence rate.

Benzer Tezler

Tez No
439533
Türkçe tümcelerin yüklem odaklı anlam ve dilbilgisi çözümlemesi
Grammatical and semantic analysis of turkish sentence based on predicate
İLKNUR DÖNMEZ
Doktora
Türkçe
2016
Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol İstanbul Teknik Üniversitesi
Bilgisayar Mühendisliği Ana Bilim Dalı
PROF. DR. EŞREF ADALI
Tez No
39102
Değiştirilmiş kohonen ağı simülasyon programı
A Simulation program for modified kohonen network
ENDER TUNÇ EROĞLU
Yüksek Lisans
Türkçe
1993
Elektrik ve Elektronik Mühendisliği İstanbul Teknik Üniversitesi
PROF.DR. UĞUR ÇİLİNGİROĞLU
Tez No
685459
Evaluating performance of different remote sensing techniques and various interpolation approaches for soil salinity assessment
Toprak tuzluluğu değerlendirmesi için farklı uzaktan algılama teknikleri ve çeşitli interpolasyon yaklaşımlarının performansının değerlendirilmesi
TAHA GORJI
Doktora
İngilizce
2021
Bilim ve Teknoloji İstanbul Teknik Üniversitesi
Bilişim Uygulamaları Ana Bilim Dalı
PROF. DR. AYŞE GÜL TANIK
Tez No
349853
Köprü ayaklarında meydana gelen yerel oyulmaların veri analiz yöntemleri kullanılarak incelenmesi
Investigation of scour at bridge piers using data analysis methods
MEHMET ÖNER YELEĞEN
Yüksek Lisans
Türkçe
2014
İnşaat Mühendisliği İstanbul Teknik Üniversitesi
İnşaat Mühendisliği Ana Bilim Dalı
YRD. DOÇ. DR. ALİ UYUMAZ
Tez No
97124
Bankacılıkta değişim yönetimi
Change management in banking
AYDIN ARGIN
Doktora
Türkçe
2000
Bankacılık Marmara Üniversitesi
Bankacılık Ana Bilim Dalı
PROF. DR. NAZIM EKREN

Geri Dön