A sample-path approach to time-average Markov decision processes

Başlık çevirisi mevcut değil.

PDF İndir

Tez No: 400894
Yazar: MELİKE BAYKAL GÜRSOY
Danışmanlar: DR. KEITH W. ROSS
Tez Türü: Doktora
Konular: Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol, Computer Engineering and Computer Science and Control
Anahtar Kelimeler: Belirtilmemiş.
Yıl: 1988
Dil: İngilizce
Üniversite: University of Pennsylvania
Enstitü: Yurtdışı Enstitü
Ana Bilim Dalı: Belirtilmemiş.
Bilim Dalı: Belirtilmemiş.
Sayfa Sayısı: 111

Özet

Özet yok.

Özet (Çeviri)

Time-average Markov decision problems are considered for the finite state and action spaces. Several definitions of variability are introduced and compared. For multichain case, it is shown that a stationary policy maximizes one of the criteria, namely, the expected long-run average variability. An algorithm which uses a decomposition approach to locate such an optimal policy is given. The algorithm produces an optimal pure policy under convexity conditions for the variability function. The unichain semi-Markov decision processes are examined. It is shown that a stationary policy maximizes the expected average reward subject to the condition that the longrun average cost is below certain level with probability 1. A fractional program is presented which produces such an optimal stationary policy. Two-person zero-sum stochastic games are also considered. In the case that only one player controls the transition probabilities, stationary policies are shown to exist which give the saddlepoint solution for multichained expected long-run average reward. An algorithm using the decomposition theory is developed to find optimal stationary policies for both players. In the case that both players control the transition probabilities a generalized game is obtained. The solution of this game gives optimal stationary policies for the players if the game is irreducible.

Benzer Tezler

Tez No
66651
Türkiye'de kentsel sit alanı sorunları ve çözüm yolları için bir deneme/Galata örneği
A Survey on the solution of the problems in the urban şite areas in Turkey/ Galata case
YASEMİN AKSOY
Yüksek Lisans
Türkçe
1997
Şehircilik ve Bölge Planlama İstanbul Teknik Üniversitesi
Kentsel Tasarım Ana Bilim Dalı
PROF. DR. GÜNDÜZ ATALIK
Tez No
711472
Türkülerin dinamik yapısını oluşturan olguların öğretiminde izlenecek bir yaklaşım önerisi: Orta Anadolu ağzı örneği
A recommendation for an approach to teaching the phenomenons which form that the dynamic structure of turkus: Central Anatolia example
ERHAN USLU
Doktora
Türkçe
2022
Eğitim ve Öğretim İstanbul Teknik Üniversitesi
Müzikoloji ve Müzik Teorisi Ana Bilim Dalı
PROF. DR. NİLGÜN DOĞRUSÖZ DİŞİAÇIK
Tez No
335005
Bulanık çok modlu kaynak kısıtlı proje çizelgeleme problemlerinin çözümü için matematiksel bir model
A mathematical model for the solution of the fuzzy multi mode resource-constrained project scheduling problems
ÖMER ATLI
Doktora
Türkçe
2012
Endüstri ve Endüstri Mühendisliği Hava Harp Okulu Komutanlığı
Endüstri Mühendisliği Ana Bilim Dalı
PROF. DR. CENGİZ KAHRAMAN
Tez No
22021
PC ler arasında veri iletişimini sağlayan bir yazılım
A Software about data communication between PCs
OSMAN NURİ ÖZPINAR
Yüksek Lisans
Türkçe
1992
Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol İstanbul Teknik Üniversitesi
DOÇ. DR. MİTHAT UYSAL
Tez No
917614
Doğal kayaçlardan radyasyon kalkanlama özelliğine sahip camların geliştirilmesi
Development of Radiation Shielding Glasses from Natural Rocks
BİLGEHAN GÜVEN
Doktora
Türkçe
2025
Fizik ve Fizik Mühendisliği Sakarya Üniversitesi
Metalurji ve Malzeme Mühendisliği Ana Bilim Dalı
PROF. DR. ŞENOL YILMAZ

Geri Dön