Incorporation of ordinal optimization into learning automata for high learning efficiency

Junqi Zhang, Cheng Wang, Di Zang, Mengchu Zhou

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

Learning automata (LA) represent important leaning mechanisms with applications in automated system design, biological system modeling, computer vision, and transportation. They play the critical roles in modeling a process as well as generating the appropriate signal to control it. They update their action probabilities in accordance with the inputs received from the environment and can improve their own performance during operations. The action probability vector in LA takes charge of two functions: 1) The cost of convergence, i.e., the size of sampling budget; 2) The allocation of sampling budget among actions to identify the optimal one. These two intertwined functions lead to a problem: The sampling budget mostly goes to the currently estimated optimal action due to its high action probability regardless whether it can help identify the real optimal action or not. This work proposes a new class of LA that separates the allocation of sampling budget from the action probability vector. It uses the action probability vector to determine the size of sampling budget and then uses Optimal Computing Budget Allocation (OCBA) to accomplish the allocation of sampling budget in a way that maximizes the probability of identifying the true optimal action. Simulation results verify its significant speedup ranging from 10.93% to 65.94% over the best existing LA algorithms.

Original languageEnglish (US)
Title of host publication2015 IEEE Conference on Automation Science and Engineering
Subtitle of host publicationAutomation for a Sustainable Future, CASE 2015
PublisherIEEE Computer Society
Pages1206-1211
Number of pages6
ISBN (Electronic)9781467381833
DOIs
StatePublished - Oct 7 2015
Event11th IEEE International Conference on Automation Science and Engineering, CASE 2015 - Gothenburg, Sweden
Duration: Aug 24 2015Aug 28 2015

Publication series

NameIEEE International Conference on Automation Science and Engineering
Volume2015-October
ISSN (Print)2161-8070
ISSN (Electronic)2161-8089

Other

Other11th IEEE International Conference on Automation Science and Engineering, CASE 2015
Country/TerritorySweden
CityGothenburg
Period8/24/158/28/15

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Incorporation of ordinal optimization into learning automata for high learning efficiency'. Together they form a unique fingerprint.

Cite this