ホームページ 検索結果 [ author_sort:"gosavi, abhijit." ]

Simulation-based optimization[electr...
Gosavi, Abhijit.

 

  • Simulation-based optimization[electronic resource] :parametric optimization techniques and reinforcement learning /
  • レコード種別: 言語・文字資料 (印刷物) : 単行資料
    [NT 15000414] null: 519.2
    タイトル / 著者: Simulation-based optimization : parametric optimization techniques and reinforcement learning // by Abhijit Gosavi.
    著者: Gosavi, Abhijit.
    出版された: Boston, MA : : Springer US :, 2015.
    記述: xxvi, 508 p. : : ill., digital ;; 24 cm.
    含まれています: Springer eBooks
    主題: Probabilities.
    主題: Mathematical optimization.
    主題: Economics/Management Science.
    主題: Operation Research/Decision Theory.
    主題: Operations Research, Management Science.
    主題: Simulation and Modeling.
    国際標準図書番号 (ISBN) : 9781489974914 (electronic bk.)
    国際標準図書番号 (ISBN) : 9781489974907 (paper)
    [NT 15000228] null: Background -- Simulation basics -- Simulation optimization: an overview -- Response surfaces and neural nets -- Parametric optimization -- Dynamic programming -- Reinforcement learning -- Stochastic search for controls -- Convergence: background material -- Convergence: parametric optimization -- Convergence: control optimization -- Case studies.
    [NT 15000229] null: Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning introduces the evolving area of static and dynamic simulation-based optimization. Covered in detail are model-free optimization techniques especially designed for those discrete-event, stochastic systems which can be simulated but whose analytical models are difficult to find in closed mathematical forms. Key features of this revised and improved Second Edition include: Extensive coverage, via step-by-step recipes, of powerful new algorithms for static simulation optimization, including simultaneous perturbation, backtracking adaptive search, and nested partitions, in addition to traditional methods, such as response surfaces, Nelder-Mead search, and meta-heuristics (simulated annealing, tabu search, and genetic algorithms) Detailed coverage of the Bellman equation framework for Markov Decision Processes (MDPs), along with dynamic programming (value and policy iteration) for discounted, average, and total reward performance metrics An in-depth consideration of dynamic simulation optimization via temporal differences and Reinforcement Learning: Q-Learning, SARSA, and R-SMART algorithms, and policy search, via API, Q-P-Learning, actor-critics, and learning automata A special examination of neural-network-based function approximation for Reinforcement Learning, semi-Markov decision processes (SMDPs), finite-horizon problems, two time scales, case studies for industrial tasks, computer codes (placed online), and convergence proofs, via Banach fixed point theory and Ordinary Differential Equations Themed around three areas in separate sets of chapters Static Simulation Optimization, Reinforcement Learning, and Convergence Analysis this book is written for researchers and students in the fields of engineering (industrial, systems, electrical, and computer), operations research, computer science, and applied mathematics.
    電子資源: http://dx.doi.org/10.1007/978-1-4899-7491-4
マルチメディア (複合媒体資料)
マルチメディアファイル
http://dx.doi.org/10.1007/978-1-4899-7491-4
論評
Export
受取館
 
 
パスワードを変更する
ログイン