序贯决策
- 名sequential decision
-
假设检验的Bayes序贯决策问题往往可用最优停止理论解决。
The problem of Bayes sequential decision rules to hypothesis testing is often reduced to the problem of optimal stopping .
-
在学习单元对环境信息未知的序贯决策问题中,强化学习(RL)是一种被广泛用于建立环境模型以及求解最优控制策略的有效技术。
Techniques based on reinforcement learning ( RL ) have been used to build systems that learn to perform nontrivial sequential decision tasks .
-
Bayes序贯决策法在舰炮武器系统试验中的应用
Application of Bayes SPRT to Shipborne Gun Weapon System Test
-
一类序贯决策问题的线性动态规划(LDP)算法
A LDP algorithm of the linear dynamic programming for multistage decision problem
-
实际生活中的许多序贯决策问题,如柔性制造系统、交通指挥系统、排队系统等,都可以模型化为Markov决策过程(MDP)。
Many sequential decision problems , such as flexible manufacturing systems , traffic command systems and queuing systems etc. , can be modeled as Markov decision processes ( MDPs ) .
-
虚拟现实仿真下的序贯决策优化研究
Research on Optimization for Sequential Decision Making Under Virtual Reality Simulation
-
基于序贯决策的装备可靠性费用优化研究
Optimization of Military Equipment Reliability Expenses Based on Sequential Decision
-
农业灌溉序贯决策的广义密切值法
Close Value Method for Decision Making of Agricultural Irrigation Procedures
-
信息搜寻的序贯决策模型
Sequential decision model on information searching
-
项目计划过程采用序贯决策原理分为项目分解、资源配置、确定任务工期三个阶段进行。
Project planning is divided into project decomposition , resource configuration and scheduling by the principle of sequential decision-making .
-
并采用主、客观相结合的方法来确定序贯决策的动态权重。
And a method named subjective and objective connection was used to get the dynamic weight of each stage .
-
其中,资源优化组合与协作作业规划过程采用序贯决策原理,分为项目分解、资源配置、确定任务工期三个阶段进行。
The scheduling is divided into task decomposition , resource configuration and decision of duration by the principle of sequential decision-making .
-
通过融合机会约束优化策略与序贯决策方法,提出了机会约束序贯优化策略。
By merging the chance constraint optimization strategy and the sequential optimization strategy , the Chance Constraint Sequential Optimization Strategy was proposed .
-
应急决策是根据突发事件发展、演化的不同阶段,进行多阶段不确定性决策,从而生成应急处置方案的动态随机序贯决策问题。
The emergency decision-making is a multi-stage decision making under uncertainty and a dynamic random sequential decision problem , based on the evolution of emergency .
-
分析了供应商评估的决策过程,在序贯决策思想的基础上提出了供应商评估的3种决策过程模型。
This paper analyzed the decision process in vendor evaluation , put forward three models of this process based on the information searching and sequence decision theory .
-
增强学习能有效解决不确定序贯决策优化问题,近年来已发展成为机器学习领域的一个研究热点。
Reinforcement learning ( RL ) is efficient at solving uncertain sequential decision problem and it has become one of the key research issues in machine learning in recent years .
-
由于现实系统中,系统状态转移概率以及报酬函数往往不能显式得到,所以传统方法无法求解序贯决策问题。
Because the state transferring probability and income function of system can not be obtained in a close form in real system , traditional methods have difficulties in solving such sequential decision problem .
-
激励学习智能体通过最优策略的学习与规划来求解序贯决策问题,因此如何定义策略的最优判据是激励学习研究的核心问题之一。
RL agents solve sequential decision problems by learning optim policies for choosing actions . Thus , at the core of RL is the definition of what it means for a policy to be optimal .
-
序贯抽样决策中的抽样数量
The Number of Basic Sampling Unit in the Sequential Sampling
-
在此基础上,采用马占山(1988)的简易序贯抽样决策模型进行了简易序贯抽样分析,制作了简易序贯抽样分析图、表。
Based on this , Ma Zhan-shan 's Model ( 1988 ) is applied to the Simple Sequential Sampling Analysis , and Making for its Analysis table and figure .
-
并应用实物期权方法、序贯投资决策理论等方法创建了国家关键技术基于收益的创新模式选择模型。
Besides , this paper adopts the real option method , the serial investment decision-making theory to create the model of innovation model selection based on income approach of the national critical technology .
-
本文提出了交互式序贯多目标决策与模糊层次综合评价模型体系,就矿井通风系统进行分析。
In this paper the model system of the mutual alternating multiple target making decision and fuzzy comprehensive evaluation is advanced in order to analyse mine ventilation system .
-
渠井灌区配水序贯多指标模糊决策模型与方法
Multi-objective Fuzzy Sequential Decision Model for Irrigation Mode in Canals and Wells Irrigation District and Its Method