Results 31 to 40 of about 5,206 (257)
Objective To investigate the influence of video and audio-visual materials based on visual thinking theory on nursing students' professional cognition and emotion, and to find methods to improve nursing students' professional cognition and strengthen ...
BAI Jinlu (白金璐)+3 more
doaj +1 more source
Cooperative service caching and peer offloading in Internet of vehicles based on multi-agent meta-reinforcement learning [PDF]
In order to reduce computation complexity, a two-layer mutli-RSU (road side unit) service caching and peer offloading algorithm (MPO) was proposed to decouple the optimization problem.In the designed MPO, the outer layer utilized multi-agent meta ...
Kaiyuan ZHANG+3 more
core +1 more source
目的/意义人工智能(Artificial Intelligence,AI)技术已在学术和工程应用领域掀起了研究高潮,在地球物理参数和农业气象遥感参数反演方面也表现出了强大的应用潜力。目前大部分AI技术在地学和农学的应用还是“黑箱”,没有物理意义或缺乏可解释性及通用性。为了促进AI在地学和农学的应用和培养交叉学科的人才,本研究提出基于AI耦合物理和统计方法的地球物理参数反演范式理论。方法首先基于物理能量平衡方程进行物理逻辑推理,从理论上构造反演方程组,然后基于物理推导构建泛化的统计方法 ...
MAO Kebiao+15 more
doaj +1 more source
Deep reinforcement learning based algorithm for real-time QoS optimization of software-defined security middle platform [PDF]
To overcome the problem that the real-time optimization of the quality of service (QoS) in software-defined security scenarios was hindered by the mismatch between security protection measures and business scenarios, which led to difficulties in ...
Yongtai QIN, Yuancheng LI
core +1 more source
TD algorithm based on double-layer fuzzy partitioning [PDF]
When dealing with the continuous space problems,the traditional Q-iteration algorithms based on lookup-table or function approximation converge slowly and are diff lt to get a continuous policy.To overcome the above weak-nesses,an on-policy TD algorithm ...
Hong-kun SUN+4 more
core +1 more source
简要地评论了强化学习的历史、现状与未来的发展途径,认为强化学习应从先行后知、先知后行向知行合一的平行强化学习迈进,实现在虚拟世界“吃一堑”,在物理世界“长一智”,真正成为智慧机制和智能算法的基础学习理论。
王飞跃, 曹东璞, 魏庆来
doaj
Deep Reinforcement Learning-driven Cross-Community Energy Interaction Optimal Scheduling
In order to coordinate energy interactions among various communities and energy conversions among multi-energy subsystems within the multi-community integrated energy system under uncertain conditions, and achieve overall optimization and scheduling of ...
Bu, Fanjin+5 more
core
深度强化学习主要被用来处理感知-决策问题,已经成为人工智能领域重要的研究分支。概述了基于值函数和策略梯度的两类深度强化学习算法,详细阐述了深度Q网络、深度策略梯度及相关改进算法的原理,并综述了深度强化学习在视频游戏、导航、多智能体协作以及推荐系统等领域的应用研究进展。最后,对深度强化学习的算法和应用进行展望,针对一些未来的研究方向和研究热点给出了建议。
刘朝阳, 穆朝絮, 孙长银
doaj
Classic Traditional Chinese Medicine literatures are the crystallization of the wisdom of ancient medical masters in China. They provide the essence and basis for the principles of syndrome differentiation and treatment as well as the application of ...
TIAN Yuan (田媛)
doaj +1 more source
Software-defined networking QoS optimization based on deep reinforcement learning [PDF]
To solve the problem that the QoS optimization schemes which based on heuristic algorithm degraded often due to the mismatch between parameters and network characteristics in software-defined networking scenarios,a software-defined networking QoS ...
Julong LAN+3 more
core +1 more source