Results 11 to 20 of about 4,328 (167)
Adaptive pilot design for OFDM based on deep reinforcement learning [PDF]
For orthogonal frequency division multiplexing (OFDM) systems, an adaptive pilot design algorithm based on deep reinforcement learning was proposed.The pilot design problem was formulated as a Markov decision process, where the index of pilot positions ...
Qiaoshou LIU +3 more
core +1 more source
Study on Intelligent Recommendation Method of Dueling Network Reinforcement Learning Based on Regret Exploration [PDF]
In recent years,the application of deep reinforcement learning in recommendation system has attracted much attention.Based on the existing research,this paper proposes a new recommendation model RP-Dueling,which is based on the deep reinforcement ...
HONG Zhi-li, LAI Jun, CAO Lei, CHEN Xi-liang, XU Zhi-xiong
core +1 more source
Objective To analyze the development of self-regulated learning research in China based on CNKI database, and to provide reference for future research on the theory and practice of self-regulated learning.
CHEN Qiannan (陈倩楠) +4 more
doaj +1 more source
Joint beam hopping and coverage control optimization algorithm for multibeam satellite system [PDF]
To improve the performance of multibeam satellite (MBS) systems, a deep reinforcement learning-based algorithm to jointly optimize the beam hopping and coverage control (BHCC) algorithm for MBS was proposed.Firstly, the resource allocation problem in MBS
Feng CHEN +3 more
core +1 more source
游戏化课程设计及新型教学法在商务汉语中的综合应用 [PDF]
高年级商务汉语课由于其学生差异较大,教材内容专业性强、实效性低等特殊矛盾对我们的教 学也提出了特殊的要求。为了更好地应对上述挑战,本文的两位作者在密西根大学四年级商务 汉语课程中不断尝试,将多种创新教学法融入课程设计和教学实践当中,并取得了良好的效果。 本文将通过对课程架构及具体教学活动设计的展示,介绍并探讨新型教学法在高级汉语课程中 的综合使用。具体内容包括如何使用游戏化课程设计法构建一个更加灵活的平台,为学生提供 个性化的学习体验并激发其内在的学习动机。此外,通过项目及任务型教学法 ...
刘倩 , 于晓盈
doaj +1 more source
QL-STCT: an intelligent routing convergence method for SDN link failure [PDF]
Aiming at the problem of routing convergence when SDN link failure occurs, a Q-Learning sub-topological convergence technique (QL-STCT) was proposed to realize intelligent route convergence when SDN links fail.Firstly, some nodes were selected in the ...
Chao CHEN +7 more
core +1 more source
Survey on reinforcement learning based adaptive bit rate algorithm for mobile video streaming services [PDF]
In recent years, with the continuous release of HTTP adaptive streaming (HAS) video datasets and network trace datasets, the machine learning methods, such as deep learning and reinforcement learning, have been continuously applied to adaptive bit rate ...
Jiafeng LI +4 more
core +1 more source
在多智能体强化学习的研究中,参数共享作为学习过程中一种信息集中的方式,可以有效地缓解不稳定性导致的学习低效性。但是,在实际应用中智能体使用同样的策略往往会带来不利影响。为了解决此类过度共享的问题,提出了一种新的方法来赋予智能体自动识别可能受益于共享参数的智能体的能力,并且可以在学习过程中动态地选择共享参数的对象。具体来说,智能体需要将历史轨迹编码为可表示其潜在意图的隐信息,并通过与其余智能体隐信息的对比选择共享参数的对象。实验表明,提出的方法在多智能体系统中不仅可以提高参数共享的效率 ...
王涵, 俞扬, 姜远
doaj +1 more source
Software-defined networking QoS optimization based on deep reinforcement learning [PDF]
To solve the problem that the QoS optimization schemes which based on heuristic algorithm degraded often due to the mismatch between parameters and network characteristics in software-defined networking scenarios,a software-defined networking QoS ...
Julong LAN +3 more
core +1 more source
Research progress of deep reinforcement learning applied to text generation [PDF]
With the recent exciting achievements of Google’s artificial intelligence system in the game of Go, deep reinforcement learning (DRL) has witnessed considerable development.
Cong XU +4 more
core +1 more source

