强化学习 - Open Access .click

Results 61 to 70 of about 4,328 (167)

野生动物学报, 2021
现代动物园中应用正强化行为训练来教授动物学习新行为或增加动物配合饲养员日常操作的期望行为。动物学习复杂期望行为过程中，需要先把要塑造的新行为分解为多个简单的步骤，再将每个小步骤组合起来达到最终期望的标准。福州动物园成年雄性山魈（Mandrillus sphinx）不易进笼，并在食物充足情况下，出现长时间蹲守投喂点，驱赶弱势群体成员的现象。训练员运用正强化行为训练实现强势动物能按指令过笼、在内室进食、允许群体成员获取食物，以及降低弱势动物的恐惧感，获得良好的效果。同时降低了饲养员的日常操作难度 ...
陈霖
doaj

Self‐Consumption Translation: Exploring Interlingual Translation Within Multilingual (mainland) China

International Journal of Applied Linguistics, Volume 36, Issue 1, Page 714-724, February 2026.
ABSTRACT Interlingual translation, as defined by Roman Jakobson, refers to the transfer of meaning between languages. However, this concept has often been conflated with linguistic shifts between distinct cultures and nation‐states. To challenge this misconception, I propose the concept of self‐consumption translation (SCT), a subfield of interlingual ...
Bilin (Belen) Liu
wiley +1 more source

完善医学长学制专业实验整合课程体系的初步探索

Zhongguo shiyan zhenduanxue, 2017
随着我国经济社会的发展、高等医学教育的发展和医学模式的进步,医学科学的发展更趋向于综合化和社会化,社会对医学生的知识、能力和综合素质提出了越来越高的要求,培养医学生的科学素质和创新能力已经成为现代医学教育的重要内容之一[1]。这就促使医学教育更强调对学生能力的发掘和培养,培养他们的自主学习能力和终身学习能力,
郭丽荣 +7 more
doaj

异构多智能体系统的输出同步：一个基于数据的强化学习方法

智能科学与技术学报, 2020
通过强化学习研究了异构多智能体系统的输出同步问题。根据多智能体系统的拓扑结构，定义一个具有邻居控制输入的性能指标和价值函数。为克服已有控制方法需要系统模型的弊端，提出一个基于系统数据的强化学习算法，使输出同步控制器也可以被应用于模型未知的情况。此外，通过调节价值函数中的权重矩阵，可以减少每个智能体的控制成本。最后，通过一个仿真示例验证了该方法的有效性和定义的价值函数的优越性。
刘莹莹, 王占山
doaj

Review of Application on Optimization Strategies for New-Type Power System Based on Reinforcement Learning [PDF]

ObjectivesAs power systems evolve toward higher levels of intelligence and automation, reinforcement learning (RL), a key technology in artificial intelligence, shows great potential in the intelligent development of the power sector.
WANG Kai, YAN Zhengyi, ZHAO Kang
core +1 more source

IEEE 802.15.4 differentiated service strategy based on reinforcement-learning [PDF]

, 2015
To provide better support in differentiated service for IEEE 802.15.4,a novel differentiated service mechanism was proposed based on BCS(back off counter scheme)and reinforcement learning.In terms of end-device,BCS backoff strategy was added to original ...
Liang QIAN, Tian-ping LI, Wei QUAN, Zhi-hong QIAN +3 more
core +1 more source

The role of child language ability and parental mentalization in early child dysregulation

Infant Mental Health Journal: Infancy and Early Childhood, Volume 47, Issue 1, January 2026.
Abstract Dysregulation in early childhood is associated with increased vulnerability to psychopathology and poor psychosocial outcomes. While there is evidence that both child language ability and parental mentalization are associated with dysregulation in early childhood, there is little understanding of the relationships between these variables, and ...
Sara Cibralic +6 more
wiley +1 more source

High flexibility of heterogeneous tri-robot collaborative handling [PDF]

This paper proposes a reinforcement learning (RL)-based control framework utilizing the proximal policy optimization (PPO) algorithm to address compliance issues in cooperative transportation tasks for heterogeneous tri-robot systems.
Chunyu QI +5 more
core +1 more source

A Mini Review on Evolution of High‐Entropy Alloy Design: From Experimental Approaches to Machine Learning Integration

Rare Metals, Volume 45, Issue 1, January 2026.
ABSTRACT High‐entropy alloys (HEAs) have emerged as a transformative class of materials distinguished by their complex chemical compositions, unique microstructures, and remarkable mechanical and functional properties. Traditionally, the discovery and optimization of HEAs have relied on conventional methods, including trial‐and‐error experimentation ...
Chrispin Ouko Zamzu, Zhe Jia, Baolong Shen +2 more
wiley +1 more source

一种双通道半监督网络表示学习模型

大数据
在半监督网络表示学习中，节点标签对于网络在不同空间中映射关系的建立具有重要指导意义。然而在很多实际任务中，可用标签信息往往比较有限或难以获取，这导致在学习网络低维表示的过程中无法提供充分有效的监督。针对这一问题，提出了一种双通道半监督网络表示学习模型，该模型以自编码器为基本框架，由自监督和半监督两个信息传递通道构成。自监督信号与标签信息分别在两个通道中对网络表示映射关系的建立提供指导，同时二者之间形成信息互补与增强。考虑到两个通道间可能存在信息冗余，在互信息视角下设计了冗余识别与消除机制。在此基础上 ...
杜航原, 谢富中, 王文剑, 白亮
doaj +1 more source

pathology
人工智能
深度强化学习

多智能体系统
机器学习

previous 5 6 7 8 9 next