Stochastic duel - Open Access .click

Results 11 to 20 of about 2,593 (166)

Price of Competition and Dueling Games [PDF]

, 2016
We study competition in a general framework introduced by Immorlica et al. and answer their main open question. Immorlica et al. considered classic optimization problems in terms of competition and introduced a general class of games called dueling games.
Dehghani, Sina +3 more
core +2 more sources

Distributional Reinforcement Learning with Quantile Regression

, 2017
In reinforcement learning an agent interacts with the environment by taking actions and observing the next state and reward. When sampled probabilistically, these state transitions, rewards, and actions can all induce randomness in the observed long-term
Bellemare, Marc G. +3 more
core +1 more source

Speed Bump and Stock Market Quality: Evidence From NYSE American

Financial Management, EarlyView.
ABSTRACT Should trading speed of high‐frequency traders be regulated? Using the data from the New York Stock Exchange American, this paper examines the impact of a speed bump on market liquidity and price discovery. Our results indicate that the use of a speed bump can lower the costs of adverse selection through reducing informed trading.
Bo Liu, Ke Xu
wiley +1 more source

Analysis of methods for simulating character encounters in a game with RPG elements

Journal of Computer Sciences Institute
This paper investigates algorithms that predict the outcome of a duel in a game with RPG elements and determine the losses incurred. The aim is to evaluate the effectiveness of the following approaches: based on Lanchester's laws and stochastic, using ...
Michał Zdybel, Jakub Smołka
doaj +1 more source

Rainbow: Combining Improvements in Deep Reinforcement Learning

, 2017
The deep reinforcement learning community has made several independent improvements to the DQN algorithm. However, it is unclear which of these extensions are complementary and can be fruitfully combined.
Azar, Mohammad +9 more
core +1 more source

Signal Timing and CAV Trajectory Joint Control Under Mixed Vehicular Environments With Hierarchical Proximal Policy Optimisation

IET Intelligent Transport Systems, Volume 20, Issue 1, January/December 2026.
This paper introduces SVCC‐HPPO, a novel Signal‐Vehicle Cooperative Control framework using an improved Hierarchical Proximal Policy Optimisation (H‐PPO) algorithm to jointly optimise traffic signal timing and Connected/Autonomous Vehicle (CAV) trajectories in mixed traffic environments.
Zongyuan Wu +5 more
wiley +1 more source

A Relative Exponential Weighing Algorithm for Adversarial Utility-based Dueling Bandits [PDF]

, 2015
We study the K-armed dueling bandit problem which is a variation of the classical Multi-Armed Bandit (MAB) problem in which the learner receives only relative feedback about the selected pairs of arms.
Clérot, Fabrice, Gajane, Pratik, Urvoy, Tanguy +2 more
core +2 more sources

Catch Laffer If You Can: Tax Take in an Evasion‐Detection Game

Journal of Public Economic Theory, Volume 27, Issue 6, December 2025.
ABSTRACT In a simple taxation framework, we analyze a taxpayer's decision of whether to report income truthfully or engage in an evasion game with the tax agency. Specifically, taxpayer and tax agency can expend efforts, respectively, to conceal income and detect evasion. These activities are costly, and the final outcome—whether evasion is detected or
Rosaria Distefano, Francesco Reito
wiley +1 more source

Deep Learning Algorithms for Autonomous Vehicle Communications: Technical Insights and Open Challenges

Concurrency and Computation: Practice and Experience, Volume 37, Issue 21-22, 25 September 2025.
ABSTRACT Autonomous vehicles (AVs) are one of the building blocks of modern intelligent transportation systems and have the potential to change some aspects related to mobility, safety, and operational efficiency. In this paper, we analyze recent progress in AV algorithms and simulation frameworks, emphasizing their roles in decision‐making processes ...
Majd Alkorabi, Alireza Souri, Nihat İnanç +2 more
wiley +1 more source

Dynamic Models and Nonlinear Filtering of Wave Propagation in Random Fields

, 2004
In this paper, a general model of wireless channels is established based on the physics of wave propagation. Then the problems of inverse scattering and channel prediction are formulated as nonlinear filtering problems.
Wei, Haiqing
core +2 more sources

16. peace & justice
duel game
stochastic model

fluctuation theory
2-person games
stochastic games, stochastic differential games

fos: mathematics
machine learning cs.lg
fos: computer and information sciences