Results 11 to 20 of about 2,593 (166)
Price of Competition and Dueling Games [PDF]
We study competition in a general framework introduced by Immorlica et al. and answer their main open question. Immorlica et al. considered classic optimization problems in terms of competition and introduced a general class of games called dueling games.
Dehghani, Sina +3 more
core +2 more sources
Distributional Reinforcement Learning with Quantile Regression
In reinforcement learning an agent interacts with the environment by taking actions and observing the next state and reward. When sampled probabilistically, these state transitions, rewards, and actions can all induce randomness in the observed long-term
Bellemare, Marc G. +3 more
core +1 more source
Speed Bump and Stock Market Quality: Evidence From NYSE American
ABSTRACT Should trading speed of high‐frequency traders be regulated? Using the data from the New York Stock Exchange American, this paper examines the impact of a speed bump on market liquidity and price discovery. Our results indicate that the use of a speed bump can lower the costs of adverse selection through reducing informed trading.
Bo Liu, Ke Xu
wiley +1 more source
Analysis of methods for simulating character encounters in a game with RPG elements
This paper investigates algorithms that predict the outcome of a duel in a game with RPG elements and determine the losses incurred. The aim is to evaluate the effectiveness of the following approaches: based on Lanchester's laws and stochastic, using ...
Michał Zdybel, Jakub Smołka
doaj +1 more source
Rainbow: Combining Improvements in Deep Reinforcement Learning
The deep reinforcement learning community has made several independent improvements to the DQN algorithm. However, it is unclear which of these extensions are complementary and can be fruitfully combined.
Azar, Mohammad +9 more
core +1 more source
This paper introduces SVCC‐HPPO, a novel Signal‐Vehicle Cooperative Control framework using an improved Hierarchical Proximal Policy Optimisation (H‐PPO) algorithm to jointly optimise traffic signal timing and Connected/Autonomous Vehicle (CAV) trajectories in mixed traffic environments.
Zongyuan Wu +5 more
wiley +1 more source
A Relative Exponential Weighing Algorithm for Adversarial Utility-based Dueling Bandits [PDF]
We study the K-armed dueling bandit problem which is a variation of the classical Multi-Armed Bandit (MAB) problem in which the learner receives only relative feedback about the selected pairs of arms.
Clérot, Fabrice +2 more
core +2 more sources
Catch Laffer If You Can: Tax Take in an Evasion‐Detection Game
ABSTRACT In a simple taxation framework, we analyze a taxpayer's decision of whether to report income truthfully or engage in an evasion game with the tax agency. Specifically, taxpayer and tax agency can expend efforts, respectively, to conceal income and detect evasion. These activities are costly, and the final outcome—whether evasion is detected or
Rosaria Distefano, Francesco Reito
wiley +1 more source
ABSTRACT Autonomous vehicles (AVs) are one of the building blocks of modern intelligent transportation systems and have the potential to change some aspects related to mobility, safety, and operational efficiency. In this paper, we analyze recent progress in AV algorithms and simulation frameworks, emphasizing their roles in decision‐making processes ...
Majd Alkorabi +2 more
wiley +1 more source
Dynamic Models and Nonlinear Filtering of Wave Propagation in Random Fields
In this paper, a general model of wireless channels is established based on the physics of wave propagation. Then the problems of inverse scattering and channel prediction are formulated as nonlinear filtering problems.
Wei, Haiqing
core +2 more sources

