Results 101 to 110 of about 7,890 (182)
ABSTRACT It is an elementary fact in the scientific literature that the Lipschitz norm of the realization function of a feedforward fully connected rectified linear unit (ReLU) artificial neural network (ANN) can, up to a multiplicative constant, be bounded from above by sums of powers of the norm of the ANN parameter vector.
Arnulf Jentzen, Timo Kröger
wiley +1 more source
Statistical Analysis of Distance Estimators with Density Differences and Density Ratios
Estimating a discrepancy between two probability distributions from samples is an important task in statistics and machine learning. There are mainly two classes of discrepancy measures: distance measures based on the density difference, such as the Lp ...
Takafumi Kanamori, Masashi Sugiyama
doaj +1 more source
On the Centroids of Symmetrized Bregman Divergences
In this paper, we generalize the notions of centroids and barycenters to the broad class of information-theoretic distortion measures called Bregman divergences. Bregman divergences are versatile, and unify quadratic geometric distances with various statistical entropic measures.
Frank Nielsen, Richard Nock
openaire +2 more sources
General H-theorem and Entropies that Violate the Second Law
H-theorem states that the entropy production is nonnegative and, therefore, the entropy of a closed system should monotonically change in time. In information processing, the entropy production is positive for random transformation of signals (the ...
Alexander N. Gorban
doaj +1 more source
Proper scoring rules and Bregman divergence
We revisit the mathematical foundations of proper scoring rules (PSRs) and Bregman divergences and present their characteristic properties in a unified theoretical framework. In many situations it is preferable not to generate a PSR directly from its convex entropy on the unit simplex but instead by the sublinear extension of the entropy to the ...
openaire +3 more sources
An abstract, quantitative theory which connects elements of information —key ingredients in the cognitive proces—is developed. Seemingly unrelated results are thereby unified.
Flemming Topsøe
doaj +1 more source
Robustness Property of Robust-BD Wald-Type Test for Varying-Dimensional General Linear Models
An important issue for robust inference is to examine the stability of the asymptotic level and power of the test statistic in the presence of contaminated data.
Xiao Guo, Chunming Zhang
doaj +1 more source
Total Jensen divergences: Definition, Properties and k-Means++ Clustering
We present a novel class of divergences induced by a smooth convex function called total Jensen divergences. Those total Jensen divergences are invariant by construction to rotations, a feature yielding regularization of ordinary Jensen divergences by a ...
Nielsen, Frank, Nock, Richard
core
Bregman Divergences and Triangle Inequality [PDF]
Sreangsu Acharyya +2 more
openaire +1 more source
Income Distributions and Decomposable Divergence Measures [PDF]
Inequality indices (i) evaluate the divergence between the income distribution and the hypothetical situation where all individuals have the mean income and (ii) are unambiguously reduced by a Pigou-Dalton progressive transfer.
Brice Magdalou, Richard Nock
core

