Jump to content

Draft:Bayes Space (statistics)

From Wikipedia, the free encyclopedia


Bayes space is a function space defined as an equivalence class of measures with the same null-sets. Two measures are defined to be equivalent if they are proportional. The basic ideas of Bayes spaces have their roots in Compositional Data Analysis and the Aitchison geometry[1]. Applications are mainly in statistics, specifically functional data analysis of density functions[2][3].

The basic structure of the Bayes space is that of a vector space, with addition and multiplication being defined by perturbation and powering [4]. The space is formed over a -finite reference/base measure, denoted or depending on whether it is infinite or finite. Densities are considered as Radon-Nikodym derivatives of the measures with same null-sets as the base measure, and are equivalent if they are proportional. In case of finite base measures, Hilbert space structure can be achieved by defining a centered log-ratio on the measures, mapping them to a subset of consisting of funtions integrating to 0[5].

Definitions and main results

[edit]

Consider a finite base measure (not necessarily a probability measure) on a domain . This may be a uniform distribution on a bounded interval, or it can be a Radon-Nikodym derivative of the Lebesgue measure (the Gaussian distribution, for example). If we take two densities with respect to , they are said to be B-equivalent if there exists a s.t , denoted (the convention is used in cases where a measure is infinite). It can be shown that is an equivalence relation. The Bayes space is defined as the quotient space of all measures with the same null-sets in as under the equivalence relation .

The first challenge to analysing density functions is that is not linear space under ordinary addition and multiplication since the ordinary difference between two densities would not be non-negative everywhere. Like in the Aitchison geometry for finite dimensional data, perturbation and powering is defined for densities:

Perturbation

Powering

where are densities in and is some real number. It can be shown using the properties of multiplication and powering of real numbers that forms a vector space over the real numbers.

The definition of Bayes space does not strictly require a finite reference measure . If Bayes space is defined over an infinite reference measure , it must be -finite (like the Lebesgue measure). The finite reference measure is, however, necessary for adding Hilbert space structure to a subset of . Consider the subspace

. For , this is a linear subspace and isometrically isomorphic to the Hilbert space via the centered log-ratio (clr) transformation . The subspace of log-square integrable functions is termed the Bayes Hilbert space. It can be shown that the clr-transformation is a linear isomorphism between the two spaces. Defining an inner product on as the inner product of the clr-transformations will provide the Hilbert space structure for , obtaining the centered log-ratio as a linear isometry.

Multivariate densities

[edit]

The measure does not have to be univariate (1 dimensional), but can also be defined as a product measure on cartesian products, characterising bivariate (2 dimensional) or multivariate densities. The geometric structure of Hilbert spaces can be used to decompose multivariate densities into orthogonal independent and interaction parts[6][7], using the concept of "clr-marginals". This decomposition has relations to copula theory[7]. The geometry in defines norms on densities that can be used to quantify "relative simplicial deviance," which is measure of how much of a bivariate distribution can be explained by assuming independence[6].

See also

[edit]

References

[edit]
  1. ^ Egozcue, Juan José. "Hilbert space of probability density functions based on Aitchison geometry". Acta Mathematica Sinica: 1175–1183.
  2. ^ Hron, Karel (2015). "Simplicial principal component analysis for density functions in Bayes spaces". Computational Statistics & Data Analysis – via Elsevie Science Direct.
  3. ^ Talská, Renáta. "Compositional Scalar-on-Function Regression with Application to Sediment Particle Size Distributions". International Association for Mathematical Geosciences.
  4. ^ van den Boogart, Karl Gerald (2010). "Bayes linear spaces". SORT: statistics and operations research transactions. 34: 201–222.
  5. ^ van den Boogart, Karl Gerald (2014). "Bayes Hilbert Spaces". Australian & New Zealand Journal of Statistics: 171–194.
  6. ^ a b Hron, Karel. "Bivariate densities in Bayes spaces: orthogonal decomposition and spline representation". Stat papers. 64: 1629–1667 – via Springer Nature.
  7. ^ a b Škorňa, Stanislav. "Approximation of bivariate densities with compositional splines". arXiv.