6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Distribution of Euclidean Distances Between Randomly Distributed Gaussian Points in n-Space

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The curse of dimensionality is a common phenomenon which affects analysis of datasets characterized by large numbers of variables associated with each point. Problematic scenarios of this type frequently arise in classification algorithms which are heavily dependent upon distances between points, such as nearest-neighbor and \(k\)-means clustering. Given that contributing variables follow Gaussian distributions, this research derives the probability distribution that describes the distances between randomly generated points in n-space. The theoretical results are extended to examine additional properties of the distribution as the dimension becomes arbitrarily large. With this distribution of distances between randomly generated points in arbitrarily large dimensions, one can then determine the significance of distance measurements between any collection of individual points.

          Related collections

          Author and article information

          Journal
          10 August 2015
          Article
          1508.02238
          4e310ec3-b260-48ed-a6f0-dcf5ae238861

          http://arxiv.org/licenses/nonexclusive-distrib/1.0/

          History
          Custom metadata
          60D05 (Primary) 52A38, 53C65 (Secondary)
          13 pages, 4 figures
          math.PR

          Comments

          Comment on this article