5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Additive Feature Hashing

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The hashing trick is a machine learning technique used to encode categorical features into a numerical vector representation of pre-defined fixed length. It works by using the categorical hash values as vector indices, and updating the vector values at those indices. Here we discuss a different approach based on additive-hashing and the "almost orthogonal" property of high-dimensional random vectors. That is, we show that additive feature hashing can be performed directly by adding the hash values and converting them into high-dimensional numerical vectors. We show that the performance of additive feature hashing is similar to the hashing trick, and we illustrate the results numerically using synthetic, language recognition, and SMS spam detection data.

          Related collections

          Author and article information

          Journal
          07 February 2021
          Article
          2102.03943
          da48ab60-ecf7-40ae-ac1f-4d64cfe49756

          http://arxiv.org/licenses/nonexclusive-distrib/1.0/

          History
          Custom metadata
          11 pages 3 figures
          cs.LG

          Artificial intelligence
          Artificial intelligence

          Comments

          Comment on this article