19
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Hardware Acceleration for RLNC: A Case Study Based on the Xtensa Processor with the Tensilica Instruction-Set Extension

      , , , , , , , ,
      Electronics
      MDPI AG

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Random linear network coding (RLNC) can greatly aid data transmission in lossy wireless networks. However, RLNC requires computationally complex matrix multiplications and inversions in finite fields (Galois fields). These computations are highly demanding for energy-constrained mobile devices. The presented case study evaluates hardware acceleration strategies for RLNC in the context of the Tensilica Xtensa LX5 processor with the tensilica instruction set extension (TIE). More specifically, we develop TIEs for multiply-accumulate (MAC) operations for accelerating matrix multiplications in Galois fields, single instruction multiple data (SIMD) instructions operating on consecutive memory locations, as well as the flexible-length instruction extension (FLIX). We evaluate the number of clock cycles required for RLNC encoding and decoding without and with the MAC, SIMD, and FLIX acceleration strategies. We also evaluate the RLNC encoding and decoding throughput and energy consumption for a range of RLNC generation and code word sizes. We find that for GF ( 2 8 ) and GF ( 2 16 ) RLNC encoding, the SIMD and FLIX acceleration strategies achieve speedups of approximately four hundred fold compared to a benchmark C code implementation without TIE. We also find that the unicore Xtensa LX5 with SIMD has seven to thirty times higher RLNC encoding and decoding throughput than the state-of-the-art ODROID XU3 system-on-a-chip (SoC) operating with a single core; the Xtensa LX5 with FLIX, in turn, increases the throughput by roughly 25% compared to utilizing only SIMD. Furthermore, the Xtensa LX5 with FLIX consumes roughly four orders of magnitude less energy than the ODROID XU3 SoC.

          Related collections

          Most cited references61

          • Record: found
          • Abstract: not found
          • Article: not found

          A Random Linear Network Coding Approach to Multicast

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            On the Delay-Storage Trade-Off in Content Download from Coded Distributed Storage Systems

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              VLSI Architectures for Computing Multiplications and Inverses in GF(2m)

              Shao, Troung, Omura (1985)
                Bookmark

                Author and article information

                Journal
                ELECGJ
                Electronics
                Electronics
                MDPI AG
                2079-9292
                September 2018
                September 08 2018
                : 7
                : 9
                : 180
                Article
                10.3390/electronics7090180
                8b6f5c48-b863-4403-bb65-f2e427407d70
                © 2018

                https://creativecommons.org/licenses/by/4.0/

                History

                Comments

                Comment on this article