RMIT University
Browse

Recurrent neural network FPGA hardware accelerator for delay-tolerant indoor optical wireless communications

journal contribution
posted on 2024-11-02, 17:49 authored by Jeonghun Lee, Tingting SongTingting Song, Jiayuan HeJiayuan He, Kandeepan SithamparanathanKandeepan Sithamparanathan, Ke WangKe Wang
The optical wireless communication (OWC) system has been widely studied as a promising solution for high-speed indoor applications. The transmitter diversity scheme has been proposed to improve the performance of high-speed OWC systems. However, the transmitter diversity is vulnerable to the delay of multiple channels. Recently neural networks have been studied to realize delay-tolerant indoor OWC systems, where long-short term memory (LSTM) and attention-augmented LSTM (ALSTM) recurrent neural networks (RNNs) have shown their capabilities. However, they have high computation complexity and long computation latency. In this paper, we propose a low complexity delay-tolerant RNN scheme for indoor OWC systems. In particular, an RNN with parallelized structure is proposed to reduce the computation cost. The proposed RNN schemes show comparable capability to the more complicated ALSTM, where a bit-error-rate (BER) performance within the forward-error-correction (FEC) limit is achieved for up to 5.5 symbol periods delays. In addition, previously studied LSTM/ALSTM schemes are implemented using high-end GPUs, which have high cost, high power consumption, and long processing latency. To solve these practical limitations, in this paper we further propose and demonstrate the FPGA-based RNN hardware accelerator for delay-tolerant indoor OWC systems. To optimize the processing latency and power consumption, we also propose two optimization methods: The parallel implementation with triple-phase clocking and the stream-in based computation with additive input data insertion. Results show that the FPGA-based RNN hardware accelerator with the proposed optimization methods achieves 96.75% effective latency reduction and 90.7% lower energy consumption per symbol compared with the FPGA-based RNN hardware accelerator without optimization. Compared to the GPU implementation, the latency is reduced by about 61% and the power consumption is reduced by about 58.1%.

Funding

Optical wireless frontier: Design challenges of multi gigabit wireless

Australian Research Council

Find out more...

History

Related Materials

  1. 1.
    DOI - Is published in 10.1364/OE.427250
  2. 2.
    ISSN - Is published in 10944087

Journal

Optics Express

Volume

29

Issue

16

Start page

26165

End page

26182

Total pages

18

Publisher

Optical Society of America

Place published

United States

Language

English

Copyright

© 2021 Optical Society of America under the terms of the OSA Open Access Publishing Agreemen

Former Identifier

2006110350

Esploro creation date

2021-10-17

Usage metrics

    Scholarly Works

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC