Source author record

Jiayuan He

Jiayuan He appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

eess.SP Machine Learning Computation and Language Computer Vision

Catalog footprint

What is connected

4works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal Data

Recently, Self-Supervised Representation Learning (SSRL) has attracted much attention in the field of computer vision, speech, natural language processing (NLP), and recently, with other types of modalities, including time series from sensors. The popularity of self-supervised learning is driven by the fact that traditional models typically require a huge amount of well-annotated data for training. Acquiring annotated data can be a difficult and costly process. Self-supervised methods have been introduced to improve the efficiency of training data through discriminative pre-training of models using supervisory signals that have been freely obtained from the raw data. Unlike existing reviews of SSRL that have pre-dominately focused upon methods in the fields of CV or NLP for a single modality, we aim to provide the first comprehensive review of multimodal self-supervised learning methods for temporal data. To this end, we 1) provide a comprehensive categorization of existing SSRL methods, 2) introduce a generic pipeline by defining the key components of a SSRL framework, 3) compare existing models in terms of their objective function, network architecture and potential applications, and 4) review existing multimodal techniques in each category and various modalities. Finally, we present existing weaknesses and future opportunities. We believe our work develops a perspective on the requirements of SSRL in domains that utilise multimodal and/or temporal data

preprint2021arXiv

Memorization vs. Generalization: Quantifying Data Leakage in NLP Performance Evaluation

Public datasets are often used to evaluate the efficacy and generalizability of state-of-the-art methods for many tasks in natural language processing (NLP). However, the presence of overlap between the train and test datasets can lead to inflated results, inadvertently evaluating the model's ability to memorize and interpreting it as the ability to generalize. In addition, such data sets may not provide an effective indicator of the performance of these methods in real world scenarios. We identify leakage of training data into test data on several publicly available datasets used to evaluate NLP tasks, including named entity recognition and relation extraction, and study them to assess the impact of that leakage on the model's ability to memorize versus generalize.

preprint2020arXiv

Experimental Demonstration of Millimeter-Wave Radio-over-Fiber System with Convolutional Neural Network (CNN) and Binary Convolutional Neural Network (BCNN)

The millimeter-wave (mm-wave) radio-over-fiber (RoF) systems have been widely studied as promising solutions to deliver high-speed wireless signals to end users, and neural networks have been studied to solve various linear and nonlinear impairments. However, high computation cost and large amounts of training data are required to effectively improve the system performance. In this paper, we propose and demonstrate highly computation efficient convolutional neural network (CNN) and binary convolutional neural network (BCNN) based decision schemes to solve these limitations. The proposed CNN and BCNN based decision schemes are demonstrated in a 5 Gbps 60 GHz RoF system for up to 20 km fiber distance. Compared with previously demonstrated neural networks, results show that the bit error rate (BER) performance and the computation intensive training process are improved. The number of training iterations needed is reduced by about 50 % and the amount of required training data is reduced by over 30 %. In addition, only one training is required for the entire measured received optical power range over 3.5 dB in the proposed CNN and BCNN schemes, to further reduce the computation cost of implementing neural networks decision schemes in mm-wave RoF systems.

preprint2020arXiv

FPGA-based Neural Network Accelerator for Millimeter-Wave Radio-over-Fiber Systems

With the rapidly-developing high-speed wireless communications, the 60 GHz millimeter-wave frequency range and radio-over-fiber systems have been investigated as a promising solution to deliver mm-wave signals. Neural networks have been studied to improve the mm-wave RoF system performances at the receiver side by suppressing linear and nonlinear impairments. However, previous neural network studies in mm-wave RoF systems focus on the off-line implementation with high-end GPUs , which is not practical for low power-consumption, low-cost and limited computation platform applications. To solve this issue, we investigate neural network hardware accelerator implementations using the field programmable gate array (FPGA), taking advantage of the low power consumption, parallel computation capability, and reconfigurablity features of FPGA. Convolutional neural network (CNN) and binary convolutional neural network (BCNN) hardware accelerators are demonstrated. In addition, to satisfy the low-latency requirement in mm-wave RoF systems and to enable the use of low-cost compact FPGA devices, a novel inner parallel optimization method is proposed. Compared with the embedded processor (ARM Cortex A9) execution latency, the CNN/BCNN FPGA-based hardware accelerator reduces their latency by over 92%. Compared with non-optimized FPGA implementations, the proposed optimization method reduces the processing latency by over 44% for CNN and BCNN. Compared with the GPU implementation, the latency of CNN implementation with the proposed optimization method is reduced by 85.49%, while the power consumption is reduced by 86.91%. Although the latency of BCNN implementation with the proposed optimization method is larger compared with the GPU implementation, the power consumption is reduced by 86.14%. The FPGA-based neural network hardware accelerators provide a promising solution for mm-wave RoF systems.

Jiayuan He

What is connected

Connect this record

See the researcher in context

Building this map preview

4 published item(s)

Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal Data

Memorization vs. Generalization: Quantifying Data Leakage in NLP Performance Evaluation

Experimental Demonstration of Millimeter-Wave Radio-over-Fiber System with Convolutional Neural Network (CNN) and Binary Convolutional Neural Network (BCNN)

FPGA-based Neural Network Accelerator for Millimeter-Wave Radio-over-Fiber Systems