Source author record

Chuang Zhang

Chuang Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Information Theory math.IT Computer Vision cond-mat.mes-hall Artificial Intelligence cond-mat.mtrl-sci Machine Learning Computation and Language Networking and Internet Architecture physics.comp-ph

Catalog footprint

What is connected

20works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

CLTS+: A New Chinese Long Text Summarization Dataset with Abstractive Summaries

The abstractive methods lack of creative ability is particularly a problem in automatic text summarization. The summaries generated by models are mostly extracted from the source articles. One of the main causes for this problem is the lack of dataset with abstractiveness, especially for Chinese. In order to solve this problem, we paraphrase the reference summaries in CLTS, the Chinese Long Text Summarization dataset, correct errors of factual inconsistencies, and propose the first Chinese Long Text Summarization dataset with a high level of abstractiveness, CLTS+, which contains more than 180K article-summary pairs and is available online. Additionally, we introduce an intrinsic metric based on co-occurrence words to evaluate the dataset we constructed. We analyze the extraction strategies used in CLTS+ summaries against other datasets to quantify the abstractiveness and difficulty of our new data and train several baselines on CLTS+ to verify the utility of it for improving the creative ability of models.

preprint2022arXiv

Mix-Teaching: A Simple, Unified and Effective Semi-Supervised Learning Framework for Monocular 3D Object Detection

Monocular 3D object detection is an essential perception task for autonomous driving. However, the high reliance on large-scale labeled data make it costly and time-consuming during model optimization. To reduce such over-reliance on human annotations, we propose Mix-Teaching, an effective semi-supervised learning framework applicable to employ both labeled and unlabeled images in training stage. Mix-Teaching first generates pseudo-labels for unlabeled images by self-training. The student model is then trained on the mixed images possessing much more intensive and precise labeling by merging instance-level image patches into empty backgrounds or labeled images. This is the first to break the image-level limitation and put high-quality pseudo labels from multi frames into one image for semi-supervised training. Besides, as a result of the misalignment between confidence score and localization quality, it's hard to discriminate high-quality pseudo-labels from noisy predictions using only confidence-based criterion. To that end, we further introduce an uncertainty-based filter to help select reliable pseudo boxes for the above mixing operation. To the best of our knowledge, this is the first unified SSL framework for monocular 3D object detection. Mix-Teaching consistently improves MonoFlex and GUPNet by significant margins under various labeling ratios on KITTI dataset. For example, our method achieves around +6.34% AP@0.7 improvement against the GUPNet baseline on validation set when using only 10% labeled data. Besides, by leveraging full training set and the additional 48K raw images of KITTI, it can further improve the MonoFlex by +4.65% improvement on AP@0.7 for car detection, reaching 18.54% AP@0.7, which ranks the 1st place among all monocular based methods on KITTI test leaderboard. The code and pretrained models will be released at https://github.com/yanglei18/Mix-Teaching.

preprint2022arXiv

Multi-class Label Noise Learning via Loss Decomposition and Centroid Estimation

In real-world scenarios, many large-scale datasets often contain inaccurate labels, i.e., noisy labels, which may confuse model training and lead to performance degradation. To overcome this issue, Label Noise Learning (LNL) has recently attracted much attention, and various methods have been proposed to design an unbiased risk estimator to the noise-free dataset to combat such label noise. Among them, a trend of works based on Loss Decomposition and Centroid Estimation (LDCE) has shown very promising performance. However, existing LNL methods based on LDCE are only designed for binary classification, and they are not directly extendable to multi-class situations. In this paper, we propose a novel multi-class robust learning method for LDCE, which is termed "MC-LDCE". Specifically, we decompose the commonly adopted loss (e.g., mean squared loss) function into a label-dependent part and a label-independent part, in which only the former is influenced by label noise. Further, by defining a new form of data centroid, we transform the recovery problem of a label-dependent part to a centroid estimation problem. Finally, by critically examining the mathematical expectation of clean data centroid given the observed noisy set, the centroid can be estimated which helps to build an unbiased risk estimator for multi-class learning. The proposed MC-LDCE method is general and applicable to different types (i.e., linear and nonlinear) of classification models. The experimental results on five public datasets demonstrate the superiority of the proposed MC-LDCE against other representative LNL methods in tackling multi-class label noise problem.

preprint2022arXiv

Non-monotonic heat dissipation phenomenon in close-packed quasi-2D and 3D hotspot system

Transient heat dissipation in close-packed quasi-2D nanoline and 3D nanocuboid hotspot systems is studied based on phonon Boltzmann transport equation. It is found that, counter-intuitively, the heat dissipation efficiency is not a monotonic function of the distance between adjacent nanoscale heat sources: the heat dissipation efficiency reaches the highest value when this distance is comparable to the phonon mean free path. This is due to the competition of two thermal transport processes: quasiballistic transport when phonons escape from the nanoscale heat source and the scattering among phonons originating from adjacent nanoscale heat source.

preprint2022arXiv

On the emergence of heat waves in the transient thermal grating geometry

The propagation of heat in the transient thermal grating geometry is studied based on phonon Boltzmann transport equation (BTE) in different phonon transport regimes. Our analytical and numerical results show that the phonon dispersion relation and temperature play a significant role in the emergence of heat wave. For the frequency-independent BTE, the heat wave appears as long as the phonon resistive scattering is not sufficient, while for the frequency-dependent BTE, the heat wave could disappear in the ballistic regime, depending on the grating period and temperature. We predict that the heat wave could appear in the suspended graphene and silicon in extremely low temperature but disappear at room temperature.

preprint2022arXiv

Self-Supervised Modality-Aware Multiple Granularity Pre-Training for RGB-Infrared Person Re-Identification

RGB-Infrared person re-identification (RGB-IR ReID) aims to associate people across disjoint RGB and IR camera views. Currently, state-of-the-art performance of RGB-IR ReID is not as impressive as that of conventional ReID. Much of that is due to the notorious modality bias training issue brought by the single-modality ImageNet pre-training, which might yield RGB-biased representations that severely hinder the cross-modality image retrieval. This paper makes first attempt to tackle the task from a pre-training perspective. We propose a self-supervised pre-training solution, named Modality-Aware Multiple Granularity Learning (MMGL), which directly trains models from scratch only on multi-modal ReID datasets, but achieving competitive results against ImageNet pre-training, without using any external data or sophisticated tuning tricks. First, we develop a simple-but-effective 'permutation recovery' pretext task that globally maps shuffled RGB-IR images into a shared latent permutation space, providing modality-invariant global representations for downstream ReID tasks. Second, we present a part-aware cycle-contrastive (PCC) learning strategy that utilizes cross-modality cycle-consistency to maximize agreement between semantically similar RGB-IR image patches. This enables contrastive learning for the unpaired multi-modal scenarios, further improving the discriminability of local features without laborious instance augmentation. Based on these designs, MMGL effectively alleviates the modality bias training problem. Extensive experiments demonstrate that it learns better representations (+8.03% Rank-1 accuracy) with faster training speed (converge only in few hours) and higher data efficiency (<5% data size) than ImageNet pre-training. The results also suggest it generalizes well to various existing models, losses and has promising transferability across datasets. The code will be released.

preprint2022arXiv

Towards Harnessing Feature Embedding for Robust Learning with Noisy Labels

The memorization effect of deep neural networks (DNNs) plays a pivotal role in recent label noise learning methods. To exploit this effect, the model prediction-based methods have been widely adopted, which aim to exploit the outputs of DNNs in the early stage of learning to correct noisy labels. However, we observe that the model will make mistakes during label prediction, resulting in unsatisfactory performance. By contrast, the produced features in the early stage of learning show better robustness. Inspired by this observation, in this paper, we propose a novel feature embedding-based method for deep learning with label noise, termed LabEl NoiseDilution (LEND). To be specific, we first compute a similarity matrix based on current embedded features to capture the local structure of training data. Then, the noisy supervision signals carried by mislabeled data are overwhelmed by nearby correctly labeled ones (\textit{i.e.}, label noise dilution), of which the effectiveness is guaranteed by the inherent robustness of feature embedding. Finally, the training data with diluted labels are further used to train a robust classifier. Empirically, we conduct extensive experiments on both synthetic and real-world noisy datasets by comparing our LEND with several representative robust learning approaches. The results verify the effectiveness of our LEND.

preprint2020arXiv

C-DLinkNet: considering multi-level semantic features for human parsing

Human parsing is an essential branch of semantic segmentation, which is a fine-grained semantic segmentation task to identify the constituent parts of human. The challenge of human parsing is to extract effective semantic features to resolve deformation and multi-scale variations. In this work, we proposed an end-to-end model called C-DLinkNet based on LinkNet, which contains a new module named Smooth Module to combine the multi-level features in Decoder part. C-DLinkNet is capable of producing competitive parsing performance compared with the state-of-the-art methods with smaller input sizes and no additional information, i.e., achiving mIoU=53.05 on the validation set of LIP dataset.

preprint2020arXiv

FGSD: A Dataset for Fine-Grained Ship Detection in High Resolution Satellite Images

Ship detection using high-resolution remote sensing images is an important task, which contribute to sea surface regulation. The complex background and special visual angle make ship detection relies in high quality datasets to a certain extent. However, there is few works on giving both precise classification and accurate location of ships in existing ship detection datasets. To further promote the research of ship detection, we introduced a new fine-grained ship detection datasets, which is named as FGSD. The dataset collects high-resolution remote sensing images that containing ship samples from multiple large ports around the world. Ship samples were fine categorized and annotated with both horizontal and rotating bounding boxes. To further detailed the information of the dataset, we put forward a new representation method of ships' orientation. For future research, the dock as a new class was annotated in the dataset. Besides, rich information of images were provided in FGSD, including the source port, resolution and corresponding GoogleEarth' s resolution level of each image. As far as we know, FGSD is the most comprehensive ship detection dataset currently and it'll be available soon. Some baselines for FGSD are also provided in this paper.

preprint2020arXiv

GINet: Graph Interaction Network for Scene Parsing

Recently, context reasoning using image regions beyond local convolution has shown great potential for scene parsing. In this work, we explore how to incorporate the linguistic knowledge to promote context reasoning over image regions by proposing a Graph Interaction unit (GI unit) and a Semantic Context Loss (SC-loss). The GI unit is capable of enhancing feature representations of convolution networks over high-level semantics and learning the semantic coherency adaptively to each sample. Specifically, the dataset-based linguistic knowledge is first incorporated in the GI unit to promote context reasoning over the visual graph, then the evolved representations of the visual graph are mapped to each local representation to enhance the discriminated capability for scene parsing. GI unit is further improved by the SC-loss to enhance the semantic representations over the exemplar-based semantic graph. We perform full ablation studies to demonstrate the effectiveness of each component in our approach. Particularly, the proposed GINet outperforms the state-of-the-art approaches on the popular benchmarks, including Pascal-Context and COCO Stuff.

preprint2016arXiv

Mobile Millimeter Wave Channel Acquisition, Tracking, and Abrupt Change Detection

Millimeter wave provides a promising approach for meeting the ever-growing traffic demand in next generation wireless networks. It is crucial to obtain relatively accurate channel state information so that beamforming/combining can be performed to compensate for severe path loss in this band. In contrast to lower frequencies, a typical mobile millimeter wave channel consists of a few dominant paths. It is generally sufficient to estimate the path gains, angles of departure (AoD), and angles of arrival (AoA) of those paths. In this paper, multiple transmit and receive antennas and beamforming with a single baseband processing chain are assumed. We propose a framework for estimating millimeter wave channels with intermittent abrupt changes (e.g., blockage or emergence of dominant paths) and slow variations of AoDs and AoAs. The solution consists of three components: tracking of the slow channel variations, detection of abrupt changes, followed by (re-)acquisition of channel (and back to the tracking stage). For acquisition, we formulate a least squares problem and find its solution based on the Levenberg-Marquardt algorithm. To track slow variations of AoDs and AoAs, we propose a new approach using Kalman filtering. Finally, an algorithm based on a likelihood test is devised for detecting abrupt changes. Simulation results show that, with moderate signal-to-noise ratios, the proposed scheme can achieve more than 8 dB higher estimation accuracy than several other methods using the same number of pilots.

preprint2016arXiv

Spintronics of Organometal Trihalide Perovskites

The family of organometal trihalide perovskite (OTP), CH3NH3PbX3 (where X is halogen) has recently revolutionized the photovoltaics field and shows promise in a variety of optoelectronic applications. The characteristic spin properties of charge and neutral excitations in OTPs are influenced by the large spin-orbit coupling of the Pb atoms, which may lead to spin-based device applications. Here we report the first studies of pure spin-current and spin-aligned carrier injection in OTP spintronics devices using spin-pumping and spin-injection, respectively. We measure a relatively large inverse-spin-Hall effect using pulsed microwave excitation in OTP devices at resonance with a ferromagnetic substrate, from which we derive room temperature spin diffusion length, lambda_sd~9nm; and low-temperature giant magnetoresistance in OTP-based spin-valves from which we estimate lambda_sd~85nm.

preprint2015arXiv

Inverse Spin Hall Effect from pulsed Spin Current in Organic Semiconductors with Tunable Spin-Orbit Coupling

Exploration of spin-currents in organic semiconductors (OSECs) induced by resonant microwave absorption in ferromagnetic substrates has been of great interest for potential spintronics applications. Due to the inherently weak spin-orbit coupling (SOC) of OSECs, their inverse spin Hall effect (ISHE) response is very subtle; limited by the microwave power applicable under continuous-wave (cw) excitation. Here we introduce a novel approach for generating significant ISHE signals using pulsed ferromagnetic resonance, where the ISHE is ~2-3 orders of magnitude larger compared to cw excitation. This strong ISHE enables us to investigate a variety of OSECs ranging from pi-conjugated polymers with strong SOC that contain intrachain platinum atoms, to weak SOC polymers, to C60 films, where the SOC is predominantly caused by the molecule surface curvature. The pulsed-ISHE technique offers a robust route for efficient injection and detection schemes of spin-currents at room temperature, and paves the way for spin-orbitronics in plastic materials.

preprint2015arXiv

Power Allocation for Mixed Traffic Broadcast with Service Outage Constraint

To transmit a mixture of real-time and non-real-time traffic in a broadcast system, we impose a basic service rate $r_0$ for real-time traffic and use the excess rate beyond $r_0$ to transmit non-real-time traffic. Considering the time-varying nature of wireless channels, the basic service rate is guaranteed with a service outage constraint, where service outage occurs when the channel capacity is below the basic service rate. This approach is well suited for providing growing services like video, real-time TV, etc., in group transportation systems such as coach, high-speed train, and airplane. We show that the optimal power allocation policy depends only on the statistics of the minimum gain of all user channels, and it is a combination of water-filling and channel inversion. We provide the optimal power allocation policy, which guarantees that real-time traffic be delivered with quality of service (QoS) for every user. Moreover, we show that the required minimum average power to satisfy the service outage constraint increases linearly with the number of users.

preprint2015arXiv

Tracking Angles of Departure and Arrival in a Mobile Millimeter Wave Channel

Millimeter wave provides a very promising approach for meeting the ever-growing traffic demand in next generation wireless networks. To utilize this band, it is crucial to obtain the channel state information in order to perform beamforming and combining to compensate for severe path loss. In contrast to lower frequencies, a typical millimeter wave channel consists of a few dominant paths. Thus it is generally sufficient to estimate the path gains, angles of departure (AoDs), and angles of arrival (AoAs) of those paths. Proposed in this paper is a dual timescale model to characterize abrupt channel changes (e.g., blockage) and slow variations of AoDs and AoAs. This work focuses on tracking the slow variations and detecting abrupt changes. A Kalman filter based tracking algorithm and an abrupt change detection method are proposed. The tracking algorithm is compared with the adaptive algorithm due to Alkhateeb, Ayach, Leus and Heath (2014) in the case with single radio frequency chain. Simulation results show that to achieve the same tracking performance, the proposed algorithm requires much lower signal-to-noise-ratio (SNR) and much fewer pilots than the other algorithm. Moreover, the change detection method can always detect abrupt changes with moderate number of pilots and SNR.

preprint2014arXiv

Downlink Resource Allocation for the High-speed Train and Local Users in OFDMA Systems

We consider providing services for passengers in a high-speed train and local users (quasi-static users) in a single OFDMA system. For the train, we apply a two-hop architecture, under which, passengers communicate with base stations (BSs) via a mobile relay (MR) installed in the train cabin. With this architecture, all passengers in the train can be represented by the MR. Since the channels of the MR and local users vary differently, we consider allocating system resources (power and subcarriers) over two time-scales for them. We formulate the problem as a capacity optimization problem for the MR subject to the sum capacity constraint of local users. We treat the inter-carrier interference (ICI) at the MR as additive Gaussian noise and derive an explicit expression for the ICI using the two-path Doppler spread model. Then we discuss the optimization problem and propose an optimal power and subcarrier allocation (OPSA) policy. The capacity obtained using OPSA is compared with that of constant power and subcarrier allocation (CPSA) policies. Simulation results justify the optimality of the OPSA. Besides, by comparing the capacity bounds achieved by OPSA with and without ICI, we find that only in specific regions, where the gap between the capacity bounds is large, do practical ICI cancellation methods provide meaningful rate gain.

preprint2014arXiv

Wireless Information and Energy Transfer for Two-Hop Non-Regenerative MIMO-OFDM Relay Networks

This paper investigates the simultaneous wireless information and energy transfer for the non-regenerative multipleinput multiple-output orthogonal frequency-division multiplexing (MIMO-OFDM) relaying system. By considering two practical receiver architectures, we present two protocols, time switchingbased relaying (TSR) and power splitting-based relaying (PSR). To explore the system performance limit, we formulate two optimization problems to maximize the end-to-end achievable information rate with the full channel state information (CSI) assumption. Since both problems are non-convex and have no known solution method, we firstly derive some explicit results by theoretical analysis and then design effective algorithms for them. Numerical results show that the performances of both protocols are greatly affected by the relay position. Specifically, PSR and TSR show very different behaviors to the variation of relay position. The achievable information rate of PSR monotonically decreases when the relay moves from the source towards the destination, but for TSR, the performance is relatively worse when the relay is placed in the middle of the source and the destination. This is the first time to observe such a phenomenon. In addition, it is also shown that PSR always outperforms TSR in such a MIMO-OFDM relaying system. Moreover, the effect of the number of antennas and the number of subcarriers are also discussed.

preprint2013arXiv

Channel Service Based High Speed Railway Base Station Arrangement

With the rapid development of high-speed railways, demands on high mobility wireless communication increase greatly. To provide stable and high data rate wireless access for users in the train, it is necessary to properly deploy base stations along the railway. In this paper, we consider this issue from the perspective of channel service which is defined as the integral of the time-varying instantaneous channel capacity. It will show that the total service quantity of each base station is a constant. In order to keep high service efficiency of the railway communication system with multiple base stations along the railway, we need to use the time division to schedule the multiple stations and allow one base station to work when the train is running close to it. In this way, we find a fact that if the ratio of the service quantity provided by each station to its total service quantity is given, the base station interval(i.e. the distance between two adjacent base stations) is a constant, regardless of the speed of the train. On the other hand, interval between two neighboring base stations will increase with the speed of the train. Furthermore, using the concept of channel service, we also analyze the transmission strategy of base stations.

preprint2013arXiv

Every-user delay guarantee for wireless multiple access systems

The quality of service (QoS) requirements are usually different from user to user in a multiaccess system, and it is necessary to take the different requirements into account when allocating the shared resources of the system. In this paper, we consider one QoS criterion--delay in a multiaccess system, and we combine information theory and queueing theory in an attempt to analyze whether a multiaccess system can meet the different delay requirements of users. For users with the same transmission power, we prove that only $N$ inequalities are necessary for the checking, and for users with different transmission powers, we provide a polynomial-time algorithm for such a decision. In cases where the system cannot satisfy the delay requirements of all users, we prove that as long as the sum power is larger than a threshold, there is always an approach to adjust the transmission power of each user to make the system delay feasible if power reallocation is available.

preprint2013arXiv

Service based hight-speed railway base station arrangement

To provide stable and high data rate wireless access for passengers in the train, it is necessary to properly deploy base stations along the railway. We consider this issue from the perspective of service, which is defined as the integral of the time-varying instantaneous channel capacity. With large-scale fading assumption, it will be shown that the total service of each base station is inversely proportional to the velocity of the train. Besides, we find that if the ratio of the service provided by a base station in its service region to its total service is given, the base station interval (i.e. the distance between two adjacent base stations) is a constant regardless of the velocity of the train. On the other hand, if a certain amount of service is required, the interval will increase with the velocity of the train. The above results apply not only to simple curve rails, like line rail and arc rail, but also to any irregular curve rail, provided that the train is travelling at a constant velocity. Furthermore, the new developed results are applied to analyze the on-off transmission strategy of base stations.

Chuang Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

20 published item(s)

CLTS+: A New Chinese Long Text Summarization Dataset with Abstractive Summaries

Mix-Teaching: A Simple, Unified and Effective Semi-Supervised Learning Framework for Monocular 3D Object Detection

Multi-class Label Noise Learning via Loss Decomposition and Centroid Estimation

Non-monotonic heat dissipation phenomenon in close-packed quasi-2D and 3D hotspot system

On the emergence of heat waves in the transient thermal grating geometry

Self-Supervised Modality-Aware Multiple Granularity Pre-Training for RGB-Infrared Person Re-Identification

Towards Harnessing Feature Embedding for Robust Learning with Noisy Labels

C-DLinkNet: considering multi-level semantic features for human parsing

FGSD: A Dataset for Fine-Grained Ship Detection in High Resolution Satellite Images

GINet: Graph Interaction Network for Scene Parsing

Mobile Millimeter Wave Channel Acquisition, Tracking, and Abrupt Change Detection

Spintronics of Organometal Trihalide Perovskites

Inverse Spin Hall Effect from pulsed Spin Current in Organic Semiconductors with Tunable Spin-Orbit Coupling

Power Allocation for Mixed Traffic Broadcast with Service Outage Constraint

Tracking Angles of Departure and Arrival in a Mobile Millimeter Wave Channel

Downlink Resource Allocation for the High-speed Train and Local Users in OFDMA Systems

Wireless Information and Energy Transfer for Two-Hop Non-Regenerative MIMO-OFDM Relay Networks

Channel Service Based High Speed Railway Base Station Arrangement

Every-user delay guarantee for wireless multiple access systems

Service based hight-speed railway base station arrangement