Source author record

Chun Yang

Chun Yang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision physics.flu-dyn Artificial Intelligence Cryptography and Security Machine Learning cond-mat.str-el Information Retrieval eess.IV Hardware Architecture hep-ph Programming Languages

Catalog footprint

What is connected

21works

11topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2024arXiv

Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling

Scene text spotting is a challenging task, especially for inverse-like scene text, which has complex layouts, e.g., mirrored, symmetrical, or retro-flexed. In this paper, we propose a unified end-to-end trainable inverse-like antagonistic text spotting framework dubbed IATS, which can effectively spot inverse-like scene texts without sacrificing general ones. Specifically, we propose an innovative reading-order estimation module (REM) that extracts reading-order information from the initial text boundary generated by an initial boundary module (IBM). To optimize and train REM, we propose a joint reading-order estimation loss consisting of a classification loss, an orthogonality loss, and a distribution loss. With the help of IBM, we can divide the initial text boundary into two symmetric control points and iteratively refine the new text boundary using a lightweight boundary refinement module (BRM) for adapting to various shapes and scales. To alleviate the incompatibility between text detection and recognition, we propose a dynamic sampling module (DSM) with a thin-plate spline that can dynamically sample appropriate features for recognition in the detected text region. Without extra supervision, the DSM can proactively learn to sample appropriate features for text recognition through the gradient returned by the recognition module. Extensive experiments on both challenging scene text and inverse-like scene text datasets demonstrate that our method achieves superior performance both on irregular and inverse-like text spotting.

preprint2022arXiv

A Novel Deep Parallel Time-series Relation Network for Fault Diagnosis

Considering the models that apply the contextual information of time-series data could improve the fault diagnosis performance, some neural network structures such as RNN, LSTM, and GRU were proposed to model the fault diagnosis effectively. However, these models are restricted by their serial computation and hence cannot achieve high diagnostic efficiency. Also the parallel CNN is difficult to implement fault diagnosis in an efficient way because it requires larger convolution kernels or deep structure to achieve long-term feature extraction capabilities. Besides, BERT model applies absolute position embedding to introduce contextual information to the model, which would bring noise to the raw data and therefore cannot be applied to fault diagnosis directly. In order to address the above problems, a fault diagnosis model named deep parallel time-series relation network(DPTRN) has been proposed in this paper. There are mainly three advantages for DPTRN: (1) Our proposed time relationship unit is based on full multilayer perceptron(MLP) structure, therefore, DPTRN performs fault diagnosis in a parallel way and improves computing efficiency significantly. (2) By improving the absolute position embedding, our novel decoupling position embedding unit could be applied on the fault diagnosis directly and learn contextual information. (3) Our proposed DPTRN has obvious advantage in feature interpretability. We confirm the effect of the proposed method on four datasets, and the results show the effectiveness, efficiency and interpretability of the proposed DPTRN model.

preprint2022arXiv

Abusing Cache Line Dirty States to Leak Information in Commercial Processors

Caches have been used to construct various types of covert and side channels to leak information. Most existing cache channels exploit the timing difference between cache hits and cache misses. However, we introduce a new and broader classification of cache covert channel attacks: Hit+Miss, Hit+Hit, and Miss+Miss. We highlight that cache misses for cache lines in different states may have more significant time differences, and these can be used as timing channels. Based on this classification, we propose a new stable and stealthy Miss+Miss cache channel. Write-back caches are widely deployed in modern processors. This paper presents in detail a way in which replacement latency differences can be used to construct timing-based channels (called WB channels) to leak information in a write-back cache. Any modification to a cache line by a sender will set it to the dirty state, and the receiver can observe this through measuring the latency of replacing this cache set. We also demonstrate how senders could exploit a different number of dirty cache lines in a cache set to improve transmission bandwidth with symbols encoding multiple bits. The peak transmission bandwidths of the WB channels in commercial systems can vary between 1300 and 4400~kbps per cache set in a hyper-threaded setting without shared memory between the sender and the receiver. In contrast to most existing cache channels, which always target specific memory addresses, the new WB channels focus on the cache set and cache line states, making it difficult for the channel to be disturbed by other processes on the core, and they can still work in a cache using a random replacement policy. We also analyzed the stealthiness of WB channels from the perspective of the number of cache loads and cache miss rates. We discuss and evaluate possible defenses. The paper finishes by discussing various forms of side-channel attack.

preprint2022arXiv

Channel Self-Supervision for Online Knowledge Distillation

Recently, researchers have shown an increased interest in the online knowledge distillation. Adopting an one-stage and end-to-end training fashion, online knowledge distillation uses aggregated intermediated predictions of multiple peer models for training. However, the absence of a powerful teacher model may result in the homogeneity problem between group peers, affecting the effectiveness of group distillation adversely. In this paper, we propose a novel online knowledge distillation method, \textbf{C}hannel \textbf{S}elf-\textbf{S}upervision for Online Knowledge Distillation (CSS), which structures diversity in terms of input, target, and network to alleviate the homogenization problem. Specifically, we construct a dual-network multi-branch structure and enhance inter-branch diversity through self-supervised learning, adopting the feature-level transformation and augmenting the corresponding labels. Meanwhile, the dual network structure has a larger space of independent parameters to resist the homogenization problem during distillation. Extensive quantitative experiments on CIFAR-100 illustrate that our method provides greater diversity than OKDDip and we also give pretty performance improvement, even over the state-of-the-art such as PCL. The results on three fine-grained datasets (StanfordDogs, StanfordCars, CUB-200-211) also show the significant generalization capability of our approach.

preprint2022arXiv

Excitonic density-waves, bi-excitons and orbital selective pairing in two-orbital correlated chains

We present a comprehensive study of a one-dimensional two-orbital model at and below quarter-filling that realizes a number of unconventional phases. In particular, we find an excitonic density wave in which excitons quasi-condense with finite center of mass momentum and an order parameter that changes phase with wave-vector $Q$. In this phase, excitons behave as hard-core bosons without charge order. In addition, excitons can pair to form bi-excitons in a state that is close to a charge density-wave instability. When pairing dominates over the inter-orbital repulsion, we encounter a regime in which one orbital is metallic, while the other forms a spin gapped superconductor, a genuine orbital selective paired state. All these results are supported by both, analytical and numerical calculations. By assuming a quasi-classical approximation, we solve the three-body hole-electron-spinon problem and show that excitons are held together by forming a bound state with spinons. In order to preserve the antiferromagnetic background, excitons acquire a dispersion that has a minimum away from $k=0$. The full characterization of the different phases is obtained by means of extensive density matrix renormalization group calculations.

preprint2022arXiv

Open-set Text Recognition via Character-Context Decoupling

The open-set text recognition task is an emerging challenge that requires an extra capability to cognize novel characters during evaluation. We argue that a major cause of the limited performance for current methods is the confounding effect of contextual information over the visual information of individual characters. Under open-set scenarios, the intractable bias in contextual information can be passed down to visual information, consequently impairing the classification performance. In this paper, a Character-Context Decoupling framework is proposed to alleviate this problem by separating contextual information and character-visual information. Contextual information can be decomposed into temporal information and linguistic information. Here, temporal information that models character order and word length is isolated with a detached temporal attention module. Linguistic information that models n-gram and other linguistic statistics is separated with a decoupled context anchor mechanism. A variety of quantitative and qualitative experiments show that our method achieves promising performance on open-set, zero-shot, and close-set text recognition datasets.

preprint2022arXiv

SPR:Supervised Personalized Ranking Based on Prior Knowledge for Recommendation

The goal of a recommendation system is to model the relevance between each user and each item through the user-item interaction history, so that maximize the positive samples score and minimize negative samples. Currently, two popular loss functions are widely used to optimize recommender systems: the pointwise and the pairwise. Although these loss functions are widely used, however, there are two problems. (1) These traditional loss functions do not fit the goals of recommendation systems adequately and utilize prior knowledge information sufficiently. (2) The slow convergence speed of these traditional loss functions makes the practical application of various recommendation models difficult. To address these issues, we propose a novel loss function named Supervised Personalized Ranking (SPR) Based on Prior Knowledge. The proposed method improves the BPR loss by exploiting the prior knowledge on the interaction history of each user or item in the raw data. Unlike BPR, instead of constructing <user, positive item, negative item> triples, the proposed SPR constructs <user, similar user, positive item, negative item> quadruples. Although SPR is very simple, it is very effective. Extensive experiments show that our proposed SPR not only achieves better recommendation performance, but also significantly accelerates the convergence speed, resulting in a significant reduction in the required training time.

preprint2022arXiv

Supervised Contrastive Learning for Recommendation

In this work, we aim to consider the application of contrastive learning in the scenario of the recommendation system adequately, making it more suitable for recommendation task. We propose a learning paradigm called supervised contrastive learning(SCL) to support the graph convolutional neural network. Specifically, we will calculate the similarity between different nodes in user side and item side respectively during data preprocessing, and then when applying contrastive learning, not only will the augmented views be regarded as the positive samples, but also a certain number of similar samples will be regarded as the positive samples, which is different with SimCLR that treats other samples in a batch as negative samples. We apply SCL on the most advanced LightGCN. In addition, in order to consider the uncertainty of node interaction, we also propose a new data augment method called node replication. Empirical research and ablation study on Gowalla, Yelp2018, Amazon-Book datasets prove the effectiveness of SCL and node replication, which improve the accuracy of recommendations and robustness to interactive noise.

preprint2022arXiv

Towards Open-Set Text Recognition via Label-to-Prototype Learning

Scene text recognition is a popular topic and extensively used in the industry. Although many methods have achieved satisfactory performance for the close-set text recognition challenges, these methods lose feasibility in open-set scenarios, where collecting data or retraining models for novel characters could yield a high cost. For example, annotating samples for foreign languages can be expensive, whereas retraining the model each time when a novel character is discovered from historical documents costs both time and resources. In this paper, we introduce and formulate a new open-set text recognition task which demands the capability to spot and recognize novel characters without retraining. A label-to-prototype learning framework is also proposed as a baseline for the proposed task. Specifically, the framework introduces a generalizable label-to-prototype mapping function to build prototypes (class centers) for both seen and unseen classes. An open-set predictor is then utilized to recognize or reject samples according to the prototypes. The implementation of rejection capability over out-of-set characters allows automatic spotting of unknown characters in the incoming data stream. Extensive experiments show that our method achieves promising performance on a variety of zero-shot, close-set, and open-set text recognition datasets

preprint2020arXiv

DangKiller: Eliminating Dangling Pointers Efficiently via Implicit Identifier

Use-After-Free vulnerabilities, allowing the attacker to access unintended memory via dangling pointers, are more threatening. However, most detection schemes can only detect dangling pointers and invalid them, but not provide a tolerance mechanism to repair the errors at runtime. Also, these techniques obtain and manage the metadata inefficiently with complex structures and too much scan (sweep). The goal of this paper is to use compiler instrumentation to eliminate dangling pointers automatically and efficiently. In this paper, we observe that most techniques lack accurate efficient pointer graph metadata maintaining methods, so they need to scan the log to reduce the redundancy and sweep the whole address space to find dangling pointers. Also, they lack a direct, efficiently obtaining metadata approach. The key insight of this paper is that a unique identifier can be used as a key to a hash or direct-map algorithm. Thus, this paper maintains the same implicit identifier with each memory object and its corresponding referent. Associating the unique ID with metadata for memory objects, obtaining and managing the pointer graph metadata can be efficiently. Therefore, with the delayed free technique adopted into C/C++, we present the DangKiller as a novel and lightweight dangling pointer elimination solution. We first demonstrate the MinFat Pointer, which can calculate unique implicit ID for each object and pointer quickly, and use hash algorithm to obtain metadata. Secondly, we propose the Log Cache and Log Compression mechanism based on the ID to decrease the redundancy of dangling pointer candidates. Coupled with the Address Tagging architecture on an ARM64 system, our experiments show that the DangKiller can eliminate use-after-free vulnerabilities at only 11% and 3% runtime overheads for the SPEC CPU2006 and 2017 benchmarks respectively, except for unique cases.

preprint2020arXiv

Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection

Arbitrary shape text detection is a challenging task due to the high variety and complexity of scenes texts. In this paper, we propose a novel unified relational reasoning graph network for arbitrary shape text detection. In our method, an innovative local graph bridges a text proposal model via Convolutional Neural Network (CNN) and a deep relational reasoning network via Graph Convolutional Network (GCN), making our network end-to-end trainable. To be concrete, every text instance will be divided into a series of small rectangular components, and the geometry attributes (e.g., height, width, and orientation) of the small components will be estimated by our text proposal model. Given the geometry attributes, the local graph construction model can roughly establish linkages between different text components. For further reasoning and deducing the likelihood of linkages between the component and its neighbors, we adopt a graph-based network to perform deep relational reasoning on local graphs. Experiments on public available datasets demonstrate the state-of-the-art performance of our method.

preprint2020arXiv

IF-Net: An Illumination-invariant Feature Network

Feature descriptor matching is a critical step is many computer vision applications such as image stitching, image retrieval and visual localization. However, it is often affected by many practical factors which will degrade its performance. Among these factors, illumination variations are the most influential one, and especially no previous descriptor learning works focus on dealing with this problem. In this paper, we propose IF-Net, aimed to generate a robust and generic descriptor under crucial illumination changes conditions. We find out not only the kind of training data important but also the order it is presented. To this end, we investigate several dataset scheduling methods and propose a separation training scheme to improve the matching accuracy. Further, we propose a ROI loss and hard-positive mining strategy along with the training scheme, which can strengthen the ability of generated descriptor dealing with large illumination change conditions. We evaluate our approach on public patch matching benchmark and achieve the best results compared with several state-of-the-arts methods. To show the practicality, we further evaluate IF-Net on the task of visual localization under large illumination changes scenes, and achieves the best localization accuracy.

preprint2020arXiv

Saturation Memory Access: Mitigating Memory Spatial Errors without Terminating Programs

Memory spatial errors, i.e., buffer overflow vulnerabilities, have been a well-known issue in computer security for a long time and remain one of the root causes of exploitable vulnerabilities. Most of the existing mitigation tools adopt a fail-stop strategy to protect programs from intrusions, which means the victim program will be terminated upon detecting a memory safety violation. Unfortunately, the fail-stop strategy harms the availability of software. In this paper, we propose Saturation Memory Access (SMA), a memory spatial error mitigation mechanism that prevents out-of-bounds access without terminating a program. SMA is based on a key observation that developers generally do not rely on out-of-bounds accesses to implement program logic. SMA modifies dynamic memory allocators and adds paddings to objects to form an enlarged object boundary. By dynamically correcting all the out-of-bounds accesses to operate on the enlarged protecting boundaries, SMA can tolerate out-of-bounds accesses. For the sake of compatibility, we chose tagged pointers to record the boundary metadata of a memory object in the pointer itself, and correct the address upon detecting out-of-bounds access. We have implemented the prototype of SMA on LLVM 10.0. Our results show that our compiler enables the programs to execute successfully through buffer overflow attacks. Experiments on MiBench show that our prototype incurs an overhead of 78\%. Further optimizations would require ISA supports.

preprint2015arXiv

Electroosmosis in conducting nanofluidic channels

Theoretical modeling of electroosmosis through conducting (ideally polarizable) nanochannels is reported. Based on the theory of induced charge electrokinetics, a novel nanofluidic system which possesses both adjustable ion selective characteristics and flexible flow control is proposed. Such nanofluidic devices operate only with very low gate control voltage applied on the conductive walls of nanochannels, and thus even can be energized by normal batteries. We believe that it is possible to use such metal-electrolyte configurations to overcome the difficulties met with conventional metal-isolator-electrolyte systems for nanofluidic applications.

preprint2015arXiv

Electroosmotic mobilities of non-Newtonian fluids

Numerical analyses of transient electro-osmosis of a typical non-Newtonian liquid induced by DC and AC electric fields in a rectangular microchannel are conducted in the framework of continuum fluid mechanics. The famous power-law constitutive model is used to express the fluid dynamic viscosity in terms of the velocity gradient. Transient start-up characteristics of electro-osmotic power-law liquid flow in rectangular microchannels are simulated by using finite element method. Under a DC electric field, it is found out and the fluid is more inert to the external electric field and the steady-state velocity profile becomes more plug-like with decrease of the flow behavior index of the power-law liquids. The numerical calculations also confirm the validity of the generalized Smoluchowski slip velocity which can serve as the counterpart for the classic Smoluchowski slip velocity when dealing with electrokinetic flow of non-Newtonian power-law fluids. Under AC electric fields, the fluid is more obviously accelerated during oscillations and the amplitude of the oscillating velocity is closer to the magnitude of the generalized Smoluchowski velocity as the fluid behavior index increases. These dynamic predictions are of practical significance for the design of microfluidic devices that manipulate non-Newtonian fluids such as biofluids, polymer solutions and colloidal suspensions.

preprint2015arXiv

Numerical analysis of dynamic electro-osmotic flows of non-Newtonian fluids in rectangular microchannels

preprint2015arXiv

Spectral function of the 2D Hubbard model: a density matrix renormalization group plus cluster perturbation theory study

We study the spectral function of the 2D Hubbard model using cluster perturbation theory, and the density matrix renormalization group as a cluster solver. We reconstruct the two-dimensional dispersion at, and away from half-filling using 2xL ladders, with L up to 80 sites, yielding results with unprecedented resolution in excellent agreement with quantum Monte Carlo. The main features of the spectrum can be described with a mean-field dispersion, while kinks and pseudogap traced back to scattering between spin and charge degrees of freedom.

preprint2014arXiv

Learning to Diversify via Weighted Kernels for Classifier Ensemble

Classifier ensemble generally should combine diverse component classifiers. However, it is difficult to give a definitive connection between diversity measure and ensemble accuracy. Given a list of available component classifiers, how to adaptively and diversely ensemble classifiers becomes a big challenge in the literature. In this paper, we argue that diversity, not direct diversity on samples but adaptive diversity with data, is highly correlated to ensemble accuracy, and we propose a novel technology for classifier ensemble, learning to diversify, which learns to adaptively combine classifiers by considering both accuracy and diversity. Specifically, our approach, Learning TO Diversify via Weighted Kernels (L2DWK), performs classifier combination by optimizing a direct but simple criterion: maximizing ensemble accuracy and adaptive diversity simultaneously by minimizing a convex loss function. Given a measure formulation, the diversity is calculated with weighted kernels (i.e., the diversity is measured on the component classifiers' outputs which are kernelled and weighted), and the kernel weights are automatically learned. We minimize this loss function by estimating the kernel weights in conjunction with the classifier weights, and propose a self-training algorithm for conducting this convex optimization procedure iteratively. Extensive experiments on a variety of 32 UCI classification benchmark datasets show that the proposed approach consistently outperforms state-of-the-art ensembles such as Bagging, AdaBoost, Random Forests, Gasen, Regularized Selective Ensemble, and Ensemble Pruning via Semi-Definite Programming.

preprint2010arXiv

AC electrokinetic phenomena over semiconductive surfaces: effective electric boundary conditions and their applications

Electrokinetic boundary conditions are derived for AC electrokinetic (ACEK) phenomena over leaky dielectric (i.e., semiconducting) surfaces. Such boundary conditions correlate the electric potentials across the semiconductor-electrolyte interface (consisting of the electric double layer (EDL) inside the electrolyte solutions and the space charge layer (SCL) inside the semiconductors) under AC electric fields with arbitrary wave forms. The present electrokinetic boundary conditions allow for evaluation of induced zeta potential contributed by both bond charges (due to electric polarization) and free charges (due to electric conduction) from the leaky dielectric materials. Subsequently, we demonstrate the applications of these boundary conditions in analyzing the ACEK phenomena around a semiconducting cylinder. It is concluded that the flow circulations exist around the semiconducting cylinder and are shown to be stronger under an AC field with lower frequency and around a cylinder with higher conductivity.

preprint2010arXiv

Fine Splitting in Charmonium Spectrum with Channel Coupling Effect

We study the fine splitting in charmonium spectrum in quark model with the channel coupling effect, including $DD$, $DD^*$, $D^*D^*$ and $D_sD_s$, $D_sD_s^*$, $D_s^*D_s^*$ channels. The interaction for channel coupling is constructed from the current-current Lagrangian related to the color confinement and the one-gluon exchange potentials. By adopting the massive gluon propagator from the lattice calculation in the nonperturbative region, the coupling interaction is further simplified to the four-fermion interaction. The numerical calculation still prefers the assignment $1^{++}$ of X(3872).

preprint2010arXiv

Induced Charge Electrokinetic Phenomena in Tapered Conducting Nanochannels

This paper has been withdrawn by the author due to a crucial sign error in equation 1

Chun Yang

What is connected

Connect this record

See the researcher in context

Building this map preview

21 published item(s)

Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling

A Novel Deep Parallel Time-series Relation Network for Fault Diagnosis

Abusing Cache Line Dirty States to Leak Information in Commercial Processors

Channel Self-Supervision for Online Knowledge Distillation

Excitonic density-waves, bi-excitons and orbital selective pairing in two-orbital correlated chains

Open-set Text Recognition via Character-Context Decoupling

SPR:Supervised Personalized Ranking Based on Prior Knowledge for Recommendation

Supervised Contrastive Learning for Recommendation

Towards Open-Set Text Recognition via Label-to-Prototype Learning

DangKiller: Eliminating Dangling Pointers Efficiently via Implicit Identifier

Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection

IF-Net: An Illumination-invariant Feature Network

Saturation Memory Access: Mitigating Memory Spatial Errors without Terminating Programs

Electroosmosis in conducting nanofluidic channels

Electroosmotic mobilities of non-Newtonian fluids

Numerical analysis of dynamic electro-osmotic flows of non-Newtonian fluids in rectangular microchannels

Spectral function of the 2D Hubbard model: a density matrix renormalization group plus cluster perturbation theory study

Learning to Diversify via Weighted Kernels for Classifier Ensemble

AC electrokinetic phenomena over semiconductive surfaces: effective electric boundary conditions and their applications

Fine Splitting in Charmonium Spectrum with Channel Coupling Effect

Induced Charge Electrokinetic Phenomena in Tapered Conducting Nanochannels