Source author record

Kun Zhao

Kun Zhao appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

42works

33topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Aligning Findings with Diagnosis: A Self-Consistent Reinforcement Learning Framework for Trustworthy Radiology Reporting

Multimodal Large Language Models (MLLMs) have shown strong potential for radiology report generation, yet their clinical translation is hindered by architectural heterogeneity and the prevalence of factual hallucinations. Standard supervised fine-tuning often fails to strictly align linguistic outputs with visual evidence, while existing reinforcement learning approaches struggle with either prohibitive computational costs or limited exploration. To address these challenges, we propose a comprehensive framework for self-consistent radiology report generation. First, we conduct a systematic evaluation to identify optimal vision encoder and LLM backbone configurations for medical imaging. Building on this foundation, we introduce a novel "Reason-then-Summarize" architecture optimized via Group Relative Policy Optimization (GRPO). This framework restructures generation into two distinct components: a think block for detailed findings and an answer block for structured disease labels. By utilizing a multi-dimensional composite reward function, we explicitly penalize logical discrepancies between the generated narrative and the final diagnosis. Extensive experiments on the MIMIC-CXR benchmark demonstrate that our method achieves state-of-the-art performance in clinical efficacy metrics and significantly reduces hallucinations compared to strong supervised baselines.

preprint2026arXiv

GRAPHMOE: Amplifying Cognitive Depth of Mixture-of-Experts Network via Introducing Self-Rethinking Mechanism

Traditional Mixture-of-Experts (MoE) networks benefit from utilizing multiple smaller expert models as opposed to a single large network. However, these experts typically operate independently, leaving a question open about whether interconnecting these models could enhance the performance of MoE networks. In response, we introduce GRAPHMOE, a novel method aimed at augmenting the cognitive depth of language models via a self-rethinking mechanism constructed on Pseudo GraphMoE networks. GRAPHMOE employs a recurrent routing strategy to simulate iterative thinking steps, thereby facilitating the flow of information among expert nodes. We implement the GRAPHMOE architecture using Low-Rank Adaptation techniques (LoRA) and conduct extensive experiments on various benchmark datasets. The experimental results reveal that GRAPHMOE outperforms other LoRA based models, achieving state-of-the-art (SOTA) performance. Additionally, this study explores a novel recurrent routing strategy that may inspire further advancements in enhancing the reasoning capabilities of language models.

preprint2023arXiv

Credible Remote Sensing Scene Classification Using Evidential Fusion on Aerial-Ground Dual-view Images

Due to their ability to offer more comprehensive information than data from a single view, multi-view (multi-source, multi-modal, multi-perspective, etc.) data are being used more frequently in remote sensing tasks. However, as the number of views grows, the issue of data quality becomes more apparent, limiting the potential benefits of multi-view data. Although recent deep neural network (DNN) based models can learn the weight of data adaptively, a lack of research on explicitly quantifying the data quality of each view when fusing them renders these models inexplicable, performing unsatisfactorily and inflexible in downstream remote sensing tasks. To fill this gap, in this paper, evidential deep learning is introduced to the task of aerial-ground dual-view remote sensing scene classification to model the credibility of each view. Specifically, the theory of evidence is used to calculate an uncertainty value which describes the decision-making risk of each view. Based on this uncertainty, a novel decision-level fusion strategy is proposed to ensure that the view with lower risk obtains more weight, making the classification more credible. On two well-known, publicly available datasets of aerial-ground dual-view remote sensing images, the proposed approach achieves state-of-the-art results, demonstrating its effectiveness. The code and datasets of this article are available at the following address: https://github.com/gaopiaoliang/Evidential.

preprint2023arXiv

Semantic Interleaving Global Channel Attention for Multilabel Remote Sensing Image Classification

Multi-Label Remote Sensing Image Classification (MLRSIC) has received increasing research interest. Taking the cooccurrence relationship of multiple labels as additional information helps to improve the performance of this task. Current methods focus on using it to constrain the final feature output of a Convolutional Neural Network (CNN). On the one hand, these methods do not make full use of label correlation to form feature representation. On the other hand, they increase the label noise sensitivity of the system, resulting in poor robustness. In this paper, a novel method called Semantic Interleaving Global Channel Attention (SIGNA) is proposed for MLRSIC. First, the label co-occurrence graph is obtained according to the statistical information of the data set. The label co-occurrence graph is used as the input of the Graph Neural Network (GNN) to generate optimal feature representations. Then, the semantic features and visual features are interleaved, to guide the feature expression of the image from the original feature space to the semantic feature space with embedded label relations. SIGNA triggers global attention of feature maps channels in a new semantic feature space to extract more important visual features. Multihead SIGNA based feature adaptive weighting networks are proposed to act on any layer of CNN in a plug-and-play manner. For remote sensing images, better classification performance can be achieved by inserting CNN into the shallow layer. We conduct extensive experimental comparisons on three data sets: UCM data set, AID data set, and DFC15 data set. Experimental results demonstrate that the proposed SIGNA achieves superior classification performance compared to state-of-the-art (SOTA) methods. It is worth mentioning that the codes of this paper will be open to the community for reproducibility research. Our codes are available at https://github.com/kyle-one/SIGNA.

preprint2022arXiv

Context-Aware Scene Prediction Network (CASPNet)

Predicting the future motion of surrounding road users is a crucial and challenging task for autonomous driving (AD) and various advanced driver-assistance systems (ADAS). Planning a safe future trajectory heavily depends on understanding the traffic scene and anticipating its dynamics. The challenges do not only lie in understanding the complex driving scenarios but also the numerous possible interactions among road users and environments, which are practically not feasible for explicit modeling. In this work, we tackle the above challenges by jointly learning and predicting the motion of all road users in a scene, using a novel convolutional neural network (CNN) and recurrent neural network (RNN) based architecture. Moreover, by exploiting grid-based input and output data structures, the computational cost is independent of the number of road users and multi-modal predictions become inherent properties of our proposed method. Evaluation on the nuScenes dataset shows that our approach reaches state-of-the-art results in the prediction benchmark.

preprint2022arXiv

Few-Shot Class-Incremental Learning from an Open-Set Perspective

The continual appearance of new objects in the visual world poses considerable challenges for current deep learning methods in real-world deployments. The challenge of new task learning is often exacerbated by the scarcity of data for the new categories due to rarity or cost. Here we explore the important task of Few-Shot Class-Incremental Learning (FSCIL) and its extreme data scarcity condition of one-shot. An ideal FSCIL model needs to perform well on all classes, regardless of their presentation order or paucity of data. It also needs to be robust to open-set real-world conditions and be easily adapted to the new tasks that always arise in the field. In this paper, we first reevaluate the current task setting and propose a more comprehensive and practical setting for the FSCIL task. Then, inspired by the similarity of the goals for FSCIL and modern face recognition systems, we propose our method -- Augmented Angular Loss Incremental Classification or ALICE. In ALICE, instead of the commonly used cross-entropy loss, we propose to use the angular penalty loss to obtain well-clustered features. As the obtained features not only need to be compactly clustered but also diverse enough to maintain generalization for future incremental classes, we further discuss how class augmentation, data augmentation, and data balancing affect classification performance. Experiments on benchmark datasets, including CIFAR100, miniImageNet, and CUB200, demonstrate the improved performance of ALICE over the state-of-the-art FSCIL methods.

preprint2022arXiv

Unsupervised Quantized Prosody Representation for Controllable Speech Synthesis

In this paper, we propose a novel prosody disentangle method for prosodic Text-to-Speech (TTS) model, which introduces the vector quantization (VQ) method to the auxiliary prosody encoder to obtain the decomposed prosody representations in an unsupervised manner. Rely on its advantages, the speaking styles, such as pitch, speaking velocity, local pitch variance, etc., are decomposed automatically into the latent quantize vectors. We also investigate the internal mechanism of VQ disentangle process by means of a latent variables counter and find that higher value dimensions usually represent prosody information. Experiments show that our model can control the speaking styles of synthesis results by directly manipulating the latent variables. The objective and subjective evaluations illustrated that our model outperforms the popular models.

preprint2021arXiv

Genetic Algorithm based hyper-parameters optimization for transfer Convolutional Neural Network

Hyperparameter optimization is a challenging problem in developing deep neural networks. Decision of transfer layers and trainable layers is a major task for design of the transfer convolutional neural networks (CNN). Conventional transfer CNN models are usually manually designed based on intuition. In this paper, a genetic algorithm is applied to select trainable layers of the transfer model. The filter criterion is constructed by accuracy and the counts of the trainable layers. The results show that the method is competent in this task. The system will converge with a precision of 97% in the classification of Cats and Dogs datasets, in no more than 15 generations. Moreover, backward inference according the results of the genetic algorithm shows that our method can capture the gradient features in network layers, which plays a part on understanding of the transfer AI models.

preprint2020arXiv

Inorganic photovoltaic cells based on BiFeO3: spontaneous polarization, lattice matching, light polarization and their relationship to photovoltaic performance

Inorganic ferroelectric perovskite oxides are more stable than hybrid perovskites. However, their solar energy harvest efficiency is not so good. Here, by constructing a series of BiFeO3 based devices (solar cells), we investigated three factors that influence the photovoltaic performance, including spontaneous polarization, terminated ions species in the interface between BiFeO3 and the electrode, and polarized light irradiation. This work was carried out in the framework of density functional theory combined with non-equilibrium Green's function theory under built in electric field or finite bias. The results showed that 1. the photocurrent is larger only under a suitable electronic band gap rather than larger spontaneous polarization; 2. the photocurrent reaches the largest in Bi ions terminated interface than in the case of Fe ion or O ion with SrTiO3 electrode; 3. the photocurrent could be largely enhanced if the polarized direction of the monochromatic light is perpendicular to the spontaneous polarization direction. The results would deepen the understanding of some experimental results of BiFeO3 based solar cells.

preprint2020arXiv

Large Time Behavior and Diffusion Limit for a System of Balance Laws From Chemotaxis in Multi-dimensions

We consider the Cauchy problem for a system of balance laws derived from a chemotaxis model with singular sensitivity in multiple space dimensions. Utilizing energy methods, we first prove the global well-posedness of classical solutions to the Cauchy problem when only the energy of the first order spatial derivatives of the initial data is sufficiently small, and the solutions are shown to converge to the prescribed constant equilibrium states as time goes to infinity. Then we prove that the solutions of the fully dissipative model converge to those of the corresponding partially dissipative model when the chemical diffusion coefficient tends to zero.

preprint2020arXiv

Learning to Estimate Driver Drowsiness from Car Acceleration Sensors using Weakly Labeled Data

This paper addresses the learning task of estimating driver drowsiness from the signals of car acceleration sensors. Since even drivers themselves cannot perceive their own drowsiness in a timely manner unless they use burdensome invasive sensors, obtaining labeled training data for each timestamp is not a realistic goal. To deal with this difficulty, we formulate the task as a weakly supervised learning. We only need to add labels for each complete trip, not for every timestamp independently. By assuming that some aspects of driver drowsiness increase over time due to tiredness, we formulate an algorithm that can learn from such weakly labeled data. We derive a scalable stochastic optimization method as a way of implementing the algorithm. Numerical experiments on real driving datasets demonstrate the advantages of our algorithm against baseline methods.

preprint2020arXiv

SID: Incremental Learning for Anchor-Free Object Detection via Selective and Inter-Related Distillation

Incremental learning requires a model to continually learn new tasks from streaming data. However, traditional fine-tuning of a well-trained deep neural network on a new task will dramatically degrade performance on the old task -- a problem known as catastrophic forgetting. In this paper, we address this issue in the context of anchor-free object detection, which is a new trend in computer vision as it is simple, fast, and flexible. Simply adapting current incremental learning strategies fails on these anchor-free detectors due to lack of consideration of their specific model structures. To deal with the challenges of incremental learning on anchor-free object detectors, we propose a novel incremental learning paradigm called Selective and Inter-related Distillation (SID). In addition, a novel evaluation metric is proposed to better assess the performance of detectors under incremental learning conditions. By selective distilling at the proper locations and further transferring additional instance relation knowledge, our method demonstrates significant advantages on the benchmark datasets PASCAL VOC and COCO.

preprint2020arXiv

SOS: Selective Objective Switch for Rapid Immunofluorescence Whole Slide Image Classification

The difficulty of processing gigapixel whole slide images (WSIs) in clinical microscopy has been a long-standing barrier to implementing computer aided diagnostic systems. Since modern computing resources are unable to perform computations at this extremely large scale, current state of the art methods utilize patch-based processing to preserve the resolution of WSIs. However, these methods are often resource intensive and make significant compromises on processing time. In this paper, we demonstrate that conventional patch-based processing is redundant for certain WSI classification tasks where high resolution is only required in a minority of cases. This reflects what is observed in clinical practice; where a pathologist may screen slides using a low power objective and only switch to a high power in cases where they are uncertain about their findings. To eliminate these redundancies, we propose a method for the selective use of high resolution processing based on the confidence of predictions on downscaled WSIs --- we call this the Selective Objective Switch (SOS). Our method is validated on a novel dataset of 684 Liver-Kidney-Stomach immunofluorescence WSIs routinely used in the investigation of autoimmune liver disease. By limiting high resolution processing to cases which cannot be classified confidently at low resolution, we maintain the accuracy of patch-level analysis whilst reducing the inference time by a factor of 7.74.

preprint2020arXiv

The existence of optimal control for continuous-time Markov decision processes in random environments

In this work, we investigate the optimal control problem for continuous-time Markov decision processes with the random impact of the environment. We provide conditions to show the existence of optimal controls under finite-horizon criteria. Under appropriate conditions, the value function is continuous and satisfies the dynamic programming principle. These results are established by introducing some restriction on the regularity of the optimal controls and by developing a new compactification method for continuous-time Markov decision processes, which is originally used to solve the optimal control problem for jump-diffusion processes.

preprint2019arXiv

CORAL8: Concurrent Object Regression for Area Localization in Medical Image Panels

This work tackles the problem of generating a medical report for multi-image panels. We apply our solution to the Renal Direct Immunofluorescence (RDIF) assay which requires a pathologist to generate a report based on observations across the eight different WSI in concert with existing clinical features. To this end, we propose a novel attention-based multi-modal generative recurrent neural network (RNN) architecture capable of dynamically sampling image data concurrently across the RDIF panel. The proposed methodology incorporates text from the clinical notes of the requesting physician to regulate the output of the network to align with the overall clinical context. In addition, we found the importance of regularizing the attention weights for word generation processes. This is because the system can ignore the attention mechanism by assigning equal weights for all members. Thus, we propose two regularizations which force the system to utilize the attention mechanism. Experiments on our novel collection of RDIF WSIs provided by a large clinical laboratory demonstrate that our framework offers significant improvements over existing methods.

preprint2019arXiv

Dynamics in two-dimensional glassy systems of crowded Penrose kites

We investigate the translational and rotational relaxation dynamics of a crowded two-dimensional system of monodisperse Penrose kites, in which crystallization, quasi-crystallization, and nematic ordering are suppressed, from low to high area fractions along the metastable ergodic fluid branch. First, we demonstrate a decoupling between both the translational and the rotational diffusion coefficients and the relaxation time: the diffusivities are not inversely proportional to the relaxation time, neither in the low-density normal liquid regime nor in the high-density supercooled regime. Our simulations reveal that this inverse proportionality breaks in the normal liquid regime due to the Mermin-Wagner long-wavelength fluctuations and in the supercooled regime due to the dynamical heterogeneities. We then show that dynamical heterogeneities are mainly spatial for translational degrees of freedom and temporal for rotational ones, that there is no correlation between the particles with the largest translational and rotational displacements, and that different dynamical length scales characterize the translational and the rotational motion. Hence, despite the translational and the rotational glass-transition densities coincide, according to a mode-coupling fit, translations and rotations appear to decorrelate via different dynamical processes.

preprint2019arXiv

Long-wavelength fluctuations and anomalous dynamics in two-dimensional liquids

Long-wavelength Mermin-Wagner fluctuations prevent the existence of translational long-range order, in two-dimensional systems at finite temperature. Their dynamical signature, which is the divergence of the vibrational amplitude with the system size, also affects disordered solids and washes out the transient solid-like response generally exhibited by liquids cooled below their melting temperature. Through a combined numerical and experimental investigation, here we show that long-wavelength fluctuations are also relevant at high temperature, where the liquid dynamics does not reveal a transient solid-like response. In this regime, they induce an unusual but ubiquitous decoupling between long-time diffusion coefficient $D$ and structural relaxation time $τ$, where $D\propto τ^{-κ}$, with $κ> 1$. Long-wavelength fluctuations have a negligible influence on the relaxation dynamics only at extremely high temperatures, in molecular liquids, or extremely low densities, in colloidal systems.

preprint2016arXiv

A Novel Framework for Online Amnesic Trajectory Compression in Resource-constrained Environments

State-of-the-art trajectory compression methods usually involve high space-time complexity or yield unsatisfactory compression rates, leading to rapid exhaustion of memory, computation, storage and energy resources. Their ability is commonly limited when operating in a resource-constrained environment especially when the data volume (even when compressed) far exceeds the storage limit. Hence we propose a novel online framework for error-bounded trajectory compression and ageing called the Amnesic Bounded Quadrant System (ABQS), whose core is the Bounded Quadrant System (BQS) algorithm family that includes a normal version (BQS), Fast version (FBQS), and a Progressive version (PBQS). ABQS intelligently manages a given storage and compresses the trajectories with different error tolerances subject to their ages. In the experiments, we conduct comprehensive evaluations for the BQS algorithm family and the ABQS framework. Using empirical GPS traces from flying foxes and cars, and synthetic data from simulation, we demonstrate the effectiveness of the standalone BQS algorithms in significantly reducing the time and space complexity of trajectory compression, while greatly improving the compression rates of the state-of-the-art algorithms (up to 45%). We also show that the operational time of the target resource-constrained hardware platform can be prolonged by up to 41%. We then verify that with ABQS, given data volumes that are far greater than storage space, ABQS is able to achieve 15 to 400 times smaller errors than the baselines. We also show that the algorithm is robust to extreme trajectory shapes.

preprint2016arXiv

Determining the best attributes for surveillance video keywords generation

Automatic video keyword generation is one of the key ingredients in reducing the burden of security officers in analyzing surveillance videos. Keywords or attributes are generally chosen manually based on expert knowledge of surveillance. Most existing works primarily aim at either supervised learning approaches relying on extensive manual labelling or hierarchical probabilistic models that assume the features are extracted using the bag-of-words approach; thus limiting the utilization of the other features. To address this, we turn our attention to automatic attribute discovery approaches. However, it is not clear which automatic discovery approach can discover the most meaningful attributes. Furthermore, little research has been done on how to compare and choose the best automatic attribute discovery methods. In this paper, we propose a novel approach, based on the shared structure exhibited amongst meaningful attributes, that enables us to compare between different automatic attribute discovery approaches.We then validate our approach by comparing various attribute discovery methods such as PiCoDeS on two attribute datasets. The evaluation shows that our approach is able to select the automatic discovery approach that discovers the most meaningful attributes. We then employ the best discovery approach to generate keywords for videos recorded from a surveillance system. This work shows it is possible to massively reduce the amount of manual work in generating video keywords without limiting ourselves to a particular video feature descriptor.

preprint2016arXiv

Effective dynamics of microorganisms that interact with their own trail

Like ants, some microorganisms are known to leave trails on surfaces to communicate. We explore how trail-mediated self-interaction could affect the behavior of individual microorganisms when diffusive spreading of the trail is negligible on the timescale of the microorganism using a simple phenomenological model for an actively moving particle and a finite-width trail. The effective dynamics of each microorganism takes on the form of a stochastic integral equation with the trail interaction appearing in the form of short-term memory. For moderate coupling strength below an emergent critical value, the dynamics exhibits effective diffusion in both orientation and position after a phase of superdiffusive reorientation. We report experimental verification of a seemingly counterintuitive perpendicular alignment mechanism that emerges from the model.

preprint2016arXiv

High-temperature superconductivity in single-unit-cell FeSe films on anatase TiO2(001)

We report on the observation of high-temperature ($T_\textrm{c}$) superconductivity and magnetic vortices in single-unit-cell FeSe films on anatase TiO$_2$(001) substrate by using scanning tunneling microscopy. A systematic study and engineering of interfacial properties has clarified the essential roles of substrate in realizing the high-$T_\textrm{c}$ superconductivity, probably via interface-induced electron-phonon coupling enhancement and charge transfer. By visualizing and tuning the oxygen vacancies at the interface, we find their very limited effect on the superconductivity, which excludes interfacial oxygen vacancies as the primary source for charge transfer between the substrate and FeSe films. Our findings have placed severe constraints on any microscopic model for the high-$T_\textrm{c}$ superconductivity in FeSe-related heterostructures.

preprint2016arXiv

Multicellular self-organization of P. aeruginosa due to interactions with secreted trails

Guided movement in response to slowly diffusing polymeric trails provides a unique mechanism for self-organization of some microorganisms. To elucidate how this signaling route leads to microcolony formation, we experimentally probe the trajectory and orientation of Pseudomonas aeruginosa that propel themselves on a surface using type IV pili motility appendages, which preferentially attach to deposited exopolysaccharides. We construct a stochastic model by analyzing single-bacterium trajectories, and show that the resulting theoretical prediction for the many-body behavior of the bacteria is in quantitative agreement with our experimental characterization of how cells explore the surface via a power law strategy.

preprint2016arXiv

Understanding the spatiotemporal pattern of grazing cattle movement

In this study, we analyse a high-frequency movement dataset for a group of grazing cattle and investigate their spatiotemporal patterns using a simple two-state `stop-and-move' mobility model. We find that the dispersal kernel in the moving state is best described by a mixture exponential distribution, indicating the hierarchical nature of the movement. On the other hand, the waiting time appears to be scale-invariant below a certain cut-off and is best described by a truncated power-law distribution, suggesting heterogenous dynamics in the non-moving state. We explore possible explanations for the observed phenomena, covering factors that can play a role in the generation of mobility patterns, such as the context of grazing environment, the intrinsic decision-making mechanism or the energy status of different activities. In particular, we propose a new hypothesis that the underlying movement pattern can be attributed to the most probable observable energy status under the maximum entropy configuration. These results are not only valuable for modelling cattle movement but also provide new insights for understanding the underlying biological basis of grazing behaviour.

preprint2015arXiv

Diffusion in Colocation Contact Networks: the Impact of Nodal Spatiotemporal Dynamics

Temporal contact networks are studied to understand dynamic spreading phenomena such as communicable diseases or information dissemination. To establish how spatiotemporal dynamics of nodes impact spreading potential in colocation contact networks, we propose "inducement-shuffling" null models which break one or more correlations between times, locations and nodes. By reconfiguring the time and/or location of each node's presence in the network, these models induce alternative sets of colocation events giving rise to contact networks with varying spreading potential. This enables second-order causal reasoning about how correlations in nodes' spatiotemporal preferences not only lead to a given contact network but ultimately influence the network's spreading potential. We find the correlation between nodes and times to be the greatest impediment to spreading, while the correlation between times and locations slightly catalyzes spreading. Under each of the presented null models we measure both the number of contacts and infection prevalence as a function of time, with the surprising finding that the two have no direct causality.

preprint2015arXiv

Efficient Clustering on Riemannian Manifolds: A Kernelised Random Projection Approach

Reformulating computer vision problems over Riemannian manifolds has demonstrated superior performance in various computer vision applications. This is because visual data often forms a special structure lying on a lower dimensional space embedded in a higher dimensional space. However, since these manifolds belong to non-Euclidean topological spaces, exploiting their structures is computationally expensive, especially when one considers the clustering analysis of massive amounts of data. To this end, we propose an efficient framework to address the clustering problem on Riemannian manifolds. This framework implements random projections for manifold points via kernel space, which can preserve the geometric structure of the original space, but is computationally efficient. Here, we introduce three methods that follow our framework. We then validate our framework on several computer vision applications by comparing against popular clustering methods on Riemannian manifolds. Experimental results demonstrate that our framework maintains the performance of the clustering whilst massively reducing computational complexity by over two orders of magnitude in some cases.

preprint2015arXiv

Molecular Beam Epitaxy Growth of Superconducting LiFeAs Film on SrTiO3(001) Substrate

The stoichiometric "111" iron-based superconductor, LiFeAs, has attacted great research interest in recent years. For the first time, we have successfully grown LiFeAs thin film by molecular beam epitaxy (MBE) on SrTiO3(001) substrate, and studied the interfacial growth behavior by reflection high energy electron diffraction (RHEED) and low-temperature scanning tunneling microscope (LT-STM). The effects of substrate temperature and Li/Fe flux ratio were investigated. Uniform LiFeAs film as thin as 3 quintuple-layer (QL) is formed. Superconducting gap appears in LiFeAs films thicker than 4 QL at 4.7 K. When the film is thicker than 13 QL, the superconducting gap determined by the distance between coherence peaks is about 7 meV, close to the value of bulk material. The ex situ transport measurement of thick LiFeAs film shows a sharp superconducting transition around 16 K. The upper critical field, Hc2(0)=13.0 T, is estimated from the temperature dependent magnetoresistance. The precise thickness and quality control of LiFeAs film paves the road of growing similar ultrathin iron arsenide films.

preprint2015arXiv

Temporal Embedding in Convolutional Neural Networks for Robust Learning of Abstract Snippets

The prediction of periodical time-series remains challenging due to various types of data distortions and misalignments. Here, we propose a novel model called Temporal embedding-enhanced convolutional neural Network (TeNet) to learn repeatedly-occurring-yet-hidden structural elements in periodical time-series, called abstract snippets, for predicting future changes. Our model uses convolutional neural networks and embeds a time-series with its potential neighbors in the temporal domain for aligning it to the dominant patterns in the dataset. The model is robust to distortions and misalignments in the temporal domain and demonstrates strong prediction power for periodical time-series. We conduct extensive experiments and discover that the proposed model shows significant and consistent advantages over existing methods on a variety of data modalities ranging from human mobility to household power consumption records. Empirical results indicate that the model is robust to various factors such as number of samples, variance of data, numerical ranges of data etc. The experiments also verify that the intuition behind the model can be generalized to multiple data types and applications and promises significant improvement in prediction performances across the datasets studied.

preprint2015arXiv

Understanding Human Mobility from Twitter

Understanding human mobility is crucial for a broad range of applications from disease prediction to communication networks. Most efforts on studying human mobility have so far used private and low resolution data, such as call data records. Here, we propose Twitter as a proxy for human mobility, as it relies on publicly available data and provides high resolution positioning when users opt to geotag their tweets with their current location. We analyse a Twitter dataset with more than six million geotagged tweets posted in Australia, and we demonstrate that Twitter can be a reliable source for studying human mobility patterns. Our analysis shows that geotagged tweets can capture rich features of human mobility, such as the diversity of movement orbits among individuals and of movements within and between cities. We also find that short and long-distance movers both spend most of their time in large metropolitan areas, in contrast with intermediate-distance movers movements, reflecting the impact of different modes of travel. Our study provides solid evidence that Twitter can indeed be a useful proxy for tracking and predicting human movement.

preprint2014arXiv

A Dynamic Network Formation Model for Understanding Bacterial Self-Organization into Micro-Colonies

We propose a general parametrizable model to capture the dynamic interaction among bacteria in the formation of micro-colonies. micro-colonies represent the first social step towards the formation of structured multicellular communities known as bacterial biofilms, which protect the bacteria against antimicrobials. In our model, bacteria can form links in the form of intercellular adhesins (such as polysaccharides) to collaborate in the production of resources that are fundamental to protect them against antimicrobials. Since maintaining a link can be costly, we assume that each bacterium forms and maintains a link only if the benefit received from the link is larger than the cost, and we formalize the interaction among bacteria as a dynamic network formation game. We rigorously characterize some of the key properties of the network evolution depending on the parameters of the system. In particular, we derive the parameters under which it is guaranteed that all bacteria will join micro-colonies and the parameters under which it is guaranteed that some bacteria will not join micro-colonies. Importantly, our study does not only characterize the properties of networks emerging in equilibrium, but it also provides important insights on how the network dynamically evolves and on how the formation history impacts the emerging networks in equilibrium. This analysis can be used to develop methods to influence on- the-fly the evolution of the network, and such methods can be useful to treat or prevent biofilm-related diseases.

preprint2014arXiv

Bounded Quadrant System: Error-bounded Trajectory Compression on the Go

Long-term location tracking, where trajectory compression is commonly used, has gained high interest for many applications in transport, ecology, and wearable computing. However, state-of-the-art compression methods involve high space-time complexity or achieve unsatisfactory compression rate, leading to rapid exhaustion of memory, computation, storage and energy resources. We propose a novel online algorithm for error-bounded trajectory compression called the Bounded Quadrant System (BQS), which compresses trajectories with extremely small costs in space and time using convex-hulls. In this algorithm, we build a virtual coordinate system centered at a start point, and establish a rectangular bounding box as well as two bounding lines in each of its quadrants. In each quadrant, the points to be assessed are bounded by the convex-hull formed by the box and lines. Various compression error-bounds are therefore derived to quickly draw compression decisions without expensive error computations. In addition, we also propose a light version of the BQS version that achieves $\mathcal{O}(1)$ complexity in both time and space for processing each point to suit the most constrained computation environments. Furthermore, we briefly demonstrate how this algorithm can be naturally extended to the 3-D case. Using empirical GPS traces from flying foxes, cars and simulation, we demonstrate the effectiveness of our algorithm in significantly reducing the time and space complexity of trajectory compression, while greatly improving the compression rates of the state-of-the-art algorithms (up to 47%). We then show that with this algorithm, the operational time of the target resource-constrained hardware platform can be prolonged by up to 41%.

preprint2014arXiv

Multi-scale Population and Mobility Estimation with Geo-tagged Tweets

Recent outbreaks of Ebola and Dengue viruses have again elevated the significance of the capability to quickly predict disease spread in an emergent situation. However, existing approaches usually rely heavily on the time-consuming census processes, or the privacy-sensitive call logs, leading to their unresponsive nature when facing the abruptly changing dynamics in the event of an outbreak. In this paper we study the feasibility of using large-scale Twitter data as a proxy of human mobility to model and predict disease spread. We report that for Australia, Twitter users' distribution correlates well the census-based population distribution, and that the Twitter users' travel patterns appear to loosely follow the gravity law at multiple scales of geographic distances, i.e. national level, state level and metropolitan level. The radiation model is also evaluated on this dataset though it has shown inferior fitness as a result of Australia's sparse population and large landmass. The outcomes of the study form the cornerstones for future work towards a model-based, responsive prediction method from Twitter data for disease spread.

preprint2014arXiv

Optimal Lévy-flight foraging in a finite landscape

We present a simple model to study Lévy-flight foraging in a finite landscape with countable targets. In our approach, foraging is a step-based exploratory random search process with a power-law step-size distribution $P(l) \propto l^{-μ}$. We find that, when the termination is regulated by a finite number of steps $N$, the optimum value of $μ$ that maximises the foraging efficiency can vary substantially in the interval $μ\in (1,3)$, depending on the landscape features (landscape size and number of targets). We further demonstrate that subjective returning can be another significant factor that affects the foraging efficiency in such context. Our results suggest that Lévy-flight foraging may arise through an interaction between the environmental context and the termination of exploitation, and particularly that the number of steps can play an important role in this scenario which is overlooked by most previous work. Our study not only provides a new perspective on Lévy-flight foraging, but also opens new avenues for investigating the interaction between foraging dynamics and environment as well as offers a realistic framework for analysing animal movement patterns from empirical data.

preprint2013arXiv

Broadband Isolated Attosecond Pulses: Generation, Characterization, and Applications

The generation of the shortest isolated attosecond pulses requires both broad spectral bandwidth and control of the spectral phase. Rapid progress has been made in both aspects, leading to the generation of the world-record-shortest 67 as light pulses in 2012, and broadband attosecond continua covering a wide range of extreme ultraviolet and soft x-ray wavelengths. Such pulses have been successfully applied in photoelectron/photoion spectroscopy and the recently developed attosecond transient absorption spectroscopy to study electron dynamics in matter. In this Review, we discuss the significant recent advancement in the generation, characterization, and application of ultrabroadband isolated attosecond pulses with spectral bandwidth comparable to the central frequency, which can in principle be compressed to a single optical cycle.

preprint2013arXiv

Communicating Is Crowdsourcing: Wi-Fi Indoor Localization with CSI-based Speed Estimation

Numerous indoor localization techniques have been proposed recently to meet the intensive demand for location based service, and Wi-Fi fingerprint-based approaches are the most popular and inexpensive solutions. Among them, one of the main trends is to incorporate the built-in sensors of smartphone and to exploit crowdsourcing potentials. However the noisy built-in sensors and multi-tasking limitation of underline OS often hinder the effectiveness of these schemes. In this work, we propose a passive crowdsourcing CSI-based indoor localization scheme, C2 IL. Our scheme C2 IL only requires the locating-device (e.g., a phone) to have a 802.11n wireless connection, and it does not rely on inertial sensors only existing in some smartphones. C2 IL is built upon our innovative method to accurately estimate the moving distance purely based on 802.11n Channel State Information (CSI). Our extensive evaluations show that the moving distance estimation error of our scheme is within 3% of the actual moving distance regardless of varying speeds and environment. Relying on the accurate moving distance estimation as constraints, we are able to construct a more accurate mapping between RSS fingerprints and location. To address the challenges of collecting fingerprints, a crowdsourcing- based scheme is designed to gradually establish the mapping and populate the fingerprints. In C2 IL, we design a trajectory clustering-based localization algorithm to provide precise real-time indoor localization and tracking. We developed and deployed a practical working system of C2 IL in a large office environment. Extensive evaluation results indicate that our scheme C2 IL provides accurate localization with error 2m at 80% at very complex indoor environment with minimal overhead.

preprint2013arXiv

Connect and win: The role of social networks in political elections

Many networks do not live in isolation but are strongly interacting, with profound consequences on their dynamics. Here, we consider the case of two interacting social networks and, in the context of a simple model, we address the case of political elections. Each network represents a competing party and every agent on the election day can choose to be either active in one of the two networks (vote for the corresponding party) or to be inactive in both (not vote). The opinion dynamics during the election campaign is described through a simulated annealing algorithm. We find that for a large region of the parameter space the result of the competition between the two parties allows for the existence of pluralism in the society, where both parties have a finite share of the votes. The central result is that a densely connected social network is key for the final victory of a party. However, small committed minorities can play a crucial role, and even reverse the election outcome.

preprint2013arXiv

Models, Entropy and Information of Temporal Social Networks

Temporal social networks are characterized by {heterogeneous} duration of contacts, which can either follow a power-law distribution, such as in face-to-face interactions, or a Weibull distribution, such as in mobile-phone communication. Here we model the dynamics of face-to-face interaction and mobile phone communication by a reinforcement dynamics, which explains the data observed in these different types of social interactions. We quantify the information encoded in the dynamics of these networks by the entropy of temporal networks. Finally, we show evidence that human dynamics is able to modulate the information present in social network dynamics when it follows circadian rhythms and when it is interfacing with a new technology such as the mobile-phone communication technology.

preprint2013arXiv

Percolation on interacting, antagonistic networks

Recently, new results on percolation of interdependent networks have shown that the percolation transition can be first order. In this paper we show that, when considering antagonistic interactions between interacting networks, the percolation process might present a bistability of the equilibrium solution. To this end, we introduce antagonistic interactions for which the functionality, or activity, of a node in a network is incompatible with the functionality, of the linked nodes in the other interacting networks. In particular, we study the percolation transition in two interacting networks with purely antagonistic interaction and different topology. For two antagonistic Poisson networks of different average degree we found a large region in the phase diagram in which there is a bistability of the steady state solutions of the percolation process, i.e. we can find that either one of the two networks might percolate. For two antagonistic scale-free networks we found that there is a region in the phase diagram in which, despite the antagonistic interactions, both networks are percolating. Finally we characterize the rich phase diagram of the percolation problems on two antagonistic networks, the first one of the two being a Poisson network and the second one being a scale-free network.

preprint2013arXiv

Percolation on interdependent networks with a fraction of antagonistic interactions

Recently, the percolation transition has been characterized on interacting networks both in presence of interdependent and antagonistic interactions. Here we characterize the phase diagram of the percolation transition in two Poisson interdependent networks with a percentage q of antagonistic nodes. We show that this system can present a bistability of the steady state solutions, and both first, and second order phase transitions. In particular, we observe a bistability of the solutions in some regions of the phase space also for a small fraction of antagonistic interactions 0<q<0.4. Moreover, we show that a fraction q>q_c=2/3 of antagonistic interactions is necessary to strongly reduce the region in phase-space in which both networks are percolating. This last result suggests that interdependent networks are robust to the presence of antagonistic interactions. Our approach can be extended to multiple networks, and to complex boolean rules for regulating the percolation phase transition.

preprint2012arXiv

Analysis of a Mixture Model of Tumor Growth

We study an initial-boundary value problem (IBVP) for a coupled Cahn-Hilliard-Hele-Shaw system that models tumor growth. For large initial data with finite energy, we prove global (local resp.) existence, uniqueness, higher order spatial regularity and Gevrey spatial regularity of strong solutions to the IBVP in 2D (3D resp.). Asymptotically in time, we show that the solution converges to a constant state exponentially fast as time tends to infinity under certain assumptions.

preprint2012arXiv

Entropy of dynamical social networks

Human dynamical social networks encode information and are highly adaptive. To characterize the information encoded in the fast dynamics of social interactions, here we introduce the entropy of dynamical social networks. By analysing a large dataset of phone-call interactions we show evidence that the dynamical social network has an entropy that depends on the time of the day in a typical week-day. Moreover we show evidence for adaptability of human social behavior showing data on duration of phone-call interactions that significantly deviates from the statistics of duration of face-to-face interactions. This adaptability of behavior corresponds to a different information content of the dynamics of social human interactions. We quantify this information by the use of the entropy of dynamical networks on realistic models of social interactions.

preprint2011arXiv

Entropy rate of non-equilibrium growing networks

New entropy measures have been recently introduced for the quantification of the complexity of networks. Most of these entropy measures apply to static networks or to dynamical processes defined on static complex networks. In this paper we define the entropy rate of growing network models. This entropy rate quantifies how many labeled networks are typically generated by the growing network models. We analytically evaluate the difference between the entropy rate of growing tree network models and the entropy of tree networks that have the same asymptotic degree distribution. We find that the growing networks with linear preferential attachment generated by dynamical models are exponentially less than the static networks with the same degree distribution for a large variety of relevant growing network models. We study the entropy rate for growing network models showing structural phase transitions including models with non-linear preferential attachment. Finally, we bring numerical evidence that the entropy rate above and below the structural phase transitions follow a different scaling with the network size.

preprint2011arXiv

Social network dynamics of face-to-face interactions

The recent availability of data describing social networks is changing our understanding of the "microscopic structure" of a social tie. A social tie indeed is an aggregated outcome of many social interactions such as face-to-face conversations or phone-calls. Analysis of data on face-to-face interactions shows that such events, as many other human activities, are bursty, with very heterogeneous durations. In this paper we present a model for social interactions at short time scales, aimed at describing contexts such as conference venues in which individuals interact in small groups. We present a detailed anayltical and numerical study of the model's dynamical properties, and show that it reproduces important features of empirical data. The model allows for many generalizations toward an increasingly realistic description of social interactions. In particular in this paper we investigate the case where the agents have intrinsic heterogeneities in their social behavior, or where dynamic variations of the local number of individuals are included. Finally we propose this model as a very flexible framework to investigate how dynamical processes unfold in social networks.

Kun Zhao

What is connected

Connect this record

See the researcher in context

Building this map preview

42 published item(s)

Aligning Findings with Diagnosis: A Self-Consistent Reinforcement Learning Framework for Trustworthy Radiology Reporting

GRAPHMOE: Amplifying Cognitive Depth of Mixture-of-Experts Network via Introducing Self-Rethinking Mechanism

Credible Remote Sensing Scene Classification Using Evidential Fusion on Aerial-Ground Dual-view Images

Semantic Interleaving Global Channel Attention for Multilabel Remote Sensing Image Classification

Context-Aware Scene Prediction Network (CASPNet)

Few-Shot Class-Incremental Learning from an Open-Set Perspective

Unsupervised Quantized Prosody Representation for Controllable Speech Synthesis

Genetic Algorithm based hyper-parameters optimization for transfer Convolutional Neural Network

Inorganic photovoltaic cells based on BiFeO3: spontaneous polarization, lattice matching, light polarization and their relationship to photovoltaic performance

Large Time Behavior and Diffusion Limit for a System of Balance Laws From Chemotaxis in Multi-dimensions

Learning to Estimate Driver Drowsiness from Car Acceleration Sensors using Weakly Labeled Data

SID: Incremental Learning for Anchor-Free Object Detection via Selective and Inter-Related Distillation

SOS: Selective Objective Switch for Rapid Immunofluorescence Whole Slide Image Classification

The existence of optimal control for continuous-time Markov decision processes in random environments

CORAL8: Concurrent Object Regression for Area Localization in Medical Image Panels

Dynamics in two-dimensional glassy systems of crowded Penrose kites

Long-wavelength fluctuations and anomalous dynamics in two-dimensional liquids

A Novel Framework for Online Amnesic Trajectory Compression in Resource-constrained Environments

Determining the best attributes for surveillance video keywords generation

Effective dynamics of microorganisms that interact with their own trail

High-temperature superconductivity in single-unit-cell FeSe films on anatase TiO2(001)

Multicellular self-organization of P. aeruginosa due to interactions with secreted trails

Understanding the spatiotemporal pattern of grazing cattle movement

Diffusion in Colocation Contact Networks: the Impact of Nodal Spatiotemporal Dynamics

Efficient Clustering on Riemannian Manifolds: A Kernelised Random Projection Approach

Molecular Beam Epitaxy Growth of Superconducting LiFeAs Film on SrTiO3(001) Substrate

Temporal Embedding in Convolutional Neural Networks for Robust Learning of Abstract Snippets

Understanding Human Mobility from Twitter

A Dynamic Network Formation Model for Understanding Bacterial Self-Organization into Micro-Colonies

Bounded Quadrant System: Error-bounded Trajectory Compression on the Go

Multi-scale Population and Mobility Estimation with Geo-tagged Tweets

Optimal Lévy-flight foraging in a finite landscape

Broadband Isolated Attosecond Pulses: Generation, Characterization, and Applications

Communicating Is Crowdsourcing: Wi-Fi Indoor Localization with CSI-based Speed Estimation

Connect and win: The role of social networks in political elections

Models, Entropy and Information of Temporal Social Networks

Percolation on interacting, antagonistic networks

Percolation on interdependent networks with a fraction of antagonistic interactions

Analysis of a Mixture Model of Tumor Growth

Entropy of dynamical social networks

Entropy rate of non-equilibrium growing networks

Social network dynamics of face-to-face interactions