Source author record

Biao Li

Biao Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

astro-ph.GA Computer Vision eess.IV Information Retrieval Machine Learning Artificial Intelligence astro-ph.CO cond-mat.quant-gas cond-mat.stat-mech Human-Computer Interaction Information Theory math.IT nlin.SI Quantitative Methods

Catalog footprint

What is connected

15works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A Dual Neighborhood Hypergraph Neural Network for Change Detection in VHR Remote Sensing Images

The very high spatial resolution (VHR) remote sensing images have been an extremely valuable source for monitoring changes occurred on the earth surface. However, precisely detecting relevant changes in VHR images still remains a challenge, due to the complexity of the relationships among ground objects. To address this limitation, a dual neighborhood hypergraph neural network is proposed in this article, which combines the multiscale superpixel segmentation and hypergraph convolution to model and exploit the complex relationships. First, the bi-temporal image pairs are segmented under two scales and fed to a pre-trained U-net to obtain node features by treating each object under the fine scale as a node. The dual neighborhood is then defined using the father-child and adjacent relationships of the segmented objects to construct the hypergraph, which permits models to represent the higher-order structured information far more complex than just pairwise relationships. The hypergraph convolutions are conducted on the constructed hypergraph to propagate the label information from a small amount of labeled nodes to the other unlabeled ones by the node-edge-node transform. Moreover, to alleviate the problem of imbalanced sample, the focal loss function is adopted to train the hypergraph neural network. The experimental results on optical, SAR and heterogeneous optical/SAR data sets demonstrate that the proposed method comprises better effectiveness and robustness compared to many state-of-the-art methods.

preprint2022arXiv

KuaiRand: An Unbiased Sequential Recommendation Dataset with Randomly Exposed Videos

Recommender systems deployed in real-world applications can have inherent exposure bias, which leads to the biased logged data plaguing the researchers. A fundamental way to address this thorny problem is to collect users' interactions on randomly expose items, i.e., the missing-at-random data. A few works have asked certain users to rate or select randomly recommended items, e.g., Yahoo!, Coat, and OpenBandit. However, these datasets are either too small in size or lack key information, such as unique user ID or the features of users/items. In this work, we present KuaiRand, an unbiased sequential recommendation dataset containing millions of intervened interactions on randomly exposed videos, collected from the video-sharing mobile App, Kuaishou. Different from existing datasets, KuaiRand records 12 kinds of user feedback signals (e.g., click, like, and view time) on randomly exposed videos inserted in the recommendation feeds in two weeks. To facilitate model learning, we further collect rich features of users and items as well as users' behavior history. By releasing this dataset, we enable the research of advanced debiasing large-scale recommendation scenarios for the first time. Also, with its distinctive features, KuaiRand can support various other research directions such as interactive recommendation, long sequential behavior modeling, and multi-task learning. The dataset and its news will be available at https://kuairand.com.

preprint2022arXiv

KuaiRec: A Fully-observed Dataset and Insights for Evaluating Recommender Systems

The progress of recommender systems is hampered mainly by evaluation as it requires real-time interactions between humans and systems, which is too laborious and expensive. This issue is usually approached by utilizing the interaction history to conduct offline evaluation. However, existing datasets of user-item interactions are partially observed, leaving it unclear how and to what extent the missing interactions will influence the evaluation. To answer this question, we collect a fully-observed dataset from Kuaishou's online environment, where almost all 1,411 users have been exposed to all 3,327 items. To the best of our knowledge, this is the first real-world fully-observed data with millions of user-item interactions. With this unique dataset, we conduct a preliminary analysis of how the two factors - data density and exposure bias - affect the evaluation results of multi-round conversational recommendation. Our main discoveries are that the performance ranking of different methods varies with the two factors, and this effect can only be alleviated in certain cases by estimating missing interactions for user simulation. This demonstrates the necessity of the fully-observed dataset. We release the dataset and the pipeline implementation for evaluation at https://kuairec.com

preprint2022arXiv

LBCF: A Large-Scale Budget-Constrained Causal Forest Algorithm

Offering incentives (e.g., coupons at Amazon, discounts at Uber and video bonuses at Tiktok) to user is a common strategy used by online platforms to increase user engagement and platform revenue. Despite its proven effectiveness, these marketing incentives incur an inevitable cost and might result in a low ROI (Return on Investment) if not used properly. On the other hand, different users respond differently to these incentives, for instance, some users never buy certain products without coupons, while others do anyway. Thus, how to select the right amount of incentives (i.e. treatment) to each user under budget constraints is an important research problem with great practical implications. In this paper, we call such problem as a budget-constrained treatment selection (BTS) problem. The challenge is how to efficiently solve BTS problem on a Large-Scale dataset and achieve improved results over the existing techniques. We propose a novel tree-based treatment selection technique under budget constraints, called Large-Scale Budget-Constrained Causal Forest (LBCF) algorithm, which is also an efficient treatment selection algorithm suitable for modern distributed computing systems. A novel offline evaluation method is also proposed to overcome an intrinsic challenge in assessing solutions' performance for BTS problem in randomized control trials (RCT) data. We deploy our approach in a real-world scenario on a large-scale video platform, where the platform gives away bonuses in order to increase users' campaign engagement duration. The simulation analysis, offline and online experiments all show that our method outperforms various tree-based state-of-the-art baselines. The proposed approach is currently serving over hundreds of millions of users on the platform and achieves one of the most tremendous improvements over these months.

preprint2021arXiv

A Multiscale Graph Convolutional Network for Change Detection in Homogeneous and Heterogeneous Remote Sensing Images

Change detection (CD) in remote sensing images has been an ever-expanding area of research. To date, although many methods have been proposed using various techniques, accurately identifying changes is still a great challenge, especially in the high resolution or heterogeneous situations, due to the difficulties in effectively modeling the features from ground objects with different patterns. In this paper, a novel CD method based on the graph convolutional network (GCN) and multiscale object-based technique is proposed for both homogeneous and heterogeneous images. First, the object-wise high level features are obtained through a pre-trained U-net and the multiscale segmentations. Treating each parcel as a node, the graph representations can be formed and then, fed into the proposed multiscale graph convolutional network with each channel corresponding to one scale. The multiscale GCN propagates the label information from a small number of labeled nodes to the other ones which are unlabeled. Further, to comprehensively incorporate the information from the output channels of multiscale GCN, a fusion strategy is designed using the father-child relationships between scales. Extensive Experiments on optical, SAR and heterogeneous optical/SAR data sets demonstrate that the proposed method outperforms some state-of the-art methods in both qualitative and quantitative evaluations. Besides, the Influences of some factors are also discussed.

preprint2020arXiv

Axiom-based Grad-CAM: Towards Accurate Visualization and Explanation of CNNs

To have a better understanding and usage of Convolution Neural Networks (CNNs), the visualization and interpretation of CNNs has attracted increasing attention in recent years. In particular, several Class Activation Mapping (CAM) methods have been proposed to discover the connection between CNN's decision and image regions. In spite of the reasonable visualization, lack of clear and sufficient theoretical support is the main limitation of these methods. In this paper, we introduce two axioms -- Conservation and Sensitivity -- to the visualization paradigm of the CAM methods. Meanwhile, a dedicated Axiom-based Grad-CAM (XGrad-CAM) is proposed to satisfy these axioms as much as possible. Experiments demonstrate that XGrad-CAM is an enhanced version of Grad-CAM in terms of conservation and sensitivity. It is able to achieve better visualization performance than Grad-CAM, while also be class-discriminative and easy-to-implement compared with Grad-CAM++ and Ablation-CAM. The code is available at https://github.com/Fu0511/XGrad-CAM.

preprint2018arXiv

Tensor Alignment Based Domain Adaptation for Hyperspectral Image Classification

This paper presents a tensor alignment (TA) based domain adaptation method for hyperspectral image (HSI) classification. To be specific, HSIs in both domains are first segmented into superpixels and tensors of both domains are constructed to include neighboring samples from single superpixel. Then we consider the subspace invariance between two domains as projection matrices and original tensors are projected as core tensors with lower dimensions into the invariant tensor subspace by applying Tucker decomposition. To preserve geometric information in original tensors, we employ a manifold regularization term for core tensors into the decomposition progress. The projection matrices and core tensors are solved in an alternating optimization manner and the convergence of TA algorithm is analyzed. In addition, a post-processing strategy is defined via pure samples extraction for each superpixel to further improve classification performance. Experimental results on four real HSIs demonstrate that the proposed method can achieve better performance compared with the state-of-the-art subspace learning methods when a limited amount of source labeled samples are available.

preprint2016arXiv

An expanded evaluation of protein function prediction methods shows an improvement in accuracy

Background: The increasing volume and variety of genotypic and phenotypic data is a major defining characteristic of modern biomedical sciences. At the same time, the limitations in technology for generating data and the inherently stochastic nature of biomolecular events have led to the discrepancy between the volume of data and the amount of knowledge gleaned from it. A major bottleneck in our ability to understand the molecular underpinnings of life is the assignment of function to biological macromolecules, especially proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, accurately assessing methods for protein function prediction and tracking progress in the field remain challenging. Methodology: We have conducted the second Critical Assessment of Functional Annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. One hundred twenty-six methods from 56 research groups were evaluated for their ability to predict biological functions using the Gene Ontology and gene-disease associations using the Human Phenotype Ontology on a set of 3,681 proteins from 18 species. CAFA2 featured significantly expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis also compared the best methods participating in CAFA1 to those of CAFA2. Conclusions: The top performing methods in CAFA2 outperformed the best methods from CAFA1, demonstrating that computational function prediction is improving. This increased accuracy can be attributed to the combined effect of the growing number of experimental annotations and improved methods for function prediction.

preprint2016arXiv

The Next Generation Virgo Cluster Survey XVI. The Angular Momentum of Dwarf Early-Type Galaxies from Globular Cluster Satellites

We analyze the kinematics of six Virgo cluster dwarf early-type galaxies (dEs) from their globular cluster (GC) systems. We present new Keck/DEIMOS spectroscopy for three of them and reanalyze the data found in the literature for the remaining three. We use two independent methods to estimate the rotation amplitude (Vmax) and velocity dispersion (sigma_GC) of the GC systems and evaluate their statistical significance by simulating non-rotating GC systems with the same number of GC satellites and velocity uncertainties. Our measured kinematics agree with the published values for the three galaxies from the literature and, in all cases, some rotation is measured. However, our simulations show that the null hypothesis of being non-rotating GC systems cannot be ruled out. In the case of VCC1861, the measured Vmax and the simulations indicate that it is not rotating. In the case of VCC1528, the null hypothesis can be marginally ruled out, thus, it might be rotating although further confirmation is needed. In our analysis, we find that, in general, the measured Vmax tends to be overestimated and the measured sigma_GC tends to be underestimated by amounts that depend on the intrinsic Vmax/sigma_GC, the number of observed GCs (N_GC), and the velocity uncertainties. The bias is negligible when N_GC>~20. In those cases where a large N_GC is not available, it is imperative to obtain data with small velocity uncertainties. For instance, errors of <2km/s lead to Vmax<10km/s for a system that is intrinsically not rotating.

preprint2015arXiv

A Gemini/GMOS Study of Intermediate Luminosity Early-Type Virgo Cluster Galaxies. I. Globular Cluster and Stellar Kinematics

We present a kinematic analysis of the globular cluster systems and diffuse stellar light of four intermediate luminosity (sub-$L^{\ast}$) early-type galaxies in the Virgo cluster based on Gemini/GMOS data. Our galaxy sample is fainter ($-23.8<M_K<-22.7$) than most previous studies, nearly doubling the number of galaxies in this magnitude range that now have GC kinematics. The data for the diffuse light extends to $4R_e$, and the data for the globular clusters reaches 8--$12R_e$. We find that the kinematics in these outer regions are all different despite the fact that these four galaxies have similar photometric properties, and are uniformly classified as "fast rotators" from their stellar kinematics within $1R_e$. The globular cluster systems exhibit a wide range of kinematic morphology. The rotation axis and amplitude can change between the inner and outer regions, including a case of counter-rotation. This difference shows the importance of wide-field kinematic studies, and shows that stellar and GC kinematics can change significantly as one moves beyond the inner regions of galaxies. Moreover, the kinematics of the globular cluster systems can differ from that of the stars, suggesting that the formation of the two populations are also distinct.

preprint2015arXiv

The Next Generation Virgo Cluster Survey. VI. The Kinematics of Ultra-compact Dwarfs and Globular Clusters in M87

The origin of ultra-compact dwarfs (UCDs)--objects larger and more massive than typical globular clusters (GCs), but more compact than typical dwarf galaxies--has been hotly debated in the 15 years since their discovery. Even whether UCDs should be considered galactic in origin, or simply the most extreme GCs, is not yet settled. We present the dynamical properties of 97 spectroscopically confirmed UCDs (rh >~10 pc) and 911 GCs associated with central cD galaxy of the Virgo cluster, M87. Our UCDs, of which 89% have M_star > ~2X10^6 M_sun and 92% are as blue as the classic blue GCs, nearly triple the sample of previous confirmed Virgo UCDs, providing by far the best opportunity for studying the global dynamics of a UCD system. We found that (1) UCDs have a surface number density profile that is shallower than that of the blue GCs in the inner ~ 70 kpc and as steep as that of the red GCs at larger radii; (2) UCDs exhibit a significantly stronger rotation than the GCs, and the blue GCs seem to have a velocity field that is more consistent with that of the surrounding dwarf ellipticals than with that of UCDs; (3) UCDs have a radially increasing orbital anisotropy profile, and are tangentially-biased at radii < ~ 40 kpc and radially-biased further out. In contrast, the blue GCs become more tangentially-biased at larger radii beyond ~ 40 kpc; (4) GCs with M_star > 2X10^6 M_sun have rotational properties indistinguishable from the less massive ones, suggesting that it is the size, instead of mass, that differentiates UCDs from GCs as kinematically distinct populations. We conclude that most UCDs in M87 are not consistent with being merely the most luminous and extended examples of otherwise normal GCs. The radially-biased orbital structure of UCDs at large radii is in general agreement with the "tidally threshed dwarf galaxy" scenario.

preprint2014arXiv

The Next Generation Virgo cluster Survey. V. modelling the dynamics of M87 with the Made-to-Measure method

We study the dynamics of the giant elliptical galaxy M87 from the central to the outermost regions with the made-to-measure (M2M) method. We use a new catalogue of 922 globular cluster line-of- sight velocities extending to a projected radius of 180 kpc (equivalent to 25 M87 effective radii), and SAURON integral field unit data within the central 2.4 kpc. 263 globular clusters, mainly located beyond 40 kpc, are newly observed by the Next Generation Virgo Survey (NGVS). For the M2M modelling, the gravitational potential is taken as a combination of a luminous matter potential with a constant stellar mass-to-light ratio and a dark matter potential modelled as a logarithmic potential. Our best dynamical model returns a stellar mass-to-light ratio in the I band of M/LI = 6.0(+ -0.3) M_sun/L_sun with a dark matter potential scale velocity of 591(+ -50) km/s and scale radius of 42(+ -10) kpc. We determine the total mass of M87 within 180 kpc to be (1.5 + - 0.2) 10^13 M_sun. The mass within 40 kpc is smaller than previous estimates determined using globular cluster kinematics that did not extend beyond 45 kpc. With our new globular cluster velocities at much larger radii, we see that globular clusters around 40 kpc show an anomalously large velocity dispersion which affected previous results. The mass we derived is in good agreement with that inferred from ROSAT X-ray observation out to 180 kpc. Within 30 kpc our mass is also consistent with that inferred from Chandra and XMM-Newton X-ray observations, while within 120 kpc it is about 20% smaller. The model velocity dispersion anisotropy beta parameter for the globular clusters in M87 is small, varying from -0.2 at the centre to 0.2 at 40 kpc, and gradually decreasing to zero at 120 kpc.

preprint2013arXiv

Some Numerical Results For Ito Equation

By the method of invariant manifold, we investigate the Ito equation numerically with high precision. By the numerical results, we can completely determine the form of analytic soliton solutions for the Ito equation. In fact, by the numerical data we have succeeded in deciding the analytic form of the $τ$-function, which is more general than the old assumptions. This may suggest that we should think more deeply about what the soliton is.

preprint2011arXiv

Exact dark state solutions of the coupled atomic-molecular Bose-Einstein condensates in an external potential

We consider a coupled nonlinear Schrödinger equations describing an atomic Bose-Einstein condensates coupled to a molecular condensates through the stimulated Raman adiabatic passage loaded in an external potential. The existence of dark state are investigated within the full parameter space accounts for all the nonlinear collisions, together with the atom-molecule conversion coupling and external potential. The results show that there are a class of external potentials such as double-well, periodical and double periodical, and harmonic potentials, where the exact dark solutions can be formed. Our results may raise the possibility of relative experiments for dark state in the coupled atomic-molecular Bose-Einstein condensates.

preprint2008arXiv

Critical properties of a dilute O($n$) model on the kagome lattice

A critical dilute O($n$) model on the kagome lattice is investigated analytically and numerically. We employ a number of exact equivalences which, in a few steps, link the critical O($n$) spin model on the kagome lattice to the exactly solvable critical $q$-state Potts model on the honeycomb lattice with $q=(n+1)^2$. The intermediate steps involve the random-cluster model on the honeycomb lattice, and a fully packed loop model with loop weight $n'=\sqrt{q}$ and a dilute loop model with loop weight $n$, both on the kagome lattice. This mapping enables the determination of a branch of critical points of the dilute O($n$) model, as well as some of its critical properties. For $n=0$, this model reproduces the known universal properties of the $θ$ point describing the collapse of a polymer. For $n\neq 0$ it displays a line of multicritical points, with the same universal properties as a branch of critical behavior that was found earlier in a dilute O($n$) model on the square lattice. These findings are supported by a finite-size-scaling analysis in combination with transfer-matrix calculations.

Biao Li

What is connected

Connect this record

See the researcher in context

Building this map preview

15 published item(s)

A Dual Neighborhood Hypergraph Neural Network for Change Detection in VHR Remote Sensing Images

KuaiRand: An Unbiased Sequential Recommendation Dataset with Randomly Exposed Videos

KuaiRec: A Fully-observed Dataset and Insights for Evaluating Recommender Systems

LBCF: A Large-Scale Budget-Constrained Causal Forest Algorithm

A Multiscale Graph Convolutional Network for Change Detection in Homogeneous and Heterogeneous Remote Sensing Images

Axiom-based Grad-CAM: Towards Accurate Visualization and Explanation of CNNs

Tensor Alignment Based Domain Adaptation for Hyperspectral Image Classification

An expanded evaluation of protein function prediction methods shows an improvement in accuracy

The Next Generation Virgo Cluster Survey XVI. The Angular Momentum of Dwarf Early-Type Galaxies from Globular Cluster Satellites

A Gemini/GMOS Study of Intermediate Luminosity Early-Type Virgo Cluster Galaxies. I. Globular Cluster and Stellar Kinematics

The Next Generation Virgo Cluster Survey. VI. The Kinematics of Ultra-compact Dwarfs and Globular Clusters in M87

The Next Generation Virgo cluster Survey. V. modelling the dynamics of M87 with the Made-to-Measure method

Some Numerical Results For Ito Equation

Exact dark state solutions of the coupled atomic-molecular Bose-Einstein condensates in an external potential

Critical properties of a dilute O($n$) model on the kagome lattice