Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
19works
0followers
19topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

19 published item(s)

preprint2026arXiv

When 2D Tasks Meet 1D Serialization: On Serialization Friction in Structured Tasks

Large language models (LLMs) conventionally process structured inputs as 1D token sequences. While natural for prose, such linearization may introduce additional representational burden for tasks whose computation depends directly on explicit 2D structure, because row--column alignment and local neighborhoods are no longer directly expressed in the input. We study this setting, which we refer to as serialization friction, on a small diagnostic testbed of synthetic tasks with explicit 2D structure: matrix transpose, Conway's Game of Life, and LU decomposition. To examine this question, we compare a text-only language pathway over serialized inputs with a vision-augmented pathway, built on the same language backbone, that receives the same underlying content rendered in task-faithful 2D layout, yielding a system-level comparison between two end-to-end input pathways. Across the tasks and settings we study, the visual pathway consistently outperforms the textual pathway; the gap often widens at larger dimensions, and error patterns under serialization become increasingly spatially structured. These findings indicate that the relationship between input representation and model performance on such tasks warrants further investigation, and suggest that preserving task-relevant 2D layout is a promising direction for structured 2D tasks.

preprint2022arXiv

Bessel Vortices in Spin-Orbit Coupled Spin-1 Bose-Einstein Condensates

We investigate the stationary vortex solutions in two-dimensional (2D) Rashba spin-orbit (SO) coupled spin-1 Bose-Einstein condensate (BEC). By introducing the generalized momentum operator, the linear version of the system can be solved exactly and its solutions are a set of the Bessel vortices. Based on the linear version solutions, the stationary vortex solutions of the full nonlinear system are constructed and determined entirely by the variational approximation. The results show that the variational results are in good agreement with the numerical ones. By means of the variational results, the vortex ground state phase-transition between the stationary vortex solutions, stability, and the unit Bloch vector textures are discussed in detail. The results have the potential to be realized in experiment.

preprint2022arXiv

Content-Augmented Feature Pyramid Network with Light Linear Spatial Transformers for Object Detection

As one of the prevalent components, Feature Pyramid Network (FPN) is widely used in current object detection models for improving multi-scale object detection performance. However, its feature fusion mode is still in a misaligned and local manner, thus limiting the representation power. To address the inherit defects of FPN, a novel architecture termed Content-Augmented Feature Pyramid Network (CA-FPN) is proposed in this paper. Firstly, a Global Content Extraction Module (GCEM) is proposed to extract multi-scale context information. Secondly, lightweight linear spatial Transformer connections are added in the top-down pathway to augment each feature map with multi-scale features, where a linearized approximate self-attention function is designed for reducing model complexity. By means of the self-attention mechanism in Transformer, there is no longer need to align feature maps during feature fusion, thus solving the misaligned defect. By setting the query scope to the entire feature map, the local defect can also be solved. Extensive experiments on COCO and PASCAL VOC datasets demonstrated that our CA-FPN outperforms other FPN-based detectors without bells and whistles and is robust in different settings.

preprint2022arXiv

Design of a Biomimetic Tactile Sensor for Material Classification

Tactile sensing typically involves active exploration of unknown surfaces and objects, making it especially effective at processing the characteristics of materials and textures. A key property extracted by human tactile perception is surface roughness, which relies on measuring vibratory signals using the multi-layered fingertip structure. Existing robotic systems lack tactile sensors that are able to provide high dynamic sensing ranges, perceive material properties, and maintain a low hardware cost. In this work, we introduce the reference design and fabrication procedure of a miniature and low-cost tactile sensor consisting of a biomimetic cutaneous structure, including the artificial fingerprint, dermis, epidermis, and an embedded magnet-sensor structure which serves as a mechanoreceptor for converting mechanical information to digital signals. The presented sensor is capable of detecting high-resolution magnetic field data through the Hall effect and creating high-dimensional time-frequency domain features for material texture classification. Additionally, we investigate the effects of different superficial sensor fingerprint patterns for classifying materials through both simulation and physical experimentation. After extracting time series and frequency domain features, we assess a k-nearest neighbors classifier for distinguishing between different materials. The results from our experiments show that our biomimetic tactile sensors with fingerprint ridges can classify materials with more than 8% higher accuracy and lower variability than ridge-less sensors. These results, along with the low cost and customizability of our sensor, demonstrate high potential for lowering the barrier to entry for a wide array of robotic applications, including model-less tactile sensing for texture classification, material inspection, and object recognition.

preprint2022arXiv

Hall anomaly, Quantum Oscillations and Possible Lifshitz Transitions in Kondo Insulator YbB$_{12}$: Evidence for Unconventional Charge Transport

In correlated electronic systems, strong interactions and the interplay between different degrees of freedom may give rise to anomalous charge transport properties, which can be tuned by external parameters like temperature and magnetic field. Recently, magnetic quantum oscillations and metallic low-temperature thermal conductivity have been observed in the Kondo insulator YbB$_{12}$, whose resistivity is a few orders of magnitude higher than those of conventional metals. As yet, these unusual observations are not fully understood. Here we present a detailed investigation of the behavior of YbB$_{12}$ under intense magnetic fields using both transport and torque magnetometry measurements. A low-field Hall anomaly, reminiscent of the Hall response associated with &#34;strange-metal&#34; physics, develops at $T < 1.5$ K. At two characteristic magnetic fields ($μ_0H_1= 19.6$ T and $μ_0H_2 \sim 31$ T), signatures appear in the Hall coefficient, magnetic torque, and magnetoresistance. We suggest that they are likely to be field-induced Lifshitz transitions. Moreover, above 35 T, the background resistivity displays an unusual, nonmetallic $T^α$-behavior, with $α$ being field-dependent and varying between -1.5 and -2. By normalizing the Shubnikov-de Haas oscillation amplitude to this $T^α$-dependence, the calculated cyclotron mass becomes more consistent with that deduced from de Haas-van Alphen oscillations. Our results support a novel two-fluid scenario in YbB$_{12}$: a Fermi-liquid-like fluid of charge-neutral quasiparticles coexists with charge carriers that remain in a nonmetallic state. The former experience successive Lifshitz transitions and develop Landau quantization in applied magnetic fields, whilst scattering between both fluids allows the Shubnikov-de Haas effect to be observed in the electrical transport.

preprint2022arXiv

MiMO: Mixture Model for Open Clusters in Color-Magnitude Diagrams

We propose a mixture model of open clusters (OCs) in the color-magnitude diagrams (CMDs) to measure the OC properties, including isochrone parameters (age, distance, metallicity, and dust extinction), stellar mass function (MF), and binary parameters (binary fraction and mass-ratio distribution), with high precision and reliability. The model treats an OC in the CMD as a mixture of single and binary member stars and field stars in the same region. The cluster members are modeled using a theoretical stellar model, MF and binary properties. The field component is modeled nonparametrically using a separate field-star sample in the vicinity of the cluster. Unlike conventional methods that rely on stringent member selection, ours allows us to use a sample of more complete cluster members and attendant field stars. The larger star sample reduces the statistical error and diminishes the potential bias by retaining more stars that are crucial for age estimation and MF measurement. After validating the method with 1000 mock clusters, we measured the parameters of 10 real OCs using Gaia EDR3 data. The best-fit isochrones are consistent with previous measurements in general but with more precise age estimates for several OCs. The inferred MF slope is -2.7 to -1.6 for clusters younger than 2 Gyr, while older clusters appear to have significantly flatter MFs. The binary fraction is 30% to 50%. The photometric and astrometric distances agree well.

preprint2022arXiv

PSGCNet: A Pyramidal Scale and Global Context Guided Network for Dense Object Counting in Remote Sensing Images

Object counting, which aims to count the accurate number of object instances in images, has been attracting more and more attention. However, challenges such as large scale variation, complex background interference, and non-uniform density distribution greatly limit the counting accuracy, particularly striking in remote sensing imagery. To mitigate the above issues, this paper proposes a novel framework for dense object counting in remote sensing images, which incorporates a pyramidal scale module (PSM) and a global context module (GCM), dubbed PSGCNet, where PSM is used to adaptively capture multi-scale information and GCM is to guide the model to select suitable scales generated from PSM. Moreover, a reliable supervision manner improved from Bayesian and Counting loss (BCL) is utilized to learn the density probability and then compute the count expectation at each annotation. It can relieve non-uniform density distribution to a certain extent. Extensive experiments on four remote sensing counting datasets demonstrate the effectiveness of the proposed method and the superiority of it compared with state-of-the-arts. Additionally, experiments extended on four commonly used crowd counting datasets further validate the generalization ability of the model. Code is available at https://github.com/gaoguangshuai/PSGCNet.

preprint2022arXiv

READ: Large-Scale Neural Scene Rendering for Autonomous Driving

Synthesizing free-view photo-realistic images is an important task in multimedia. With the development of advanced driver assistance systems~(ADAS) and their applications in autonomous vehicles, experimenting with different scenarios becomes a challenge. Although the photo-realistic street scenes can be synthesized by image-to-image translation methods, which cannot produce coherent scenes due to the lack of 3D information. In this paper, a large-scale neural rendering method is proposed to synthesize the autonomous driving scene~(READ), which makes it possible to synthesize large-scale driving scenarios on a PC through a variety of sampling schemes. In order to represent driving scenarios, we propose an ω rendering network to learn neural descriptors from sparse point clouds. Our model can not only synthesize realistic driving scenes but also stitch and edit driving scenes. Experiments show that our model performs well in large-scale driving scenarios.

preprint2021arXiv

On the statistical complexity of quantum circuits

In theoretical machine learning, the statistical complexity is a notion that measures the richness of a hypothesis space. In this work, we apply a particular measure of statistical complexity, namely the Rademacher complexity, to the quantum circuit model in quantum computation and study how the statistical complexity depends on various quantum circuit parameters. In particular, we investigate the dependence of the statistical complexity on the resources, depth, width, and the number of input and output registers of a quantum circuit. To study how the statistical complexity scales with resources in the circuit, we introduce a resource measure of magic based on the $(p,q)$ group norm, which quantifies the amount of magic in the quantum channels associated with the circuit. These dependencies are investigated in the following two settings: (i) where the entire quantum circuit is treated as a single quantum channel, and (ii) where each layer of the quantum circuit is treated as a separate quantum channel. The bounds we obtain can be used to constrain the capacity of quantum neural networks in terms of their depths and widths as well as the resources in the network.

preprint2021arXiv

Rademacher complexity of noisy quantum circuits

Noise in quantum systems is a major obstacle to implementing many quantum algorithms on large quantum circuits. In this work, we study the effects of noise on the Rademacher complexity of quantum circuits, which is a measure of statistical complexity that quantifies the richness of classes of functions generated by these circuits. We consider noise models that are represented by convex combinations of unitary channels and provide both upper and lower bounds for the Rademacher complexities of quantum circuits characterized by these noise models. In particular, we find a lower bound for the Rademacher complexity of noisy quantum circuits that depends on the Rademacher complexity of the corresponding noiseless quantum circuit as well as the free robustness of the circuit. Our results show that the Rademacher complexity of quantum circuits decreases with the increase in noise.

preprint2020arXiv

CLUENER2020: Fine-grained Named Entity Recognition Dataset and Benchmark for Chinese

In this paper, we introduce the NER dataset from CLUE organization (CLUENER2020), a well-defined fine-grained dataset for named entity recognition in Chinese. CLUENER2020 contains 10 categories. Apart from common labels like person, organization, and location, it contains more diverse categories. It is more challenging than current other Chinese NER datasets and could better reflect real-world applications. For comparison, we implement several state-of-the-art baselines as sequence labeling tasks and report human performance, as well as its analysis. To facilitate future work on fine-grained NER for Chinese, we release our dataset, baselines, and leader-board.

preprint2020arXiv

Exploring open cluster properties with Gaia and LAMOST

In Gaia DR2, the unprecedented high-precision level reached in sub-mas for astrometry and mmag for photometry. Using cluster members identified with these astrometry and photometry in Gaia DR2, we can obtain a reliable determination of cluster properties. However, because of the shortcoming of Gaia spectroscopic observation in dealing with densely crowded cluster region, the number of radial velocity and metallicity for cluster member stars from Gaia DR2 is still lacking. In this study, we aim to improve the cluster properties by combining the LAMOST spectra. In particular, we provide the list of cluster members with spectroscopic parameters as an add-value catalog in LAMOST DR5, which can be used to perform detailed study for a better understanding on the stellar properties, by using their spectra and fundamental properties from the host cluster. We cross-matched the spectroscopic catalog in LAMOST DR5 with the identified cluster members in Cantat-Gaudin et al.2018 and then used members with spectroscopic parameters to derive statistical properties of open clusters. We obtained a list of 8811 members with spectroscopic parameters and a catalog of 295 cluster properties. In addition, we study the radial and vertical metallicity gradient and age-metallicity relation with the compiled open clusters as tracers, finding slopes of -0.053$\pm$0.004 dex kpc$^{-1}$, -0.252$\pm$0.039 dex kpc$^{-1}$ and 0.022$\pm$0.008 dex Gyr$^{-1}$, respectively. Both slopes of metallicity distribution relation for young clusters (0.1 Gyr < Age < 2 Gyr) and the age-metallicity relation for clusters within 6 Gyr are consistent with literature results. In order to fully study the chemical evolution history in the disk, more spectroscopic observations for old and distant open clusters are needed for further investigation.

preprint2020arXiv

Learning Spatiotemporal Features of Ride-sourcing Services with Fusion Convolutional Network

To collectively forecast the demand for ride-sourcing services in all regions of a city, the deep learning approaches have been applied with commendable results. However, the local statistical differences throughout the geographical layout of the city make the spatial stationarity assumption of the convolution invalid, which limits the performance of CNNs on the demand forecasting task. In this paper, we propose a novel deep learning framework called LC-ST-FCN (locally connected spatiotemporal fully-convolutional neural network) to address the unique challenges of the region-level demand forecasting problem within one end-to-end architecture (E2E). We first employ the 3D convolutional layers to fuse the spatial and temporal information existed in the input and then feed the spatiotemporal features extracted by the 3D convolutional layers to the subsequent 2D convolutional layers. Afterward, the prediction value of each region is obtained by the locally connected convolutional layers which relax the parameter sharing scheme. We evaluate the proposed model on a real dataset from a ride-sourcing service platform (DiDiChuxing) and observe significant improvements compared with a bunch of baseline models. Besides, we also illustrate the effectiveness of our proposed model by visualizing how different types of convolutional layers transform their input and capture useful features. The visualization results show that fully convolutional architecture enables the model to better localize the related regions. And the locally connected layers play an important role in dealing with the local statistical differences and activating useful regions.

preprint2020arXiv

Linearly implicit local and global energy-preserving methods for PDEs with a cubic Hamiltonian

We present linearly implicit methods that preserve discrete approximations to local and global energy conservation laws for multi-symplectic PDEs with cubic invariants. The methods are tested on the one-dimensional Korteweg-de Vries equation and the two-dimensional Zakharov-Kuznetsov equation; the numerical simulations confirm the conservative properties of the methods, and demonstrate their good stability properties and superior running speed when compared to fully implicit schemes.

preprint2020arXiv

Linearly implicit structure-preserving schemes for Hamiltonian systems

Kahan&#39;s method and a two-step generalization of the discrete gradient method are both linearly implicit methods that can preserve a modified energy for Hamiltonian systems with a cubic Hamiltonian. These methods are here investigated and compared. The schemes are applied to the Korteweg-de Vries equation and the Camassa-Holm equation, and the numerical results are presented and analysed.

preprint2020arXiv

Organ size increases with obesity and correlates with cancer risk

Obesity increases significantly cancer risk in various organs. Although this has been recognized for decades, the mechanism through which this happens has never been explained. Here, we show that the volumes of kidneys, pancreas, and liver are strongly correlated (median correlation = 0.625; P-value<10-47) with the body mass index (BMI) of an individual. We also find a significant relationship between the increase in organ volume and the increase in cancer risk (P-value<10-12). These results provide a mechanism explaining why obese individuals have higher cancer risk in several organs: the larger the organ volume the more cells at risk of becoming cancerous. These findings are important for a better understanding of the effects obesity has on cancer risk and, more generally, for the development of better preventive strategies to limit the mortality caused by obesity.

preprint2020arXiv

Outcome regression-based estimation of conditional average treatment effect

The research is about a systematic investigation on the following issues. First, we construct different outcome regression-based estimators for conditional average treatment effect under, respectively, true (oracle), parametric, nonparametric and semiparametric dimension reduction structure. Second, according to the corresponding asymptotic variance functions, we answer the following questions when supposing the models are correctly specified: what is the asymptotic efficiency ranking about the four estimators in general? how is the efficiency related to the affiliation of the given covariates in the set of arguments of the regression functions? what do the roles of bandwidth and kernel function selections play for the estimation efficiency; and in which scenarios should the estimator under semiparametric dimension reduction regression structure be used in practice? As a by-product, the results show that any outcome regression-based estimation should be asymptotically more efficient than any inverse probability weighting-based estimation. All these results give a relatively complete picture of the outcome regression-based estimation such that the theoretical conclusions could provide guidance for practical use when more than one estimations can be applied to the same problem. Several simulation studies are conducted to examine the performances of these estimators in finite sample cases and a real dataset is analyzed for illustration.

preprint2020arXiv

Unveiling the Hierarchical Structure of Open Star Clusters: the Perseus Double Cluster

We introduce a new kinematic method to investigate the structure of open star clusters. We adopt a hierarchical clustering algorithm that uses the celestial coordinates and the proper motions of the stars in the field of view of the cluster to estimate a proxy of the pairwise binding energy of the stars and arrange them in a binary tree. The cluster substructures and their members are identified by trimming the tree at two thresholds, according to the $σ$-plateau method. Testing the algorithm on 100 mock catalogs shows that, on average, the membership of the identified clusters is $(91.5\pm 3.5)$\% complete and the fraction of unrelated stars is $(10.4\pm 2.0)$\%. We apply the algorithm to the stars in the field of view of the Perseus double cluster from the Data Release 2 of Gaia. This approach identifies a single structure, Sub1, that separates into two substructures, Sub1-1 and Sub1-2. These substructures coincide with $h$ Per and $χ$ Per: the distributions of the proper motions and the color-magnitude diagrams of the members of Sub1-1 and Sub1-2 are fully consistent with those of $h$ Per and $χ$ Per reported in the literature. These results suggest that our hierarchical clustering algorithm can be a powerful tool to unveil the complex kinematic information of star clusters.

preprint2019arXiv

Quantifying the resource content of quantum channels: An operational approach

We propose a general method to operationally quantify the resourcefulness of quantum channels via channel discrimination, an important information processing task. A main result is that the maximum success probability of distinguishing a given channel from the set of free channels by free probe states is exactly characterized by the resource generating power, i.e. the maximum amount of resource produced by the action of the channel, given by the trace distance to the set of free states. We apply this framework to the resource theory of quantum coherence, as an informative example. The general results can also be easily applied to other resource theories such as entanglement, magic states, and asymmetry.