Researcher profile

Hao Ding

Hao Ding contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
15topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2025arXiv

The systemic recoil velocity distribution and the scale height of field millisecond pulsar systems: Implications on neutron star retention fractions in star clusters

The systemic recoil velocity ($v_\mathrm{sys}$) distribution of millisecond pulsars (MSPs) is essential for understanding the MSP formation channel(s) and for estimating the retention fractions of MSPs in star clusters. However, the determination is complicated by MSPs' long-term dynamic evolution and the scarcity of radial velocity measurements. We compiled 64 field MSP systems that are well astrometrically determined, and calculated their transverse peculiar velocities $\boldsymbol{v}_\perp$ and Galactic heights $z$. Assuming that the Galactic-longitude components $v_\mathrm{l}$ of $\boldsymbol{v}_\perp$ are statistically stable over time (the "stable-$v_\mathrm{l}$" assumption), we approached the distribution of the $v_\mathrm{l}$ components of $\boldsymbol{v}_\mathrm{sys}$ by the observed $v_\mathrm{l}$ sample. We find that the observed $v_\mathrm{l}$ can be well described by a linear combination of three normal distributions. Accordingly, the MSP $v_\mathrm{sys}$ distribution can be approximated by a linear combination of three Maxwellian components under the assumption that $\boldsymbol{v}_\mathrm{sys}$ directions are uniformly distributed. Our dynamical population synthesis analysis based on the derived $v_\mathrm{sys}$ distribution verified the "stable-$v_\mathrm{l}$" assumption in the parameter space of this work, and estimated the initial and the current Galaxy-wide scale heights of field MSP systems to be about 0.32 kpc and 0.68 kpc, respectively. According to the MSP $v_\mathrm{sys}$ distribution, $\approx14$% of all the MSPs born in a globular cluster with the nominal 50 $\mathrm{km~s^{-1}}$ central escape velocity can be retained. Therefore, the $v_\mathrm{sys}$ distribution of field MSP systems may account for the high number of MSPs discovered in globular clusters, which implies that MSPs in star clusters may follow the same formation channel(s) as field MSP systems.

preprint2024arXiv

Consistent Intradecadal/Interdecadal Oscillations in the Surface Geomagnetic Observations and in the $Δ$LOD: New Findings and Unresolved Problems

Surface geomagnetic observations and length of day variations (dLOD) have played an important role in interpreting the long-period motions of the Earth's core. Focusing on the 5-30yr period band, we use the optimal sequence estimation method to analyze the global geomagnetic vertical observations, and newly find that the about 6yr/8.6yr signals, the about7.6yr/13.6yr/22.5yr signals, the about 15.4yr and about 18.6yr signals respectively have the Y2,2, Y2,-2, Y2,-1 and Y2,0 spatial patterns; in which, the about 7.6yr/8.6yr/13.6yr/15.4yr/18.6yr signals are clearly detected for the first time. We also find that the five Y2,+/-2-related "equivalent" excitation signals have good phase consistency with the corresponding signals in the dLOD, and they have the same about 0.042 amplitude scaling factor. Determining the physical mechanisms behind those seven signals still leaves many unresolved problems, and our new findings make those problems more complicated. As a preliminary thought, we suggest the five Y2,+/-2-related signals may be originated from the Magnetic-Archimedes-Coriolis waves

preprint2024arXiv

Pre-trained Recommender Systems: A Causal Debiasing Perspective

Recent studies on pre-trained vision/language models have demonstrated the practical benefit of a new, promising solution-building paradigm in AI where models can be pre-trained on broad data describing a generic task space and then adapted successfully to solve a wide range of downstream tasks, even when training data is severely limited (e.g., in zero- or few-shot learning scenarios). Inspired by such progress, we investigate in this paper the possibilities and challenges of adapting such a paradigm to the context of recommender systems, which is less investigated from the perspective of pre-trained model. In particular, we propose to develop a generic recommender that captures universal interaction patterns by training on generic user-item interaction data extracted from different domains, which can then be fast adapted to improve few-shot learning performance in unseen new domains (with limited data). However, unlike vision/language data which share strong conformity in the semantic space, universal patterns underlying recommendation data collected across different domains (e.g., different countries or different E-commerce platforms) are often occluded by both in-domain and cross-domain biases implicitly imposed by the cultural differences in their user and item bases, as well as their uses of different e-commerce platforms. As shown in our experiments, such heterogeneous biases in the data tend to hinder the effectiveness of the pre-trained model. To address this challenge, we further introduce and formalize a causal debiasing perspective, which is substantiated via a hierarchical Bayesian deep learning model, named PreRec. Our empirical studies on real-world data show that the proposed model could significantly improve the recommendation performance in zero- and few-shot learning settings under both cross-market and cross-platform scenarios.

preprint2022arXiv

CaRTS: Causality-driven Robot Tool Segmentation from Vision and Kinematics Data

Vision-based segmentation of the robotic tool during robot-assisted surgery enables downstream applications, such as augmented reality feedback, while allowing for inaccuracies in robot kinematics. With the introduction of deep learning, many methods were presented to solve instrument segmentation directly and solely from images. While these approaches made remarkable progress on benchmark datasets, fundamental challenges pertaining to their robustness remain. We present CaRTS, a causality-driven robot tool segmentation algorithm, that is designed based on a complementary causal model of the robot tool segmentation task. Rather than directly inferring segmentation masks from observed images, CaRTS iteratively aligns tool models with image observations by updating the initially incorrect robot kinematic parameters through forward kinematics and differentiable rendering to optimize image feature similarity end-to-end. We benchmark CaRTS with competing techniques on both synthetic as well as real data from the dVRK, generated in precisely controlled scenarios to allow for counterfactual synthesis. On training-domain test data, CaRTS achieves a Dice score of 93.4 that is preserved well (Dice score of 91.8) when tested on counterfactually altered test data, exhibiting low brightness, smoke, blood, and altered background patterns. This compares favorably to Dice scores of 95.0 and 86.7, respectively, of the SOTA image-based method. Future work will involve accelerating CaRTS to achieve video framerate and estimating the impact occlusion has in practice. Despite these limitations, our results are promising: In addition to achieving high segmentation accuracy, CaRTS provides estimates of the true robot kinematics, which may benefit applications such as force estimation. Code is available at: https://github.com/hding2455/CaRTS

preprint2022arXiv

Context Uncertainty in Contextual Bandits with Applications to Recommender Systems

Recurrent neural networks have proven effective in modeling sequential user feedbacks for recommender systems. However, they usually focus solely on item relevance and fail to effectively explore diverse items for users, therefore harming the system performance in the long run. To address this problem, we propose a new type of recurrent neural networks, dubbed recurrent exploration networks (REN), to jointly perform representation learning and effective exploration in the latent space. REN tries to balance relevance and exploration while taking into account the uncertainty in the representations. Our theoretical analysis shows that REN can preserve the rate-optimal sublinear regret even when there exists uncertainty in the learned representations. Our empirical study demonstrates that REN can achieve satisfactory long-term rewards on both synthetic and real-world recommendation datasets, outperforming state-of-the-art models.

preprint2022arXiv

Hankel Spectrum Analysis: A novel signal decomposition method and its geophysical applications

To analyze non-stationary harmonic signals typically contained in geophysical observables is a quest that has seen continual advances in numerical techniques over the decades. In this paper, based on transient z-pole estimation (in Hankel matrices), a novel state-space analysis referred to as Hankel Spectral Analysis (HSA), was developed. Depended on the Hankel total least square (HTLS), the HSA incorporates truncated singular value decomposition (TSVD) and its shift-invariant property in robustly decomposing the closely-spaced sinusoids. Resorted to a sliding window processing, HSA can be used to analyze non-stationary sequential structures, in the support of consecutive quaternary parameters {Ai, αi, fi, θi}. Based on a series of experiments with special features commonly in real measurements, the availabilities of HSA in complex harmonic constituents (e.g., the time-variant amplitude/frequency, mutation, the episodic recording signals) with low Signal-to-Noise Ratio are confirmed. In real applications, we use HSA to analyze both global geophysical observables, including polar motion (PM) and earth's dynamic oblateness (ΔJ2), and some new findings are obtained. In the PM series since the 1900s, a total of triple jumps from Chandler wobble (CW) are firstly confirmed; and all of them are synchronized by the sharp decrease of Chandler intensity and period. In the ΔJ2 series, two decadal signals (18.6 yr, 10.5 yr) are identified to be associated with the tide effect, and solar activity; and its interannual-to-decadal oscillations contribute to multiple global gravity anomalies. These findings implied the great potential of the HSA in searching hitherto signals of geophysical observations.

preprint2021arXiv

Gender Inequality in Research Productivity During the COVID-19 Pandemic

We study the disproportionate impact of the lockdown as a result of the COVID-19 outbreak on female and male academics' research productivity in social science. The lockdown has caused substantial disruptions to academic activities, requiring people to work from home. How this disruption affects productivity and the related gender equity is an important operations and societal question. We collect data from the largest open-access preprint repository for social science on 41,858 research preprints in 18 disciplines produced by 76,832 authors across 25 countries over a span of two years. We use a difference-in-differences approach leveraging the exogenous pandemic shock. Our results indicate that, in the 10 weeks after the lockdown in the United States, although the total research productivity increased by 35%, female academics' productivity dropped by 13.9% relative to that of male academics. We also show that several disciplines drive such gender inequality. Finally, we find that this intensified productivity gap is more pronounced for academics in top-ranked universities, and the effect exists in six other countries. Our work points out the fairness issue in productivity caused by the lockdown, a finding that universities will find helpful when evaluating faculty productivity. It also helps organizations realize the potential unintended consequences that can arise from telecommuting.

preprint2020arXiv

Cross-View Image Synthesis with Deformable Convolution and Attention Mechanism

Learning to generate natural scenes has always been a daunting task in computer vision. This is even more laborious when generating images with very different views. When the views are very different, the view fields have little overlap or objects are occluded, leading the task very challenging. In this paper, we propose to use Generative Adversarial Networks(GANs) based on a deformable convolution and attention mechanism to solve the problem of cross-view image synthesis (see Fig.1). It is difficult to understand and transform scenes appearance and semantic information from another view, thus we use deformed convolution in the U-net network to improve the network's ability to extract features of objects at different scales. Moreover, to better learn the correspondence between images from different views, we apply an attention mechanism to refine the intermediate feature map thus generating more realistic images. A large number of experiments on different size images on the Dayton dataset[1] show that our model can produce better results than state-of-the-art methods.

preprint2020arXiv

Discovery of dissipative microwave photonic solitons

Dissipative solitons rely on the double balance between nonlinearity and dispersion as well as gain and loss have attracted a lot of attention in optics, since it gives rise to ultrashort pulses and broadband frequency combs with good stability and smooth spectral envelopes. Here we observe a novel dissipative solitons in microwave photonics that gives rise to wideband tunable frequency hopping microwave signals with fast frequency switching speed. The dissipative microwave photonic solitions are achieved through the double balance between nonlinear gain saturation and linear filtering as well as gain and loss in a microwave photonic resonant cavity. The generation of dissipative solitons with different pulse width, repletion rate and number of solitons per round-trip time are observed, together with the corresponding wideband tunable frequency hopping microwave signals. This work opens new avenues for signal generation, processing and control based on the principle of solitons in microwave photonics, and has great potential in many applications such as modern radars, electronic warfare systems, and telecommunications.

preprint2020arXiv

New evidences for the fluctuation characteristic of intradecadal periodic signals in length-of-day variation

The intradecadal fluctuations in the length-of-day variation (dLOD) are considered likely to play an important role in core motions. Two intradecadal oscillations, with 5.9yr and 8.5yr periods (referred to as SYO and EYO, respectively), have been detected in previous studies. However, whether the SYO and the EYO have stable damping trends since 1962 and whether geomagnetic jerks are possible excitation sources for the SYO/EYO are still debated. In this study, based on different methods and dLOD records with different time span, we show robust evidences to prove that the SYO and the EYO have no stable damping trends since 1962, and we find that there is also a possible 7.6yr signal. To prove whether it is a periodic signal, we use the optimal sequence estimation method to stack 35 global geomagnetic records, the results also show an 7.6yr periodic signal which has an Y2,-2 spatial distribution, and it has a high degree of consistent synchronicity with the 7.6yr signal in dLOD. After confirming that the jerks have no special consistency with the peaks/valleys of the EYO/SYO, we confirm that the geomagnetic jerks seem to be related to sudden changes in the SYO/EYO time series and their excitation series; so we finally suggest that jerks are possible excitation sources of the SYO/EYO. Meanwhile, after using a deconvolution method, we estimate that the period P and quality factor Q of the SYO and the EYO are [P=5.85+/-0.06yr, Q larger than 180] and [P=8.455+/-0.17yr, Q larger than 350], respectively.

preprint2020arXiv

SimGNN: A Neural Network Approach to Fast Graph Similarity Computation

Graph similarity search is among the most important graph-based applications, e.g. finding the chemical compounds that are most similar to a query compound. Graph similarity computation, such as Graph Edit Distance (GED) and Maximum Common Subgraph (MCS), is the core operation of graph similarity search and many other applications, but very costly to compute in practice. Inspired by the recent success of neural network approaches to several graph applications, such as node or graph classification, we propose a novel neural network based approach to address this classic yet challenging graph problem, aiming to alleviate the computational burden while preserving a good performance. The proposed approach, called SimGNN, combines two strategies. First, we design a learnable embedding function that maps every graph into a vector, which provides a global summary of a graph. A novel attention mechanism is proposed to emphasize the important nodes with respect to a specific similarity metric. Second, we design a pairwise node comparison method to supplement the graph-level embeddings with fine-grained node-level information. Our model achieves better generalization on unseen graphs, and in the worst case runs in quadratic time with respect to the number of nodes in two graphs. Taking GED computation as an example, experimental results on three real graph datasets demonstrate the effectiveness and efficiency of our approach. Specifically, our model achieves smaller error rate and great time reduction compared against a series of baselines, including several approximation algorithms on GED computation, and many existing graph neural network based models. To the best of our knowledge, we are among the first to adopt neural networks to explicitly model the similarity between two graphs, and provide a new direction for future research on graph similarity computation and graph similarity search.

preprint2020arXiv

Very long baseline astrometry of PSR J1012+5307 and its implications on alternative theories of gravity

PSR J1012+5307, a millisecond pulsar in orbit with a helium white dwarf (WD), has been timed with high precision for about 25 years. One of the main objectives of this long-term timing is to use the large asymmetry in gravitational binding energy between the neutron star and the WD to test gravitational theories. Such tests, however, will be eventually limited by the accuracy of the distance to the pulsar. Here, we present VLBI (very long baseline interferometry) astrometry results spanning approximately 2.5 years for PSR J1012+5307, obtained with the Very Long Baseline Array as part of the MSPSRPI project. These provide the first proper motion and absolute position for PSR J1012+5307 measured in a quasi-inertial reference frame. From the VLBI results, we measure a distance of $0.83^{+0.06}_{-0.02}$kpc (all the estimates presented in the abstract are at 68% confidence) for PSR J1012+5307, which is the most precise obtained to date. Using the new distance, we improve the uncertainty of measurements of the unmodeled contributions to orbital period decay, which, combined with three other pulsars, places new constraints on the coupling constant for dipole gravitational radiation $κ_D=(-1.7\pm1.7)\times 10^{-4}$ and the fractional time derivative of Newton's gravitational constant $\dot{G}/G = -1.8^{\,+5.6}_{\,-4.7}\times 10^{-13}\,{\rm yr^{-1}}$ in the local universe. As the uncertainties of the observed decays of orbital period for the four leading pulsar-WD systems become negligible in $\approx10$ years, the uncertainties for $\dot{G}/G$ and $κ_D$ will be improved to $\leq1.5\times10^{-13}\,{\rm yr^{-1}}$ and $\leq1.0\times10^{-4}$, respectively, predominantly limited by the distance uncertainties.