Researcher profile

Lei Dong

Lei Dong contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2026arXiv

A Geometric Analysis of Sign-Magnitude Asymmetry in a ReLU + RMSNorm Block under Ternary Quantization

Pre-norm Transformers with RMSNorm tolerate ternary {-1,0,+1} weight quantization with surprisingly small loss (Ma et al., 2024). We give a geometric explanation via sign-magnitude decomposition of weight perturbations. In a two-layer ReLU + RMSNorm model with i.i.d. Gaussian weights, sign-flips produce $π/(π-2) \approx 2.75$ times more transverse output energy than sign-preserving magnitude perturbations of equal Frobenius norm, as the flip rate $p \to 0$ (Theorem 3). The mechanism: ReLU creates a hidden-space directional asymmetry between the two perturbation types, which RMSNorm's transverse-projection Fréchet derivative selectively exposes. Sign-quantization error is itself a sign-preserving perturbation with angular alignment $\cos^2 \to 2/π$ (Theorem 4); its post-ReLU radial fraction ($0.365$) matches the pre-ReLU value $1-2/π$ within $0.4\%$, so ReLU is approximately transparent to ternary error. Multi-layer compounding of the $2.75\times$ factor is not experimentally supported; the gap to real-model sign sensitivity arises from outlier features violating delocalization. For an input dimension with amplitude $α$, a single sign-flip produces post-ReLU energy amplified by $R \approx nα^2$ relative to a delocalized entry. On TinyLlama-1.1B, at linear response ($p \leq 0.5\%$), count-matched NLL leverage stabilizes at $\sim 10\times \approx n\mathbb{E}[α^2]$, matching the per-entry theory; the all-column NLL ratio of $5.0\times$ falls within $R_{\mathrm{col}} \leq 19$ ($67\times$ PPL gap reflects metric nonlinearity). Measured outlier $α$ at layer 12 (median $0.024$, max $0.26$) confirms heavy-tailed concentration. The Bussgang constant $2/π$, RMSNorm geometry, and ReLU half-space structure together explain sign-magnitude asymmetry in pre-norm models, with $R \propto nα^2$ accounting for real-model deviations.

preprint2022arXiv

Mapping evolving population geography in China

China's demographic changes have important global economic and geopolitical implications. Yet, our understanding of such transitions at the micro-spatial scale remains limited due to spatial inconsistency of the census data caused by administrative boundary adjustments. To fill this gap, we manually collected and built a population census panel from 2010 to 2020 at both the county and prefectural-city levels. We show that the massive internal migration drives China's increasing population concentration and regional disparity, resulting in severe population aging in shrinking cities and increasing gender imbalance in growing cities.

preprint2022arXiv

MetroGAN: Simulating Urban Morphology with Generative Adversarial Network

Simulating urban morphology with location attributes is a challenging task in urban science. Recent studies have shown that Generative Adversarial Networks (GANs) have the potential to shed light on this task. However, existing GAN-based models are limited by the sparsity of urban data and instability in model training, hampering their applications. Here, we propose a GAN framework with geographical knowledge, namely Metropolitan GAN (MetroGAN), for urban morphology simulation. We incorporate a progressive growing structure to learn hierarchical features and design a geographical loss to impose the constraints of water areas. Besides, we propose a comprehensive evaluation framework for the complex structure of urban systems. Results show that MetroGAN outperforms the state-of-the-art urban simulation methods by over 20% in all metrics. Inspiringly, using physical geography features singly, MetroGAN can still generate shapes of the cities. These results demonstrate that MetroGAN solves the instability problem of previous urban simulation GANs and is generalizable to deal with various urban attributes.

preprint2022arXiv

Quantifying navigation complexity in transportation networks

The complexity of navigation in cities has increased with the expansion of urban areas, creating challenging transportation problems that drive many studies on the navigability of networks. However, due to the lack of individual mobility data, large-scale empirical analysis of the wayfinder's real-world navigation is rare. Here, using 225 million subway trips from three major cities in China, we quantify navigation difficulty from an information perspective. Our results reveal that 1) people conserve a small number of repeatedly used routes, and 2) the navigation information in the subnetworks formed by those routes is much smaller than the theoretical value in the global network, suggesting that the decision cost for actual trips is significantly smaller than the theoretical upper limit found in previous studies. By modeling routing behaviors in growing networks, we show that while the global network becomes difficult to navigate, navigability can be improved in subnetworks. We further present a universal linear relationship between the empirical and theoretical search information, which allows the two metrics to predict each other. Our findings demonstrate how large-scale observations can quantify real-world navigation behaviors and aid in evaluating transportation planning.

preprint2022arXiv

The universality in urban commuting across and within cities

Commuting is a key mechanism that governs the dynamics of cities. Despite its importance, very little is known of the properties and mechanisms underlying this crucial urban process. Here, we capitalize on $\sim$ 50 million individuals' smartphone data from 234 Chinese cities to show that urban commuting obeys remarkable regularities. These regularities can be generalized as two laws: (i) the scale-invariance of the average commuting distance across cities, which is a long-awaited validation of Marchetti's constant conjecture, and (ii) a universal inverted U-shape of the commuting distance as a function of the distance from the city centre within cities, indicating that the city centre's attraction is bounded. Motivated by such empirical findings, we develop a simple urban growth model that connects individual-level mobility choices with macroscopic urban spatial structure and faithfully explains both commuting laws. Our results further show that the scale-invariants of human mobility will ultimately lead to the polycentric transition in cities, which could be used to better inform urban development strategies.

preprint2020arXiv

A gridded establishment dataset as a proxy for economic activity in China

Measuring the geographical distribution of economic activity plays a key role in scientific research and policymaking. However, previous studies and data on economic activity either have a coarse spatial resolution or cover a limited time span, and the high-resolution characteristics of socioeconomic dynamics are largely unknown. Here, we construct a dataset on the economic activity of mainland China, the gridded establishment dataset (GED), which measures the volume of establishments at a 0.01$^{\circ}$ latitude by 0.01$^{\circ}$ longitude scale. Specifically, our dataset captures the geographically based opening and closing of approximately 25.5 million firms that registered in mainland China over the period 2005-2015. The characteristics of fine granularity and long-term observability give the GED a high application value. The dataset not only allows us to quantify the spatiotemporal patterns of the establishments, urban vibrancy and socioeconomic activity, but also helps us uncover the fundamental principles underlying the dynamics of industrial and economic development.

preprint2020arXiv

Quantifying urban areas with multi-source data based on percolation theory

Quantifying urban areas is crucial for addressing associated urban issues such as environmental and sustainable problems. Remote sensing data, especially the nighttime light images, have been widely used to delineate urbanized areas across the world. Meanwhile, some emerging urban data, such as volunteered geographical information (e.g., OpenStreetMap) and social sensing data (e.g., mobile phone and social media), have also shown great potential in revealing urban boundaries and dynamics. However, consistent and robust methods to quantify urban areas from these multi-source data have remained elusive. Here, we propose a percolation-based method to extract urban areas from these multi-source urban data. We derive the optimal urban/non-urban threshold by considering the critical nature of urban systems with the support of the percolation theory. Furthermore, we apply the method with three open-source datasets - population, road, and nighttime light - to 28 countries. We show that the proposed method captures the similar urban characteristics in terms of urban areas from multi-source data, and Zipf's law holds well in most countries. The accuracy of the derived urban areas by different datasets has been validated with the Landsat-based reference data in 10 cities, and the accuracy can be further improved through data fusion ($κ=0.69-0.85$, mean $κ=0.78$). Our study not only provides an efficient method to quantify urban areas with open-source data, but also deepens the understanding of urban systems and sheds some light on multi-source data fusion in geographical fields.

preprint2020arXiv

The spectral dimension of human mobility

Human mobility patterns are surprisingly structured. In spite of many hard to model factors, such as climate, culture, and socioeconomic opportunities, aggregate migration rates obey a universal, parameter-free, `radiation' model. Recent work has further shown that the detailed spectral decomposition of these flows -- defined as the number of individuals that visit a given location with frequency $f$ from a distance $r$ away -- also obeys simple rules, namely, scaling as a universal inverse square law in the combination, $rf$. However, this surprising regularity, derived on general grounds, has not been explained through microscopic mechanisms of individual behavior. Here we confirm this by analyzing large-scale cell phone datasets from three distinct regions and show that a direct consequence of this scaling law is that the average `travel energy' spent by visitors to a given location is constant across space, a finding reminiscent of the well-known travel budget hypothesis of human movement. The attractivity of different locations, which we define by the total number of visits to that location, also admits non-trivial, spatially-clustered structure. The observed pattern is consistent with the well-known central place theory in urban geography, as well as with the notion of Weber optimality in spatial economy, hinting to a collective human capacity of optimizing recurrent movements. We close by proposing a simple, microscopic human mobility model which simultaneously captures all our empirical findings. Our results have relevance for transportation, urban planning, geography, and other disciplines in which a deeper understanding of aggregate human mobility is key.

preprint2020arXiv

Understanding the mesoscopic scaling patterns within cities

Understanding quantitative relationships between urban elements is crucial for a wide range of applications. The observation at the macroscopic level demonstrates that the aggregated urban quantities (e.g., gross domestic product) scale systematically with population sizes across cities, also known as urban scaling laws. However, at the mesoscopic level, we lack an understanding of whether the simple scaling relationship holds within cities, which is a fundamental question regarding the spatial origin of scaling in urban systems. Here, by analyzing four extensive datasets covering millions of mobile phone users and urban facilities, we investigate the scaling phenomena within cities. We find that the mesoscopic infrastructure volume and socioeconomic activity scale sub- and super-linearly with the active population, respectively. For a same scaling phenomenon, however, the exponents vary in cities of similar population sizes. To explain these empirical observations, we propose a conceptual framework by considering the heterogeneous distributions of population and facilities, and the spatial interactions between them. Analytical and numerical results suggest that, despite the large number of complexities that influence urban activities, the simple interaction rules can effectively explain the observed regularity and heterogeneity in scaling behaviors within cities.