Source author record

Lei Dong

Lei Dong appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

physics.soc-ph Social and Information Networks cs.CY econ.GN q-fin.EC Computational Geometry cond-mat.mtrl-sci Machine Learning math.MG physics.geo-ph

Catalog footprint

What is connected

12works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A Geometric Analysis of Sign-Magnitude Asymmetry in a ReLU + RMSNorm Block under Ternary Quantization

Pre-norm Transformers with RMSNorm tolerate ternary {-1,0,+1} weight quantization with surprisingly small loss (Ma et al., 2024). We give a geometric explanation via sign-magnitude decomposition of weight perturbations. In a two-layer ReLU + RMSNorm model with i.i.d. Gaussian weights, sign-flips produce $π/(π-2) \approx 2.75$ times more transverse output energy than sign-preserving magnitude perturbations of equal Frobenius norm, as the flip rate $p \to 0$ (Theorem 3). The mechanism: ReLU creates a hidden-space directional asymmetry between the two perturbation types, which RMSNorm's transverse-projection Fréchet derivative selectively exposes. Sign-quantization error is itself a sign-preserving perturbation with angular alignment $\cos^2 \to 2/π$ (Theorem 4); its post-ReLU radial fraction ($0.365$) matches the pre-ReLU value $1-2/π$ within $0.4\%$, so ReLU is approximately transparent to ternary error. Multi-layer compounding of the $2.75\times$ factor is not experimentally supported; the gap to real-model sign sensitivity arises from outlier features violating delocalization. For an input dimension with amplitude $α$, a single sign-flip produces post-ReLU energy amplified by $R \approx nα^2$ relative to a delocalized entry. On TinyLlama-1.1B, at linear response ($p \leq 0.5\%$), count-matched NLL leverage stabilizes at $\sim 10\times \approx n\mathbb{E}[α^2]$, matching the per-entry theory; the all-column NLL ratio of $5.0\times$ falls within $R_{\mathrm{col}} \leq 19$ ($67\times$ PPL gap reflects metric nonlinearity). Measured outlier $α$ at layer 12 (median $0.024$, max $0.26$) confirms heavy-tailed concentration. The Bussgang constant $2/π$, RMSNorm geometry, and ReLU half-space structure together explain sign-magnitude asymmetry in pre-norm models, with $R \propto nα^2$ accounting for real-model deviations.

preprint2022arXiv

Mapping evolving population geography in China

China's demographic changes have important global economic and geopolitical implications. Yet, our understanding of such transitions at the micro-spatial scale remains limited due to spatial inconsistency of the census data caused by administrative boundary adjustments. To fill this gap, we manually collected and built a population census panel from 2010 to 2020 at both the county and prefectural-city levels. We show that the massive internal migration drives China's increasing population concentration and regional disparity, resulting in severe population aging in shrinking cities and increasing gender imbalance in growing cities.

preprint2022arXiv

MetroGAN: Simulating Urban Morphology with Generative Adversarial Network

Simulating urban morphology with location attributes is a challenging task in urban science. Recent studies have shown that Generative Adversarial Networks (GANs) have the potential to shed light on this task. However, existing GAN-based models are limited by the sparsity of urban data and instability in model training, hampering their applications. Here, we propose a GAN framework with geographical knowledge, namely Metropolitan GAN (MetroGAN), for urban morphology simulation. We incorporate a progressive growing structure to learn hierarchical features and design a geographical loss to impose the constraints of water areas. Besides, we propose a comprehensive evaluation framework for the complex structure of urban systems. Results show that MetroGAN outperforms the state-of-the-art urban simulation methods by over 20% in all metrics. Inspiringly, using physical geography features singly, MetroGAN can still generate shapes of the cities. These results demonstrate that MetroGAN solves the instability problem of previous urban simulation GANs and is generalizable to deal with various urban attributes.

preprint2022arXiv

Quantifying navigation complexity in transportation networks

The complexity of navigation in cities has increased with the expansion of urban areas, creating challenging transportation problems that drive many studies on the navigability of networks. However, due to the lack of individual mobility data, large-scale empirical analysis of the wayfinder's real-world navigation is rare. Here, using 225 million subway trips from three major cities in China, we quantify navigation difficulty from an information perspective. Our results reveal that 1) people conserve a small number of repeatedly used routes, and 2) the navigation information in the subnetworks formed by those routes is much smaller than the theoretical value in the global network, suggesting that the decision cost for actual trips is significantly smaller than the theoretical upper limit found in previous studies. By modeling routing behaviors in growing networks, we show that while the global network becomes difficult to navigate, navigability can be improved in subnetworks. We further present a universal linear relationship between the empirical and theoretical search information, which allows the two metrics to predict each other. Our findings demonstrate how large-scale observations can quantify real-world navigation behaviors and aid in evaluating transportation planning.

preprint2022arXiv

The universality in urban commuting across and within cities

Commuting is a key mechanism that governs the dynamics of cities. Despite its importance, very little is known of the properties and mechanisms underlying this crucial urban process. Here, we capitalize on $\sim$ 50 million individuals' smartphone data from 234 Chinese cities to show that urban commuting obeys remarkable regularities. These regularities can be generalized as two laws: (i) the scale-invariance of the average commuting distance across cities, which is a long-awaited validation of Marchetti's constant conjecture, and (ii) a universal inverted U-shape of the commuting distance as a function of the distance from the city centre within cities, indicating that the city centre's attraction is bounded. Motivated by such empirical findings, we develop a simple urban growth model that connects individual-level mobility choices with macroscopic urban spatial structure and faithfully explains both commuting laws. Our results further show that the scale-invariants of human mobility will ultimately lead to the polycentric transition in cities, which could be used to better inform urban development strategies.

preprint2020arXiv

A gridded establishment dataset as a proxy for economic activity in China

Measuring the geographical distribution of economic activity plays a key role in scientific research and policymaking. However, previous studies and data on economic activity either have a coarse spatial resolution or cover a limited time span, and the high-resolution characteristics of socioeconomic dynamics are largely unknown. Here, we construct a dataset on the economic activity of mainland China, the gridded establishment dataset (GED), which measures the volume of establishments at a 0.01$^{\circ}$ latitude by 0.01$^{\circ}$ longitude scale. Specifically, our dataset captures the geographically based opening and closing of approximately 25.5 million firms that registered in mainland China over the period 2005-2015. The characteristics of fine granularity and long-term observability give the GED a high application value. The dataset not only allows us to quantify the spatiotemporal patterns of the establishments, urban vibrancy and socioeconomic activity, but also helps us uncover the fundamental principles underlying the dynamics of industrial and economic development.

preprint2020arXiv

Quantifying urban areas with multi-source data based on percolation theory

Quantifying urban areas is crucial for addressing associated urban issues such as environmental and sustainable problems. Remote sensing data, especially the nighttime light images, have been widely used to delineate urbanized areas across the world. Meanwhile, some emerging urban data, such as volunteered geographical information (e.g., OpenStreetMap) and social sensing data (e.g., mobile phone and social media), have also shown great potential in revealing urban boundaries and dynamics. However, consistent and robust methods to quantify urban areas from these multi-source data have remained elusive. Here, we propose a percolation-based method to extract urban areas from these multi-source urban data. We derive the optimal urban/non-urban threshold by considering the critical nature of urban systems with the support of the percolation theory. Furthermore, we apply the method with three open-source datasets - population, road, and nighttime light - to 28 countries. We show that the proposed method captures the similar urban characteristics in terms of urban areas from multi-source data, and Zipf's law holds well in most countries. The accuracy of the derived urban areas by different datasets has been validated with the Landsat-based reference data in 10 cities, and the accuracy can be further improved through data fusion ($κ=0.69-0.85$, mean $κ=0.78$). Our study not only provides an efficient method to quantify urban areas with open-source data, but also deepens the understanding of urban systems and sheds some light on multi-source data fusion in geographical fields.

preprint2020arXiv

The spectral dimension of human mobility

Human mobility patterns are surprisingly structured. In spite of many hard to model factors, such as climate, culture, and socioeconomic opportunities, aggregate migration rates obey a universal, parameter-free, `radiation' model. Recent work has further shown that the detailed spectral decomposition of these flows -- defined as the number of individuals that visit a given location with frequency $f$ from a distance $r$ away -- also obeys simple rules, namely, scaling as a universal inverse square law in the combination, $rf$. However, this surprising regularity, derived on general grounds, has not been explained through microscopic mechanisms of individual behavior. Here we confirm this by analyzing large-scale cell phone datasets from three distinct regions and show that a direct consequence of this scaling law is that the average `travel energy' spent by visitors to a given location is constant across space, a finding reminiscent of the well-known travel budget hypothesis of human movement. The attractivity of different locations, which we define by the total number of visits to that location, also admits non-trivial, spatially-clustered structure. The observed pattern is consistent with the well-known central place theory in urban geography, as well as with the notion of Weber optimality in spatial economy, hinting to a collective human capacity of optimizing recurrent movements. We close by proposing a simple, microscopic human mobility model which simultaneously captures all our empirical findings. Our results have relevance for transportation, urban planning, geography, and other disciplines in which a deeper understanding of aggregate human mobility is key.

preprint2020arXiv

Understanding the mesoscopic scaling patterns within cities

Understanding quantitative relationships between urban elements is crucial for a wide range of applications. The observation at the macroscopic level demonstrates that the aggregated urban quantities (e.g., gross domestic product) scale systematically with population sizes across cities, also known as urban scaling laws. However, at the mesoscopic level, we lack an understanding of whether the simple scaling relationship holds within cities, which is a fundamental question regarding the spatial origin of scaling in urban systems. Here, by analyzing four extensive datasets covering millions of mobile phone users and urban facilities, we investigate the scaling phenomena within cities. We find that the mesoscopic infrastructure volume and socioeconomic activity scale sub- and super-linearly with the active population, respectively. For a same scaling phenomenon, however, the exponents vary in cities of similar population sizes. To explain these empirical observations, we propose a conceptual framework by considering the heterogeneous distributions of population and facilities, and the spatial interactions between them. Analytical and numerical results suggest that, despite the large number of complexities that influence urban activities, the simple interaction rules can effectively explain the observed regularity and heterogeneity in scaling behaviors within cities.

preprint2016arXiv

Measuring Economic Activities of China with Mobile Big Data

Emerging trends in smartphones, online maps, social media, and the resulting geo-located data, provide opportunities to collect traces of people's socio-economical activities in a much more granular and direct fashion, triggering a revolution in empirical research. These vast mobile data offer new perspectives and approaches for measurements of economic dynamics and are broadening the research fields of social science and economics. In this paper, we explore the potential of using mobile big data for measuring economic activities of China. Firstly, We build indices for gauging employment and consumer trends based on billions of geo-positioning data. Secondly, we advance the estimation of store offline foot traffic via location search data derived from Baidu Maps, which is then applied to predict revenues of Apple in China and detect box-office fraud accurately. Thirdly, we construct consumption indicators to track the trends of various industries in service sector, which are verified by several existing indicators. To the best of our knowledge, we are the first to measure the second largest economy by mining such unprecedentedly large scale and fine granular spatial-temporal data. Our research provides new approaches and insights on measuring economic activities.

preprint2015arXiv

Three-Dimensional Projective Geometry with Geometric Algebra

The line geometric model of 3-D projective geometry has the nice property that the Lie algebra sl(4) of 3-D projective transformations is isomorphic to the bivector algebra of CL(3,3), and line geometry is closely related to the classical screw theory for 3-D rigid-body motions. The canonical homomorphism from SL(4) to Spin(3,3) is not satisfying because it is not surjective, and the projective transformations of negative determinant do induce orthogonal transformations in the Plücker coordinate space of lines. This paper presents our contributions in developing a rigorous and convenient algebraic framework for the study of 3-D projective geometry with Clifford algebra. To overcome the unsatisfying defects of the Plücker correspondence, we propose a group Pin^{sp}(3,3) with Pin(3,3) as its normal subgroup, to quadruple-cover the group of 3-D projective transformations and polarities. We construct spinors in factored form that generate 3-D reflections and rigid-body motions, and extend screw algebra from the Lie algebra of rigid-body motions to other 6-D Lie subalgebras of sl(4), and construct the corresponding cross products and virtual works.

preprint2010arXiv

Extending Electrical Resistivity Measurements in Micro-scratching of Silicon to Determine Thermal Conductivity of the Metallic Phase Si-II

Ductile Regime Machining of semiconductors (DRM) offers higher quality of the resulting surfaces. To optimize this process, it is required to understand the thermal kinetics of silicon metallization under pressure. Such understanding is not yet possible since the metallic phases of silicon aren't readily amenable to thermal characterization through direct measurements. This being the case, one has to rely on processing indirect measurement data to deduce refined estimates of the thermal transport properties of pressurized Silicon. A feasible measurement of this sort is the electrical resistivity, since its' variation during indentation and consider is often an indicator of the formation of the metallic silicon phase Si-II under the indenter. This paper, therefore, describes a procedure by which the average thermal conductivity of the metallic phase of silicon, Si-II is extracted from electrical resistivity measurements taken in real time. The procedure is based on teaming a temperature evaluation code to the resistivity measurements thus allowing the extraction of the conductivity as a function of temperature.

Lei Dong

What is connected

Connect this record

See the researcher in context

Building this map preview

12 published item(s)

A Geometric Analysis of Sign-Magnitude Asymmetry in a ReLU + RMSNorm Block under Ternary Quantization

Mapping evolving population geography in China

MetroGAN: Simulating Urban Morphology with Generative Adversarial Network

Quantifying navigation complexity in transportation networks

The universality in urban commuting across and within cities

A gridded establishment dataset as a proxy for economic activity in China

Quantifying urban areas with multi-source data based on percolation theory

The spectral dimension of human mobility

Understanding the mesoscopic scaling patterns within cities

Measuring Economic Activities of China with Mobile Big Data

Three-Dimensional Projective Geometry with Geometric Algebra

Extending Electrical Resistivity Measurements in Micro-scratching of Silicon to Determine Thermal Conductivity of the Metallic Phase Si-II