Source author record

Ge Zhang

Ge Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

27works

20topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Adaptive Dual-Weighted Gravitational Point Cloud Denoising Method

High-quality point cloud data is a critical foundation for tasks such as autonomous driving and 3D reconstruction. However, LiDAR-based point cloud acquisition is often affected by various disturbances, resulting in a large number of noise points that degrade the accuracy of subsequent point cloud object detection and recognition. Moreover, existing point cloud denoising methods typically sacrifice computational efficiency in pursuit of higher denoising accuracy, or, conversely, improve processing speed at the expense of preserving object boundaries and fine structural details, making it difficult to simultaneously achieve high denoising accuracy, strong edge preservation, and real-time performance. To address these limitations, this paper proposes an adaptive dualweight gravitational-based point cloud denoising method. First, an octree is employed to perform spatial partitioning of the global point cloud, enabling parallel acceleration. Then, within each leaf node, adaptive voxel-based occupancy statistics and k-nearest neighbor (kNN) density estimation are applied to rapidly remove clearly isolated and low-density noise points, thereby reducing the effective candidate set. Finally, a gravitational scoring function that combines density weights with adaptive distance weights is constructed to finely distinguish noise points from object points. Experiments conducted on the Stanford 3D Scanning Repository, the Canadian Adverse Driving Conditions (CADC) dataset, and in-house RUBY PLUS LiDAR point clouds acquired in our laboratory demonstrate that, compared with existing methods, the proposed approach achieves consistent improvements in F1, PSNR, and Chamfer Distance (CD) across various noise conditions while reducing the single-frame processing time, thereby validating its high accuracy, robustness, and real-time performance in multi-noise scenarios.

preprint2026arXiv

Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

Large Language Models (LLMs) apply uniform computation to all tokens, despite language exhibiting highly non-uniform information density. This token-uniform regime wastes capacity on locally predictable spans while under-allocating computation to semantically critical transitions. We propose $\textbf{Dynamic Large Concept Models (DLCM)}$, a hierarchical language modeling framework that learns semantic boundaries from latent representations and shifts computation from tokens to a compressed concept space where reasoning is more efficient. DLCM discovers variable-length concepts end-to-end without relying on predefined linguistic units. Hierarchical compression fundamentally changes scaling behavior. We introduce the first $\textbf{compression-aware scaling law}$, which disentangles token-level capacity, concept-level reasoning capacity, and compression ratio, enabling principled compute allocation under fixed FLOPs. To stably train this heterogeneous architecture, we further develop a $\textbf{decoupled $μ$P parametrization}$ that supports zero-shot hyperparameter transfer across widths and compression regimes. At a practical setting ($R=4$, corresponding to an average of four tokens per concept), DLCM reallocates roughly one-third of inference compute into a higher-capacity reasoning backbone, achieving a $\textbf{+2.69$\%$ average improvement}$ across 12 zero-shot benchmarks under matched inference FLOPs.

preprint2026arXiv

Encyclo-K: Evaluating LLMs with Dynamically Composed Knowledge Statements

Benchmarks play a crucial role in tracking the rapid advancement of large language models (LLMs) and identifying their capability boundaries. However, existing benchmarks predominantly curate questions at the question level, suffering from three fundamental limitations: vulnerability to data contamination, restriction to single-knowledge-point assessment, and reliance on costly domain expert annotation. We propose Encyclo-K, a statement-based benchmark that rethinks benchmark construction from the ground up. Our key insight is that knowledge statements, not questions, can serve as the unit of curation, and questions can then be constructed from them. We extract standalone knowledge statements from authoritative textbooks and dynamically compose them into evaluation questions through random sampling at test time. This design directly addresses all three limitations: the combinatorial space is too vast to memorize, and model rankings remain stable across dynamically generated question sets, enabling reliable periodic dataset refresh; each question aggregates 8-10 statements for comprehensive multi-knowledge assessment; annotators only verify formatting compliance without requiring domain expertise, substantially reducing annotation costs. Experiments on over 50 LLMs demonstrate that Encyclo-K poses substantial challenges with strong discriminative power. Even the top-performing OpenAI-GPT-5.1 achieves only 62.07% accuracy, and model performance displays a clear gradient distribution--reasoning models span from 16.04% to 62.07%, while chat models range from 9.71% to 50.40%. These results validate the challenges introduced by dynamic evaluation and multi-statement comprehensive understanding. These findings establish Encyclo-K as a scalable framework for dynamic evaluation of LLMs' comprehensive understanding over multiple fine-grained disciplinary knowledge statements.

preprint2026arXiv

FutureX-Pro: Extending Future Prediction to High-Value Vertical Domains

Building upon FutureX, which established a live benchmark for general-purpose future prediction, this report introduces FutureX-Pro, including FutureX-Finance, FutureX-Retail, FutureX-PublicHealth, FutureX-NaturalDisaster, and FutureX-Search. These together form a specialized framework extending agentic future prediction to high-value vertical domains. While generalist agents demonstrate proficiency in open-domain search, their reliability in capital-intensive and safety-critical sectors remains under-explored. FutureX-Pro targets four economically and socially pivotal verticals: Finance, Retail, Public Health, and Natural Disaster. We benchmark agentic Large Language Models (LLMs) on entry-level yet foundational prediction tasks -- ranging from forecasting market indicators and supply chain demands to tracking epidemic trends and natural disasters. By adapting the contamination-free, live-evaluation pipeline of FutureX, we assess whether current State-of-the-Art (SOTA) agentic LLMs possess the domain grounding necessary for industrial deployment. Our findings reveal the performance gap between generalist reasoning and the precision required for high-value vertical applications.

preprint2026arXiv

Is Grokking a Computational Glass Relaxation?

Understanding neural network's (NN) generalizability remains a central question in deep learning research. The special phenomenon of grokking, where NNs abruptly generalize long after the training performance reaches a near-perfect level, offers a unique window to investigate the underlying mechanisms of NNs' generalizability. Here we propose an interpretation for grokking by framing it as a computational glass relaxation: viewing NNs as a physical system where parameters are the degrees of freedom and train loss is the system energy, we find memorization process resembles a rapid cooling of liquid into non-equilibrium glassy state at low temperature and the later generalization is like a slow relaxation towards a more stable configuration. This mapping enables us to sample NNs' Boltzmann entropy (states of density) landscape as a function of training loss and test accuracy. Our experiments in transformers on arithmetic tasks suggests that there is NO entropy barrier in the memorization-to-generalization transition of grokking, challenging previous theory that defines grokking as a first-order phase transition. We identify a high-entropy advantage under grokking, an extension of prior work linking entropy to generalizability but much more significant. Inspired by grokking's far-from-equilibrium nature, we develop a toy optimizer WanD based on Wang-landau molecular dynamics, which can eliminate grokking without any constraints and find high-norm generalizing solutions. This provides strictly-defined counterexamples to theory attributing grokking solely to weight norm evolution towards the Goldilocks zone and also suggests new potential ways for optimizer design.

preprint2026arXiv

LPFQA: A Long-Tail Professional Forum-based Benchmark for LLM Evaluation

Large Language Models (LLMs) perform well on standard reasoning and question-answering benchmarks, yet such evaluations often fail to capture their ability to handle long-tail, expertise-intensive knowledge in real-world professional scenarios. We introduce LPFQA, a long-tail knowledge benchmark derived from authentic professional forum discussions, covering 7 academic and industrial domains with 430 curated tasks grounded in practical expertise. LPFQA evaluates specialized reasoning, domain-specific terminology understanding, and contextual interpretation, and adopts a hierarchical difficulty structure to ensure semantic clarity and uniquely identifiable answers. Experiments on over multiple mainstream LLMs reveal substantial performance gaps, particularly on tasks requiring deep domain reasoning, exposing limitations overlooked by existing benchmarks. Overall, LPFQA provides an authentic and discriminative evaluation framework that complements prior benchmarks and informs future LLM development.

preprint2026arXiv

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Recent advances in coding agents suggest rapid progress toward autonomous software development, yet existing benchmarks fail to rigorously evaluate the long-horizon capabilities required to build complete software systems. Most prior evaluations focus on localized code generation, scaffolded completion, or short-term repair tasks, leaving open the question of whether agents can sustain coherent reasoning, planning, and execution over the extended horizons demanded by real-world repository construction. To address this gap, we present NL2Repo Bench, a benchmark explicitly designed to evaluate the long-horizon repository generation ability of coding agents. Given only a single natural-language requirements document and an empty workspace, agents must autonomously design the architecture, manage dependencies, implement multi-module logic, and produce a fully installable Python library. Our experiments across state-of-the-art open- and closed-source models reveal that long-horizon repository generation remains largely unsolved: even the strongest agents achieve below 40% average test pass rates and rarely complete an entire repository correctly. Detailed analysis uncovers fundamental long-horizon failure modes, including premature termination, loss of global coherence, fragile cross-file dependencies, and inadequate planning over hundreds of interaction steps. NL2Repo Bench establishes a rigorous, verifiable testbed for measuring sustained agentic competence and highlights long-horizon reasoning as a central bottleneck for the next generation of autonomous coding agents.

preprint2026arXiv

Structure factor realizability reveals the glass-dynamics onset temperature

When liquids are cooled rapidly, they bypass crystallization and instead enter a supercooled state and then a glass state. Previous studies have shown that the static structure factors of high-temperature liquids, supercooled liquids, and glasses exhibit only subtle differences, leading to the conclusion that the glass transition cannot be predicted solely from structure factor changes. Our research challenges this limitation. Specifically, we studied the difficulty of generating configurations corresponding to target structure factors using stochastic gradient descent optimizations. While such optimizations easily converge when targeting the structure factors of higher temperature liquids, the difficulty significantly increases for lower temperature liquids and glasses. By quantifying this difficulty through the mean squared error achieved, we found a kink at the onset temperature of glass dynamics. Our results suggest that the onset of glass dynamics can be explained by a topological change in the realizability landscape of the structure factor, even though the structure factor itself only changes smoothly. Our results are currently based on computer simulations using original and modified Dzugutov interactions, and future work will determine whether our theory is applicable to other glass-forming systems.

preprint2026arXiv

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Large language models (LLMs) often fail to learn effective long chain-of-thought (Long CoT) reasoning from human or non-Long-CoT LLMs imitation. To understand this, we propose that effective and learnable Long CoT trajectories feature stable molecular-like structures in unified view, which are formed by three interaction types: Deep-Reasoning (covalent-like), Self-Reflection (hydrogen-bond-like), and Self-Exploration (van der Waals-like). Analysis of distilled trajectories reveals these structures emerge from Long CoT fine-tuning, not keyword imitation. We introduce Effective Semantic Isomers and show that only bonds promoting fast entropy convergence support stable Long CoT learning, while structural competition impairs training. Drawing on these findings, we present Mole-Syn, a distribution-transfer-graph method that guides synthesis of effective Long CoT structures, boosting performance and RL stability across benchmarks.

preprint2025arXiv

OmniBench: Towards The Future of Universal Omni-Language Models

Recent advancements in multimodal large language models (MLLMs) have aimed to integrate and interpret data across diverse modalities. However, the capacity of these models to concurrently process and reason about multiple modalities remains underexplored, partly due to the lack of comprehensive modality-wise benchmarks. We introduce OmniBench, a novel benchmark designed to rigorously evaluate models' ability to recognize, interpret, and reason across visual, acoustic, and textual inputs simultaneously. We define language models capable of such tri-modal processing as the omni-language models (OLMs). OmniBench is distinguished by high-quality human annotations, ensuring that accurate responses require integrated understanding and reasoning across all three modalities. Our main findings reveal that: i) open-source OLMs exhibit critical limitations in instruction-following and reasoning capabilities within tri-modal contexts; and ii) most baselines models perform poorly (below 50% accuracy) even when provided with alternative textual representations of images or/and audio. These results suggest that the ability to construct a consistent context from text, image, and audio is often overlooked in existing MLLM training paradigms. To address this gap, we curate an instruction tuning dataset of 84.5K training samples, OmniInstruct, for training OLMs to adapt to tri-modal contexts. We advocate for future research to focus on developing more robust tri-modal integration techniques and training strategies to enhance OLMs. Codes, data and live leaderboard could be found at https://m-a-p.ai/OmniBench.

preprint2023arXiv

CORGI-PM: A Chinese Corpus For Gender Bias Probing and Mitigation

As natural language processing (NLP) for gender bias becomes a significant interdisciplinary topic, the prevalent data-driven techniques such as large-scale language models suffer from data inadequacy and biased corpus, especially for languages with insufficient resources such as Chinese. To this end, we propose a Chinese cOrpus foR Gender bIas Probing and Mitigation CORGI-PM, which contains 32.9k sentences with high-quality labels derived by following an annotation scheme specifically developed for gender bias in the Chinese context. Moreover, we address three challenges for automatic textual gender bias mitigation, which requires the models to detect, classify, and mitigate textual gender bias. We also conduct experiments with state-of-the-art language models to provide baselines. To our best knowledge, CORGI-PM is the first sentence-level Chinese corpus for gender bias probing and mitigation.

preprint2023arXiv

Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits

We present Second Thought, a new learning paradigm that enables language models (LMs) to re-align with human values. By modeling the chain-of-edits between value-unaligned and value-aligned text, with LM fine-tuning and additional refinement through reinforcement learning, Second Thought not only achieves superior performance in three value alignment benchmark datasets but also shows strong human-value transfer learning ability in few-shot scenarios. The generated editing steps also offer better interpretability and ease for interactive error correction. Extensive human evaluations further confirm its effectiveness.

preprint2022arXiv

1Cademy at Semeval-2022 Task 1: Investigating the Effectiveness of Multilingual, Multitask, and Language-Agnostic Tricks for the Reverse Dictionary Task

This paper describes our system for the SemEval2022 task of matching dictionary glosses to word embeddings. We focus on the Reverse Dictionary Track of the competition, which maps multilingual glosses to reconstructed vector representations. More specifically, models convert the input of sentences to three types of embeddings: SGNS, Char, and Electra. We propose several experiments for applying neural network cells, general multilingual and multitask structures, and language-agnostic tricks to the task. We also provide comparisons over different types of word embeddings and ablation studies to suggest helpful strategies. Our initial transformer-based model achieves relatively low performance. However, trials on different retokenization methodologies indicate improved performance. Our proposed Elmobased monolingual model achieves the highest outcome, and its multitask, and multilingual varieties show competitive results as well.

preprint2022arXiv

Correlation between entropy and generalizability in a neural network

Although neural networks can solve very complex machine-learning problems, the theoretical reason for their generalizability is still not fully understood. Here we use Wang-Landau Mote Carlo algorithm to calculate the entropy (logarithm of the volume of a part of the parameter space) at a given test accuracy, and a given training loss function value or training accuracy. Our results show that entropical forces help generalizability. Although our study is on a very simple application of neural networks (a spiral dataset and a small, fully-connected neural network), our approach should be useful in explaining the generalizability of more complicated neural networks in future works.

preprint2022arXiv

Graph-level Neural Networks: Current Progress and Future Directions

Graph-structured data consisting of objects (i.e., nodes) and relationships among objects (i.e., edges) are ubiquitous. Graph-level learning is a matter of studying a collection of graphs instead of a single graph. Traditional graph-level learning methods used to be the mainstream. However, with the increasing scale and complexity of graphs, Graph-level Neural Networks (GLNNs, deep learning-based graph-level learning methods) have been attractive due to their superiority in modeling high-dimensional data. Thus, a survey on GLNNs is necessary. To frame this survey, we propose a systematic taxonomy covering GLNNs upon deep neural networks, graph neural networks, and graph pooling. The representative and state-of-the-art models in each category are focused on this survey. We also investigate the reproducibility, benchmarks, and new graph datasets of GLNNs. Finally, we conclude future directions to further push forward GLNNs. The repository of this survey is available at https://github.com/GeZhangMQ/Awesome-Graph-level-Neural-Networks.

preprint2022arXiv

Structuro-elasto-plasticity (StEP) model for plasticity in disordered solids

Elastoplastic lattice models for the response of solids to deformation typically incorporate structure only implicitly via a local yield strain that is assigned to each site. However, the local yield strain can change in response to a nearby or even distant plastic event in the system. This interplay is key to understanding phenomena such as avalanches in which one plastic event can trigger another, leading to a cascade of events, but typically is neglected in elastoplastic models. To include the interplay one could calculate the local yield strain for a given particulate system and follow its evolution, but this is expensive and requires knowledge of particle interactions, which is often hard to extract from experiments. Instead, we introduce a structural quantity, "softness," obtained using machine learning to correlate with imminent plastic rearrangements. We show that softness also correlates with local yield strain. We incorporate softness to construct a "structuro-elasto-plasticity" model that reproduces particle simulation results quantitatively for several observable quantities, confirming that we capture the influence of the interplay of local structure, plasticity, and elasticity on material response.

preprint2021arXiv

Delineation of the flow and mixing induced by Rayleigh-Taylor instability through tracers

Rayleigh-Taylor-instability(RTI) induced flow and mixing are of great importance in both nature and engineering scenarios. To capture the underpinning physics, tracers are introduced to make a supplement to discrete Boltzmann simulation of RTI in compressible flows. Via marking two types of tracers with different colors, the tracer distribution provides a clear boundary of two fluids during the RTI evolution. Fine structures of the flow and thermodynamic nonequilibrium behavior around the interface in a miscible two-fluid system are delineated. Distribution of tracers in its velocity phase space makes a charming pattern showing quite dense information on the flow behavior, which opens a new perspective for analyzing and accessing significantly deep insights into the flow system. RTI mixing is further investigated via tracer defined local mixedness. The appearance of Kelvin-Helmholtz instability is quantitatively captured by mixedness averaged align the direction of the pressure gradient. The role of compressibility and viscosity on mixing are investigated separately, both of which show two-stage effect. The underlying mechanism of the two-stage effect is interpreted as the development of large structures at the initial stage and the generation of small structures at the late stage. At the late stage, for a fixed time, a saturation phenomenon of viscosity is found that further increase of viscosity cannot see an evident decline in mixedness. The mixing statues of heavy and light fluids are not synchronous and the mixing of a RTI system is heterogenous. The results are helpful for understanding the mechanism of flow and mixing induced by RTI.

preprint2021arXiv

Spectral Variability Augmented Sparse Unmixing of Hyperspectral Images

Spectral unmixing (SU) expresses the mixed pixels existed in hyperspectral images as the product of endmember and abundance, which has been widely used in hyperspectral imagery analysis. However, the influence of light, acquisition conditions and the inherent properties of materials, results in that the identified endmembers can vary spectrally within a given image (construed as spectral variability). To address this issue, recent methods usually use a priori obtained spectral library to represent multiple characteristic spectra of the same object, but few of them extracted the spectral variability explicitly. In this paper, a spectral variability augmented sparse unmixing model (SVASU) is proposed, in which the spectral variability is extracted for the first time. The variable spectra are divided into two parts of intrinsic spectrum and spectral variability for spectral reconstruction, and modeled synchronously in the SU model adding the regular terms restricting the sparsity of abundance and the generalization of the variability coefficient. It is noted that the spectral variability library and the intrinsic spectral library are all constructed from the In-situ observed image. Experimental results over both synthetic and real-world data sets demonstrate that the augmented decomposition by spectral variability significantly improves the unmixing performance than the decomposition only by spectral library, as well as compared to state-of-the-art algorithms.

preprint2020arXiv

CORAL: COde RepresentAtion Learning with Weakly-Supervised Transformers for Analyzing Data Analysis

Large scale analysis of source code, and in particular scientific source code, holds the promise of better understanding the data science process, identifying analytical best practices, and providing insights to the builders of scientific toolkits. However, large corpora have remained unanalyzed in depth, as descriptive labels are absent and require expert domain knowledge to generate. We propose a novel weakly supervised transformer-based architecture for computing joint representations of code from both abstract syntax trees and surrounding natural language comments. We then evaluate the model on a new classification task for labeling computational notebook cells as stages in the data analysis process from data import to wrangling, exploration, modeling, and evaluation. We show that our model, leveraging only easily-available weak supervision, achieves a 38% increase in accuracy over expert-supplied heuristics and outperforms a suite of baselines. Our model enables us to examine a set of 118,000 Jupyter Notebooks to uncover common data analysis patterns. Focusing on notebooks with relationships to academic articles, we conduct the largest ever study of scientific code and find that notebook composition correlates with the citation count of corresponding papers.

preprint2020arXiv

Realizable hyperuniform and nonhyperuniform particle configurations with targeted spectral functions via effective pair interactions

The capacity to identify realizable many-body configurations associated with targeted functional forms for the pair correlation function $g_2(r)$ or its corresponding structure factor $S(k)$ is of great fundamental and practical importance. A major aim of this paper is to expand our theoretical knowledge of the class of pair correlation functions or structure factors that are realizable by classical disordered ensembles of particle configurations. We first introduce a theoretical formalism that provides a means to draw classical particle configurations from canonical ensembles with certain pairwise-additive potentials that could correspond to targeted analytical functional forms for the structure factor. This formulation enables us to devise an improved algorithm to construct systematically canonical-ensemble particle configurations with such targeted pair statistics. We test the algorithm by targeting several different structure factors across dimensions that are known to be realizable and one hyperuniform target that is known to be nontrivially unrealizable. Subsequently, we also target several families of structure-factor functions that meet the known necessary realizability conditions but were heretofore not known to be realizable. Moreover, we also explore unusual nonhyperuniform targets, including "hyposurficial" and "anti-hyperuniform" examples. In all of these instances, the targeted structure factors were achieved with high accuracy, suggesting that they are indeed realizable by equilibrium configurations with pairwise interactions at positive temperatures. Our findings lead us to the conjecture that any realizable structure factor corresponding to either a translationally invariant equilibrium or nonequilibrium system can be attained by an equilibrium ensemble involving only effective pair interactions.

preprint2020arXiv

Self-Supervised Joint Learning Framework of Depth Estimation via Implicit Cues

In self-supervised monocular depth estimation, the depth discontinuity and motion objects' artifacts are still challenging problems. Existing self-supervised methods usually utilize a single view to train the depth estimation network. Compared with static views, abundant dynamic properties between video frames are beneficial to refined depth estimation, especially for dynamic objects. In this work, we propose a novel self-supervised joint learning framework for depth estimation using consecutive frames from monocular and stereo videos. The main idea is using an implicit depth cue extractor which leverages dynamic and static cues to generate useful depth proposals. These cues can predict distinguishable motion contours and geometric scene structures. Furthermore, a new high-dimensional attention module is introduced to extract clear global transformation, which effectively suppresses uncertainty of local descriptors in high-dimensional space, resulting in a more reliable optimization in learning framework. Experiments demonstrate that the proposed framework outperforms the state-of-the-art(SOTA) on KITTI and Make3D datasets.

preprint2020arXiv

Stone-Wales Defects Preserve Hyperuniformity in Amorphous Two-Dimensional Materials

Crystalline two-dimensional (2D) materials such as graphene possess unique physical properties absent in their bulk form, enabling many novel device applications. Yet, little is known about their amorphous counterparts, which can be obtained by introducing the Stone-Wales (SW) topological defects via proton radiation. Here we provide strong numerical evidence that SW defects preserve hyperuniformity in hexagonal 2D materials, a recently discovered new state of matter characterized by vanishing normalized infinite-wavelength density fluctuations, which implies that all amorphous states of these materials are hyperuniform. Specifically, the static structure factor S(k) of these materials possesses the scaling S(k) ~ k^α for small wave number k, where 1<=α(p)<=2 is monotonically decreasing as the SW defect concentration p increases, indicating a transition from type-I to type-II hyperuniformity at p ~= 0.12 induced by the saturation of the SW defects. This hyperuniformity transition marks a structural transition from perturbed lattice structures to truly amorphous structures, and underlies the onset of strong correlation among the SW defects as well as a transition between distinct electronic transport mechanisms associated with different hyperuniformity classes.

preprint2016arXiv

Critical slowing down and hyperuniformity on approach to jamming

Hyperuniformity characterizes a state of matter that is poised at a critical point at which density or volume-fraction fluctuations are anomalously suppressed at infinite wavelengths. Recently, much attention has been given to the link between strict jamming and hyperuniformity in frictionless hard-particle packings. Doing so requires one to study very large packings, which can be difficult to jam properly. We modify the rigorous linear programming method of Donev et al. [J. Comp. Phys. 197, 139 (2004)] in order to test for jamming in putatively jammed packings of hard-disks in two dimensions. We find that various standard packing protocols struggle to reliably create packings that are jammed for even modest system sizes; importantly, these packings appear to be jammed by conventional tests. We present evidence that suggests that deviations from hyperuniformity in putative maximally random jammed (MRJ) packings can in part be explained by a shortcoming in generating exactly-jammed configurations due to a type of "critical slowing down" as the necessary rearrangements become difficult to realize by numerical protocols. Additionally, various protocols are able to produce packings exhibiting hyperuniformity to different extents, but this is because certain protocols are better able to approach exactly-jammed configurations. Nonetheless, while one should not generally expect exact hyperuniformity for disordered packings with rattlers, we find that when jamming is ensured, our packings are very nearly hyperuniform, and deviations from hyperuniformity correlate with an inability to ensure jamming, suggesting that strict jamming and hyperuniformity are indeed linked. This raises the possibility that the ideal MRJ packings have no rattlers. Our work provides the impetus for the development of packing algorithms that produce large disordered strictly jammed packings that are rattler-free.

preprint2016arXiv

Spatial modulation of a unitary impurity-induced resonances in superconducting CeCoIn$_{5}$

Motivated by recent experimental progress in high-resolution scanning tunneling microscopy (STM) techniques, we propose to investigate the local quasiparticle density of states around a unitary impurity in the heavy fermion superconductor CeCoIn$_{5}$. Based on the T-matrix approach we obtain a sharp nearly zero-energy resonance state (ZERS) in the strong impurity potential scattering localized around the impurity, and find qualitative differences in the spatial pattern of the tunneling conductance modulated by the nodal structure of the superconducting gap. These unique features may be used as a probe of the superconducting gap symmetry and in combination with the further STM measurements, may help to confirm the $d_{x^{2}-y^{2}}$ pairing in CeCoIn$_{5}$ at ambient pressure.

preprint2015arXiv

Ground states of stealthy hyperuniform potentials: I. Entropically favored configurations

Systems of particles interacting with "stealthy" pair potentials have been shown to possess infinitely degenerate disordered hyperuniform classical ground states with novel physical properties. Previous attempts to sample the infinitely degenerate ground states used energy minimization techniques, introducing algorithmic dependence that is artificial in nature. Recently, an ensemble theory of stealthy hyperuniform ground states was formulated to predict the structure and thermodynamics that was shown to be in excellent agreement with corresponding computer simulation results in the canonical ensemble (in the zero-temperature limit). In this paper, we provide details and justifications of the simulation procedure, which involves performing molecular dynamics simulations at sufficiently low temperatures and minimizing the energy of the snapshots for both the high-density disordered regime, where the theory applies, as well as lower densities. We also use numerical simulations to extend our study to the lower-density regime. We report results for the pair correlation functions, structure factors, and Voronoi cell statistics. In the high-density regime, we verify the theoretical ansatz that stealthy disordered ground states behave like "pseudo" disordered equilibrium hard-sphere systems in Fourier space. These results show that as the density decreases from the high-density limit, the disordered ground states in the canonical ensemble are characterized by an increasing degree of short-range order and eventually the system undergoes a phase transition to crystalline ground states. We also provide numerical evidence suggesting that different forms of stealthy pair potentials produce the same ground-state ensemble in the zero-temperature limit. Our techniques may be applied to sample this limit of the canonical ensemble of other potentials with highly degenerate ground states.

preprint2015arXiv

Ground states of stealthy hyperuniform potentials. II. Stacked-slider phases

Stealthy potentials, a family of long-range isotropic pair potentials, produce infinitely degenerate disordered ground states at high densities and crystalline ground states at low densities in d-dimensional Euclidean space R^d. In the previous paper in this series, we numerically studied the entropically favored ground states in the canonical ensemble in the zero-temperature limit across the first three Euclidean space dimensions. In this paper, we investigate using both numerical and theoretical techniques metastable stacked-slider phases, which are part of the ground-state manifold of stealthy potentials at densities in which crystal ground states are favored entropically. Our numerical results enable us to devise analytical models of this phase in two, three, and higher dimensions. Utilizing this model, we estimated the size of the feasible region in configuration space of the stacked-slider phase, finding it to be smaller than that of crystal structures in the infinite-system-size limit, which is consistent with our recent previous work. In two dimensions, we also determine exact expressions for the pair correlation function and structure factor of the analytical model of stacked-slider phases and analyze the connectedness of the ground-state manifold of stealthy potentials in this density regime. We demonstrate that stacked-slider phases are distinguishable states of matter; they are nonperiodic, statistically anisotropic structures that possess long-range orientational order but have zero shear modulus. We outline some possible future avenues of research to elucidate our understanding of this unusual phase of matter.

preprint2015arXiv

Inverse design of disordered stealthy hyperuniform spin chains

Positioned between crystalline solids and liquids, disordered many-particle systems which are stealthy and hyperuniform represent new states of matter that are endowed with novel physical and thermodynamic properties. Such stealthy and hyperuniform states are unique in that they are transparent to radiation for a range of wavenumbers around the origin. In this work, we employ recently developed inverse statistical-mechanical methods, which seek to obtain the optimal set of interactions that will spontaneously produce a targeted structure or configuration as a unique ground state, to investigate the spin-spin interaction potentials required to stabilize disordered stealthy hyperuniform one-dimensional (1D) Ising-like spin chains. By performing an exhaustive search over the spin configurations that can be enumerated on periodic 1D integer lattices containing $N=2,3,\ldots,36$ sites, we were able to identify and structurally characterize \textit{all} stealthy hyperuniform spin chains in this range of system sizes. Within this pool of stealthy hyperuniform spin configurations, we then utilized such inverse optimization techniques to demonstrate that stealthy hyperuniform spin chains can be realized as either unique or degenerate disordered ground states of radial long-ranged (relative to the spin chain length) spin-spin interactions. Such exotic ground states are distinctly different from spin glasses in both their inherent structural properties and the nature of the spin-spin interactions required to stabilize them. As such, the implications and significance of the existence of such disordered stealthy hyperuniform ground state spin systems warrants further study, including whether their bulk physical properties and excited states, like their many-particle system counterparts, are singularly remarkable, and can be experimentally realized.

Ge Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

27 published item(s)

Adaptive Dual-Weighted Gravitational Point Cloud Denoising Method

Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

Encyclo-K: Evaluating LLMs with Dynamically Composed Knowledge Statements

FutureX-Pro: Extending Future Prediction to High-Value Vertical Domains

Is Grokking a Computational Glass Relaxation?

LPFQA: A Long-Tail Professional Forum-based Benchmark for LLM Evaluation

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Structure factor realizability reveals the glass-dynamics onset temperature

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

OmniBench: Towards The Future of Universal Omni-Language Models

CORGI-PM: A Chinese Corpus For Gender Bias Probing and Mitigation

Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits

1Cademy at Semeval-2022 Task 1: Investigating the Effectiveness of Multilingual, Multitask, and Language-Agnostic Tricks for the Reverse Dictionary Task

Correlation between entropy and generalizability in a neural network

Graph-level Neural Networks: Current Progress and Future Directions

Structuro-elasto-plasticity (StEP) model for plasticity in disordered solids

Delineation of the flow and mixing induced by Rayleigh-Taylor instability through tracers

Spectral Variability Augmented Sparse Unmixing of Hyperspectral Images

CORAL: COde RepresentAtion Learning with Weakly-Supervised Transformers for Analyzing Data Analysis

Realizable hyperuniform and nonhyperuniform particle configurations with targeted spectral functions via effective pair interactions

Self-Supervised Joint Learning Framework of Depth Estimation via Implicit Cues

Stone-Wales Defects Preserve Hyperuniformity in Amorphous Two-Dimensional Materials

Critical slowing down and hyperuniformity on approach to jamming

Spatial modulation of a unitary impurity-induced resonances in superconducting CeCoIn$_{5}$

Ground states of stealthy hyperuniform potentials: I. Entropically favored configurations

Ground states of stealthy hyperuniform potentials. II. Stacked-slider phases

Inverse design of disordered stealthy hyperuniform spin chains