Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
23works
0followers
19topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

23 published item(s)

preprint2026arXiv

Deploy-Master: Automating the Deployment of 50,000+ Agent-Ready Scientific Tools in One Day

Open-source scientific software is abundant, yet most tools remain difficult to compile, configure, and reuse, sustaining a small-workshop mode of scientific computing. This deployment bottleneck limits reproducibility, large-scale evaluation, and the practical integration of scientific tools into modern AI-for-Science (AI4S) and agentic workflows. We present Deploy-Master, a one-stop agentic workflow for large-scale tool discovery, build specification inference, execution-based validation, and publication. Guided by a taxonomy spanning 90+ scientific and engineering domains, our discovery stage starts from a recall-oriented pool of over 500,000 public repositories and progressively filters it to 52,550 executable tool candidates under license- and quality-aware criteria. Deploy-Master transforms heterogeneous open-source repositories into runnable, containerized capabilities grounded in execution rather than documentation claims. In a single day, we performed 52,550 build attempts and constructed reproducible runtime environments for 50,112 scientific tools. Each successful tool is validated by a minimal executable command and registered in SciencePedia for search and reuse, enabling direct human use and optional agent-based invocation. Beyond delivering runnable tools, we report a deployment trace at the scale of 50,000 tools, characterizing throughput, cost profiles, failure surfaces, and specification uncertainty that become visible only at scale. These results explain why scientific software remains difficult to operationalize and motivate shared, observable execution substrates as a foundation for scalable AI4S and agentic science.

preprint2026arXiv

Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework

Geolocation, the task of identifying an image's location, requires complex reasoning and is crucial for navigation, monitoring, and cultural preservation. However, current methods often produce coarse, imprecise, and non-interpretable localization. A major challenge lies in the quality and scale of existing geolocation datasets. These datasets are typically small-scale and automatically constructed, leading to noisy data and inconsistent task difficulty, with images that either reveal answers too easily or lack sufficient clues for reliable inference. To address these challenges, we introduce a comprehensive geolocation framework with three key components: GeoComp, a large-scale dataset; GeoCoT, a novel reasoning method; and GeoEval, an evaluation metric, collectively designed to address critical challenges and drive advancements in geolocation research. At the core of this framework is GeoComp (Geolocation Competition Dataset), a large-scale dataset collected from a geolocation game platform involving 740K users over two years. It comprises 25 million entries of metadata and 3 million geo-tagged locations spanning much of the globe, with each location annotated thousands to tens of thousands of times by human users. The dataset offers diverse difficulty levels for detailed analysis and highlights key gaps in current models. Building on this dataset, we propose Geographical Chain-of-Thought (GeoCoT), a novel multi-step reasoning framework designed to enhance the reasoning capabilities of Large Vision Models (LVMs) in geolocation tasks. GeoCoT improves performance by integrating contextual and spatial cues through a multi-step process that mimics human geolocation reasoning. Finally, using the GeoEval metric, we demonstrate that GeoCoT significantly boosts geolocation accuracy by up to 25% while enhancing interpretability.

preprint2026arXiv

Inverse Knowledge Search over Verifiable Reasoning: Synthesizing a Scientific Encyclopedia from a Long Chains-of-Thought Knowledge Base

Most scientific materials compress reasoning, presenting conclusions while omitting the derivational chains that justify them. This compression hinders verification by lacking explicit, step-wise justifications and inhibits cross-domain links by collapsing the very pathways that establish the logical and causal connections between concepts. We introduce a scalable framework that decompresses scientific reasoning, constructing a verifiable Long Chain-of-Thought (LCoT) knowledge base and projecting it into an emergent encyclopedia, SciencePedia. Our pipeline operationalizes an endpoint-driven, reductionist strategy: a Socratic agent, guided by a curriculum of around 200 courses, generates approximately 3 million first-principles questions. To ensure high fidelity, multiple independent solver models generate LCoTs, which are then rigorously filtered by prompt sanitization and cross-model answer consensus, retaining only those with verifiable endpoints. This verified corpus powers the Brainstorm Search Engine, which performs inverse knowledge search -- retrieving diverse, first-principles derivations that culminate in a target concept. This engine, in turn, feeds the Plato synthesizer, which narrates these verified chains into coherent articles. The initial SciencePedia comprises approximately 200,000 fine-grained entries spanning mathematics, physics, chemistry, biology, engineering, and computation. In evaluations across six disciplines, Plato-synthesized articles (conditioned on retrieved LCoTs) exhibit substantially higher knowledge-point density and significantly lower factual error rates than an equally-prompted baseline without retrieval (as judged by an external LLM). Built on this verifiable LCoT knowledge base, this reasoning-centric approach enables trustworthy, cross-domain scientific synthesis at scale and establishes the foundation for an ever-expanding encyclopedia.

preprint2025arXiv

CoHalLo: code hallucination localization via probing hidden layer vector

The localization of code hallucinations aims to identify specific lines of code containing hallucinations, helping developers to improve the reliability of AI-generated code more efficiently. Although recent studies have adopted several methods to detect code hallucination, most of these approaches remain limited to coarse-grained detection and lack specialized techniques for fine-grained hallucination localization. This study introduces a novel method, called CoHalLo, which achieves line-level code hallucination localization by probing the hidden-layer vectors from hallucination detection models. CoHalLo uncovers the key syntactic information driving the model's hallucination judgments and locates the hallucinating code lines accordingly. Specifically, we first fine-tune the hallucination detection model on manually annotated datasets to ensure that it learns features pertinent to code syntactic information. Subsequently, we designed a probe network that projects high-dimensional latent vectors onto a low-dimensional syntactic subspace, generating vector tuples and reconstructing the predicted abstract syntax tree (P-AST). By comparing P-AST with the original abstract syntax tree (O-AST) extracted from the input AI-generated code, we identify the key syntactic structures associated with hallucinations. This information is then used to pinpoint hallucinated code lines. To evaluate CoHalLo's performance, we manually collected a dataset of code hallucinations. The experimental results show that CoHalLo achieves a Top-1 accuracy of 0.4253, Top-3 accuracy of 0.6149, Top-5 accuracy of 0.7356, Top-10 accuracy of 0.8333, IFA of 5.73, Recall@1% Effort of 0.052721, and Effort@20% Recall of 0.155269, which outperforms the baseline methods.

preprint2022arXiv

Atomic-Scale Visualization of Chiral Charge Density Wave States and Their Reversible Transition

Chirality is essential for various amazing phenomena in life and matter. However,chirality and its switching in electronic superlattices, such as charge density wave(CDW) arrays, remain elusive. In this study, we characterize the chirality transition with atom-resolution imaging in a single-layer NbSe2 CDW pattern by technique of scanning tunneling microscopy. The atomic lattice of the CDW array is found continuous and intact although its chirality is switched. Several intermediate states are tracked by time-resolved imaging, revealing the fast and dynamic chirality transition. Importantly, the switching is reversibly realized with an external electric-field. Our findings unveil the delicate transition process of chiral CDW array in a 2D crystal down to the atomic scale and may be applicable for future nanoscale devices.

preprint2022arXiv

Composite Expectile Regression with Gene-environment Interaction

If error distribution has heteroscedasticity, it voliates the assumption of linear regression. Expectile regression is a powerful tool for estimating the conditional expectiles of a response variable in this setting. Since multiple levels of expectile regression modelhas been well studied, we propose composite expectile regression by combining different levels of expectile regression to improve the efficacy. In this paper, we study the sparse composite expectile regression under high dimensional setting. It is realized by implementing a coordinate descent algorithm. We also prove its selection and estimation consistency. Simulations are conducted to demonstrate its performance, which is comparable to or better than the alternatives. We apply the proposed method to analyze Lung adenocarcinoma(LUAD) real data set, investigating the G-E interaction.

preprint2022arXiv

Exfoliation of 2D van der Waals crystals in ultrahigh vacuum for interface engineering

Two-dimensional (2D) materials and their heterostructures have been intensively studied in recent years due to their potential applications in electronic, optoelectronic, and spintronic devices. Nonetheless, the realization of 2D heterostructures with atomically flat and clean interfaces remains challenging, especially for air-sensitive materials, which hinders the in-depth investigation of interface-induced phenomena and the fabrication of high-quality devices. Here, we circumvented this challenge by exfoliating 2D materials in an ultrahigh vacuum. Remarkably, ultraflat and clean substrate surfaces can assist the exfoliation of 2D materials, regardless of the substrate and 2D material, thus providing a universal method for the preparation of heterostructures with ideal interfaces. In addition, we studied the properties of two prototypical systems that cannot be achieved previously, including the electronic structure of monolayer phospherene and optical responses of transition metal dichalcogenides on different metal substrates. Our work paves the way to engineer rich interface-induced phenomena, such as proximity effects and moiré superlattices.

preprint2022arXiv

One-step exfoliation method for plasmonic activation of large-area 2D crystals

Advanced exfoliation techniques are crucial for exploring the intrinsic properties and applications of 2D materials. Though the recently discovered Au-enhanced exfoliation technique provides an effective strategy for preparation of large-scale 2D crystals, the high cost of gold hinders this method from being widely adopted in industrial applications. In addition, direct Au contact could significantly quench photoluminescence (PL) emission in 2D semiconductors. It is therefore crucial to find alternative metals that can replace gold to achieve efficient exfoliation of 2D materials. Here, we present a one-step Ag-assisted method that can efficiently exfoliate many large-area 2D monolayers, where the yield ratio is comparable to Au-enhanced exfoliation method. Differing from Au film, however, the surface roughness of as-prepared Ag films on SiO2/Si substrate is much higher, which facilitates the generation of surface plasmons resulting from the nanostructures formed on the rough Ag surface. More interestingly, the strong coupling between 2D semiconductor crystals (e.g. MoS2, MoSe2) and Ag film leads to a unique PL enhancement that has not been observed in other mechanical exfoliation techniques, which can be mainly attributed to enhanced light-matter interaction as a result of extended propagation of surface plasmonic polariton (SPP). Our work provides a lower-cost and universal Ag-assisted exfoliation method, while at the same offering enhanced SPP-matter interactions.

preprint2022arXiv

Towards Exploring the Code Reuse from Stack Overflow during Software Development

As one of the most well-known programmer Q&A websites, Stack Overflow (i.e., SO) is serving tens of thousands of developers every day. Previous work has shown that many developers reuse the code snippets on SO when they find an answer (from SO) that functionally matches the programming problem they encounter in their development activities. To study how programmers reuse code on SO during project development, we conduct a comprehensive empirical study. First, to capture the development activities of programmers, we collect 342,148 modified code snippets in commits from 793 open-source Java projects, and these modified code can reflect the programming problems encountered during development. We also collect the code snippets from 1,355,617 posts on SO. Then, we employ CCFinder to detect the code clone between the modified code from commits and the code from SO, and further analyze the code reuse when programmer solves a programming problem during development. We count the code reuse ratios of the modified code snippets in the commits of each project in different years, the results show that the average code reuse ratio is 6.32%, and the maximum is 8.38%. The code reuse ratio in project commits has increased year by year, and the proportion of code reuse in the newly established project is higher than that of old projects. We also find that some projects reuse the code snippets from many years ago. Additionally, we find that experienced developers seem to be more likely to reuse the knowledge on SO. Moreover, we find that the code reuse ratio in bug-related commits (6.67%) is slightly higher than that of in non-bug-related commits (6.59%). Furthermore, we also find that the code reuse ratio (14.44%) in Java class files that have undergone multiple modifications is more than double the overall code reuse ratio (6.32%).

preprint2021arXiv

Beyond Fine-tuning: Classifying High Resolution Mammograms using Function-Preserving Transformations

The task of classifying mammograms is very challenging because the lesion is usually small in the high resolution image. The current state-of-the-art approaches for medical image classification rely on using the de-facto method for ConvNets - fine-tuning. However, there are fundamental differences between natural images and medical images, which based on existing evidence from the literature, limits the overall performance gain when designed with algorithmic approaches. In this paper, we propose to go beyond fine-tuning by introducing a novel framework called MorphHR, in which we highlight a new transfer learning scheme. The idea behind the proposed framework is to integrate function-preserving transformations, for any continuous non-linear activation neurons, to internally regularise the network for improving mammograms classification. The proposed solution offers two major advantages over the existing techniques. Firstly and unlike fine-tuning, the proposed approach allows for modifying not only the last few layers but also several of the first ones on a deep ConvNet. By doing this, we can design the network front to be suitable for learning domain specific features. Secondly, the proposed scheme is scalable to hardware. Therefore, one can fit high resolution images on standard GPU memory. We show that by using high resolution images, one prevents losing relevant information. We demonstrate, through numerical and visual experiments, that the proposed approach yields to a significant improvement in the classification performance over state-of-the-art techniques, and is indeed on a par with radiology experts. Moreover and for generalisation purposes, we show the effectiveness of the proposed learning scheme on another large dataset, the ChestX-ray14, surpassing current state-of-the-art techniques.

preprint2021arXiv

Isospin competitions and valley polarized correlated insulators in twisted double bilayer graphene

New phase of matter usually emerges when a given symmetry breaks spontaneously, which can involve charge, spin, and valley degree of freedoms. Here, we report an observation of new correlated insulators evolved from spin polarized states to valley polarized states in AB-BA stacked twisted double bilayer graphene (TDBG). The transition of the isospin polarization is a result of the competition between spin and valley, driven by the displacement field (D). At a high field |D| > 0.7 V/nm, we observe valley polarized correlated insulators with a big Zeeman g factor of ~10, both at v = 2 in the moiré conduction band and more surprisingly at v = -2 in the moiré valence band. At a medium field |D| < 0.6 V/nm, by contrast, it is a conventional spin polarized correlated insulator at v = 2 and a featureless metal at v = -2. Moreover, we observe a valley polarized Chern insulator with C = 2 emanating at v = 2 in the electron side and a valley polarized Fermi surface around v = -2 in the hole side. The valley Chern insulator with C = 2 is evident from a well quantized Hall conductance plateau at 2e^2/h and correspondingly a vanishing longitudinal component. The valley polarized Fermi surface is topologically trivial with C = 0, and it shows a series of quantized Landau levels with v_LL = 0, 1, 2, 3, 4 and others. These observations are in good agreements with our band and topology calculations. Our results demonstrate a feasible way to realize isospin control and to obtain new phases of matter in TDBG by the displacement field, and might benefit other twisted or non-twisted multilayer systems.

preprint2020arXiv

A Bayesian Updating Scheme for Pandemics: Estimating the Infection Dynamics of COVID-19

Epidemic models play a key role in understanding and responding to the emerging COVID-19 pandemic. Widely used compartmental models are static and are of limited use to evaluate intervention strategies with the emerging pandemic. Applying the technology of data assimilation, we propose a Bayesian updating approach for estimating epidemiological parameters using observable information for the purpose of assessing the impacts of different intervention strategies. We adopt a concise renewal model and propose new parameters by disentangling the reduction of instantaneous reproduction number Rt into mitigation and suppression factors for quantifying intervention impacts at a finer granularity. Then we developed a data assimilation framework for estimating these parameters including constructing an observation function and developing a Bayesian updating scheme. A statistical analysis framework is then built to quantify the impact of intervention strategies by monitoring the evolution of these estimated parameters. By Investigating the impacts of intervention measures of European countries, the United States and Wuhan with the framework, we reveal the effects of interventions in these countries and the resurgence risk in the USA.

preprint2020arXiv

Direct Measurement of the Electronic Structure and band gap nature of atomic-layer-thick 2H-MoTe2

The millimeter sized monolayer and bilayer 2H-MoTe2 single crystal samples are prepared by a new mechanical exfoliation method. Based on such high-quality samples, we report the first direct electronic structure study on them, using standard high resolution angle-resolved photoemission spectroscopy (ARPES). A direct band gap of 0.924eV is found at K in the rubidium-doped monolayer MoTe2. Similar valence band alignment is also observed in bilayer MoTe2,supporting an assumption of a analogous direct gap semiconductor on it. Our measurements indicate a rather large band splitting of 212meV at the valence band maximum (VBM) in monolayer MoTe2, and the splitting is systematically enlarged with layer stacking, from monolayer to bilayer and to bulk. Meanwhile, our PBE band calculation on these materials show excellent agreement with ARPES results. Some fundamental electronic parameters are derived from the experimental and calculated electronic structures. Our findings lay a foundation for further application-related study on monolayer and bilayer MoTe2.

preprint2020arXiv

Electronic Evolution from the Parent Mott Insulator to a Superconductor in Lightly Hole-Doped Bi2Sr2CaCu2O8+delta

High temperature superconductivity in cuprates is realized by doping the Mott insulator with charge carriers. A central issue is how such an insulating state can evolve into a conducting or superconducting state when charge carriers are introduced. Here, by in situ vacuum annealing and Rb deposition on the Bi2Sr2Ca0.6Dy0.4Cu2O8+delta (Bi2212) sample surface to push its doping level continuously from deeply underdoped (Tc=25 K, doping level p-0.066) to the near zero doping parent Mott insulator, angle-resolved photoemission spectroscopy measurements are carried out to observe the detailed electronic structure evolution in lightly hole-doped region for the first time. Our results indicate that the chemical potential lies at about 1 eV above the charge transfer band for the parent state at zero doping which is quite close to the upper Hubbard band. With increasing hole doping, the chemical potential moves continuously towards the charge transfer band and the band structure evolution exhibits a rigid band shift-like behavior. When the chemical potential approaches the charge transfer band at a doping level of -0.05, the nodal spectral weight near the Fermi level increases, followed by the emergence of the coherent quasiparticle peak and the insulator-superconductor transition. Our observations provide key insights in understanding the insulator-superconductor transition in doping the parent cuprate compound and for establishing related theories.

preprint2020arXiv

Simultaneous Generation of Direct- and Indirect-Gap Photoluminescence in Multilayer MoS2 Bubbles

Transition metal dichalcogenide (TMD) materials have received enormous attention due to their extraodinary optical and electrical properties, among which MoS2 is the most typical one. As thickness increases from monolayer to multilayer, the photoluminescence (PL) of MoS2 is gradually quenched due to the direct-to-indirect band gap transition. How to enhance PL response and decrease the layer dependence in multilayer MoS2 is still a challenging task. In this work, we report, for the first time, simultaneous generation of three PL peaks at around 1.3, 1.4 and 1.8 eV on multilayer MoS2 bubbles. The temperature dependent PL measurements indicate that the two peaks at 1.3 and 1.4 eV are phonon-assisted indirect-gap transitions while the peak at 1.8 eV is the direct-gap transition. Using first-principles calculations, the band structure evolution of multilayer MoS2 under strain is studied, from which the origin of the three PL peaks of MoS2 bubbles is further confirmed. Moreover, PL standing waves are observed in MoS2 bubbles that creates Newton-Ring-like patterns. This work demonstrates that the bubble structure may provide new opportunities for engineering the electronic structure and optical properties of layered materials.

preprint2020arXiv

Spectroscopic Evidence of Bilayer Splitting and Interlayer Pairing in an Iron Based Superconductor

In high temperature cuprate superconductors, the interlayer coupling between the CuO$_2$ planes plays an important role in dictating superconductivity, as indicated by the sensitive dependence of the critical temperature (T$_C$) on the number of CuO$_2$ planes in one structural unit. In Bi$_2$Sr$_2$CaCu$_2$O$_{8+δ}$ superconductor with two CuO$_2$ planes in one structural unit, the interaction between the two CuO$_2$ planes gives rise to band splitting into two Fermi surface sheets (bilayer splitting) that have distinct superconducting gap. The iron based superconductors are composed of stacking of the FeAs/FeSe layers; whether the interlayer coupling can cause similar band splitting and its effect on superconductivity remain unclear. Here we report high resolution laser-based angle-resolved photoemission spectroscopy (ARPES) measurements on a newly discovered iron based superconductor, KCa$_2$Fe$_4$As$_4$F$_2$ (T$_C$=33.5\,K) which consists of stacking FeAs blocks with two FeAs layers separated by insulating Ca$_2$F$_2$ blocks. Bilayer splitting effect is observed for the first time that gives rise to totally five hole-like Fermi surface sheets around the Brilliouin zone center. Band structure calculations reproduce the observed bilayer splitting by identifying interlayer interorbital interaction between the two FeAs layers within one FeAs block. All the hole-like pockets around the zone center exhibit Fermi surface-dependent and nodeless superconducting gap. The gap functions with short-range antiferromagetic fluctuations are proposed and the gap symmetry can be well understood when the interlayer pairing is considered. The particularly strong interlayer pairing is observed for one of the bands. Our observations provide key information on the interlayer coupling and interlayer pairing in understanding superconductivity in iron based superconductors.

preprint2020arXiv

Universal mechanical exfoliation of large-area 2D crystals

Two-dimensional (2D) materials provide extraordinary opportunities for exploring phenomena arising in atomically thin crystals. Beginning with the first isolation of graphene, mechanical exfoliation has been a key to provide high-quality 2D materials but despite improvements it is still limited in yield, lateral size and contamination. Here we introduce a contamination-free, one-step and universal Au-assisted mechanical exfoliation method and demonstrate its effectiveness by isolating 40 types of single-crystalline monolayers, including elemental 2D crystals, metal-dichalcogenides, magnets and superconductors. Most of them are of millimeter-size and high-quality, as shown by transfer-free measurements of electron microscopy, photo spectroscopies and electrical transport. Large suspended 2D crystals and heterojunctions were also prepared with high-yield. Enhanced adhesion between the crystals and the substrates enables such efficient exfoliation, for which we identify a common rule that underpins a universal route for producing large-area monolayers and thus supports studies of fundamental properties and potential application of 2D materials.

preprint2019arXiv

Evidence for an Additional Symmetry Breaking from Direct Observation of Band Splitting in the Nematic State of FeSe Superconductor

The iron-based superconductor FeSe has attracted much recent attention because of its simple crystal structure, distinct electronic structure and rich physics exhibited by itself and its derivatives. Determination of its intrinsic electronic structure is crucial to understand its physical properties and superconductivity mechanism. Both theoretical and experimental studies so far have provided a picture that FeSe consists of one hole-like Fermi surface around the Brillouin zone center in its nematic state. Here we report direct observation of two hole-like Fermi surface sheets around the Brillouin zone center, and the splitting of the associated bands, in the nematic state of FeSe by taking high resolution laser-based angle-resolved photoemission measurements. These results indicate that, in addition to nematic order and spin-orbit coupling, there is an additional order in FeSe that breaks either inversion or time reversal symmetries. The new Fermi surface topology asks for reexamination of the existing theoretical and experimental understanding of FeSe and stimulates further efforts to identify the origin of the hidden order in its nematic state.

preprint2019arXiv

Selective Hybridization between Main Band and Superstructure Band in Bi$_2$Sr$_2$CaCu$_2$O$_{8+δ}$ Superconductor

High-resolution laser-based angle-resolved photoemission measurements have been carried out on Bi$_2$Sr$_2$CaCu$_2$O$_{8+δ}$ (Bi2212) and Bi$_2$Sr$_{2-x}$La$_x$CuO$_{6+δ}$ (Bi2201) superconductors. Unexpected hybridization between the main band and the superstructure band in Bi2212 is clearly revealed. In the momentum space where one main Fermi surface intersects with one superstructure Fermi surface, four bands are observed instead of two. The hybridization exists in both superconducting state and normal state, and in Bi2212 samples with different doping levels. Such a hybridization is not observed in Bi2201. This phenomenon can be understood by considering the bilayer splitting in Bi2212, the selective hybridization of two bands with peculiar combinations, and the altered matrix element effects of the hybridized bands. These observations provide strong evidence on the origin of the superstructure band which is intrinsic to the CuO$_2$ planes. Therefore, understanding physical properties and superconductivity mechanism in Bi2212 should consider the complete Fermi surface topology which involves the main bands, the superstructure bands and their interactions.

preprint2015arXiv

Promoting Similarity of Sparsity Structures in Integrative Analysis with Penalization

For data with high-dimensional covariates but small to moderate sample sizes, the analysis of single datasets often generates unsatisfactory results. The integrative analysis of multiple independent datasets provides an effective way of pooling information and outperforms single-dataset analysis and some alternative multi-datasets approaches including meta-analysis. Under certain scenarios, multiple datasets are expected to share common important covariates, that is, the multiple models have similarity in sparsity structures. However, the existing methods do not have a mechanism to {\it promote} the similarity of sparsity structures in integrative analysis. In this study, we consider penalized variable selection and estimation in integrative analysis. We develop an $L_0$-penalty based approach, which is the first to explicitly promote the similarity of sparsity structures. Computationally it is realized using a coordinate descent algorithm. Theoretically it has the much desired consistency properties. In simulation, it significantly outperforms the competing alternative when the models in multiple datasets share common important covariates. It has better or similar performance as the alternative when the sparsity structures share no similarity. Thus it provides a &#34;safe&#34; choice for data analysis. Applying the proposed method to three lung cancer datasets with gene expression measurements leads to models with significantly more similar sparsity structures and better prediction performance.

preprint2013arXiv

An innovative way of etching MoS2: Characterization and mechanistic investigation

We report a systematic study of the etching of MoS2 crystals by using XeF2 as a gaseous reactant. By controlling the etching process, monolayer MoS2 with uniform morphology can be obtained. The Raman and photoluminescence spectra of the resulting material were similar to those of exfoliated MoS2. Utilizing this strategy, different patterns such as a Hall bar structure and a hexagonal array can be realized. Furthermore, the etching mechanism was studied by introducing graphene as an etching mask. We believe our technique opens an easy and controllable way of etching MoS2, which can be used to fabricate complex nanostructures, such as nanoribbons, quantum dots and transistor structures. This etching process using XeF2 can also be extended to other interesting two-dimensional crystals.

preprint2013arXiv

Two-dimensional Potts antiferromagnets with a phase transition at arbitrarily large q

We exhibit infinite families of two-dimensional lattices (some of which are triangulations or quadrangulations of the plane) on which the q-state Potts antiferromagnet has a finite-temperature phase transition at arbitrarily large values of q. This unexpected result is proven rigorously by using a Peierls argument to measure the entropic advantage of sublattice long-range order. Additional numerical data are obtained using transfer matrices, Monte Carlo simulation, and a high-precision graph-theoretic method.

preprint2011arXiv

Finite-temperature phase transition in a class of 4-state Potts antiferromagnets

We argue that the 4-state Potts antiferromagnet has a finite-temperature phase transition on any Eulerian plane triangulation in which one sublattice consists of vertices of degree 4. We furthermore predict the universality class of this transition. We then present transfer-matrix and Monte Carlo data confirming these predictions for the cases of the union-jack and bisected hexagonal lattices.