Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
40works
0followers
22topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

40 published item(s)

preprint2026arXiv

Attribution-Guided Continual Learning for Large Language Models

Large language models (LLMs) often suffer from catastrophic forgetting in continual learning: after learning new tasks sequentially, they perform worse on earlier tasks. Existing methods mitigate catastrophic forgetting by data replay, parameter freezing, or regularization. However, these methods lack semantic awareness of internal knowledge distribution in LLMs. As a result, they cannot distinguish parameters that should be preserved or updated. We propose an attribution-guided continual fine-tuning framework for LLMs. Our method estimates task-specific, element-wise parameter importance in each Transformer layer and uses these scores to modulate gradients. Parameters important to previous tasks receive smaller updates, while less relevant ones remain plastic for learning new tasks. Experiments on continual learning benchmarks show that our method consistently outperforms baselines, achieving better retention of old tasks while maintaining competitive performance on new tasks.

preprint2024arXiv

The Fate of Simple Organics on Titan's Surface: A Theoretical Perspective

Atmospheric photochemistry on Titan continuously transforms methane and nitrogen gases into various organic compounds. This study explores the fate of these molecules when they land on Titan's surface. Our analytical exploration reveals that most simple organics found in Titan's atmosphere, including all nitriles, triple-bonded hydrocarbons, and benzene, land as solids. Only a few compounds are in the liquid phase, while only ethylene remains gaseous. For the simple organics that land as solids, we further examine their interactions with Titan's lake liquids. Utilizing principles of buoyancy, we found that flotation can be achieved via porosity-induced (25-60% porosity) or capillary force-induced buoyancy for HCN ices on ethane-rich lakes. Otherwise, these ices would sink and become lakebed sediments. By evaluating the timescale of flotation, our findings suggest that porosity-induced flotation of millimeter-sized and larger sediments is the only plausible mechanism for floating solids to explain the transient "magic islands" phenomena on Titan's lakes.

preprint2022arXiv

A bimodal distribution of haze in Pluto's atmosphere

Pluto, Titan, and Triton make up a unique class of solar system bodies, with icy surfaces and chemically reducing atmospheres rich in organic photochemistry and haze formation. Hazes play important roles in these atmospheres, with physical and chemical processes highly dependent on particle sizes, but the haze size distribution in reducing atmospheres is currently poorly understood. Here we report observational evidence that Pluto's haze particles are bimodally distributed, which successfully reproduces the full phase scattering observations from New Horizons. Combined with previous simulations of Titan's haze, this result suggests that haze particles in reducing atmospheres undergo rapid shape change near pressure levels ~0.5Pa and favors a photochemical rather than a dynamical origin for the formation of Titan's detached haze. It also demonstrates that both oxidizing and reducing atmospheres can produce multi-modal hazes, and encourages reanalysis of observations of hazes on Titan and Triton.

preprint2022arXiv

A Fast, Semi-analytical Model for the Venusian Binary Cloud System

The Venusian clouds originate from the binary condensation of H$_{2}$SO$_{4}$ and H$_{2}$O. The two components strongly interact with each other via chemistry and cloud formation. Previous works adopted sophisticated microphysical approaches to understand the clouds. Here we show that the observed vapor and cloud distributions on Venus can be well explained by a semi-analytical model. Our model assumes local thermodynamical equilibrium for water vapor but not for sulfuric acid vapor, and includes the feedback of cloud condensation and acidity to vapor distributions. The model predicts strong supersaturation of the H$_{2}$SO$_{4}$ vapor above 60 km, consistent with our recent cloud condensation model. The semi-analytical model is 100 times faster than the condensation model and 1000 times faster than the microphysical models. This allows us to quickly explore a large parameter space of the sulfuric acid gas-cloud system. We found that the cloud mass loading in the upper clouds has an opposite response of that in the lower clouds to the vapor mixing ratios in the lower atmosphere. The transport of water vapor influences the cloud acidity in all cloud layers while the transport of sulfuric acid vapor only dominates in the lower clouds. This cloud model is fast enough to be coupled with the climate models and chemistry models to understand the cloudy atmospheres of Venus and Venus-like extra-solar planets.

preprint2022arXiv

A Simple Condensation Model for the H2SO4-H2O Gas-cloud System on Venus

The current Venus climate is largely regulated by globally-covered concentrated sulfuric acid clouds from binary condensation of sulfuric acid (H2SO4) and water (H2O). To understand this complicated H2SO4-H2O gas-cloud system, previous theoretical studies either adopted complicated microphysical calculations or assumed that both H2SO4 and H2O vapor follow their saturation vapor pressure. In this study, we developed a simple one-dimensional cloud condensation model including condensation, diffusion and sedimentation of H2SO4 and H2O but without detailed microphysics. Our model is able to explain the observed vertical structure of cloud and upper haze mass loading, cloud acidity, H2SO4, and H2O vapor, and the mode-2 particle size on Venus. We found that most H2SO4 is stored in the condensed phase above 48 km, while the partitioning of H2O between the vapor and clouds is complicated. The cloud cycle is mostly driven by evaporation and condensation of H2SO4 rather than H2O and is about seven times stronger than the H2SO4 photochemical cycle. Most of the condensed H2O in the upper clouds is evaporated before the falling particles reach the middle clouds. The cloud acidity is affected by the temperature and the condensation-evaporation cycles of both H2SO4 and H2O. Because of the large chemical production of H2SO4 vapor and relatively inefficient cloud condensation, the simulated H2SO4 vapor above 60 km is largely supersaturated by more than two orders of magnitude, which could be tested by future observations.

preprint2022arXiv

ABO: Dataset and Benchmarks for Real-World 3D Object Understanding

We introduce Amazon Berkeley Objects (ABO), a new large-scale dataset designed to help bridge the gap between real and virtual 3D worlds. ABO contains product catalog images, metadata, and artist-created 3D models with complex geometries and physically-based materials that correspond to real, household objects. We derive challenging benchmarks that exploit the unique properties of ABO and measure the current limits of the state-of-the-art on three open problems for real-world 3D object understanding: single-view 3D reconstruction, material estimation, and cross-domain multi-view object retrieval.

preprint2022arXiv

Deep Decoding of $\ell_\infty$-coded Light Field Images

To enrich the functionalities of traditional cameras, light field cameras record both the intensity and direction of light rays, so that images can be rendered with user-defined camera parameters via computations. The added capability and flexibility are gained at the cost of gathering typically more than $100\times$ greater amount of information than conventional images. To cope with this issue, several light field compression schemes have been introduced. However, their ways of exploiting correlations of multidimensional light field data are complex and are hence not suited for inexpensive light field cameras. In this work, we propose a novel $\ell_\infty$-constrained light-field image compression system that has a very low-complexity DPCM encoder and a CNN-based deep decoder. Targeting high-fidelity reconstruction, the CNN decoder capitalizes on the $\ell_\infty$-constraint and light field properties to remove the compression artifacts and achieves significantly better performance than existing state-of-the-art $\ell_2$-based light field compression methods.

preprint2022arXiv

Depletion of gaseous CO in protoplanetary disks by surface-energy-regulated ice formation

Empirical constraints of fundamental properties of protoplanetary disks are essential for understanding planet formation and planetary properties (1,2). Carbon monoxide (CO) gas is often used to constrain disk properties (3). However, estimates show that the CO gas abundance in disks is depleted relative to expected values (4,5,6,7), and models of various disk processes impacting the CO abundance could not explain this depletion on observed 1Myr timescales (8,9,10,11,12,13,14). Here we demonstrate that surface energy effects on particles in disks, such as the Kelvin effect, that arise when ice heterogeneously nucleates onto an existing particle can efficiently trap CO in its ice phase. In previous ice formation models, CO gas was released when small ice-coated particles were lofted to warmed disk layers. Our model can reproduce the observed abundance, distribution and time evolution of gaseous CO in the four most studied protoplanetary disks (7). We constrain the solid and gaseous CO inventory at the midplane and disk diffusivities and resolve inconsistencies in estimates of the disk mass -- three crucial parameters that control planetary formation.

preprint2022arXiv

Mechanism-based Tuning of Room-temperature Ferromagnetism in Mn-doped \b{eta}-Ga2O3 by Annealing Atmospher

Mn-doped \b{eta}-Ga2O3 (GMO) films with room-temperature ferromagnetism (RTFM) are synthesized by polymer-assisted deposition and the effects of annealing atmosphere (air or pure O2 gas) on their structures and physical properties are investigated. The characterizations show that the concentrations of vacancy defects and Mn dopants in various valence states and lattice constants of the samples are all modulated by the annealing atmosphere. Notably, the samples annealed in air (GMO-air) exhibit a saturation magnetization as strong as 170% times that of the samples annealed in pure O2 gas (GMO-O2), which can be quantitatively explained by oxygen vacancy (VO) controlled ferromagnetism due to bound magnetic polarons established between delocalized hydrogenic electrons of VOs and local magnetic moments of Mn2+, Mn3+, and Mn4+ ions in the samples. Our results provide insights into mechanism-based tuning of RTFM in Ga2O3 and may be useful for design, fabrication, and application of related spintronic materials.

preprint2022arXiv

Multi-modality Deep Restoration of Extremely Compressed Face Videos

Arguably the most common and salient object in daily video communications is the talking head, as encountered in social media, virtual classrooms, teleconferences, news broadcasting, talk shows, etc. When communication bandwidth is limited by network congestions or cost effectiveness, compression artifacts in talking head videos are inevitable. The resulting video quality degradation is highly visible and objectionable due to high acuity of human visual system to faces. To solve this problem, we develop a multi-modality deep convolutional neural network method for restoring face videos that are aggressively compressed. The main innovation is a new DCNN architecture that incorporates known priors of multiple modalities: the video-synchronized speech signal and semantic elements of the compression code stream, including motion vectors, code partition map and quantization parameters. These priors strongly correlate with the latent video and hence they are able to enhance the capability of deep learning to remove compression artifacts. Ample empirical evidences are presented to validate the superior performance of the proposed DCNN method on face videos over the existing state-of-the-art methods.

preprint2022arXiv

New Open Cluster candidates Found in Galactic Disk Using Gaia DR2/EDR3 Data

We report 541 new open cluster candidates in Gaia EDR3 through revisiting the cluster results from an earlier analysis of the Gaia DR2, which revealed nearly a thousand open cluster candidates in the solar neighborhood (mostly d < 3 kpc) resideing at Galactic latitudes |b| < 20 degrees. A subsequent comparison with lists of known clusters shows a large increases of the cluster samples within 2 kpc from the Sun. We assign membership probabilities to the stars through the open source pyUPMASK algorithm, and also estimate the physical parameters through isochrone fitting for each candidate. Most of the new candidates show small total proper motion dispersions and clear features in the color-magnitude diagrams. Besides, the metallicity gradient of the new candidates is consistent with those found in the literature. The cluster parameters and member stars are available at CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via https://cdsarc.unistra.fr/viz-bin/cat/J/ApJS. The discovery of these new objects shows that the open cluster samples in Gaia data is still not complete, and more discoveries are expected in the future researches.

preprint2022arXiv

Strain tunability of perpendicular magnetic anisotropy in van der Waals ferromagnets VI3

Layered ferromagnets with high coercivity have special applications in nanoscale memory elements in electronic circuits, such as data storage. Therefore, searching for new hard ferromagnets and effectively tuning or enhancing the coercivity are the hottest topics in layered magnets today. Here, we report a strain tunability of perpendicular magnetic anisotropy in van der Waals (vdW) ferromagnets VI3 using magnetic circular dichroism measurements. For an unstrained flake, the M-H curve shows a rectangular-shaped hysteresis loop with perpendicular magnetic anisotropy and a large coercivity (up to 1.775 T at 10 K). Furthermore, the coercivity can be enhanced to a maximum of 2.6 T at 10 K under a 2.9% in-plane tensile strain. Our DFT calculations show that the magnetic anisotropy energy (MAE) can be dramatically increased after applying an in-plain tensile strain, which contributes to the enhancement of coercivity in the VI3 flake. Meanwhile, the strain tunability on the coercivity of CrI3, with a similar crystal structure, is limited. The main reason is the strong spin-orbital coupling in V3+ in VI6 octahedra in comparison with that in Cr3+. The strain tunability of coercivity in VI3 flakes highlights its potential for integration into vdW heterostructures, paving the way toward nanoscale spintronic devices and applications in the future.

preprint2022arXiv

Structured Graph Variational Autoencoders for Indoor Furniture layout Generation

We present a structured graph variational autoencoder for generating the layout of indoor 3D scenes. Given the room type (e.g., living room or library) and the room layout (e.g., room elements such as floor and walls), our architecture generates a collection of objects (e.g., furniture items such as sofa, table and chairs) that is consistent with the room type and layout. This is a challenging problem because the generated scene should satisfy multiple constrains, e.g., each object must lie inside the room and two objects cannot occupy the same volume. To address these challenges, we propose a deep generative model that encodes these relationships as soft constraints on an attributed graph (e.g., the nodes capture attributes of room and furniture elements, such as class, pose and size, and the edges capture geometric relationships such as relative orientation). The architecture consists of a graph encoder that maps the input graph to a structured latent space, and a graph decoder that generates a furniture graph, given a latent code and the room graph. The latent space is modeled with auto-regressive priors, which facilitates the generation of highly structured scenes. We also propose an efficient training procedure that combines matching and constrained learning. Experiments on the 3D-FRONT dataset show that our method produces scenes that are diverse and are adapted to the room layout.

preprint2022arXiv

THP: Topological Hawkes Processes for Learning Causal Structure on Event Sequences

Learning causal structure among event types on multi-type event sequences is an important but challenging task. Existing methods, such as the Multivariate Hawkes processes, mostly assumed that each sequence is independent and identically distributed. However, in many real-world applications, it is commonplace to encounter a topological network behind the event sequences such that an event is excited or inhibited not only by its history but also by its topological neighbors. Consequently, the failure in describing the topological dependency among the event sequences leads to the error detection of the causal structure. By considering the Hawkes processes from the view of temporal convolution, we propose a Topological Hawkes process (THP) to draw a connection between the graph convolution in the topology domain and the temporal convolution in time domains. We further propose a causal structure learning method on THP in a likelihood framework. The proposed method is featured with the graph convolution-based likelihood function of THP and a sparse optimization scheme with an Expectation-Maximization of the likelihood function. Theoretical analysis and experiments on both synthetic and real-world data demonstrate the effectiveness of the proposed method

preprint2022arXiv

Unsupervised Scene Sketch to Photo Synthesis

Sketches make an intuitive and powerful visual expression as they are fast executed freehand drawings. We present a method for synthesizing realistic photos from scene sketches. Without the need for sketch and photo pairs, our framework directly learns from readily available large-scale photo datasets in an unsupervised manner. To this end, we introduce a standardization module that provides pseudo sketch-photo pairs during training by converting photos and sketches to a standardized domain, i.e. the edge map. The reduced domain gap between sketch and photo also allows us to disentangle them into two components: holistic scene structures and low-level visual styles such as color and texture. Taking this advantage, we synthesize a photo-realistic image by combining the structure of a sketch and the visual style of a reference photo. Extensive experimental results on perceptual similarity metrics and human perceptual studies show the proposed method could generate realistic photos with high fidelity from scene sketches and outperform state-of-the-art photo synthesis baselines. We also demonstrate that our framework facilitates a controllable manipulation of photo synthesis by editing strokes of corresponding sketches, delivering more fine-grained details than previous approaches that rely on region-level editing.

preprint2021arXiv

Deep Agent: Studying the Dynamics of Information Spread and Evolution in Social Networks

This paper explains the design of a social network analysis framework, developed under DARPA&#39;s SocialSim program, with novel architecture that models human emotional, cognitive and social factors. Our framework is both theory and data-driven, and utilizes domain expertise. Our simulation effort helps in understanding how information flows and evolves in social media platforms. We focused on modeling three information domains: cryptocurrencies, cyber threats, and software vulnerabilities for the three interrelated social environments: GitHub, Reddit, and Twitter. We participated in the SocialSim DARPA Challenge in December 2018, in which our models were subjected to extensive performance evaluation for accuracy, generalizability, explainability, and experimental power. This paper reports the main concepts and models, utilized in our social media modeling effort in developing a multi-resolution simulation at the user, community, population, and content levels.

preprint2021arXiv

Fluence Adaptation for Task-based Dose Optimization in X-ray Phase-Contrast Imaging

Purpose: Grating-based imaging (GBI) and edge-illumination (EI) are two promising types of XPCI as the conventional x-ray sources can be directly utilized. For GBI and EI systems, the phase-stepping acquisition with multiple exposures at a constant fluence is usually adopted in the literature. This work, however, attempts to challenge such a constant fluence concept during the phase-stepping process and proposes a fluence adaptation mechanism for dose reduction. Method: Recently, analytic multi-order moment analysis has been proposed to improve the computing efficiency. In these algorithms, multiple contrasts can be calculated by summing together the weighted phase-stepping curves (PSCs) with some kernel functions, which suggests us that the raw data at different steps have different contributions for the noise in retrieved contrasts. Based on analytic retrieval formulas and the Gaussian noise model for detected signals, we derived an optimal adaptive fluence distribution, which is proportional to the absolute weighting kernel functions and the root of original sample PSCs acquired under the constant fluence. Results: To validate our analyses, simulations and experiments are conducted for GBI and EI systems. Simulated results demonstrate that the dose reduction ratio between our proposed fluence distributions and the typical constant one can be about 20% for the phase contrast, which is consistent with our theoretical predictions. Although the experimental noise reduction ratios are a little smaller than the theoretical ones, synthetic and real experiments both observe better noise performance by our proposed method. Our simulated results also give out the effective ranges of the parameters of the PSCs, such as the visibility in GBI, the standard deviation and the mean value in EI, providing a guidance for the use of our proposed approach in practice.

preprint2020arXiv

A Global Non-Hydrostatic Atmospheric Model with a Mass and Energy Conserving Vertically-Implicit-Correction (VIC) Scheme

Global non-hydrostatic atmospheric models are becoming increasingly important for studying the climates of planets and exoplanets. However, such models suffer from computational difficulties due to the large aspect ratio between the horizontal and vertical directions. To overcome this problem, we developed a global model using a vertically-implicit-correction (VIC) scheme in which the integration time step is no longer limited by the propagation of acoustic waves in the vertical. We proved that our model, based on the $\rm Athena^{++}$ framework and its extension for planetary atmospheres - SNAP (Simulating Non-hydrostatic Atmosphere on Planets), rigorously conserves mass and energy in finite volume simulations. We found that traditional numerical stabilizers such as hyper-viscosity and divergence damping are not needed when using the VIC scheme, which greatly simplifies the numerical implementation and improves stability. We present simulation results ranging from 1D linear waves to 3D global circulations with and without the VIC scheme. These tests demonstrate that our formulation correctly tracks local turbulent motions, produces Kelvin-Helmholtz instability, and generates a super-rotating jet on hot Jupiters. Employing this VIC scheme improves the computational efficiency of global simulations by more than two orders of magnitude compared to an explicit model and facilitates the capability of simulating a wide range of planetary atmospheres both regionally and globally.

preprint2020arXiv

Atmospheric Regimes and Trends on Exoplanets and Brown Dwarfs

A planetary atmosphere is the outer gas layer of a planet. Besides its scientific significance among the first and most accessible planetary layers observed from space, it is closely connected with planetary formation and evolution, surface and interior processes, and habitability of planets. Current theories of the planetary atmosphere were primarily obtained through the studies of eight large planets, Pluto and three large moons (Io, Titan, and Triton) in the Solar System. Outside the Solar System, more than four thousand extra-solar planets (exoplanets) and two thousand brown dwarfs have been confirmed in our galaxy, and their population is rapidly growing. The rich information from these exotic bodies offers a database to test, in a statistical sense, the fundamental theories of planetary climates. Here we review the current knowledge of atmospheres of exoplanets and brown dwarfs from recent observations and theories. This review highlights important regimes and statistical trends in an ensemble of atmospheres as an initial step towards fully characterizing diverse substellar atmospheres, that illustrates the underlying principles and critical problems. Insights are obtained through analysis of the dependence of atmospheric characteristics on basic planetary parameters. Dominant processes that influence atmospheric stability, energy transport, temperature, composition, and flow pattern are discussed and elaborated with simple scaling laws. We dedicate this review to Dr. Adam P. Showman (1968-2020) in recognition of his fundamental contribution to the understanding of atmospheric dynamics on giant planets, exoplanets, and brown dwarfs.

preprint2020arXiv

Challenge of Spatial Cognition for Deep Learning

Given the success of the deep convolutional neural networks (DCNNs) in applications of visual recognition and classification, it would be tantalizing to test if DCNNs can also learn spatial concepts, such as straightness, convexity, left/right, front/back, relative size, aspect ratio, polygons, etc., from varied visual examples of these concepts that are simple and yet vital for spatial reasoning. Much to our dismay, extensive experiments of the type of cognitive psychology demonstrate that the data-driven deep learning (DL) cannot see through superficial variations in visual representations and grasp the spatial concept in abstraction. The root cause of failure turns out to be the learning methodology, not the computational model of the neural network itself. By incorporating task-specific convolutional kernels, we are able to construct DCNNs for spatial cognition tasks that can generalize to input images not drawn from the same distribution of the training set. This work raises a precaution that without manually-incorporated priors or features DCCNs may fail spatial cognitive tasks at rudimentary level.

preprint2020arXiv

Deep Multi-modality Soft-decoding of Very Low Bit-rate Face Videos

We propose a novel deep multi-modality neural network for restoring very low bit rate videos of talking heads. Such video contents are very common in social media, teleconferencing, distance education, tele-medicine, etc., and often need to be transmitted with limited bandwidth. The proposed CNN method exploits the correlations among three modalities, video, audio and emotion state of the speaker, to remove the video compression artifacts caused by spatial down sampling and quantization. The deep learning approach turns out to be ideally suited for the video restoration task, as the complex non-linear cross-modality correlations are very difficult to model analytically and explicitly. The new method is a video post processor that can significantly boost the perceptual quality of aggressively compressed talking head videos, while being fully compatible with all existing video compression standards.

preprint2020arXiv

Deflating Super-Puffs: Impact of Photochemical Hazes on the Observed Mass-Radius Relationship of Low Mass Planets

The observed mass-radius relationship of low-mass planets informs our understanding of their composition and evolution. Recent discoveries of low mass, large radii objects (&#34;super-puffs&#34;) have challenged theories of planet formation and atmospheric loss, as their high inferred gas masses make them vulnerable to runaway accretion and hydrodynamic escape. Here we propose that high altitude photochemical hazes could enhance the observed radii of low-mass planets and explain the nature of super-puffs. We construct model atmospheres in radiative-convective equilibrium and compute rates of atmospheric escape and haze distributions, taking into account haze coagulation, sedimentation, diffusion, and advection by an outflow wind. We develop mass-radius diagrams that include atmospheric lifetimes and haze opacity, which is enhanced by the outflow, such that young (~0.1-1 Gyr), warm (T$_{eq}$ $\geq$ 500 K), low mass objects ($M_c$ < 4M$_{\rm Earth}$) should experience the most apparent radius enhancement due to hazes, reaching factors of three. This reconciles the densities and ages of the most extreme super-puffs. For Kepler-51b, the inclusion of hazes reduces its inferred gas mass fraction to <10%, similar to that of planets on the large radius side of the sub-Neptune radius gap. This suggests that Kepler-51b may be evolving towards that population, and that some warm sub-Neptunes may have evolved from super-puffs. Hazes also render transmission spectra of super-puffs and sub-Neptunes featureless, consistent with recent measurements. Our hypothesis can be tested by future observations of super-puffs&#39; transmission spectra at mid-infrared wavelengths, where we predict that the planet radius will be half of that observed in the near-infrared.

preprint2020arXiv

Demarcating circulation regimes of synchronously rotating terrestrial planets within the habitable zone

We investigate the atmospheric dynamics of terrestrial planets in synchronous rotation within the habitable zone of low-mass stars using the Community Atmosphere Model (CAM). The surface temperature contrast between day and night hemispheres decreases with an increase in incident stellar flux, which is opposite the trend seen on gas giants. We define three dynamical regimes in terms of the equatorial Rossby deformation radius and the Rhines length. The slow rotation regime has a mean zonal circulation that spans from day to night side, with both the Rossby deformation radius and the Rhines length exceeding planetary radius, which occurs for planets around stars with effective temperatures of 3300 K to 4500 K (rotation period > 20 days). Rapid rotators have a mean zonal circulation that partially spans a hemisphere and with banded cloud formation beneath the substellar point, with the Rossby deformation radius is less than planetary radius, which occurs for planets orbiting stars with effective temperatures of less than 3000 K (rotation period < 5 days). In between is the Rhines rotation regime, which retains a thermally-direct circulation from day to night side but also features midlatitude turbulence-driven zonal jets. Rhines rotators occur for planets around stars in the range of 3000 K to 3300 K (rotation period ~ 5 to 20 days), where the Rhines length is greater than planetary radius but the Rossby deformation radius is less than planetary radius. The dynamical state can be observationally inferred from comparing the morphology of the thermal emission phase curves of synchronously rotating planets.

preprint2020arXiv

Eightfold Fermionic Excitation in a Charge Density Wave Compound

Unconventional quasiparticle excitations in condensed matter systems have become one of the most important research frontiers. Beyond two- and fourfold degenerate Weyl and Dirac fermions, three-, six- and eightfold symmetry protected degeneracies have been predicted however remain challenging to realize in solid state materials. Here, charge density wave compound TaTe4 is proposed to hold eightfold fermionic excitation and Dirac point in energy bands. High quality TaTe4 single crystals are prepared, where the charge density wave is revealed by directly imaging the atomic structure and a pseudogap of about 45 meV on the surface. Shubnikov de-Haas oscillations of TaTe4 are consistent with band structure calculation. Scanning tunneling microscopy reveals atomic step edge states on the surface of TaTe4. This work uncovers that charge density wave is able to induce new topological phases and sheds new light on the novel excitations in condensed matter materials.

preprint2020arXiv

Image polaritons in boron nitride for extreme polariton confinement with low losses

Polaritons in two-dimensional materials provide extreme light confinement that is difficult to achieve with metal plasmonics. However, such tight confinement inevitably increases optical losses through various damping channels. Here we demonstrate that hyperbolic phonon polaritons in hexagonal boron nitride can overcome this fundamental trade-off. Among two observed polariton modes, featuring a symmetric and antisymmetric charge distribution, the latter exhibits lower optical losses and tighter polariton confinement. Far-field excitation and detection of this high-momenta mode becomes possible with our resonator design that can boost the coupling efficiency via virtual polariton modes with image charges that we dub image polaritons. Using these image polaritons, we experimentally observe a record-high effective index of up to 132 and quality factors as high as 501. Further, our phenomenological theory suggests an important role of hyperbolic surface scattering in the damping process of hyperbolic phonon polaritons.

preprint2020arXiv

Local Causal Structure Learning and its Discovery Between Type 2 Diabetes and Bone Mineral Density

Type 2 diabetes (T2DM), one of the most prevalent chronic diseases, affects the glucose metabolism of the human body, which decreases the quantity of life and brings a heavy burden on social medical care. Patients with T2DM are more likely to suffer bone fragility fracture as diabetes affects bone mineral density (BMD). However, the discovery of the determinant factors of BMD in a medical way is expensive and time-consuming. In this paper, we propose a novel algorithm, Prior-Knowledge-driven local Causal structure Learning (PKCL), to discover the underlying causal mechanism between BMD and its factors from the clinical data. Since there exist limited data but redundant prior knowledge for medicine, PKCL adequately utilize the prior knowledge to mine the local causal structure for the target relationship. Combining the medical prior knowledge with the discovered causal relationships, PKCL can achieve more reliable results without long-standing medical statistical experiments. Extensive experiments are conducted on a newly provided clinical data set. The experimental study of PKCL on the data is proved to highly corresponding with existing medical knowledge, which demonstrates the superiority and effectiveness of PKCL. To illustrate the importance of prior knowledge, the result of the algorithm without prior knowledge is also investigated.

preprint2020arXiv

MDLdroid: a ChainSGD-reduce Approach to Mobile Deep Learning for Personal Mobile Sensing

Personal mobile sensing is fast permeating our daily lives to enable activity monitoring, healthcare and rehabilitation. Combined with deep learning, these applications have achieved significant success in recent years. Different from conventional cloud-based paradigms, running deep learning on devices offers several advantages including data privacy preservation and low-latency response for both model inference and update. Since data collection is costly in reality, Google&#39;s Federated Learning offers not only complete data privacy but also better model robustness based on multiple user data. However, personal mobile sensing applications are mostly user-specific and highly affected by environment. As a result, continuous local changes may seriously affect the performance of a global model generated by Federated Learning. In addition, deploying Federated Learning on a local server, e.g., edge server, may quickly reach the bottleneck due to resource constraint and serious failure by attacks. Towards pushing deep learning on devices, we present MDLdroid, a novel decentralized mobile deep learning framework to enable resource-aware on-device collaborative learning for personal mobile sensing applications. To address resource limitation, we propose a ChainSGD-reduce approach which includes a novel chain-directed Synchronous Stochastic Gradient Descent algorithm to effectively reduce overhead among multiple devices. We also design an agent-based multi-goal reinforcement learning mechanism to balance resources in a fair and efficient manner. Our evaluations show that our model training on off-the-shelf mobile devices achieves 2x to 3.5x faster than single-device training, and 1.5x faster than the master-slave approach.

preprint2020arXiv

Near-lossless $\ell_\infty$-constrained Image Decompression via Deep Neural Network

Recently a number of CNN-based techniques were proposed to remove image compression artifacts. As in other restoration applications, these techniques all learn a mapping from decompressed patches to the original counterparts under the ubiquitous $\ell_\infty$ metric. However, this approach is incapable of restoring distinctive image details which may be statistical outliers but have high semantic importance (e.g., tiny lesions in medical images). To overcome this weakness, we propose to incorporate an $\ell_\infty$ fidelity criterion in the design of neural network so that no small, distinctive structures of the original image can be dropped or distorted. Experimental results demonstrate that the proposed method outperforms the state-of-the-art methods in $\ell_\infty$ error metric and perceptual quality, while being competitive in $\ell_2$ error metric as well. It can restore subtle image details that are otherwise destroyed or missed by other algorithms. Our research suggests a new machine learning paradigm of ultra high fidelity image compression that is ideally suited for applications in medicine, space, and sciences.

preprint2020arXiv

Projectively flat bundles and semi-stable Higgs bundles

The Corlette-Donaldson-Hitchin-Simpson&#39;s correspondence states that, on a compact Kähler manifold $(X, ω)$, there is a one-to-one correspondence between the moduli space of semisimple flat complex vector bundles and the moduli space of poly-stable Higgs bundles with vanishing Chern numbers. In this paper, we extend this correspondence to the projectively flat bundles case. We prove that there is an equivalence of categories between the category of $ω$-semi-stable (poly-stable) Higgs bundles $(E, \overline{\partial}_{E}, ϕ)$ with $(2rc_{2}(E)-(r-1)c_{1}^{2}(E))\cdot [ω]^{n-2}=0 $ and the category of (semi-simple) projectively flat bundles $(E, D)$ with $\sqrt{-1}F_{D}=α\otimes \mbox{Id}_{E}$ for some real (1,1)-form $α$. Furthermore, we also establish the above correspondence on some compact non-Kähler manifolds. As its application, we obtain a vanishing theorem of characteristic classes of projectively flat bundles.

preprint2020arXiv

QSAN: A Quantum-probability based Signed Attention Network for Explainable False Information Detection

False information detection on social media is challenging as it commonly requires tedious evidence-collecting but lacks available comparative information. Clues mined from user comments, as the wisdom of crowds, could be of considerable benefit to this task. However, it is non-trivial to capture the complex semantics from the contents and comments in consideration of their implicit correlations. Although deep neural networks have good expressive power, one major drawback is the lack of explainability. In this paper, we focus on how to learn from the post contents and related comments in social media to understand and detect the false information more effectively, with explainability. We thus propose a Quantum-probability based Signed Attention Network (QSAN) that integrates the quantum-driven text encoding and a novel signed attention mechanism in a unified framework. QSAN is not only able to distinguish important comments from the others, but also can exploit the conflicting social viewpoints in the comments to facilitate the detection. Moreover, QSAN is advantageous with its explainability in terms of transparency due to quantum physics meanings and the attention weights. Extensive experiments on real-world datasets show that our approach outperforms state-of-the-art baselines and can provide different kinds of user comments to explain why a piece of information is detected as false.

preprint2020arXiv

Rapid Synthesis of Thermoelectric SnSe Thin Films by MPCVD

Microwave plasma chemical vapor deposition (MPCVD) has been traditionally used to synthesize carbon-based materials such as diamonds, carbon nanotubes and graphene. Here we report that a rapid and catalyst-free growth of SnSe thin films can be achieved by using single-mode MPCVD with appropriate source materials. The analysis combing microscope images, X-ray diffraction patterns and lattice vibration modes shows that the grown thin films were composed of orthorhombic structured SnSe polycrystals mainly along the (111) direction. Further thermoelectric (TE) characterizations reveal that the power factor of the SnSe films reached 3.98 μW cm-1K-2 at 600 K, comparable to the highest reported values of SnSe thin films. Our results may open an avenue for rapid synthesis of new types of materials such as IV-VI compounds and be useful for TE application of these materials.

preprint2020arXiv

Revisiting the Sulfur-Water Chemical System in the Middle Atmosphere of Venus

Sulfur-water chemistry plays an important role in the middle atmosphere of Venus. Ground based observations have found that simultaneously observed SO2 and H2O at ~64 km vary with time and are temporally anti-correlated. To understand these observations, we explore the sulfur-water chemical system using a one-dimensional chemistry-diffusion model. We find that SO2 and H2O mixing ratios above the clouds are highly dependent on mixing ratios of the two species at the middle cloud top (58 km). The behavior of sulfur-water chemical system can be classified into three regimes but there is no abrupt transition among these regimes. In particular, there is no bifurcation behavior as previously claimed. We also find that the SO2 self-shielding effect causes H2O above the clouds to respond to the middle cloud top in a non-monotonic fashion. Through comparison with observations, we find that mixing ratio variations at the middle cloud top can explain the observed variability of SO2 and H2O. The sulfur-water chemistry in the middle atmosphere is responsible for the H2O-SO2 anti-correlation at 64 km. Eddy transport change alone cannot explain the variations of both species. These results imply that variations of species abundance in the middle atmosphere are significantly influenced by the lower atmospheric processes. Continued ground-based measurements of the co-evolution of SO2 and H2O above the clouds and new spacecraft missions will be crucial for uncover the complicated processes underlying the interaction among the lower atmosphere, the clouds and the middle atmosphere of Venus.

preprint2020arXiv

Rigorous Explanation of Inference on Probabilistic Graphical Models

Probabilistic graphical models, such as Markov random fields (MRF), exploit dependencies among random variables to model a rich family of joint probability distributions. Sophisticated inference algorithms, such as belief propagation (BP), can effectively compute the marginal posteriors. Nonetheless, it is still difficult to interpret the inference outcomes for important human decision making. There is no existing method to rigorously attribute the inference outcomes to the contributing factors of the graphical models. Shapley values provide an axiomatic framework, but naively computing or even approximating the values on general graphical models is challenging and less studied. We propose GraphShapley to integrate the decomposability of Shapley values, the structure of MRFs, and the iterative nature of BP inference in a principled way for fast Shapley value computation, that 1) systematically enumerates the important contributions to the Shapley values of the explaining variables without duplicate; 2) incrementally compute the contributions without starting from scratches. We theoretically characterize GraphShapley regarding independence, equal contribution, and additivity. On nine graphs, we demonstrate that GraphShapley provides sensible and practical explanations.

preprint2020arXiv

Synthesis and temperature-dependent photoluminescence of high density GeSe triangular nanoplate arrays on Si substrates

We have grown germanium selenide (GeSe) triangular nanoplate arrays (TNAs) with a high density (3.82E+6 / mm2) on the Si (111) substrate using a simple thermal evaporation method. The thickness and trilateral lengths of a single triangular nanoplate were statistically estimated by atomic force microscopy (AFM) as 44 nm, 365 nm, 458 nm and 605 nm, respectively. Transmission electron microscopy (TEM) images and X-ray diffraction (XRD) patterns show that the TNAs were composed of single crystalline GeSe phase. The Se-related defects in the lattice were also revealed by TEM images and Raman vibration modes. Unlike previously reported GeSe compounds, the GeSe TNAs exhibited temperature-dependent photoluminescence (PL). In addition, not previously reported PL peak (1.25 eV) of the 44 nm thick TNAs at 5 K was in the gaps between those of GeSe monolayers (1.5 nm) and thin films (400 nm), revealing a close relationship between the PL peak and the thickness of GeSe. The high-density structure and temperature-dependent PL of the TNAs on the Si substrate may be useful for temperature controllable semiconductor nanodevices.

preprint2020arXiv

The Science Case for a Titan Flagship-class Orbiter with Probes

We outline a flagship-class mission concept focused on studying Titan as a global system, with particular emphasis on the polar regions. Investigating Titan from the unique standpoint of a polar orbit would enable comprehensive global maps to uncover the physics and chemistry of the atmosphere, and the topography and geophysical environment of the surface and subsurface. The mission includes two key elements: (1) an orbiter spacecraft, which also acts as a data relay, and (2) one or more small probes to directly investigate Titan&#39;s seas and make the first direct measurements of their liquid composition and physical environment. The orbiter would carry a sophisticated remote sensing payload, including a novel topographic lidar, a long-wavelength surface-penetrating radar, a sub-millimeter sounder for winds and for mesospheric/thermospheric composition, and a camera and near-infrared spectrometer. An instrument suite to analyze particles and fields would include a mass spectrometer to focus on the interactions between Titan&#39;s escaping upper atmosphere and the solar wind and Saturnian magnetosphere. The orbiter would enter a stable polar orbit around 1500 to 1800 km, from which vantage point it would make global maps of the atmosphere and surface. One or more probes, released from the orbiter, would investigate Titan&#39;s seas in situ, including possible differences in composition between higher and lower latitude seas, as well as the atmosphere during the parachute descent. The number of probes, as well as the instrument complement on the orbiter and probe, remain to be finalized during a mission study that we recommend to NASA as part of the NRC Decadal Survey for Planetary Science now underway, with the goal of an overall mission cost in the &#34;small flagship&#34; category of ~$2 bn. International partnerships, similar to Cassini-Huygens, may also be included for consideration.

preprint2020arXiv

Two-Dimensional Si-Ge Monolayers: Stabilities, Structures and Electronic Properties

Si-Ge monolayers (SiGeM) with different elementary proportion x (0<x<1) were systematically studied for the first-time using ab initio calculations in this work. The structural stabilities of the Si1-xGexM with different symmetries were investigated using phonon spectra, and an infinite miscibility between Si and Ge elements were revealed in the 2D honeycomb structures. The simulated scanning tunneling microscope images and Raman and infrared active modes of the Si1-xGexM were then obtained for structural characterizations. Interestingly, the study of electronic properties revealed not previously reported oscillatory nonlinear dependence of band gap values on the elementary proportion x in the Si1-xGexM, which suggests an alternative way for tuning the band gaps of 2D materials. Additionally, low effective masses (0.008m0 ~ 0.021m0) of the carriers in the semiconducting Si1-xGexM were found, which has potentials for high-speed applications. Considering the advantage of their compatibility with current Si-based technology and the trend of miniature of electronic devices, the Si1-xGexM with stable structures and excellent properties would be important for 2D applications based on group IV materials.

preprint2020arXiv

Ultra High Fidelity Image Compression with $\ell_\infty$-constrained Encoding and Deep Decoding

In many professional fields, such as medicine, remote sensing and sciences, users often demand image compression methods to be mathematically lossless. But lossless image coding has a rather low compression ratio (around 2:1 for natural images). The only known technique to achieve significant compression while meeting the stringent fidelity requirements is the methodology of $\ell_\infty$-constrained coding that was developed and standardized in nineties. We make a major progress in $\ell_\infty$-constrained image coding after two decades, by developing a novel CNN-based soft $\ell_\infty$-constrained decoding method. The new method repairs compression defects by using a restoration CNN called the $\ell_\infty\mbox{-SDNet}$ to map a conventionally decoded image to the latent image. A unique strength of the $\ell_\infty\mbox{-SDNet}$ is its ability to enforce a tight error bound on a per pixel basis. As such, no small distinctive structures of the original image can be dropped or distorted, even if they are statistical outliers that are otherwise sacrificed by mainstream CNN restoration methods. More importantly, this research ushers in a new image compression system of $\ell_\infty$-constrained encoding and deep soft decoding ($\ell_\infty\mbox{-ED}^2$). The $\ell_\infty \mbox{-ED}^2$ approach beats the best of existing lossy image compression methods (e.g., BPG, WebP, etc.) not only in $\ell_\infty$ but also in $\ell_2$ error metric and perceptual quality, for bit rates near the threshold of perceptually transparent reconstruction. Operationally, the new compression system is practical, with a low-complexity real-time encoder and a cascade decoder consisting of a fast initial decoder and an optional CNN soft decoder.

preprint2019arXiv

Transit Signatures of Inhomogeneous Clouds on Hot Jupiters: Insights From Microphysical Cloud Modeling

We determine the observability in transmission of inhomogeneous cloud cover on the limbs of hot Jupiters through post processing a general circulation model to include cloud distributions computed using a cloud microphysics model. We find that both the east and west limb often form clouds, but that the different properties of these clouds enhances the limb to limb differences compared to the clear case. Using JWST it should be possible to detect the presence of cloud inhomogeneities by comparing the shape of the transit lightcurve at multiple wavelengths because inhomogeneous clouds impart a characteristic, wavelength dependent signature. This method is statistically robust even with limited wavelength coverage, uncertainty on limb darkening coefficients, and imprecise transit times. We predict that the short wavelength slope varies strongly with temperature. The hot limb of the hottest planets form higher altitude clouds composed of smaller particles leading to a strong rayleigh slope. The near infrared spectral features of clouds are almost always detectable, even when no spectral slope is visible in the optical. In some of our models a spectral window between 5 and 9 microns can be used to probe through the clouds and detect chemical spectral features. Our cloud particle size distributions are not log-normal and differ from species to species. Using the area or mass weighted particle size significantly alters the relative strength of the cloud spectral features compared to using the predicted size distribution. Finally, the cloud content of a given planet is sensitive to a species&#39; desorption energy and contact angle, two parameters that could be constrained experimentally in the future.