Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
61works
0followers
22topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

61 published item(s)

preprint2026arXiv

BitLM: Unlocking Multi-Token Language Generation with Bitwise Continuous Diffusion

Autoregressive language models generate text one token at a time, yet natural language is inherently structured in multi-token units, including phrases, n-grams, and collocations that carry meaning jointly. This one-token bottleneck limits both the expressiveness of the model during pre-training and its throughput at inference time. Existing remedies such as speculative decoding or diffusion-based language models either leave the underlying bottleneck intact or sacrifice the causal structure essential to language modeling. We propose BitLM, a language model that represents each token as a fixed-length binary code and employs a lightweight diffusion head to denoise multiple tokens in parallel within each block. Crucially, BitLM preserves left-to-right causal attention across blocks while making joint lexical decisions within each block, combining the reliability of autoregressive modeling with the parallelism of iterative refinement. By replacing the large-vocabulary softmax with bitwise denoising, BitLM reframes token generation as iterative commitment in a compact binary space, enabling more efficient pre-training and substantially faster inference without altering the causal foundation that makes language models effective. Our results demonstrate that the one-token-at-a-time paradigm is not a fundamental requirement but an interface choice, and that changing it can yield a stronger and faster language model. We hope BitLM points toward a promising direction for next-generation language model architectures.

preprint2025arXiv

A Gas-Kinetic Scheme for Maxwell Equations

The Gas-Kinetic Scheme (GKS), widely used in computational fluid dynamics for simulating hypersonic and other complicated flow phenomena, is extended in this work to electromagnetic problems by solving Maxwell's equations. In contrast to the classical GKS formulation, the proposed scheme employs a discrete rather than a continuous velocity space. By evaluating a time-accurate numerical flux at cell interfaces, the proposed scheme attains second-order accuracy within a single step. Its kinetic formulation provides an inherently multidimensional framework, while the finite-volume formulation ensures straightforward extension to unstructured meshes. Through the incorporation of a collision process, the scheme exhibits lower numerical dissipation than classical flux-vector splitting (FVS) methods. Furthermore, the kinetic decomposition enables direct implementation of non-reflecting boundary conditions. The proposed scheme is validated against several benchmark problems and compared with established methods, including the Finite-Difference Time-Domain (FDTD) method and FVS. A lattice Boltzmann method (LBM) implementation is also included for comparative analysis. Finally, the technique is applied to simulate electromagnetic wave propagation in a realistic aircraft configuration, demonstrating its ability to model complex geometries.

preprint2024arXiv

Formation of PSR J1012+5307 with an extremely low-mass white dwarf: testing magnetic braking models

PSR J1012+5307 is a millisecond pulsar with an extremely low-mass (ELM) white dwarf (WD) companion in an orbit of 14.5 hours. Magnetic braking (MB) plays an important role in influencing the orbital evolution of binary systems with a low-mass ($\lt 1-2~M_{\odot}$) donor star. At present, there exist several different MB descriptions. In this paper, we investigate the formation of PSR J1012+5307 as a probe to test the plausible MB model. Employing a detailed stellar evolution model by the MESA code, we find that the Convection And Rotation Boosted MB and the 'Intermediate' MB models can reproduce the WD mass, WD radius, WD surface gravity, neutron-star mass, and orbital period observed in PSR J1012+5307. However, our simulated WD has higher effective temperature than the observation. Other three MB mechanisms including the standard MB model are too weak to account for the observed orbital period in a Hubble time. A long cooling timescale caused by H-shell flashes of the WD may alleviate the discrepancy between the simulated effective temperature and the observed value.

preprint2024arXiv

High-order compact gas-kinetic scheme in arbitrary Lagrangian-Eulerian formulation

This study proposes an extension of the high-order compact gas-kinetic scheme (CGKS) to compressible flow simulation in an arbitrary Lagrangian-Eulerian (ALE) formulation in unstructured mesh. The ALE method is achieved by subdividing arbitrary mesh into tetrahedrons and integrating flux function in a local coordinate system at the cell interface to ensure geometric conservation law. The scheme incorporates a compact reconstruction with third-order accuracy for updating both cell-averaged conservative flow variables and their gradients. HWENO-type nonlinear reconstruction and gradient compression factors are adopted to improve the accuracy and robustness of the scheme. A multi-stage multi-derivative (MSMD) time-stepping method is also implemented to achieve high-order time accuracy with fewer middle stages. The scheme is used to study problems involving moving boundaries. The numerical experiments demonstrate the effectiveness of the scheme in capturing the accurate solutions of both low-speed smooth flow and highly compressible ones with strong shock waves.

preprint2022arXiv

60-nm-span wavelength-tunable vortex fiber laser with intracavity plasmon metasurfaces

Wavelength-tunable vortex fiber lasers that could generate beams carrying orbital angular momentum (OAM) hold great interest in large-capacity optical communications. The wavelength tunability of conventional vortex fiber lasers is however limited by the range of 35 nm due to narrow bandwidth and/or insertion loss of mode conversion components. Optical metasurfaces apart from being compact planar components can flexibly manipulate light with high efficiency in a broad wavelength range. Here, we propose and demonstrate for the first time, to the best of our knowledge, a metasurface-assisted vortex fiber laser that can directly generate OAM beams with changeable topological charges. Due to the designed broadband gap-surface plasmon metasurface, combined with an intracavity tunable filter, the laser enables OAM beam with center wavelength continuously tunable from 1015 nm to 1075 nm, nearly twice of other vortex fiber lasers ever reported. The metasurface can be designed at will to satisfy requirements for either low pump threshold or high slope efficiency of the laser. Furthermore, the cavity-metasurface configuration can be extended to generate higher-order OAM beams or more complex structured beams in different wavelength regions, which greatly broadens the possibilities for developing low-cost and high-quality structured-beam laser sources.

preprint2022arXiv

A Survey of Adversarial Learning on Graphs

Deep learning models on graphs have achieved remarkable performance in various graph analysis tasks, e.g., node classification, link prediction, and graph clustering. However, they expose uncertainty and unreliability against the well-designed inputs, i.e., adversarial examples. Accordingly, a line of studies has emerged for both attack and defense addressed in different graph analysis tasks, leading to the arms race in graph adversarial learning. Despite the booming works, there still lacks a unified problem definition and a comprehensive review. To bridge this gap, we investigate and summarize the existing works on graph adversarial learning tasks systemically. Specifically, we survey and unify the existing works w.r.t. attack and defense in graph analysis tasks, and give appropriate definitions and taxonomies at the same time. Besides, we emphasize the importance of related evaluation metrics, investigate and summarize them comprehensively. Hopefully, our works can provide a comprehensive overview and offer insights for the relevant researchers. Latest advances in graph adversarial learning are summarized in our GitHub repository https://github.com/EdisonLeeeee/Graph-Adversarial-Learning.

preprint2022arXiv

Artificial Cnoidal Wave Breathers in Optical Microresonators

Breathers are localized structures that undergo a periodic oscillation in their duration and amplitude. Optical microresonators, benefiting from their high quality factor, provide an ideal test bench for studying the breathing phenomena. In the monochromatically pumped microresonator system, intrinsic breathing instabilities are widely observed in the form of temporal dissipative Kerr solitons which only exist in the effectively red detuned regime. Here, we proposed a novel bichromatic pumping scheme to create compulsive breathing microcombs via respectively distributing two pump lasers at the effectively blue and red detuned side of a single resonance. We experimentally discover the artificial cnoidal wave breathers and molecular crystal-like breathers in a chip-based silicon nitride microresonator, and theoretically describe their intriguing temporal dynamics based on the bichromatic pumping Lugiato-Lefever equation. In particular, the corresponding breathing microcombs exhibit diverse comb line spacing ranging from 2 to 17 times of the free spectral range of the resonator. Our discovery not only provides a simple and robust method to produce microcombs with reconfigurable comb line spacing, but also reveals a new type of breathing waves in driven dissipative nonlinear systems.

preprint2022arXiv

Construct the emission line galaxy-host halo connection through auto and cross correlations

We investigate the [O\,II] emission line galaxy (ELG)-host halo connection via auto and cross correlations, and propose a concise and effective method to populate ELGs in dark matter halos without assuming a parameterized halo occupation distribution (HOD) model. Using the observational data from VIMOS Public Extragalactic Redshift Survey (VIPERS), we measure the auto and cross correlation functions between ELGs selected by [O\,II] luminosity and normal galaxies selected by stellar mass. Combining the stellar-halo mass relation (SHMR) derived for the normal galaxies and the fraction of ELGs observed in the normal galaxy population, we demonstrate that we can establish an accurate ELG-halo connection. With the ELG-halo connection, we can accurately reproduce the auto and cross correlation functions of ELGs and normal galaxies both in real-space and in redshift-space, once the satellite fraction is properly reduced. Our method provides a novel strategy to generate ELG mock catalogs for ongoing and upcoming galaxy redshift surveys. We also provide a simple description for the HOD of ELGs.

preprint2022arXiv

Distant finetuning with discourse relations for stance classification

Approaches for the stance classification task, an important task for understanding argumentation in debates and detecting fake news, have been relying on models which deal with individual debate topics. In this paper, in order to train a system independent from topics, we propose a new method to extract data with silver labels from raw text to finetune a model for stance classification. The extraction relies on specific discourse relation information, which is shown as a reliable and accurate source for providing stance information. We also propose a 3-stage training framework where the noisy level in the data used for finetuning decreases over different stages going from the most noisy to the least noisy. Detailed experiments show that the automatically annotated dataset as well as the 3-stage training help improve model performance in stance classification. Our approach ranks 1st among 26 competing teams in the stance classification track of the NLPCC 2021 shared task Argumentative Text Understanding for AI Debater, which confirms the effectiveness of our approach.

preprint2022arXiv

DyLex: Incorporating Dynamic Lexicons into BERT for Sequence Labeling

Incorporating lexical knowledge into deep learning models has been proved to be very effective for sequence labeling tasks. However, previous works commonly have difficulty dealing with large-scale dynamic lexicons which often cause excessive matching noise and problems of frequent updates. In this paper, we propose DyLex, a plug-in lexicon incorporation approach for BERT based sequence labeling tasks. Instead of leveraging embeddings of words in the lexicon as in conventional methods, we adopt word-agnostic tag embeddings to avoid re-training the representation while updating the lexicon. Moreover, we employ an effective supervised lexical knowledge denoising method to smooth out matching noise. Finally, we introduce a col-wise attention based knowledge fusion mechanism to guarantee the pluggability of the proposed framework. Experiments on ten datasets of three tasks show that the proposed framework achieves new SOTA, even with very large scale lexicons.

preprint2022arXiv

High-order Compact Gas-kinetic Schemes for Three-dimensional Flow Simulation on Tetrahedral Mesh

A general framework for the development of high-order compact schemes has been proposed recently. The core steps of the schemes are composed of the following. 1). Based on a kinetic model equation, from a generalized initial distribution of flow variables construct a time-accurate evolution solution of gas distribution function at a cell interface ; 2). Introduce the WENO-type weighting functions into the time-derivative of the cell interface flux function in the multistage multi-derivative time stepping scheme to cope with the possible impingement of a shock wave on a cell interface within a time step; 3). Take moments of interface gas distribution function to obtain the time-accurate flow variables and the corresponding fluxes at the cell interface, and update the cell-averaged flow variables and their gradients inside each control volume; 4). Within the physical domain of dependence of the reconstructed cell, based on the cell-averaged flow variables and their gradients develop compact initial data reconstruction to get initial flow distributions at the beginning of next time step. A compact gas-kinetic scheme (GKS) up to sixth-order accuracy in space and fourth-order in time has been constructed on 2D unstructured mesh before. In this paper, the compact GKS up to fourth-order accuracy on 3D tetrahedral mesh will be further constructed with the focus on the WENO-type initial data reconstruction. Nonlinear weights are designed to achieve high-order accuracy for the smooth Navier-Stokes solution and keep super robustness in 3D computation with strong shock interactions. The fourth-order compact GKS can use a large time step with CFL number $0.6$ in the simulations from subsonic to hypersonic flow. A series of test cases are used to validate the scheme. The high-order compact GKS is ready for 3D applications with complex geometry.

preprint2022arXiv

High-order Gas-kinetic Schemes with Non-compact and Compact Reconstruction for Implicit Large Eddy Simulation

High-order gas-kinetic scheme (HGKS) with 5th-order non-compact reconstruction has been well implemented for implicit large eddy simulation (ILES) in nearly incompressible turbulent channel flows. In this study, the HGKS with higher-order non-compact reconstruction and compact reconstruction will be validated in turbulence simulation. For higher-order non-compact reconstruction, 7th-order normal reconstruction and tangential reconstruction are implemented. In terms of compact reconstruction, 5th-order normal reconstruction is adopted. Current work aims to show the benefits of high-order non-compact reconstruction and compact reconstruction for ILES. The accuracy of HGKS is verified by numerical simulation of three-dimensional advection of density perturbation. For the non-compact 7th-order scheme, 16 Gaussian points are required on the cell interface to preserve the order of accuracy. Then, HGKS with non-compact and compact reconstruction is used in the three-dimensional Taylor-Green vortex (TGV) problem and turbulent channel flows. Accurate ILES solutions have been obtained from HGKS. In terms of the physical modeling underlying the numerical algorithms, the compact reconstruction has the consistent physical and numerical domains of dependence without employing additional information from cells which have no any direct physical connection with the targeted cell. The compact GKS shows a favorable performance for turbulence simulation in resolving the multi-scale structures.

preprint2022arXiv

Magnetism of QCD matter and pion mass from tensor-type spin polarization and anomalous magnetic moment of quarks

We investigate the magnetism of QCD matter and pion mass under magnetic field considering the contribution from the tensor-type spin polarization and the anomalous magnetic moment (AMM) of quarks. It is found that the tensor-type spin polarization (TSP) induces the magnetic catalysis of chiral condensate and diamagnetism (negative magnetic susceptibility) of quark matter at low temperature, both neutral and charged pion masses increase quickly with magnetic field in the case of TSP. The anomalous magnetic moment (AMM) of quarks induces magnetic inhibition and a magnetic dependent AMM causes inverse magnetic catalysis at finite temperature, and the neutral pion mass decreases with magnetic field while the charged pion mass shows nonmonotonic behavior with the magnetic field, which is qualitatively in agreement with lattice result. However, the magnetic susceptibility is positive at low temperature with AMM. In the current framework, our results show the irreconcilable contradiction between the diamagnetism and inverse magnetic catalysis.

preprint2022arXiv

Neuromorphic computing using wavelength-division multiplexing

Optical neural networks (ONNs), or optical neuromorphic hardware accelerators, have the potential to dramatically enhance the computing power and energy efficiency of mainstream electronic processors, due to their ultralarge bandwidths of up to 10s of terahertz together with their analog architecture that avoids the need for reading and writing data back and forth. Different multiplexing techniques have been employed to demonstrate ONNs, amongst which wavelength division multiplexing (WDM) techniques make sufficient use of the unique advantages of optics in terms of broad bandwidths. Here, we review recent advances in WDM based ONNs, focusing on methods that use integrated microcombs to implement ONNs. We present results for human image processing using an optical convolution accelerator operating at 11 Tera operations per second. The open challenges and limitations of ONNs that need to be addressed for future applications are also discussed.

preprint2022arXiv

Photometric Objects around Cosmic Webs (PAC) Delineated in a Spectroscopic Survey. I. Methods

We provide a method for estimating the projected density distribution $\bar{n}_2w_p(r_p)$ of photometric objects around spectroscopic objects in a redshift survey. This quantity describes the distribution of Photometric sources with certain physical properties (e.g. luminosity, mass, color etc) Around Cosmic webs (PAC) traced by the spectroscopic objects. The method can make full use of current and future deep and wide photometric surveys to explore the formation of galaxies up to medium redshift ($z_s < 2$), with the aid of cosmological redshift surveys that sample only a fairly limited species of objects (e.g. Emission Line Galaxies). As an example, we apply the PAC method to the CMASS spectroscopic and HSC-SSP PDR2 photometric samples to explore the distribution of galaxies for a wide range of stellar mass from $10^{9.0}{\rm M_\odot}$ to $10^{12.0}{\rm M_\odot}$ around massive ones at $z_s\approx 0.6$. Using the abundance matching method, we model $\bar{n}_2w_p(r_p)$ in N-body simulation using MCMC sampling, and accurately measure the stellar-halo mass relation (SHMR) and stellar mass function (SMF) for the whole mass range. We can also measure the conditional stellar mass function (CSMF) of satellites for central galaxies of different mass. The PAC method has many potential applications for studying the evolution of galaxies.

preprint2022arXiv

Satellite galaxies&#39; drag on field stars in the Milky Way

With Gaia EDR3 data, velocity dispersion of Milky Way field stars around satellite galaxies have been investigated. We have fitted velocity dispersion against distance to satellite galaxy and found the gradient of velocity dispersion is related to the mass of satellite galaxy. With order-of-magnitude approximations, a linear correlation has been fitted between the mass of satellite galaxy and gradient of velocity dispersion caused by its gravitational drag. Though our result is an observational qualitative result, it shows better relation could be obtained with more observations in the future.

preprint2022arXiv

Semantic optical fiber communication system

The current optical communication systems minimize bit or symbol errors without considering the semantic meaning behind digital bits, thus transmitting a lot of unnecessary information. We propose and experimentally demonstrate a semantic optical fiber communication (SOFC) system. Instead of encoding information into bits for transmission, semantic information is extracted from the source using deep learning. The generated semantic symbols are then directly transmitted through an optical fiber. Compared with the bit-based structure, the SOFC system achieved higher information compression and a more stable performance, especially in the low received optical power regime, and enhanced the robustness against optical link impairments. This work introduces an intelligent optical communication system at the human analytical thinking level, which is a significant step toward a breakthrough in the current optical communication architecture.

preprint2022arXiv

Three-dimensional third-order gas-kinetic scheme on hybrid unstructured meshes for Euler and Navier-Stokes equations

In this paper, a third order gas kinetic scheme is developed on the three dimensional hybrid unstructured meshes for the numerical simulation of compressible inviscid and viscous flows. A third-order WENO reconstruction is developed on the hybrid unstructured meshes, including tetrahedron, pyramid, prism and hexahedron. A simple strategy is adopted for the selection of big stencil and sub-stencils. Incorporate with the two-stage fourth-order temporal discretization and lower-upper symmetric Gauss-Seidel methods, both explicit and implicit high-order gas-kinetic schemes are developed. A variety of numerical examples, from the subsonic to supersonic flows, are presented to validate the accuracy and robustness for both inviscid and viscous flows.

preprint2022arXiv

UGKWP for three-dimensional simulation of gas-particle fluidized bed

The gas-solid particle two-phase flow in a fluidized bed shows complex physics. Following our previous work, the multi-scale framework based on gas-kinetic scheme (GKS) and unified gas-kinetic wave-particle method (UGKWP) for the gas-particle system is firstly extended to the three-dimensional simulation of the fluidized bed. For the solid particle evolution, different from the widely-used Eulerian and Lagrangian approaches, the UGKWP unifies the wave (dense particle region) and discrete particle (dilute particle region) formulation seamlessly according to a continuous variation of particle cell&#39;s Kundsen number ($Kn$). The GKS-UGKWP for the coupled gas-particle evolution system can automatically become an Eulerian-Eulerian (EE) method in the high particle collision regime and Eulerian-Lagrangian (EL) formulation in the collisionless particle regime. In the transition regime, the UGKWP can achieve a smooth transition between the Eulerian and Lagrangian limiting formulation. More importantly, the weights of mass distributions from analytical wave and discrete particle are related to the local $Kn$ by $\exp(-1/Kn)$ for wave and $(1-\exp(-1/Kn))$ for discrete particle. As a result, the UGKWP provides an optimal modeling for capturing the particle phase in terms of physical accuracy and numerical efficiency. In the numerical simulation, the UGKWP does not need any prior division of dilute/dense regions, which makes it suitable for the fluidized bed problem, where the dilute/transition/dense regions instantaneously coexist and are dynamically interconvertible. In this paper, based on the GKS-UGKWP formulation two lab-scale fluidization cases are simulated in 3D and the simulation results are compared with the experimental measurements. The typical heterogeneous flow features of the fluidized bed are well captured and the statistics are in good agreement with experiment data.

preprint2022arXiv

Zero-shot Cross-lingual Conversational Semantic Role Labeling

While conversational semantic role labeling (CSRL) has shown its usefulness on Chinese conversational tasks, it is still under-explored in non-Chinese languages due to the lack of multilingual CSRL annotations for the parser training. To avoid expensive data collection and error-propagation of translation-based methods, we present a simple but effective approach to perform zero-shot cross-lingual CSRL. Our model implicitly learns language-agnostic, conversational structure-aware and semantically rich representations with the hierarchical encoders and elaborately designed pre-training objectives. Experimental results show that our model outperforms all baselines by large margins on two newly collected English CSRL test sets. More importantly, we confirm the usefulness of CSRL to non-Chinese conversational tasks such as the question-in-context rewriting task in English and the multi-turn dialogue response generation tasks in English, German and Japanese by incorporating the CSRL information into the downstream conversation-based models. We believe this finding is significant and will facilitate the research of non-Chinese dialogue tasks which suffer the problems of ellipsis and anaphora.

preprint2021arXiv

A giant central red disk galaxy at redshift $z=0.76$: challenge to theories of galaxy formation

We report a giant red central disk galaxy in the XMM-LSS north region. The region is covered with a rich variety of multiband photometric and spectroscopic observations. Using the photometric data of the Canada-France-Hawaii Telescope Legacy Survey (CFHTLS) and spectroscopic observation of the Baryon Oscillation Spectroscopic Survey (BOSS), we find that the galaxy has a stellar mass of $\sim 10^{11.6}$ times of the solar mass $M_\odot$. The galaxy has a red color and has an old stellar population, and thus its star formation has stopped. With the photometric image data of Hyper Suprime-Cam (HSC) Subaru Strategic Program, we demonstrate that its luminosity profile is perfectly described by a Sérsic form with $n=1.22$ indicating disk morphology. We also analyze its environment based on the VIMOS Public Extragalactic Redshift Survey (VIPERS) photometric catalog, and find that its close neighbors are all less massive, indicating that our observed galaxy is sitting at the center of its host halo. Existence of the giant red central disk galaxy seriously challenges the current standard paradigm of galaxy formation, as there is no known physical mechanism to explain the quenching of its star formation. This conclusion is supported by state-of-the-art hydrodynamical simulations of galaxy formation.

preprint2021arXiv

Are there magnetars in high-mass X-ray binaries?

Magnetars form a special population of neutron stars with strong magnetic fields and long spin periods. About 30 magnetars and magnetar candidates known currently are probably isolated. But the possibility that magnetars are in binaries hasn&#39;t been excluded. In this work, we perform spin evolution of neutron stars with different magnetic fields in wind-fed high-mass X-ray binaries and compare the spin period distribution with observations, aiming to find magnetars in binaries. Our simulation shows that some of the neutron stars, which have long spin periods or in wide-separation systems, need strong magnetic fields to explain their spin evolution. This implies that there are probably magnetars in high-mass X-ray binaries. Moreover, this can further provide a theoretical basis for some unclear astronomical phenomena, such as the possible origin of periodic fast radio bursts from magnetars in binary systems.

preprint2021arXiv

GraphGallery: A Platform for Fast Benchmarking and Easy Development of Graph Neural Networks Based Intelligent Software

Graph Neural Networks (GNNs) have recently shown to be powerful tools for representing and analyzing graph data. So far GNNs is becoming an increasingly critical role in software engineering including program analysis, type inference, and code representation. In this paper, we introduce GraphGallery, a platform for fast benchmarking and easy development of GNNs based software. GraphGallery is an easy-to-use platform that allows developers to automatically deploy GNNs even with less domain-specific knowledge. It offers a set of implementations of common GNN models based on mainstream deep learning frameworks. In addition, existing GNNs toolboxes such as PyG and DGL can be easily incorporated into the platform. Experiments demonstrate the reliability of implementations and superiority in fast coding. The official source code of GraphGallery is available at https://github.com/EdisonLeeeee/GraphGallery and a demo video can be found at https://youtu.be/mv7Zs1YeaYo.

preprint2021arXiv

Joint Coreference Resolution and Character Linking for Multiparty Conversation

Character linking, the task of linking mentioned people in conversations to the real world, is crucial for understanding the conversations. For the efficiency of communication, humans often choose to use pronouns (e.g., &#34;she&#34;) or normal phrases (e.g., &#34;that girl&#34;) rather than named entities (e.g., &#34;Rachel&#34;) in the spoken language, which makes linking those mentions to real people a much more challenging than a regular entity linking task. To address this challenge, we propose to incorporate the richer context from the coreference relations among different mentions to help the linking. On the other hand, considering that finding coreference clusters itself is not a trivial task and could benefit from the global character information, we propose to jointly solve these two tasks. Specifically, we propose C$^2$, the joint learning model of Coreference resolution and Character linking. The experimental results demonstrate that C$^2$ can significantly outperform previous works on both tasks. Further analyses are conducted to analyze the contribution of all modules in the proposed model and the effect of all hyper-parameters.

preprint2021arXiv

Principle-driven Fiber Transmission Model based on PINN Neural Network

In this paper, a novel principle-driven fiber transmission model based on physical induced neural network (PINN) is proposed. Unlike data-driven models which regard fiber transmission problem as data regression tasks, this model views it as an equation solving problem. Instead of adopting input signals and output signals which are calculated by SSFM algorithm in advance before training, this principle-driven PINN based fiber model adopts frames of time and distance as its inputs and the corresponding real and imaginary parts of NLSE solutions as its outputs. By taking into account of pulses and signals before transmission as initial conditions and fiber physical principles as NLSE in the design of loss functions, this model will progressively learn the transmission rules. Therefore, it can be effectively trained without the data labels, referred as the pre-calculated signals after transmission in data-driven models. Due to this advantage, SSFM algorithm is no longer needed before the training of principle-driven fiber model which can save considerable time consumption. Through numerical demonstration, the results show that this principle-driven PINN based fiber model can handle the prediction tasks of pulse evolution, signal transmission and fiber birefringence for different transmission parameters of fiber telecommunications.

preprint2021arXiv

Programmable Multifunctional Plasmonic Waveguide System based on Coding Metamaterials and Inverse Design

In this article, we propose a programmable plasmonic waveguide system (PPWS) to achieve several different functions based on metal coding metamaterials (MCMs) and inverse design technology. There is no need to spend much time on considering the relation between the function and the structure because the MCMs in the PPWS are reprogrammable. In order to demonstrate the effectiveness of the PPWS, we utilize it to achieve several filtering functions, including bandstop and bandpass filters. The simulation results exhibit that the performance of filters is improved based on genetic algorithm, particle swarm optimization, multi-traversal direct-binary search and simulated annealing. Especially, the bandwidth and quality factor for the narrow-band filter can reach 6.5 nm and 200.5. In addition to the simple filtering functions, some relatively complex transmission characteristics can be obtained by using the PPWS, such as plasmon-induced transparency-like effects. In conclusion, genetic algorithm is considered as the most efficient inverse design method for our system due to its less optimization time and stable performance. In comparison with the previous works, our proposed PPWS not only provides a general framework for obtaining an effective, flexible and compact plasmonic device but also shows the applications of inverse design on photonics devices.

preprint2021arXiv

Structural Information Preserving for Graph-to-Text Generation

The task of graph-to-text generation aims at producing sentences that preserve the meaning of input graphs. As a crucial defect, the current state-of-the-art models may mess up or even drop the core structural information of input graphs when generating outputs. We propose to tackle this problem by leveraging richer training signals that can guide our model for preserving input information. In particular, we introduce two types of autoencoding losses, each individually focusing on different aspects (a.k.a. views) of input graphs. The losses are then back-propagated to better calibrate our model via multi-task training. Experiments on two benchmarks for graph-to-text generation show the effectiveness of our approach over a state-of-the-art baseline. Our code is available at \url{http://github.com/Soistesimmer/AMR-multiview}.

preprint2021arXiv

The first decade of unified gas kinetic scheme

In 2010, the unified gas kinetic scheme (UGKS) was proposed by Xu et al . (A unified gas-kinetic scheme for continuum and rarefied flows, Journal of Computational Physics, 2010). In the past decade, many numerical techniques have been developed to improve the capability of the UGKS in the aspects of efficiency increment, memory reduction, and physical modeling. The methodology of the direct modeling of the UGKS on discretization scale provides a general framework for construction of multiscale method for multiscale transport processes. This paper reviews the development and extension of the UGKS in its first decade.

preprint2021arXiv

Two-step multi-resolution reconstruction-based compact gas-kinetic scheme on tetrahedral mesh

In this paper, a third-order compact gas-kinetic scheme (GKS) on unstructured tetrahedral mesh is constructed for the compressible Euler and Navier-Stokes solutions. The time-dependent gas distribution function at a cell interface is used to calculate the fluxes for the updating the cell-averaged flow variables and to evaluate the time accurate cell-averaged flow variables as well for evolving the cell-averaged gradients of flow variables. With the accurate evolution model for both flow variables and their slopes, the quality of the scheme depends closely on the accuracy and reliability of the initial reconstruction of flow variables. The reconstruction scheme becomes more challenge on tetrahedral mesh, where the conventional second-order unlimited least-square reconstruction can make the scheme be linearly unstable when using cell-averaged conservative variables alone with von Neumann neighbors. Benefiting from the evolved cell-averaged slopes, on tetrahedral mesh the GKS is linearly stable from a compact third-order smooth reconstruction with a large CFL number. In order to further increase the robustness of the high-order compact GKS for capturing discontinuous solution, a new two-step multi-resolution weighted essentially non-oscillatory (WENO) reconstruction will be proposed. The novelty of the reconstruction includes the following. Firstly, it releases the stability issue from a second-order compact reconstruction through the introduction of a pre-reconstruction step. Secondly, in the third-order non-linear reconstruction, only one more large stencil is added beside those in the second-order one, which significantly simplifies the high-order reconstruction. The proposed third-order scheme shows good robustness in high speed flow computation and favorable mesh adaptability in cases with complex geometry.

preprint2021arXiv

Unified gas-kinetic wave-particle method for gas-particle two phase flow from dilute to dense solid-particle limit

In this paper, a unified framework for particulate two-phase flow will be presented with a wide range of solid-particle concentration from dilute to dense limit. The two phase flow is simulated by two coupled flow solvers, i.e., the gas-kinetic scheme (GKS) for the gas phase and unified gas-kinetic wave-particle method (UGKWP) for the particle phase. The GKS is a second-order Navier-Stokes flow solver for the continuum flow. The UGKWP is a multiscale method for all flow regimes. The wave and particle decomposition in UGKWP depends on the cell&#39;s Knudsen number (Kn). At a small Kn number, the high concentrated solid particle phase will be modeled by the Eulerian hydrodynamic wave due to the intensive particle-particle collisions. At a large Kn number, the dilute solid particle will be sampled and followed by the Lagrangian particle formulation to capture the non-equilibrium transport. In the transition regime, the distribution and evolution of particle and wave in UGKWP are controlled by the local Kn number with a smooth transition between the above limits. In the current scheme, the two phase model improves the previous one in all following aspects: drag force model for different solid particle concentrations; the frictional pressure in inter-particle contacts at high solid-particle concentration; a flux limiting model to avoid solid particles&#39; over-packing; additional non-conservative nozzle and work terms for the gas phase. Besides, the inter-particle collisions have been refined numerically for the dense particle flow through the discretization of the collision term and numerical flux function. The numerical scheme is tested in a series of typical gas-particle problems. The results validate the accuracy and reliability of the proposed method for gas-particle flow.

preprint2021arXiv

Unified gas-kinetic wave-particle methods VI: Disperse dilute gas-particle multiphase flow

In this paper, a unified gas-kinetic wave-particle scheme (UGKWP) for the disperse dilute gas-particle multiphase flow is proposed. The gas phase is always in the hydrodynamic regime. However, the particle phase covers different flow regimes from particle trajectory crossing to the hydrodynamic wave interaction with the variation of local particle phase Knudsen number. The UGKWP is an appropriate method for the capturing of the multiscale transport mechanism in the particle phase through its coupled wave-particle formulation. In the regime with intensive particle collision, the evolution of solid particle will be followed by the analytic wave with quasi-equilibrium distribution; while in the rarefied regime the non-equilibrium particle phase will be captured through particle tracking and collision, which plays a decisive role in recovering particle trajectory crossing behavior. The gas-kinetic scheme (GKS) is employed for the simulation of gas flow. In the highly collision regime for the particles, no particles will be sampled in UGKWP and the wave formulation for solid particle with the hydrodynamic gas phase will reduce the system to the two-fluid Eulerian model. On the other hand, in the collisionless regime for the solid particle, the free transport of solid particle will be followed in UGKWP, and coupled system will return to the Eulerian-Lagrangian formulation for the gas and particle. The scheme will be tested for in all flow regimes, which include the non-equilibrium particle trajectory crossing, the particle concentration under different Knudsen number, and the dispersion of particle flow with the variation of Stokes number. A experiment of shock-induced particle bed fluidization is simulated and the results are compared with experimental measurements. These numerical solutions validate suitability of the proposed scheme for the simulation of gas-particle multiphase flow.

preprint2020arXiv

A three-dimensional compact high-order gas-kinetic scheme on structured mesh

In this paper, a third-order compact gas-kinetic scheme is firstly proposed for three-dimensional computation for the compressible Euler and Navier-Stokes solutions. The scheme achieves its compactness due to the time-dependent gas distribution function in GKS, which provides not only the fluxes but also the time accurate flow variables in the next time level at a cell interface. As a result, the cell averaged first-order spatial derivatives of flow variables can be obtained naturally through the Gauss&#39;s theorem. Then, a third-order compact reconstruction involving the cell averaged values and their first-order spatial derivatives can be achieved. The trilinear interpolation is used to treat possible non-coplanar elements on general hexahedral mesh. The constrained least-square technique is applied to improve the accuracy in the smooth case. To deal with both smooth and discontinuous flows, a new HWENO reconstruction is designed in the current scheme by following the ideas in Zhu, 2018. No identification of troubled cells is needed in the current scheme. In contrast to the Riemann solver-based method, the compact scheme can achieve a third-order temporal accuracy with the two-stage two-derivative temporal discretization, instead of the three-stage Runge-Kutta method. Overall, the proposed scheme inherits the high accuracy and efficiency of the previous ones in two-dimensional case. The desired third-order accuracy can be obtained with curved boundary. The robustness of the scheme has been validated through many cases, including strong shocks in both inviscid and viscous flow computations. Quantitative comparisons for both smooth and discontinuous cases show that the current third-order scheme can give competitive results against the fifth-order non-compact GKS under the same mesh. A large CFL number around 0.5 can be used in the present scheme.

preprint2020arXiv

A Unified Gas-kinetic Scheme for Micro Flow Simulation Based on Linearized Kinetic Equation

The flow regime of micro flow varies from collisionless regime to hydrodynamic regime according to the Knudsen number. On the kinetic scale, the dynamics of micro flow can be described by the linearized kinetic equation. In the continuum regime, hydrodynamic equations such as linearized Navier-Stokes equations and Euler equations can be derived from the linearized kinetic equation by the Chapman-Enskog asymptotic analysis. In this paper, based on the linearized kinetic equation we are going to propose a unified gas kinetic scheme scheme (UGKS) for micro flow simulation, which is an effective multiscale scheme in the whole micro flow regime. The important methodology of UGKS is the following. Firstly, the evolution of microscopic distribution function is coupled with the evolution of macroscopic flow quantities. Secondly, the numerical flux of UGKS is constructed based on the integral solution of kinetic equation, which provides a genuinely multiscale and multidimensional numerical flux. The UGKS recovers the linear kinetic solution in the rarefied regime, and converges to the linear hydrodynamic solution in the continuum regime. An outstanding feature of UGKS is its capability of capturing the accurate viscous solution even when the cell size is much larger than the kinetic kinetic length scale, such as the capturing of the viscous boundary layer with a cell size ten times larger than the particle mean free path. Such a multiscale property is called unified preserving (UP) which has been studied in \cite{guo2019unified}. In this paper, we are also going to give a mathematical proof for the UP property of UGKS.

preprint2020arXiv

Comparison of the performance of high-order schemes based on the gas-kinetic and HLLC fluxes

In this paper, a comparison of the performance of two high-order finite volume methods based on the gas-kinetic scheme (GKS) and HLLC fluxes is carried out in structured rectangular mesh. For both schemes, the fifth-order WENO-AO reconstruction is adopted to achieve a high-order spatial accuracy. In terms of temporal discretization, a two-stage fourth-order (S2O4) time marching strategy is adopted for WENO5-AO-GKS scheme, and the fourth-order Runge-Kutta (RK4) method is employed for WENO5-AO-HLLC scheme. For the viscous flow computation, the GKS includes both inviscid and viscous fluxes in the evolution of a single cell interface gas distribution function. While for the WENO5-AO-HLLC scheme, the inviscid flux is provided by HLLC Riemann solver, and the viscous flux is discretized by a sixth-order central difference method. Based on the tests of forward Mach step and viscous shock tube, both schemes show outstanding shock capturing property. From the Titarev-Toro and double shear layer tests, WENO5-AO-GKS scheme seems to have a better resolution than WENO5-AO-HLLC scheme. Both schemes show excellent robustness in extreme cases, such as the Le Blanc problem. From the cases of the Noh problem and the compressible isotropic turbulence, WENO5-AO-GKS scheme shows favorite robustness. In the compressible isotropic turbulence and three-dimensional Taylor-Green vortex problems, WENO-AO-GKS can use a CFL number up to 0.5, instead of 0.3 for WENO5-AO-HLLC. In terms of computational efficiency, WENO5-AO-HLLC scheme is about 27% more expensive than WENO5-AO-GKS scheme in the two-dimensional viscous flow problems, but is about 15% faster in the three-dimensional case. Due to the multi-dimensionality, WENO5-AO-GKS scheme performs better than WENO5-AO-HLLC scheme in the laminar boundary layer and the double shear layer test.

preprint2020arXiv

Coordinated Reasoning for Cross-Lingual Knowledge Graph Alignment

Existing entity alignment methods mainly vary on the choices of encoding the knowledge graph, but they typically use the same decoding method, which independently chooses the local optimal match for each source entity. This decoding method may not only cause the &#34;many-to-one&#34; problem but also neglect the coordinated nature of this task, that is, each alignment decision may highly correlate to the other decisions. In this paper, we introduce two coordinated reasoning methods, i.e., the Easy-to-Hard decoding strategy and joint entity alignment algorithm. Specifically, the Easy-to-Hard strategy first retrieves the model-confident alignments from the predicted results and then incorporates them as additional knowledge to resolve the remaining model-uncertain alignments. To achieve this, we further propose an enhanced alignment model that is built on the current state-of-the-art baseline. In addition, to address the many-to-one problem, we propose to jointly predict entity alignments so that the one-to-one constraint can be naturally incorporated into the alignment prediction. Experimental results show that our model achieves the state-of-the-art performance and our reasoning methods can also significantly improve existing baselines.

preprint2020arXiv

Extracting the magnitude of magnetic field at freeze-out in heavy-ion collisions

A strong magnetic field influences significantly the masses of the charged light mesons. For example, the mass of charged pion increases with the magnetic field increasing. We propose this mechanism as a possible way to extract the magnitude of magnetic field at freeze-out in heavy ion collisions and thus help constrain its lifetime which is currently a major open question to resolve. Specifically we show that the ratio between the yield of charged pions and that of charged rhos is very sensitive to the magnetic field value at freeze-out. By using a viscous-hydrodynamic framework (iEBE-VISHNU) to simulate heavy ion collisions and implementing magnetic-field-dependent meson masses, we compute their yields and predict the dependence of such ratio on the magnetic field. We suggest to use this ratio of charged rho yield over charged pion yield as an experimental observable to extract the possible magnetic field at freeze-out in heavy ion collisions.

preprint2020arXiv

High-order gas-kinetic scheme with parallel computation for direct numerical simulation of turbulent flows

The performance of high-order gas-kinetic scheme (HGKS) has been investigated for the direct numerical simulation (DNS) of isotropic compressible turbulence up to the supersonic regime. Due to the multi-scale nature and coupled temporal-spatial evolution process, HGKS provides a valid tool for the numerical simulation of compressible turbulent flow. Based on the domain decomposition and message passing interface (MPI), a parallel HGKS code is developed for large-scale computation in this paper. The standard tests from the nearly incompressible flow to the supersonic one, including Taylor-Green vortex problem, turbulent channel flow and isotropic compressible turbulence, are presented to validate the parallel scalability, efficiency, accuracy and robustness of parallel implementation. The performance of HGKS for the nearly incompressible turbulence is comparable with the high-order finite difference scheme, including the resolution of flow structure and efficiency of computation. Based on the accuracy of the numerical solution, the numerical dissipation of the scheme in the turbulence simulation is quantitatively evaluated. As a mesoscopic method, HGKS performs better than both lattice Boltzmann method (LBM) and discrete unified gas-kinetic scheme (DUGKS), due to its high-order accuracy. Meanwhile, based on the kinetic formulation HGKS shows advantage for supersonic turbulent flow simulation with its accuracy and robustness. The current work demonstrates the capability of HGKS as a powerful DNS tool from the low speed to supersonic turbulence study, which is less reported under the framework of finite volume scheme.

preprint2020arXiv

Learning Implicit Generative Models by Teaching Explicit Ones

Implicit generative models are difficult to train as no explicit density functions are defined. Generative adversarial nets (GANs) present a minimax framework to train such models, which however can suffer from mode collapse due to the nature of the JS-divergence. This paper presents a learning by teaching (LBT) approach to learning implicit models, which intrinsically avoids the mode collapse problem by optimizing a KL-divergence rather than the JS-divergence in GANs. In LBT, an auxiliary density estimator is introduced to fit the implicit model&#39;s distribution while the implicit model teaches the density estimator to match the data distribution. LBT is formulated as a bilevel optimization problem, whose optimal generator matches the true data distribution. LBT can be naturally integrated with GANs to derive a hybrid LBT-GAN that enjoys complimentary benefits. Finally, we present a stochastic gradient ascent algorithm with unrolling to solve the challenging learning problems. Experimental results demonstrate the effectiveness of our method.

preprint2020arXiv

Mixup Inference: Better Exploiting Mixup to Defend Adversarial Attacks

It has been widely recognized that adversarial examples can be easily crafted to fool deep networks, which mainly root from the locally non-linear behavior nearby input examples. Applying mixup in training provides an effective mechanism to improve generalization performance and model robustness against adversarial perturbations, which introduces the globally linear behavior in-between training examples. However, in previous work, the mixup-trained models only passively defend adversarial attacks in inference by directly classifying the inputs, where the induced global linearity is not well exploited. Namely, since the locality of the adversarial perturbations, it would be more efficient to actively break the locality via the globality of the model predictions. Inspired by simple geometric intuition, we develop an inference principle, named mixup inference (MI), for mixup-trained models. MI mixups the input with other random clean samples, which can shrink and transfer the equivalent perturbation if the input is adversarial. Our experiments on CIFAR-10 and CIFAR-100 demonstrate that MI can further improve the adversarial robustness for the models trained by mixup and its variants.

preprint2020arXiv

Modeling and computation for non-equilibrium gas dynamics: beyond kinetic relaxation model

The non-equilibrium gas dynamics is described by the Boltzmann equation, which can be solved numerically through the deterministic and stochastic methods. Due to the complicated collision term of the Boltzmann equation, many kinetic relaxation models have been proposed and used in the past seventy years for the study of rarefied flow. In order to develop a multiscale method for the rarefied and continuum flow simulation, by adopting the integral solution of the kinetic model equation a DVM-type unified gas-kinetic scheme (UGKS) has been constructed. The UGKS models the gas dynamics on the cell size and time step scales while the accumulating effect from particle transport and collision has been taken into account within a time step. Under the UGKS framework, a unified gas-kinetic wave-particle (UGKWP) method has been further developed for non-equilibrium flow simulation, where the time evolution of gas distribution function is composed of analytical wave and individual particle. In the highly rarefied regime, particle transport and collision will play a dominant role. Due to the single relaxation time model for particle collision, there is a noticeable discrepancy between the UGKWP solution and the full Boltzmann or DSMC result, especially in the high Mach and Knudsen number cases. In this paper, besides the kinetic relaxation model, a modification of particle collision time according to the particle velocity will be implemented in UGKWP. As a result, the new model greatly improves the performance of UGKWP in the capturing of non-equilibrium flow. There is a perfect match between UGKWP and DSMC or Boltzmann solution in the highly rarefied regime. In the near continuum and continuum flow regime, the UGKWP will gradually get back to the macroscopic variables based Navier-Stokes flow solver at small cell Knudsen number.

preprint2020arXiv

Multiplex Word Embeddings for Selectional Preference Acquisition

Conventional word embeddings represent words with fixed vectors, which are usually trained based on co-occurrence patterns among words. In doing so, however, the power of such representations is limited, where the same word might be functionalized separately under different syntactic relations. To address this limitation, one solution is to incorporate relational dependencies of different words into their embeddings. Therefore, in this paper, we propose a multiplex word embedding model, which can be easily extended according to various relations among words. As a result, each word has a center embedding to represent its overall semantics, and several relational embeddings to represent its relational dependencies. Compared to existing models, our model can effectively distinguish words with respect to different relations without introducing unnecessary sparseness. Moreover, to accommodate various relations, we use a small dimension for relational embeddings and our model is able to keep their effectiveness. Experiments on selectional preference acquisition and word similarity demonstrate the effectiveness of the proposed model, and a further study of scalability also proves that our embeddings only need 1/20 of the original embedding size to achieve better performance.

preprint2020arXiv

On the Role of Conceptualization in Commonsense Knowledge Graph Construction

Commonsense knowledge graphs (CKGs) like Atomic and ASER are substantially different from conventional KGs as they consist of much larger number of nodes formed by loosely-structured text, which, though, enables them to handle highly diverse queries in natural language related to commonsense, leads to unique challenges for automatic KG construction methods. Besides identifying relations absent from the KG between nodes, such methods are also expected to explore absent nodes represented by text, in which different real-world things, or entities, may appear. To deal with the innumerable entities involved with commonsense in the real world, we introduce to CKG construction methods conceptualization, i.e., to view entities mentioned in text as instances of specific concepts or vice versa. We build synthetic triples by conceptualization, and further formulate the task as triple classification, handled by a discriminatory model with knowledge transferred from pretrained language models and fine-tuned by negative sampling. Experiments demonstrate that our methods can effectively identify plausible triples and expand the KG by triples of both new nodes and edges of high diversity and novelty.

preprint2020arXiv

Rethinking Softmax Cross-Entropy Loss for Adversarial Robustness

Previous work shows that adversarially robust generalization requires larger sample complexity, and the same dataset, e.g., CIFAR-10, which enables good standard accuracy may not suffice to train robust models. Since collecting new training data could be costly, we focus on better utilizing the given data by inducing the regions with high sample density in the feature space, which could lead to locally sufficient samples for robust learning. We first formally show that the softmax cross-entropy (SCE) loss and its variants convey inappropriate supervisory signals, which encourage the learned feature points to spread over the space sparsely in training. This inspires us to propose the Max-Mahalanobis center (MMC) loss to explicitly induce dense feature regions in order to benefit robustness. Namely, the MMC loss encourages the model to concentrate on learning ordered and compact representations, which gather around the preset optimal centers for different classes. We empirically demonstrate that applying the MMC loss can significantly improve robustness even under strong adaptive attacks, while keeping state-of-the-art accuracy on clean inputs with little extra computation compared to the SCE loss.

preprint2020arXiv

Robust Dialogue Utterance Rewriting as Sequence Tagging

The task of dialogue rewriting aims to reconstruct the latest dialogue utterance by copying the missing content from the dialogue context. Until now, the existing models for this task suffer from the robustness issue, i.e., performances drop dramatically when testing on a different domain. We address this robustness issue by proposing a novel sequence-tagging-based model so that the search space is significantly reduced, yet the core of this task is still well covered. As a common issue of most tagging models for text generation, the model&#39;s outputs may lack fluency. To alleviate this issue, we inject the loss signal from BLEU or GPT-2 under a REINFORCE framework. Experiments show huge improvements of our model over the current state-of-the-art systems on domain transfer.

preprint2020arXiv

Star Formation in Massive Galaxies at Redshift $z \sim 0.5$

It is believed that massive galaxies have quenched their star formation because of active galactic nucleus feedback. However, recent studies have shown that some massive galaxies are still forming stars. We analyze the morphology of star formation regions for galaxies of stellar mass larger than 10$^{11.3}$ M$_{\odot}$ at around redshift $z_r=0.5$ using $u-z$ color images. We find that about $20\%$ of the massive galaxies are star-forming (SF) galaxies, and most of them ($\sim 85\%$) have asymmetric structures induced by recent mergers. Moreover, for these asymmetric galaxies, we find that the asymmetry of the SF regions becomes larger for bluer galaxies. Using the Illustris simulation, we can qualitatively reproduce the observed relation between asymmetry parameter and color. Furthermore, using the merger trees in the simulation, we find a correlation between the color of the main branch galaxies at $z_r=0.5$ and the sum of the Star Formation Rates (SFRs) of the recently accreted galaxies, which implies that star formation of the accreted galaxies has contributed to the observed star formation of the massive (host) galaxies (ex situ star formation). Furthermore, we find two blue and symmetric galaxies, candidates for massive blue disks, in our observed sample, which indicates that about $\sim 10\%$ of massive SF galaxies are forming stars in the normal mode of disk star formation (in situ star formation). With the simulation, we find that the disk galaxies at $z_r \approx 0.5$ should have experienced few major mergers during the last 4.3 Gyrs.

preprint2020arXiv

TexSmart: A Text Understanding System for Fine-Grained NER and Enhanced Semantic Analysis

This technique report introduces TexSmart, a text understanding system that supports fine-grained named entity recognition (NER) and enhanced semantic analysis functionalities. Compared to most previous publicly available text understanding systems and tools, TexSmart holds some unique features. First, the NER function of TexSmart supports over 1,000 entity types, while most other public tools typically support several to (at most) dozens of entity types. Second, TexSmart introduces new semantic analysis functions like semantic expansion and deep semantic representation, that are absent in most previous systems. Third, a spectrum of algorithms (from very fast algorithms to those that are relatively slow but more accurate) are implemented for one function in TexSmart, to fulfill the requirements of different academic and industrial applications. The adoption of unsupervised or weakly-supervised algorithms is especially emphasized, with the goal of easily updating our models to include fresh data with less human annotation efforts. The main contents of this report include major functions of TexSmart, algorithms for achieving these functions, how to use the TexSmart toolkit and Web APIs, and evaluation results of some key algorithms.

preprint2020arXiv

Three dimensional high-order gas-kinetic scheme for supersonic isotropic turbulence II: coarse-grained analysis of compressible $K_{sgs}$ budget

The direct numerical simulation (DNS) of compressible isotropic turbulence up to the supersonic regime $Ma_{t} = 1.2$ has been investigated by high-order gas-kinetic scheme (HGKS) [{\it{Computers}} \& {\it{Fluids, 192, 2019}}]. In this study, the coarse-grained analysis of subgrid-scale (SGS) turbulent kinetic energy $K_{sgs}$ budget is fully analyzed for constructing one-equation SGS model in the compressible large eddy simulation (LES). The DNS on a much higher turbulent Mach number up to $Ma_{t} = 2.0$ has been obtained by HGKS, which confirms the super robustness of HGKS. Then, the exact compressible SGS turbulent kinetic energy $K_{sgs}$ transport equation is derived with density weighted filtering process. Based on the compressible $K_{sgs}$ transport equation, the coarse-grained processes are implemented on three sets of unresolved grids with the Box filter. The coarse-grained analysis of compressible $K_{sgs}$ budgets shows that all unresolved source terms are dominant terms in current system. Especially, the magnitude of SGS pressure-dilation term is in the order of SGS solenoidal dissipation term within the initial acoustic time scale. Therefore, it can be concluded that the SGS pressure-dilation term cannot be neglected as the previous work. The delicate coarse-grained analysis of SGS diffusion terms in compressible $K_{sgs}$ equation confirms that both the fluctuation velocity triple correlation term and the pressure-velocity correlation term are dominant terms. Current coarse-grained analysis gives an indication of the order of magnitude of all SGS terms in compressible $K_{sgs}$ budget, which provides a solid basis for compressible LES modeling in high Mach number turbulent flow.

preprint2020arXiv

To Relieve Your Headache of Training an MRF, Take AdVIL

We propose a black-box algorithm called {\it Adversarial Variational Inference and Learning} (AdVIL) to perform inference and learning on a general Markov random field (MRF). AdVIL employs two variational distributions to approximately infer the latent variables and estimate the partition function of an MRF, respectively. The two variational distributions provide an estimate of the negative log-likelihood of the MRF as a minimax optimization problem, which is solved by stochastic gradient descent. AdVIL is proven convergent under certain conditions. On one hand, compared with contrastive divergence, AdVIL requires a minimal assumption about the model structure and can deal with a broader family of MRFs. On the other hand, compared with existing black-box methods, AdVIL provides a tighter estimate of the log partition function and achieves much better empirical results.

preprint2020arXiv

Triple Generative Adversarial Networks

We propose a unified game-theoretical framework to perform classification and conditional image generation given limited supervision. It is formulated as a three-player minimax game consisting of a generator, a classifier and a discriminator, and therefore is referred to as Triple Generative Adversarial Network (Triple-GAN). The generator and the classifier characterize the conditional distributions between images and labels to perform conditional generation and classification, respectively. The discriminator solely focuses on identifying fake image-label pairs. Under a nonparametric assumption, we prove the unique equilibrium of the game is that the distributions characterized by the generator and the classifier converge to the data distribution. As a byproduct of the three-player mechanism, Triple-GAN is flexible to incorporate different semi-supervised classifiers and GAN architectures. We evaluate Triple-GAN in two challenging settings, namely, semi-supervised learning and the extreme low data regime. In both settings, Triple-GAN can achieve excellent classification results and generate meaningful samples in a specific class simultaneously. In particular, using a commonly adopted 13-layer CNN classifier, Triple-GAN outperforms extensive semi-supervised learning methods substantially on more than 10 benchmarks no matter data augmentation is applied or not.

preprint2020arXiv

Understanding and Stabilizing GANs&#39; Training Dynamics with Control Theory

Generative adversarial networks (GANs) are effective in generating realistic images but the training is often unstable. There are existing efforts that model the training dynamics of GANs in the parameter space but the analysis cannot directly motivate practically effective stabilizing methods. To this end, we present a conceptually novel perspective from control theory to directly model the dynamics of GANs in the function space and provide simple yet effective methods to stabilize GANs&#39; training. We first analyze the training dynamic of a prototypical Dirac GAN and adopt the widely-used closed-loop control (CLC) to improve its stability. We then extend CLC to stabilize the training dynamic of normal GANs, where CLC is implemented as a squared $L2$ regularizer on the output of the discriminator. Empirical results show that our method can effectively stabilize the training and obtain state-of-the-art performance on data generation tasks.

preprint2020arXiv

Unified Gas-kinetic Wave-Particle Method IV: Multi-species Gas Mixture and Plasma Transport

In this paper, we extend the unified gas-kinetic wave-particle (UGKWP) method to the multi-species gas mixture and multiscale plasma transport. The construction of the scheme is based on the direct modeling on the mesh size and time step scales, and the local cell&#39;s Knudsen number determines the flow physics. The proposed scheme has the multiscale and asymptotic complexity diminishing properties. The multiscale property means that according to cell&#39;s Knudsen number the scheme can capture the non-equilibrium flow physics in the rarefied flow regime, and preserve the asymptotic Euler, Navier-Stokes, and magnetohydrodynamics limit in the continuum regime. The asymptotic complexity diminishing property means that the total degree of freedom of the scheme automatically decreases as cell&#39;s Knudsen number decreases. In the continuum regime, the scheme automatically degenerates from a kinetic solver to a hydrodynamic solver. In UGKWP, the evolution of microscopic velocity distribution is coupled with the evolution of macroscopic variables, and the particle evolution as well as the macroscopic fluxes are modeled from the time accumulating solution up to a time step scale from the kinetic model equation. For plasma transport, current scheme provides a smooth transition from particle in cell (PIC) method in the rarefied regime to the magnetohydrodynamic (MHD) solver in the continuum regime. In the continuum limit, the cell size and time step of the UGKWP method is not restricted to be less than the mean free path and mean collision time. In the highly magnetized regime, the cell size and time step are not restricted by the Debye length and plasma cyclotron period. The multiscale and asymptotic complexity diminishing properties of the scheme are verified by numerical tests in multiple flow regimes.

preprint2019arXiv

A well-balanced gas kinetic scheme for Navier-Stokes equations with gravitational potential

The hydrostatic equilibrium state is the consequence of the exact hydrostatic balance between hydrostatic pressure and external force. Standard finite volume or finite difference schemes cannot keep this balance exactly due to their unbalanced truncation errors. In this study, we introduce an auxiliary variable which becomes constant at isothermal hydrostatic equilibrium state and propose a well-balanced gas kinetic scheme for the Navier-Stokes equations with a global reconstruction. Through reformulating the convection term and the force term via the auxiliary variable, zero numerical flux and zero numerical source term are enforced at the hydrostatic equilibrium state instead of the balance between hydrostatic pressure and external force. Several problems are tested numerically to demonstrate the accuracy and the stability of the new scheme, and the results confirm that, the new scheme can preserve the exact hydrostatic solution. The small perturbation riding on hydrostatic equilibria can be calculated accurately. The viscous effect is also illustrated through the propagation of small perturbation and the Rayleigh-Taylor instability. More importantly, the new scheme is capable of simulating the process of converging towards hydrostatic equilibrium state from a highly non-balanced initial condition. The ultimate state of zero velocity and constant temperature is achieved up to machine accuracy. As demonstrated by the numerical experiments, the current scheme is very suitable for small amplitude perturbation and long time running under gravitational potential.

preprint2019arXiv

Efficient training and design of photonic neural network through neuroevolution

Recently, optical neural networks (ONNs) integrated in photonic chips has received extensive attention because they are expected to implement the same pattern recognition tasks in the electronic platforms with high efficiency and low power consumption. However, the current lack of various learning algorithms to train the ONNs obstructs their further development. In this article, we propose a novel learning strategy based on neuroevolution to design and train the ONNs. Two typical neuroevolution algorithms are used to determine the hyper-parameters of the ONNs and to optimize the weights (phase shifters) in the connections. In order to demonstrate the effectiveness of the training algorithms, the trained ONNs are applied in the classification tasks for iris plants dataset, wine recognition dataset and modulation formats recognition. The calculated results exhibit that the training algorithms based on neuroevolution are competitive with other traditional learning algorithms on both accuracy and stability. Compared with previous works, we introduce an efficient training method for the ONNs and demonstrate their broad application prospects in pattern recognition, reinforcement learning and so on.

preprint2019arXiv

High-order ALE gas-kinetic scheme with unstructured WENO reconstruction

In this paper, a high-order multi-dimensional gas-kinetic scheme is presented for both inviscid and viscous flows in arbitrary Lagrangian-Eulerian (ALE) formulation. Compared with the traditional ALE method, the flow variables are updated in the finite volume framework, and the rezoning and remapping steps are not required. The two-stage fourth-order method is used for the temporal discretization, and the second-order gas-kinetic solver is applied for the flux evaluation. In the two-stage method, the spatial reconstruction is performed at both initial and intermediate stage, and the computational mesh at the corresponding stage is given by the specified mesh velocity. In the moving mesh procedure, the mesh may distort severely and the mesh quality is reduced. To achieve the accuracy and improve the robustness, the newly developed WENO method \cite{un-WENO3} on quadrilateral unstructured meshes is adopted at each stage. The Gaussian quadrature is used for flux calculation. For each Gaussian point, the reconstruction performed in the local moving coordinate, where the variation of mesh velocity is taken into account. Therefore, the accuracy and geometric conservation law can be well preserved by the current scheme even with the largely deforming mesh. Numerical examples are presented to validate the performance of current scheme, where the mesh adaptation method and cell centered Lagrangian method are used to provide mesh velocity.

preprint2019arXiv

Improved Decoding of Staircase Codes: The Soft-aided Bit-marking (SABM) Algorithm

Staircase codes (SCCs) are typically decoded using iterative bounded-distance decoding (BDD) and hard decisions. In this paper, a novel decoding algorithm is proposed, which partially uses soft information from the channel. The proposed algorithm is based on marking certain number of highly reliable and highly unreliable bits. These marked bits are used to improve the miscorrection-detection capability of the SCC decoder and the error-correcting capability of BDD. For SCCs with $2$-error-correcting Bose-Chaudhuri-Hocquenghem component codes, our algorithm improves upon standard SCC decoding by up to $0.30$~dB at a bit-error rate (BER) of $10^{-7}$. The proposed algorithm is shown to achieve almost half of the gain achievable by an idealized decoder with this structure. A complexity analysis based on the number of additional calls to the component BDD decoder shows that the relative complexity increase is only around $4\%$ at a BER of $10^{-4}$. This additional complexity is shown to decrease as the channel quality improves. Our algorithm is also extended (with minor modifications) to product codes. The simulation results show that in this case, the algorithm offers gains of up to $0.44$~dB at a BER of $10^{-8}$.

preprint2019arXiv

Machine learning and evolutionary algorithm studies of graphene metamaterials for optimized plasmon-induced transparency

Machine learning and optimization algorithms have been widely applied in the design and optimization for photonic devices. In this article, we briefly review recent progress of this field of research and show some data-driven applications (e.g. spectrum prediction, inverse design and performance optimization) for novel graphene metamaterials (GMs). The structure of the GMs is well-designed to achieve the wideband plasmon induced transparency effect, which is regarded as optimization object and can be theoretically demonstrated by using transfer matrix method. Some classical machine learning algorithms, including k nearest neighbour, decision tree, random forest and artificial neural networks, are utilized to equivalently substitute the numerical simulation in the forward spectrum prediction and complete the inverse design for the GMs. The calculated results demonstrate that all the algorithms are effective and the random forest has advantages in terms of accuracy and training speed. Moreover, the single-objective and multi-objective optimization algorithms are used to achieve steep transmission characteristics by synthetically taking many performance metrics into consideration. The maximum difference between the transmission peaks and dips in the optimized transmission spectrum can reach 0.97. In comparison to previous works, we provide a guidance for intelligent design of photonic devices and advanced materials based on machine learning and evolutionary algorithms.

preprint2019arXiv

Ray Effect in Rarefied Flow Simulation

Ray effect usually appears in the radiative transfer when using discrete ordinates method (DOM) in the simulations. The cause and remedy for the ray effect have been intensively investigated in the radiation community. For rarefied gas flow, the ray effect is also associated with the discrete velocity method (DVM). However, few studies have been carried out in the rarefied community. In this paper, we take a detailed investigation of the ray effect in the rarefied flow simulations. Starting from a few commonly used benchmark tests, the root of the ray effect has been analyzed theoretically and validated numerically. At the same time, the influence of the ray effect on the quality of the numerical results of rarefied flow is estimated quantitatively. After understanding the nature of the ray effect, the strategy to minimize the ray effect through the discretization of the particle velocity space is presented and applied in the numerical simulations. An optimal velocity discretization for DVM is problem dependent and can be hardly obtained in the complex flow simulations. Due to the intrinsic self-adaptation of particle velocity, the stochastic particle methods are free from the ray effect. In rarefied regimes, the particle method seems more appropriate in the capturing of highly non-equilibrium flow behavior.

preprint2019arXiv

Unified Gas-kinetic Wave-Particle Methods II: Multiscale Simulation on Unstructured Mesh

In this paper, we present a unified gas-kinetic wave-particle (UGKWP) method on unstructured mesh for multiscale simulation of continuum and rarefied flow. Inheriting from the multicale transport in the unified gas-kinetic scheme (UGKS), the integral solution of kinetic model equation is employed in the construction of UGKWP method to model the flow physics in the cell size and time step scales. A novel wave-particle adaptive formulation is introduced in the UGKWP method to describe the flow dynamics in each control volume. The local gas evolution is constructed through the dynamical interaction of the deterministic hydrodynamic wave and the stochastic kinetic particle. Within the resolution of cell size and time step, the decomposition, interaction, and evolution of the hydrodynamic wave and the kinetic particle depend on the ratio of the time step to the local particle collision time. In the rarefied flow regime, the flow physics is mainly recovered by the discrete particles and the UGKWP method performs as a stochastic particle method. In the continuum flow regime, the flow behavior is solely followed by macroscopic variable evolution and the UGKWP method becomes a gas-kinetic hydrodynamic flow solver for the viscous and heat-conducting Navier--Stokes solutions. In different flow regimes, many numerical test cases are computed to validate the UGKWP method on unstructured mesh. The UGKWP method can get the same UGKS solutions in all Knudsen regimes without the requirement of the time step and mesh size being less than than the particle collision time and mean free path. With an automatic wave-particle decomposition, the UGKWP method becomes very efficient. For example, at Mach number 30 and Knudsen number 0.1, in comparison with UGKS several-order-of-magnitude reductions in computational cost and memory requirement have been achieved by UGKWP.

preprint2019arXiv

Unified Gas-kinetic Wave-Particle Methods III: Multiscale Photon Transport

In this paper, we extend the unified gas-kinetic wave-particle (UGKWP) method to the multiscale photon transport. In this method, the photon free streaming and scattering processes are treated in an un-splitting way. The duality descriptions, namely the simulation particle and distribution function, are utilized to describe the photon. By accurately recovering the governing equations of the unified gas-kinetic scheme (UGKS), the UGKWP preserves the multiscale dynamics of photon transport from optically thin to optically thick regime. In the optically thin regime, the UGKWP becomes a Monte Carlo type particle tracking method, while in the optically thick regime, the UGKWP becomes a diffusion equation solver. The local photon dynamics of the UGKWP, as well as the proportion of wave-described and particle-described photons are automatically adapted according to the numerical resolution and transport regime. Compared to the $S_n$ -type UGKS, the UGKWP requires less memory cost and does not suffer ray effect. Compared to the implicit Monte Carlo (IMC) method, the statistical noise of UGKWP is greatly reduced and computational efficiency is significantly improved in the optically thick regime. Several numerical examples covering all transport regimes from the optically thin to optically thick are computed to validate the accuracy and efficiency of the UGKWP method. In comparison to the $S_n$ -type UGKS and IMC method, the UGKWP method may have several-order-of-magnitude reduction in computational cost and memory requirement in solving some multsicale transport problems.

preprint2018arXiv

Decoding Staircase Codes with Marked Bits

Staircase codes (SCCs) are typically decoded using iterative bounded-distance decoding (BDD) and hard decisions. In this paper, a novel decoding algorithm is proposed, which partially uses soft information from the channel. The proposed algorithm is based on marking certain number of highly reliable and highly unreliable bits. These marked bits are used to improve the miscorrection-detection capability of the SCC decoder and the error-correcting capability of BDD. For SCCs with $2$-error-correcting BCH component codes, our algorithm improves upon standard SCC decoding by up to $0.30$~dB at a bit-error rate of $10^{-7}$. The proposed algorithm is shown to achieve almost half of the gain achievable by an idealized decoder with this structure.

preprint2017arXiv

A Compact Fourth-order Gas-kinetic Scheme for the Euler and Navier-Stokes Solutions

In this paper, a fourth-order compact gas-kinetic scheme (GKS) is developed for the compressible Euler and Navier-Stokes equations under the framework of two-stage fourth-order temporal discretization and Hermite WENO (HWENO) reconstruction. Due to the high-order gas evolution model, the GKS provides a time dependent gas distribution function at a cell interface. This time evolution solution can be used not only for the flux evaluation across a cell interface and its time derivative, but also time accurate evolution solution at a cell interface. As a result, besides updating the conservative flow variables inside each control volume, the GKS can get the cell averaged slopes inside each control volume as well through the differences of flow variables at the cell interfaces. So, with the updated flow variables and their slopes inside each cell, the HWENO reconstruction can be naturally implemented for the compact high-order reconstruction at the beginning of next step. Therefore, a compact higher-order GKS, such as the two-stages fourth-order compact scheme can be constructed. This scheme is as robust as second-order one, but more accurate solution can be obtained. In comparison with compact fourth-order DG method, the current scheme has only two stages instead of four within each time step for the fourth-order temporal accuracy, and the CFL number used here can be on the order of $0.5$ instead of $0.11$ for the DG method. Through this research, it concludes that the use of high-order time evolution model rather than the first order Riemann solution is extremely important for the design of robust, accurate, and efficient higher-order schemes for the compressible flows.