Source author record

Kun Xu

Kun Xu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

80works

27topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

BitLM: Unlocking Multi-Token Language Generation with Bitwise Continuous Diffusion

Autoregressive language models generate text one token at a time, yet natural language is inherently structured in multi-token units, including phrases, n-grams, and collocations that carry meaning jointly. This one-token bottleneck limits both the expressiveness of the model during pre-training and its throughput at inference time. Existing remedies such as speculative decoding or diffusion-based language models either leave the underlying bottleneck intact or sacrifice the causal structure essential to language modeling. We propose BitLM, a language model that represents each token as a fixed-length binary code and employs a lightweight diffusion head to denoise multiple tokens in parallel within each block. Crucially, BitLM preserves left-to-right causal attention across blocks while making joint lexical decisions within each block, combining the reliability of autoregressive modeling with the parallelism of iterative refinement. By replacing the large-vocabulary softmax with bitwise denoising, BitLM reframes token generation as iterative commitment in a compact binary space, enabling more efficient pre-training and substantially faster inference without altering the causal foundation that makes language models effective. Our results demonstrate that the one-token-at-a-time paradigm is not a fundamental requirement but an interface choice, and that changing it can yield a stronger and faster language model. We hope BitLM points toward a promising direction for next-generation language model architectures.

preprint2025arXiv

A Gas-Kinetic Scheme for Maxwell Equations

The Gas-Kinetic Scheme (GKS), widely used in computational fluid dynamics for simulating hypersonic and other complicated flow phenomena, is extended in this work to electromagnetic problems by solving Maxwell's equations. In contrast to the classical GKS formulation, the proposed scheme employs a discrete rather than a continuous velocity space. By evaluating a time-accurate numerical flux at cell interfaces, the proposed scheme attains second-order accuracy within a single step. Its kinetic formulation provides an inherently multidimensional framework, while the finite-volume formulation ensures straightforward extension to unstructured meshes. Through the incorporation of a collision process, the scheme exhibits lower numerical dissipation than classical flux-vector splitting (FVS) methods. Furthermore, the kinetic decomposition enables direct implementation of non-reflecting boundary conditions. The proposed scheme is validated against several benchmark problems and compared with established methods, including the Finite-Difference Time-Domain (FDTD) method and FVS. A lattice Boltzmann method (LBM) implementation is also included for comparative analysis. Finally, the technique is applied to simulate electromagnetic wave propagation in a realistic aircraft configuration, demonstrating its ability to model complex geometries.

preprint2024arXiv

Formation of PSR J1012+5307 with an extremely low-mass white dwarf: testing magnetic braking models

PSR J1012+5307 is a millisecond pulsar with an extremely low-mass (ELM) white dwarf (WD) companion in an orbit of 14.5 hours. Magnetic braking (MB) plays an important role in influencing the orbital evolution of binary systems with a low-mass ($\lt 1-2~M_{\odot}$) donor star. At present, there exist several different MB descriptions. In this paper, we investigate the formation of PSR J1012+5307 as a probe to test the plausible MB model. Employing a detailed stellar evolution model by the MESA code, we find that the Convection And Rotation Boosted MB and the 'Intermediate' MB models can reproduce the WD mass, WD radius, WD surface gravity, neutron-star mass, and orbital period observed in PSR J1012+5307. However, our simulated WD has higher effective temperature than the observation. Other three MB mechanisms including the standard MB model are too weak to account for the observed orbital period in a Hubble time. A long cooling timescale caused by H-shell flashes of the WD may alleviate the discrepancy between the simulated effective temperature and the observed value.

preprint2024arXiv

High-order compact gas-kinetic scheme in arbitrary Lagrangian-Eulerian formulation

This study proposes an extension of the high-order compact gas-kinetic scheme (CGKS) to compressible flow simulation in an arbitrary Lagrangian-Eulerian (ALE) formulation in unstructured mesh. The ALE method is achieved by subdividing arbitrary mesh into tetrahedrons and integrating flux function in a local coordinate system at the cell interface to ensure geometric conservation law. The scheme incorporates a compact reconstruction with third-order accuracy for updating both cell-averaged conservative flow variables and their gradients. HWENO-type nonlinear reconstruction and gradient compression factors are adopted to improve the accuracy and robustness of the scheme. A multi-stage multi-derivative (MSMD) time-stepping method is also implemented to achieve high-order time accuracy with fewer middle stages. The scheme is used to study problems involving moving boundaries. The numerical experiments demonstrate the effectiveness of the scheme in capturing the accurate solutions of both low-speed smooth flow and highly compressible ones with strong shock waves.

preprint2022arXiv

60-nm-span wavelength-tunable vortex fiber laser with intracavity plasmon metasurfaces

Wavelength-tunable vortex fiber lasers that could generate beams carrying orbital angular momentum (OAM) hold great interest in large-capacity optical communications. The wavelength tunability of conventional vortex fiber lasers is however limited by the range of 35 nm due to narrow bandwidth and/or insertion loss of mode conversion components. Optical metasurfaces apart from being compact planar components can flexibly manipulate light with high efficiency in a broad wavelength range. Here, we propose and demonstrate for the first time, to the best of our knowledge, a metasurface-assisted vortex fiber laser that can directly generate OAM beams with changeable topological charges. Due to the designed broadband gap-surface plasmon metasurface, combined with an intracavity tunable filter, the laser enables OAM beam with center wavelength continuously tunable from 1015 nm to 1075 nm, nearly twice of other vortex fiber lasers ever reported. The metasurface can be designed at will to satisfy requirements for either low pump threshold or high slope efficiency of the laser. Furthermore, the cavity-metasurface configuration can be extended to generate higher-order OAM beams or more complex structured beams in different wavelength regions, which greatly broadens the possibilities for developing low-cost and high-quality structured-beam laser sources.

preprint2022arXiv

A Survey of Adversarial Learning on Graphs

Deep learning models on graphs have achieved remarkable performance in various graph analysis tasks, e.g., node classification, link prediction, and graph clustering. However, they expose uncertainty and unreliability against the well-designed inputs, i.e., adversarial examples. Accordingly, a line of studies has emerged for both attack and defense addressed in different graph analysis tasks, leading to the arms race in graph adversarial learning. Despite the booming works, there still lacks a unified problem definition and a comprehensive review. To bridge this gap, we investigate and summarize the existing works on graph adversarial learning tasks systemically. Specifically, we survey and unify the existing works w.r.t. attack and defense in graph analysis tasks, and give appropriate definitions and taxonomies at the same time. Besides, we emphasize the importance of related evaluation metrics, investigate and summarize them comprehensively. Hopefully, our works can provide a comprehensive overview and offer insights for the relevant researchers. Latest advances in graph adversarial learning are summarized in our GitHub repository https://github.com/EdisonLeeeee/Graph-Adversarial-Learning.

preprint2022arXiv

Artificial Cnoidal Wave Breathers in Optical Microresonators

Breathers are localized structures that undergo a periodic oscillation in their duration and amplitude. Optical microresonators, benefiting from their high quality factor, provide an ideal test bench for studying the breathing phenomena. In the monochromatically pumped microresonator system, intrinsic breathing instabilities are widely observed in the form of temporal dissipative Kerr solitons which only exist in the effectively red detuned regime. Here, we proposed a novel bichromatic pumping scheme to create compulsive breathing microcombs via respectively distributing two pump lasers at the effectively blue and red detuned side of a single resonance. We experimentally discover the artificial cnoidal wave breathers and molecular crystal-like breathers in a chip-based silicon nitride microresonator, and theoretically describe their intriguing temporal dynamics based on the bichromatic pumping Lugiato-Lefever equation. In particular, the corresponding breathing microcombs exhibit diverse comb line spacing ranging from 2 to 17 times of the free spectral range of the resonator. Our discovery not only provides a simple and robust method to produce microcombs with reconfigurable comb line spacing, but also reveals a new type of breathing waves in driven dissipative nonlinear systems.

preprint2022arXiv

Construct the emission line galaxy-host halo connection through auto and cross correlations

We investigate the [O\,II] emission line galaxy (ELG)-host halo connection via auto and cross correlations, and propose a concise and effective method to populate ELGs in dark matter halos without assuming a parameterized halo occupation distribution (HOD) model. Using the observational data from VIMOS Public Extragalactic Redshift Survey (VIPERS), we measure the auto and cross correlation functions between ELGs selected by [O\,II] luminosity and normal galaxies selected by stellar mass. Combining the stellar-halo mass relation (SHMR) derived for the normal galaxies and the fraction of ELGs observed in the normal galaxy population, we demonstrate that we can establish an accurate ELG-halo connection. With the ELG-halo connection, we can accurately reproduce the auto and cross correlation functions of ELGs and normal galaxies both in real-space and in redshift-space, once the satellite fraction is properly reduced. Our method provides a novel strategy to generate ELG mock catalogs for ongoing and upcoming galaxy redshift surveys. We also provide a simple description for the HOD of ELGs.

preprint2022arXiv

Distant finetuning with discourse relations for stance classification

Approaches for the stance classification task, an important task for understanding argumentation in debates and detecting fake news, have been relying on models which deal with individual debate topics. In this paper, in order to train a system independent from topics, we propose a new method to extract data with silver labels from raw text to finetune a model for stance classification. The extraction relies on specific discourse relation information, which is shown as a reliable and accurate source for providing stance information. We also propose a 3-stage training framework where the noisy level in the data used for finetuning decreases over different stages going from the most noisy to the least noisy. Detailed experiments show that the automatically annotated dataset as well as the 3-stage training help improve model performance in stance classification. Our approach ranks 1st among 26 competing teams in the stance classification track of the NLPCC 2021 shared task Argumentative Text Understanding for AI Debater, which confirms the effectiveness of our approach.

preprint2022arXiv

DyLex: Incorporating Dynamic Lexicons into BERT for Sequence Labeling

Incorporating lexical knowledge into deep learning models has been proved to be very effective for sequence labeling tasks. However, previous works commonly have difficulty dealing with large-scale dynamic lexicons which often cause excessive matching noise and problems of frequent updates. In this paper, we propose DyLex, a plug-in lexicon incorporation approach for BERT based sequence labeling tasks. Instead of leveraging embeddings of words in the lexicon as in conventional methods, we adopt word-agnostic tag embeddings to avoid re-training the representation while updating the lexicon. Moreover, we employ an effective supervised lexical knowledge denoising method to smooth out matching noise. Finally, we introduce a col-wise attention based knowledge fusion mechanism to guarantee the pluggability of the proposed framework. Experiments on ten datasets of three tasks show that the proposed framework achieves new SOTA, even with very large scale lexicons.

preprint2022arXiv

High-order Compact Gas-kinetic Schemes for Three-dimensional Flow Simulation on Tetrahedral Mesh

A general framework for the development of high-order compact schemes has been proposed recently. The core steps of the schemes are composed of the following. 1). Based on a kinetic model equation, from a generalized initial distribution of flow variables construct a time-accurate evolution solution of gas distribution function at a cell interface ; 2). Introduce the WENO-type weighting functions into the time-derivative of the cell interface flux function in the multistage multi-derivative time stepping scheme to cope with the possible impingement of a shock wave on a cell interface within a time step; 3). Take moments of interface gas distribution function to obtain the time-accurate flow variables and the corresponding fluxes at the cell interface, and update the cell-averaged flow variables and their gradients inside each control volume; 4). Within the physical domain of dependence of the reconstructed cell, based on the cell-averaged flow variables and their gradients develop compact initial data reconstruction to get initial flow distributions at the beginning of next time step. A compact gas-kinetic scheme (GKS) up to sixth-order accuracy in space and fourth-order in time has been constructed on 2D unstructured mesh before. In this paper, the compact GKS up to fourth-order accuracy on 3D tetrahedral mesh will be further constructed with the focus on the WENO-type initial data reconstruction. Nonlinear weights are designed to achieve high-order accuracy for the smooth Navier-Stokes solution and keep super robustness in 3D computation with strong shock interactions. The fourth-order compact GKS can use a large time step with CFL number $0.6$ in the simulations from subsonic to hypersonic flow. A series of test cases are used to validate the scheme. The high-order compact GKS is ready for 3D applications with complex geometry.

preprint2022arXiv

High-order Gas-kinetic Schemes with Non-compact and Compact Reconstruction for Implicit Large Eddy Simulation

High-order gas-kinetic scheme (HGKS) with 5th-order non-compact reconstruction has been well implemented for implicit large eddy simulation (ILES) in nearly incompressible turbulent channel flows. In this study, the HGKS with higher-order non-compact reconstruction and compact reconstruction will be validated in turbulence simulation. For higher-order non-compact reconstruction, 7th-order normal reconstruction and tangential reconstruction are implemented. In terms of compact reconstruction, 5th-order normal reconstruction is adopted. Current work aims to show the benefits of high-order non-compact reconstruction and compact reconstruction for ILES. The accuracy of HGKS is verified by numerical simulation of three-dimensional advection of density perturbation. For the non-compact 7th-order scheme, 16 Gaussian points are required on the cell interface to preserve the order of accuracy. Then, HGKS with non-compact and compact reconstruction is used in the three-dimensional Taylor-Green vortex (TGV) problem and turbulent channel flows. Accurate ILES solutions have been obtained from HGKS. In terms of the physical modeling underlying the numerical algorithms, the compact reconstruction has the consistent physical and numerical domains of dependence without employing additional information from cells which have no any direct physical connection with the targeted cell. The compact GKS shows a favorable performance for turbulence simulation in resolving the multi-scale structures.

preprint2022arXiv

Magnetism of QCD matter and pion mass from tensor-type spin polarization and anomalous magnetic moment of quarks

We investigate the magnetism of QCD matter and pion mass under magnetic field considering the contribution from the tensor-type spin polarization and the anomalous magnetic moment (AMM) of quarks. It is found that the tensor-type spin polarization (TSP) induces the magnetic catalysis of chiral condensate and diamagnetism (negative magnetic susceptibility) of quark matter at low temperature, both neutral and charged pion masses increase quickly with magnetic field in the case of TSP. The anomalous magnetic moment (AMM) of quarks induces magnetic inhibition and a magnetic dependent AMM causes inverse magnetic catalysis at finite temperature, and the neutral pion mass decreases with magnetic field while the charged pion mass shows nonmonotonic behavior with the magnetic field, which is qualitatively in agreement with lattice result. However, the magnetic susceptibility is positive at low temperature with AMM. In the current framework, our results show the irreconcilable contradiction between the diamagnetism and inverse magnetic catalysis.

preprint2022arXiv

Neuromorphic computing using wavelength-division multiplexing

Optical neural networks (ONNs), or optical neuromorphic hardware accelerators, have the potential to dramatically enhance the computing power and energy efficiency of mainstream electronic processors, due to their ultralarge bandwidths of up to 10s of terahertz together with their analog architecture that avoids the need for reading and writing data back and forth. Different multiplexing techniques have been employed to demonstrate ONNs, amongst which wavelength division multiplexing (WDM) techniques make sufficient use of the unique advantages of optics in terms of broad bandwidths. Here, we review recent advances in WDM based ONNs, focusing on methods that use integrated microcombs to implement ONNs. We present results for human image processing using an optical convolution accelerator operating at 11 Tera operations per second. The open challenges and limitations of ONNs that need to be addressed for future applications are also discussed.

preprint2022arXiv

Photometric Objects around Cosmic Webs (PAC) Delineated in a Spectroscopic Survey. I. Methods

We provide a method for estimating the projected density distribution $\bar{n}_2w_p(r_p)$ of photometric objects around spectroscopic objects in a redshift survey. This quantity describes the distribution of Photometric sources with certain physical properties (e.g. luminosity, mass, color etc) Around Cosmic webs (PAC) traced by the spectroscopic objects. The method can make full use of current and future deep and wide photometric surveys to explore the formation of galaxies up to medium redshift ($z_s < 2$), with the aid of cosmological redshift surveys that sample only a fairly limited species of objects (e.g. Emission Line Galaxies). As an example, we apply the PAC method to the CMASS spectroscopic and HSC-SSP PDR2 photometric samples to explore the distribution of galaxies for a wide range of stellar mass from $10^{9.0}{\rm M_\odot}$ to $10^{12.0}{\rm M_\odot}$ around massive ones at $z_s\approx 0.6$. Using the abundance matching method, we model $\bar{n}_2w_p(r_p)$ in N-body simulation using MCMC sampling, and accurately measure the stellar-halo mass relation (SHMR) and stellar mass function (SMF) for the whole mass range. We can also measure the conditional stellar mass function (CSMF) of satellites for central galaxies of different mass. The PAC method has many potential applications for studying the evolution of galaxies.

preprint2022arXiv

Satellite galaxies' drag on field stars in the Milky Way

With Gaia EDR3 data, velocity dispersion of Milky Way field stars around satellite galaxies have been investigated. We have fitted velocity dispersion against distance to satellite galaxy and found the gradient of velocity dispersion is related to the mass of satellite galaxy. With order-of-magnitude approximations, a linear correlation has been fitted between the mass of satellite galaxy and gradient of velocity dispersion caused by its gravitational drag. Though our result is an observational qualitative result, it shows better relation could be obtained with more observations in the future.

preprint2022arXiv

Semantic optical fiber communication system

The current optical communication systems minimize bit or symbol errors without considering the semantic meaning behind digital bits, thus transmitting a lot of unnecessary information. We propose and experimentally demonstrate a semantic optical fiber communication (SOFC) system. Instead of encoding information into bits for transmission, semantic information is extracted from the source using deep learning. The generated semantic symbols are then directly transmitted through an optical fiber. Compared with the bit-based structure, the SOFC system achieved higher information compression and a more stable performance, especially in the low received optical power regime, and enhanced the robustness against optical link impairments. This work introduces an intelligent optical communication system at the human analytical thinking level, which is a significant step toward a breakthrough in the current optical communication architecture.

preprint2022arXiv

Three-dimensional third-order gas-kinetic scheme on hybrid unstructured meshes for Euler and Navier-Stokes equations

In this paper, a third order gas kinetic scheme is developed on the three dimensional hybrid unstructured meshes for the numerical simulation of compressible inviscid and viscous flows. A third-order WENO reconstruction is developed on the hybrid unstructured meshes, including tetrahedron, pyramid, prism and hexahedron. A simple strategy is adopted for the selection of big stencil and sub-stencils. Incorporate with the two-stage fourth-order temporal discretization and lower-upper symmetric Gauss-Seidel methods, both explicit and implicit high-order gas-kinetic schemes are developed. A variety of numerical examples, from the subsonic to supersonic flows, are presented to validate the accuracy and robustness for both inviscid and viscous flows.

preprint2022arXiv

UGKWP for three-dimensional simulation of gas-particle fluidized bed

The gas-solid particle two-phase flow in a fluidized bed shows complex physics. Following our previous work, the multi-scale framework based on gas-kinetic scheme (GKS) and unified gas-kinetic wave-particle method (UGKWP) for the gas-particle system is firstly extended to the three-dimensional simulation of the fluidized bed. For the solid particle evolution, different from the widely-used Eulerian and Lagrangian approaches, the UGKWP unifies the wave (dense particle region) and discrete particle (dilute particle region) formulation seamlessly according to a continuous variation of particle cell's Kundsen number ($Kn$). The GKS-UGKWP for the coupled gas-particle evolution system can automatically become an Eulerian-Eulerian (EE) method in the high particle collision regime and Eulerian-Lagrangian (EL) formulation in the collisionless particle regime. In the transition regime, the UGKWP can achieve a smooth transition between the Eulerian and Lagrangian limiting formulation. More importantly, the weights of mass distributions from analytical wave and discrete particle are related to the local $Kn$ by $\exp(-1/Kn)$ for wave and $(1-\exp(-1/Kn))$ for discrete particle. As a result, the UGKWP provides an optimal modeling for capturing the particle phase in terms of physical accuracy and numerical efficiency. In the numerical simulation, the UGKWP does not need any prior division of dilute/dense regions, which makes it suitable for the fluidized bed problem, where the dilute/transition/dense regions instantaneously coexist and are dynamically interconvertible. In this paper, based on the GKS-UGKWP formulation two lab-scale fluidization cases are simulated in 3D and the simulation results are compared with the experimental measurements. The typical heterogeneous flow features of the fluidized bed are well captured and the statistics are in good agreement with experiment data.

preprint2022arXiv

Zero-shot Cross-lingual Conversational Semantic Role Labeling

While conversational semantic role labeling (CSRL) has shown its usefulness on Chinese conversational tasks, it is still under-explored in non-Chinese languages due to the lack of multilingual CSRL annotations for the parser training. To avoid expensive data collection and error-propagation of translation-based methods, we present a simple but effective approach to perform zero-shot cross-lingual CSRL. Our model implicitly learns language-agnostic, conversational structure-aware and semantically rich representations with the hierarchical encoders and elaborately designed pre-training objectives. Experimental results show that our model outperforms all baselines by large margins on two newly collected English CSRL test sets. More importantly, we confirm the usefulness of CSRL to non-Chinese conversational tasks such as the question-in-context rewriting task in English and the multi-turn dialogue response generation tasks in English, German and Japanese by incorporating the CSRL information into the downstream conversation-based models. We believe this finding is significant and will facilitate the research of non-Chinese dialogue tasks which suffer the problems of ellipsis and anaphora.

preprint2021arXiv

A giant central red disk galaxy at redshift $z=0.76$: challenge to theories of galaxy formation

We report a giant red central disk galaxy in the XMM-LSS north region. The region is covered with a rich variety of multiband photometric and spectroscopic observations. Using the photometric data of the Canada-France-Hawaii Telescope Legacy Survey (CFHTLS) and spectroscopic observation of the Baryon Oscillation Spectroscopic Survey (BOSS), we find that the galaxy has a stellar mass of $\sim 10^{11.6}$ times of the solar mass $M_\odot$. The galaxy has a red color and has an old stellar population, and thus its star formation has stopped. With the photometric image data of Hyper Suprime-Cam (HSC) Subaru Strategic Program, we demonstrate that its luminosity profile is perfectly described by a Sérsic form with $n=1.22$ indicating disk morphology. We also analyze its environment based on the VIMOS Public Extragalactic Redshift Survey (VIPERS) photometric catalog, and find that its close neighbors are all less massive, indicating that our observed galaxy is sitting at the center of its host halo. Existence of the giant red central disk galaxy seriously challenges the current standard paradigm of galaxy formation, as there is no known physical mechanism to explain the quenching of its star formation. This conclusion is supported by state-of-the-art hydrodynamical simulations of galaxy formation.

preprint2021arXiv

Are there magnetars in high-mass X-ray binaries?

Magnetars form a special population of neutron stars with strong magnetic fields and long spin periods. About 30 magnetars and magnetar candidates known currently are probably isolated. But the possibility that magnetars are in binaries hasn't been excluded. In this work, we perform spin evolution of neutron stars with different magnetic fields in wind-fed high-mass X-ray binaries and compare the spin period distribution with observations, aiming to find magnetars in binaries. Our simulation shows that some of the neutron stars, which have long spin periods or in wide-separation systems, need strong magnetic fields to explain their spin evolution. This implies that there are probably magnetars in high-mass X-ray binaries. Moreover, this can further provide a theoretical basis for some unclear astronomical phenomena, such as the possible origin of periodic fast radio bursts from magnetars in binary systems.

preprint2021arXiv

GraphGallery: A Platform for Fast Benchmarking and Easy Development of Graph Neural Networks Based Intelligent Software

Graph Neural Networks (GNNs) have recently shown to be powerful tools for representing and analyzing graph data. So far GNNs is becoming an increasingly critical role in software engineering including program analysis, type inference, and code representation. In this paper, we introduce GraphGallery, a platform for fast benchmarking and easy development of GNNs based software. GraphGallery is an easy-to-use platform that allows developers to automatically deploy GNNs even with less domain-specific knowledge. It offers a set of implementations of common GNN models based on mainstream deep learning frameworks. In addition, existing GNNs toolboxes such as PyG and DGL can be easily incorporated into the platform. Experiments demonstrate the reliability of implementations and superiority in fast coding. The official source code of GraphGallery is available at https://github.com/EdisonLeeeee/GraphGallery and a demo video can be found at https://youtu.be/mv7Zs1YeaYo.

preprint2021arXiv

Joint Coreference Resolution and Character Linking for Multiparty Conversation

Character linking, the task of linking mentioned people in conversations to the real world, is crucial for understanding the conversations. For the efficiency of communication, humans often choose to use pronouns (e.g., "she") or normal phrases (e.g., "that girl") rather than named entities (e.g., "Rachel") in the spoken language, which makes linking those mentions to real people a much more challenging than a regular entity linking task. To address this challenge, we propose to incorporate the richer context from the coreference relations among different mentions to help the linking. On the other hand, considering that finding coreference clusters itself is not a trivial task and could benefit from the global character information, we propose to jointly solve these two tasks. Specifically, we propose C$^2$, the joint learning model of Coreference resolution and Character linking. The experimental results demonstrate that C$^2$ can significantly outperform previous works on both tasks. Further analyses are conducted to analyze the contribution of all modules in the proposed model and the effect of all hyper-parameters.

preprint2021arXiv

Principle-driven Fiber Transmission Model based on PINN Neural Network

In this paper, a novel principle-driven fiber transmission model based on physical induced neural network (PINN) is proposed. Unlike data-driven models which regard fiber transmission problem as data regression tasks, this model views it as an equation solving problem. Instead of adopting input signals and output signals which are calculated by SSFM algorithm in advance before training, this principle-driven PINN based fiber model adopts frames of time and distance as its inputs and the corresponding real and imaginary parts of NLSE solutions as its outputs. By taking into account of pulses and signals before transmission as initial conditions and fiber physical principles as NLSE in the design of loss functions, this model will progressively learn the transmission rules. Therefore, it can be effectively trained without the data labels, referred as the pre-calculated signals after transmission in data-driven models. Due to this advantage, SSFM algorithm is no longer needed before the training of principle-driven fiber model which can save considerable time consumption. Through numerical demonstration, the results show that this principle-driven PINN based fiber model can handle the prediction tasks of pulse evolution, signal transmission and fiber birefringence for different transmission parameters of fiber telecommunications.

preprint2021arXiv

Programmable Multifunctional Plasmonic Waveguide System based on Coding Metamaterials and Inverse Design

In this article, we propose a programmable plasmonic waveguide system (PPWS) to achieve several different functions based on metal coding metamaterials (MCMs) and inverse design technology. There is no need to spend much time on considering the relation between the function and the structure because the MCMs in the PPWS are reprogrammable. In order to demonstrate the effectiveness of the PPWS, we utilize it to achieve several filtering functions, including bandstop and bandpass filters. The simulation results exhibit that the performance of filters is improved based on genetic algorithm, particle swarm optimization, multi-traversal direct-binary search and simulated annealing. Especially, the bandwidth and quality factor for the narrow-band filter can reach 6.5 nm and 200.5. In addition to the simple filtering functions, some relatively complex transmission characteristics can be obtained by using the PPWS, such as plasmon-induced transparency-like effects. In conclusion, genetic algorithm is considered as the most efficient inverse design method for our system due to its less optimization time and stable performance. In comparison with the previous works, our proposed PPWS not only provides a general framework for obtaining an effective, flexible and compact plasmonic device but also shows the applications of inverse design on photonics devices.

preprint2021arXiv

Structural Information Preserving for Graph-to-Text Generation

The task of graph-to-text generation aims at producing sentences that preserve the meaning of input graphs. As a crucial defect, the current state-of-the-art models may mess up or even drop the core structural information of input graphs when generating outputs. We propose to tackle this problem by leveraging richer training signals that can guide our model for preserving input information. In particular, we introduce two types of autoencoding losses, each individually focusing on different aspects (a.k.a. views) of input graphs. The losses are then back-propagated to better calibrate our model via multi-task training. Experiments on two benchmarks for graph-to-text generation show the effectiveness of our approach over a state-of-the-art baseline. Our code is available at \url{http://github.com/Soistesimmer/AMR-multiview}.

preprint2021arXiv

The first decade of unified gas kinetic scheme

In 2010, the unified gas kinetic scheme (UGKS) was proposed by Xu et al . (A unified gas-kinetic scheme for continuum and rarefied flows, Journal of Computational Physics, 2010). In the past decade, many numerical techniques have been developed to improve the capability of the UGKS in the aspects of efficiency increment, memory reduction, and physical modeling. The methodology of the direct modeling of the UGKS on discretization scale provides a general framework for construction of multiscale method for multiscale transport processes. This paper reviews the development and extension of the UGKS in its first decade.

preprint2021arXiv

Two-step multi-resolution reconstruction-based compact gas-kinetic scheme on tetrahedral mesh

In this paper, a third-order compact gas-kinetic scheme (GKS) on unstructured tetrahedral mesh is constructed for the compressible Euler and Navier-Stokes solutions. The time-dependent gas distribution function at a cell interface is used to calculate the fluxes for the updating the cell-averaged flow variables and to evaluate the time accurate cell-averaged flow variables as well for evolving the cell-averaged gradients of flow variables. With the accurate evolution model for both flow variables and their slopes, the quality of the scheme depends closely on the accuracy and reliability of the initial reconstruction of flow variables. The reconstruction scheme becomes more challenge on tetrahedral mesh, where the conventional second-order unlimited least-square reconstruction can make the scheme be linearly unstable when using cell-averaged conservative variables alone with von Neumann neighbors. Benefiting from the evolved cell-averaged slopes, on tetrahedral mesh the GKS is linearly stable from a compact third-order smooth reconstruction with a large CFL number. In order to further increase the robustness of the high-order compact GKS for capturing discontinuous solution, a new two-step multi-resolution weighted essentially non-oscillatory (WENO) reconstruction will be proposed. The novelty of the reconstruction includes the following. Firstly, it releases the stability issue from a second-order compact reconstruction through the introduction of a pre-reconstruction step. Secondly, in the third-order non-linear reconstruction, only one more large stencil is added beside those in the second-order one, which significantly simplifies the high-order reconstruction. The proposed third-order scheme shows good robustness in high speed flow computation and favorable mesh adaptability in cases with complex geometry.

preprint2021arXiv

Unified gas-kinetic wave-particle method for gas-particle two phase flow from dilute to dense solid-particle limit

In this paper, a unified framework for particulate two-phase flow will be presented with a wide range of solid-particle concentration from dilute to dense limit. The two phase flow is simulated by two coupled flow solvers, i.e., the gas-kinetic scheme (GKS) for the gas phase and unified gas-kinetic wave-particle method (UGKWP) for the particle phase. The GKS is a second-order Navier-Stokes flow solver for the continuum flow. The UGKWP is a multiscale method for all flow regimes. The wave and particle decomposition in UGKWP depends on the cell's Knudsen number (Kn). At a small Kn number, the high concentrated solid particle phase will be modeled by the Eulerian hydrodynamic wave due to the intensive particle-particle collisions. At a large Kn number, the dilute solid particle will be sampled and followed by the Lagrangian particle formulation to capture the non-equilibrium transport. In the transition regime, the distribution and evolution of particle and wave in UGKWP are controlled by the local Kn number with a smooth transition between the above limits. In the current scheme, the two phase model improves the previous one in all following aspects: drag force model for different solid particle concentrations; the frictional pressure in inter-particle contacts at high solid-particle concentration; a flux limiting model to avoid solid particles' over-packing; additional non-conservative nozzle and work terms for the gas phase. Besides, the inter-particle collisions have been refined numerically for the dense particle flow through the discretization of the collision term and numerical flux function. The numerical scheme is tested in a series of typical gas-particle problems. The results validate the accuracy and reliability of the proposed method for gas-particle flow.

preprint2021arXiv

Unified gas-kinetic wave-particle methods VI: Disperse dilute gas-particle multiphase flow

In this paper, a unified gas-kinetic wave-particle scheme (UGKWP) for the disperse dilute gas-particle multiphase flow is proposed. The gas phase is always in the hydrodynamic regime. However, the particle phase covers different flow regimes from particle trajectory crossing to the hydrodynamic wave interaction with the variation of local particle phase Knudsen number. The UGKWP is an appropriate method for the capturing of the multiscale transport mechanism in the particle phase through its coupled wave-particle formulation. In the regime with intensive particle collision, the evolution of solid particle will be followed by the analytic wave with quasi-equilibrium distribution; while in the rarefied regime the non-equilibrium particle phase will be captured through particle tracking and collision, which plays a decisive role in recovering particle trajectory crossing behavior. The gas-kinetic scheme (GKS) is employed for the simulation of gas flow. In the highly collision regime for the particles, no particles will be sampled in UGKWP and the wave formulation for solid particle with the hydrodynamic gas phase will reduce the system to the two-fluid Eulerian model. On the other hand, in the collisionless regime for the solid particle, the free transport of solid particle will be followed in UGKWP, and coupled system will return to the Eulerian-Lagrangian formulation for the gas and particle. The scheme will be tested for in all flow regimes, which include the non-equilibrium particle trajectory crossing, the particle concentration under different Knudsen number, and the dispersion of particle flow with the variation of Stokes number. A experiment of shock-induced particle bed fluidization is simulated and the results are compared with experimental measurements. These numerical solutions validate suitability of the proposed scheme for the simulation of gas-particle multiphase flow.

Kun Xu

What is connected

Connect this record

See the researcher in context

Building this map preview

80 published item(s)

BitLM: Unlocking Multi-Token Language Generation with Bitwise Continuous Diffusion

A Gas-Kinetic Scheme for Maxwell Equations

Formation of PSR J1012+5307 with an extremely low-mass white dwarf: testing magnetic braking models

High-order compact gas-kinetic scheme in arbitrary Lagrangian-Eulerian formulation

60-nm-span wavelength-tunable vortex fiber laser with intracavity plasmon metasurfaces

A Survey of Adversarial Learning on Graphs

Artificial Cnoidal Wave Breathers in Optical Microresonators

Construct the emission line galaxy-host halo connection through auto and cross correlations

Distant finetuning with discourse relations for stance classification

DyLex: Incorporating Dynamic Lexicons into BERT for Sequence Labeling

High-order Compact Gas-kinetic Schemes for Three-dimensional Flow Simulation on Tetrahedral Mesh

High-order Gas-kinetic Schemes with Non-compact and Compact Reconstruction for Implicit Large Eddy Simulation

Magnetism of QCD matter and pion mass from tensor-type spin polarization and anomalous magnetic moment of quarks

Neuromorphic computing using wavelength-division multiplexing

Photometric Objects around Cosmic Webs (PAC) Delineated in a Spectroscopic Survey. I. Methods

Satellite galaxies' drag on field stars in the Milky Way

Semantic optical fiber communication system

Three-dimensional third-order gas-kinetic scheme on hybrid unstructured meshes for Euler and Navier-Stokes equations

UGKWP for three-dimensional simulation of gas-particle fluidized bed

Zero-shot Cross-lingual Conversational Semantic Role Labeling

A giant central red disk galaxy at redshift $z=0.76$: challenge to theories of galaxy formation

Are there magnetars in high-mass X-ray binaries?

GraphGallery: A Platform for Fast Benchmarking and Easy Development of Graph Neural Networks Based Intelligent Software

Joint Coreference Resolution and Character Linking for Multiparty Conversation

Principle-driven Fiber Transmission Model based on PINN Neural Network

Programmable Multifunctional Plasmonic Waveguide System based on Coding Metamaterials and Inverse Design

Structural Information Preserving for Graph-to-Text Generation

The first decade of unified gas kinetic scheme

Two-step multi-resolution reconstruction-based compact gas-kinetic scheme on tetrahedral mesh

Unified gas-kinetic wave-particle method for gas-particle two phase flow from dilute to dense solid-particle limit

Unified gas-kinetic wave-particle methods VI: Disperse dilute gas-particle multiphase flow

A three-dimensional compact high-order gas-kinetic scheme on structured mesh

A Unified Gas-kinetic Scheme for Micro Flow Simulation Based on Linearized Kinetic Equation

Comparison of the performance of high-order schemes based on the gas-kinetic and HLLC fluxes

Coordinated Reasoning for Cross-Lingual Knowledge Graph Alignment

Extracting the magnitude of magnetic field at freeze-out in heavy-ion collisions

High-order gas-kinetic scheme with parallel computation for direct numerical simulation of turbulent flows

Learning Implicit Generative Models by Teaching Explicit Ones

Mixup Inference: Better Exploiting Mixup to Defend Adversarial Attacks

Modeling and computation for non-equilibrium gas dynamics: beyond kinetic relaxation model

Multiplex Word Embeddings for Selectional Preference Acquisition

On the Role of Conceptualization in Commonsense Knowledge Graph Construction

Rethinking Softmax Cross-Entropy Loss for Adversarial Robustness

Robust Dialogue Utterance Rewriting as Sequence Tagging

Star Formation in Massive Galaxies at Redshift $z \sim 0.5$

TexSmart: A Text Understanding System for Fine-Grained NER and Enhanced Semantic Analysis

Three dimensional high-order gas-kinetic scheme for supersonic isotropic turbulence II: coarse-grained analysis of compressible $K_{sgs}$ budget

To Relieve Your Headache of Training an MRF, Take AdVIL

Triple Generative Adversarial Networks

Understanding and Stabilizing GANs' Training Dynamics with Control Theory

Unified Gas-kinetic Wave-Particle Method IV: Multi-species Gas Mixture and Plasma Transport

A well-balanced gas kinetic scheme for Navier-Stokes equations with gravitational potential

Efficient training and design of photonic neural network through neuroevolution

High-order ALE gas-kinetic scheme with unstructured WENO reconstruction

Improved Decoding of Staircase Codes: The Soft-aided Bit-marking (SABM) Algorithm

Machine learning and evolutionary algorithm studies of graphene metamaterials for optimized plasmon-induced transparency

Ray Effect in Rarefied Flow Simulation

Unified Gas-kinetic Wave-Particle Methods II: Multiscale Simulation on Unstructured Mesh

Unified Gas-kinetic Wave-Particle Methods III: Multiscale Photon Transport

Decoding Staircase Codes with Marked Bits

A Compact Fourth-order Gas-kinetic Scheme for the Euler and Navier-Stokes Solutions

A Few Benchmark Test Cases for Higher-order Euler Solvers

A simplification of the unified gas kinetic scheme

A Third-order Compact Gas-kinetic Scheme on Unstructured Meshes for Compressible Navier-Stokes Solutions

An Alternative Analysis of Discontinuous Galerkin Method for Hyperbolic Conservation Law

An Efficient and Accurate Two-Stage Fourth-order Gas-kinetic Scheme for the Navier-Stokes Equations

Linear commuting maps and skew-symmertric biderivations of the deformative Schrodinger-Virasoro Lie algebras

Onsager's Cross Coupling Effects in Gas Flows Confined to Micro-channels

Phonon Boltzmann equation-based discrete unified gas kinetic scheme for multiscale heat transfer

Question Answering on Freebase via Relation Extraction and Textual Evidence

An indirect magnetic approach for determining entropy change in first-order magnetocaloric materials

Cartesian Grid Method for Gas Kinetic Scheme

Discrete unified gas kinetic scheme on unstructured meshes

Semantic Relation Classification via Convolutional Neural Networks with Simple Negative Sampling