Source author record

Yue Li

Yue Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

32works

27topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Draft Less, Retrieve More: Hybrid Tree Construction for Speculative Decoding

Speculative decoding (SD) accelerates large language model inference by leveraging a draft-then-verify paradigm. To maximize the acceptance rate, recent methods construct expansive draft trees, which unfortunately incur severe VRAM bandwidth and computational overheads that bottleneck end-to-end speedups. While dynamic-depth pruning can reduce this latency by removing marginal branches, it also discards potentially valid candidates, preventing the acceptance rate from reaching the upper bound of dense trees. In this paper, we identify a critical opportunity in resource allocation: the transition from dense to pruned drafting frees up significant computational budget. To break this Pareto tradeoff, we introduce Graft, a compensation framework that couples pruning and retrieval as mutually reinforcing operations. Pruning supplies sufficient budget for retrieval, while retrieval compensates for pruning-induced coverage loss and recovers accepted length. By employing a sequential `prune-then-graft' mechanism, Graft attaches highly predictive retrieved tokens into positions opened by pruning, filling the topological gaps with near-zero overhead. Graft is entirely training-free and lossless. Comprehensive evaluations show that Graft establishes a new Pareto frontier across practical deployment settings, including short-context generation, long-context generation, and large-scale models. On short-context benchmarks, it achieves up to 5.41$\times$ speedup and improves average speedup over EAGLE-3 by up to 21.8% on the large-scale Qwen3-235B. We also provide a preliminary exploration of applying Graft to the DFlash-style block drafting paradigm, offering initial evidence and insights for extending grafting beyond autoregressive draft trees.

preprint2026arXiv

When Hidden States Drift: Can KV Caches Rescue Long-Range Speculative Decoding?

Speculative decoding accelerates LLM inference, but SOTA hidden-state-based drafters suffer from long-range decay: draft accuracy degrades as the speculative step increases. Existing work attributes this decay to train-inference mismatch and proposes test-time training (TTT) as a remedy, yet we observe that long-range decay persists even in TTT-trained drafters. We revisit long-range decay from the perspective of context information preservation. In hidden-state reuse, we argue the target hidden state acts as a biased context compression: it aggregates historical token information according to the attention query at the current position, yielding a compact representation optimized for immediate next-token prediction. This compression can suppress information less relevant to the current query but important for later speculative steps. In contrast, the target model's KV cache serves as an explicit context, retaining the complete set of token-wise KV representations. We therefore posit the KV-Reuse Hypothesis: allowing the draft model to reuse the target KV cache can provide richer signals for long-horizon drafting. To test this hypothesis, we introduce KVShot, a diagnostic framework that compares three reuse paradigms: hidden-only, KV-only, and hybrid. Extensive evaluations on Qwen3-8B show that KV-Reuse improves long-range acceptance, although end-to-end speedups remain marginal under current training pipelines. Our analysis identifies two key structural bottlenecks: shallow drafters struggle to estimate target queries accurately, and draft-side KV projections receive sparse gradient signals. These findings suggest that realizing the full potential of KV-aware decoding requires moving beyond TTT toward block-wise training paradigms. By exposing these bottlenecks, KVShot provides a foundational diagnostic testbed and a clear roadmap for designing next-generation inference architectures.

preprint2023arXiv

Rigorous Derivation of the Degenerate Parabolic-Elliptic Keller-Segel System from a Moderately Interacting Stochastic Particle System. Part I Partial Differential Equation

The aim of this paper is to provide the analysis result for the partial differential equations arising from the rigorous derivation of the degenerate parabolic-elliptic Keller-Segel system from a moderately interacting stochastic particle system. The rigorous derivation is divided into two articles. In this paper, we establish the solution theory of the degenerate parabolic-elliptic Keller-Segel problem and its non-local version, which will be used in the second paper for the discussion of the mean-field limit. A parabolic regularized system is introduced to bridge the stochastic particle model and the degenerate Keller-Segel system. We derive the existence of the solution to this regularized system by constructing approximate solutions, giving uniform estimates and taking the limits, where a crucial step is to obtain the L infty Bernstein type estimate for the gradient of the approximate solution. Based on this, we obtain the well-posedness of the corresponding non-local equation through perturbation method. Finally, the weak solution of the degenerate Keller-Segel system is obtained by using a nonlinear version of Aubin-Lions lemma.

preprint2022arXiv

A six-point neuron-based ENO (NENO6) scheme for compressible fluid dynamics

In this work, we introduce a deep artificial neural network (ANN) that can detect locations of discontinuity and build a six-point ENO-type scheme based on a set of smooth and discontinuous training data. While a set of candidate stencils of incremental width is constructed, the ANN instead of a classical smoothness indicator is deployed for an ENO-like sub-stencil selection. A convex combination of the candidate fluxes with the re-normalized linear weights forms the six-point neuron-based ENO (NENO6) scheme. The present methodology is inspired by the work [Fu et al., Journal of Computational Physics 305 (2016): 333-359] where contributions of candidate stencils containing discontinuities are removed from the final reconstruction stencil. The binary candidate stencil classification is performed by a well-trained ANN with high fidelity. The proposed framework shows an improved generality and robustness compared with other ANN-based schemes. The generality and performance of the proposed NENO6 scheme are demonstrated by examining one- and two-dimensional benchmark cases with different governing conservation laws and comparing to those of WENO-CU6 and TENO6-opt schemes.

preprint2022arXiv

Coinduction in Uniform: Foundations for Corecursive Proof Search with Horn Clauses

We establish proof-theoretic, constructive and coalgebraic foundations for proof search in coinductive Horn clause theories. Operational semantics of coinductive Horn clause resolution is cast in terms of coinductive uniform proofs; its constructive content is exposed via soundness relative to an intuitionistic first-order logic with recursion controlled by the later modality; and soundness of both proof systems is proven relative to a novel coalgebraic description of complete Herbrand models.

preprint2022arXiv

Field-Dependent Magnetic Domain Behavior in van der Waals Fe$_3$GeTe$_2$

Two-dimensional magnetic van der Waals (vdW) materials can show a variety of topological nontrivial spin textures, such as Bloch- or Néel-type stripe, skyrmion or bubble domains under certain external stimuli. It is critical to understand the magnetic domain behavior in vdW materials in order to control their size, and density in response to external stimuli such as electric and magnetic fields. Here we examine the magnetic field dependence of topologically non-trivial magnetization spin textures in vdW Fe$_3$GeTe$_2$. Néel-type stripe domains and skyrmions are formed depending on the magnetic field-cooling protocol used during in-situ Lorentz transmission electron microscopy (LTEM) experiments. Use of quantitative reconstruction of magnetic induction maps, and micromagnetic simulations, allow for understanding the LTEM results of Néel-type stripe domains as well as skyrmions. In addition, the deformation of skyrmion contrast is observed as a result of the introduction of an in-plane magnetic field. We demonstrate the stability of the stripe domains and skyrmions in response to externally applied magnetic field due to energy barrier for domain wall annihilation. Our results establish an understanding of the energy landscape that governs the behavior of the topologically non-trivial spin textures in vdW materials which can be harnessed for spintronic applications.

preprint2022arXiv

GateNLP-UShef at SemEval-2022 Task 8: Entity-Enriched Siamese Transformer for Multilingual News Article Similarity

This paper describes the second-placed system on the leaderboard of SemEval-2022 Task 8: Multilingual News Article Similarity. We propose an entity-enriched Siamese Transformer which computes news article similarity based on different sub-dimensions, such as the shared narrative, entities, location and time of the event discussed in the news article. Our system exploits a Siamese network architecture using a Transformer encoder to learn document-level representations for the purpose of capturing the narrative together with the auxiliary entity-based features extracted from the news articles. The intuition behind using all these features together is to capture the similarity between news articles at different granularity levels and to assess the extent to which different news outlets write about "the same events". Our experimental results and detailed ablation study demonstrate the effectiveness and the validity of our proposed method.

preprint2022arXiv

Global weak solutions to the Vlasov-Poisson-Fokker-Planck-Navier-Stokes system

We consider the compressible Vlasov-Poisson-Fokker-Planck-Navier-Stokes system in a three dimensional bounded domain with nonhomogeneous Dirichlet boundary conditions. The system describes the evolution of charged particles ensemble dispersed in an isentropic fluid. For the adiabatic coefficient $γ>3/2$, we establish the global existence of weak solutions to this system with arbitrary large initial and boundary data.

preprint2022arXiv

Hierarchical Reinforcement Learning Based Video Semantic Coding for Segmentation

The rapid development of intelligent tasks, e.g., segmentation, detection, classification, etc, has brought an urgent need for semantic compression, which aims to reduce the compression cost while maintaining the original semantic information. However, it is impractical to directly integrate the semantic metric into the traditional codecs since they cannot be optimized in an end-to-end manner. To solve this problem, some pioneering works have applied reinforcement learning to implement image-wise semantic compression. Nevertheless, video semantic compression has not been explored since its complex reference architectures and compression modes. In this paper, we take a step forward to video semantic compression and propose the Hierarchical Reinforcement Learning based task-driven Video Semantic Coding, named as HRLVSC. Specifically, to simplify the complex mode decision of video semantic coding, we divided the action space into frame-level and CTU-level spaces in a hierarchical manner, and then explore the best mode selection for them progressively with the cooperation of frame-level and CTU-level agents. Moreover, since the modes of video semantic coding will exponentially increase with the number of frames in a Group of Pictures (GOP), we carefully investigate the effects of different mode selections for video semantic coding and design a simple but effective mode simplification strategy for it. We have validated our HRLVSC on the video segmentation task with HEVC reference software HM16.19. Extensive experimental results demonstrated that our HRLVSC can achieve over 39% BD-rate saving for video semantic coding under the Low Delay P configuration.

preprint2022arXiv

Modeling electronic health record data using a knowledge-graph-embedded topic model

The rapid growth of electronic health record (EHR) datasets opens up promising opportunities to understand human diseases in a systematic way. However, effective extraction of clinical knowledge from the EHR data has been hindered by its sparsity and noisy information. We present KG-ETM, an end-to-end knowledge graph-based multimodal embedded topic model. KG-ETM distills latent disease topics from EHR data by learning the embedding from the medical knowledge graphs. We applied KG-ETM to a large-scale EHR dataset consisting of over 1 million patients. We evaluated its performance based on EHR reconstruction and drug imputation. KG-ETM demonstrated superior performance over the alternative methods on both tasks. Moreover, our model learned clinically meaningful graph-informed embedding of the EHR codes. In additional, our model is also able to discover interpretable and accurate patient representations for patient stratification and drug recommendations.

preprint2022arXiv

Multi-wavelength magnetic coding of helical luminescence in ferromagnetic 2D layered CrI3

Two-dimensional (2D) van der Waals (vdW) ferromagnets have opened new avenues for manipulating spin at the limits of single or few atomic layers, and for creating unique magneto-exciton devices through the coupling of long-range ferromagnetic (FM) orders and excitons. However, 2D vdW ferromagnets explored so far have rarely possessed exciton behaviors; to date, FM CrI3 have been recently revealed to show ligand-field photoluminescence correlated with FM ordering, but typically with a broad emission peak. Alternatively, many-body excitons have been observed in antiferromagnetic (AFM) NiPS3, but the coupling of excitons with AFM orders is exponentially more difficult, owing to extremely high coercivity. Here, we report a straightforward approach to realize strong coupling of narrow helical emission and FM orders at a low magnetic field in CrI3 through a relatively simple microsphere cavity. We show that the resonant whispering-gallery-modes (WGM) of SiO2 microspheres give rising to a series of strong oscillation helical emissions with a full width at half-maximum (FWHM) of ~5 nm under continuous wave excitation. Reversible magnetic control and coding of helical luminescence with multiwavelength is realized in the range of 950-1100 nm. This work enables plenty of opportunities for creating magnetic encoding lasing for photonic integrated chips.

preprint2022arXiv

Revisiting the transient coarsening kinetics: a new framework in the Lifshitz-Slyozov-Wagner space

Phase coarsening is a fundamental process of microstructure evolution in multiphase materials. A thorough understanding of its kinetics is of great significance for material processing and performance. Generally, coarsening can be divided into the transient stage and the steady stage. Compared with steady coarsening kinetics, the current understanding of transient coarsening is rather limited and contradictory. In the present work, a new framework in the dimensionless Lifshitz-Slyozov-Wagner space is developed to study transient coarsening kinetics co-controlled by interface migration/reaction and matrix diffusion, where the dynamic equation for individual particles is derived from the thermodynamic extremal principle.

preprint2022arXiv

Tai-e: A Static Analysis Framework for Java by Harnessing the Best Designs of Classics

Static analysis is a mature field with applications to bug detection, security analysis, and code optimization, etc. To facilitate these applications, static analysis frameworks play an essential role by providing a series of fundamental services such as program abstraction, control flow graph construction, and points-to/alias information computation, etc. However, despite impressive progress of static analysis, and this field has seen several popular frameworks in the last decades, it is still not clear how a static analysis framework should be designed in a way that analysis developers could benefit more: for example, what a good IR (for analysis) ought to look like? What functionalities should the module of fundamental analyses provide to ease client analyses? How to develop and integrate new analysis conveniently? How to manage multiple analyses? To answer these questions, in this work, we discuss the design trade-offs for the crucial components of a static analysis framework, and argue for the most appropriate design by following the HBDC (Harnessing the Best Designs of Classics) principle: for each crucial component, we compare the design choices made for it (possibly) by different classic frameworks such as Soot, WALA, SpotBugs and Doop, and choose arguably the best one, but if none is good enough, we then propose a better design. These selected or newly proposed designs finally constitute Tai-e, a new static analysis framework for Java. Specifically, Tai-e is novel in the designs of several aspects like IR, pointer analysis and development of new analyses, etc., leading to an easy-to-learn, easy-to-use and efficient system. To our knowledge, this is the first work that systematically explores the designs and implementations of various static analysis frameworks, and we believe it provides useful materials and viewpoints for building better static analysis infrastructures.

preprint2022arXiv

Thermal Hysteresis Behavior of Skyrmion Lattices in the van der Waals Ferromagnet Fe3GeTe2

Understanding the physics of phase transitions in two-dimensional (2D) systems underpins the research in diverse fields including statistical mechanics, quantum systems, nanomagnetism, and soft condensed matter. However, many fundamental aspects of 2D phase transitions are still not well understood, including the effects of interparticle potential, polydispersity, and particle shape. Magnetic skyrmions, which are non-trivial chiral spin structures, can be considered as quasi-particles that form two-dimensional lattices. Here we show, by real-space imaging using in situ cryo-Lorentz transmission electron microscopy coupled with machine learning, the ordering behavior of Néel skyrmion lattices in van der Waals Fe3GeTe2. We demonstrate a distinct change in the skyrmion size distribution during field-cooling, which leads to a loss of lattice order and an evolution of the skyrmion liquid phase. Remarkably, the lattice order is restored during field heating and demonstrates a thermal hysteresis. Our quantitative analysis explains this behavior based on the energy landscape of skyrmions and demonstrates the potential to control the lattice order in 2D phase transitions.

preprint2022arXiv

Xscope: Hunting for Cross-Chain Bridge Attacks

Cross-Chain bridges have become the most popular solution to support asset interoperability between heterogeneous blockchains. However, while providing efficient and flexible cross-chain asset transfer, the complex workflow involving both on-chain smart contracts and off-chain programs causes emerging security issues. In the past year, there have been more than ten severe attacks against cross-chain bridges, causing billions of loss. With few studies focusing on the security of cross-chain bridges, the community still lacks the knowledge and tools to mitigate this significant threat. To bridge the gap, we conduct the first study on the security of cross-chain bridges. We document three new classes of security bugs and propose a set of security properties and patterns to characterize them. Based on those patterns, we design Xscope, an automatic tool to find security violations in cross-chain bridges and detect real-world attacks. We evaluate Xscope on four popular cross-chain bridges. It successfully detects all known attacks and finds suspicious attacks unreported before. A video of Xscope is available at https://youtu.be/vMRO_qOqtXY.

preprint2021arXiv

Convolutional neural network-assisted recognition of nanoscale L12 ordered structures in face-centred cubic alloys

Nanoscale L12-type ordered structures are widely used in face-centred cubic (FCC) alloys to exploit their hardening capacity and thereby improve mechanical properties. These fine-scale particles are typically fully coherent with matrix with the same atomic configuration disregarding chemical species, which makes them challenging to be characterized. Spatial distribution maps (SDMs) are used to probe local order by interrogating the three-dimensional (3D) distribution of atoms within reconstructed atom probe tomography (APT) data. However, it is almost impossible to manually analyse the complete point cloud ($>10$ million) in search for the partial crystallographic information retained within the data. Here, we proposed an intelligent L12-ordered structure recognition method based on convolutional neural networks (CNNs). The SDMs of a simulated L12-ordered structure and the FCC matrix were firstly generated. These simulated images combined with a small amount of experimental data were used to train a CNN-based L12-ordered structure recognition model. Finally, the approach was successfully applied to reveal the 3D distribution of L12-type $δ^\prime$-Al3(LiMg) nanoparticles with an average radius of 2.54 nm in a FCC Al-Li-Mg system. The minimum radius of detectable nanodomain is even down to 5 Å. The proposed CNN-APT method is promising to be extended to recognize other nanoscale ordered structures and even more-challenging short-range ordered phenomena in the near future.

preprint2021arXiv

IRO: Integrity and Reliability Enhanced Ring ORAM

Memory security and reliability are two of the major design concerns in cloud computing systems. State-of-the-art memory security-reliability co-designs (e.g. Synergy) have achieved a good balance on performance, confidentiality, integrity, and reliability. However, these works merely rely on encryption to ensure data confidentiality, which has been proven unable to prevent information leakage from memory access patterns. Ring ORAM is an attractive confidential protection protocol to hide memory access patterns to the untrusted storage system. Unfortunately, it does not compatible with the security-reliability co-designs. A forced combination would result in more severe performance loss. In this paper, we propose IRO, an Integrity and Reliability enhanced Ring ORAM design. To reduce the overhead of integrity verification, we propose a low overhead integrity tree RIT and use a Minimum Update Subtree Tree (MUST) to reduce metadata update overhead. To improve memory reliability, we present Secure Replication to provide channel-level error resilience for the ORAM tree and use the mirrored channel technique to guarantee the reliability of the MUST. Last, we use the error correction pointer (ECP) to repair permanent memory cell fault to further improve device reliability and lifetime. A compact metadata design is used to reduce the storage and consulting overhead of the ECP. IRO provides strong security and reliability guarantees, while the resulting storage and performance overhead is very small. Our evaluation shows that IRO only increases 7.54% execution time on average over the Baseline under two channels four AES-GCM units setting. With enough AES-GCM units to perform concurrent MAC computing, IRO can reduce 2.14% execution time of the Baseline.

preprint2021arXiv

Reflections on the spatial performance of atom probe tomography in the analysis of atomic neighbourhoods

Atom probe tomography is often introduced as providing "atomic-scale" mapping of the composition of materials and as such is often exploited to analyse atomic neighbourhoods within a material. Yet quantifying the actual spatial performance of the technique in a general case remains challenging, as they depend on the material system being investigated as well as on the specimen's geometry. Here, by using comparisons with field-ion microscopy experiments and field-ion imaging and field evaporation simulations, we provide the basis for a critical reflection on the spatial performance of atom probe tomography in the analysis of pure metals, low alloyed systems and concentrated solid solutions (i.e. akin to high-entropy alloys). The spatial resolution imposes strong limitations on the possible interpretation of measured atomic neighbourhoods, and directional neighbourhood analyses restricted to the depth are expected to be more robust. We hope this work gets the community to reflect on its practices, in the same way, it got us to reflect on our work.

preprint2020arXiv

A low-dissipation shock-capturing framework with flexible nonlinear dissipation control

In this work, a framework to construct arbitrarily high-order low-dissipation shock-capturing schemes with flexible and controllable nonlinear dissipation for convection-dominated problems is proposed. While a set of candidate stencils of incremental width is constructed, each one is indicated as smooth or nonsmooth by the ENO-like stencil selection procedure proposed in the targeted essentially non-oscillatory (TENO) scheme [Fu et al., Journal of Computational Physics 305 (2016): 333-359]. Rather than being discarded directly as with TENO schemes, the nonsmooth candidates are filtered by an extra nonlinear limiter, such as a monotonicity-preserving (MP) limiter or a total variation diminishing (TVD) limiter. Consequently, high-order reconstruction is achieved by assembling candidate fluxes with optimal linear weights since they are either smooth reconstructions or filtered ones which feature good non-oscillation property. A weight renormalization procedure as with the standard TENO paradigm is not necessary. This new framework concatenates the concepts of TENO, WENO and other nonlinear limiters for shock-capturing, and provides a new insight to designing low-dissipation nonlinear schemes. Through the adaptation of nonlinear limiters, nonlinear dissipation in the newly proposed framework can be controlled separately without affecting the performance in smooth regions. Based on the proposed framework, a family of new six- and eight-point nonlinear schemes with controllable dissipation is proposed. A set of critical benchmark cases involving strong discontinuities and broadband fluctuations is simulated. Numerical results reveal that the proposed new schemes capture discontinuities sharply and resolve the high-wavenumber fluctuations with low dissipation, while maintaining the desired accuracy order in smooth regions.

preprint2020arXiv

Kaya: A Testing Framework for Blockchain-based Decentralized Applications

In recent years, many decentralized applications based on blockchain (DApp) have been developed. However, due to inadequate testing, DApps are easily exposed to serious vulnerabilities. We find three main challenges for DApp testing, i.e., the inherent complexity of DApp, inconvenient pre-state setting, and not-so-readable logs. In this paper, we propose a testing framework named Kaya to bridge these gaps. Kaya has three main functions. Firstly, Kaya proposes DApp behavior description language (DBDL) to make writing test cases easier. Test cases written in DBDL can also be automatically executed by Kaya. Secondly, Kaya supports a flexible and convenient way for test engineers to set the blockchain pre-states easily. Thirdly, Kaya transforms incomprehensible addresses into readable variables for easy comprehension. With these functions, Kaya can help test engineers test DApps more easily. Besides, to fit the various application environments, we provide two ways for test engineers to use Kaya, i.e., UI and command-line. Our experimental case demonstrates the potential of Kaya in helping test engineers to test DApps more easily.

preprint2020arXiv

Orthogonal electric control of the out-of-plane field-effect in two-dimensional ferroelectric alpha-In2Se3

Tuning the electric properties of crystalline solids is at the heart of material science and electronics. Generating the electric field-effect via an external voltage is a clean, continuous and systematic method. Here, utilizing the unique electric dipole locking in van der Waals (vdW) ferroelectric alpha-In2Se3, we report a new approach to establish the electric gating effect, where the electrostatic doping in the out-of-plane direction is induced and controlled by an in-plane voltage. With the vertical vdW heterostructure of ultrathin alpha-In2Se3 and MoS2, we validate an in-plane voltage gated coplanar field-effect transistor (CP-FET) with distinguished and retentive on/off ratio. Our results demonstrate unprecedented electric control of ferroelectricity, which paves the way for integrating two-dimensional (2D) ferroelectric into novel nanoelectronic devices with broad applications.

preprint2020arXiv

Spin-Valley Locking Effect in Defect States of Monolayer MoS$_2$

Valley pseudospin in two-dimensional (2D) transition-metal dichalcogenides (TMDs) allows optical control of spin-valley polarization and intervalley quantum coherence. Defect states in TMDs give rise to new exciton features and theoretically exhibit spin-valley polarization; however, experimental achievement of this phenomenon remains challenges. Here, we report unambiguous valley pseudospin of defect-bound localized excitons in CVD-grown monolayer MoS2; enhanced valley Zeeman splitting with an effective g-factor of -6.2 is observed. Our results reveal that all five d-orbitals and the increased effective electron mass contribute to the band shift of defect states, demonstrating a new physics of the magnetic responses of defect-bound localized excitons, strikingly different from that of A excitons. Our work paves the way for the manipulation of the spin-valley degrees of freedom through defects toward valleytronic devices.

preprint2019arXiv

Deep Learning-Based Video Coding: A Review and A Case Study

The past decade has witnessed great success of deep learning technology in many disciplines, especially in computer vision and image processing. However, deep learning-based video coding remains in its infancy. This paper reviews the representative works about using deep learning for image/video coding, which has been an actively developing research area since the year of 2015. We divide the related works into two categories: new coding schemes that are built primarily upon deep networks (deep schemes), and deep network-based coding tools (deep tools) that shall be used within traditional coding schemes or together with traditional coding tools. For deep schemes, pixel probability modeling and auto-encoder are the two approaches, that can be viewed as predictive coding scheme and transform coding scheme, respectively. For deep tools, there have been several proposed techniques using deep learning to perform intra-picture prediction, inter-picture prediction, cross-channel prediction, probability distribution prediction, transform, post- or in-loop filtering, down- and up-sampling, as well as encoding optimizations. In the hope of advocating the research of deep learning-based video coding, we present a case study of our developed prototype video codec, namely Deep Learning Video Coding (DLVC). DLVC features two deep tools that are both based on convolutional neural network (CNN), namely CNN-based in-loop filter (CNN-ILF) and CNN-based block adaptive resolution coding (CNN-BARC). Both tools help improve the compression efficiency by a significant margin. With the two deep tools as well as other non-deep coding tools, DLVC is able to achieve on average 39.6\% and 33.0\% bits saving than HEVC, under random-access and low-delay configurations, respectively. The source code of DLVC has been released for future researches.

preprint2019arXiv

Direct visualization of the magnetostructural phase transition in nano-scale FeRh thin films using differential phase contrast imaging

To advance the use of thermally-activated magnetic materials in device applications it is necessary to examine their behaviour on the localised scale in operando conditions. Equi-atomic FeRh undergoes a magnetostructural transition from an antiferromagnetic (AF) to a ferromagnetic (FM) phase above room temperature (~ 75 to 105 °C) and hence is considered a very desirable material for the next generation of novel nanomagnetic or spintronic devices. For this to be realised, we must fully understand the intricate details of AF to FM transition and associated FM domain growth on the scale of their operation. Here we combine in-situ heating with a comprehensive suite of advanced transmission electron microscopy techniques to investigate directly the magnetostructural transition in nano-scale FeRh thin films. Differential phase contrast imaging visualizes the stages of FM domain growth in both cross-sectional and planar FeRh thin films as a function of temperature. Small surface FM signals are also detected due to interfacial strain with the MgO substrate and Fe deficiency after HF etching of the substrate, providing a directional bias for FM domain growth. Our work provides high resolution imaging and quantitative measurements throughout the transition, which were previously inaccessible, and offers new fundamental insight into their potential use in magnetic devices.

preprint2019arXiv

Ice-rule made manifold: phase transitions, topological defects and manifold restoration in two-dimensional artificial spin systems

Artificial spin ices are arrays of correlated nano-scale magnetic islands that prove an excellent playground in which to study the role of topology in critical phenomena. Here, we investigate a continuum of spin ice geometries, parameterised by rotation of the islands. In doing so, we morph from the classic square ice to the recently studied pinwheel geometry, with the rotation angle acting as a proxy for controlling inter-island interactions. We experimentally observe a change in ground state magnetic order from antiferromagnetic to ferromagnetic across this class of geometries using Lorentz transmission electron microscopy on thermally annealed cobalt arrays. The change in ordering leads to an apparent change in the nature of the defects supported: from one-dimensional strings in the antiferromagnetic phase to two-dimensional vortex-like structures in the ferromagnetic one, consistent with the scaling predicted by the Kibble-Zurek mechanism. Our results show how magnetic order in artificial spin ices can be tuned by changes in geometry so that a truly frustrated ice-rule phase is possible in two-dimensional systems. Furthermore, we demonstrate this system as a testbed to investigate out-of-equilibrium dynamics across phases.

preprint2019arXiv

Multiple and concentration of nontrivial nonnegative solutions for a fractional Choquard equation with critical exponent

In present paper, we study the fractional Choquard equation $$\varepsilon^{2s}(-Δ)^s u+V(x)u=\varepsilon^{μ-N}(\frac{1}{|x|^μ}\ast F(u))f(u)+|u|^{2^\ast_s-2}u$$ where $\varepsilon>0$ is a parameter, $s\in(0,1),$ $N>2s,$ $2^*_s=\frac{2N}{N-2s}$ and $0<μ<\min\{2s,N-2s\}$. Under suitable assumption on $V$ and $f$, we prove this problem has a nontrivial nonnegative ground state solution. Moreover, we relate the number of nontrivial nonnegative solutions with the topology of the set where the potential attains its minimum values and their's concentration behavior.

preprint2019arXiv

Study of Constrained Network Structures for WGANs on Numeric Data Generation

Some recent studies have suggested using GANs for numeric data generation such as to generate data for completing the imbalanced numeric data. Considering the significant difference between the dimensions of the numeric data and images, as well as the strong correlations between features of numeric data, the conventional GANs normally face an overfitting problem, consequently leads to an ill-conditioning problem in generating numeric and structured data. This paper studies the constrained network structures between generator G and discriminator D in WGAN, designs several structures including isomorphic, mirror and self-symmetric structures. We evaluates the performances of the constrained WGANs in data augmentations, taking the non-constrained GANs and WGANs as the baselines. Experiments prove the constrained structures have been improved in 17/20 groups of experiments. In twenty experiments on four UCI Machine Learning Repository datasets, Australian Credit Approval data, German Credit data, Pima Indians Diabetes data and SPECT heart data facing five conventional classifiers. Especially, Isomorphic WGAN is the best in 15/20 experiments. Finally, we theoretically proves that the effectiveness of constrained structures by the directed graphic model (DGM) analysis.

preprint2016arXiv

Ripple: Reflection Analysis for Android Apps in Incomplete Information Environments

Despite its widespread use in Android apps, reflection poses graving problems for static security analysis. Currently, string inference is applied to handle reflection, resulting in significantly missed security vulnerabilities. In this paper, we bring forward the ubiquity of incomplete information environments (IIEs) for Android apps, where some critical data-flows are missing during static analysis, and the need for resolving reflective calls under IIEs. We present Ripple, the first IIE-aware static reflection analysis for Android apps that resolves reflective calls more soundly than string inference. Validation with 17 popular Android apps from Google Play demonstrates the effectiveness of Ripple in discovering reflective targets with a low false positive rate. As a result, Ripple enables FlowDroid, a taint analysis for Android apps, to find hundreds of sensitive data leakages that would otherwise be missed. As a fundamental analysis, Ripple will be valuable for many security analysis clients, since more program behaviors can now be analyzed under IIEs.

preprint2015arXiv

Asymmetric Error Correction and Flash-Memory Rewriting using Polar Codes

We propose efficient coding schemes for two communication settings: 1. asymmetric channels, and 2. channels with an informed encoder. These settings are important in non-volatile memories, as well as optical and broadcast communication. The schemes are based on non-linear polar codes, and they build on and improve recent work on these settings. In asymmetric channels, we tackle the exponential storage requirement of previously known schemes, that resulted from the use of large Boolean functions. We propose an improved scheme, that achieves the capacity of asymmetric channels with polynomial computational complexity and storage requirement. The proposed non-linear scheme is then generalized to the setting of channel coding with an informed encoder, using a multicoding technique. We consider specific instances of the scheme for flash memories, that incorporate error-correction capabilities together with rewriting. Since the considered codes are non-linear, they eliminate the requirement of previously known schemes (called polar write-once-memory codes) for shared randomness between the encoder and the decoder. Finally, we mention that the multicoding scheme is also useful for broadcast communication in Marton's region, improving upon previous schemes for this setting.

preprint2015arXiv

Rewriting Flash Memories by Message Passing

This paper constructs WOM codes that combine rewriting and error correction for mitigating the reliability and the endurance problems in flash memory. We consider a rewriting model that is of practical interest to flash applications where only the second write uses WOM codes. Our WOM code construction is based on binary erasure quantization with LDGM codes, where the rewriting uses message passing and has potential to share the efficient hardware implementations with LDPC codes in practice. We show that the coding scheme achieves the capacity of the rewriting model. Extensive simulations show that the rewriting performance of our scheme compares favorably with that of polar WOM code in the rate region where high rewriting success probability is desired. We further augment our coding schemes with error correction capability. By drawing a connection to the conjugate code pairs studied in the context of quantum error correction, we develop a general framework for constructing error-correction WOM codes. Under this framework, we give an explicit construction of WOM codes whose codewords are contained in BCH codes.

preprint2015arXiv

Unsupervised Learning in Genome Informatics

With different genomes available, unsupervised learning algorithms are essential in learning genome-wide biological insights. Especially, the functional characterization of different genomes is essential for us to understand lives. In this book chapter, we review the state-of-the-art unsupervised learning algorithms for genome informatics from DNA to MicroRNA. DNA (DeoxyriboNucleic Acid) is the basic component of genomes. A significant fraction of DNA regions (transcription factor binding sites) are bound by proteins (transcription factors) to regulate gene expression at different development stages in different tissues. To fully understand genetics, it is necessary of us to apply unsupervised learning algorithms to learn and infer those DNA regions. Here we review several unsupervised learning methods for deciphering the genome-wide patterns of those DNA regions. MicroRNA (miRNA), a class of small endogenous non-coding RNA (RiboNucleic acid) species, regulate gene expression post-transcriptionally by forming imperfect base-pair with the target sites primarily at the 3$'$ untranslated regions of the messenger RNAs. Since the 1993 discovery of the first miRNA \emph{let-7} in worms, a vast amount of studies have been dedicated to functionally characterizing the functional impacts of miRNA in a network context to understand complex diseases such as cancer. Here we review several representative unsupervised learning frameworks on inferring miRNA regulatory network by exploiting the static sequence-based information pertinent to the prior knowledge of miRNA targeting and the dynamic information of miRNA activities implicated by the recently available large data compendia, which interrogate genome-wide expression profiles of miRNAs and/or mRNAs across various cell conditions.

preprint2013arXiv

Enhancing Composition Window of Bicontinuous Structures by Designed Polydispersity Distribution of ABA Triblock Copolymers

The phase behavior of polydisperse ABA triblock copolymers is studied using dissipative particle dynamics simulations, focusing on the emergence and property of bicontinuous structures. Bicontinuous structures are characterized by two separate, intermeshed nanoscopic domains extending throughout the material. The connectivity of polymeric bicontinuous structures makes them highly desirable for many applications. For conventional monodisperse diblock and triblock copolymers, regular bicontinuous structures (i.e., gyroid and Fddd) can be formed over a narrow composition window of ~3%. We demonstrate that the composition window for the formation of bicontinuous structures can be regulated by designed polydispersity distributions of ABA triblock copolymers. In particular, introducing polydispersity in both A and B blocks can lead to a significant enhancement of the composition window of bicontinuous structures with both continuous A and B domains. The mechanism of the bicontinuous structure enhancement is elucidated from the distribution of the long and short blocks. Furthermore, it is shown that the polymeric bicontinuous structures from polydisperse ABA triblock copolymers possess good continuity throughout the sample, making them ideal candidates for advanced applications.

Yue Li

What is connected

Connect this record

See the researcher in context

Building this map preview

32 published item(s)

Draft Less, Retrieve More: Hybrid Tree Construction for Speculative Decoding

When Hidden States Drift: Can KV Caches Rescue Long-Range Speculative Decoding?

Rigorous Derivation of the Degenerate Parabolic-Elliptic Keller-Segel System from a Moderately Interacting Stochastic Particle System. Part I Partial Differential Equation

A six-point neuron-based ENO (NENO6) scheme for compressible fluid dynamics

Coinduction in Uniform: Foundations for Corecursive Proof Search with Horn Clauses

Field-Dependent Magnetic Domain Behavior in van der Waals Fe$_3$GeTe$_2$

GateNLP-UShef at SemEval-2022 Task 8: Entity-Enriched Siamese Transformer for Multilingual News Article Similarity

Global weak solutions to the Vlasov-Poisson-Fokker-Planck-Navier-Stokes system

Hierarchical Reinforcement Learning Based Video Semantic Coding for Segmentation

Modeling electronic health record data using a knowledge-graph-embedded topic model

Multi-wavelength magnetic coding of helical luminescence in ferromagnetic 2D layered CrI3

Revisiting the transient coarsening kinetics: a new framework in the Lifshitz-Slyozov-Wagner space

Tai-e: A Static Analysis Framework for Java by Harnessing the Best Designs of Classics

Thermal Hysteresis Behavior of Skyrmion Lattices in the van der Waals Ferromagnet Fe3GeTe2

Xscope: Hunting for Cross-Chain Bridge Attacks

Convolutional neural network-assisted recognition of nanoscale L12 ordered structures in face-centred cubic alloys

IRO: Integrity and Reliability Enhanced Ring ORAM

Reflections on the spatial performance of atom probe tomography in the analysis of atomic neighbourhoods

A low-dissipation shock-capturing framework with flexible nonlinear dissipation control

Kaya: A Testing Framework for Blockchain-based Decentralized Applications

Orthogonal electric control of the out-of-plane field-effect in two-dimensional ferroelectric alpha-In2Se3

Spin-Valley Locking Effect in Defect States of Monolayer MoS$_2$

Deep Learning-Based Video Coding: A Review and A Case Study

Direct visualization of the magnetostructural phase transition in nano-scale FeRh thin films using differential phase contrast imaging

Ice-rule made manifold: phase transitions, topological defects and manifold restoration in two-dimensional artificial spin systems

Multiple and concentration of nontrivial nonnegative solutions for a fractional Choquard equation with critical exponent

Study of Constrained Network Structures for WGANs on Numeric Data Generation

Ripple: Reflection Analysis for Android Apps in Incomplete Information Environments

Asymmetric Error Correction and Flash-Memory Rewriting using Polar Codes

Rewriting Flash Memories by Message Passing

Unsupervised Learning in Genome Informatics

Enhancing Composition Window of Bicontinuous Structures by Designed Polydispersity Distribution of ABA Triblock Copolymers