Source author record

Lan Li

Lan Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning physics.optics Computation and Language Computer Vision cond-mat.mtrl-sci Databases

Catalog footprint

What is connected

9works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

ParaRNN: An Interpretable and Parallelizable Recurrent Neural Network for Time-Dependent Data

The proliferation of large-scale and structurally complex data has spurred the integration of machine learning methods into statistical modeling. Recurrent neural networks (RNNs), a foundational class of models for time-dependent data, can be viewed as nonlinear extensions of classical autoregressive moving average models. Despite their flexibility and empirical success in machine learning, RNNs often suffer from limited interpretability and slow training, which hinders their use in statistics. This paper proposes the Parallelized RNN (ParaRNN), a novel model composed of multiple small recurrent units. ParaRNN admits an additive representation that decouples recurrent dynamics into interpretable components, whose behavior can be characterized through recurrence features. This interpretability enables its applications in nonparametric regression for time-dependent data, while the design also allows efficient parallelization. The approximation capacity and non-asymptotic prediction error bounds in a nonparametric regression setting are established for ParaRNN. Empirical results on three sequential modeling tasks further demonstrate that ParaRNN achieves performance comparable to vanilla RNNs while offering improved interpretability and efficiency.

preprint2023arXiv

KAER: A Knowledge Augmented Pre-Trained Language Model for Entity Resolution

Entity resolution has been an essential and well-studied task in data cleaning research for decades. Existing work has discussed the feasibility of utilizing pre-trained language models to perform entity resolution and achieved promising results. However, few works have discussed injecting domain knowledge to improve the performance of pre-trained language models on entity resolution tasks. In this study, we propose Knowledge Augmented Entity Resolution (KAER), a novel framework named for augmenting pre-trained language models with external knowledge for entity resolution. We discuss the results of utilizing different knowledge augmentation and prompting methods to improve entity resolution performance. Our model improves on Ditto, the existing state-of-the-art entity resolution method. In particular, 1) KAER performs more robustly and achieves better results on "dirty data", and 2) with more general knowledge injection, KAER outperforms the existing baseline models on the textual dataset and dataset from the online product domain. 3) KAER achieves competitive results on highly domain-specific datasets, such as citation datasets, requiring the injection of expert knowledge in future work.

preprint2022arXiv

Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via Reinforcement Learning

Text detection and recognition are essential components of a modern OCR system. Most OCR approaches attempt to obtain accurate bounding boxes of text at the detection stage, which is used as the input of the text recognition stage. We observe that when using tight text bounding boxes as input, a text recognizer frequently fails to achieve optimal performance due to the inconsistency between bounding boxes and deep representations of text recognition. In this paper, we propose Box Adjuster, a reinforcement learning-based method for adjusting the shape of each text bounding box to make it more compatible with text recognition models. Additionally, when dealing with cross-domain problems such as synthetic-to-real, the proposed method significantly reduces mismatches in domain distribution between the source and target domains. Experiments demonstrate that the performance of end-to-end text recognition systems can be improved when using the adjusted bounding boxes as the ground truths for training. Specifically, on several benchmark datasets for scene text understanding, the proposed method outperforms state-of-the-art text spotters by an average of 2.0% F-Score on end-to-end text recognition tasks and 4.6% F-Score on domain adaptation tasks.

preprint2022arXiv

Preliminary Steps Towards Federated Sentiment Classification

Automatically mining sentiment tendency contained in natural language is a fundamental research to some artificial intelligent applications, where solutions alternate with challenges. Transfer learning and multi-task learning techniques have been leveraged to mitigate the supervision sparsity and collaborate multiple heterogeneous domains correspondingly. Recent years, the sensitive nature of users' private data raises another challenge for sentiment classification, i.e., data privacy protection. In this paper, we resort to federated learning for multiple domain sentiment classification under the constraint that the corpora must be stored on decentralized devices. In view of the heterogeneous semantics across multiple parties and the peculiarities of word embedding, we pertinently provide corresponding solutions. First, we propose a Knowledge Transfer Enhanced Private-Shared (KTEPS) framework for better model aggregation and personalization in federated sentiment classification. Second, we propose KTEPS$^\star$ with the consideration of the rich semantic and huge embedding size properties of word vectors, utilizing Projection-based Dimension Reduction (PDR) methods for privacy protection and efficient transmission simultaneously. We propose two federated sentiment classification scenes based on public benchmarks, and verify the superiorities of our proposed methods with abundant experimental investigations.

preprint2020arXiv

I-BERT: Inductive Generalization of Transformer to Arbitrary Context Lengths

Self-attention has emerged as a vital component of state-of-the-art sequence-to-sequence models for natural language processing in recent years, brought to the forefront by pre-trained bi-directional Transformer models. Its effectiveness is partly due to its non-sequential architecture, which promotes scalability and parallelism but limits the model to inputs of a bounded length. In particular, such architectures perform poorly on algorithmic tasks, where the model must learn a procedure which generalizes to input lengths unseen in training, a capability we refer to as inductive generalization. Identifying the computational limits of existing self-attention mechanisms, we propose I-BERT, a bi-directional Transformer that replaces positional encodings with a recurrent layer. The model inductively generalizes on a variety of algorithmic tasks where state-of-the-art Transformer models fail to do so. We also test our method on masked language modeling tasks where training and validation sets are partitioned to verify inductive generalization. Out of three algorithmic and two natural language inductive generalization tasks, I-BERT achieves state-of-the-art results on four tasks.

preprint2013arXiv

3-D Integrated Flexible Glass Photonics

Photonic integration on plastic substrates enables emerging applications ranging from flexible interconnects to conformal sensors on biological tissues. Such devices are traditionally fabricated using pattern transfer, which is complicated and has limited integration capacity. Here we pioneered a monolithic approach to realize flexible, high-index-contrast glass photonics with significantly improved processing throughput and yield. Noting that the conventional multilayer bending theory fails when laminates have large elastic mismatch, we derived a mechanics theory accounting for multiple neutral axes in one laminated structure to accurately predict its strain-optical coupling behavior. Through combining monolithic fabrication and local neutral axis designs, we fabricated devices that boast record optical performance (Q=460,000) and excellent mechanical flexibility enabling repeated bending down to sub-millimeter radius without measurable performance degradation, both of which represent major improvements over state-of-the-art. Further, we demonstrate that our technology offers a facile fabrication route for 3-D high-index-contrast photonics difficult to process using traditional methods.

preprint2013arXiv

Demonstration of mid-infrared waveguide photonic crystal cavities

We have demonstrated what we believe to be the first waveguide photonic crystal cavity operating in the mid-infrared. The devices were fabricated from Ge23Sb7S70 chalcogenide glass on CaF2 substrates by combing photolithographic patterning and focus ion beam milling. The waveguide-coupled cavities were characterized using a fiber end fire coupling method at 5.2 μm wavelength, and a loaded quality factor of ~ 2,000 was measured near the critical coupling regime.

preprint2013arXiv

High-Performance, High-Index-Contrast Chalcogenide Glass Photonics on Silicon and Unconventional Non-planar Substrates

This paper reports a versatile, roll-to-roll and backend compatible technique for the fabrication of high-index-contrast photonic structures on both silicon and plastic substrates. The fabrication technique combines low-temperature chalcogenide glass film deposition and resist-free single-step thermal nanoimprint to process low-loss (1.6 dB/cm), sub-micron single-mode waveguides with a smooth surface finish using simple contact photolithography. Using this approach, the first chalcogenide glass micro-ring resonators are fabricated by thermal nanoimprint. The devices exhibit an ultra-high quality-factor of 400,000 near 1550 nm wavelength, which represents the highest value reported in chalcogenide glass micro-ring resonators. Furthermore, sub-micron nanoimprint of chalcogenide glass films on non-planar plastic substrates is demonstrated, which establishes the method as a facile route for monolithic fabrication of high-index-contrast devices on a wide array of unconventional substrates.

preprint2012arXiv

First-Principles Studies of the Atomic, Electronic, and Magnetic Structure of a-MnO2 (Cryptomelane)

Density functional theory calculations are used to investigate a-MnO2, a structure containing a framework of corner and edge sharing MnO6 octahedra with tunnels in between. Placing K+ ions into the tunnels stabilizes a-MnO2 with respect to the rutile-structure b-MnO2 phase, in agreement with experiment. The computed magnetic structure has antiferromagnetic (ferromagnetic) Mn-Mn interactions between corner-sharing (edge-sharing) octahedra. Pure a-MnO2 is found to be a semiconductor with an indirect band gap of 1.3 eV. Water and related hydrides (OH-; H3O+) can also be accommodated in the tunnels; the equilibrium K-O distance increases with increasing oxygen hydride charge.

Lan Li

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

ParaRNN: An Interpretable and Parallelizable Recurrent Neural Network for Time-Dependent Data

KAER: A Knowledge Augmented Pre-Trained Language Model for Entity Resolution

Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via Reinforcement Learning

Preliminary Steps Towards Federated Sentiment Classification

I-BERT: Inductive Generalization of Transformer to Arbitrary Context Lengths

3-D Integrated Flexible Glass Photonics

Demonstration of mid-infrared waveguide photonic crystal cavities

High-Performance, High-Index-Contrast Chalcogenide Glass Photonics on Silicon and Unconventional Non-planar Substrates

First-Principles Studies of the Atomic, Electronic, and Magnetic Structure of a-MnO2 (Cryptomelane)