Source author record

Shaochun Li

Shaochun Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language Machine Learning cond-mat.mtrl-sci Information Retrieval

Catalog footprint

What is connected

4works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2020arXiv

Automatic Business Process Structure Discovery using Ordered Neurons LSTM: A Preliminary Study

Automatic process discovery from textual process documentations is highly desirable to reduce time and cost of Business Process Management (BPM) implementation in organizations. However, existing automatic process discovery approaches mainly focus on identifying activities out of the documentations. Deriving the structural relationships between activities, which is important in the whole process discovery scope, is still a challenge. In fact, a business process has latent semantic hierarchical structure which defines different levels of detail to reflect the complex business logic. Recent findings in neural machine learning area show that the meaningful linguistic structure can be induced by joint language modeling and structure learning. Inspired by these findings, we propose to retrieve the latent hierarchical structure present in the textual business process documents by building a neural network that leverages a novel recurrent architecture, Ordered Neurons LSTM (ON-LSTM), with process-level language model objective. We tested the proposed approach on data set of Process Description Documents (PDD) from our practical Robotic Process Automation (RPA) projects. Preliminary experiments showed promising results.

preprint2020arXiv

Dynamic Knowledge Distillation for Black-box Hypothesis Transfer Learning

In real world applications like healthcare, it is usually difficult to build a machine learning prediction model that works universally well across different institutions. At the same time, the available model is often proprietary, i.e., neither the model parameter nor the data set used for model training is accessible. In consequence, leveraging the knowledge hidden in the available model (aka. the hypothesis) and adapting it to a local data set becomes extremely challenging. Motivated by this situation, in this paper we aim to address such a specific case within the hypothesis transfer learning framework, in which 1) the source hypothesis is a black-box model and 2) the source domain data is unavailable. In particular, we introduce a novel algorithm called dynamic knowledge distillation for hypothesis transfer learning (dkdHTL). In this method, we use knowledge distillation with instance-wise weighting mechanism to adaptively transfer the "dark" knowledge from the source hypothesis to the target domain.The weighting coefficients of the distillation loss and the standard loss are determined by the consistency between the predicted probability of the source hypothesis and the target ground-truth label.Empirical results on both transfer learning benchmark datasets and a healthcare dataset demonstrate the effectiveness of our method.

preprint2020arXiv

Unlocking the Power of Deep PICO Extraction: Step-wise Medical NER Identification

The PICO framework (Population, Intervention, Comparison, and Outcome) is usually used to formulate evidence in the medical domain. The major task of PICO extraction is to extract sentences from medical literature and classify them into each class. However, in most circumstances, there will be more than one evidences in an extracted sentence even it has been categorized to a certain class. In order to address this problem, we propose a step-wise disease Named Entity Recognition (DNER) extraction and PICO identification method. With our method, sentences in paper title and abstract are first classified into different classes of PICO, and medical entities are then identified and classified into P and O. Different kinds of deep learning frameworks are used and experimental results show that our method will achieve high performance and fine-grained extraction results comparing with conventional PICO extraction works.

preprint2014arXiv

Aging the Cu-doped Bi2Te3 crystals for the topological transport and its atomic tunneling-clustering dynamics

We report on the observation of the two-dimensional weak antilocalization in (Cu0.1Bi0.9)2Te3.06 crystals relying on measurements of the magnetoresistance in a tilted field. The dephasing analysis and scanning tunneling spectroscopy corroborate the transport of the topological surface states (SS). The SSs contribute 3.3% conductance in 30μm-thick material and become dominant in the 100nm-thick flakes. Such optimized topological SS transport is achieved by an intense aging process, when the bulk conductance is suppressed by four orders of magnitude in the long period. Scanning tunneling microscopy reveals that Cu atoms are initially inside the quintuple layers and migrate to the layer gaps to form Cu clusters during the aging. In combination with first-principles calculations, an atomic tunneling-clustering procedure across a diffusion barrier of 0.57eV is proposed.

Shaochun Li

What is connected

Connect this record

See the researcher in context

Building this map preview

4 published item(s)

Automatic Business Process Structure Discovery using Ordered Neurons LSTM: A Preliminary Study

Dynamic Knowledge Distillation for Black-box Hypothesis Transfer Learning

Unlocking the Power of Deep PICO Extraction: Step-wise Medical NER Identification

Aging the Cu-doped Bi2Te3 crystals for the topological transport and its atomic tunneling-clustering dynamics