Researcher profile

Hui Jiang

Hui Jiang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
13works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2023arXiv

Stochastic volatility modeling of high-frequency CSI 300 index and dynamic jump prediction driven by machine learning

This paper models stochastic process of price time series of CSI 300 index in Chinese financial market, analyzes volatility characteristics of intraday high-frequency price data. In the new generalized Barndorff-Nielsen and Shephard model, the lag caused by asynchrony of market information is considered, and the problem of lack of long-term dependence is solved. To speed up the valuation process, several machine learning and deep learning algorithms are used to estimate parameter and evaluate forecast results. Tracking historical jumps of different magnitudes offers promising avenues for simulating dynamic price processes and predicting future jumps. Numerical results show that the deterministic component of stochastic volatility processes would always be captured over short and longer-term windows. Research finding could be suitable for influence investors and regulators interested in predicting market dynamics based on realized volatility.

preprint2022arXiv

Analysis of stock index with a generalized BN-S model: an approach based on machine learning and fuzzy parameters

In this paper we implement a combination of data-science and fuzzy theory to improve the classical Barndorff-Nielsen and Shephard model, and implement this to analyze the S&P 500 index. We pre-process the index data based on fuzzy theory. After that, S&P 500 stock index data for the past ten years are analyzed, and a deterministic parameter is extracted using various machine and deep learning methods. The results show that the new model, where fuzzy parameters are incorporated, can incorporate the long-term dependence in the classical Barndorff-Nielsen and Shephard model. The modification is based on only a few changes compared to the classical model. At the same time, the resulting analysis effectively captures the stochastic dynamics of the stock index time series.

preprint2022arXiv

DavarOCR: A Toolbox for OCR and Multi-Modal Document Understanding

This paper presents DavarOCR, an open-source toolbox for OCR and document understanding tasks. DavarOCR currently implements 19 advanced algorithms, covering 9 different task forms. DavarOCR provides detailed usage instructions and the trained models for each algorithm. Compared with the previous opensource OCR toolbox, DavarOCR has relatively more complete support for the sub-tasks of the cutting-edge technology of document understanding. In order to promote the development and application of OCR technology in academia and industry, we pay more attention to the use of modules that different sub-domains of technology can share. DavarOCR is publicly released at https://github.com/hikopensource/Davar-Lab-OCR.

preprint2022arXiv

Functional large deviations for Stroock's approximation to a class of Gaussian processes with application to small noise diffusions

Letting~$N=\left\{N(t), t\geq0\right\}$ be a standard Poisson process, Stroock~ \cite{Stroock-1981} constructed a family of continuous processes by $$Θ_ε(t)=\int_0^tθ_ε(r)dr, \ \ \ \ \ 0 \le t \le 1,$$ where $θ_ε(r)=\frac{1}ε(-1)^{N(ε^{-2}r)}$, and proved that it weakly converges to a standard Brownian motion under the continuous function topology. We establish the functional large deviations principle (LDP) for the approximations of a class of Gaussian processes constructed by integrals over $Θ_ε(t)$, and find the explicit form for rate function. As an application, we consider the following (non-Markovian) stochastic differential equation \begin{equation*} \begin{aligned} X^ε(t) &=x_{0}+\int^{t}_{0}b(X^ε(s))ds+λ(ε)\int^{t}_{0}σ(X^ε(s))dΘ_ε(s), \end{aligned} \end{equation*} where $b$ and $σ$ are both Lipschitz functions, and establish its Freidlin-Wentzell type LDP as $ε\rightarrow 0$. The rate function indicates a phase transition phenomenon as $λ(ε)$ moves from one region to the other.

preprint2022arXiv

Kullback-Leibler-Based Discrete Failure Time Models for Integration of Published Prediction Models with New Time-To-Event Dataset

Prediction of time-to-event data often suffers from rare event rates, small sample sizes, high dimensionality and low signal-to-noise ratios. Incorporating published prediction models from large-scale studies is expected to improve the performance of prognosis prediction on internal individual-level time-to-event data. However, existing integration approaches typically assume that underlying distributions from the external and internal data sources are similar, which is often invalid. To account for challenges including heterogeneity, data sharing, and privacy constraints, we propose a discrete failure time modeling procedure, which utilizes a discrete hazard-based Kullback-Leibler discriminatory information measuring the discrepancy between the published models and the internal dataset. Simulations show the advantage of the proposed method compared with those solely based on the internal data or published models. We apply the proposed method to improve prediction performance on a kidney transplant dataset from a local hospital by integrating this small-scale dataset with published survival models obtained from the national transplant registry.

preprint2022arXiv

One-dimensional quasi bound states in the continuum in the ω~k space for nonlinear optical applications

The phenomenon of bound state in the continuum (BIC) with infinite quality factor and lifetime has emerged in recent years in photonics as a new tool of manipulating light-matter interactions. However, most of the investigated structures only support BIC resonances at very few discrete points in the w~k space. Even when the BIC is switched to a quasi-BIC(QBIC) resonance through perturbation, its frequency will still be located within a narrow spectral band close to that of the original BIC, restricting their applications in many fields where random or multiple input frequencies beyond the narrow band are required. In this work, we demonstrate that a new set of QBIC resonances can be supported by making use of a special binary grating consisting of two alternatingly aligned ridge arrays with the same period and zero-approaching ridge width difference on a slab waveguide. These QBIC resonances are distributed continuously over a broad band along a line in the w~k space and can thus be considered as one-dimensional QBICs. With the Q factors generally affected by the ridge difference, it is now possible to choose arbitrarily any frequencies on the dispersion line to achieve significantly enhanced light-matter interactions, facilitating many applications where multiple input wavelengths are required, e.g. sum or difference frequency generations in nonlinear optics.

preprint2021arXiv

Enhanced Aspect-Based Sentiment Analysis Models with Progressive Self-supervised Attention Learning

In aspect-based sentiment analysis (ABSA), many neural models are equipped with an attention mechanism to quantify the contribution of each context word to sentiment prediction. However, such a mechanism suffers from one drawback: only a few frequent words with sentiment polarities are tended to be taken into consideration for final sentiment decision while abundant infrequent sentiment words are ignored by models. To deal with this issue, we propose a progressive self-supervised attention learning approach for attentional ABSA models. In this approach, we iteratively perform sentiment prediction on all training instances, and continually learn useful attention supervision information in the meantime. During training, at each iteration, context words with the highest impact on sentiment prediction, identified based on their attention weights or gradients, are extracted as words with active/misleading influence on the correct/incorrect prediction for each instance. Words extracted in this way are masked for subsequent iterations. To exploit these extracted words for refining ABSA models, we augment the conventional training objective with a regularization term that encourages ABSA models to not only take full advantage of the extracted active context words but also decrease the weights of those misleading words. We integrate the proposed approach into three state-of-the-art neural ABSA models. Experiment results and in-depth analyses show that our approach yields better attention results and significantly enhances the performance of all three models. We release the source code and trained models at https://github.com/DeepLearnXMU/PSSAttention.

preprint2021arXiv

Filling up complex spectral regions through non-Hermitian disordered chains

Eigenspectra that fill regions in the complex plane have been intriguing to many, inspiring research from random matrix theory to esoteric semi-infinite bounded non-Hermitian lattices. In this work, we propose a simple and robust ansatz for constructing models whose eigenspectra fill up generic prescribed regions. Our approach utilizes specially designed non-Hermitian random couplings that allow the co-existence of eigenstates with a continuum of localization lengths, mathematically emulating the effects of semi-infinite boundaries. While some of these couplings are necessarily long-ranged, they are still far more local than what is possible with known random matrix ensembles. Our ansatz can be feasibly implemented in physical platforms such as classical and quantum circuits, and harbors very high tolerance to imperfections due to its stochastic nature.

preprint2020arXiv

Match$^2$: A Matching over Matching Model for Similar Question Identification

Community Question Answering (CQA) has become a primary means for people to acquire knowledge, where people are free to ask questions or submit answers. To enhance the efficiency of the service, similar question identification becomes a core task in CQA which aims to find a similar question from the archived repository whenever a new question is asked. However, it has long been a challenge to properly measure the similarity between two questions due to the inherent variation of natural language, i.e., there could be different ways to ask a same question or different questions sharing similar expressions. To alleviate this problem, it is natural to involve the existing answers for the enrichment of the archived questions. Traditional methods typically take a one-side usage, which leverages the answer as some expanded representation of the corresponding question. Unfortunately, this may introduce unexpected noises into the similarity computation since answers are often long and diverse, leading to inferior performance. In this work, we propose a two-side usage, which leverages the answer as a bridge of the two questions. The key idea is based on our observation that similar questions could be addressed by similar parts of the answer while different questions may not. In other words, we can compare the matching patterns of the two questions over the same answer to measure their similarity. In this way, we propose a novel matching over matching model, namely Match$^2$, which compares the matching patterns between two question-answer pairs for similar question identification. Empirical experiments on two benchmark datasets demonstrate that our model can significantly outperform previous state-of-the-art methods on the similar question identification task.

preprint2020arXiv

On Approximation Capabilities of ReLU Activation and Softmax Output Layer in Neural Networks

In this paper, we have extended the well-established universal approximator theory to neural networks that use the unbounded ReLU activation function and a nonlinear softmax output layer. We have proved that a sufficiently large neural network using the ReLU activation function can approximate any function in $L^1$ up to any arbitrary precision. Moreover, our theoretical results have shown that a large enough neural network using a nonlinear softmax output layer can also approximate any indicator function in $L^1$, which is equivalent to mutually-exclusive class labels in any realistic multiple-class pattern classification problems. To the best of our knowledge, this work is the first theoretical justification for using the softmax output layers in neural networks for pattern classification.

preprint2020arXiv

Topological invariants, zero mode edge states and finite size effect for a generalized non-reciprocal Su-Schrieffer-Heeger model

Intriguing issues in one-dimensional non-reciprocal topological systems include the breakdown of usual bulk-edge correspondence and the occurrence of half-integer topological invariants. In order to understand these unusual topological properties, we investigate the topological phase diagrams and the zero-mode edge states of a generalized non-reciprocal Su-Schrieffer-Heeger model, based on some analytical results. Meanwhile, we provide a concise geometrical interpretation of the bulk topological invariants in terms of two independent winding numbers and also give an alternative interpretation related to the linking properties of curves in three-dimensional space. For the system under the open boundary condition, we construct analytically the wavefunctions of zero-mode edge states by properly considering a hidden symmetry of the system and the normalization condition with the use of biorthogonal eigenvectors. Our analytical results directly give the phase boundary for the existence of zero-mode edge states and unveil clearly the evolution behavior of edge states. In comparison with results via exact diagonalization of finite-size systems, we find our analytical results agree with the numerical results very well.

preprint2017arXiv

Enhanced LSTM for Natural Language Inference

Reasoning and inference are central to human and artificial intelligence. Modeling inference in human language is very challenging. With the availability of large annotated data (Bowman et al., 2015), it has recently become feasible to train neural network based inference models, which have shown to be very effective. In this paper, we present a new state-of-the-art result, achieving the accuracy of 88.6% on the Stanford Natural Language Inference Dataset. Unlike the previous top models that use very complicated network architectures, we first demonstrate that carefully designing sequential inference models based on chain LSTMs can outperform all previous models. Based on this, we further show that by explicitly considering recursive architectures in both local inference modeling and inference composition, we achieve additional improvement. Particularly, incorporating syntactic parsing information contributes to our best result---it further improves the performance even when added to the already very strong model.

preprint2011arXiv

Thermodynamic properties and phase diagrams of spin-1 quantum Ising systems with three-spin interactions

The spin-1 quantum Ising systems with three-spin interactions on two-dimensional triangular lattices are studied by mean-field method. The thermal variations of order parameters and phase diagrams are investigated in detail. The stable, metastable and unstable branches of the order parameters are obtained. According to the stable conditions at critical point, we find that the systems exhibit tricritical points. With crystal field and biquadratic interactions, the system has rich phase diagrams with single reentrant or double reentrant phase transitions for appropriate ranges of the both parameters.