Source author record

Wenqing Zhang

Wenqing Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mtrl-sci Computer Vision physics.comp-ph cond-mat.dis-nn physics.data-an q-fin.MF

Catalog footprint

What is connected

14works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

Fixed-point iterative algorithm for SVI model

The stochastic volatility inspired (SVI) model is widely used to fit the implied variance smile. Presently, most optimizer algorithms for the SVI model have a strong dependence on the input starting point. In this study, we develop an efficient iterative algorithm for the SVI model based on a fixed-point and least-square optimizer. Furthermore, we present the convergence results in certain situations for this novel iterative algorithm. Compared with the quasi-explicit SVI method, we demonstrate the advantages of the fixed-point iterative algorithm using simulation and market data.

preprint2022arXiv

1st Place Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: End-to-End Recognition of Out of Vocabulary Words

Scene text recognition has attracted increasing interest in recent years due to its wide range of applications in multilingual translation, autonomous driving, etc. In this report, we describe our solution to the Out of Vocabulary Scene Text Understanding (OOV-ST) Challenge, which aims to extract out-of-vocabulary (OOV) words from natural scene images. Our oCLIP-based model achieves 28.59\% in h-mean which ranks 1st in end-to-end OOV word recognition track of OOV Challenge in ECCV2022 TiE Workshop.

preprint2022arXiv

FedOCR: Communication-Efficient Federated Learning for Scene Text Recognition

While scene text recognition techniques have been widely used in commercial applications, data privacy has rarely been taken into account by this research community. Most existing algorithms have assumed a set of shared or centralized training data. However, in practice, data may be distributed on different local devices that can not be centralized to share due to the privacy restrictions. In this paper, we study how to make use of decentralized datasets for training a robust scene text recognizer while keeping them stay on local devices. To the best of our knowledge, we propose the first framework leveraging federated learning for scene text recognition, which is trained with decentralized datasets collaboratively. Hence we name it FedOCR. To make FedCOR fairly suitable to be deployed on end devices, we make two improvements including using lightweight models and hashing techniques. We argue that both are crucial for FedOCR in terms of the communication efficiency of federated learning. The simulations on decentralized datasets show that the proposed FedOCR achieves competitive results to the models that are trained with centralized data, with fewer communication costs and higher-level privacy-preserving.

preprint2022arXiv

Few Could Be Better Than All: Feature Sampling and Grouping for Scene Text Detection

Recently, transformer-based methods have achieved promising progresses in object detection, as they can eliminate the post-processes like NMS and enrich the deep representations. However, these methods cannot well cope with scene text due to its extreme variance of scales and aspect ratios. In this paper, we present a simple yet effective transformer-based architecture for scene text detection. Different from previous approaches that learn robust deep representations of scene text in a holistic manner, our method performs scene text detection based on a few representative features, which avoids the disturbance by background and reduces the computational cost. Specifically, we first select a few representative features at all scales that are highly relevant to foreground text. Then, we adopt a transformer for modeling the relationship of the sampled features, which effectively divides them into reasonable groups. As each feature group corresponds to a text instance, its bounding box can be easily obtained without any post-processing operation. Using the basic feature pyramid network for feature extraction, our method consistently achieves state-of-the-art results on several popular datasets for scene text detection.

preprint2022arXiv

Giant transverse and longitudinal magneto-thermoelectric effect in polycrystalline nodal-line semimetal Mg3Bi2

Topological semimetals provide new opportunities for exploring new thermoelectric phenomena, because of their exotic and nontrivial electronic structure topology around the Fermi surface. In this study, we report on the discovery of giant transverse and longitudinal magneto-thermoelectric (MTE) effects in Mg3Bi2, which is predicted to be a type-II nodal-line semimetal in the absence of spin-orbit coupling (SOC). The maximum transverse power factor is 2182 μWm^{-1}K^{-2} at 13.5 K and 6 Tesla. The longitudinal power factor reaches up to 3043μWm^{-1}K^{-2} at 15 K and 13 Tesla, which is 20 times higher than in a zero-strength magnetic field and is also comparable to state-of-the-art MTE materials. By compensating Mg loss in the Mg-rich conditions for turning carrier concentration, the sample obtained in this work shows a large linear non-saturating magnetoresistance of 940% under a field of 14 Tesla. This is a two-orders-of-magnitude increase with respect to the normal Mg-deficiency Mg3Bi2 sample. Using density functional calculations, we attribute the underlying mechanism to the parent nodal-line electronic structure without SOC and the anisotropic Fermi surface shape with SOC, highlighting the essential role of high carrier mobility and open electron orbits in moment space. Our work offers a new avenue toward highly efficient thermoelectric materials through the design of Fermi surfaces with special topological electronic structures in novel quantum materials.

preprint2022arXiv

Prediction of protected band edge states and dielectric tunable quasiparticle and excitonic properties of monolayer MoSi$_2$N$_4$

The electronic structure of two-dimensional (2D) materials are inherently prone to environmental perturbations, which may pose significant challenges to their applications in electronic or optoelectronic devices. A 2D material couples with its environment through two mechanisms: local chemical coupling and nonlocal dielectric screening effects. The local chemical coupling is often difficult to predict or control experimentally. Nonlocal dielectric screening, on the other hand, can be tuned by choosing the substrates or layer thickness in a controllable manner. Therefore, a compelling 2D electronic material should offer band edge states that are robust against local chemical coupling effects. Here it is demonstrated that the recently synthesized MoSi$_2$N$_4$ is an ideal 2D semiconductor with robust band edge states protected from capricious environmental chemical coupling effects. Detailed many-body perturbation theory calculations are carried out to illustrate how the band edge states of MoSi$_2$N$_4$ are shielded from the direct chemical coupling effects, but its quasiparticle and excitonic properties can be modulated through the nonlocal dielectric screening effects. This unique property, together with the moderate band gap and the thermodynamic and mechanical stability of this material, paves the way for a range of applications of MoSi$_2$N$_4$ in areas including energy, 2D electronics, and optoelectronics.

preprint2022arXiv

Runner-Up Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: Cropped Word Recognition

This report presents our 2nd place solution to ECCV 2022 challenge on Out-of-Vocabulary Scene Text Understanding (OOV-ST) : Cropped Word Recognition. This challenge is held in the context of ECCV 2022 workshop on Text in Everything (TiE), which aims to extract out-of-vocabulary words from natural scene images. In the competition, we first pre-train SCATTER on the synthetic datasets, then fine-tune the model on the training set with data augmentations. Meanwhile, two additional models are trained specifically for long and vertical texts. Finally, we combine the output from different models with different layers, different backbones, and different seeds as the final results. Our solution achieves a word accuracy of 59.45\% when considering out-of-vocabulary words only.

preprint2022arXiv

SeqFormer: Sequential Transformer for Video Instance Segmentation

In this work, we present SeqFormer for video instance segmentation. SeqFormer follows the principle of vision transformer that models instance relationships among video frames. Nevertheless, we observe that a stand-alone instance query suffices for capturing a time sequence of instances in a video, but attention mechanisms shall be done with each frame independently. To achieve this, SeqFormer locates an instance in each frame and aggregates temporal information to learn a powerful representation of a video-level instance, which is used to predict the mask sequences on each frame dynamically. Instance tracking is achieved naturally without tracking branches or post-processing. On YouTube-VIS, SeqFormer achieves 47.4 AP with a ResNet-50 backbone and 49.0 AP with a ResNet-101 backbone without bells and whistles. Such achievement significantly exceeds the previous state-of-the-art performance by 4.6 and 4.4, respectively. In addition, integrated with the recently-proposed Swin transformer, SeqFormer achieves a much higher AP of 59.3. We hope SeqFormer could be a strong baseline that fosters future research in video instance segmentation, and in the meantime, advances this field with a more robust, accurate, neat model. The code is available at https://github.com/wjf5203/SeqFormer.

preprint2022arXiv

Solid-state Janus nanoprecipitation enables amorphous-like heat conduction in crystalline Mg3Sb2-based thermoelectric materials

Solid-state precipitation can be used to tailor materials properties, ranging from ferromagnets and catalysts to mechanical strengthening and energy storage. Thermoelectric properties can be modified by precipitation to enhance phonon scattering while retaining charge-carrier transmission. Here, we uncover unconventional dual Janus-type nanoprecipitates in Mg3Sb1.5Bi0.5 formed by side-by-side Bi- and Ge-rich appendages, in contrast to separate nanoprecipitate formation. These Janus nanoprecipitates result from local co-melting of Bi and Ge during sintering, enabling an amorphous-like lattice thermal conductivity. A precipitate size effect on phonon scattering is observed due to the balance between alloy-disorder and nanoprecipitate scattering. The thermoelectric figure-of-merit ZT reaches 0.6 near room temperature and 1.6 at 773 K. The Janus nanoprecipitation can be introduced into other materials and may act as a general property-tailoring mechanism.

preprint2020arXiv

Combined sub-sampling and analytical integration for efficient large-scale $GW$ calculations for 2D systems

Accurate and efficient predictions of the quasiparticle properties of complex materials remain a major challenge due to the convergence issue and the unfavorable scaling of the computational cost with respect to the system size. Quasiparticle $GW$ calculations for two dimensional (2D) materials are especially difficult. The unusual analytical behaviors of the dielectric screening and the electron self-energy of 2D materials make the conventional Brillouin zone (BZ) integration approach rather inefficient and require an extremely dense $k$-grid to properly converge the calculated quasiparticle energies. In this work, we present a combined non-uniform sub-sampling and analytical integration method that can drastically improve the efficiency of the BZ integration in 2D $GW$ calculations. Our work is distinguished from previous work in that, instead of focusing on the intricate dielectric matrix or the screened Coulomb interaction matrix, we exploit the analytical behavior of various terms of the convolved self-energy $Σ(\mathbf{q})$ in the small $\mathbf{q}$ limit. This method, when combined with another accelerated $GW$ method that we developed recently, can drastically speed-up (by over three orders of magnitude) $GW$ calculations for 2D materials. Our method allows fully converged $GW$ calculations for complex 2D systems at a fraction of computational cost, facilitating future high throughput screening of the quasiparticle properties of 2D semiconductors for various applications. To demonstrate the capability and performance of our new method, we have carried out fully converged $GW$ calculations for monolayer C$_2$N, a recently discovered 2D material with a large unit cell, and investigate its quasiparticle band structure in detail.

preprint2020arXiv

Remarkable band gap renormalization via dimensionality of the layered material C3B

Layer-dependent electronic and structural properties of emerging graphitic carbon boron compound C3B are investigated using both density functional theory and the GW approximation. We discover that, in contrast to a moderate quasiparticle band gap of 2.55 eV for monolayer C3B, the calculated quasiparticle band gap of perfectly stacked bulk phase C3B is as small as 0.17 eV. Therefore, our results suggest that layered material C3B exhibits a remarkably large band gap renormalization of over 2.3 eV due to the interlayer coupling and screening effects, providing a single material with an extraordinary band gap tunability. The quasiparticle band gap of monolayer C3B is also over 1.0 eV larger than that of C3N, a closely related two-dimensional semiconductor. Detailed inspections of the near-edge electronic states reveal that the conduction and valence band edges of C3B are formed by out-of-plane and in-plane electronic states, respectively, suggesting an interesting possibility of tuning the band edges of such layered material separately by modulating the in-plane and out-of-plane interactions.

preprint2013arXiv

Hole-lattice Coupling and Photo-induced Insulator-Metal Transition in VO$_2$

Photo-induced insulator-metal transition in VO$_2$ and the related transient and multi-timescale structural dynamics upon photoexcitation are explained within a unified framework. Holes created by photoexcitation weaken the V-V bonds and eventually break V-V dimers in the M$_1$ phase of VO$_2$ when the laser fluence reaches a critical value. The breaking of the V-V bonds in turn leads to an immediate electronic phase transition from an insulating to a metallic state while the crystal lattice remains monoclinic in shape. The coupling between excited electrons and the 6.0 THz phonon mode is found to be responsible for the observed zig-zag motion of V atoms upon photoexcitation and is consistent with coherent phonon experiments.

preprint2012arXiv

Evaluation of Scale-Invariance In Physiological Signals By Means Of Balanced Estimation Of Diffusion Entropy

By means of the concept of balanced estimation of diffusion entropy we evaluate reliable scale-invariance embedded in different sleep stages and stride records. Segments corresponding to Wake, light sleep, REM, and deep sleep stages are extracted from long-term EEG signals. For each stage the scaling value distributes in a considerable wide range, which tell us that the scaling behavior is subject- and sleep cycle- dependent. The average of the scaling exponent values for wake segments is almost the same with that for REM segments ($\sim 0.8$). Wake and REM stages have significant high value of average scaling exponent, compared with that for light sleep stages ($\sim 0.7$). For the stride series, the original diffusion entropy (DE) and balanced estimation of diffusion entropy (BEDE) give almost the same results for de-trended series. Evolutions of local scaling invariance show that the physiological states change abruptly, though in the experiments great efforts have been done to keep conditions unchanged. Global behaviors of a single physiological signal may lose rich information on physiological states. Methodologically, BEDE can evaluate with considerable precision scale-invariance in very short time series ($\sim 10^2$), while the original DE method sometimes may underestimate scale-invariance exponents or even fail in detecting scale-invariant behavior. The BEDE method is sensitive to trends in time series. Existence of trend may leads to a unreasonable high value of scaling exponent, and consequent mistake conclusions.

preprint2011arXiv

Comparative Study of Structural and Electronic Properties of Cu-based Multinary Semiconductors

We present a systematic and comparative study of the structural and electronic properties of Cu-based ternary and quaternary semiconductors using first-principles electronic structure approaches. The important role that Cu d electrons play in determining their properties is illustrated by comparing results calculated with different exchange correlation energy functionals. We show that systematic improvement of the calculated anion displacement can be achieved by using the Heyd-Scuseria-Ernzerhof (HSE06) functional compared with the Perdew-Burke-Ernzerhof (PBE) functional. Quasiparticle band structures are then calculated within the G0W0 approximation using the crystal structures optimized within the HSE06 functional and starting from the PBE+U mean-field solution. Both the calculated quasiparticle band gaps and their systematic variation with chemical constituents agree very well with experiments. We also predict that the quasiparticle band gaps of the prototypical semiconductor Cu2ZnSnS4 in the kesterite (KS) phase is 1.65 eV and that of the stannite (ST) phase is 1.40 eV. These results are also consistent with available experimental values which vary from 1.45 to 1.6 eV.

Wenqing Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

Fixed-point iterative algorithm for SVI model

1st Place Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: End-to-End Recognition of Out of Vocabulary Words

FedOCR: Communication-Efficient Federated Learning for Scene Text Recognition

Few Could Be Better Than All: Feature Sampling and Grouping for Scene Text Detection

Giant transverse and longitudinal magneto-thermoelectric effect in polycrystalline nodal-line semimetal Mg3Bi2

Prediction of protected band edge states and dielectric tunable quasiparticle and excitonic properties of monolayer MoSi$_2$N$_4$

Runner-Up Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: Cropped Word Recognition

SeqFormer: Sequential Transformer for Video Instance Segmentation

Solid-state Janus nanoprecipitation enables amorphous-like heat conduction in crystalline Mg3Sb2-based thermoelectric materials

Combined sub-sampling and analytical integration for efficient large-scale $GW$ calculations for 2D systems

Remarkable band gap renormalization via dimensionality of the layered material C3B

Hole-lattice Coupling and Photo-induced Insulator-Metal Transition in VO$_2$

Evaluation of Scale-Invariance In Physiological Signals By Means Of Balanced Estimation Of Diffusion Entropy

Comparative Study of Structural and Electronic Properties of Cu-based Multinary Semiconductors