Researcher profile

Wenqing Zhang

Wenqing Zhang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2023arXiv

Fixed-point iterative algorithm for SVI model

The stochastic volatility inspired (SVI) model is widely used to fit the implied variance smile. Presently, most optimizer algorithms for the SVI model have a strong dependence on the input starting point. In this study, we develop an efficient iterative algorithm for the SVI model based on a fixed-point and least-square optimizer. Furthermore, we present the convergence results in certain situations for this novel iterative algorithm. Compared with the quasi-explicit SVI method, we demonstrate the advantages of the fixed-point iterative algorithm using simulation and market data.

preprint2022arXiv

1st Place Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: End-to-End Recognition of Out of Vocabulary Words

Scene text recognition has attracted increasing interest in recent years due to its wide range of applications in multilingual translation, autonomous driving, etc. In this report, we describe our solution to the Out of Vocabulary Scene Text Understanding (OOV-ST) Challenge, which aims to extract out-of-vocabulary (OOV) words from natural scene images. Our oCLIP-based model achieves 28.59\% in h-mean which ranks 1st in end-to-end OOV word recognition track of OOV Challenge in ECCV2022 TiE Workshop.

preprint2022arXiv

FedOCR: Communication-Efficient Federated Learning for Scene Text Recognition

While scene text recognition techniques have been widely used in commercial applications, data privacy has rarely been taken into account by this research community. Most existing algorithms have assumed a set of shared or centralized training data. However, in practice, data may be distributed on different local devices that can not be centralized to share due to the privacy restrictions. In this paper, we study how to make use of decentralized datasets for training a robust scene text recognizer while keeping them stay on local devices. To the best of our knowledge, we propose the first framework leveraging federated learning for scene text recognition, which is trained with decentralized datasets collaboratively. Hence we name it FedOCR. To make FedCOR fairly suitable to be deployed on end devices, we make two improvements including using lightweight models and hashing techniques. We argue that both are crucial for FedOCR in terms of the communication efficiency of federated learning. The simulations on decentralized datasets show that the proposed FedOCR achieves competitive results to the models that are trained with centralized data, with fewer communication costs and higher-level privacy-preserving.

preprint2022arXiv

Few Could Be Better Than All: Feature Sampling and Grouping for Scene Text Detection

Recently, transformer-based methods have achieved promising progresses in object detection, as they can eliminate the post-processes like NMS and enrich the deep representations. However, these methods cannot well cope with scene text due to its extreme variance of scales and aspect ratios. In this paper, we present a simple yet effective transformer-based architecture for scene text detection. Different from previous approaches that learn robust deep representations of scene text in a holistic manner, our method performs scene text detection based on a few representative features, which avoids the disturbance by background and reduces the computational cost. Specifically, we first select a few representative features at all scales that are highly relevant to foreground text. Then, we adopt a transformer for modeling the relationship of the sampled features, which effectively divides them into reasonable groups. As each feature group corresponds to a text instance, its bounding box can be easily obtained without any post-processing operation. Using the basic feature pyramid network for feature extraction, our method consistently achieves state-of-the-art results on several popular datasets for scene text detection.

preprint2022arXiv

Giant transverse and longitudinal magneto-thermoelectric effect in polycrystalline nodal-line semimetal Mg3Bi2

Topological semimetals provide new opportunities for exploring new thermoelectric phenomena, because of their exotic and nontrivial electronic structure topology around the Fermi surface. In this study, we report on the discovery of giant transverse and longitudinal magneto-thermoelectric (MTE) effects in Mg3Bi2, which is predicted to be a type-II nodal-line semimetal in the absence of spin-orbit coupling (SOC). The maximum transverse power factor is 2182 μWm^{-1}K^{-2} at 13.5 K and 6 Tesla. The longitudinal power factor reaches up to 3043μWm^{-1}K^{-2} at 15 K and 13 Tesla, which is 20 times higher than in a zero-strength magnetic field and is also comparable to state-of-the-art MTE materials. By compensating Mg loss in the Mg-rich conditions for turning carrier concentration, the sample obtained in this work shows a large linear non-saturating magnetoresistance of 940% under a field of 14 Tesla. This is a two-orders-of-magnitude increase with respect to the normal Mg-deficiency Mg3Bi2 sample. Using density functional calculations, we attribute the underlying mechanism to the parent nodal-line electronic structure without SOC and the anisotropic Fermi surface shape with SOC, highlighting the essential role of high carrier mobility and open electron orbits in moment space. Our work offers a new avenue toward highly efficient thermoelectric materials through the design of Fermi surfaces with special topological electronic structures in novel quantum materials.

preprint2022arXiv

Prediction of protected band edge states and dielectric tunable quasiparticle and excitonic properties of monolayer MoSi$_2$N$_4$

The electronic structure of two-dimensional (2D) materials are inherently prone to environmental perturbations, which may pose significant challenges to their applications in electronic or optoelectronic devices. A 2D material couples with its environment through two mechanisms: local chemical coupling and nonlocal dielectric screening effects. The local chemical coupling is often difficult to predict or control experimentally. Nonlocal dielectric screening, on the other hand, can be tuned by choosing the substrates or layer thickness in a controllable manner. Therefore, a compelling 2D electronic material should offer band edge states that are robust against local chemical coupling effects. Here it is demonstrated that the recently synthesized MoSi$_2$N$_4$ is an ideal 2D semiconductor with robust band edge states protected from capricious environmental chemical coupling effects. Detailed many-body perturbation theory calculations are carried out to illustrate how the band edge states of MoSi$_2$N$_4$ are shielded from the direct chemical coupling effects, but its quasiparticle and excitonic properties can be modulated through the nonlocal dielectric screening effects. This unique property, together with the moderate band gap and the thermodynamic and mechanical stability of this material, paves the way for a range of applications of MoSi$_2$N$_4$ in areas including energy, 2D electronics, and optoelectronics.

preprint2022arXiv

Runner-Up Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: Cropped Word Recognition

This report presents our 2nd place solution to ECCV 2022 challenge on Out-of-Vocabulary Scene Text Understanding (OOV-ST) : Cropped Word Recognition. This challenge is held in the context of ECCV 2022 workshop on Text in Everything (TiE), which aims to extract out-of-vocabulary words from natural scene images. In the competition, we first pre-train SCATTER on the synthetic datasets, then fine-tune the model on the training set with data augmentations. Meanwhile, two additional models are trained specifically for long and vertical texts. Finally, we combine the output from different models with different layers, different backbones, and different seeds as the final results. Our solution achieves a word accuracy of 59.45\% when considering out-of-vocabulary words only.

preprint2022arXiv

SeqFormer: Sequential Transformer for Video Instance Segmentation

In this work, we present SeqFormer for video instance segmentation. SeqFormer follows the principle of vision transformer that models instance relationships among video frames. Nevertheless, we observe that a stand-alone instance query suffices for capturing a time sequence of instances in a video, but attention mechanisms shall be done with each frame independently. To achieve this, SeqFormer locates an instance in each frame and aggregates temporal information to learn a powerful representation of a video-level instance, which is used to predict the mask sequences on each frame dynamically. Instance tracking is achieved naturally without tracking branches or post-processing. On YouTube-VIS, SeqFormer achieves 47.4 AP with a ResNet-50 backbone and 49.0 AP with a ResNet-101 backbone without bells and whistles. Such achievement significantly exceeds the previous state-of-the-art performance by 4.6 and 4.4, respectively. In addition, integrated with the recently-proposed Swin transformer, SeqFormer achieves a much higher AP of 59.3. We hope SeqFormer could be a strong baseline that fosters future research in video instance segmentation, and in the meantime, advances this field with a more robust, accurate, neat model. The code is available at https://github.com/wjf5203/SeqFormer.

preprint2022arXiv

Solid-state Janus nanoprecipitation enables amorphous-like heat conduction in crystalline Mg3Sb2-based thermoelectric materials

Solid-state precipitation can be used to tailor materials properties, ranging from ferromagnets and catalysts to mechanical strengthening and energy storage. Thermoelectric properties can be modified by precipitation to enhance phonon scattering while retaining charge-carrier transmission. Here, we uncover unconventional dual Janus-type nanoprecipitates in Mg3Sb1.5Bi0.5 formed by side-by-side Bi- and Ge-rich appendages, in contrast to separate nanoprecipitate formation. These Janus nanoprecipitates result from local co-melting of Bi and Ge during sintering, enabling an amorphous-like lattice thermal conductivity. A precipitate size effect on phonon scattering is observed due to the balance between alloy-disorder and nanoprecipitate scattering. The thermoelectric figure-of-merit ZT reaches 0.6 near room temperature and 1.6 at 773 K. The Janus nanoprecipitation can be introduced into other materials and may act as a general property-tailoring mechanism.

preprint2020arXiv

Combined sub-sampling and analytical integration for efficient large-scale $GW$ calculations for 2D systems

Accurate and efficient predictions of the quasiparticle properties of complex materials remain a major challenge due to the convergence issue and the unfavorable scaling of the computational cost with respect to the system size. Quasiparticle $GW$ calculations for two dimensional (2D) materials are especially difficult. The unusual analytical behaviors of the dielectric screening and the electron self-energy of 2D materials make the conventional Brillouin zone (BZ) integration approach rather inefficient and require an extremely dense $k$-grid to properly converge the calculated quasiparticle energies. In this work, we present a combined non-uniform sub-sampling and analytical integration method that can drastically improve the efficiency of the BZ integration in 2D $GW$ calculations. Our work is distinguished from previous work in that, instead of focusing on the intricate dielectric matrix or the screened Coulomb interaction matrix, we exploit the analytical behavior of various terms of the convolved self-energy $Σ(\mathbf{q})$ in the small $\mathbf{q}$ limit. This method, when combined with another accelerated $GW$ method that we developed recently, can drastically speed-up (by over three orders of magnitude) $GW$ calculations for 2D materials. Our method allows fully converged $GW$ calculations for complex 2D systems at a fraction of computational cost, facilitating future high throughput screening of the quasiparticle properties of 2D semiconductors for various applications. To demonstrate the capability and performance of our new method, we have carried out fully converged $GW$ calculations for monolayer C$_2$N, a recently discovered 2D material with a large unit cell, and investigate its quasiparticle band structure in detail.

preprint2020arXiv

Remarkable band gap renormalization via dimensionality of the layered material C3B

Layer-dependent electronic and structural properties of emerging graphitic carbon boron compound C3B are investigated using both density functional theory and the GW approximation. We discover that, in contrast to a moderate quasiparticle band gap of 2.55 eV for monolayer C3B, the calculated quasiparticle band gap of perfectly stacked bulk phase C3B is as small as 0.17 eV. Therefore, our results suggest that layered material C3B exhibits a remarkably large band gap renormalization of over 2.3 eV due to the interlayer coupling and screening effects, providing a single material with an extraordinary band gap tunability. The quasiparticle band gap of monolayer C3B is also over 1.0 eV larger than that of C3N, a closely related two-dimensional semiconductor. Detailed inspections of the near-edge electronic states reveal that the conduction and valence band edges of C3B are formed by out-of-plane and in-plane electronic states, respectively, suggesting an interesting possibility of tuning the band edges of such layered material separately by modulating the in-plane and out-of-plane interactions.