Source author record

Qiong Wang

Qiong Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision cond-mat.mtrl-sci Multimedia cond-mat.mes-hall math.PR physics.optics quant-ph Tissues and Organs

Catalog footprint

What is connected

10works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

FECANet: Boosting Few-Shot Semantic Segmentation with Feature-Enhanced Context-Aware Network

Few-shot semantic segmentation is the task of learning to locate each pixel of the novel class in the query image with only a few annotated support images. The current correlation-based methods construct pair-wise feature correlations to establish the many-to-many matching because the typical prototype-based approaches cannot learn fine-grained correspondence relations. However, the existing methods still suffer from the noise contained in naive correlations and the lack of context semantic information in correlations. To alleviate these problems mentioned above, we propose a Feature-Enhanced Context-Aware Network (FECANet). Specifically, a feature enhancement module is proposed to suppress the matching noise caused by inter-class local similarity and enhance the intra-class relevance in the naive correlation. In addition, we propose a novel correlation reconstruction module that encodes extra correspondence relations between foreground and background and multi-scale context semantic features, significantly boosting the encoder to capture a reliable matching pattern. Experiments on PASCAL-$5^i$ and COCO-$20^i$ datasets demonstrate that our proposed FECANet leads to remarkable improvement compared to previous state-of-the-arts, demonstrating its effectiveness.

preprint2022arXiv

Intra-Modal Constraint Loss For Image-Text Retrieval

Cross-modal retrieval has drawn much attention in both computer vision and natural language processing domains. With the development of convolutional and recurrent neural networks, the bottleneck of retrieval across image-text modalities is no longer the extraction of image and text features but an efficient loss function learning in embedding space. Many loss functions try to closer pairwise features from heterogeneous modalities. This paper proposes a method for learning joint embedding of images and texts using an intra-modal constraint loss function to reduce the violation of negative pairs from the same homogeneous modality. Experimental results show that our approach outperforms state-of-the-art bi-directional image-text retrieval methods on Flickr30K and Microsoft COCO datasets. Our code is publicly available: https://github.com/CanonChen/IMC.

preprint2022arXiv

Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic Segmentation

Weakly supervised semantic segmentation with only image-level labels aims to reduce annotation costs for the segmentation task. Existing approaches generally leverage class activation maps (CAMs) to locate the object regions for pseudo label generation. However, CAMs can only discover the most discriminative parts of objects, thus leading to inferior pixel-level pseudo labels. To address this issue, we propose a saliency guided Inter- and Intra-Class Relation Constrained (I$^2$CRC) framework to assist the expansion of the activated object regions in CAMs. Specifically, we propose a saliency guided class-agnostic distance module to pull the intra-category features closer by aligning features to their class prototypes. Further, we propose a class-specific distance module to push the inter-class features apart and encourage the object region to have a higher activation than the background. Besides strengthening the capability of the classification network to activate more integral object regions in CAMs, we also introduce an object guided label refinement module to take a full use of both the segmentation prediction and the initial labels for obtaining superior pseudo-labels. Extensive experiments on PASCAL VOC 2012 and COCO datasets demonstrate well the effectiveness of I$^2$CRC over other state-of-the-art counterparts. The source codes, models, and data have been made available at \url{https://github.com/NUST-Machine-Intelligence-Laboratory/I2CRC}.

preprint2021arXiv

Exploiting random lead times for significant inventory cost savings

We study the classical single-item inventory system in which unsatisfied demands are backlogged. Replenishment lead times are random, independent identically distributed, causing orders to cross in time. We develop a new inventory policy to exploit implications of lead time randomness and order crossover, and evaluate its performance by asymptotic analysis and simulations. Our policy does not follow the basic principle of Constant Base Stock (CBS) policy, or more generally, (s,S) and (r,Q) policies, which is to keep the inventory position within a fixed range. Instead, it uses the current inventory level (= inventory-on-hand minus backlog) to set a dynamic target for inventory in-transit, and place orders to follow this target. Our policy includes CBS policy as a special case, under a particular choice of a policy parameter. We show that our policy can significantly reduce the average inventory cost compared with CBS policy. Specifically, we prove that if the lead time is exponentially distributed, then under our policy, with properly chosen policy parameters, the expected (absolute) inventory level scales as $o(\sqrt{r})$, as the demand rate $r\to\infty$. In comparison, it is known to scale as $Θ(\sqrt{r})$ under CBS policy. In particular, this means that, as $r\to\infty$, the average inventory cost under our policy vanishes in comparison with that under CBS policy. Furthermore, our simulations show that the advantage of our policy remains to be substantial under non-exponential lead time distributions, and may even be greater than under exponential distribution. We also use simulations to compare GBS to an optimal policy for some cases where computing the optimal cost is tractable. The results show that our policy removes a majority of excess costs of CBS policy over the minimum cost, leading to much smaller optimality gaps.

preprint2021arXiv

Semantically Meaningful Class Prototype Learning for One-Shot Image Semantic Segmentation

One-shot semantic image segmentation aims to segment the object regions for the novel class with only one annotated image. Recent works adopt the episodic training strategy to mimic the expected situation at testing time. However, these existing approaches simulate the test conditions too strictly during the training process, and thus cannot make full use of the given label information. Besides, these approaches mainly focus on the foreground-background target class segmentation setting. They only utilize binary mask labels for training. In this paper, we propose to leverage the multi-class label information during the episodic training. It will encourage the network to generate more semantically meaningful features for each category. After integrating the target class cues into the query features, we then propose a pyramid feature fusion module to mine the fused features for the final classifier. Furthermore, to take more advantage of the support image-mask pair, we propose a self-prototype guidance branch to support image segmentation. It can constrain the network for generating more compact features and a robust prototype for each semantic class. For inference, we propose a fused prototype guidance branch for the segmentation of the query image. Specifically, we leverage the prediction of the query image to extract the pseudo-prototype and combine it with the initial prototype. Then we utilize the fused prototype to guide the final segmentation of the query image. Extensive experiments demonstrate the superiority of our proposed approach.

preprint2021arXiv

Texture Formation in Polycrystalline Thin Films of All-Inorganic Lead Halide Perovskite

Controlling grain orientations within polycrystalline all-inorganic halide perovskite solar cells can help increase conversion efficiencies toward their thermodynamic limits, however the forces governing texture formation are ambiguous. Using synchrotron X-ray diffraction, we report meso-structure formation within polycrystalline CsPbI2.85Br0.15 powders as they cool from a high-temperature cubic perovskite (α-phase). Tetragonal distortions (\b{eta}-phase) trigger preferential crystallographic alignment within polycrystalline ensembles, a feature we suggest is coordinated across multiple neighboring grains via interfacial forces that select for certain lattice distortions over others. External anisotropy is then imposed on polycrystalline thin films of orthorhombic (γ-phase) CsPbI3-xBrx perovskite via substrate clamping, revealing two fundamental uniaxial texture formations; (i) I-rich films possess orthorhombic-like texture (<100> out-of-plane; <010> and <001> in-plane), while (ii) Br-rich films form tetragonal-like texture (<110> out-of-plane; <1-10> and <001> in-plane). In contrast to relatively uninfluential factors like the choice of substrate, film thickness and annealing temperature, Br incorporation modifies the γ-CsPbI3-xBrx crystal structure by reducing the orthorhombic lattice distortion (making it more tetragonal-like) and governs the formation of the different, energetically favored textures within polycrystalline thin films.

preprint2020arXiv

Globular structure of the hypermineralized tissue in human femoral neck

Bone becomes more fragile with ageing. Among many structural changes, a thin layer of highly mineralized and brittle tissue covers part of the external surface of the thin femoral neck cortex in older people and has been proposed to increase hip fragility. However, there have been very limited reports on this hypermineralized tissue in the femoral neck, especially on its ultrastructure. Such information is critical to understanding both the mineralization process and its contributions to hip fracture. Here, we use multiple advanced techniques to characterize the ultrastructure of the hypermineralized tissue in the neck across various length scales. Synchrotron radiation micro-CT found larger but less densely distributed cellular lacunae in hypermineralized tissue than in lamellar bone. When examined under FIB-SEM, the hypermineralized tissue was mainly composed of mineral globules with sizes varying from submicron to a few microns. Nano-sized channels were present within the mineral globules and oriented with the surrounding organic matrix. Transmission electron microscopy showed the apatite inside globules were poorly crystalline, while those at the boundaries between the globules had well-defined lattice structure with crystallinity similar to the apatite mineral in lamellar bone. No preferred mineral orientation was observed both inside each globule and at the boundaries. Collectively, we conclude based on these new observations that the hypermineralized tissue is non-lamellar and has less organized mineral, which may contribute to the high brittleness of the tissue.

preprint2020arXiv

Instance Shadow Detection

Instance shadow detection is a brand new problem, aiming to find shadow instances paired with object instances. To approach it, we first prepare a new dataset called SOBA, named after Shadow-OBject Association, with 3,623 pairs of shadow and object instances in 1,000 photos, each with individual labeled masks. Second, we design LISA, named after Light-guided Instance Shadow-object Association, an end-to-end framework to automatically predict the shadow and object instances, together with the shadow-object associations and light direction. Then, we pair up the predicted shadow and object instances, and match them with the predicted shadow-object associations to generate the final results. In our evaluations, we formulate a new metric named the shadow-object average precision to measure the performance of our results. Further, we conducted various experiments and demonstrate our method's applicability on light direction estimation and photo editing.

preprint2017arXiv

Image denoising using group sparsity residual and external nonlocal self-similarity prior

Nonlocal image representation has been successfully used in many image-related inverse problems including denoising, deblurring and deblocking. However, a majority of reconstruction methods only exploit the nonlocal self-similarity (NSS) prior of the degraded observation image, it is very challenging to reconstruct the latent clean image. In this paper we propose a novel model for image denoising via group sparsity residual and external NSS prior. To boost the performance of image denoising, the concept of group sparsity residual is proposed, and thus the problem of image denoising is transformed into one that reduces the group sparsity residual. Due to the fact that the groups contain a large amount of NSS information of natural images, we obtain a good estimation of the group sparse coefficients of the original image by the external NSS prior based on Gaussian Mixture model (GMM) learning and the group sparse coefficients of noisy image is used to approximate the estimation. Experimental results have demonstrated that the proposed method not only outperforms many state-of-the-art methods, but also delivers the best qualitative denoising results with finer details and less ringing artifacts.

preprint2015arXiv

Precision measurement of the environmental temperature by tunable double optomechanically induced transparency with a squeezed field

A tunable double optomechanically induced transparency (OMIT) with a squeezed field is investigated in a system consisting of an optomechanical cavity coupled to a charged nanomechanical resonator via Coulomb interaction. Such a double OMIT can be achieved by adjusting the strength of the Coulomb interaction, and observed even with a single-photon squeezed field at finite temperature. Since it is robust against the cavity decay, but very sensitive to some parameters, such as the environmental temperature, the model under our consideration can be applied as a quantum thermometer for precision measurement of the environmental temperature within the reach of current techniques.

Qiong Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

10 published item(s)

FECANet: Boosting Few-Shot Semantic Segmentation with Feature-Enhanced Context-Aware Network

Intra-Modal Constraint Loss For Image-Text Retrieval

Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic Segmentation

Exploiting random lead times for significant inventory cost savings

Semantically Meaningful Class Prototype Learning for One-Shot Image Semantic Segmentation

Texture Formation in Polycrystalline Thin Films of All-Inorganic Lead Halide Perovskite

Globular structure of the hypermineralized tissue in human femoral neck

Instance Shadow Detection

Image denoising using group sparsity residual and external nonlocal self-similarity prior

Precision measurement of the environmental temperature by tunable double optomechanically induced transparency with a squeezed field