Source author record

Xiaohong Wang

Xiaohong Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision eess.IV Machine Learning Populations and Evolution

Catalog footprint

What is connected

5works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Cross-modal Local Shortest Path and Global Enhancement for Visible-Thermal Person Re-Identification

In addition to considering the recognition difficulty caused by human posture and occlusion, it is also necessary to solve the modal differences caused by different imaging systems in the Visible-Thermal cross-modal person re-identification (VT-ReID) task. In this paper,we propose the Cross-modal Local Shortest Path and Global Enhancement (CM-LSP-GE) modules,a two-stream network based on joint learning of local and global features. The core idea of our paper is to use local feature alignment to solve occlusion problem, and to solve modal difference by strengthening global feature. Firstly, Attention-based two-stream ResNet network is designed to extract dual-modality features and map to a unified feature space. Then, to solve the cross-modal person pose and occlusion problems, the image are cut horizontally into several equal parts to obtain local features and the shortest path in local features between two graphs is used to achieve the fine-grained local feature alignment. Thirdly, a batch normalization enhancement module applies global features to enhance strategy, resulting in difference enhancement between different classes. The multi granularity loss fusion strategy further improves the performance of the algorithm. Finally, joint learning mechanism of local and global features is used to improve cross-modal person re-identification accuracy. The experimental results on two typical datasets show that our model is obviously superior to the most state-of-the-art methods. Especially, on SYSU-MM01 datasets, our model can achieve a gain of 2.89%and 7.96% in all search term of Rank-1 and mAP. The source code will be released soon.

preprint2020arXiv

Bi-directional Dermoscopic Feature Learning and Multi-scale Consistent Decision Fusion for Skin Lesion Segmentation

Accurate segmentation of skin lesion from dermoscopic images is a crucial part of computer-aided diagnosis of melanoma. It is challenging due to the fact that dermoscopic images from different patients have non-negligible lesion variation, which causes difficulties in anatomical structure learning and consistent skin lesion delineation. In this paper, we propose a novel bi-directional dermoscopic feature learning (biDFL) framework to model the complex correlation between skin lesions and their informative context. By controlling feature information passing through two complementary directions, a substantially rich and discriminative feature representation is achieved. Specifically, we place biDFL module on the top of a CNN network to enhance high-level parsing performance. Furthermore, we propose a multi-scale consistent decision fusion (mCDF) that is capable of selectively focusing on the informative decisions generated from multiple classification layers. By analysis of the consistency of the decision at each position, mCDF automatically adjusts the reliability of decisions and thus allows a more insightful skin lesion delineation. The comprehensive experimental results show the effectiveness of the proposed method on skin lesion segmentation, achieving state-of-the-art performance consistently on two publicly available dermoscopic image databases.

preprint2020arXiv

The Role of Nonlinear Relapse on Contagion Amongst Drinking Communities

Relapse, the recurrence of a disorder following a symptomatic remission, is a frequent outcome in substance abuse disorders. Some of our prior results suggested that relapse, in the context of abusive drinking, is likely an "unbeatable" force as long as recovered individuals continue to interact in the environments that lead to and/or reinforce the persistence of abusive drinking behaviors. Our earlier results were obtained via a deterministic model that ignored differences between individuals, that is, in a rather simple "social" setting. In this paper, we address the role of relapse on drinking dynamics but use models that incorporate the role of "chance", or a high degree of "social" heterogeneity, or both. Our focus is primarily on situations where relapse rates are high. We first use a Markov chain model to simulate the effect of relapse on drinking dynamics. These simulations reinforce the conclusions obtained before, with the usual caveats that arise when the outcomes of deterministic and stochastic models are compared. However, the simulation results generated from stochastic realizations of an "equivalent" drinking process in populations "living" in small world networks, parameterized via a disorder parameter $p$, show that there is no social structure within this family capable of reducing the impact of high relapse rates on drinking prevalence, even if we drastically limit the interactions between individuals ($p\approx 0$).

preprint2020arXiv

U-Net Using Stacked Dilated Convolutions for Medical Image Segmentation

This paper proposes a novel U-Net variant using stacked dilated convolutions for medical image segmentation (SDU-Net). SDU-Net adopts the architecture of vanilla U-Net with modifications in the encoder and decoder operations (an operation indicates all the processing for feature maps of the same resolution). Unlike vanilla U-Net which incorporates two standard convolutions in each encoder/decoder operation, SDU-Net uses one standard convolution followed by multiple dilated convolutions and concatenates all dilated convolution outputs as input to the next operation. Experiments showed that SDU-Net outperformed vanilla U-Net, attention U-Net (AttU-Net), and recurrent residual U-Net (R2U-Net) in all four tested segmentation tasks while using parameters around 40% of vanilla U-Net's, 17% of AttU-Net's, and 15% of R2U-Net's.

preprint2020arXiv

Weakly Supervised Context Encoder using DICOM metadata in Ultrasound Imaging

Modern deep learning algorithms geared towards clinical adaption rely on a significant amount of high fidelity labeled data. Low-resource settings pose challenges like acquiring high fidelity data and becomes the bottleneck for developing artificial intelligence applications. Ultrasound images, stored in Digital Imaging and Communication in Medicine (DICOM) format, have additional metadata data corresponding to ultrasound image parameters and medical exams. In this work, we leverage DICOM metadata from ultrasound images to help learn representations of the ultrasound image. We demonstrate that the proposed method outperforms the non-metadata based approaches across different downstream tasks.

Xiaohong Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

Cross-modal Local Shortest Path and Global Enhancement for Visible-Thermal Person Re-Identification

Bi-directional Dermoscopic Feature Learning and Multi-scale Consistent Decision Fusion for Skin Lesion Segmentation

The Role of Nonlinear Relapse on Contagion Amongst Drinking Communities

U-Net Using Stacked Dilated Convolutions for Medical Image Segmentation

Weakly Supervised Context Encoder using DICOM metadata in Ultrasound Imaging