Source author record

Shiyi Wang

Shiyi Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

physics.optics Computer Vision Machine Learning Artificial Intelligence eess.IV Social and Information Networks

Catalog footprint

What is connected

8works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A Novel Automated Classification and Segmentation for COVID-19 using 3D CT Scans

Medical image classification and segmentation based on deep learning (DL) are emergency research topics for diagnosing variant viruses of the current COVID-19 situation. In COVID-19 computed tomography (CT) images of the lungs, ground glass turbidity is the most common finding that requires specialist diagnosis. Based on this situation, some researchers propose the relevant DL models which can replace professional diagnostic specialists in clinics when lacking expertise. However, although DL methods have a stunning performance in medical image processing, the limited datasets can be a challenge in developing the accuracy of diagnosis at the human level. In addition, deep learning algorithms face the challenge of classifying and segmenting medical images in three or even multiple dimensions and maintaining high accuracy rates. Consequently, with a guaranteed high level of accuracy, our model can classify the patients' CT images into three types: Normal, Pneumonia and COVID. Subsequently, two datasets are used for segmentation, one of the datasets even has only a limited amount of data (20 cases). Our system combined the classification model and the segmentation model together, a fully integrated diagnostic model was built on the basis of ResNet50 and 3D U-Net algorithm. By feeding with different datasets, the COVID image segmentation of the infected area will be carried out according to classification results. Our model achieves 94.52% accuracy in the classification of lung lesions by 3 types: COVID, Pneumonia and Normal. For future medical use, embedding the model into the medical facilities might be an efficient way of assisting or substituting doctors with diagnoses, therefore, a broader range of the problem of variant viruses in the COVID-19 situation may also be successfully solved.

preprint2022arXiv

LP-UIT: A Multimodal Framework for Link Prediction in Social Networks

With the rapid information explosion on online social network sites (SNSs), it becomes difficult for users to seek new friends or broaden their social networks in an efficient way. Link prediction, which can effectively conquer this problem, has thus attracted wide attention. Previous methods on link prediction fail to comprehensively capture the factors leading to new link formation: 1) few models have considered the varied impacts of users' short-term and long-term interests on link prediction. Besides, they fail to jointly model the influence from social influence and "weak links"; 2) considering that different factors should be derived from information sources of different modalities, there is a lack of effective multi-modal framework for link prediction. In this view, we propose a novel multi-modal framework for link prediction (referred as LP-UIT) which fuses a comprehensive set of features (i.e., user information and topological features) extracted from multi-modal information (i.e., textual information, graph information, and numerical information). Specifically, we adopt graph convolutional network to process the network information to capture topological features, employ natural language processing techniques (i.e., TF-IDF and word2Vec) to model users' short-term and long-term interests, and identify social influence and "weak links" from numerical features. We further use an attention mechanism to model the relationship between textual and topological features. Finally, a multi-layer perceptron (MLP) is designed to combine the representations from three modalities for link prediction. Extensive experiments on two real-world datasets demonstrate the superiority of LP-UIT over the state-of-the-art methods.

preprint2020arXiv

Detailed 2D-3D Joint Representation for Human-Object Interaction

Human-Object Interaction (HOI) detection lies at the core of action understanding. Besides 2D information such as human/object appearance and locations, 3D pose is also usually utilized in HOI learning since its view-independence. However, rough 3D body joints just carry sparse body information and are not sufficient to understand complex interactions. Thus, we need detailed 3D body shape to go further. Meanwhile, the interacted object in 3D is also not fully studied in HOI learning. In light of these, we propose a detailed 2D-3D joint representation learning method. First, we utilize the single-view human body capture method to obtain detailed 3D body, face and hand shapes. Next, we estimate the 3D object location and size with reference to the 2D human-object spatial configuration and object category priors. Finally, a joint learning framework and cross-modal consistency tasks are proposed to learn the joint HOI representation. To better evaluate the 2D ambiguity processing capacity of models, we propose a new benchmark named Ambiguous-HOI consisting of hard ambiguous images. Extensive experiments in large-scale HOI benchmark and Ambiguous-HOI show impressive effectiveness of our method. Code and data are available at https://github.com/DirtyHarryLYL/DJ-RN.

preprint2020arXiv

PaStaNet: Toward Human Activity Knowledge Engine

Existing image-based activity understanding methods mainly adopt direct mapping, i.e. from image to activity concepts, which may encounter performance bottleneck since the huge gap. In light of this, we propose a new path: infer human part states first and then reason out the activities based on part-level semantics. Human Body Part States (PaSta) are fine-grained action semantic tokens, e.g. <hand, hold, something>, which can compose the activities and help us step toward human activity knowledge engine. To fully utilize the power of PaSta, we build a large-scale knowledge base PaStaNet, which contains 7M+ PaSta annotations. And two corresponding models are proposed: first, we design a model named Activity2Vec to extract PaSta features, which aim to be general representations for various activities. Second, we use a PaSta-based Reasoning method to infer activities. Promoted by PaStaNet, our method achieves significant improvements, e.g. 6.4 and 13.9 mAP on full and one-shot sets of HICO in supervised learning, and 3.2 and 4.2 mAP on V-COCO and images-based AVA in transfer learning. Code and data are available at http://hake-mvig.cn/.

preprint2016arXiv

Integrated Broadband Bowtie Antenna on Transparent Silica Substrate

The bowtie antenna is a topic of growing interest in recent years. In this paper, we design, fabricate, and characterize a modified gold bowtie antenna integrated on a transparent silica substrate. The bowtie antenna is designed with broad RF bandwidth to cover the X-band in the electromagnetic spectrum. We numerically investigate the antenna characteristics, specifically its resonant frequency and enhancement factor. Our designed bowtie antenna provides a strong broadband electric field enhancement in its feed gap. Taking advantage of the low-k silica substrate, high enhancement factor can be achieved without the unwanted reflection and scattering from the backside silicon handle which is the issue of using an SOI substrate. We simulate the dependence of resonance frequency on bowtie geometry, and verify the simulation results through experimental investigation, by fabricating different sets of bowtie antennas on silica substrates and then measuring their resonance frequencies. In addition, the far-field radiation pattern of the bowtie antenna is measured, and it shows dipole-like characteristics with large beam width. Such a broadband antenna will be useful for a myriad of applications, ranging from photonic electromagnetic wave sensing to wireless communications.

preprint2015arXiv

Antenna-coupled silicon-organic hybrid integrated photonic crystal modulator for broadband electromagnetic wave detection

In this work, we design, fabricate and characterize a compact, broadband and highly sensitive integrated photonic electromagnetic field sensor based on a silicon-organic hybrid modulator driven by a bowtie antenna. The large electro-optic (EO) coefficient of organic polymer, the slow-light effects in the silicon slot photonic crystal waveguide (PCW), and the broadband field enhancement provided by the bowtie antenna, are all combined to enhance the interaction of microwaves and optical waves, enabling a high EO modulation efficiency and thus a high sensitivity. The modulator is experimentally demonstrated with a record-high effective in-device EO modulation efficiency of r33=1230pm/V. Modulation response up to 40GHz is measured, with a 3-dB bandwidth of 11GHz. The slot PCW has an interaction length of 300um, and the bowtie antenna has an area smaller than 1cm2. The bowtie antenna in the device is experimentally demonstrated to have a broadband characteristics with a central resonance frequency of 10GHz, as well as a large beam width which enables the detection of electromagnetic waves from a large range of incident angles. The sensor is experimentally demonstrated with a minimum detectable electromagnetic power density of 8.4mW/m2 at 8.4GHz, corresponding to a minimum detectable electric field of 2.5V/m and an ultra-high sensitivity of 0.000027V/m Hz^-1/2 ever demonstrated. To the best of our knowledge, this is the first silicon-organic hybrid device and also the first PCW device used for the photonic detection of electromagnetic waves. Finally, we propose some future work, including a Teraherz wave sensor based on antenna-coupled electro-optic polymer filled plasmonic slot waveguide, as well as a fully packaged and tailgated device.

preprint2015arXiv

Integrated broadband bowtie antenna on transparent substrate

The bowtie antenna is a topic of growing interest in recent years. In this paper, we design, fabricate, and characterize a modified gold bowtie antenna integrated on a transparent glass substrate. We numerically investigate the antenna characteristics, specifically its resonant frequency and enhancement factor. We simulate the dependence of resonance frequency on bowtie geometry, and verify the simulation results through experimental investigation, by fabricating different sets of bowtie antennas on glass substrates utilizing CMOS compatible processes and measuring their resonance frequencies. Our designed bowtie antenna provides a strong broadband electric field enhancement in its feed gap. The far-field radiation pattern of the bowtie antenna is measured, and it shows dipole-like characteristics with large beam width. Such a broadband antenna will be useful for a myriad of applications, ranging from wireless communications to electromagnetic wave detection.

preprint2014arXiv

Electric field sensor based on electro-optic polymer refilled silicon slot photonic crystal waveguide coupled with bowtie antenna

We present the design of a compact and highly sensitive electric field sensor based on a bowtie antenna-coupled slot photonic crystal waveguide (PCW). An electro-optic (EO) polymer with a large EO coefficient, r33=100pm/V, is used to refill the PCW slot and air holes. Bowtie-shaped electrodes are used as both poling electrodes and as receiving antenna. The slow-light effect in the PCW is used to increase the effective in-device r33>1000pm/V. The slot PCW is designed for low-dispersion slow light propagation, maximum poling efficiency as well as optical mode confinement inside the EO polymer. The antenna is designed for operation at 10GHz.

Shiyi Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

A Novel Automated Classification and Segmentation for COVID-19 using 3D CT Scans

LP-UIT: A Multimodal Framework for Link Prediction in Social Networks

Detailed 2D-3D Joint Representation for Human-Object Interaction

PaStaNet: Toward Human Activity Knowledge Engine

Integrated Broadband Bowtie Antenna on Transparent Silica Substrate

Antenna-coupled silicon-organic hybrid integrated photonic crystal modulator for broadband electromagnetic wave detection

Integrated broadband bowtie antenna on transparent substrate

Electric field sensor based on electro-optic polymer refilled silicon slot photonic crystal waveguide coupled with bowtie antenna