Source author record

Zihao Yu

Zihao Yu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Distributed, Parallel, and Cluster Computing Networking and Internet Architecture physics.app-ph physics.optics

Catalog footprint

What is connected

4works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2024arXiv

Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference

Due to the recent success of diffusion models, text-to-image generation is becoming increasingly popular and achieves a wide range of applications. Among them, text-to-image editing, or continuous text-to-image generation, attracts lots of attention and can potentially improve the quality of generated images. It's common to see that users may want to slightly edit the generated image by making minor modifications to their input textual descriptions for several rounds of diffusion inference. However, such an image editing process suffers from the low inference efficiency of many existing diffusion models even using GPU accelerators. To solve this problem, we introduce Fast Image Semantically Edit (FISEdit), a cached-enabled sparse diffusion model inference engine for efficient text-to-image editing. The key intuition behind our approach is to utilize the semantic mapping between the minor modifications on the input text and the affected regions on the output image. For each text editing step, FISEdit can automatically identify the affected image regions and utilize the cached unchanged regions' feature map to accelerate the inference process. Extensive empirical results show that FISEdit can be $3.4\times$ and $4.4\times$ faster than existing methods on NVIDIA TITAN RTX and A100 GPUs respectively, and even generates more satisfactory images.

preprint2023arXiv

Protected Transverse Electric Waves in Topological Dielectric Waveguides

Waveguides are fundamental components in communication systems. However, they suffer from reflection and scattering losses at sharp routes or defects. The breakthrough in developing topological photonic crystals (PhCs) provides promising solutions to robust signal transmission. In this work, we propose a new mechanism for protecting wave-guiding modes by decorating the boundaries of a conventional waveguide with valley-Hall PhCs. This special layout enables the robust propagation of conventional transverse electric waves against defects and bends. Moreover, the proposed waveguide is compatible with the substrate integrated waveguide (SIW). High efficient mode conversion from the SIW to the proposed waveguide is achievable. By leveraging the idea of topology to conventional waveguides, we provide a powerful and practical tool that can largely improve the performance of microwave and millimeter-wave integrated circuits while reserving the features of wave-guiding modes.

preprint2022arXiv

Cross-Technology Communication for the Internet of Things: A Survey

The ever-developing Internet of Things (IoT) brings the prosperity of wireless sensing and control applications. In many scenarios, different wireless technologies coexist in the shared frequency medium as well as the physical space. Such wireless coexistence may lead to serious cross-technology interference (CTI) problems, e.g. channel competition, signal collision, throughput degradation. Compared with traditional methods like interference avoidance, tolerance, and concurrency mechanism, direct and timely information exchange among heterogeneous devices is therefore a fundamental requirement to ensure the usability, inter-operability, and reliability of the IoT. Under this circumstance, Cross-Technology Communication (CTC) technique thus becomes a hot topic in both academic and industrial fields, which aims at directly exchanging data among heterogeneous devices that follow different standards. This paper comprehensively summarizes the CTC techniques and reveals that the key challenge for CTC lies in the heterogeneity of IoT devices, including the incompatibility of technical standards and the asymmetry of connection capability. Based on the above finding, we present a taxonomy of the existing CTC works (packet-level CTCs and physical-level CTCs) and compare the existing CTC techniques in terms of throughput, reliability, hardware modification, and concurrency.

preprint2015arXiv

Design and optimization of DBSCAN Algorithm based on CUDA

DBSCAN is a very classic algorithm for data clus- tering, which is widely used in many fields. However, with the data scale growing much more bigger than before, the traditional serial algorithm can not meet the performance requirement. Recently, parallel computing based on CUDA has developed very fast and has great advantage on big data. This paper summarizes the algorithms proposed before and improves the performance of the old DBSCAN algorithm by using CUDA and parallel computing. The algorithm uses shared memory as much as possible compared with other algorithms and it has very good scalability. A data set is tested on the new version of DBSCAN. Finally, we analyze the results and give a conclusion that our algorithm is approximately 97 times faster than the serial version.