Source author record

Xin Fu

Xin Fu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.AT math.DG Artificial Intelligence Computer Vision cond-mat.dis-nn Distributed, Parallel, and Cluster Computing eess.IV Emerging Technologies Graphics Machine Learning math.AP math.CO math.RT physics.optics

Catalog footprint

What is connected

9works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Variation of Kahler-Einstein metrics with mixed singularities

In this short note, we consider a fiberation f: (X, Delta) to Y between two compact Kahler manifolds with generic fiber of f being a smooth log canonical pair with ample canonical divisor, we prove that the current induced by variation of Kahler Einsteins with mixed cone and Poincare singularities is positive, hence generalize the result of Schumacher in the smooth case [22] and the result of Guenancia in the conic case [14]. As application, we prove the surjectivity of Albanese map for a smooth log canonical pair with -(KX + Delta) being nef.

preprint2022arXiv

Efficient Federated Learning for AIoT Applications Using Knowledge Distillation

As a promising distributed machine learning paradigm, Federated Learning (FL) trains a central model with decentralized data without compromising user privacy, which has made it widely used by Artificial Intelligence Internet of Things (AIoT) applications. However, the traditional FL suffers from model inaccuracy since it trains local models using hard labels of data and ignores useful information of incorrect predictions with small probabilities. Although various solutions try to tackle the bottleneck of the traditional FL, most of them introduce significant communication and memory overhead, making the deployment of large-scale AIoT devices a great challenge. To address the above problem, this paper presents a novel Distillation-based Federated Learning (DFL) architecture that enables efficient and accurate FL for AIoT applications. Inspired by Knowledge Distillation (KD) that can increase the model accuracy, our approach adds the soft targets used by KD to the FL model training, which occupies negligible network resources. The soft targets are generated by local sample predictions of each AIoT device after each round of local training and used for the next round of model training. During the local training of DFL, both soft targets and hard labels are used as approximation objectives of model predictions to improve model accuracy by supplementing the knowledge of soft targets. To further improve the performance of our DFL model, we design a dynamic adjustment strategy for tuning the ratio of two loss functions used in KD, which can maximize the use of both soft targets and hard labels. Comprehensive experimental results on well-known benchmarks show that our approach can significantly improve the model accuracy of FL with both Independent and Identically Distributed (IID) and non-IID data.

preprint2022arXiv

Just Noticeable Difference for Deep Machine Vision

As an important perceptual characteristic of the Human Visual System (HVS), the Just Noticeable Difference (JND) has been studied for decades with image and video processing (e.g., perceptual visual signal compression). However, there is little exploration on the existence of JND for the Deep Machine Vision (DMV), although the DMV has made great strides in many machine vision tasks. In this paper, we take an initial attempt, and demonstrate that the DMV has the JND, termed as the DMV-JND. We then propose a JND model for the image classification task in the DMV. It has been discovered that the DMV can tolerate distorted images with average PSNR of only 9.56dB (the lower the better), by generating JND via unsupervised learning with the proposed DMV-JND-NET. In particular, a semantic-guided redundancy assessment strategy is designed to restrain the magnitude and spatial distribution of the DMV-JND. Experimental results on image classification demonstrate that we successfully find the JND for deep machine vision. Our DMV-JND facilitates a possible direction for DMV-oriented image and video compression, watermarking, quality assessment, deep neural network security, and so on.

preprint2022arXiv

Uniform convergence for linear elastostatic systems with periodic high contrast inclusions

We consider the Lame system of linear elasticity with periodically distributed inclusions whose elastic parameters have high contrast compared to the background media. We develop a unified method based on layer potential techniques to quantify three convergence results when some parameters of the elastic inclusions are sent to extreme values. More precisely, we study the incompressible inclusions limit where the bulk modulus of the inclusions tends to infinity, the soft inclusions limit where both the bulk modulus and the shear modulus tend to zero, and the hard inclusions limit where the shear modulus tends to infinity. Our method yields convergence rates that are independent of the periodicity of the inclusions array, and are sharper than some earlier results of this type. A key ingredient of the proof is the establishment of uniform spectra gaps for the elastic Neumann-Poincare operator associated to the collection of periodic inclusions that are independent of the periodicity.

preprint2022arXiv

Uniqueness of Tangent Cone of Kahler Einstein Metrics on Singular Varieties with Crepant Singularities

Let $(X, L)$ be a polarized Calabi Yau variety (or canonical polarized variety) with crepant singularity. Suppose $ω_{KE} \in c_1(L)$ (or $ω_{KE} \in c_1(K_X)$) is the unique Ricci flat current (or Kahler Einstein current with negative scalar curvature) with local bounded potential constructed in [18], we show that the local tangent at any point $p \in X$ of metric $ω_{KE}$ is unique

preprint2021arXiv

Model category structures on multicomplexes

We present a family of model structures on the category of multicomplexes. There is a cofibrantly generated model structure in which the weak equivalences are the morphisms inducing an isomorphism at a fixed stage of an associated spectral sequence. Corresponding model structures are given for truncated versions of multicomplexes, interpolating between bicomplexes and multicomplexes. For a fixed stage of the spectral sequence, the model structures on all these categories are shown to be Quillen equivalent.

preprint2020arXiv

OO-VR: NUMA Friendly Object-Oriented VR Rendering Framework For Future NUMA-Based Multi-GPU Systems

With the strong computation capability, NUMA-based multi-GPU system is a promising candidate to provide sustainable and scalable performance for Virtual Reality. However, the entire multi-GPU system is viewed as a single GPU which ignores the data locality in VR rendering during the workload distribution, leading to tremendous remote memory accesses among GPU models. By conducting comprehensive characterizations on different kinds of parallel rendering frameworks, we observe that distributing the rendering object along with its required data per GPM can reduce the inter-GPM memory accesses. However, this object-level rendering still faces two major challenges in NUMA-based multi-GPU system: (1) the large data locality between the left and right views of the same object and the data sharing among different objects and (2) the unbalanced workloads induced by the software-level distribution and composition mechanisms. To tackle these challenges, we propose object-oriented VR rendering framework (OO-VR) that conducts the software and hardware co-optimization to provide a NUMA friendly solution for VR multi-view rendering in NUMA-based multi-GPU systems. We first propose an object-oriented VR programming model to exploit the data sharing between two views of the same object and group objects into batches based on their texture sharing levels. Then, we design an object aware runtime batch distribution engine and distributed hardware composition unit to achieve the balanced workloads among GPMs. Finally, evaluations on our VR featured simulator show that OO-VR provides 1.58x overall performance improvement and 76% inter-GPM memory traffic reduction over the state-of-the-art multi-GPU systems. In addition, OO-VR provides NUMA friendly performance scalability for the future larger multi-GPU scenarios with ever increasing asymmetric bandwidth between local and remote memory.

preprint2020arXiv

Parallel convolution processing using an integrated photonic tensor core

With the proliferation of ultra-high-speed mobile networks and internet-connected devices, along with the rise of artificial intelligence, the world is generating exponentially increasing amounts of data - data that needs to be processed in a fast, efficient and smart way. These developments are pushing the limits of existing computing paradigms, and highly parallelized, fast and scalable hardware concepts are becoming progressively more important. Here, we demonstrate a computational specific integrated photonic tensor core - the optical analog of an ASIC-capable of operating at Tera-Multiply-Accumulate per second (TMAC/s) speeds. The photonic core achieves parallelized photonic in-memory computing using phase-change memory arrays and photonic chip-based optical frequency combs (soliton microcombs). The computation is reduced to measuring the optical transmission of reconfigurable and non-resonant passive components and can operate at a bandwidth exceeding 14 GHz, limited only by the speed of the modulators and photodetectors. Given recent advances in hybrid integration of soliton microcombs at microwave line rates, ultra-low loss silicon nitride waveguides, and high speed on-chip detectors and modulators, our approach provides a path towards full CMOS wafer-scale integration of the photonic tensor core. While we focus on convolution processing, more generally our results indicate the major potential of integrated photonics for parallel, fast, and efficient computational hardware in demanding AI applications such as autonomous driving, live video processing, and next generation cloud computing services.

preprint2019arXiv

Simplicial $G$-complexes and representation stability of polyhedral products

Representation stability in the sense of Church-Farb is concerned with stable properties of representations of sequences of algebraic structures, in particular of groups. We study this notion on objects arising in toric topology. With a simplicial $G$-complex $K$ and a topological pair $(X, A)$, a $G$-polyhedral product $(X, A)^K$ is associated. We show that the homotopy decomposition [2] of $Σ(X, A)^K$ is then $G$-equivariant after suspension. In the case of $Σ_m$-polyhedral products, we give criteria on simplicial $Σ_m$-complexes which imply representation stability of $Σ_m$-representations $\{H_i((X, A)^{K_m})\}$.

Xin Fu

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

Variation of Kahler-Einstein metrics with mixed singularities

Efficient Federated Learning for AIoT Applications Using Knowledge Distillation

Just Noticeable Difference for Deep Machine Vision

Uniform convergence for linear elastostatic systems with periodic high contrast inclusions

Uniqueness of Tangent Cone of Kahler Einstein Metrics on Singular Varieties with Crepant Singularities

Model category structures on multicomplexes

OO-VR: NUMA Friendly Object-Oriented VR Rendering Framework For Future NUMA-Based Multi-GPU Systems

Parallel convolution processing using an integrated photonic tensor core

Simplicial $G$-complexes and representation stability of polyhedral products