Source author record

Yang Sui

Yang Sui appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence Computer Vision cond-mat.mes-hall Machine Learning

Catalog footprint

What is connected

4works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Temporal Aware Pruning for Efficient Diffusion-based Video Generation

Video diffusion models have recently enabled high-quality video generation with ViT-based architectures, but remain computationally intensive because generation requires attention computation over long spatiotemporal sequences. Token pruning has proven effective for ViTs and VLMs. However, most prior pruning methods are attention-based and operate per frame, failing to ensure the vital temporal coherence across frames in video generation tasks. In practice, naively adopting attention-only pruning causes noticeable degradation due to worsened background consistency, flickering, and reduced image quality. To address this, we propose TAPE, a training-free Temporal Aware Pruning for Efficient diffusion-based video generation. TAPE (i) applies temporal smoothing to align token-importance across adjacent frames and suppress selection jitter; and (ii) performs token reselection in selected layers to align token pruning with layers' diverse semantic focus and avoid error accumulation in specific areas; it also (iii) adopt a timestep-level budget scheduling that prunes aggressively at early noisy steps and relaxes pruning during fidelity-critical refinement. The experimental results show that TAPE delivers significant speedups while preserving high visual fidelity, outperforming prior token reduction approaches.

preprint2022arXiv

CHIP: CHannel Independence-based Pruning for Compact Neural Networks

Filter pruning has been widely used for neural network compression because of its enabled practical acceleration. To date, most of the existing filter pruning works explore the importance of filters via using intra-channel information. In this paper, starting from an inter-channel perspective, we propose to perform efficient filter pruning using Channel Independence, a metric that measures the correlations among different feature maps. The less independent feature map is interpreted as containing less useful information$/$knowledge, and hence its corresponding filter can be pruned without affecting model capacity. We systematically investigate the quantification metric, measuring scheme and sensitiveness$/$reliability of channel independence in the context of filter pruning. Our evaluation results for different models on various datasets show the superior performance of our approach. Notably, on CIFAR-10 dataset our solution can bring $0.90\%$ and $0.94\%$ accuracy increase over baseline ResNet-56 and ResNet-110 models, respectively, and meanwhile the model size and FLOPs are reduced by $42.8\%$ and $47.4\%$ (for ResNet-56) and $48.3\%$ and $52.1\%$ (for ResNet-110), respectively. On ImageNet dataset, our approach can achieve $40.8\%$ and $44.8\%$ storage and computation reductions, respectively, with $0.15\%$ accuracy increase over the baseline ResNet-50 model. The code is available at https://github.com/Eclipsess/CHIP_NeurIPS2021.

preprint2011arXiv

Signatures of disorder in the minimum conductivity of graphene

Graphene has been proposed as a promising material for future nanoelectronics because of its unique electronic properties. Understanding the scaling behavior of this new nanomaterial under common experimental conditions is of critical importance for developing graphene-based nanoscale devices. We present a comprehensive experimental and theoretical study on the influence of edge disorder and bulk disorder on the minimum conductivity of graphene ribbons. For the first time, we discovered a strong non-monotonic size scaling behavior featuring a peak and saturation minimum conductivity. Through extensive numerical simulations and analysis, we are able to attribute these features to the amount of edge and bulk disorder in graphene devices. This study elucidates the quantum transport mechanisms in realistic experimental graphene systems, which can be used as a guideline for designing graphene-based nanoscale devices with improved performance.

preprint2011arXiv

Substrate Gating of Contact Resistance in Graphene Transistors

Metal contacts have been identified to be a key technological bottleneck for the realization of viable graphene electronics. Recently, it was observed that for structures that possess both a top and a bottom gate, the electron-hole conductance asymmetry can be modulated by the bottom gate. In this letter, we explain this observation by postulating the presence of an effective thin interfacial dielectric layer between the metal contact and the underlying graphene. Electrical results from quantum transport calculations accounting for this modified electrostatics corroborate well with the experimentally measured contact resistances. Our study indicates that the engineering of metal- graphene interface is a crucial step towards reducing the contact resistance for high performance graphene transistors.

Yang Sui

What is connected

Connect this record

See the researcher in context

Building this map preview

4 published item(s)

Temporal Aware Pruning for Efficient Diffusion-based Video Generation

CHIP: CHannel Independence-based Pruning for Compact Neural Networks

Signatures of disorder in the minimum conductivity of graphene

Substrate Gating of Contact Resistance in Graphene Transistors