Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
20works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

20 published item(s)

preprint2025arXiv

AudioFab: Building A General and Intelligent Audio Factory through Tool Learning

Currently, artificial intelligence is profoundly transforming the audio domain; however, numerous advanced algorithms and tools remain fragmented, lacking a unified and efficient framework to unlock their full potential. Existing audio agent frameworks often suffer from complex environment configurations and inefficient tool collaboration. To address these limitations, we introduce AudioFab, an open-source agent framework aimed at establishing an open and intelligent audio-processing ecosystem. Compared to existing solutions, AudioFab's modular design resolves dependency conflicts, simplifying tool integration and extension. It also optimizes tool learning through intelligent selection and few-shot learning, improving efficiency and accuracy in complex audio tasks. Furthermore, AudioFab provides a user-friendly natural language interface tailored for non-expert users. As a foundational framework, AudioFab's core contribution lies in offering a stable and extensible platform for future research and development in audio and multimodal AI. The code is available at https://github.com/SmileHnu/AudioFab.

preprint2024arXiv

Free Lunch for Federated Remote Sensing Target Fine-Grained Classification: A Parameter-Efficient Framework

Remote Sensing Target Fine-grained Classification (TFGC) is of great significance in both military and civilian fields. Due to location differences, growth in data size, and centralized server storage constraints, these data are usually stored under different databases across regions/countries. However, privacy laws and national security concerns constrain researchers from accessing these sensitive remote sensing images for further analysis. Additionally, low-resource remote sensing devices encounter challenges in terms of communication overhead and efficiency when dealing with the ever-increasing data and model scales. To solve the above challenges, this paper proposes a novel Privacy-Reserving TFGC Framework based on Federated Learning, dubbed PRFL. The proposed framework allows each client to learn global and local knowledge to enhance the local representation of private data in environments with extreme statistical heterogeneity (non. Independent and Identically Distributed, IID). Thus, it provides highly customized models to clients with differentiated data distributions. Moreover, the framework minimizes communication overhead and improves efficiency while ensuring satisfactory performance, thereby enhancing robustness and practical applicability under resource-scarce conditions. We demonstrate the effectiveness of the proposed PRFL on the classical TFGC task by leveraging four public datasets.

preprint2023arXiv

Mask-then-Fill: A Flexible and Effective Data Augmentation Framework for Event Extraction

We present Mask-then-Fill, a flexible and effective data augmentation framework for event extraction. Our approach allows for more flexible manipulation of text and thus can generate more diverse data while keeping the original event structure unchanged as much as possible. Specifically, it first randomly masks out an adjunct sentence fragment and then infills a variable-length text span with a fine-tuned infilling model. The main advantage lies in that it can replace a fragment of arbitrary length in the text with another fragment of variable length, compared to the existing methods which can only replace a single word or a fixed-length fragment. On trigger and argument extraction tasks, the proposed framework is more effective than baseline methods and it demonstrates particularly strong results in the low-resource setting. Our further analysis shows that it achieves a good balance between diversity and distributional similarity.

preprint2022arXiv

A General End-to-end Diagnosis Framework for Manufacturing Systems

The manufacturing sector is envisioned to be heavily influenced by artificial intelligence-based technologies with the extraordinary increases in computational power and data volumes. A central challenge in manufacturing sector lies in the requirement of a general framework to ensure satisfied diagnosis and monitoring performances in different manufacturing applications. Here we propose a general data-driven, end-to-end framework for the monitoring of manufacturing systems. This framework, derived from deep learning techniques, evaluates fused sensory measurements to detect and even predict faults and wearing conditions. This work exploits the predictive power of deep learning to automatically extract hidden degradation features from noisy, time-course data. We have experimented the proposed framework on ten representative datasets drawn from a wide variety of manufacturing applications. Results reveal that the framework performs well in examined benchmark applications and can be applied in diverse contexts, indicating its potential use as a critical corner stone in smart manufacturing.

preprint2022arXiv

Bridging the Gap of AutoGraph between Academia and Industry: Analysing AutoGraph Challenge at KDD Cup 2020

Graph structured data is ubiquitous in daily life and scientific areas and has attracted increasing attention. Graph Neural Networks (GNNs) have been proved to be effective in modeling graph structured data and many variants of GNN architectures have been proposed. However, much human effort is often needed to tune the architecture depending on different datasets. Researchers naturally adopt Automated Machine Learning on Graph Learning, aiming to reduce the human effort and achieve generally top-performing GNNs, but their methods focus more on the architecture search. To understand GNN practitioners' automated solutions, we organized AutoGraph Challenge at KDD Cup 2020, emphasizing on automated graph neural networks for node classification. We received top solutions especially from industrial tech companies like Meituan, Alibaba and Twitter, which are already open sourced on Github. After detailed comparisons with solutions from academia, we quantify the gaps between academia and industry on modeling scope, effectiveness and efficiency, and show that (1) academia AutoML for Graph solutions focus on GNN architecture search while industrial solutions, especially the winning ones in the KDD Cup, tend to obtain an overall solution (2) by neural architecture search only, academia solutions achieve on average 97.3% accuracy of industrial solutions (3) academia solutions are cheap to obtain with several GPU hours while industrial solutions take a few months' labors. Academic solutions also contain much fewer parameters.

preprint2022arXiv

Codabench: Flexible, Easy-to-Use and Reproducible Benchmarking Platform

Obtaining standardized crowdsourced benchmark of computational methods is a major issue in data science communities. Dedicated frameworks enabling fair benchmarking in a unified environment are yet to be developed. Here we introduce Codabench, an open-source, community-driven platform for benchmarking algorithms or software agents versus datasets or tasks. A public instance of Codabench (https://www.codabench.org/) is open to everyone, free of charge, and allows benchmark organizers to compare fairly submissions, under the same setting (software, hardware, data, algorithms), with custom protocols and data formats. Codabench has unique features facilitating the organization of benchmarks flexibly, easily and reproducibly, such as the possibility of re-using templates of benchmarks, and supplying compute resources on-demand. Codabench has been used internally and externally on various applications, receiving more than 130 users and 2500 submissions. As illustrative use cases, we introduce 4 diverse benchmarks covering Graph Machine Learning, Cancer Heterogeneity, Clinical Diagnosis and Reinforcement Learning.

preprint2022arXiv

Designing the Topology of Graph Neural Networks: A Novel Feature Fusion Perspective

In recent years, Graph Neural Networks (GNNs) have shown superior performance on diverse real-world applications. To improve the model capacity, besides designing aggregation operations, GNN topology design is also very important. In general, there are two mainstream GNN topology design manners. The first one is to stack aggregation operations to obtain the higher-level features but easily got performance drop as the network goes deeper. Secondly, the multiple aggregation operations are utilized in each layer which provides adequate and independent feature extraction stage on local neighbors while are costly to obtain the higher-level information. To enjoy the benefits while alleviating the corresponding deficiencies of these two manners, we learn to design the topology of GNNs in a novel feature fusion perspective which is dubbed F$^2$GNN. To be specific, we provide a feature fusion perspective in designing GNN topology and propose a novel framework to unify the existing topology designs with feature selection and fusion strategies. Then we develop a neural architecture search method on top of the unified framework which contains a set of selection and fusion operations in the search space and an improved differentiable search algorithm. The performance gains on eight real-world datasets demonstrate the effectiveness of F$^2$GNN. We further conduct experiments to show that F$^2$GNN can improve the model capacity while alleviating the deficiencies of existing GNN topology design manners, especially alleviating the over-smoothing problem, by utilizing different levels of features adaptively.

preprint2022arXiv

Graph Neural Networks for Double-Strand DNA Breaks Prediction

Double-strand DNA breaks (DSBs) are a form of DNA damage that can cause abnormal chromosomal rearrangements. Recent technologies based on high-throughput experiments have obvious high costs and technical challenges.Therefore, we design a graph neural network based method to predict DSBs (GraphDSB), using DNA sequence features and chromosome structure information. In order to improve the expression ability of the model, we introduce Jumping Knowledge architecture and several effective structural encoding methods. The contribution of structural information to the prediction of DSBs is verified by the experiments on datasets from normal human epidermal keratinocytes (NHEK) and chronic myeloid leukemia cell line (K562), and the ablation studies further demonstrate the effectiveness of the designed components in the proposed GraphDSB framework. Finally, we use GNNExplainer to analyze the contribution of node features and topology to DSBs prediction, and proved the high contribution of 5-mer DNA sequence features and two chromatin interaction modes.

preprint2022arXiv

Graph Property Prediction on Open Graph Benchmark: A Winning Solution by Graph Neural Architecture Search

Aiming at two molecular graph datasets and one protein association subgraph dataset in OGB graph classification task, we design a graph neural network framework for graph classification task by introducing PAS(Pooling Architecture Search). At the same time, we improve it based on the GNN topology design method F2GNN to further design the feature selection and fusion strategies, so as to further improve the performance of the model in the graph property prediction task while overcoming the over smoothing problem of deep GNN training. Finally, a performance breakthrough is achieved on these three datasets, which is significantly better than other methods with fixed aggregate function. It is proved that the NAS method has high generalization ability for multiple tasks and the advantage of our method in processing graph property prediction tasks.

preprint2022arXiv

Hydrogen trapping and embrittlement in high-strength Al-alloys

Ever more stringent regulations on greenhouse gas emissions from transportation motivate efforts to revisit materials used for vehicles. High-strength Al-alloys often used in aircrafts could help reduce the weight of automobiles, but are susceptible to environmental degradation. Hydrogen (H) "embrittlement" is often pointed as the main culprit, however, the mechanisms underpinning failure are elusive: atomic-scale analysis of H inside an alloy remains a challenge, and this prevents deploying alloy design strategies to enhance the materials' durability. Here we successfully performed near-atomic scale analysis of H trapped in second-phase particles and at grain boundaries in a high-strength 7xxx Al-alloy. We used these observations to guide atomistic ab-initio calculations which show that the co-segregation of alloying elements and H favours grain boundary decohesion, while the strong partitioning of H into the second-phases removes solute H from the matrix, hence preventing H-embrittlement. Our insights further advance the mechanistic understanding of H-assisted embrittlement in Al-alloys, emphasizing the role of H-traps in retarding cracking and guiding new alloy design.

preprint2022arXiv

Improving Event Representation via Simultaneous Weakly Supervised Contrastive Learning and Clustering

Representations of events described in text are important for various tasks. In this work, we present SWCC: a Simultaneous Weakly supervised Contrastive learning and Clustering framework for event representation learning. SWCC learns event representations by making better use of co-occurrence information of events. Specifically, we introduce a weakly supervised contrastive learning method that allows us to consider multiple positives and multiple negatives, and a prototype-based clustering method that avoids semantically related events being pulled apart. For model training, SWCC learns representations by simultaneously performing weakly supervised contrastive learning and prototype-based clustering. Experimental results show that SWCC outperforms other baselines on Hard Similarity and Transitive Sentence Similarity tasks. In addition, a thorough analysis of the prototype-based clustering method demonstrates that the learned prototype vectors are able to implicitly capture various relations between events.

preprint2022arXiv

Manipulating Interlayer Excitons for Ultra-pure Near-infrared Quantum Light Generation

Interlayer excitons (IXs) formed at the interface of atomically-thin semiconductors possess various novel properties. In a parallel development, nanoscale strain engineering has emerged as an effective means for creating 2D quantum light sources. Exploring the intersection of these two exciting areas, where strain and defects are exploited for the manipulation of IX toward the emergence of new functionalities, is currently at a nascent stage. Here, using MoS2/WSe2 heterostructure as a model system, we demonstrate how strain, defects, and layering can be utilized to create defect-bound IXs capable of bright, robust, and tunable quantum light emission in the technologically important near-infrared spectral range. We were able to achieve ultra-high single-photon purity with g(2)(0) = 0.01. Our strategy of creating site-controlled QEs from the defect-bound IXs represents a paradigm shift in 2D quantum photonics research, from engineering intralayer exciton in monolayer structures towards IXs at the interface of 2D heterostructures.

preprint2022arXiv

Proximity Induced Chiral Quantum Light Generation in Strain-Engineered WSe2/NiPS3 Heterostructures

Quantum light emitters (QEs) capable of generating single photons of well-defined circular polarization could enable non-reciprocal single photon devices and deterministic spin-photon interfaces critical for realizing complex quantum networks. To date, emission of such chiral quantum light has been achieved via the application of intense external magnetic field electrical/optical injection of spin polarized carriers/excitons, or coupling with complex photonic/meta-structures. Here we report free-space generation of highly chiral single photons from QEs created in monolayer WSe2 - NiPS3 heterostructures at zero external magnetic field. These QEs emit in the 760-800 nm range with a degree of circular polarization and single photon purity as high as 0.71 and 80% respectively, independent of pump laser polarization. QEs are deterministically created by pressing a scanning probe microscope tip into a two-dimensional heterostructure comprising a WSe2 monolayer and a ~50 nm thick layer of the antiferromagnetic (AFM) insulator NiPS3. Temperature dependent magneto-photoluminescence studies indicate that the chiral quantum light emission arises from magnetic proximity interactions between localized excitons in the WSe2 monolayer and the out-of-plane magnetization of AFM defects in NiPS3, both of which are co-localized by the strain field arising from the nanoscale indentations.

preprint2022arXiv

Revisiting stress-corrosion cracking and hydrogen embrittlement in 7xxx-Al alloys at the near-atomic-scale

Hydrogen embrittlement (HE) affects all major high-strength structural materials and as such is a major impediment to lightweighting e.g. vehicles and help reduce carbon-emissions and reach net-zero. The high-strength 7xxx series aluminium alloys can fulfil the need for light, high strength materials, and are already extensively used in aerospace for weight reduction purposes. However, depending on the thermomechanical and loading state, these alloys can be sensitive to stress-corrosion cracking (SCC) through anodic dissolution and hydrogen embrittlement. Here, we study at the near-atomic-scale the intra- and inter-granular microstructure ahead and in the wake of a propagating SCC crack. Moving away from model cases not strictly relevant to application, we performed an industry-standard test on an engineering Al-7XXX alloy. H is found segregated to planar arrays of dislocations and to grain boundaries that we can associate to the combined effects of hydrogen-enhanced localized plasticity (HELP) and hydrogen-enhanced decohesion (HEDE) mechanisms. We report on a Mg-rich H-rich amorphous oxide on the corroded crack surface and evidence of Mg-related diffusional processes leading to dissolution of the strengthening eta-phase precipitates ahead of the crack. We show ingress of up to 1 at% O, i.e. well above the solubility limit of O in Al, near the oxide-metal interface, while no increased level of H is found in the matrix. We provide an array of discussion points relative to the interplay of structural defects, transport of solutes, thereby changing the resistance against crack propagation, which have been overlooked across the SCC literature and prevent accurate service life predictions.

preprint2022arXiv

Space4HGNN: A Novel, Modularized and Reproducible Platform to Evaluate Heterogeneous Graph Neural Network

Heterogeneous Graph Neural Network (HGNN) has been successfully employed in various tasks, but we cannot accurately know the importance of different design dimensions of HGNNs due to diverse architectures and applied scenarios. Besides, in the research community of HGNNs, implementing and evaluating various tasks still need much human effort. To mitigate these issues, we first propose a unified framework covering most HGNNs, consisting of three components: heterogeneous linear transformation, heterogeneous graph transformation, and heterogeneous message passing layer. Then we build a platform Space4HGNN by defining a design space for HGNNs based on the unified framework, which offers modularized components, reproducible implementations, and standardized evaluation for HGNNs. Finally, we conduct experiments to analyze the effect of different designs. With the insights found, we distill a condensed design space and verify its effectiveness.

preprint2022arXiv

Understanding the degradation of a model Si-anode in Li-ion battery at the atomic-scale

Si-anodes have long been candidates thanks to an expected ten-fold increase in capacity compared to graphite. However, details of the mechanisms governing their degradation remain elusive, hindering science-guided development of long-lived Si-based anodes. Here we demonstrate how the latest developments in cryo-atom probe tomography enable the in-depth analysis of the electrode and electrolyte, and their interface at atomic-level.

preprint2021arXiv

Nondestructive Testing of Composite Fibre Materials with Hyperspectral Imaging : Evaluative Studies in the EU H2020 FibreEUse Project

Through capturing spectral data from a wide frequency range along with the spatial information, hyperspectral imaging (HSI) can detect minor differences in terms of temperature, moisture and chemical composition. Therefore, HSI has been successfully applied in various applications, including remote sensing for security and defense, precision agriculture for vegetation and crop monitoring, food/drink, and pharmaceuticals quality control. However, for condition monitoring and damage detection in carbon fibre reinforced polymer (CFRP), the use of HSI is a relatively untouched area, as existing non-destructive testing (NDT) techniques focus mainly on delivering information about physical integrity of structures but not on material composition. To this end, HSI can provide a unique way to tackle this challenge. In this paper, with the use of a near-infrared HSI camera, applications of HSI for the non-destructive inspection of CFRP products are introduced, taking the EU H2020 FibreEUse project as the background. Technical challenges and solutions on three case studies are presented in detail, including adhesive residues detection, surface damage detection and Cobot based automated inspection. Experimental results have fully demonstrated the great potential of HSI and related vision techniques for NDT of CFRP, especially the potential to satisfy the industrial manufacturing environment.

preprint2021arXiv

Site-Controlled Telecom Single-Photon Emitters in Atomically-thin MoTe2

Quantum emitters (QEs) in two-dimensional transition metal dichalcogenides (2D TMDCs) have advanced to the forefront of quantum communication and transduction research due to their unique potentials in accessing valley pseudo-spin degree of freedom (DOF) and facile integration into quantum-photonic, electronic and sensing platforms via the layer-by-layer-assembly approach. To date, QEs capable of operating in O-C telecommunication bands have not been demonstrated in TMDCs. Here we report a deterministic creation of such telecom QEs emitting over the 1080 to 1550 nm wavelength range via coupling of 2D molybdenum ditelluride (MoTe2) to strain inducing nano-pillar arrays. Our Hanbury Brown and Twiss experiment conducted at 10 K reveals clear photon antibunching with 90% single photon purity. Ultra-long lifetimes, 4-6 orders of magnitude longer than that of the 2D exciton, are also observed. Polarization analysis further reveals that while some QEs display cross-linearly polarized doublets with ~1 meV splitting resulting from the strain induced anisotropic exchange interaction, valley degeneracy is preserved in other QEs. Valley Zeeman splitting as well as restoring of valley symmetry in cross-polarized doublets are observed under 8T magnetic field. In contrast to other telecom QEs, our QEs which offer the potential to access valley DOF through single photons, could lead to unprecedented advantages in optical fiber-based quantum networks.

preprint2020arXiv

Simplifying Architecture Search for Graph Neural Network

Recent years have witnessed the popularity of Graph Neural Networks (GNN) in various scenarios. To obtain optimal data-specific GNN architectures, researchers turn to neural architecture search (NAS) methods, which have made impressive progress in discovering effective architectures in convolutional neural networks. Two preliminary works, GraphNAS and Auto-GNN, have made first attempt to apply NAS methods to GNN. Despite the promising results, there are several drawbacks in expressive capability and search efficiency of GraphNAS and Auto-GNN due to the designed search space. To overcome these drawbacks, we propose the SNAG framework (Simplified Neural Architecture search for Graph neural networks), consisting of a novel search space and a reinforcement learning based search algorithm. Extensive experiments on real-world datasets demonstrate the effectiveness of the SNAG framework compared to human-designed GNNs and NAS methods, including GraphNAS and Auto-GNN.

preprint2020arXiv

Vertex-reinforced Random Walk for Network Embedding

In this paper, we study the fundamental problem of random walk for network embedding. We propose to use non-Markovian random walk, variants of vertex-reinforced random walk (VRRW), to fully use the history of a random walk path. To solve the getting stuck problem of VRRW, we introduce an exploitation-exploration mechanism to help the random walk jump out of the stuck set. The new random walk algorithms share the same convergence property of VRRW and thus can be used to learn stable network embeddings. Experimental results on two link prediction benchmark datasets and three node classification benchmark datasets show that our proposed approach reinforce2vec can outperform state-of-the-art random walk based embedding methods by a large margin.