Source author record

Min Wu

Min Wu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

52works

36topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

CERSA: Cumulative Energy-Retaining Subspace Adaptation for Memory-Efficient Fine-Tuning

To mitigate the memory constraints associated with fine-tuning large pre-trained models, existing parameter-efficient fine-tuning (PEFT) methods, such as LoRA, rely on low-rank updates. However, such updates fail to fully capture the rank characteristics of the weight modifications observed in full-parameter fine-tuning, resulting in a performance gap. Furthermore, LoRA and other existing PEFT methods still require substantial memory to store the full set of frozen weights, limiting their efficiency in resource-constrained settings. To addres these limitations, we introduce Cumulative Energy-Retaining Subspace Adaptation (CERSA), a novel fine-tuning paradigm that leverages singular value decomposition (SVD) to retain only the principal components responsible for 90% to 95% of the spectral energy. By fine-tuning low-rank representations derived from this principal subspace, CERSA significantly reduces memory consumption. We conduct extensive evaluations of CERSA across models of varying scales and domains, including image recognition, text-to-image generation, and natural language understanding. Empirical results demonstrate that CERSA consistently outperforms state-of-the-art PEFT methods while achieving substantially lower memory requirements. The code will be publicly released.

preprint2026arXiv

Compress Then Adapt? No, Do It Together via Task-aware Union of Subspaces

Adapting large pretrained models to diverse tasks is now routine, yet the two dominant strategies of parameter-efficient fine-tuning (PEFT) and low-rank compression are typically composed in sequence. This decoupled practice first compresses and then fine-tunes adapters, potentially misaligning the compressed subspace with downstream objectives and squandering a global parameter budget. To overcome this limitation, we introduce JACTUS (Joint Adaptation and Compression with a Task-aware Union of Subspaces), a single framework that unifies compression and adaptation. From a small calibration set, JACTUS estimates input and pre-activation gradient covariances, forms their orthogonal union with the pretrained weight subspace, performs a projected low-rank approximation inside this union, allocates rank globally by marginal gain per parameter, and trains only a compact core matrix. This explicitly mitigates the potential misalignment between the compressed subspace and downstream objectives by coupling the directions preserved for compression with those required for adaptation, yielding a deployable low-rank model that avoids retaining full frozen weights while enabling fast and robust tuning. On vision, JACTUS attains an average 89.2% accuracy on ViT-Base across eight datasets at 80% retained parameters, surpassing strong 100% PEFT baselines (e.g., DoRA 87.9%). On language, JACTUS achieves an 80.9% average on Llama2-7B commonsense QA at the same 80% retained-parameter budget, outperforming 100% PEFT (e.g., DoRA 79.7%) and exceeding prior compress-then-finetune pipelines under the same ratained-parameter budget. We will release code.

preprint2022arXiv

A Generalized Probabilistic Monitoring Model with Both Random and Sequential Data

Many multivariate statistical analysis methods and their corresponding probabilistic counterparts have been adopted to develop process monitoring models in recent decades. However, the insightful connections between them have rarely been studied. In this study, a generalized probabilistic monitoring model (GPMM) is developed with both random and sequential data. Since GPMM can be reduced to various probabilistic linear models under specific restrictions, it is adopted to analyze the connections between different monitoring methods. Using expectation maximization (EM) algorithm, the parameters of GPMM are estimated for both random and sequential cases. Based on the obtained model parameters, statistics are designed for monitoring different aspects of the process system. Besides, the distributions of these statistics are rigorously derived and proved, so that the control limits can be calculated accordingly. After that, contribution analysis methods are presented for identifying faulty variables once the process anomalies are detected. Finally, the equivalence between monitoring models based on classical multivariate methods and their corresponding probabilistic graphic models is further investigated. The conclusions of this study are verified using a numerical example and the Tennessee Eastman (TE) process. Experimental results illustrate that the proposed monitoring statistics are subject to their corresponding distributions, and they are equivalent to statistics in classical deterministic models under specific restrictions.

preprint2022arXiv

A Survey on AI Sustainability: Emerging Trends on Learning Algorithms and Research Challenges

Artificial Intelligence (AI) is a fast-growing research and development (R&D) discipline which is attracting increasing attention because of its promises to bring vast benefits for consumers and businesses, with considerable benefits promised in productivity growth and innovation. To date it has reported significant accomplishments in many areas that have been deemed as challenging for machines, ranging from computer vision, natural language processing, audio analysis to smart sensing and many others. The technical trend in realizing the successes has been towards increasing complex and large size AI models so as to solve more complex problems at superior performance and robustness. This rapid progress, however, has taken place at the expense of substantial environmental costs and resources. Besides, debates on the societal impacts of AI, such as fairness, safety and privacy, have continued to grow in intensity. These issues have presented major concerns pertaining to the sustainable development of AI. In this work, we review major trends in machine learning approaches that can address the sustainability problem of AI. Specifically, we examine emerging AI methodologies and algorithms for addressing the sustainability issue of AI in two major aspects, i.e., environmental sustainability and social sustainability of AI. We will also highlight the major limitations of existing studies and propose potential research challenges and directions for the development of next generation of sustainable AI techniques. We believe that this technical review can help to promote a sustainable development of AI R&D activities for the research community.

preprint2022arXiv

ADAST: Attentive Cross-domain EEG-based Sleep Staging Framework with Iterative Self-Training

Sleep staging is of great importance in the diagnosis and treatment of sleep disorders. Recently, numerous data-driven deep learning models have been proposed for automatic sleep staging. They mainly train the model on a large public labeled sleep dataset and test it on a smaller one with subjects of interest. However, they usually assume that the train and test data are drawn from the same distribution, which may not hold in real-world scenarios. Unsupervised domain adaption (UDA) has been recently developed to handle this domain shift problem. However, previous UDA methods applied for sleep staging have two main limitations. First, they rely on a totally shared model for the domain alignment, which may lose the domain-specific information during feature extraction. Second, they only align the source and target distributions globally without considering the class information in the target domain, which hinders the classification performance of the model while testing. In this work, we propose a novel adversarial learning framework called ADAST to tackle the domain shift problem in the unlabeled target domain. First, we develop an unshared attention mechanism to preserve the domain-specific features in both domains. Second, we design an iterative self-training strategy to improve the classification performance on the target domain via target domain pseudo labels. We also propose dual distinct classifiers to increase the robustness and quality of the pseudo labels. The experimental results on six cross-domain scenarios validate the efficacy of our proposed framework and its advantage over state-of-the-art UDA methods. The source code is available at https://github.com/emadeldeen24/ADAST.

preprint2022arXiv

Dimensions of certain sets of continued fractions with non-decreasing partial quotients

Let $[a_1(x),a_2(x),a_3(x),\cdots]$ be the continued fraction expansion of $x\in (0,1)$. This paper is concerned with certain sets of continued fractions with non-decreasing partial quotients. As a main result, we obtain the Hausdorff dimension of the set \[\left\{x\in(0,1): a_1(x)\leq a_2(x)\leq \cdots,\ \limsup\limits_{n\to\infty}\frac{\log a_n(x)}{ψ(n)}=1\right\}\] for any $ψ:\mathbb{N}\rightarrow\mathbb{R}^+$ satisfying $ψ(n)\to\infty$ as $n\to\infty$.

preprint2022arXiv

FFConv: Fast Factorized Convolutional Neural Network Inference on Encrypted Data

Homomorphic Encryption (HE), allowing computations on encrypted data (ciphertext) without decrypting it first, enables secure but prohibitively slow Convolutional Neural Network (CNN) inference for privacy-preserving applications in clouds. To reduce the inference latency, one approach is to pack multiple messages into a single ciphertext in order to reduce the number of ciphertexts and support massive parallelism of Homomorphic Multiply-Accumulate (HMA) operations between ciphertexts. Despite the faster HECNN inference, the mainstream packing schemes Dense Packing (DensePack) and Convolution Packing (ConvPack) introduce expensive rotation overhead, which prolongs the inference latency of HECNN for deeper and wider CNN architectures. In this paper, we propose a low-rank factorization method named FFConv dedicated to efficient ciphertext packing for reducing both the rotation overhead and HMA operations. FFConv approximates a d x d convolution layer with low-rank factorized convolutions, in which a d x d low-rank convolution with fewer channels is followed by a 1 x 1 convolution to restore the channels. The d x d low-rank convolution with DensePack leads to significantly reduced rotation operations, while the rotation overhead of 1 x 1 convolution with ConvPack is close to zero. To our knowledge, FFConv is the first work that is capable of reducing the rotation overhead incurred by DensePack and ConvPack simultaneously, without introducing additional special blocks into the HECNN inference pipeline. Compared to prior art LoLa and Falcon, our method reduces the inference latency by up to 88% and 21%, respectively, with comparable accuracy on MNIST and CIFAR-10.

preprint2022arXiv

Full Poincaré polarimetry enabled through physical inference

While polarisation sensing is vital in many areas of research, with applications spanning from microscopy to aerospace, traditional approaches are limited by method-related error amplification or accumulation, placing fundamental limitations on precision and accuracy in single-shot polarimetry. Here, we put forward a new measurement paradigm to circumvent this, introducing the notion of a universal full Poincaré generator to map all polarisation analyser states into a single vectorially structured light field, allowing all vector components to be analysed in a single-shot with theoretically user-defined precision. To demonstrate the advantage of our approach, we use a common GRIN optic as our mapping device and show mean errors of <1% for each vector component, enhancing the sensitivity by around three times, allowing us to sense weak polarisation aberrations not measurable by traditional single-shot techniques. Our work paves the way for next-generation polarimetry, impacting a wide variety of applications relying on weak vector measurement.

preprint2022arXiv

Inferring relative surface elastic moduli in thin-wall models of single cells

There is a growing interest in measuring the cell wall mechanical property at different locations in single walled cells. We present an inference scheme that maps relative surface elastic modulus distributions along the cell wall based on tracking the location of material marker points along the turgid and relaxed cell wall outline. A primary scheme provides a step-function inference of surface elastic moduli by computing the tensions and elastic stretches between material marker points. We perform stability analysis for the primary scheme against perturbations on the marker-point locations, which may occur due to image acquisition and processing from experiments. The perturbation analysis shows that the primary scheme is more stable to noise when the spacing between the marker points is coarser, and has been confirmed by the numerical experiments where we apply the primary scheme to synthetic cell outlines from simulations of hyper-elastic membrane deformation with random noise on the marker-point locations. To improve the spatial resolution of elastic modulus distribution of the primary scheme with noise, we propose two optimization schemes that convert the step-function inferences of elastic moduli into smooth-curve inferences. The first scheme infers a canonical elastic modulus distribution based on marker-point locations from multiple cell samples of the same cell type. The second scheme is a simplified cost-effective version that infers the elastic moduli based on marker-point locations from a single cell. The numerical experiments show that the first scheme significantly improves the inference precision for the underlying canonical elastic modulus distributions and can even capture some degree of nonlinearity when the underlying elastic modulus gradients are nonlinear. The second cost-effective scheme can predict the trend of the elastic modulus gradients consistently.

preprint2022arXiv

Lumbar Bone Mineral Density Estimation from Chest X-ray Images: Anatomy-aware Attentive Multi-ROI Modeling

Osteoporosis is a common chronic metabolic bone disease often under-diagnosed and under-treated due to the limited access to bone mineral density (BMD) examinations, e.g. via Dual-energy X-ray Absorptiometry (DXA). This paper proposes a method to predict BMD from Chest X-ray (CXR), one of the most commonly accessible and low-cost medical imaging examinations. Our method first automatically detects Regions of Interest (ROIs) of local CXR bone structures. Then a multi-ROI deep model with transformer encoder is developed to exploit both local and global information in the chest X-ray image for accurate BMD estimation. Our method is evaluated on 13719 CXR patient cases with ground truth BMD measured by the gold standard DXA. The model predicted BMD has a strong correlation with the ground truth (Pearson correlation coefficient 0.894 on lumbar 1). When applied in osteoporosis screening, it achieves a high classification performance (average AUC of 0.968). As the first effort of using CXR scans to predict the BMD, the proposed algorithm holds strong potential for early osteoporosis screening and public health promotion.

preprint2022arXiv

Magnetotransport due to conductivity fluctuations in non-magnetic ZrTe2 nanoplates

Transition metal dichalcogenides with nontrivial band structures exhibit various fascinating physical properties and have sparked intensively research interest. Here, we performed systematic magnetotransport measurements on mechanical exfoliation prepared ZrTe2 nanoplates. We revealed that the negative longitudinal magnetoresistivity observed at high field region in the presence of parallel electric and magnetic fields could stem from the conductivity fluctuations due to the excess Zr in the nanoplates. In addition, the parametric plot, the planar Hall resistivity as function of the in-plane anisotropic magnetoresistivity, has an ellipse-shaped pattern with shifted orbital center, which further strengthen the evidence for the conductivity fluctuations. Our work provides some useful insights into transport phenomena in topological materials.

preprint2022arXiv

Multi-Omic Data Integration and Feature Selection for Survival-based Patient Stratification via Supervised Concrete Autoencoders

Cancer is a complex disease with significant social and economic impact. Advancements in high-throughput molecular assays and the reduced cost for performing high-quality multi-omics measurements have fuelled insights through machine learning . Previous studies have shown promise on using multiple omic layers to predict survival and stratify cancer patients. In this paper, we developed a Supervised Autoencoder (SAE) model for survival-based multi-omic integration which improves upon previous work, and report a Concrete Supervised Autoencoder model (CSAE), which uses feature selection to jointly reconstruct the input features as well as predict survival. Our experiments show that our models outperform or are on par with some of the most commonly used baselines, while either providing a better survival separation (SAE) or being more interpretable (CSAE). We also perform a feature selection stability analysis on our models and notice that there is a power-law relationship with features which are commonly associated with survival. The code for this project is available at: https://github.com/phcavelar/coxae

preprint2022arXiv

Nebula Graph: An open source distributed graph database

This paper introduces the recent work of Nebula Graph, an open-source, distributed, scalable, and native graph database. We present a system design trade-off and a comprehensive overview of Nebula Graph internals, including graph data models, partitioning strategies, secondary indexes, optimizer rules, storage-side transactions, graph query languages, observability, graph processing frameworks, and visualization tool-kits. In addition, three sets of large-scale graph b

preprint2022arXiv

RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System

Speech enhancement and separation have been a long-standing problem, especially with the recent advances using a single microphone. Although microphones perform well in constrained settings, their performance for speech separation decreases in noisy conditions. In this work, we propose RadioSES, an audioradio speech enhancement and separation system that overcomes inherent problems in audio-only systems. By fusing a complementary radio modality, RadioSES can estimate the number of speakers, solve source association problem, separate and enhance noisy mixture speeches, and improve both intelligibility and perceptual quality. We perform millimeter-wave sensing to detect and localize speakers, and introduce an audioradio deep learning framework to fuse the separate radio features with the mixed audio features. Extensive experiments using commercial off-the-shelf devices show that RadioSES outperforms a variety of state-of-the-art baselines, with consistent performance gains in different environmental settings. Compared with the audiovisual methods, RadioSES provides similar improvements (e.g., ~3 dB gains in SiSDR), along with the benefits of lower computational complexity and being less privacy concerning.

preprint2022arXiv

Root-aligned SMILES: A Tight Representation for Chemical Reaction Prediction

Chemical reaction prediction, involving forward synthesis and retrosynthesis prediction, is a fundamental problem in organic synthesis. A popular computational paradigm formulates synthesis prediction as a sequence-to-sequence translation problem, where the typical SMILES is adopted for molecule representations. However, the general-purpose SMILES neglects the characteristics of chemical reactions, where the molecular graph topology is largely unaltered from reactants to products, resulting in the suboptimal performance of SMILES if straightforwardly applied. In this article, we propose the root-aligned SMILES (R-SMILES), which specifies a tightly aligned one-to-one mapping between the product and the reactant SMILES for more efficient synthesis prediction. Due to the strict one-to-one mapping and reduced edit distance, the computational model is largely relieved from learning the complex syntax and dedicated to learning the chemical knowledge for reactions. We compare the proposed R-SMILES with various state-of-the-art baselines and show that it significantly outperforms them all, demonstrating the superiority of the proposed method.

preprint2022arXiv

Self-supervised Autoregressive Domain Adaptation for Time Series Data

Unsupervised domain adaptation (UDA) has successfully addressed the domain shift problem for visual applications. Yet, these approaches may have limited performance for time series data due to the following reasons. First, they mainly rely on large-scale dataset (i.e., ImageNet) for the source pretraining, which is not applicable for time-series data. Second, they ignore the temporal dimension on the feature space of the source and target domains during the domain alignment step. Last, most of prior UDA methods can only align the global features without considering the fine-grained class distribution of the target domain. To address these limitations, we propose a Self-supervised Autoregressive Domain Adaptation (SLARDA) framework. In particular, we first design a self-supervised learning module that utilizes forecasting as an auxiliary task to improve the transferability of the source features. Second, we propose a novel autoregressive domain adaptation technique that incorporates temporal dependency of both source and target features during domain alignment. Finally, we develop an ensemble teacher model to align the class-wise distribution in the target domain via a confident pseudo labeling approach. Extensive experiments have been conducted on three real-world time series applications with 30 cross-domain scenarios. Results demonstrate that our proposed SLARDA method significantly outperforms the state-of-the-art approaches for time series domain adaptation.

preprint2022arXiv

Separable-HoverNet and Instance-YOLO for Colon Nuclei Identification and Counting

Nuclear segmentation, classification and quantification within Haematoxylin & Eosin stained histology images enables the extraction of interpretable cell-based features that can be used in downstream explainable models in computational pathology (CPath). However, automatic recognition of different nuclei is faced with a major challenge in that there are several different types of nuclei, some of them exhibiting large intraclass variability. In this work, we propose an approach that combine Separable-HoverNet and Instance-YOLOv5 to indentify colon nuclei small and unbalanced. Our approach can achieve mPQ+ 0.389 on the Segmentation and Classification-Preliminary Test Dataset and r2 0.599 on the Cellular Composition-Preliminary Test Dataset on ISBI 2022 CoNIC Challenge.

preprint2022arXiv

SwiftPruner: Reinforced Evolutionary Pruning for Efficient Ad Relevance

Ad relevance modeling plays a critical role in online advertising systems including Microsoft Bing. To leverage powerful transformers like BERT in this low-latency setting, many existing approaches perform ad-side computations offline. While efficient, these approaches are unable to serve cold start ads, resulting in poor relevance predictions for such ads. This work aims to design a new, low-latency BERT via structured pruning to empower real-time online inference for cold start ads relevance on a CPU platform. Our challenge is that previous methods typically prune all layers of the transformer to a high, uniform sparsity, thereby producing models which cannot achieve satisfactory inference speed with an acceptable accuracy. In this paper, we propose SwiftPruner - an efficient framework that leverages evolution-based search to automatically find the best-performing layer-wise sparse BERT model under the desired latency constraint. Different from existing evolution algorithms that conduct random mutations, we propose a reinforced mutator with a latency-aware multi-objective reward to conduct better mutations for efficiently searching the large space of layer-wise sparse models. Extensive experiments demonstrate that our method consistently achieves higher ROC AUC and lower latency than the uniform sparse baseline and state-of-the-art search methods. Remarkably, under our latency requirement of 1900us on CPU, SwiftPruner achieves a 0.86% higher AUC than the state-of-the-art uniform sparse baseline for BERT-Mini on a large scale real-world dataset. Online A/B testing shows that our model also achieves a significant 11.7% cut in the ratio of defective cold start ads with satisfactory real-time serving latency.

preprint2022arXiv

Traceable Automatic Feature Transformation via Cascading Actor-Critic Agents

Feature transformation for AI is an essential task to boost the effectiveness and interpretability of machine learning (ML). Feature transformation aims to transform original data to identify an optimal feature space that enhances the performances of a downstream ML model. Existing studies either combines preprocessing, feature selection, and generation skills to empirically transform data, or automate feature transformation by machine intelligence, such as reinforcement learning. However, existing studies suffer from: 1) high-dimensional non-discriminative feature space; 2) inability to represent complex situational states; 3) inefficiency in integrating local and global feature information. To fill the research gap, we formulate the feature transformation task as an iterative, nested process of feature generation and selection, where feature generation is to generate and add new features based on original features, and feature selection is to remove redundant features to control the size of feature space. Finally, we present extensive experiments and case studies to illustrate 24.7\% improvements in F1 scores compared with SOTAs and robustness in high-dimensional data.

preprint2022arXiv

WSSS4LUAD: Grand Challenge on Weakly-supervised Tissue Semantic Segmentation for Lung Adenocarcinoma

Lung cancer is the leading cause of cancer death worldwide, and adenocarcinoma (LUAD) is the most common subtype. Exploiting the potential value of the histopathology images can promote precision medicine in oncology. Tissue segmentation is the basic upstream task of histopathology image analysis. Existing deep learning models have achieved superior segmentation performance but require sufficient pixel-level annotations, which is time-consuming and expensive. To enrich the label resources of LUAD and to alleviate the annotation efforts, we organize this challenge WSSS4LUAD to call for the outstanding weakly-supervised semantic segmentation (WSSS) techniques for histopathology images of LUAD. Participants have to design the algorithm to segment tumor epithelial, tumor-associated stroma and normal tissue with only patch-level labels. This challenge includes 10,091 patch-level annotations (the training set) and over 130 million labeled pixels (the validation and test sets), from 87 WSIs (67 from GDPH, 20 from TCGA). All the labels were generated by a pathologist-in-the-loop pipeline with the help of AI models and checked by the label review board. Among 532 registrations, 28 teams submitted the results in the test phase with over 1,000 submissions. Finally, the first place team achieved mIoU of 0.8413 (tumor: 0.8389, stroma: 0.7931, normal: 0.8919). According to the technical reports of the top-tier teams, CAM is still the most popular approach in WSSS. Cutmix data augmentation has been widely adopted to generate more reliable samples. With the success of this challenge, we believe that WSSS approaches with patch-level annotations can be a complement to the traditional pixel annotations while reducing the annotation efforts. The entire dataset has been released to encourage more researches on computational pathology in LUAD and more novel WSSS techniques.

preprint2021arXiv

Cross-domain Joint Dictionary Learning for ECG Inference from PPG

The inverse problem of inferring electrocardiogram (ECG) from photoplethysmogram (PPG) is an emerging research direction that combines the easy measurability of PPG and the rich clinical knowledge of ECG for long-term continuous cardiac monitoring. The prior art for reconstruction using a universal basis has limited fidelity for uncommon ECG waveform shapes due to the lack of rich representative power. In this paper, we design two dictionary learning frameworks, the cross-domain joint dictionary learning (XDJDL) and the label-consistent XDJDL (LC-XDJDL), to further improve the ECG inference quality and enrich the PPG-based diagnosis knowledge. Building on the K-SVD technique, our proposed joint dictionary learning frameworks aim to maximize the expressive power by optimizing simultaneously a pair of signal dictionaries for PPG and ECG with the transforms to relate their sparse codes and disease information. The proposed models are evaluated with 34,000+ ECG/PPG cycle pairs containing a variety of ECG morphologies and cardiovascular diseases. We demonstrate both visually and quantitatively that our proposed frameworks can achieve better inference performance than previous methods, suggesting an encouraging potential for ECG screening using PPG based on the proactive learned PPG-ECG relationship.

preprint2020arXiv

A Survey of Safety and Trustworthiness of Deep Neural Networks: Verification, Testing, Adversarial Attack and Defence, and Interpretability

In the past few years, significant progress has been made on deep neural networks (DNNs) in achieving human-level performance on several long-standing tasks. With the broader deployment of DNNs on various applications, the concerns over their safety and trustworthiness have been raised in public, especially after the widely reported fatal incidents involving self-driving cars. Research to address these concerns is particularly active, with a significant number of papers released in the past few years. This survey paper conducts a review of the current research effort into making DNNs safe and trustworthy, by focusing on four aspects: verification, testing, adversarial attack and defence, and interpretability. In total, we survey 202 papers, most of which were published after 2017.

preprint2020arXiv

Attention Sequence to Sequence Model for Machine Remaining Useful Life Prediction

Accurate estimation of remaining useful life (RUL) of industrial equipment can enable advanced maintenance schedules, increase equipment availability and reduce operational costs. However, existing deep learning methods for RUL prediction are not completely successful due to the following two reasons. First, relying on a single objective function to estimate the RUL will limit the learned representations and thus affect the prediction accuracy. Second, while longer sequences are more informative for modelling the sensor dynamics of equipment, existing methods are less effective to deal with very long sequences, as they mainly focus on the latest information. To address these two problems, we develop a novel attention-based sequence to sequence with auxiliary task (ATS2S) model. In particular, our model jointly optimizes both reconstruction loss to empower our model with predictive capabilities (by predicting next input sequence given current input sequence) and RUL prediction loss to minimize the difference between the predicted RUL and actual RUL. Furthermore, to better handle longer sequence, we employ the attention mechanism to focus on all the important input information during training process. Finally, we propose a new dual-latent feature representation to integrate the encoder features and decoder hidden states, to capture rich semantic information in data. We conduct extensive experiments on four real datasets to evaluate the efficacy of the proposed method. Experimental results show that our proposed method can achieve superior performance over 13 state-of-the-art methods consistently.

preprint2020arXiv

Contextualized Graph Attention Network for Recommendation with Item Knowledge Graph

Graph neural networks (GNN) have recently been applied to exploit knowledge graph (KG) for recommendation. Existing GNN-based methods explicitly model the dependency between an entity and its local graph context in KG (i.e., the set of its first-order neighbors), but may not be effective in capturing its non-local graph context (i.e., the set of most related high-order neighbors). In this paper, we propose a novel recommendation framework, named Contextualized Graph Attention Network (CGAT), which can explicitly exploit both local and non-local graph context information of an entity in KG. Specifically, CGAT captures the local context information by a user-specific graph attention mechanism, considering a user's personalized preferences on entities. Moreover, CGAT employs a biased random walk sampling process to extract the non-local context of an entity, and utilizes a Recurrent Neural Network (RNN) to model the dependency between the entity and its non-local contextual entities. To capture the user's personalized preferences on items, an item-specific attention mechanism is also developed to model the dependency between a target item and the contextual items extracted from the user's historical behaviors. Experimental results on real datasets demonstrate the effectiveness of CGAT, compared with state-of-the-art KG-based recommendation methods.

preprint2020arXiv

Lower Assouad type dimensions of uniformly perfect sets in doubling metric spaces

In this paper, we are concerned with the relationship among the lower Assouad type dimensions. For uniformly perfect sets in doubling metric spaces, we obtain a variational result between two different but closely related lower Assouad spectra. As an application, we show that the limit of the lower Assouad spectrum as $θ$ tends to 1 equals to the quasi-lower Assouad dimension, which provides an equivalent definition to the latter. On the other hand, although the limit of the lower Assouad spectrum as $θ$ tends to 0 exists, there exist uniformly perfect sets such that this limit is not equal to the lower box-counting dimension. Moreover, by the example of Cantor cut-out sets, we show that the new definition of quasi lower Assouad dimension is more accessible, and indicate that the lower Assouad dimension could be strictly smaller than the lower spectra and the quasi lower Assouad dimension.

preprint2020arXiv

Robustness Guarantees for Deep Neural Networks on Videos

The widespread adoption of deep learning models places demands on their robustness. In this paper, we consider the robustness of deep neural networks on videos, which comprise both the spatial features of individual frames extracted by a convolutional neural network and the temporal dynamics between adjacent frames captured by a recurrent neural network. To measure robustness, we study the maximum safe radius problem, which computes the minimum distance from the optical flow sequence obtained from a given input to that of an adversarial example in the neighbourhood of the input. We demonstrate that, under the assumption of Lipschitz continuity, the problem can be approximated using finite optimisation via discretising the optical flow space, and the approximation has provable guarantees. We then show that the finite optimisation problem can be solved by utilising a two-player turn-based game in a cooperative setting, where the first player selects the optical flows and the second player determines the dimensions to be manipulated in the chosen flow. We employ an anytime approach to solve the game, in the sense of approximating the value of the game by monotonically improving its upper and lower bounds. We exploit a gradient-based search algorithm to compute the upper bounds, and the admissible A* algorithm to update the lower bounds. Finally, we evaluate our framework on the UCF101 video dataset.

preprint2020arXiv

Three-dimensional topological semimetal phase in layered TaNiTe5 probed by de Haas-van Alphen effect

Layered three-dimensional (3D) topological semimetals have attracted intensively attention due to the exotic phenomena and abundantly tunable properties. Here we report the experimental evidence for the 3D topological semimetal phase in layered material TaNiTe5 single crystals through quantum oscillations. Strong quantum oscillations have been observed with diamagnetism background in TaNiTe5. By analyzing the de Haas-van Alphen oscillations, multi-periodic oscillations were extracted, in content with magnetotransport measurements. Moreover, nontrivial "π" Berry phase with 3D Fermi surface is identified, indicating the topologically nontrivial feature in TaNiTe5. Additionally, we demonstrated the thin-layer of TaNiTe5 crystals is highly feasible by the mechanical exfoliation, which offers a platform to explore exotic properties in low dimensional topological semimetal and paves the way for potential applications in nanodevices.

preprint2020arXiv

Towards Threshold Invariant Fair Classification

Effective machine learning models can automatically learn useful information from a large quantity of data and provide decisions in a high accuracy. These models may, however, lead to unfair predictions in certain sense among the population groups of interest, where the grouping is based on such sensitive attributes as race and gender. Various fairness definitions, such as demographic parity and equalized odds, were proposed in prior art to ensure that decisions guided by the machine learning models are equitable. Unfortunately, the "fair" model trained with these fairness definitions is threshold sensitive, i.e., the condition of fairness may no longer hold true when tuning the decision threshold. This paper introduces the notion of threshold invariant fairness, which enforces equitable performances across different groups independent of the decision threshold. To achieve this goal, this paper proposes to equalize the risk distributions among the groups via two approximation methods. Experimental results demonstrate that the proposed methodology is effective to alleviate the threshold sensitivity in machine learning models designed to achieve fairness.

preprint2020arXiv

Transparency Tools for Fairness in AI (Luskin)

We propose new tools for policy-makers to use when assessing and correcting fairness and bias in AI algorithms. The three tools are: - A new definition of fairness called "controlled fairness" with respect to choices of protected features and filters. The definition provides a simple test of fairness of an algorithm with respect to a dataset. This notion of fairness is suitable in cases where fairness is prioritized over accuracy, such as in cases where there is no "ground truth" data, only data labeled with past decisions (which may have been biased). - Algorithms for retraining a given classifier to achieve "controlled fairness" with respect to a choice of features and filters. Two algorithms are presented, implemented and tested. These algorithms require training two different models in two stages. We experiment with combinations of various types of models for the first and second stage and report on which combinations perform best in terms of fairness and accuracy. - Algorithms for adjusting model parameters to achieve a notion of fairness called "classification parity". This notion of fairness is suitable in cases where accuracy is prioritized. Two algorithms are presented, one which assumes that protected features are accessible to the model during testing, and one which assumes protected features are not accessible during testing. We evaluate our tools on three different publicly available datasets. We find that the tools are useful for understanding various dimensions of bias, and that in practice the algorithms are effective in starkly reducing a given observed bias when tested on new data.

preprint2019arXiv

A Game-Based Approximate Verification of Deep Neural Networks with Provable Guarantees

Despite the improved accuracy of deep neural networks, the discovery of adversarial examples has raised serious safety concerns. In this paper, we study two variants of pointwise robustness, the maximum safe radius problem, which for a given input sample computes the minimum distance to an adversarial example, and the feature robustness problem, which aims to quantify the robustness of individual features to adversarial perturbations. We demonstrate that, under the assumption of Lipschitz continuity, both problems can be approximated using finite optimisation by discretising the input space, and the approximation has provable guarantees, i.e., the error is bounded. We then show that the resulting optimisation problems can be reduced to the solution of two-player turn-based games, where the first player selects features and the second perturbs the image within the feature. While the second player aims to minimise the distance to an adversarial example, depending on the optimisation objective the first player can be cooperative or competitive. We employ an anytime approach to solve the games, in the sense of approximating the value of a game by monotonically improving its upper and lower bounds. The Monte Carlo tree search algorithm is applied to compute upper bounds for both games, and the Admissible A* and the Alpha-Beta Pruning algorithms are, respectively, used to compute lower bounds for the maximum safety radius and feature robustness games. When working on the upper bound of the maximum safe radius problem, our tool demonstrates competitive performance against existing adversarial example crafting algorithms. Furthermore, we show how our framework can be deployed to evaluate pointwise robustness of neural networks in safety-critical applications such as traffic sign recognition in self-driving cars.

preprint2019arXiv

Uniform recurrence properties for beta-transformation

For any $β> 1$, let $T_β: [0,1)\rightarrow [0,1)$ be the $β$-transformation defined by $T_βx=βx \mod 1$. We study the uniform recurrence properties of the orbit of a point under the $β$-transformation to the point itself. The size of the set of points with prescribed uniform recurrence rate is obtained. More precisely, for any $0\leq \hat{r}\leq +\infty$, the set $$\left\{x \in [0,1): \forall\ N\gg1, \exists\ 1\leq n \leq N, {\rm\ s.t.}\ |T^n_βx-x|\leq β^{-\hat{r}N}\right\}$$ is of Hausdorff dimension $\left(\frac{1-\hat{r}}{1+\hat{r}}\right)^2$ if $0\leq \hat{r}\leq 1$ and is countable if $\hat{r}>1$.

preprint2016arXiv

A note on Rényi's "record" problem and Engel's series

In 1973, Williams introduced two interesting discrete Markov processes, namely $C$-processes and $A$-processes, which are related to record times in statistics and Engel's series in number theory respectively. Moreover, he showed that these two processes share the same classical limit theorems, such as the law of large numbers, central limit theorem and law of the iterated logarithm. In this paper, we consider the large deviations for these two Markov processes, which indicate that there is a difference between $C$-processes and $A$-processes in the context of large deviations.

preprint2016arXiv

Approximation orders of real numbers by $β$-expansions

We prove that almost all real numbers (with respect to Lebesgue measure) are approximated by the convergents of their $β$-expansions with the exponential order $β^{-n}$. Moreover, the Hausdorff dimensions of sets of the real numbers which are approximated by all other orders, are determined. These results are also applied to investigate the orbits of real numbers under $β$-transformation, the shrinking target type problem, the Diophantine approximation and the run-length function of $β$-expansions.

preprint2016arXiv

Beta-expansion and continued fraction expansion of real numbers

Let $β> 1$ be a real number and $x \in [0,1)$ be an irrational number. We denote by $k_n(x)$ the exact number of partial quotients in the continued fraction expansion of $x$ given by the first $n$ digits in the $β$-expansion of $x$ ($n \in \mathbb{N}$). It is known that $k_n(x)/n$ converges to $(6\log2\logβ)/π^2$ almost everywhere in the sense of Lebesgue measure. In this paper, we improve this result by proving that the Lebesgue measure of the set of $x \in [0,1)$ for which $k_n(x)/n$ deviates away from $(6\log2\logβ)/π^2$ decays to 0 exponentially as $n$ tends to $\infty$, which generalizes the result of Faivre \cite{lesFai97} from $β= 10$ to any $β>1$. Moreover, we also discuss which of the $β$-expansion and continued fraction expansion yields the better approximations of real numbers.

preprint2016arXiv

Efficient Estimation of Compressible State-Space Models with Application to Calcium Signal Deconvolution

In this paper, we consider linear state-space models with compressible innovations and convergent transition matrices in order to model spatiotemporally sparse transient events. We perform parameter and state estimation using a dynamic compressed sensing framework and develop an efficient solution consisting of two nested Expectation-Maximization (EM) algorithms. Under suitable sparsity assumptions on the innovations, we prove recovery guarantees and derive confidence bounds for the state estimates. We provide simulation studies as well as application to spike deconvolution from calcium imaging data which verify our theoretical results and show significant improvement over existing algorithms.

preprint2016arXiv

Impact Analysis of Baseband Quantizer on Coding Efficiency for HDR Video

Digitally acquired high dynamic range (HDR) video baseband signal can take 10 to 12 bits per color channel. It is economically important to be able to reuse the legacy 8 or 10-bit video codecs to efficiently compress the HDR video. Linear or nonlinear mapping on the intensity can be applied to the baseband signal to reduce the dynamic range before the signal is sent to the codec, and we refer to this range reduction step as a baseband quantization. We show analytically and verify using test sequences that the use of the baseband quantizer lowers the coding efficiency. Experiments show that as the baseband quantizer is strengthened by 1.6 bits, the drop of PSNR at a high bitrate is up to 1.60dB. Our result suggests that in order to achieve high coding efficiency, information reduction of videos in terms of quantization error should be introduced in the video codec instead of on the baseband signal.

preprint2016arXiv

Learning Optimal Social Dependency for Recommendation

Social recommender systems exploit users' social relationships to improve the recommendation accuracy. Intuitively, a user tends to trust different subsets of her social friends, regarding with different scenarios. Therefore, the main challenge of social recommendation is to exploit the optimal social dependency between users for a specific recommendation task. In this paper, we propose a novel recommendation method, named probabilistic relational matrix factorization (PRMF), which aims to learn the optimal social dependency between users to improve the recommendation accuracy, with or without users' social relationships. Specifically, in PRMF, the latent features of users are assumed to follow a matrix variate normal (MVN) distribution. The positive and negative dependency between users are modeled by the row precision matrix of the MVN distribution. Moreover, we have also proposed an efficient alternating algorithm to solve the optimization problem of PRMF. The experimental results on real datasets demonstrate that the proposed PRMF method outperforms state-of-the-art social recommendation approaches, in terms of root mean square error (RMSE) and mean absolute error (MAE).

preprint2016arXiv

Limit theorems related to beta-expansion and continued fraction expansion

Let $β> 1$ be a real number and $x \in [0,1)$ be an irrational number. Denote by $k_n(x)$ the exact number of partial quotients in the continued fraction expansion of $x$ given by the first $n$ digits in the $β$-expansion of $x$ ($n \in \mathbb{N}$). In this paper, we show a central limit theorem and a law of the iterated logarithm for the random variables sequence $\{k_n, n \geq 1\}$, which generalize the results of Faivre and Wu respectively from $β=10$ to any $β>1$.

preprint2016arXiv

On exceptional sets in Erdős-Rényi limit theorem revisited

For $x\in [0,1],$ the run-length function $r_n(x)$ is defined as the length of the longest run of $1$'s amongst the first $n$ dyadic digits in the dyadic expansion of $x.$ Erdős and Rényi proved that $\lim\limits_{n\to\infty}\frac{r_n(x)}{\log_2n}=1$ for Lebesgue almost all $x\in[0,1]$. Let $H$ denote the set of monotonically increasing functions $φ:\mathbb{N}\to (0,+\infty)$ with $\lim\limits_{n\to\infty}φ(n)=+\infty$. For any $φ\in H$, we prove that the set \[ E_{\max}^φ=\left\{x\in [0,1]:\liminf\limits_{n\to\infty}\frac{r_n(x)}{φ(n)}=0, \limsup\limits_{n\to\infty}\frac{r_n(x)}{φ(n)}=+\infty\right\} \] either has Hausdorff dimension one and is residual in $[0,1]$ or empty. The result solves a conjecture posed in \cite{LW5} affirmatively.

preprint2016arXiv

Random Continued fractions: Lévy constant and Chernoff-type estimate

Given a stochastic process $\{A_n, n \geq 1\}$ taking values in natural numbers, the random continued fractions is defined as $[A_1, A_2, \cdots, A_n, \cdots]$ analogue to the continued fraction expansion of real numbers. Assume that $\{A_n, n \geq 1\}$ is ergodic and the expectation $E(\log A_1) < \infty$, we give a Lévy-type metric theorem which covers that of real case presented by Lévy in 1929. Moreover, a corresponding Chernoff-type estimate is obtained under the conditions $\{A_n, n \geq 1\}$ is $ψ$-mixing and for each $0< t< 1$, $E(A_1^t) < \infty$.

preprint2016arXiv

The denominators of convergents for continued fractions

For any real number $x \in [0,1)$, we denote by $q_n(x)$ the denominator of the $n$-th convergent of the continued fraction expansion of $x$ $(n \in \mathbb{N})$. It is well-known that the Lebesgue measure of the set of points $x \in [0,1)$ for which $\log q_n(x)/n$ deviates away from $π^2/(12\log2)$ decays to zero as $n$ tends to infinity. In this paper, we study the rate of this decay by giving an upper bound and a lower bound. What is interesting is that the upper bound is closely related to the Hausdorff dimensions of the level sets for $\log q_n(x)/n$. As a consequence, we obtain a large deviation type result for $\log q_n(x)/n$, which indicates that the rate of this decay is exponential.

preprint2016arXiv

The dimension of irregular set in parameter space

For any real number $β>1$. The $n$th cylinder of $β$ in the parameter space $\{β\in \mathbb{R}: β>1\}$ is a set of real numbers in $(1,\infty)$ having the same first $n$ digits in their $β$-expansion of $1$, denote by $I^P_n(β)$. We study the quantities which describe the growth of the length of $I^P_n(β)$. The Huasdorff dimension of the set of given growth rate of the length of $I^P_n(β)$ will be determined in this paper.

preprint2016arXiv

The topological property of the irregular sets on the lengths of basic intervals in beta-expansions

Let $β> 1$ be a real number and $(ε_1(x, β), ε_2(x, β), \ldots)$ be the $β$-expansion of a point $x \in (0, 1]$. For all $x \in (0,1]$, let $A(D(x))$ be the set of accumulation points of $\frac{-\log_β|I_n(x)|}{n}$ as $n \rightarrow \infty$, where $|I_n(x)|$ is the length of the basic interval of order $n$ containing $x \in (0, 1]$. In this paper, we prove that $A(D(x))$ is always a closed interval for any $x \in (0,1]$. Furthermore, if $λ(β)>0$, the extremely irregular set containing points $x \in [0, 1]$ whose upper limit of $\frac{-\log_β|I_n(x)|}{n}$ equals to $1+ł(β)$ is residual, where $1+ł(β)$ is a constant depending on $β$. As a consequence, the irregular set with $x\in [0, 1]$ whose limit of $\frac{-\log_β|I_n(x)|}{n}$ does not exist is residual for every $λ(β)>0$.

preprint2014arXiv

Multi-label ensemble based on variable pairwise constraint projection

Multi-label classification has attracted an increasing amount of attention in recent years. To this end, many algorithms have been developed to classify multi-label data in an effective manner. However, they usually do not consider the pairwise relations indicated by sample labels, which actually play important roles in multi-label classification. Inspired by this, we naturally extend the traditional pairwise constraints to the multi-label scenario via a flexible thresholding scheme. Moreover, to improve the generalization ability of the classifier, we adopt a boosting-like strategy to construct a multi-label ensemble from a group of base classifiers. To achieve these goals, this paper presents a novel multi-label classification framework named Variable Pairwise Constraint projection for Multi-label Ensemble (VPCME). Specifically, we take advantage of the variable pairwise constraint projection to learn a lower-dimensional data representation, which preserves the correlations between samples and labels. Thereafter, the base classifiers are trained in the new data space. For the boosting-like strategy, we employ both the variable pairwise constraints and the bootstrap steps to diversify the base classifiers. Empirical studies have shown the superiority of the proposed method in comparison with other approaches.

preprint2013arXiv

Domain-of-Attraction Estimation for Uncertain Non-polynomial Systems

In this paper, we consider the problem of computing estimates of the domain-of-attraction for non-polynomial systems. A polynomial approximation technique, based on multivariate polynomial interpolation and error analysis for remaining functions, is applied to compute an uncertain polynomial system, whose set of trajectories contains that of the original non-polynomial system. Experiments on the benchmark non-polynomial systems show that our approach gives better estimates of the domain-of-attraction.

preprint2013arXiv

Exact Safety Verification of Interval Hybrid Systems Based on Symbolic-Numeric Computation

In this paper, we address the problem of safety verification of interval hybrid systems in which the coefficients are intervals instead of explicit numbers. A hybrid symbolic-numeric method, based on SOS relaxation and interval arithmetic certification, is proposed to generate exact inequality invariants for safety verification of interval hybrid systems. As an application, an approach is provided to verify safety properties of non-polynomial hybrid systems. Experiments on the benchmark hybrid systems are given to illustrate the efficiency of our method.

preprint2013arXiv

In-situ EXAFS study on the thermal decomposition of TiH2

Thermal decomposition behaviors of TiH2 powder under a flowing helium atmosphere and in a low vacuum condition have been studied by using in-situ EXAFS technique. By an EXAFS analysis containing the multiple scattering paths including H atoms, the changes of hydrogen stoichiometric ratio and the phase transformation sequence are obtained. The results demonstrate that the initial decomposition temperature is dependent on experimental conditions, which occurs, respectively, at about 300 and 400 degree in a low vacuum condition and under a flowing helium atmosphere. During the decomposition process of TiH2 in a low vacuum condition, the sample experiences a phase change process: δ(TiH2) - δ(TiHx) - δ(TiHx)+β(TiHx) - δ(TiHx)+β(TiHx)+α(Ti) - β(TiHx)+α(Ti) - α(Ti)+β(Ti). This study offers a way to detect the structural information of hydrogen. A detailed discussion about the decomposition process of TiH2 is given in this paper.

preprint2013arXiv

Sparse Norm Filtering

Optimization-based filtering smoothes an image by minimizing a fidelity function and simultaneously preserves edges by exploiting a sparse norm penalty over gradients. It has obtained promising performance in practical problems, such as detail manipulation, HDR compression and deblurring, and thus has received increasing attentions in fields of graphics, computer vision and image processing. This paper derives a new type of image filter called sparse norm filter (SNF) from optimization-based filtering. SNF has a very simple form, introduces a general class of filtering techniques, and explains several classic filters as special implementations of SNF, e.g. the averaging filter and the median filter. It has advantages of being halo free, easy to implement, and low time and memory costs (comparable to those of the bilateral filter). Thus, it is more generic than a smoothing operator and can better adapt to different tasks. We validate the proposed SNF by a wide variety of applications including edge-preserving smoothing, outlier tolerant filtering, detail manipulation, HDR compression, non-blind deconvolution, image segmentation, and colorization.

preprint2012arXiv

Classification of Element Systems over Finite Commutative Groups

We obtain the formula computing the number of isomorphic classes of element systems with characters over finite commutative group $G$.

preprint2012arXiv

Exact Safety Verification of Hybrid Systems Based on Bilinear SOS Representation

In this paper, we address the problem of safety verification of nonlinear hybrid systems. A hybrid symbolic-numeric method is presented to compute exact inequality invariants of hybrid systems efficiently. Some numerical invariants of a hybrid system can be obtained by solving a bilinear SOS programming via PENBMI solver or iterative method, then the modified Newton refinement and rational vector recovery techniques are applied to obtain exact polynomial invariants with rational coefficients, which {\it exactly} satisfy the conditions of invariants. Experiments on some benchmarks are given to illustrate the efficiency of our algorithm.

preprint2011arXiv

Exact Safety Verification of Hybrid Systems Using Sums-Of-Squares Representation

In this paper we discuss how to generate inductive invariants for safety verification of hybrid systems. A hybrid symbolic-numeric method is presented to compute inequality inductive invariants of the given systems. A numerical invariant of the given system can be obtained by solving a parameterized polynomial optimization problem via sum-of-squares (SOS) relaxation. And a method based on Gauss-Newton refinement and rational vector recovery is deployed to obtain the invariants with rational coefficients, which exactly satisfy the conditions of invariants. Several examples are given to illustrate our algorithm.

preprint2011arXiv

Generating Loop Invariants by Computing Vanishing Ideals of Sample Points

Loop invariants play a very important role in proving correctness of programs. In this paper, we address the problem of generating invariants of polynomial loop programs. We present a new approach, for generating polynomial equation invariants of polynomial loop programs through computing vanishing ideals of sample points. We apply rational function interpolation, based on early termination technique, to generate invariants of loop programs with symbolic initial values. Our approach avoids first-order quantifier elimination and cylindrical algebraic decomposition(CAD). An algorithm for generating polynomial invariants is proposed and some examples are given to illustrate the algorithm. Furthermore, we demonstrate on a set of loop programs with symbolic initial values that our algorithm can yield polynomial invariants with degrees high up to 15.

Min Wu

What is connected

Connect this record

See the researcher in context

Building this map preview

52 published item(s)

CERSA: Cumulative Energy-Retaining Subspace Adaptation for Memory-Efficient Fine-Tuning

Compress Then Adapt? No, Do It Together via Task-aware Union of Subspaces

A Generalized Probabilistic Monitoring Model with Both Random and Sequential Data

A Survey on AI Sustainability: Emerging Trends on Learning Algorithms and Research Challenges

ADAST: Attentive Cross-domain EEG-based Sleep Staging Framework with Iterative Self-Training

Dimensions of certain sets of continued fractions with non-decreasing partial quotients

FFConv: Fast Factorized Convolutional Neural Network Inference on Encrypted Data

Full Poincaré polarimetry enabled through physical inference

Inferring relative surface elastic moduli in thin-wall models of single cells

Lumbar Bone Mineral Density Estimation from Chest X-ray Images: Anatomy-aware Attentive Multi-ROI Modeling

Magnetotransport due to conductivity fluctuations in non-magnetic ZrTe2 nanoplates

Multi-Omic Data Integration and Feature Selection for Survival-based Patient Stratification via Supervised Concrete Autoencoders

Nebula Graph: An open source distributed graph database

RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System

Root-aligned SMILES: A Tight Representation for Chemical Reaction Prediction

Self-supervised Autoregressive Domain Adaptation for Time Series Data

Separable-HoverNet and Instance-YOLO for Colon Nuclei Identification and Counting

SwiftPruner: Reinforced Evolutionary Pruning for Efficient Ad Relevance

Traceable Automatic Feature Transformation via Cascading Actor-Critic Agents

WSSS4LUAD: Grand Challenge on Weakly-supervised Tissue Semantic Segmentation for Lung Adenocarcinoma

Cross-domain Joint Dictionary Learning for ECG Inference from PPG

A Survey of Safety and Trustworthiness of Deep Neural Networks: Verification, Testing, Adversarial Attack and Defence, and Interpretability

Attention Sequence to Sequence Model for Machine Remaining Useful Life Prediction

Contextualized Graph Attention Network for Recommendation with Item Knowledge Graph

Lower Assouad type dimensions of uniformly perfect sets in doubling metric spaces

Robustness Guarantees for Deep Neural Networks on Videos

Three-dimensional topological semimetal phase in layered TaNiTe5 probed by de Haas-van Alphen effect

Towards Threshold Invariant Fair Classification

Transparency Tools for Fairness in AI (Luskin)

A Game-Based Approximate Verification of Deep Neural Networks with Provable Guarantees

Uniform recurrence properties for beta-transformation

A note on Rényi's "record" problem and Engel's series

Approximation orders of real numbers by $β$-expansions

Beta-expansion and continued fraction expansion of real numbers

Efficient Estimation of Compressible State-Space Models with Application to Calcium Signal Deconvolution

Impact Analysis of Baseband Quantizer on Coding Efficiency for HDR Video

Learning Optimal Social Dependency for Recommendation

Limit theorems related to beta-expansion and continued fraction expansion

On exceptional sets in Erdős-Rényi limit theorem revisited

Random Continued fractions: Lévy constant and Chernoff-type estimate

The denominators of convergents for continued fractions

The dimension of irregular set in parameter space

The topological property of the irregular sets on the lengths of basic intervals in beta-expansions

Multi-label ensemble based on variable pairwise constraint projection

Domain-of-Attraction Estimation for Uncertain Non-polynomial Systems

Exact Safety Verification of Interval Hybrid Systems Based on Symbolic-Numeric Computation

In-situ EXAFS study on the thermal decomposition of TiH2

Sparse Norm Filtering

Classification of Element Systems over Finite Commutative Groups

Exact Safety Verification of Hybrid Systems Based on Bilinear SOS Representation

Exact Safety Verification of Hybrid Systems Using Sums-Of-Squares Representation

Generating Loop Invariants by Computing Vanishing Ideals of Sample Points