Source author record

Tao Chen

Tao Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

94works

41topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

LLMSYS-HPOBench: Hyperparameter Optimization Benchmark Suite for Real-World LLM Systems

Large Language Model (LLM) systems have been the frontier of AI in many application domains, leading to new challenges and opportunities for hyperparameter optimization (HPO) for the AutoML community. However, this type of system exhibits an unprecedented compound space of hyperparameter configuration from both the AI and non-AI components; rich and nonlinear implications from the fidelity factors; and diverse costs of measuring hyperparameter configurations, none of which have been fully captured in existing benchmarks. This paper presents the first (live) benchmark suite and datasets for HPO of real-world LLM systems, dubbed LLMSYS-HPOBench, covering data related to the inference objective values of hyperparameter configurations profiled from running the LLM systems. Currently, LLMSYS-HPOBench contains 364,450 hyperparameter configurations with a dimensionality of 12-23, 3-5 dimensions of fidelity factor leading to 932 settings, 3-9 inference objective metrics, and 2-10 cost metrics, together with generated logs from measuring the LLM systems. What we seek to advocate is not only a revalidation of the existing HPO algorithms over the frontier LLM systems, but also to provide an evolving platform for the AutoML community to explore new directions of research in this regard. The benchmark suite has been made available at: https://github.com/ideas-labo/llmsys-hpobench

preprint2025arXiv

HY-MT1.5 Technical Report

In this report, we introduce our latest translation models, HY-MT1.5-1.8B and HY-MT1.5-7B, a new family of machine translation models developed through a holistic training framework tailored for high-performance translation. Our methodology orchestrates a multi-stage pipeline that integrates general and MT-oriented pre-training, supervised fine-tuning, on-policy distillation, and reinforcement learning. HY-MT1.5-1.8B, the 1.8B-parameter model demonstrates remarkable parameter efficiency, comprehensively outperforming significantly larger open-source baselines (e.g., Tower-Plus-72B, Qwen3-32B) and mainstream commercial APIs (e.g., Microsoft Translator, Doubao Translator) in standard Chinese-foreign and English-foreign tasks. It achieves approximately 90% of the performance of ultra-large proprietary models such as Gemini-3.0-Pro, while marginally trailing Gemini-3.0-Pro on WMT25 and Mandarin-minority language benchmarks, it maintains a substantial lead over other competing models. Furthermore, HY-MT1.5-7B establishes a new state-of-the-art for its size class, achieving 95% of Gemini-3.0-Pro's performance on Flores-200 and surpassing it on the challenging WMT25 and Mandarin-minority language test sets. Beyond standard translation, the HY-MT1.5 series supports advanced constraints, including terminology intervention, context-aware translation, and format preservation. Extensive empirical evaluations confirm that both models offer highly competitive, robust solutions for general and specialized translation tasks within their respective parameter scales.

preprint2024arXiv

MLN-net: A multi-source medical image segmentation method for clustered microcalcifications using multiple layer normalization

Accurate segmentation of clustered microcalcifications in mammography is crucial for the diagnosis and treatment of breast cancer. Despite exhibiting expert-level accuracy, recent deep learning advancements in medical image segmentation provide insufficient contribution to practical applications, due to the domain shift resulting from differences in patient postures, individual gland density, and imaging modalities of mammography etc. In this paper, a novel framework named MLN-net, which can accurately segment multi-source images using only single source images, is proposed for clustered microcalcification segmentation. We first propose a source domain image augmentation method to generate multi-source images, leading to improved generalization. And a structure of multiple layer normalization (LN) layers is used to construct the segmentation network, which can be found efficient for clustered microcalcification segmentation in different domains. Additionally, a branch selection strategy is designed for measuring the similarity of the source domain data and the target domain data. To validate the proposed MLN-net, extensive analyses including ablation experiments are performed, comparison of 12 baseline methods. Extensive experiments validate the effectiveness of MLN-net in segmenting clustered microcalcifications from different domains and the its segmentation accuracy surpasses state-of-the-art methods. Code will be available at https://github.com/yezanting/MLN-NET-VERSON1.

preprint2023arXiv

$β$-DARTS++: Bi-level Regularization for Proxy-robust Differentiable Architecture Search

Neural Architecture Search has attracted increasing attention in recent years. Among them, differential NAS approaches such as DARTS, have gained popularity for the search efficiency. However, they still suffer from three main issues, that are, the weak stability due to the performance collapse, the poor generalization ability of the searched architectures, and the inferior robustness to different kinds of proxies. To solve the stability and generalization problems, a simple-but-effective regularization method, termed as Beta-Decay, is proposed to regularize the DARTS-based NAS searching process (i.e., $β$-DARTS). Specifically, Beta-Decay regularization can impose constraints to keep the value and variance of activated architecture parameters from being too large, thereby ensuring fair competition among architecture parameters and making the supernet less sensitive to the impact of input on the operation set. In-depth theoretical analyses on how it works and why it works are provided. Comprehensive experiments validate that Beta-Decay regularization can help to stabilize the searching process and makes the searched network more transferable across different datasets. To address the robustness problem, we first benchmark different NAS methods under a wide range of proxy data, proxy channels, proxy layers and proxy epochs, since the robustness of NAS under different kinds of proxies has not been explored before. We then conclude some interesting findings and find that $β$-DARTS always achieves the best result among all compared NAS methods under almost all proxies. We further introduce the novel flooding regularization to the weight optimization of $β$-DARTS (i.e., Bi-level regularization), and experimentally and theoretically verify its effectiveness for improving the proxy robustness of differentiable NAS.

preprint2023arXiv

A Large-Scale Outdoor Multi-modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction

Neural Radiance Fields (NeRF) has achieved impressive results in single object scene reconstruction and novel view synthesis, which have been demonstrated on many single modality and single object focused indoor scene datasets like DTU, BMVS, and NeRF Synthetic.However, the study of NeRF on large-scale outdoor scene reconstruction is still limited, as there is no unified outdoor scene dataset for large-scale NeRF evaluation due to expensive data acquisition and calibration costs. In this paper, we propose a large-scale outdoor multi-modal dataset, OMMO dataset, containing complex land objects and scenes with calibrated images, point clouds and prompt annotations. Meanwhile, a new benchmark for several outdoor NeRF-based tasks is established, such as novel view synthesis, surface reconstruction, and multi-modal NeRF. To create the dataset, we capture and collect a large number of real fly-view videos and select high-quality and high-resolution clips from them. Then we design a quality review module to refine images, remove low-quality frames and fail-to-calibrate scenes through a learning-based automatic evaluation plus manual review. Finally, a number of volunteers are employed to add the text descriptions for each scene and key-frame to meet the potential multi-modal requirements in the future. Compared with existing NeRF datasets, our dataset contains abundant real-world urban and natural scenes with various scales, camera trajectories, and lighting conditions. Experiments show that our dataset can benchmark most state-of-the-art NeRF methods on different tasks. We will release the dataset and model weights very soon.

preprint2023arXiv

Do Performance Aspirations Matter for Guiding Software Configuration Tuning?

Configurable software systems can be tuned for better performance. Leveraging on some Pareto optimizers, recent work has shifted from tuning for a single, time-related performance objective to two intrinsically different objectives that assess distinct performance aspects of the system, each with varying aspirations. Before we design better optimizers, a crucial engineering decision to make therein is how to handle the performance requirements with clear aspirations in the tuning process. For this, the community takes two alternative optimization models: either quantifying and incorporating the aspirations into the search objectives that guide the tuning, or not considering the aspirations during the search but purely using them in the later decision-making process only. However, despite being a crucial decision that determines how an optimizer can be designed and tailored, there is a rather limited understanding of which optimization model should be chosen under what particular circumstance, and why. In this paper, we seek to close this gap. Firstly, we do that through a review of over 426 papers in the literature and 14 real-world requirements datasets. Drawing on these, we then conduct a comprehensive empirical study that covers 15 combinations of the state-of-the-art performance requirement patterns, four types of aspiration space, three Pareto optimizers, and eight real-world systems/environments, leading to 1,296 cases of investigation. We found that (1) the realism of aspirations is the key factor that determines whether they should be used to guide the tuning; (2) the given patterns and the position of the realistic aspirations in the objective landscape are less important for the choice, but they do matter to the extents of improvement; (3) the available tuning budget can also influence the choice for unrealistic aspirations but it is insignificant under realistic ones.

preprint2023arXiv

End-to-End 3D Dense Captioning with Vote2Cap-DETR

3D dense captioning aims to generate multiple captions localized with their associated object regions. Existing methods follow a sophisticated ``detect-then-describe'' pipeline equipped with numerous hand-crafted components. However, these hand-crafted components would yield suboptimal performance given cluttered object spatial and class distributions among different scenes. In this paper, we propose a simple-yet-effective transformer framework Vote2Cap-DETR based on recent popular \textbf{DE}tection \textbf{TR}ansformer (DETR). Compared with prior arts, our framework has several appealing advantages: 1) Without resorting to numerous hand-crafted components, our method is based on a full transformer encoder-decoder architecture with a learnable vote query driven object decoder, and a caption decoder that produces the dense captions in a set-prediction manner. 2) In contrast to the two-stage scheme, our method can perform detection and captioning in one-stage. 3) Without bells and whistles, extensive experiments on two commonly used datasets, ScanRefer and Nr3D, demonstrate that our Vote2Cap-DETR surpasses current state-of-the-arts by 11.13\% and 7.11\% in CIDEr@0.5IoU, respectively. Codes will be released soon.

preprint2023arXiv

FECANet: Boosting Few-Shot Semantic Segmentation with Feature-Enhanced Context-Aware Network

Few-shot semantic segmentation is the task of learning to locate each pixel of the novel class in the query image with only a few annotated support images. The current correlation-based methods construct pair-wise feature correlations to establish the many-to-many matching because the typical prototype-based approaches cannot learn fine-grained correspondence relations. However, the existing methods still suffer from the noise contained in naive correlations and the lack of context semantic information in correlations. To alleviate these problems mentioned above, we propose a Feature-Enhanced Context-Aware Network (FECANet). Specifically, a feature enhancement module is proposed to suppress the matching noise caused by inter-class local similarity and enhance the intra-class relevance in the naive correlation. In addition, we propose a novel correlation reconstruction module that encodes extra correspondence relations between foreground and background and multi-scale context semantic features, significantly boosting the encoder to capture a reliable matching pattern. Experiments on PASCAL-$5^i$ and COCO-$20^i$ datasets demonstrate that our proposed FECANet leads to remarkable improvement compared to previous state-of-the-arts, demonstrating its effectiveness.

preprint2023arXiv

Optimizing nonadiabatic geometric quantum gates against off-resonance error by dynamical correction in a silicon-based spin qubit

Geometric quantum gates are performed by using the geometric phase, making them particularly robust to the pulse amplitude error due to the intrinsic global property. However, in many systems, such as the silicon-based spin qubits, the off-resonance error is the dominant noise, which can cause dephasing and is always difficult to deal with for a geometric gate. Thus how to deal with the off-resonance error is very significant for the application of the geometric gates. A recent work in \emph{Phy. Rev. Appl. 16, 044005 (2021)} reveals that by inserting two $π$-pulse dynamically corrected sequences into the evolution paths, the holonomic quantum gate is effective to suppress the pulse amplitude error, however it is still useless for combating the off-resonance error. Inspired by this work, we combine using the techniques of dynamical correction and path design. Surprisingly, we find that by picking up a specific evolution path inserted by only a $π$-pulse dynamically corrected sequence, the obtained optimized geometric gate is robust to the off-resonance error, assuming the noise is static. Further, by calculating the filter function considering the realistic $1/f$-type noise in silicon, the related results show that the performance of the optimized geometric gate can also surpass both the conventional geometric gate and the naive dynamical gate constructed without using the geometric phase. Our results indicate dynamical correction is an powerful tool to improve the geometric gate.

preprint2022arXiv

$β$-DARTS: Beta-Decay Regularization for Differentiable Architecture Search

Neural Architecture Search~(NAS) has attracted increasingly more attention in recent years because of its capability to design deep neural networks automatically. Among them, differential NAS approaches such as DARTS, have gained popularity for the search efficiency. However, they suffer from two main issues, the weak robustness to the performance collapse and the poor generalization ability of the searched architectures. To solve these two problems, a simple-but-efficient regularization method, termed as Beta-Decay, is proposed to regularize the DARTS-based NAS searching process. Specifically, Beta-Decay regularization can impose constraints to keep the value and variance of activated architecture parameters from too large. Furthermore, we provide in-depth theoretical analysis on how it works and why it works. Experimental results on NAS-Bench-201 show that our proposed method can help to stabilize the searching process and makes the searched network more transferable across different datasets. In addition, our search scheme shows an outstanding property of being less dependent on training time and data. Comprehensive experiments on a variety of search spaces and datasets validate the effectiveness of the proposed method.

preprint2022arXiv

Asymmetric topological pumping in nonparaxial photonics

Topological photonics was initially inspired by the quantum-optical analogy between the Schrödinger equation for an electron wavefunction and the paraxial equation for a light beam. Here, we reveal an unexpected phenomenon in topological pumping observed in arrays of nonparaxial optical waveguides where the quantum-optical analogy becomes invalid. We predict theoretically and demonstrate experimentally an asymmetric topological pumping when the injected field transfers from one side of the waveguide array to the other side whereas the reverse process is unexpectedly forbidden. Our finding could open an avenue for exploring topological photonics that enables nontrivial topological phenomena and designs in photonics driven by nonparaxiality.

preprint2022arXiv

CO2Sum:Contrastive Learning for Factual-Consistent Abstractive Summarization

Generating factual-consistent summaries is a challenging task for abstractive summarization. Previous works mainly encode factual information or perform post-correct/rank after decoding. In this paper, we provide a factual-consistent solution from the perspective of contrastive learning, which is a natural extension of previous works. We propose CO2Sum (Contrastive for Consistency), a contrastive learning scheme that can be easily applied on sequence-to-sequence models for factual-consistent abstractive summarization, proving that the model can be fact-aware without modifying the architecture. CO2Sum applies contrastive learning on the encoder, which can help the model be aware of the factual information contained in the input article, or performs contrastive learning on the decoder, which makes the model to generate factual-correct output summary. What's more, these two schemes are orthogonal and can be combined to further improve faithfulness. Comprehensive experiments on public benchmarks demonstrate that CO2Sum improves the faithfulness on large pre-trained language models and reaches competitive results compared to other strong factual-consistent summarization baselines.

preprint2022arXiv

Composite Short-path Nonadiabatic Holonomic Quantum Gates

Nonadiabatic holonomic quantum computation (NHQC) has attracted significant attention due to its fast evolution and the geometric nature induced resilience to local noises. However, its long operation time and complex physical implementation make it hard to surpass the dynamical scheme, and thus hindering its wide application. Here, we present to implement NHQC with the shortest path under some conditions, through the inverse Hamiltonian engineering technique, which posseses higher fidelity and stronger robustness than previous NHQC schemes. Meanwhile, the gate performance in our scheme can be further improved by using the proposed composite dynamical decoupling pulses, which can efficiently improve both the gate fidelity and robustness, making our scheme outperform the optimal dynamical scheme in certain parameters range. Remarkably, our scheme can be readily implemented with Rydberg atoms, and a simplified implementation of the controlled-not gate in the Rydberg blockade regime can be achieved. Therefore, our scheme represents a promising progress towards future fault-tolerant quantum computation in atomic systems.

preprint2022arXiv

Data-Driven Nonparametric Robust Control under Dependence Uncertainty

We consider a multi-period stochastic control problem where the multivariate driving stochastic factor of the system has known marginal distributions but uncertain dependence structure. To solve the problem, we propose to implement the nonparametric adaptive robust control framework. We aim to find the optimal control against the worst-case copulae in a sequence of shrinking uncertainty sets which are generated from continuously observing the data. Then, we use a stochastic gradient descent ascent algorithm to numerically handle the corresponding high dimensional dynamic inf-sup optimization problem. We present the numerical results in the context of utility maximization and show that the controller benefits from knowing more information about the uncertain model.

preprint2022arXiv

Does Configuration Encoding Matter in Learning Software Performance? An Empirical Study on Encoding Schemes

Learning and predicting the performance of a configurable software system helps to provide better quality assurance. One important engineering decision therein is how to encode the configuration into the model built. Despite the presence of different encoding schemes, there is still little understanding of which is better and under what circumstances, as the community often relies on some general beliefs that inform the decision in an ad-hoc manner. To bridge this gap, in this paper, we empirically compared the widely used encoding schemes for software performance learning, namely label, scaled label, and one-hot encoding. The study covers five systems, seven models, and three encoding schemes, leading to 105 cases of investigation. Our key findings reveal that: (1) conducting trial-and-error to find the best encoding scheme in a case by case manner can be rather expensive, requiring up to 400+ hours on some models and systems; (2) the one-hot encoding often leads to the most accurate results while the scaled label encoding is generally weak on accuracy over different models; (3) conversely, the scaled label encoding tends to result in the fastest training time across the models/systems while the one-hot encoding is the slowest; (4) for all models studied, label and scaled label encoding often lead to relatively less biased outcomes between accuracy and training time, but the paired model varies according to the system. We discuss the actionable suggestions derived from our findings, hoping to provide a better understanding of this topic for the community. To promote open science, the data and code of this work can be publicly accessed at https://github.com/ideas-labo/MSR2022-encoding-study.

preprint2022arXiv

ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking Inference

State-of-the-art neural models typically encode document-query pairs using cross-attention for re-ranking. To this end, models generally utilize an encoder-only (like BERT) paradigm or an encoder-decoder (like T5) approach. These paradigms, however, are not without flaws, i.e., running the model on all query-document pairs at inference-time incurs a significant computational cost. This paper proposes a new training and inference paradigm for re-ranking. We propose to finetune a pretrained encoder-decoder model using in the form of document to query generation. Subsequently, we show that this encoder-decoder architecture can be decomposed into a decoder-only language model during inference. This results in significant inference time speedups since the decoder-only architecture only needs to learn to interpret static encoder embeddings during inference. Our experiments show that this new paradigm achieves results that are comparable to the more expensive cross-attention ranking approaches while being up to 6.8X faster. We believe this work paves the way for more efficient neural rankers that leverage large pretrained models.

preprint2022arXiv

Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation

Semantic segmentation is a popular research topic in computer vision, and many efforts have been made on it with impressive results. In this paper, we intend to search an optimal network structure that can run in real-time for this problem. Towards this goal, we jointly search the depth, channel, dilation rate and feature spatial resolution, which results in a search space consisting of about 2.78*10^324 possible choices. To handle such a large search space, we leverage differential architecture search methods. However, the architecture parameters searched using existing differential methods need to be discretized, which causes the discretization gap between the architecture parameters found by the differential methods and their discretized version as the final solution for the architecture search. Hence, we relieve the problem of discretization gap from the innovative perspective of solution space regularization. Specifically, a novel Solution Space Regularization (SSR) loss is first proposed to effectively encourage the supernet to converge to its discrete one. Then, a new Hierarchical and Progressive Solution Space Shrinking method is presented to further achieve high efficiency of searching. In addition, we theoretically show that the optimization of SSR loss is equivalent to the L_0-norm regularization, which accounts for the improved search-evaluation gap. Comprehensive experiments show that the proposed search scheme can efficiently find an optimal network structure that yields an extremely fast speed (175 FPS) of segmentation with a small model size (1 M) while maintaining comparable accuracy.

preprint2022arXiv

Generalized Global Ranking-Aware Neural Architecture Ranker for Efficient Image Classifier Search

Neural Architecture Search (NAS) is a powerful tool for automating effective image processing DNN designing. The ranking has been advocated to design an efficient performance predictor for NAS. The previous contrastive method solves the ranking problem by comparing pairs of architectures and predicting their relative performance. However, it only focuses on the rankings between two involved architectures and neglects the overall quality distributions of the search space, which may suffer generalization issues. A predictor, namely Neural Architecture Ranker (NAR) which concentrates on the global quality tier of specific architecture, is proposed to tackle such problems caused by the local perspective. The NAR explores the quality tiers of the search space globally and classifies each individual to the tier they belong to according to its global ranking. Thus, the predictor gains the knowledge of the performance distributions of the search space which helps to generalize its ranking ability to the datasets more easily. Meanwhile, the global quality distribution facilitates the search phase by directly sampling candidates according to the statistics of quality tiers, which is free of training a search algorithm, e.g., Reinforcement Learning (RL) or Evolutionary Algorithm (EA), thus it simplifies the NAS pipeline and saves the computational overheads. The proposed NAR achieves better performance than the state-of-the-art methods on two widely used datasets for NAS research. On the vast search space of NAS-Bench-101, the NAR easily finds the architecture with top 0.01$\unicode{x2030}$ performance only by sampling. It also generalizes well to different image datasets of NAS-Bench-201, i.e., CIFAR-10, CIFAR-100, and ImageNet-16-120 by identifying the optimal architectures for each of them.

preprint2022arXiv

HandoverSim: A Simulation Framework and Benchmark for Human-to-Robot Object Handovers

We introduce a new simulation benchmark "HandoverSim" for human-to-robot object handovers. To simulate the giver's motion, we leverage a recent motion capture dataset of hand grasping of objects. We create training and evaluation environments for the receiver with standardized protocols and metrics. We analyze the performance of a set of baselines and show a correlation with a real-world evaluation. Code is open sourced at https://handover-sim.github.io.

preprint2022arXiv

Joint Distribution Alignment via Adversarial Learning for Domain Adaptive Object Detection

Unsupervised domain adaptive object detection aims to adapt a well-trained detector from its original source domain with rich labeled data to a new target domain with unlabeled data. Recently, mainstream approaches perform this task through adversarial learning, yet still suffer from two limitations. First, they mainly align marginal distribution by unsupervised cross-domain feature matching, and ignore each feature's categorical and positional information that can be exploited for conditional alignment; Second, they treat all classes as equally important for transferring cross-domain knowledge and ignore that different classes usually have different transferability. In this paper, we propose a joint adaptive detection framework (JADF) to address the above challenges. First, an end-to-end joint adversarial adaptation framework for object detection is proposed, which aligns both marginal and conditional distributions between domains without introducing any extra hyperparameter. Next, to consider the transferability of each object class, a metric for class-wise transferability assessment is proposed, which is incorporated into the JADF objective for domain adaptation. Further, an extended study from unsupervised domain adaptation (UDA) to unsupervised few-shot domain adaptation (UFDA) is conducted, where only a few unlabeled training images are available in unlabeled target domain. Extensive experiments validate that JADF is effective in both the UDA and UFDA settings, achieving significant performance gains over existing state-of-the-art cross-domain detection methods.

preprint2022arXiv

Learning Cross-Image Object Semantic Relation in Transformer for Few-Shot Fine-Grained Image Classification

Few-shot fine-grained learning aims to classify a query image into one of a set of support categories with fine-grained differences. Although learning different objects' local differences via Deep Neural Networks has achieved success, how to exploit the query-support cross-image object semantic relations in Transformer-based architecture remains under-explored in the few-shot fine-grained scenario. In this work, we propose a Transformer-based double-helix model, namely HelixFormer, to achieve the cross-image object semantic relation mining in a bidirectional and symmetrical manner. The HelixFormer consists of two steps: 1) Relation Mining Process (RMP) across different branches, and 2) Representation Enhancement Process (REP) within each individual branch. By the designed RMP, each branch can extract fine-grained object-level Cross-image Semantic Relation Maps (CSRMs) using information from the other branch, ensuring better cross-image interaction in semantically related local object regions. Further, with the aid of CSRMs, the developed REP can strengthen the extracted features for those discovered semantically-related local regions in each branch, boosting the model's ability to distinguish subtle feature differences of fine-grained objects. Extensive experiments conducted on five public fine-grained benchmarks demonstrate that HelixFormer can effectively enhance the cross-image object semantic relation matching for recognizing fine-grained objects, achieving much better performance over most state-of-the-art methods under 1-shot and 5-shot scenarios. Our code is available at: https://github.com/JiakangYuan/HelixFormer

preprint2022arXiv

Lifelong Dynamic Optimization for Self-Adaptive Systems: Fact or Fiction?

When faced with changing environment, highly configurable software systems need to dynamically search for promising adaptation plan that keeps the best possible performance, e.g., higher throughput or smaller latency -- a typical planning problem for self-adaptive systems (SASs). However, given the rugged and complex search landscape with multiple local optima, such a SAS planning is challenging especially in dynamic environments. In this paper, we propose LiDOS, a lifelong dynamic optimization framework for SAS planning. What makes LiDOS unique is that to handle the "dynamic", we formulate the SAS planning as a multi-modal optimization problem, aiming to preserve the useful information for better dealing with the local optima issue under dynamic environment changes. This differs from existing planners in that the "dynamic" is not explicitly handled during the search process in planning. As such, the search and planning in LiDOS run continuously over the lifetime of SAS, terminating only when it is taken offline or the search space has been covered under an environment. Experimental results on three real-world SASs show that the concept of explicitly handling dynamic as part of the search in the SAS planning is effective, as LiDOS outperforms its stationary counterpart overall with up to 10x improvement. It also achieves better results in general over state-of-the-art planners and with 1.4x to 10x speedup on generating promising adaptation plans.

preprint2022arXiv

Multi-Forgery Detection Challenge 2022: Push the Frontier of Unconstrained and Diverse Forgery Detection

In this paper, we present the Multi-Forgery Detection Challenge held concurrently with the IEEE Computer Society Workshop on Biometrics at CVPR 2022. Our Multi-Forgery Detection Challenge aims to detect automatic image manipulations including but not limited to image editing, image synthesis, image generation, image photoshop, etc. Our challenge has attracted 674 teams from all over the world, with about 2000 valid result submission counts. We invited the Top 10 teams to present their solutions to the challenge, from which three teams are awarded prizes in the grand finale. In this paper, we present the solutions from the Top 3 teams, in order to boost the research work in the field of image forgery detection.

preprint2022arXiv

Nonparametric Adaptive Bayesian Stochastic Control Under Model Uncertainty

In this paper we propose a new methodology for solving a discrete time stochastic Markovian control problem under model uncertainty. By utilizing the Dirichlet process, we model the unknown distribution of the underlying stochastic process as a random probability measure and achieve online learning in a Bayesian manner. Our approach integrates optimizing and dynamic learning. When dealing with model uncertainty, the nonparametric framework allows us to avoid model misspecification that usually occurs in other classical control methods. Then, we develop a numerical algorithm to handle the infinitely dimensional state space in this setup and utilizes Gaussian process surrogates to obtain a functional representation of the value function in the Bellman recursion. We also build separate surrogates for optimal control to eliminate repeated optimizations on out-of-sample paths and bring computational speed-ups. Finally, we demonstrate the financial advantages of the nonparametric Bayesian framework compared to parametric approaches such as strong robust and time consistent adaptive.

preprint2022arXiv

Nonparametric Adaptive Robust Control Under Model Uncertainty

We consider a discrete time stochastic Markovian control problem under model uncertainty. Such uncertainty not only comes from the fact that the true probability law of the underlying stochastic process is unknown, but the parametric family of probability distributions which the true law belongs to is also unknown. We propose a nonparametric adaptive robust control methodology to deal with such problem. Our approach hinges on the following building concepts: first, using the adaptive robust paradigm to incorporate online learning and uncertainty reduction into the robust control problem; second, learning the unknown probability law through the empirical distribution, and representing uncertainty reduction in terms of a sequence of Wasserstein balls around the empirical distribution; third, using Lagrangian duality to convert the optimization over Wasserstein balls to a scalar optimization problem, and adopting a machine learning technique to achieve efficient computation of the optimal control. We illustrate our methodology by considering a utility maximization problem. Numerical comparisons show that the nonparametric adaptive robust control approach is preferable to the traditional robust frameworks.

preprint2022arXiv

On Reporting Performance and Accuracy Bugs for Deep Learning Frameworks: An Exploratory Study from GitHub

The tremendous success of Deep Learning (DL) has significantly boosted the number of open-sourced DL frameworks hosted on GitHub. Among others, performance and accuracy bugs are critical factors that affect the reputation of these DL frameworks, therefore understanding the practice of discovering and investigating them for DL is important. In this paper, we conduct an exploratory study on the nature of reporting performance and accuracy bugs bugs for DL frameworks, aiming to improve our knowledge on this topic. Our study covers 10 most popular open-sourced DL frameworks on GitHub (e.g., TensorFlow, Keras, and PyTorch), based on which we sample 664 representative performance and accuracy bugs bug reports out of a total population of 22,522. Through systematic analysis of these samples, our key findings are: (1) low speed is the primary reason that a performance bug related report is submitted but we see no consistent pattern for accuracy related ones; (2) most of the reports are about issues encountered in the training stage; (3) only a small proportion of the reports provide insufficient information to investigate; (4) the majority of the performance and accuracy bugs bug reports (from 69% to 100%) are not related to the actual bug or regarded as unclassified; (5) around 50% of the performance and accuracy bug reports, which indeed reveal bugs, are not resolved by direct patches. Deriving from the above, we discuss a set of actionable implications to the researchers, maintainers, and report submitters on this subject. To promote open science, the labeled dataset has been made publicly available at https://tinyurl.com/4x3tap9w.

preprint2022arXiv

Planning Landscape Analysis for Self-Adaptive Systems

To assure performance on the fly, planning is arguably one of the most important steps for self-adaptive systems (SASs), especially when they are highly configurable with a daunting number of adaptation options. However, there has been little understanding of the planning landscape or ways by which it can be analyzed. This inevitably creates barriers to the design of better and tailored planners for SASs. In this paper, we showcase how the planning landscapes of SASs can be quantified and reasoned, particularly with respect to the different environments. By studying four diverse real-world SASs and 14 environments, we found that (1) the SAS planning landscapes often provide strong guidance to the planner, but their ruggedness and multi-modality can be the major obstacle; (2) the extents of guidance and number of global/local optima are sensitive to the changing environment, but not the ruggedness of the surface; (3) the local optima are often closer to the global optimum than other random points; and (4) there are considerable (and useful) overlaps on the global/local optima between landscapes under different environments. We then discuss the potential implications to the future work of planner designs for SASs.

preprint2022arXiv

Point Cloud Instance Segmentation with Semi-supervised Bounding-Box Mining

Point cloud instance segmentation has achieved huge progress with the emergence of deep learning. However, these methods are usually data-hungry with expensive and time-consuming dense point cloud annotations. To alleviate the annotation cost, unlabeled or weakly labeled data is still less explored in the task. In this paper, we introduce the first semi-supervised point cloud instance segmentation framework (SPIB) using both labeled and unlabelled bounding boxes as supervision. To be specific, our SPIB architecture involves a two-stage learning procedure. For stage one, a bounding box proposal generation network is trained under a semi-supervised setting with perturbation consistency regularization (SPCR). The regularization works by enforcing an invariance of the bounding box predictions over different perturbations applied to the input point clouds, to provide self-supervision for network learning. For stage two, the bounding box proposals with SPCR are grouped into some subsets, and the instance masks are mined inside each subset with a novel semantic propagation module and a property consistency graph module. Moreover, we introduce a novel occupancy ratio guided refinement module to refine the instance masks. Extensive experiments on the challenging ScanNet v2 dataset demonstrate our method can achieve competitive performance compared with the recent fully-supervised methods.

preprint2022arXiv

Rapid Locomotion via Reinforcement Learning

Agile maneuvers such as sprinting and high-speed turning in the wild are challenging for legged robots. We present an end-to-end learned controller that achieves record agility for the MIT Mini Cheetah, sustaining speeds up to 3.9 m/s. This system runs and turns fast on natural terrains like grass, ice, and gravel and responds robustly to disturbances. Our controller is a neural network trained in simulation via reinforcement learning and transferred to the real world. The two key components are (i) an adaptive curriculum on velocity commands and (ii) an online system identification strategy for sim-to-real transfer leveraged from prior work. Videos of the robot's behaviors are available at: https://agility.csail.mit.edu/

preprint2022arXiv

Rarefaction effects in head-on collision of two identical droplets

The head-on collision of two identical droplets is investigated based on the BGK-Boltzmann equation. Gauss-Hermite quadratures with different degree of precision are used to solve the kinetic equation, so that the continuum (solution truncated at the Navier-Stokes order) and non-continuum (rarefied gas dynamics) solutions can be compared. When the kinetic equation is solved with adequate accuracy, prominent variations of the vertical velocity (the collision is in the horizontal direction), the viscous stress components, and droplet morphology are observed during the formation of liquid bridge, which demonstrates the importance of the rarefaction effects and the failure of the Navier-Stokes equation. The rarefaction effects change the topology of streamlines near the droplet surface, suppress the high-magnitude vorticity concentration inside the interdroplet region, and promote the vorticity diffusion around outer droplet surface. Two physical mechanisms responsible for the local energy conversion between the free and kinetic energies are identified, namely, the total pressure-dilatation coupling effect and the interaction between the density gradient and strain rate tensor. An energy conversion analysis is performed to show that the rarefaction effects can enhance the conversion from free energy to kinetic energy and facilitate the discharge of interdroplet gas film along the vertical direction, thereby boosting droplet coalescence.

preprint2022arXiv

Rethinking Saliency Map: An Context-aware Perturbation Method to Explain EEG-based Deep Learning Model

Deep learning is widely used to decode the electroencephalogram (EEG) signal. However, there are few attempts to specifically investigate how to explain the EEG-based deep learning models. We conduct a review to summarize the existing works explaining the EEG-based deep learning model. Unfortunately, we find that there is no appropriate method to explain them. Based on the characteristic of EEG data, we suggest a context-aware perturbation method to generate a saliency map from the perspective of the raw EEG signal. Moreover, we also justify that the context information can be used to suppress the artifacts in the EEG-based deep learning model. In practice, some users might want a simple version of the explanation, which only indicates a few features as salient points. To this end, we propose an optional area limitation strategy to restrict the highlighted region. To validate our idea and make a comparison with the other methods, we select three representative EEG-based models to implement experiments on the emotional EEG dataset DEAP. The results of the experiments support the advantages of our method.

preprint2022arXiv

Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic Segmentation

Weakly supervised semantic segmentation with only image-level labels aims to reduce annotation costs for the segmentation task. Existing approaches generally leverage class activation maps (CAMs) to locate the object regions for pseudo label generation. However, CAMs can only discover the most discriminative parts of objects, thus leading to inferior pixel-level pseudo labels. To address this issue, we propose a saliency guided Inter- and Intra-Class Relation Constrained (I$^2$CRC) framework to assist the expansion of the activated object regions in CAMs. Specifically, we propose a saliency guided class-agnostic distance module to pull the intra-category features closer by aligning features to their class prototypes. Further, we propose a class-specific distance module to push the inter-class features apart and encourage the object region to have a higher activation than the background. Besides strengthening the capability of the classification network to activate more integral object regions in CAMs, we also introduce an object guided label refinement module to take a full use of both the segmentation prediction and the initial labels for obtaining superior pseudo-labels. Extensive experiments on PASCAL VOC 2012 and COCO datasets demonstrate well the effectiveness of I$^2$CRC over other state-of-the-art counterparts. The source codes, models, and data have been made available at \url{https://github.com/NUST-Machine-Intelligence-Laboratory/I2CRC}.

preprint2022arXiv

Study of a droplet breakup process in decaying homogeneous isotropic turbulence based on the phase-field DUGKS approach

The breakup of a spherical droplet in a decaying homogeneous isotropic turbulence is studied by solving the Cahn-Hilliard-Navier-Stokes equations, using the discrete unified gas kinetic scheme combined with the free-energy-based phase-field model. We focus on the combined effects of turbulence and surface tension on the breakup process by assuming that the two fluid phases have the same density and same viscosity. The key physical parameters of the system include the volume fraction (6.54%), the initial Weber number (21.7), and the initial Taylor microscale Reynolds number (58). Three distinct stages of droplet evolution are identified, namely, the deformation stage when the initially spherical droplet evolves into an irregular geometric shape with complex structures, the breakup stage when many daughter droplets are formed, and the restoration stage when the droplets relax towards spherical shape. These three stages are analyzed systematically from several perspectives: (1) a geometric perspective concerning the maximum equivalent diameter, the total number of droplets, total interface area, and probability distribution of droplet diameters, (2) a dynamic perspective concerning the evolution of local velocity and vorticity at the fluid-fluid interface, (3) a global perspective concerning the evolution of average kinetic energy / dissipation rate and their Fourier spectra, (4) spherical harmonics based energetics concerning simultaneous transfer of kinetic energy across different length scales and different radii relative to initial droplet center, and (5) the time evolution of global kinetic energy and free energy of the system.

preprint2022arXiv

The Weights can be Harmful: Pareto Search versus Weighted Search in Multi-Objective Search-Based Software Engineering

In presence of multiple objectives to be optimized in Search-Based Software Engineering (SBSE), Pareto search has been commonly adopted. It searches for a good approximation of the problem's Pareto optimal solutions, from which the stakeholders choose the most preferred solution according to their preferences. However, when clear preferences of the stakeholders (e.g., a set of weights which reflect relative importance between objectives) are available prior to the search, weighted search is believed to be the first choice since it simplifies the search via converting the original multi-objective problem into a single-objective one and enable the search to focus on what only the stakeholders are interested in. This paper questions such a "weighted search first" belief. We show that the weights can, in fact, be harmful to the search process even in the presence of clear preferences. Specifically, we conduct a large scale empirical study which consists of 38 systems/projects from three representative SBSE problems, together with two types of search budget and nine sets of weights, leading to 604 cases of comparisons. Our key finding is that weighted search reaches a certain level of solution quality by consuming relatively less resources at the early stage of the search; however, Pareto search is at the majority of the time (up to 77% of the cases) significantly better than its weighted counterpart, as long as we allow a sufficient, but not unrealistic search budget. This, together with other findings and actionable suggestions in the paper, allows us to codify pragmatic and comprehensive guidance on choosing weighted and Pareto search for SBSE under the circumstance that clear preferences are available. All code and data can be accessed at: https://github.com/ideas-labo/pareto-vs-weight-for-sbse.

preprint2022arXiv

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation

Although vision transformers (ViTs) have achieved great success in computer vision, the heavy computational cost hampers their applications to dense prediction tasks such as semantic segmentation on mobile devices. In this paper, we present a mobile-friendly architecture named \textbf{To}ken \textbf{P}yramid Vision Trans\textbf{former} (\textbf{TopFormer}). The proposed \textbf{TopFormer} takes Tokens from various scales as input to produce scale-aware semantic features, which are then injected into the corresponding tokens to augment the representation. Experimental results demonstrate that our method significantly outperforms CNN- and ViT-based networks across several semantic segmentation datasets and achieves a good trade-off between accuracy and latency. On the ADE20K dataset, TopFormer achieves 5\% higher accuracy in mIoU than MobileNetV3 with lower latency on an ARM-based mobile device. Furthermore, the tiny version of TopFormer achieves real-time inference on an ARM-based mobile device with competitive results. The code and models are available at: https://github.com/hustvl/TopFormer

preprint2022arXiv

Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal

As a common security tool, visible watermarking has been widely applied to protect copyrights of digital images. However, recent works have shown that visible watermarks can be removed by DNNs without damaging their host images. Such watermark-removal techniques pose a great threat to the ownership of images. Inspired by the vulnerability of DNNs on adversarial perturbations, we propose a novel defence mechanism by adversarial machine learning for good. From the perspective of the adversary, blind watermark-removal networks can be posed as our target models; then we actually optimize an imperceptible adversarial perturbation on the host images to proactively attack against watermark-removal networks, dubbed Watermark Vaccine. Specifically, two types of vaccines are proposed. Disrupting Watermark Vaccine (DWV) induces to ruin the host image along with watermark after passing through watermark-removal networks. In contrast, Inerasable Watermark Vaccine (IWV) works in another fashion of trying to keep the watermark not removed and still noticeable. Extensive experiments demonstrate the effectiveness of our DWV/IWV in preventing watermark removal, especially on various watermark removal networks.

preprint2021arXiv

A new class of differential quasivariational inequalities with an application to a quasistatic viscoelastic frictional contact problem

The overarching goal of this paper is to introduce and investigate a new nonlinear system driven by a nonlinear differential equation, a history-dependent quasivariational inequality, and a parabolic variational inequality in Banach spaces. Such a system can be used to model quasistatic frictional contact problems for viscoelastic materials with long memory, damage and wear. By using the Banach fixed point theorem, we prove an existence and uniqueness theorem of solution for such a system under some mild conditions. As a novel application, we obtain a unique solvability of a quasistatic viscoelastic frictional contact problem with long memory, damage and wear.

preprint2021arXiv

Error-Tolerant Geometric Quantum Control for Logical Qubits with Minimal Resource

Geometric quantum computation offers a practical strategy toward robust quantum computation due to its inherently error tolerance. However, the rigorous geometric conditions lead to complex and/or error-disturbed quantum controls, especially for logical qubits that involve more physical qubits, whose error tolerance is effective in principle though, their experimental demonstration is still demanding. Thus, how to best simplify the needed control and manifest its full advantage has become the key to widespread applications of geometric quantum computation. Here we propose a new fast and robust geometric scheme, with the decoherence-free-subspace encoding, and present its physical implementation on superconducting quantum circuits, where we only utilize the experimentally demonstrated parametrically tunable coupling to achieve high-fidelity geometric control over logical qubits. Numerical simulation verifies that it can efficiently combine the error tolerance from both the geometric phase and logical-qubit encoding, displaying our gate-performance superiority over the conventional dynamical one without encoding, in terms of both gate fidelity and robustness. Therefore, our scheme can consolidate both error suppression methods for logical-qubit control, which sheds light on the future large-scale quantum computation.

preprint2021arXiv

Neural Hierarchical Factorization Machines for User's Event Sequence Analysis

Many prediction tasks of real-world applications need to model multi-order feature interactions in user's event sequence for better detection performance. However, existing popular solutions usually suffer two key issues: 1) only focusing on feature interactions and failing to capture the sequence influence; 2) only focusing on sequence information, but ignoring internal feature relations of each event, thus failing to extract a better event representation. In this paper, we consider a two-level structure for capturing the hierarchical information over user's event sequence: 1) learning effective feature interactions based event representation; 2) modeling the sequence representation of user's historical events. Experimental results on both industrial and public datasets clearly demonstrate that our model achieves significantly better performance compared with state-of-the-art baselines.

preprint2021arXiv

Semantically Meaningful Class Prototype Learning for One-Shot Image Semantic Segmentation

One-shot semantic image segmentation aims to segment the object regions for the novel class with only one annotated image. Recent works adopt the episodic training strategy to mimic the expected situation at testing time. However, these existing approaches simulate the test conditions too strictly during the training process, and thus cannot make full use of the given label information. Besides, these approaches mainly focus on the foreground-background target class segmentation setting. They only utilize binary mask labels for training. In this paper, we propose to leverage the multi-class label information during the episodic training. It will encourage the network to generate more semantically meaningful features for each category. After integrating the target class cues into the query features, we then propose a pyramid feature fusion module to mine the fused features for the final classifier. Furthermore, to take more advantage of the support image-mask pair, we propose a self-prototype guidance branch to support image segmentation. It can constrain the network for generating more compact features and a robust prototype for each semantic class. For inference, we propose a fused prototype guidance branch for the segmentation of the query image. Specifically, we leverage the prediction of the query image to extract the pseudo-prototype and combine it with the initial prototype. Then we utilize the fused prototype to guide the final segmentation of the query image. Extensive experiments demonstrate the superiority of our proposed approach.

preprint2021arXiv

Slices of Parameter Space for Meromorphic Maps with Two Asymptotic Values

This paper is part of a program to understand the parameter spaces of dynamical systems generated by meromorphic functions with finitely many singular values. We give a full description of the parameter space for a specific family based on the exponential function that has precisely two finite asymptotic values and one attracting fixed point. It represents a step beyond the previous work in [GK] on degree 2 rational functions with analogous constraints: two critical values and an attracting fixed point. What is interesting and promising for pushing the general program even further, is that, despite the presence of the essential singularity, our new functions exhibit a dynamic structure as similar as one could hope to the rational case, and that the philosophy of the techniques used in the rational case could be adapted.

preprint2020arXiv

A Design of Cooperative Overtaking Based on Complex Lane Detection and Collision Risk Estimation

Cooperative overtaking is believed to have the capability of improving road safety and traffic efficiency by means of the real-time information exchange between traffic participants, including road infrastructures, nearby vehicles and others. In this paper, we focused on the critical issues of modeling, computation, and analysis of cooperative overtaking and made it playing a key role in the road overtaking area. In detail, for the purpose of extending the awareness of the surrounding environment, the lane markings in front of ego vehicle were detected and modeled with Bezier curve using an onboard camera. While the nearby vehicle positions were obtained through the vehicle-to-vehicle communication scheme making assure of the accuracy of localization. Then, Gaussian-based conflict potential field was proposed to guarantee the overtaking safety, which can quantitatively estimate the oncoming collision danger. To support the proposed method, many experiments were conducted on the human-in-the-loop simulation platform. The results demonstrated that our proposed method achieves better performance, especially in some unpredictable nature road circumstances.

preprint2020arXiv

A Machine Learning Approach to Adaptive Robust Utility Maximization and Hedging

We investigate the adaptive robust control framework for portfolio optimization and loss-based hedging under drift and volatility uncertainty. Adaptive robust problems offer many advantages but require handling a double optimization problem (infimum over market measures, supremum over the control) at each instance. Moreover, the underlying Bellman equations are intrinsically multi-dimensional. We propose a novel machine learning approach that solves for the local saddle-point at a chosen set of inputs and then uses a nonparametric (Gaussian process) regression to obtain a functional representation of the value function. Our algorithm resembles control randomization and regression Monte Carlo techniques but also brings multiple innovations, including adaptive experimental design, separate surrogates for optimal control and the local worst-case measure, and computational speed-ups for the sup-inf optimization. Thanks to the new scheme we are able to consider settings that have been previously computationally intractable and provide several new financial insights about learning and optimal trading under unknown market parameters. In particular, we demonstrate the financial advantages of adaptive robust framework compared to adaptive and static robust alternatives.

preprint2020arXiv

APPCorp: A Corpus for Android Privacy Policy Document Structure Analysis

With the increasing popularity of mobile devices and the wide adoption of mobile Apps, an increasing concern of privacy issues is raised. Privacy policy is identified as a proper medium to indicate the legal terms, such as GDPR, and to bind legal agreement between service providers and users. However, privacy policies are usually long and vague for end users to read and understand. It is thus important to be able to automatically analyze the document structures of privacy policies to assist user understanding. In this work we create a manually labelled corpus containing $167$ privacy policies (of more than $447$K words and $5,276$ annotated paragraphs). We report the annotation process and details of the annotated corpus. We also benchmark our data corpus with $4$ document classification models, thoroughly analyze the results and discuss challenges and opportunities for the research committee to use the corpus. We release our labelled corpus as well as the classification models for public access.

preprint2020arXiv

Beyond Adiabatic Elimination in Topological Floquet Engineering

In quantum mechanics, adiabatic elimination is a standard tool that produces a low-lying reduced Hamiltonian for a relevant subspace of states, incorporating effects of its coupling to states with much higher energy. Suppose this powerful elimination approach is applied to quasi-energy states in periodically-driven systems, a critical question then arises that the violation of the adiabatic condition caused by driven forces challenges such a presence of spectral reduction in the non-equilibrium driven system. Here, both theoretically and experimentally, we newly reported two kinds of driven-induced eliminations universal in topologically-protected Floquet systems. We named them "quasi-adiabatic elimination" and "high-frequency-limited elimination", in terms of different driven frequencies that deny the underlying requirement for the adiabatic condition. Both two non-adiabatic eliminations are observed in our recently developed microwave Floquet simulator, a programmable test platform composed of periodically-bending ultrathin metallic coupled corrugated waveguides. Through the near-field imaging on our simulator, the mechanisms between the adiabatic and driven-induced eliminations are revealed, indicating the ubiquitous spectral decomposition for tailoring and manipulating Floquet states with quasi-energies. Finally, we hope our findings may open up profound and applicable possibilities for further developing Floquet engineering in periodically-driven systems, ranging from condensed matter physics to photonics.

preprint2020arXiv

BiLO-CPDP: Bi-Level Programming for Automated Model Discovery in Cross-Project Defect Prediction

Cross-Project Defect Prediction (CPDP), which borrows data from similar projects by combining a transfer learner with a classifier, have emerged as a promising way to predict software defects when the available data about the target project is insufficient. How-ever, developing such a model is challenge because it is difficult to determine the right combination of transfer learner and classifier along with their optimal hyper-parameter settings. In this paper, we propose a tool, dubbedBiLO-CPDP, which is the first of its kind to formulate the automated CPDP model discovery from the perspective of bi-level programming. In particular, the bi-level programming proceeds the optimization with two nested levels in a hierarchical manner. Specifically, the upper-level optimization routine is designed to search for the right combination of transfer learner and classifier while the nested lower-level optimization routine aims to optimize the corresponding hyper-parameter settings.To evaluateBiLO-CPDP, we conduct experiments on 20 projects to compare it with a total of 21 existing CPDP techniques, along with its single-level optimization variant and Auto-Sklearn, a state-of-the-art automated machine learning tool. Empirical results show that BiLO-CPDP champions better prediction performance than all other 21 existing CPDP techniques on 70% of the projects, while be-ing overwhelmingly superior to Auto-Sklearn and its single-level optimization variant on all cases. Furthermore, the unique bi-level formalization inBiLO-CPDP also permits to allocate more budget to the upper-level, which significantly boosts the performance.

preprint2020arXiv

Cascade EF-GAN: Progressive Facial Expression Editing with Local Focuses

Recent advances in Generative Adversarial Nets (GANs) have shown remarkable improvements for facial expression editing. However, current methods are still prone to generate artifacts and blurs around expression-intensive regions, and often introduce undesired overlapping artifacts while handling large-gap expression transformations such as transformation from furious to laughing. To address these limitations, we propose Cascade Expression Focal GAN (Cascade EF-GAN), a novel network that performs progressive facial expression editing with local expression focuses. The introduction of the local focus enables the Cascade EF-GAN to better preserve identity-related features and details around eyes, noses and mouths, which further helps reduce artifacts and blurs within the generated facial images. In addition, an innovative cascade transformation strategy is designed by dividing a large facial expression transformation into multiple small ones in cascade, which helps suppress overlapping artifacts and produce more realistic editing while dealing with large-gap expression transformations. Extensive experiments over two publicly available facial expression datasets show that our proposed Cascade EF-GAN achieves superior performance for facial expression editing.

preprint2020arXiv

Context-Based Quotation Recommendation

While composing a new document, anything from a news article to an email or essay, authors often utilize direct quotes from a variety of sources. Although an author may know what point they would like to make, selecting an appropriate quote for the specific context may be time-consuming and difficult. We therefore propose a novel context-aware quote recommendation system which utilizes the content an author has already written to generate a ranked list of quotable paragraphs and spans of tokens from a given source document. We approach quote recommendation as a variant of open-domain question answering and adapt the state-of-the-art BERT-based methods from open-QA to our task. We conduct experiments on a collection of speech transcripts and associated news articles, evaluating models' paragraph ranking and span prediction performances. Our experiments confirm the strong performance of BERT-based methods on this task, which outperform bag-of-words and neural ranking baselines by more than 30% relative across all ranking metrics. Qualitative analyses show the difficulty of the paragraph and span recommendation tasks and confirm the quotability of the best BERT model's predictions, even if they are not the true selected quotes from the original news articles.

preprint2020arXiv

DATESSO: Self-Adapting Service Composition with Debt-Aware Two Levels Constraint Reasoning

The rapidly changing workload of service-based systems can easily cause under-/over-utilization on the component services, which can consequently affect the overall Quality of Service (QoS), such as latency. Self-adaptive services composition rectifies this problem, but poses several challenges: (i) the effectiveness of adaptation can deteriorate due to over-optimistic assumptions on the latency and utilization constraints, at both local and global levels; and (ii) the benefits brought by each composition plan is often short term and is not often designed for long-term benefits -- a natural prerequisite for sustaining the system. To tackle these issues, we propose a two levels constraint reasoning framework for sustainable self-adaptive services composition, called DATESSO. In particular, DATESSO consists of a re ned formulation that differentiates the "strictness" for latency/utilization constraints in two levels. To strive for long-term benefits, DATESSO leverages the concept of technical debt and time-series prediction to model the utility contribution of the component services in the composition. The approach embeds a debt-aware two level constraint reasoning algorithm in DATESSO to improve the efficiency, effectiveness and sustainability of self-adaptive service composition. We evaluate DATESSO on a service-based system with real-world WS-DREAM dataset and comparing it with other state-of-the-art approaches. The results demonstrate the superiority of DATESSO over the others on the utilization, latency and running time whilst likely to be more sustainable.

preprint2020arXiv

DeepSQLi: Deep Semantic Learning for Testing SQL Injection

Security is unarguably the most serious concern for Web applications, to which SQL injection (SQLi) attack is one of the most devastating attacks. Automatically testing SQLi vulnerabilities is of ultimate importance, yet is unfortunately far from trivial to implement. This is because the existence of a huge, or potentially infinite, number of variants and semantic possibilities of SQL leading to SQLi attacks on various Web applications. In this paper, we propose a deep natural language processing based tool, dubbed DeepSQLi, to generate test cases for detecting SQLi vulnerabilities. Through adopting deep learning based neural language model and sequence of words prediction, DeepSQLi is equipped with the ability to learn the semantic knowledge embedded in SQLi attacks, allowing it to translate user inputs (or a test case) into a new test case, which is semantically related and potentially more sophisticated. Experiments are conducted to compare DeepSQLi with SQLmap, a state-of-the-art SQLi testing automation tool, on six real-world Web applications that are of different scales, characteristics and domains. Empirical results demonstrate the effectiveness and the remarkable superiority of DeepSQLi over SQLmap, such that more SQLi vulnerabilities can be identified by using a less number of test cases, whilst running much faster.

preprint2020arXiv

Do not forget interaction: Predicting fatality of COVID-19 patients using logistic regression

Amid the ongoing COVID-19 pandemic, whether COVID-19 patients with high risks can be recovered or not depends, to a large extent, on how early they will be treated appropriately before irreversible consequences are caused to the patients by the virus. In this research, we reported an explainable, intuitive, and accurate machine learning model based on logistic regression to predict the fatality rate of COVID-19 patients using only three important blood biomarkers, including lactic dehydrogenase, lymphocyte (%) and high-sensitivity C-reactive protein, and their interactions. We found that when the fatality probability produced by the logistic regression model was over 0.8, the model had the optimal performance in that it was able to predict patient fatalities more than 11.30 days on average with maximally 34.91 days in advance, an accumulative f1-score of 93.76% and and an accumulative accuracy score of 93.92%. Such a model can be used to identify COVID-19 patients with high risks with three blood biomarkers and help the medical systems around the world plan critical medical resources amid this pandemic.

preprint2020arXiv

Experimental implementation of universal nonadiabatic geometric quantum gates in a superconducting circuit

Using geometric phases to realize noise-resilient quantum computing is an important method to enhance the control fidelity. In this work, we experimentally realize a universal nonadiabatic geometric quantum gate set in a superconducting qubit chain. We characterize the realized single- and two-qubit geometric gates with both quantum process tomography and randomized benchmarking methods. The measured average fidelities for the single-qubit rotation gates and two-qubit controlled-Z gate are 0.9977(1) and 0.977(9), respectively. Besides, we also experimentally demonstrate the noise-resilient feature of the realized single-qubit geometric gates by comparing their performance with the conventional dynamical gates with different types of errors in the control field. Thus, our experiment proves a way to achieve high-fidelity geometric quantum gates for robust quantum computation.

preprint2020arXiv

Fast holonomic quantum computation on superconducting circuits with optimal control

Geometric phases induced in quantum evolutions have built-in noise-resilient characters, and thus can find applications in many robust quantum manipulation tasks. Here, we propose a feasible and fast scheme for universal quantum computation on superconducting circuits with nonadiabatic non-Abelian geometric phases, using resonant interaction of three-level quantum system. In our scheme, arbitrary single-qubit quantum gates can be implemented in a single-loop scenario by shaping both the amplitudes and phases of the two driving microwave fields resonantly coupled to a transmon device. Moreover, nontrivial two-qubit gates can also be realized with an auxiliary transmon simultaneously coupled to the two target transmons in an effective resonant way. In particular, our proposal can be compatible to various optimal control techniques, which further enhances the robustness of the quantum operations. Therefore, our proposal represents a promising way towards fault-tolerant quantum computation on solid-state quantum circuits.

preprint2020arXiv

High-fidelity geometric gate for silicon-based spin qubits

High-fidelity manipulation is the key for the physical realization of fault-tolerant quantum computation. Here, we present a protocol to realize universal nonadiabatic geometric gates for silicon-based spin qubits. We find that the advantage of geometric gates over dynamical gates depends crucially on the evolution loop for the construction of the geometric phase. Under appropriate evolution loops, both the geometric single-qubit gates and the CNOT gate can outperform their dynamical counterparts for both systematic and detuning noises. We also perform randomized benchmarking using noise amplitudes consistent with experiments in silicon. For the static noise model, the averaged fidelities of geometric gates are around 99.90\% or above, while for the time-dependent $1/f$-type noise, the fidelities are around 99.98\% when only the detuning noise is present. We also show that the improvement in fidelities of the geometric gates over dynamical ones typically increases with the exponent $α$ of the $1/f$ noise, and the ratio can be as high as 4 when $α\approx 3$. Our results suggest that geometric gates with judiciously chosen evolution loops can be a powerful way to realize high-fidelity quantum gates.

preprint2020arXiv

Information Freshness-Aware Task Offloading in Air-Ground Integrated Edge Computing Systems

This paper studies the problem of information freshness-aware task offloading in an air-ground integrated multi-access edge computing system, which is deployed by an infrastructure provider (InP). A third-party real-time application service provider provides computing services to the subscribed mobile users (MUs) with the limited communication and computation resources from the InP based on a long-term business agreement. Due to the dynamic characteristics, the interactions among the MUs are modelled by a non-cooperative stochastic game, in which the control policies are coupled and each MU aims to selfishly maximize its own expected long-term payoff. To address the Nash equilibrium solutions, we propose that each MU behaves in accordance with the local system states and conjectures, based on which the stochastic game is transformed into a single-agent Markov decision process. Moreover, we derive a novel online deep reinforcement learning (RL) scheme that adopts two separate double deep Q-networks for each MU to approximate the Q-factor and the post-decision Q-factor. Using the proposed deep RL scheme, each MU in the system is able to make decisions without a priori statistical knowledge of dynamics. Numerical experiments examine the potentials of the proposed scheme in balancing the age of information and the energy consumption.

preprint2020arXiv

Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition

Affective computing and cognitive theory are widely used in modern human-computer interaction scenarios. Human faces, as the most prominent and easily accessible features, have attracted great attention from researchers. Since humans have rich emotions and developed musculature, there exist a lot of fine-grained expressions in real-world applications. However, it is extremely time-consuming to collect and annotate a large number of facial images, of which may even require psychologists to correctly categorize them. To the best of our knowledge, the existing expression datasets are only limited to several basic facial expressions, which are not sufficient to support our ambitions in developing successful human-computer interaction systems. To this end, a novel Fine-grained Facial Expression Database - F2ED is contributed in this paper, and it includes more than 200k images with 54 facial expressions from 119 persons. Considering the phenomenon of uneven data distribution and lack of samples is common in real-world scenarios, we further evaluate several tasks of few-shot expression learning by virtue of our F2ED, which are to recognize the facial expressions given only few training instances. These tasks mimic human performance to learn robust and general representation from few examples. To address such few-shot tasks, we propose a unified task-driven framework - Compositional Generative Adversarial Network (Comp-GAN) learning to synthesize facial images and thus augmenting the instances of few-shot expression classes. Extensive experiments are conducted on F2ED and existing facial expression datasets, i.e., JAFFE and FER2013, to validate the efficacy of our F2ED in pre-training facial expression recognition network and the effectiveness of our proposed approach Comp-GAN to improve the performance of few-shot recognition tasks.

preprint2020arXiv

Nonadiabatic geometric quantum computation with optimal control on superconducting circuits

Quantum gates, which are the essential building blocks of quantum computers, are very fragile. Thus, to realize robust quantum gates with high fidelity is the ultimate goal of quantum manipulation. Here, we propose a nonadiabatic geometric quantum computation scheme on superconducting circuits to engineer arbitrary quantum gates, which share both the robust merit of geometric phases and the capacity to combine with optimal control technique to further enhance the gate robustness. Specifically, in our proposal, arbitrary geometric single-qubit gates can be realized on a transmon qubit, by a resonant microwave field driving, with both the amplitude and phase of the driving being time-dependent. Meanwhile, nontrivial two-qubit geometric gates can be implemented by two capacitively coupled transmon qubits, with one of the transmon qubits' frequency being modulated to obtain effective resonant coupling between them. Therefore, our scheme provides a promising step towards fault-tolerant solid-state quantum computation.

preprint2020arXiv

On Carbon Nanotubes in the Interstellar Medium

Since their discovery in 1991, carbon nanotubes (CNTs) -- a novel one-dimensional carbon allotrope -- have attracted considerable interest worldwide because of their potential technological applications such as electric and optical devices. In the astrophysical context, CNTs may be present in the interstellar space since many of the other allotropes of carbon (e.g., amorphous carbon, fullerenes, nanodiamonds, graphite, polycyclic aromatic hydrocarbons, and possibly graphene as well) are known to be widespread in the Universe, as revealed by presolar grains in carbonaceous primitive meteorites and/or by their fingerprint spectral features in astronomical spectra. In addition, there are also experimental and theoretical pathways to the formation of CNTs in the interstellar medium (ISM). In this work, we examine their possible presence in the ISM by comparing the observed interstellar extinction curve with the ultraviolet/optical absorption spectra experimentally obtained for single-walled CNTs of a wide range of diameters and chiralities. Based on the absence in the interstellar extinction curve of the ~4.5 and 5.25 eV $π$-plasmon absorption bands which are pronounced in the experimental spectra of CNTs, we place an upper limit of ~10 ppm of C/H (i.e., ~4% of the total interstellar C) on the interstellar CNT abundance.

preprint2020arXiv

PIDNet: An Efficient Network for Dynamic Pedestrian Intrusion Detection

Vision-based dynamic pedestrian intrusion detection (PID), judging whether pedestrians intrude an area-of-interest (AoI) by a moving camera, is an important task in mobile surveillance. The dynamically changing AoIs and a number of pedestrians in video frames increase the difficulty and computational complexity of determining whether pedestrians intrude the AoI, which makes previous algorithms incapable of this task. In this paper, we propose a novel and efficient multi-task deep neural network, PIDNet, to solve this problem. PIDNet is mainly designed by considering two factors: accurately segmenting the dynamically changing AoIs from a video frame captured by the moving camera and quickly detecting pedestrians from the generated AoI-contained areas. Three efficient network designs are proposed and incorporated into PIDNet to reduce the computational complexity: 1) a special PID task backbone for feature sharing, 2) a feature cropping module for feature cropping, and 3) a lighter detection branch network for feature compression. In addition, considering there are no public datasets and benchmarks in this field, we establish a benchmark dataset to evaluate the proposed network and give the corresponding evaluation metrics for the first time. Experimental results show that PIDNet can achieve 67.1% PID accuracy and 9.6 fps inference speed on the proposed dataset, which serves as a good baseline for the future vision-based dynamic PID study.

preprint2020arXiv

Robust and Fast Holonomic Quantum Gates with Encoding on Superconducting Circuits

High-fidelity and robust quantum manipulation is the key for scalable quantum computation. Therefore, due to the intrinsic operational robustness, quantum manipulation induced by geometric phases is one of the promising candidates. However, the longer gate time for geometric operations and more physical-implementation difficulties hinder its practical and wide applications. Here, we propose a simplified implementation of universal holonomic quantum gates on superconducting circuits with experimentally demonstrated techniques, which can remove the two main challenges by introducing the time-optimal control into the construction of quantum gates. Remarkably, our scheme is also based on a decoherence-free subspace encoding, with minimal physical qubit resource, which can further immune to error caused by qubit-frequency drift, which is regarded as the main error source for large scale superconducting circuits. Meanwhile, we deliberately design the quantum evolution to eliminate gate error caused by unwanted leakage sources. Therefore, our scheme is more robust than the conventional ones, and thus provides a promising alternative strategy for scalable fault-tolerant quantum computation.

preprint2020arXiv

Search-Based Software Engineering for Self-Adaptive Systems: Survey, Disappointments, Suggestions and Opportunities

Search-Based Software Engineering (SBSE) is a promising paradigm that exploits the computational search to optimize different processes when engineering complex software systems. Self-adaptive system (SAS) is one category of such complex systems that permits to optimize different functional and non-functional objectives/criteria under changing environments (e.g., requirements and workload), which involves problems that are subject to search. In this regard, over years, there has been a considerable amount of work that investigates SBSE for SASs. In this paper, we provide the first systematic and comprehensive survey exclusively on SBSE for SASs, covering papers in 27 venues from 7 repositories, which eventually leads to several key statistics from the most notable 74 primary studies in this particular field of research. Our results, surprisingly, have revealed five disappointments that are of utmost importance and can result in serve consequences but have been overwhelmingly ignored in existing studies. We provide theoretical and/or experimental evidence to justify our arguments against the disappointments, present suggestions, and highlight the promising research opportunities towards their mitigation. We also elaborate on three other emergent, but currently under-explored opportunities for future work on SBSE for SASs. By mitigating the disappointments revealed in this work, together with the highlighted opportunities, we hope to be able to excite a much more significant growth in this particular research direction.

preprint2020arXiv

Synergizing Domain Expertise with Self-Awareness in Software Systems: A Patternized Architecture Guideline

To promote engineering self-aware and self-adaptive software systems in a reusable manner, architectural patterns and the related methodology provide an unified solution to handle the recurring problems in the engineering process. However, in existing patterns and methods, domain knowledge and engineers' expertise that is built over time are not explicitly linked to the self-aware processes. This linkage is important, as the knowledge is a valuable asset for the related problems and its absence would cause unnecessary overhead, possibly misleading results and unwise waste of the tremendous benefit that could have been brought by the domain expertise. This paper highlights the importance of synergizing domain expertise and the self-awareness to enable better self-adaptation in software systems, relying on well-defined expertise representation, algorithms and techniques. In particular, we present a holistic framework of notions, enriched patterns and methodology, dubbed DBASES, that offers a principled guideline for the engineers to perform difficulty and benefit analysis on possible synergies, in an attempt to keep "engineers-in-the-loop". Through three tutorial case studies, we demonstrate how DBASES can be applied in different domains, within which a carefully selected set of candidates with different synergies can be used for quantitative investigation, providing more informed decisions of the design choices.

preprint2020arXiv

TexSmart: A Text Understanding System for Fine-Grained NER and Enhanced Semantic Analysis

This technique report introduces TexSmart, a text understanding system that supports fine-grained named entity recognition (NER) and enhanced semantic analysis functionalities. Compared to most previous publicly available text understanding systems and tools, TexSmart holds some unique features. First, the NER function of TexSmart supports over 1,000 entity types, while most other public tools typically support several to (at most) dozens of entity types. Second, TexSmart introduces new semantic analysis functions like semantic expansion and deep semantic representation, that are absent in most previous systems. Third, a spectrum of algorithms (from very fast algorithms to those that are relatively slow but more accurate) are implemented for one function in TexSmart, to fulfill the requirements of different academic and industrial applications. The adoption of unsupervised or weakly-supervised algorithms is especially emphasized, with the goal of easily updating our models to include fresh data with less human annotation efforts. The main contents of this report include major functions of TexSmart, algorithms for achieving these functions, how to use the TexSmart toolkit and Web APIs, and evaluation results of some key algorithms.

preprint2020arXiv

The Adaptability and Challenges of Autonomous Vehicles to Pedestrians in Urban China

China is the world's largest automotive market and is ambitious for autonomous vehicles (AVs) development. As one of the key goals of AVs, pedestrian safety is an important issue in China. Despite the rapid development of driverless technologies in recent years, there is a lack of researches on the adaptability of AVs to pedestrians. To fill the gap, this study would discuss the adaptability of current driverless technologies to China urban pedestrians by reviewing the latest researches. The paper firstly analyzed typical Chinese pedestrian behaviors and summarized the safety demands of pedestrians for AVs through articles and open database data, which are worked as the evaluation criteria. Then, corresponding driverless technologies are carefully reviewed. Finally, the adaptability would be given combining the above analyses. Our review found that autonomous vehicles have trouble in the occluded pedestrian environment and Chinese pedestrians do not accept AVs well. And more explorations should be conducted on standard human-machine interaction, interaction information overload avoidance, occluded pedestrians detection and nation-based receptivity research. The conclusions are very useful for motor corporations and driverless car researchers to place more attention on the complexity of the Chinese pedestrian environment, for transportation experts to protect pedestrian safety in the context of AVs, and for governors to think about making new pedestrians policies to welcome the upcoming driverless cars.

preprint2020arXiv

Time-inconsistent Markovian control problems under model uncertainty with application to the mean-variance portfolio selection

In this paper we study a class of time-inconsistent terminal Markovian control problems in discrete time subject to model uncertainty. We combine the concept of the sub-game perfect strategies with the adaptive robust stochastic to tackle the theoretical aspects of the considered stochastic control problem. Consequently, as an important application of the theoretical results, by applying a machine learning algorithm we solve numerically the mean-variance portfolio selection problem under the model uncertainty.

preprint2020arXiv

Topological quantum walks in momentum space with a Bose-Einstein condensate

We report the experimental implementation of discrete-time topological quantum walks of a Bose-Einstein condensate in momentum space. Introducing stroboscopic driving sequences to the generation of a momentum lattice, we show that the dynamics of atoms along the lattice is effectively governed by a periodically driven Su-Schrieffer-Heeger model, which is equivalent to a discrete-time topological quantum walk. We directly measure the underlying topological invariants through time-averaged mean chiral displacements, which are consistent with our experimental observation of topological phase transitions. We then observe interaction-induced localization in the quantum-walk dynamics, where atoms tend to populate a single momentum-lattice site under interactions that are non-local in momentum space. Our experiment opens up the avenue of investigating discrete-time topological quantum walks using cold atoms, where the many-body environment and tunable interactions offer exciting new possibilities.

preprint2020arXiv

Tunable non-reciprocal quantum transport through a dissipative Aharonov-Bohm ring in ultracold atoms

We report the experimental observation of tunable, non-reciprocal quantum transport of a Bose-Einstein condensate in a momentum lattice. By implementing a dissipative Aharonov-Bohm (AB) ring in momentum space and sending atoms through it, we demonstrate a directional atom flow by measuring the momentum distribution of the condensate at different times. While the dissipative AB ring is characterized by the synthetic magnetic flux through the ring and the laser-induced loss on it, both the propagation direction and transport rate of the atom flow sensitively depend on these highly tunable parameters. We demonstrate that the non-reciprocity originates from the interplay of the synthetic magnetic flux and the laser-induced loss, which simultaneously breaks the inversion and the time-reversal symmetries. Our results open up the avenue for investigating non-reciprocal dynamics in cold atoms, and highlight the dissipative AB ring as a flexible building element for applications in quantum simulation and quantum information.

preprint2020arXiv

Understanding the Automated Parameter Optimization on Transfer Learning for CPDP: An Empirical Study

Data-driven defect prediction has become increasingly important in software engineering process. Since it is not uncommon that data from a software project is insufficient for training a reliable defect prediction model, transfer learning that borrows data/knowledge from other projects to facilitate the model building at the current project, namely cross-project defect prediction (CPDP), is naturally plausible. Most CPDP techniques involve two major steps, i.e., transfer learning and classification, each of which has at least one parameter to be tuned to achieve their optimal performance. This practice fits well with the purpose of automated parameter optimization. However, there is a lack of thorough understanding about what are the impacts of automated parameter optimization on various CPDP techniques. In this paper, we present the first empirical study that looks into such impacts on 62 CPDP techniques, 13 of which are chosen from the existing CPDP literature while the other 49 ones have not been explored before. We build defect prediction models over 20 real-world software projects that are of different scales and characteristics. Our findings demonstrate that: (1) Automated parameter optimization substantially improves the defect prediction performance of 77\% CPDP techniques with a manageable computational cost. Thus more efforts on this aspect are required in future CPDP studies. (2) Transfer learning is of ultimate importance in CPDP. Given a tight computational budget, it is more cost-effective to focus on optimizing the parameter configuration of transfer learning algorithms (3) The research on CPDP is far from mature where it is "not difficult" to find a better alternative by making a combination of existing transfer learning and classification techniques. This finding provides important insights about the future design of CPDP techniques.

preprint2019arXiv

Fragmentation and isomerization of polycyclic aromatic hydrocarbons in the interstellar medium: coronene as a case study

Aims. Due to the limitations of current computational technology, the fragmentation and isomerization products of vibrationally-excited polycyclic aromatic hydrocarbon (PAH) molecules and their derivatives are poorly studied. In this work, we investigate the intermediate products of PAHs and their derivatives as well as the gas-phase reactions relevant to the interstellar medium, with coronene as a case study. Methods. Based on the semi-empirical method of PM3 as implemented in the CP2K program, molecular dynamics simulations are performed to model the major processes (e.g., vibrations, fragmentations, and isomerizations) of coronene and its derivatives (e.g., methylated coronene, hydrogenated coronene, dehydrogenated coronene, nitrogen-substituted coronene, and oxygen-substituted coronene) at temperatures of 3000 K and 4000 K. Results. We find that the anharmonic effects are crucial for the simulation of vibrational excitation. For the molecules studied here, H2, CO, HCN, and CH2 are the major fragments. Following the dissociation of these small units, most of the molecules could maintain their ring structures, but a few molecules would break completely into carbon chains. The transformation from hexagon to pentagon or heptagon may occur and the heteroatomic substitutions (e.g., N- or O-substitutions) facilitate the transformation.

preprint2016arXiv

Bounded Geometry and Characterization of Some Transcendental Entire and Meromorphic Maps

We define two classes of topological infinite degree covering maps modeled on two families of transcendental holomorphic maps. The first, which we call exponential maps of type $(p,q)$, are branched covers and is modeled on transcendental entire maps of the form $P e^{Q}$, where $P$ and $Q$ are polynomials of degrees $p$ and $q$. The second is the class of universal covering maps from the plane to the sphere with two removed points modeled on transcendental meromorphic maps with two asymptotic values. The problem we address is to give a combinatorial characterization of the holomorphic maps contained in these classes whose post-singular sets are finite. The main results in this paper are that a post-singularly finite topological exponential map of type $(0,1)$ or a certain post-singularly finite topological exponential map of type $(p,1)$ or a post-singularly finite universal covering map from the plane to the sphere with two points removed is combinatorially equivalent to a holomorphic same type map if and only if this map has bounded geometry.

preprint2016arXiv

Bridging Ecology and Cloud: Transposing Ecological Prespective to Enable Better Cloud Autoscaling

Elastic autoscaling is the fundamental mechanism that enables the cloud-based services to continually evolve themselves - through changing the related software configurations and hardware resource provisions - under time-varying workloads. However, given the increasingly complex dynamic, uncertainty and trade-offs related to the runtime QoS and cost/energy of services, cloud autoscaling system is becoming one of the most complex artifacts constructed by human and thus its effectiveness is difficult to be preserved. In this article, we present novel ideas for facilitating cloud autoscaling. Our hypothesis that cloud ecosystem, represented by a collection of cloud-based services, bears many similarities with the natural ecosystem. As such, we in- tend to investigate how ecological view can be adopted to better explain how the cloud-based services evolve, and to explore what are the key factors that drive stable and sustainable cloud-based services in the cloud. To achieve this goal, we aim to transpose ecological principles, theories and models into cloud autoscaling analogues and spontaneously improve long-term stability and sustainability of cloud ecosystem.

preprint2016arXiv

Determining layer number of two dimensional flakes of transition-metal dichalcogenides by the Raman intensity from substrate

Transition-metal dichalcogenide (TMD) semiconductors have been widely studied due to their distinctive electronic and optical properties. The property of TMD flakes is a function of its thickness, or layer number (N). How to determine N of ultrathin TMDs materials is of primary importance for fundamental study and practical applications. Raman mode intensity from substrates has been used to identify N of intrinsic and defective multilayer graphenes up to N=100. However, such analysis is not applicable for ultrathin TMD flakes due to the lack of a unified complex refractive index ($\tilde{n}$) from monolayer to bulk TMDs. Here, we discuss the N identification of TMD flakes on the SiO$_2$/Si substrate by the intensity ratio between the Si peak from 100-nm (or 89-nm) SiO$_2$/Si substrates underneath TMD flakes and that from bare SiO$_2$/Si substrates. We assume the real part of $\tilde{n}$ of TMD flakes as that of monolayer TMD and treat the imaginary part of $\tilde{n}$ as a fitting parameter to fit the experimental intensity ratio. An empirical $\tilde{n}$, namely, $\tilde{n}_{eff}$, of ultrathin MoS$_{2}$, WS$_{2}$ and WSe$_{2}$ flakes from monolayer to multilayer is obtained for typical laser excitations (2.54 eV, 2.34 eV, or 2.09 eV). The fitted $\tilde{n}_{eff}$ of MoS$_{2}$ has been used to identify N of MoS$_{2}$ flakes deposited on 302-nm SiO$_2$/Si substrate, which agrees well with that determined from their shear and layer-breathing modes. This technique by measuring Raman intensity from the substrate can be extended to identify N of ultrathin 2D flakes with N-dependent $\tilde{n}$ . For the application purpose, the intensity ratio excited by specific laser excitations has been provided for MoS$_{2}$, WS$_{2}$ and WSe$_{2}$ flakes and multilayer graphene flakes deposited on Si substrates covered by 80-110 nm or 280-310 nm SiO$_2$ layer.

preprint2016arXiv

Exact solution of the pairing problem for spherical and deformed systems

There has been increasing interest in studying the Richardson model from which one can derive the exact solution for certain pairing Hamiltonians. However, it is still a numerical challenge to solve the nonlinear equations involved. In this paper we tackle this problem by employing a simple hybrid polynomial approach. The method is found to be robust and is valid for both deformed and nearly spherical nuclei. It also provides important and convenient initial guesses for spherical systems with large degeneracy. As an example, we apply the method to study the shape coexistence in neutron-rich Ni isotopes.

preprint2016arXiv

Multilingual Visual Sentiment Concept Matching

The impact of culture in visual emotion perception has recently captured the attention of multimedia research. In this study, we pro- vide powerful computational linguistics tools to explore, retrieve and browse a dataset of 16K multilingual affective visual concepts and 7.3M Flickr images. First, we design an effective crowdsourc- ing experiment to collect human judgements of sentiment connected to the visual concepts. We then use word embeddings to repre- sent these concepts in a low dimensional vector space, allowing us to expand the meaning around concepts, and thus enabling insight about commonalities and differences among different languages. We compare a variety of concept representations through a novel evaluation task based on the notion of visual semantic relatedness. Based on these representations, we design clustering schemes to group multilingual visual concepts, and evaluate them with novel metrics based on the crowdsourced sentiment annotations as well as visual semantic relatedness. The proposed clustering framework enables us to analyze the full multilingual dataset in-depth and also show an application on a facial data subset, exploring cultural in- sights of portrait-related affective visual concepts.

preprint2016arXiv

Self-Adaptive Trade-off Decision Making for Autoscaling Cloud-Based Services

Elasticity in the cloud is often achieved by on-demand autoscaling. In such context, the goal is to optimize the Quality of Service (QoS) and cost objectives for the cloud-based services. However, the difficulty lies in the facts that these objectives, e.g., throughput and cost, can be naturally conflicted, and the QoS of cloud-based services often interfere due to the shared infrastructure in cloud. Consequently, dynamic and effective trade-off decision making of autoscaling in the cloud is necessary, yet challenging. In particular, it is even harder to achieve well-compromised trade-offs, where the decision largely improves the majority of the objectives, while causing relatively small degradations to others. In this paper, we present a self-adaptive decision making approach for autoscaling in the cloud. It is capable to adaptively produce autoscaling decisions that lead to well-compromised trade-offs without heavy human intervention. We leverage on ant colony inspired multi-objective optimization for searching and optimizing the trade-offs decisions, the result is then filtered by compromise-dominance, a mechanism that extracts the decisions with balanced improvements in the trade-offs. We experimentally compare our approach to four state-of-the-arts autoscaling approaches: rule, heuristic, randomized and multi-objective genetic algorithm based solutions. The results reveal the effectiveness of our approach over the others, including better quality of trade-offs and significantly smaller violation of the requirements.

preprint2016arXiv

Self-Aware and Self-Adaptive Autoscaling for Cloud Based Services

Modern Internet services are increasingly leveraging on cloud computing for flexible, elastic and on-demand provision. Typically, Quality of Service (QoS) of cloud-based services can be tuned using different underlying cloud configurations and resources, e.g., number of threads, CPU and memory etc., which are shared, leased and priced as utilities. This benefit is fundamentally grounded by autoscaling: an automatic and elastic process that adapts cloud configurations on-demand according to time-varying workloads. This thesis proposes a holistic cloud autoscaling framework to effectively and seamlessly address existing challenges related to different logical aspects of autoscaling, including architecting autoscaling system, modelling the QoS of cloud-based service, determining the granularity of control and deciding trade-off autoscaling decisions. The framework takes advantages of the principles of self-awareness and the related algorithms to adaptively handle the dynamics, uncertainties, QoS interference and trade-offs on objectives that are exhibited in the cloud. The major benefit is that, by leveraging the framework, cloud autoscaling can be effectively achieved without heavy human analysis and design time knowledge. Through conducting various experiments using RUBiS benchmark and realistic workload on real cloud setting, this thesis evaluates the effectiveness of the framework based on various quality indicators and compared with other state-of-the-art approaches.

preprint2016arXiv

Structures, Branching Ratios and Laser Cooling Scheme for 138BaF Molecule

For laser cooling considerations, we have theoretically investigated the electronic, rovibrational and hypefine structures of BaF molecule. The highly diagonal Franck-Condon factors and the branching ratios for all possible transitions within the lowest-lying four electronic states have also been calculated. Meanwhile, the mixing between metastable A'2Δ and A2Π states and further the lifetime of the Δ state have been estimated since the loss procedure via Δ state would like fatally destroy the main quasi-cycling Σ-Π transition for cooling and trapping. The resultant hyperfine splittings of each rovibrational states in X2Σ+ state provide benchmarksfor sideband modulations of cooling and repumping lasers and remixing microwaves to address all necessary levels. The calculated Zeeman shift and g-factors for both X and A states serve as benchmarks for selections of the trapping laser polarizations. Our study paves the way for future laser cooling and magneto-optical trapping of the BaF molecule.

preprint2015arXiv

#mytweet via Instagram: Exploring User Behaviour across Multiple Social Networks

We study how users of multiple online social networks (OSNs) employ and share information by studying a common user pool that use six OSNs - Flickr, Google+, Instagram, Tumblr, Twitter, and YouTube. We analyze the temporal and topical signature of users' sharing behaviour, showing how they exhibit distinct behaviorial patterns on different networks. We also examine cross-sharing (i.e., the act of user broadcasting their activity to multiple OSNs near-simultaneously), a previously-unstudied behaviour and demonstrate how certain OSNs play the roles of originating source and destination sinks.

preprint2015arXiv

Failure of hydrogenation in protecting polycyclic aromatic hydrocarbons from fragmentation

A recent study of soft X-ray absorption in native and hydrogenated coronene cations, C$_{24}$H$_{12+m}^+$ $m=0-7$, led to the conclusion that additional hydrogen atoms protect (interstellar) Polycyclic Aromatic Hydrocarbon (PAH) molecules from fragmentation [Reitsma et al., Phys. Rev. Lett. 113, 053002 (2014)]. The present experiment with collisions between fast (30-200 eV) He atoms and pyrene (C$_{16}$H$_{10+m}^+$, $m=0$, 6, and 16) and simulations without reference to the excitation method suggests the opposite. We find that the absolute carbon-backbone fragmentation cross section does not decrease but increases with the degree of hydrogenation for pyrene molecules.

preprint2015arXiv

Mechanism for femtosecond laser-induced periodic subwavelength structures on solid surface: surface two-plasmon resonance

We present that surface two-plasmon resonance (STPR) in electron plasma sheet produced by femtosecond laser irradiating metal surface is the self-formation mechanism of periodic subwavelength ripple structures. Peaks of overdense electrons formed by resonant two-plasmon wave pull bound ions out of the metal surface and thus the wave pattern of STPR is "carved" on the surface by Coulomb ablation (removal) resulting from the strong electrostatic field induced by charge separation. To confirm the STPR model, we have performed analogical carving experiments by two laser beams with perpendicular polarizations. The results explicitly show that two wave patterns of STPR are independently carved on the exposure area of target surface. The time-scale of ablation dynamics and the electron temperature in ultrafast interaction are also verified by time-resolved spectroscopy experiment and numerical simulation, respectively. The present model can self-consistently explain the formation of subwavelength ripple structures even with spatial periods shorter than half of the laser wavelength, shedding light on the understanding of ultrafast laser-solid interaction.

preprint2015arXiv

Online QoS Modeling in the Cloud: A Hybrid and Adaptive Multi-Learners Approach

Given the on-demand nature of cloud computing, managing cloud-based services requires accurate modeling for the correlation between their Quality of Service (QoS) and cloud configurations/resources. The resulted models need to cope with the dynamic fluctuation of QoS sensitivity and interference. However, existing QoS modeling in the cloud are limited in terms of both accuracy and applicability due to their static and semi- dynamic nature. In this paper, we present a fully dynamic multi- learners approach for automated and online QoS modeling in the cloud. We contribute to a hybrid learners solution, which improves accuracy while keeping model complexity adequate. To determine the inputs of QoS model at runtime, we partition the inputs space into two sub-spaces, each of which applies different symmetric uncertainty based selection techniques, and we then combine the sub-spaces results. The learners are also adaptive; they simultaneously allow several machine learning algorithms to model QoS function and dynamically select the best model for prediction on the fly. We experimentally evaluate our models using RUBiS benchmark and realistic FIFA 98 workload. The results show that our multi-learners approach is more accurate and effective in contrast to the other state-of-the-art approaches.

preprint2015arXiv

Polytypism and Unexpected Strong Interlayer Coupling of two-Dimensional Layered ReS2

The anisotropic two-dimensional (2D) van der Waals (vdW) layered materials, with both scientific interest and potential application, have one more dimension to tune the properties than the isotropic 2D materials. The interlayer vdW coupling determines the properties of 2D multi-layer materials by varying stacking orders. As an important representative anisotropic 2D materials, multilayer rhenium disulfide (ReS2) was expected to be random stacking and lack of interlayer coupling. Here, we demonstrate two stable stacking orders (aa and a-b) of N layer (NL, N>1) ReS2 from ultralow-frequency and high-frequency Raman spectroscopy, photoluminescence spectroscopy and first-principles density functional theory calculation. Two interlayer shear modes are observed in aa-stacked NL-ReS2 while only one interlayer shear mode appears in a-b-stacked NL-ReS2, suggesting anisotropic-like and isotropic-like stacking orders in aa- and a-b-stacked NL-ReS2, respectively. The frequency of the interlayer shear and breathing modes reveals unexpected strong interlayer coupling in aa- and a-b-NL-ReS2, the force constants of which are 55-90% to those of multilayer MoS2. The observation of strong interlayer coupling and polytypism in multi-layer ReS2 stimulate future studies on the structure, electronic and optical properties of other 2D anisotropic materials.

preprint2015arXiv

Substrate-free layer-number identification of two-dimensional materials: A case of Mo$_{0.5}$W$_{0.5}$S$_2$ alloy

Any of two or more two-dimensional (2D) materials with similar properties can be alloyed into a new layered material, namely, 2D alloy. Individual monolayer in 2D alloys are kept together by Van der Waals interactions. The property of multilayer alloys is a function of their layer number. Here, we studied the shear (C) and layer-breathing (LB) modes of Mo$_{0.5}$W$_{0.5}$S$_2$ alloy flakes and their link to the layer number of alloy flakes. The study reveals that the disorder effect is absent in the C and LB modes of 2D alloys, and the monatomic chain model can be used to estimate the frequencies of the C and LB modes. We demonstrated how to use the C and LB mode frequency to identify the layer number of alloy flakes deposited on different substrates. This technique is independent of the substrate, stoichiometry, monolayer thickness and complex refractive index of 2D materials, offering a robust and substrate-free approach for layer-number identification of 2D materials.

preprint2015arXiv

The Handbook of Engineering Self-Aware and Self-Expressive Systems

When faced with the task of designing and implementing a new self-aware and self-expressive computing system, researchers and practitioners need a set of guidelines on how to use the concepts and foundations developed in the Engineering Proprioception in Computing Systems (EPiCS) project. This report provides such guidelines on how to design self-aware and self-expressive computing systems in a principled way. We have documented different categories of self-awareness and self-expression level using architectural patterns. We have also documented common architectural primitives, their possible candidate techniques and attributes for architecting self-aware and self-expressive systems. Drawing on the knowledge obtained from the previous investigations, we proposed a pattern driven methodology for engineering self-aware and self-expressive systems to assist in utilising the patterns and primitives during design. The methodology contains detailed guidance to make decisions with respect to the possible design alternatives, providing a systematic way to build self-aware and self-expressive systems. Then, we qualitatively and quantitatively evaluated the methodology using two case studies. The results reveal that our pattern driven methodology covers the main aspects of engineering self-aware and self-expressive systems, and that the resulted systems perform significantly better than the non-self-aware systems.

preprint2015arXiv

Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology

Every culture and language is unique. Our work expressly focuses on the uniqueness of culture and language in relation to human affect, specifically sentiment and emotion semantics, and how they manifest in social multimedia. We develop sets of sentiment- and emotion-polarized visual concepts by adapting semantic structures called adjective-noun pairs, originally introduced by Borth et al. (2013), but in a multilingual context. We propose a new language-dependent method for automatic discovery of these adjective-noun constructs. We show how this pipeline can be applied on a social multimedia platform for the creation of a large-scale multilingual visual sentiment concept ontology (MVSO). Unlike the flat structure in Borth et al. (2013), our unified ontology is organized hierarchically by multilingual clusters of visually detectable nouns and subclusters of emotionally biased versions of these nouns. In addition, we present an image-based prediction task to show how generalizable language-specific models are in a multilingual context. A new, publicly available dataset of >15.6K sentiment-biased visual concepts across 12 languages with language-specific detector banks, >7.36M images and their metadata is also released.

preprint2014arXiv

DeepSentiBank: Visual Sentiment Concept Classification with Deep Convolutional Neural Networks

This paper introduces a visual sentiment concept classification method based on deep convolutional neural networks (CNNs). The visual sentiment concepts are adjective noun pairs (ANPs) automatically discovered from the tags of web photos, and can be utilized as effective statistical cues for detecting emotions depicted in the images. Nearly one million Flickr images tagged with these ANPs are downloaded to train the classifiers of the concepts. We adopt the popular model of deep convolutional neural networks which recently shows great performance improvement on classifying large-scale web-based image dataset such as ImageNet. Our deep CNNs model is trained based on Caffe, a newly developed deep learning framework. To deal with the biased training data which only contains images with strong sentiment and to prevent overfitting, we initialize the model with the model weights trained from ImageNet. Performance evaluation shows the newly trained deep CNNs model SentiBank 2.0 (or called DeepSentiBank) is significantly improved in both annotation accuracy and retrieval performance, compared to its predecessors which mainly use binary SVM classification models.

preprint2014arXiv

Dynamic Conic Finance via Backward Stochastic Difference Equations

We present an arbitrage free theoretical framework for modeling bid and ask prices of dividend paying securities in a discrete time setup using theory of dynamic acceptability indices. In the first part of the paper we develop the theory of dynamic subscale invariant performance measures, on a general probability space, and discrete time setup. We prove a representation theorem of such measures in terms of a family of dynamic convex risk measures, and provide a representation of dynamic risk measures in terms of g-expectations, and solutions of BS$Δ$Es with convex drivers. We study the existence and uniqueness of the solutions, and derive a comparison theorem for corresponding BS$Δ$Es. In the second part of the paper we discuss a market model for dividend paying securities by introducing the pricing operators that are defined in terms of dynamic acceptability indices, and find various properties of these operators. Using these pricing operators, we define the bid and ask prices for the underlying securities and then for derivatives in this market. We show that the obtained market model is arbitrage free, and we also prove a series of properties of these prices.

preprint2014arXiv

Latent Tree Models and Approximate Inference in Bayesian Networks

We propose a novel method for approximate inference in Bayesian networks (BNs). The idea is to sample data from a BN, learn a latent tree model (LTM) from the data offline, and when online, make inference with the LTM instead of the original BN. Because LTMs are tree-structured, inference takes linear time. In the meantime, they can represent complex relationship among leaf nodes and hence the approximation accuracy is often good. Empirical evidence shows that our method can achieve good approximation accuracy at low online computational cost.

preprint2013arXiv

Bounded Geometry and Characterization of post-singularly Finite $(p,q)$-Exponential Maps

In this paper we define a topological class of branched covering maps of the plane called {\em topological exponential maps of type $(p,q)$} and denoted by $\TE_{p,q}$, where $p\geq 0$ and $q\geq 1$. We follow the framework given in \cite{Ji} to study the problem of combinatorially characterizing an entire map $P e^{Q}$, where $P$ is a polynomial of degree $p$ and $Q$ is a polynomial of degree $q$ using an {\em iteration scheme defined by Thurston} and a {\em bounded geometry condition}. We first show that an element $f \in {\TE}_{p,q}$ with finite post-singular set is combinatorially equivalent to an entire map $P e^{Q}$ if and only if it has bounded geometry with compactness. Thus to complete the characterization, we only need to check that the bounded geometry actually implies compactness. We show this for some $f\in \TE_{p,1}$, $p\geq 1$. Our main result in this paper is that a post-singularly finite map $f$ in $\TE_{0,1}$ or a post-singularly finite map $f$ in $\TE_{p,1}$, $p\geq 1$, with only one non-zero simple breanch point $c$ such that either $c$ is periodic or $c$ and $f(c)$ are both not periodic, is combinatorially equivalent to a post-singularly finite entire map of either the form $e^{λz}$ or the form $ αz^{p}e^{λz}$, where $α=(-λ/p)^{p}e^{- λ(-p/λ)^{p}}$, respectively, if and only if it has bounded geometry. This is the first result in this direction for a family of transcendental holomorphic maps with critical points.

preprint2013arXiv

Bounded Geometry and Families of Meromorphic Functions with Two Asymptotic Values

In this paper we consider the topological class, denoted by AV2, of universal covering maps from the plane to the sphere with two removed points. We prove that an element f in AV2 with finite post-singular set is combinatorially equivalent to a meromorphic transcendental map g with constant Schwarzian derivative if any only if f satisfies a Bounded Geometry condition.

preprint2013arXiv

On Improving Energy Efficiency within Green Femtocell Networks: A Hierarchical Reinforcement Learning Approach

One of the efficient solutions of improving coverage and increasing capacity in cellular networks is the deployment of femtocells. As the cellular networks are becoming more complex, energy consumption of whole network infrastructure is becoming important in terms of both operational costs and environmental impacts. This paper investigates energy efficiency of two-tier femtocell networks through combining game theory and stochastic learning. With the Stackelberg game formulation, a hierarchical reinforcement learning framework is applied for studying the joint expected utility maximization of macrocells and femtocells subject to the minimum signal-to-interference-plus-noise-ratio requirements. In the learning procedure, the macrocells act as leaders and the femtocells are followers. At each time step, the leaders commit to dynamic strategies based on the best responses of the followers, while the followers compete against each other with no further information but the leaders' transmission parameters. In this paper, we propose two reinforcement learning based intelligent algorithms to schedule each cell's stochastic power levels. Numerical experiments are presented to validate the investigations. The results show that the two learning algorithms substantially improve the energy efficiency of the femtocell networks.

preprint2012arXiv

Improving Energy Efficiency in Femtocell Networks: A Hierarchical Reinforcement Learning Framework

This paper investigates energy efficiency for two-tier femtocell networks through combining game theory and stochastic learning. With the Stackelberg game formulation, a hierarchical reinforcement learning framework is applied to study the joint average utility maximization of macrocells and femtocells subject to the minimum signal-to-interference-plus-noise-ratio requirements. The macrocells behave as the leaders and the femtocells are followers during the learning procedure. At each time step, the leaders commit to dynamic strategies based on the best responses of the followers, while the followers compete against each other with no further information but the leaders' strategy information. In this paper, we propose two learning algorithms to schedule each cell's stochastic power levels, leading by the macrocells. Numerical experiments are presented to validate the proposed studies and show that the two learning algorithms substantially improve the energy efficiency of the femtocell networks.

preprint2011arXiv

Canonical Thurston Obstructions for Sub-Hyperbolic Semi-Rational Branched Coverings

We prove that the canonical Thurston obstruction for a sub-hyperbolic semi-rational branched covering exists if the branched covering is not CLH-equivalent to a rational map.

preprint2011arXiv

Creating a Live, Public Short Message Service Corpus: The NUS SMS Corpus

Short Message Service (SMS) messages are largely sent directly from one person to another from their mobile phones. They represent a means of personal communication that is an important communicative artifact in our current digital era. As most existing studies have used private access to SMS corpora, comparative studies using the same raw SMS data has not been possible up to now. We describe our efforts to collect a public SMS corpus to address this problem. We use a battery of methodologies to collect the corpus, paying particular attention to privacy issues to address contributors' concerns. Our live project collects new SMS message submissions, checks their quality and adds the valid messages, releasing the resultant corpus as XML and as SQL dumps, along with corpus statistics, every month. We opportunistically collect as much metadata about the messages and their sender as possible, so as to enable different types of analyses. To date, we have collected about 60,000 messages, focusing on English and Mandarin Chinese.

Tao Chen

What is connected

Connect this record

See the researcher in context

Building this map preview

94 published item(s)

LLMSYS-HPOBench: Hyperparameter Optimization Benchmark Suite for Real-World LLM Systems

HY-MT1.5 Technical Report

MLN-net: A multi-source medical image segmentation method for clustered microcalcifications using multiple layer normalization

$β$-DARTS++: Bi-level Regularization for Proxy-robust Differentiable Architecture Search

A Large-Scale Outdoor Multi-modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction

Do Performance Aspirations Matter for Guiding Software Configuration Tuning?

End-to-End 3D Dense Captioning with Vote2Cap-DETR

FECANet: Boosting Few-Shot Semantic Segmentation with Feature-Enhanced Context-Aware Network

Optimizing nonadiabatic geometric quantum gates against off-resonance error by dynamical correction in a silicon-based spin qubit

$β$-DARTS: Beta-Decay Regularization for Differentiable Architecture Search

Asymmetric topological pumping in nonparaxial photonics

CO2Sum:Contrastive Learning for Factual-Consistent Abstractive Summarization

Composite Short-path Nonadiabatic Holonomic Quantum Gates

Data-Driven Nonparametric Robust Control under Dependence Uncertainty

Does Configuration Encoding Matter in Learning Software Performance? An Empirical Study on Encoding Schemes

ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking Inference

Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation

Generalized Global Ranking-Aware Neural Architecture Ranker for Efficient Image Classifier Search

HandoverSim: A Simulation Framework and Benchmark for Human-to-Robot Object Handovers

Joint Distribution Alignment via Adversarial Learning for Domain Adaptive Object Detection

Learning Cross-Image Object Semantic Relation in Transformer for Few-Shot Fine-Grained Image Classification

Lifelong Dynamic Optimization for Self-Adaptive Systems: Fact or Fiction?

Multi-Forgery Detection Challenge 2022: Push the Frontier of Unconstrained and Diverse Forgery Detection

Nonparametric Adaptive Bayesian Stochastic Control Under Model Uncertainty

Nonparametric Adaptive Robust Control Under Model Uncertainty

On Reporting Performance and Accuracy Bugs for Deep Learning Frameworks: An Exploratory Study from GitHub

Planning Landscape Analysis for Self-Adaptive Systems

Point Cloud Instance Segmentation with Semi-supervised Bounding-Box Mining

Rapid Locomotion via Reinforcement Learning

Rarefaction effects in head-on collision of two identical droplets

Rethinking Saliency Map: An Context-aware Perturbation Method to Explain EEG-based Deep Learning Model

Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic Segmentation

Study of a droplet breakup process in decaying homogeneous isotropic turbulence based on the phase-field DUGKS approach

The Weights can be Harmful: Pareto Search versus Weighted Search in Multi-Objective Search-Based Software Engineering

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation

Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal

A new class of differential quasivariational inequalities with an application to a quasistatic viscoelastic frictional contact problem

Error-Tolerant Geometric Quantum Control for Logical Qubits with Minimal Resource

Neural Hierarchical Factorization Machines for User's Event Sequence Analysis

Semantically Meaningful Class Prototype Learning for One-Shot Image Semantic Segmentation

Slices of Parameter Space for Meromorphic Maps with Two Asymptotic Values

A Design of Cooperative Overtaking Based on Complex Lane Detection and Collision Risk Estimation

A Machine Learning Approach to Adaptive Robust Utility Maximization and Hedging

APPCorp: A Corpus for Android Privacy Policy Document Structure Analysis

Beyond Adiabatic Elimination in Topological Floquet Engineering

BiLO-CPDP: Bi-Level Programming for Automated Model Discovery in Cross-Project Defect Prediction

Cascade EF-GAN: Progressive Facial Expression Editing with Local Focuses

Context-Based Quotation Recommendation

DATESSO: Self-Adapting Service Composition with Debt-Aware Two Levels Constraint Reasoning

DeepSQLi: Deep Semantic Learning for Testing SQL Injection

Do not forget interaction: Predicting fatality of COVID-19 patients using logistic regression

Experimental implementation of universal nonadiabatic geometric quantum gates in a superconducting circuit

Fast holonomic quantum computation on superconducting circuits with optimal control

High-fidelity geometric gate for silicon-based spin qubits

Information Freshness-Aware Task Offloading in Air-Ground Integrated Edge Computing Systems

Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition

Nonadiabatic geometric quantum computation with optimal control on superconducting circuits

On Carbon Nanotubes in the Interstellar Medium

PIDNet: An Efficient Network for Dynamic Pedestrian Intrusion Detection

Robust and Fast Holonomic Quantum Gates with Encoding on Superconducting Circuits

Search-Based Software Engineering for Self-Adaptive Systems: Survey, Disappointments, Suggestions and Opportunities

Synergizing Domain Expertise with Self-Awareness in Software Systems: A Patternized Architecture Guideline

TexSmart: A Text Understanding System for Fine-Grained NER and Enhanced Semantic Analysis

The Adaptability and Challenges of Autonomous Vehicles to Pedestrians in Urban China

Time-inconsistent Markovian control problems under model uncertainty with application to the mean-variance portfolio selection

Topological quantum walks in momentum space with a Bose-Einstein condensate

Tunable non-reciprocal quantum transport through a dissipative Aharonov-Bohm ring in ultracold atoms

Understanding the Automated Parameter Optimization on Transfer Learning for CPDP: An Empirical Study

Fragmentation and isomerization of polycyclic aromatic hydrocarbons in the interstellar medium: coronene as a case study

Bounded Geometry and Characterization of Some Transcendental Entire and Meromorphic Maps

Bridging Ecology and Cloud: Transposing Ecological Prespective to Enable Better Cloud Autoscaling

Determining layer number of two dimensional flakes of transition-metal dichalcogenides by the Raman intensity from substrate

Exact solution of the pairing problem for spherical and deformed systems

Multilingual Visual Sentiment Concept Matching