Source author record

Wei Shi

Wei Shi appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

39works

30topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Caracal: Causal Architecture via Spectral Mixing

The scalability of Large Language Models to long sequences is hindered by the quadratic cost of attention and the limitations of positional encodings. To address these, we introduce Caracal, a novel architecture that replaces attention with a parameter-efficient, O(L log(L)) Multi-Head Fourier (MHF) module. Our contributions are threefold: (1) We leverage the Fast Fourier Transform (FFT) for sequence mixing, inherently addressing both bottlenecks mentioned above. (2) We apply a frequency-domain causal masking technique that enforces autoregressive capabilities via asymmetric padding and truncation, overcoming a critical barrier for Fourier-based generative models. (3) Unlike efficient models relying on hardware-specific implementations (e.g., Mamba), we uses standard library operators. This ensures robust portability, eliminating common deployment barriers. Evaluations demonstrate that Caracal performs competitively with Transformer and SSM baselines, offering a scalable and simple pathway for efficient long-sequence modeling. Code is available in Appendix.

preprint2022arXiv

Identity-Sensitive Knowledge Propagation for Cloth-Changing Person Re-identification

Cloth-changing person re-identification (CC-ReID), which aims to match person identities under clothing changes, is a new rising research topic in recent years. However, typical biometrics-based CC-ReID methods often require cumbersome pose or body part estimators to learn cloth-irrelevant features from human biometric traits, which comes with high computational costs. Besides, the performance is significantly limited due to the resolution degradation of surveillance images. To address the above limitations, we propose an effective Identity-Sensitive Knowledge Propagation framework (DeSKPro) for CC-ReID. Specifically, a Cloth-irrelevant Spatial Attention module is introduced to eliminate the distraction of clothing appearance by acquiring knowledge from the human parsing module. To mitigate the resolution degradation issue and mine identity-sensitive cues from human faces, we propose to restore the missing facial details using prior facial knowledge, which is then propagated to a smaller network. After training, the extra computations for human parsing or face restoration are no longer required. Extensive experiments show that our framework outperforms state-of-the-art methods by a large margin. Our code is available at https://github.com/KimbingNg/DeskPro.

preprint2022arXiv

Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks

With the deployment of the fifth generation (5G) wireless systems gathering momentum across the world, possible technologies for 6G are under active research discussions. In particular, the role of machine learning (ML) in 6G is expected to enhance and aid emerging applications such as virtual and augmented reality, vehicular autonomy, and computer vision. This will result in large segments of wireless data traffic comprising image, video and speech. The ML algorithms process these for classification/recognition/estimation through the learning models located on cloud servers. This requires wireless transmission of data from edge devices to the cloud server. Channel estimation, handled separately from recognition step, is critical for accurate learning performance. Toward combining the learning for both channel and the ML data, we introduce implicit channel learning to perform the ML tasks without estimating the wireless channel. Here, the ML models are trained with channel-corrupted datasets in place of nominal data. Without channel estimation, the proposed approach exhibits approximately 60% improvement in image and speech classification tasks for diverse scenarios such as millimeter wave and IEEE 802.11p vehicular channels.

preprint2022arXiv

Preserving Dense Features for Ki67 Nuclei Detection

Nuclei detection is a key task in Ki67 proliferation index estimation in breast cancer images. Deep learning algorithms have shown strong potential in nuclei detection tasks. However, they face challenges when applied to pathology images with dense medium and overlapping nuclei since fine details are often diluted or completely lost by early maxpooling layers. This paper introduces an optimized UV-Net architecture, specifically developed to recover nuclear details with high-resolution through feature preservation for Ki67 proliferation index computation. UV-Net achieves an average F1-score of 0.83 on held-out test patch data, while other architectures obtain 0.74-0.79. On tissue microarrays (unseen) test data obtained from multiple centers, UV-Net's accuracy exceeds other architectures by a wide margin, including 9-42\% on Ontario Veterinary College, 7-35\% on Protein Atlas and 0.3-3\% on University Health Network.

preprint2022arXiv

RobustAnalog: Fast Variation-Aware Analog Circuit Design Via Multi-task RL

Analog/mixed-signal circuit design is one of the most complex and time-consuming stages in the whole chip design process. Due to various process, voltage, and temperature (PVT) variations from chip manufacturing, analog circuits inevitably suffer from performance degradation. Although there has been plenty of work on automating analog circuit design under the typical condition, limited research has been done on exploring robust designs under real and unpredictable silicon variations. Automatic analog design against variations requires prohibitive computation and time costs. To address the challenge, we present RobustAnalog, a robust circuit design framework that involves the variation information in the optimization process. Specifically, circuit optimizations under different variations are considered as a set of tasks. Similarities among tasks are leveraged and competitions are alleviated to realize a sample-efficient multi-task training. Moreover, RobustAnalog prunes the task space according to the current performance in each iteration, leading to a further simulation cost reduction. In this way, RobustAnalog can rapidly produce a set of circuit parameters that satisfies diverse constraints (e.g. gain, bandwidth, noise...) across variations. We compare RobustAnalog with Bayesian optimization, Evolutionary algorithm, and Deep Deterministic Policy Gradient (DDPG) and demonstrate that RobustAnalog can significantly reduce required optimization time by 14-30 times. Therefore, our study provides a feasible method to handle various real silicon conditions.

preprint2022arXiv

SpinQ Triangulum: a commercial three-qubit desktop quantum computer

SpinQ Triangulum is the second generation of the desktop quantum computers designed and manufactured by SpinQ Technology. SpinQ's desktop quantum computer series, based on room temperature NMR spectrometer, provide light-weighted, cost-effective and maintenance-free quantum computing platforms that aim to provide real-device experience for quantum computing education for K-12 and college level. These platforms also feature quantum control design capabilities for studying quantum control and quantum noise. Compared with the first generation product, the two-qubit SpinQ Gemini, Triangulum features a three-qubit QPU, smaller dimensions (61 * 33 * 56 cm^3) and lighter (40 kg). Furthermore, the magnetic field is more stable and the performance of quantum control is more accurate. This paper introduces the system design of Triangulum and its new features. As an example of performing quantum computing tasks, we present the implementation of the Harrow-Hassidim-Lloyd (HHL) algorithm on Triangulum, demonstrating Triangulum's capability of undertaking complex quantum computing tasks. SpinQ will continue to develop desktop quantum computing platform with more qubits. Meanwhile, a simplified version of SpinQ Gemini, namely Gemini Mini (https://www.spinq.cn/products#geminiMini-anchor) , has been recently realised. Gemini Mini is much more portable (20* 35 * 26 cm^3, 14 kg) and affordable for most K-12 schools around the world.

preprint2022arXiv

Supervised Contrastive CSI Representation Learning for Massive MIMO Positioning

Similarity metric is crucial for massive MIMO positioning utilizing channel state information~(CSI). In this letter, we propose a novel massive MIMO CSI similarity learning method via deep convolutional neural network~(DCNN) and contrastive learning. A contrastive loss function is designed considering multiple positive and negative CSI samples drawn from a training dataset. The DCNN encoder is trained using the loss so that positive samples are mapped to points close to the anchor's encoding, while encodings of negative samples are kept away from the anchor's in the representation space. Evaluation results of fingerprint-based positioning on a real-world CSI dataset show that the learned similarity metric improves positioning accuracy significantly compared with other known state-of-the-art methods.

preprint2021arXiv

Obfuscation of Images via Differential Privacy: From Facial Images to General Images

Due to the pervasiveness of image capturing devices in every-day life, images of individuals are routinely captured. Although this has enabled many benefits, it also infringes on personal privacy. A promising direction in research on obfuscation of facial images has been the work in the k-same family of methods which employ the concept of k-anonymity from database privacy. However, there are a number of deficiencies of k-anonymity that carry over to the k-same methods, detracting from their usefulness in practice. In this paper, we first outline several of these deficiencies and discuss their implications in the context of facial obfuscation. We then develop a framework through which we obtain a formal differentially private guarantee for the obfuscation of facial images in generative machine learning models. Our approach provides a provable privacy guarantee that is not susceptible to the outlined deficiencies of k-same obfuscation and produces photo-realistic obfuscated output. In addition, we demonstrate through experimental comparisons that our approach can achieve comparable utility to k-same obfuscation in terms of preservation of useful features in the images. Furthermore, we propose a method to achieve differential privacy for any image (i.e., without restriction to facial images) through the direct modification of pixel intensities. Although the addition of noise to pixel intensities does not provide the high visual quality obtained via generative machine learning models, it offers greater versatility by eliminating the need for a trained model. We demonstrate that our proposed use of the exponential mechanism in this context is able to provide superior visual quality to pixel-space obfuscation using the Laplace mechanism.

preprint2021arXiv

SpinQ Gemini: a desktop quantum computer for education and research

SpinQ Gemini is a commercial desktop quantum computer designed and manufactured by SpinQ Technology. It is an integrated hardware-software system. The first generation product with two qubits was launched in January 2020. The hardware is based on NMR spectrometer, with permanent magnets providing $\sim 1$ T magnetic field. SpinQ Gemini operates under room temperature ($0$-$30^{\circ}$C), highlighting its lightweight (55 kg with a volume of $70\times 40 \times 80$ cm$^3$), cost-effective (under $50$k USD), and maintenance-free. SpinQ Gemini aims to provide real-device experience for quantum computing education for K-12 and at the college level. It also features quantum control design capabilities that benefit the researchers studying quantum control and quantum noise. Since its first launch, SpinQ Gemini has been shipped to institutions in Canada, Taiwan and Mainland China. This paper introduces the system of design of SpinQ Gemini, from hardware to software. We also demonstrate examples for performing quantum computing tasks on SpinQ Gemini, including one task for a variational quantum eigensolver of a two-qubit Heisenberg model. The next generations of SpinQ quantum computing devices will adopt models of more qubits, advanced control functions for researchers with comparable cost, as well as simplified models for much lower cost (under $5$k USD) for K-12 education. We believe that low-cost portable quantum computer products will facilitate hands-on experience for teaching quantum computing at all levels, well-prepare younger generations of students and researchers for the future of quantum technologies.

preprint2020arXiv

Accelerating Incremental Gradient Optimization with Curvature Information

This paper studies an acceleration technique for incremental aggregated gradient ({\sf IAG}) method through the use of \emph{curvature} information for solving strongly convex finite sum optimization problems. These optimization problems of interest arise in large-scale learning applications. Our technique utilizes a curvature-aided gradient tracking step to produce accurate gradient estimates incrementally using Hessian information. We propose and analyze two methods utilizing the new technique, the curvature-aided IAG ({\sf CIAG}) method and the accelerated CIAG ({\sf A-CIAG}) method, which are analogous to gradient method and Nesterov's accelerated gradient method, respectively. Setting $κ$ to be the condition number of the objective function, we prove the $R$ linear convergence rates of $1 - \frac{4c_0 κ}{(κ+1)^2}$ for the {\sf CIAG} method, and $1 - \sqrt{\frac{c_1}{2κ}}$ for the {\sf A-CIAG} method, where $c_0,c_1 \leq 1$ are constants inversely proportional to the distance between the initial point and the optimal solution. When the initial iterate is close to the optimal solution, the $R$ linear convergence rates match with the gradient and accelerated gradient method, albeit {\sf CIAG} and {\sf A-CIAG} operate in an incremental setting with strictly lower computation complexity. Numerical experiments confirm our findings. The source codes used for this paper can be found on \url{http://github.com/hoitowai/ciag/}.

preprint2020arXiv

Chip-scale Full-Stokes Spectropolarimeter in Silicon Photonic Circuits

Wavelength-dependent polarization state of light carries crucial information about light-matter interactions. However, its measurement is limited to bulky, energy-consuming devices, which prohibits many modern, portable applications. Here, we propose and demonstrate a chip-scale spectropolarimeter implemented using a CMOS-compatible silicon photonics technology. Four compact Vernier microresonator spectrometers are monolithically integrated with a broadband polarimeter consisting of a 2D nanophotonic antenna and a polarimetric circuit to achieve full-Stokes spectropolarimetric analysis. The proposed device offers a solid-state spectropolarimetry solution with a small footprint of 1*0.6 mm2 and low power consumption of 360 mW}. Full-Stokes spectral detection across a broad spectral range of 50 nm with a resolution of 1~nm is demonstrated in characterizing a material possessing structural chirality. The proposed device may enable a broader application of spectropolarimetry in the fields ranging from biomedical diagnostics and chemical analysis to observational astronomy.

preprint2020arXiv

Differential Privacy Via a Truncated and Normalized Laplace Mechanism

When querying databases containing sensitive information, the privacy of individuals stored in the database has to be guaranteed. Such guarantees are provided by differentially private mechanisms which add controlled noise to the query responses. However, most such mechanisms do not take into consideration the valid range of the query being posed. Thus, noisy responses that fall outside of this range may potentially be produced. To rectify this and therefore improve the utility of the mechanism, the commonly used Laplace distribution can be truncated to the valid range of the query and then normalized. However, such a data-dependent operation of normalization leaks additional information about the true query response thereby violating the differential privacy guarantee. Here, we propose a new method which preserves the differential privacy guarantee through a careful determination of an appropriate scaling parameter for the Laplace distribution. We also generalize the privacy guarantee in the context of the Laplace distribution to account for data-dependent normalization factors and study this guarantee for different classes of range constraint configurations. We provide derivations of the optimal scaling parameter (i.e., the minimal value that preserves differential privacy) for each class or provide an approximation thereof. As a consequence of this work, one can use the Laplace distribution to answer queries in a range-adherent and differentially private manner.

preprint2020arXiv

Joint Embedding in Named Entity Linking on Sentence Level

Named entity linking is to map an ambiguous mention in documents to an entity in a knowledge base. The named entity linking is challenging, given the fact that there are multiple candidate entities for a mention in a document. It is difficult to link a mention when it appears multiple times in a document, since there are conflicts by the contexts around the appearances of the mention. In addition, it is difficult since the given training dataset is small due to the reason that it is done manually to link a mention to its mapping entity. In the literature, there are many reported studies among which the recent embedding methods learn vectors of entities from the training dataset at document level. To address these issues, we focus on how to link entity for mentions at a sentence level, which reduces the noises introduced by different appearances of the same mention in a document at the expense of insufficient information to be used. We propose a new unified embedding method by maximizing the relationships learned from knowledge graphs. We confirm the effectiveness of our method in our experimental studies.

preprint2020arXiv

Push-Pull Gradient Methods for Distributed Optimization in Networks

In this paper, we focus on solving a distributed convex optimization problem in a network, where each agent has its own convex cost function and the goal is to minimize the sum of the agents' cost functions while obeying the network connectivity structure. In order to minimize the sum of the cost functions, we consider new distributed gradient-based methods where each node maintains two estimates, namely, an estimate of the optimal decision variable and an estimate of the gradient for the average of the agents' objective functions. From the viewpoint of an agent, the information about the gradients is pushed to the neighbors, while the information about the decision variable is pulled from the neighbors hence giving the name "push-pull gradient methods". The methods utilize two different graphs for the information exchange among agents, and as such, unify the algorithms with different types of distributed architecture, including decentralized (peer-to-peer), centralized (master-slave), and semi-centralized (leader-follower) architecture. We show that the proposed algorithms and their many variants converge linearly for strongly convex and smooth objective functions over a network (possibly with unidirectional data links) in both synchronous and asynchronous random-gossip settings. In particular, under the random-gossip setting, "push-pull" is the first class of algorithms for distributed optimization over directed graphs. Moreover, we numerically evaluate our proposed algorithms in both scenarios, and show that they outperform other existing linearly convergent schemes, especially for ill-conditioned problems and networks that are not well balanced.

preprint2020arXiv

Self-Refining Deep Symmetry Enhanced Network for Rain Removal

Rain removal aims to remove the rain streaks on rain images. The state-of-the-art methods are mostly based on Convolutional Neural Network~(CNN). However, as CNN is not equivariant to object rotation, these methods are unsuitable for dealing with the tilted rain streaks. To tackle this problem, we propose Deep Symmetry Enhanced Network~(DSEN) that is able to explicitly extract the rotation equivariant features from rain images. In addition, we design a self-refining mechanism to remove the accumulated rain streaks in a coarse-to-fine manner. This mechanism reuses DSEN with a novel information link which passes the gradient flow to the higher stages. Extensive experiments on both synthetic and real-world rain images show that our self-refining DSEN yields the top performance.

preprint2020arXiv

SNEAP: A Fast and Efficient Toolchain for Mapping Large-Scale Spiking Neural Network onto NoC-based Neuromorphic Platform

Spiking neural network (SNN), as the third generation of artificial neural networks, has been widely adopted in vision and audio tasks. Nowadays, many neuromorphic platforms support SNN simulation and adopt Network-on-Chips (NoC) architecture for multi-cores interconnection. However, interconnection brings huge area overhead to the platform. Moreover, run-time communication on the interconnection has a significant effect on the total power consumption and performance of the platform. In this paper, we propose a toolchain called SNEAP for mapping SNNs to neuromorphic platforms with multi-cores, which aims to reduce the energy and latency brought by spike communication on the interconnection. SNEAP includes two key steps: partitioning the SNN to reduce the spikes communicated between partitions, and mapping the partitions of SNN to the NoC to reduce average hop of spikes under the constraint of hardware resources. SNEAP can reduce more spikes communicated on the interconnection of NoC and spend less time than other toolchains in the partitioning phase. Moreover, the average hop of spikes is reduced more by SNEAP within a time period, which effectively reduces the energy and latency on the NoC-based neuromorphic platform. The experimental results show that SNEAP can achieve 418x reduction in end-to-end execution time, and reduce energy consumption and spike latency, on average, by 23% and 51% respectively, compared with SpiNeMap.

preprint2020arXiv

The Unusual Eruption of the Extragalactic Classical Nova M31N 2017-09a

M31N 2017-09a is a classical nova and was observed for some 160 days following its initial eruption, during which time it underwent a number of bright secondary outbursts. The light-curve is characterized by continual variation with excursions of at least 0.5 magnitudes on a daily time-scale. The lower envelope of the eruption suggests that a single power-law can describe the decline rate. The eruption is relatively long with $t_2 = 111$, and $t_3 = 153$ days.

preprint2019arXiv

A decentralized proximal-gradient method with network independent step-sizes and separated convergence rates

This paper proposes a novel proximal-gradient algorithm for a decentralized optimization problem with a composite objective containing smooth and non-smooth terms. Specifically, the smooth and nonsmooth terms are dealt with by gradient and proximal updates, respectively. The proposed algorithm is closely related to a previous algorithm, PG-EXTRA \cite{shi2015proximal}, but has a few advantages. First of all, agents use uncoordinated step-sizes, and the stable upper bounds on step-sizes are independent of network topologies. The step-sizes depend on local objective functions, and they can be as large as those of the gradient descent. Secondly, for the special case without non-smooth terms, linear convergence can be achieved under the strong convexity assumption. The dependence of the convergence rate on the objective functions and the network are separated, and the convergence rate of the new algorithm is as good as one of the two convergence rates that match the typical rates for the general gradient descent and the consensus averaging. We provide numerical experiments to demonstrate the efficacy of the introduced algorithm and validate our theoretical discoveries.

preprint2019arXiv

Automatic image-domain Moire artifact reduction method in grating-based x-ray interferometry imaging

The aim of this study is to demonstrate the feasibility of removing the image Moire artifacts caused by system inaccuracies in grating-based x-ray interferometry imaging system via convolutional neural network (CNN) technique. Instead of minimizing these inconsistencies between the acquired phase stepping data via certain optimized signal retrieval algorithms, our newly proposed CNN-based method reduces the Moire artifacts in the image-domain via a learned image post-processing procedure. To ease the training data preparations, we propose to synthesize them with numerical natural images and experimentally obtained Moire artifact-only-images. Moreover, a fast signal processing method has also been developed to generate the needed large number of high quality Moire artifact-only images from finite number of acquired experimental phase stepping data. Experimental results show that the CNN method is able to remove Moire artifacts effectively, while maintaining the signal accuracy and image resolution.

preprint2016arXiv

A Decentralized Second-Order Method for Dynamic Optimization

This paper considers decentralized dynamic optimization problems where nodes of a network try to minimize a sequence of time-varying objective functions in a real-time scheme. At each time slot, nodes have access to different summands of an instantaneous global objective function and they are allowed to exchange information only with their neighbors. This paper develops the application of the Exact Second-Order Method (ESOM) to solve the dynamic optimization problem in a decentralized manner. The proposed dynamic ESOM algorithm operates by primal descending and dual ascending on a quadratic approximation of an augmented Lagrangian of the instantaneous consensus optimization problem. The convergence analysis of dynamic ESOM indicates that a Lyapunov function of the sequence of primal and dual errors converges linearly to an error bound when the local functions are strongly convex and have Lipschitz continuous gradients. Numerical results demonstrate the claim that the sequence of iterates generated by the proposed method is able to track the sequence of optimal arguments.

preprint2016arXiv

A Decentralized Second-Order Method with Exact Linear Convergence Rate for Consensus Optimization

This paper considers decentralized consensus optimization problems where different summands of a global objective function are available at nodes of a network that can communicate with neighbors only. The proximal method of multipliers is considered as a powerful tool that relies on proximal primal descent and dual ascent updates on a suitably defined augmented Lagrangian. The structure of the augmented Lagrangian makes this problem non-decomposable, which precludes distributed implementations. This problem is regularly addressed by the use of the alternating direction method of multipliers. The exact second order method (ESOM) is introduced here as an alternative that relies on: (i) The use of a separable quadratic approximation of the augmented Lagrangian. (ii) A truncated Taylor's series to estimate the solution of the first order condition imposed on the minimization of the quadratic approximation of the augmented Lagrangian. The sequences of primal and dual variables generated by ESOM are shown to converge linearly to their optimal arguments when the aggregate cost function is strongly convex and its gradients are Lipschitz continuous. Numerical results demonstrate advantages of ESOM relative to decentralized alternatives in solving least squares and logistic regression problems.

preprint2016arXiv

Decentralized Dynamic Optimization for Power Network Voltage Control

Voltage control in power distribution networks has been greatly challenged by the increasing penetration of volatile and intermittent devices. These devices can also provide limited reactive power resources that can be used to regulate the network-wide voltage. A decentralized voltage control strategy can be designed by minimizing a quadratic voltage mismatch error objective using gradient-projection (GP) updates. Coupled with the power network flow, the local voltage can provide the instantaneous gradient information. This paper aims to analyze the performance of this decentralized GP-based voltage control design under two dynamic scenarios: i) the nodes perform the decentralized update in an asynchronous fashion, and ii) the network operating condition is time-varying. For the asynchronous voltage control, we improve the existing convergence condition by recognizing that the voltage based gradient is always up-to-date. By modeling the network dynamics using an autoregressive process and considering time-varying resource constraints, we provide an error bound in tracking the instantaneous optimal solution to the quadratic error objective. This result can be extended to more general \textit{constrained dynamic optimization} problems with smooth strongly convex objective functions under stochastic processes that have bounded iterative changes. Extensive numerical tests have been performed to demonstrate and validate our analytical results for realistic power networks.

preprint2016arXiv

Expander Graph and Communication-Efficient Decentralized Optimization

In this paper, we discuss how to design the graph topology to reduce the communication complexity of certain algorithms for decentralized optimization. Our goal is to minimize the total communication needed to achieve a prescribed accuracy. We discover that the so-called expander graphs are near-optimal choices. We propose three approaches to construct expander graphs for different numbers of nodes and node degrees. Our numerical results show that the performance of decentralized optimization is significantly better on expander graphs than other regular graphs.

preprint2016arXiv

Geometrically Convergent Distributed Optimization with Uncoordinated Step-Sizes

A recent algorithmic family for distributed optimization, DIGing's, have been shown to have geometric convergence over time-varying undirected/directed graphs. Nevertheless, an identical step-size for all agents is needed. In this paper, we study the convergence rates of the Adapt-Then-Combine (ATC) variation of the DIGing algorithm under uncoordinated step-sizes. We show that the ATC variation of DIGing algorithm converges geometrically fast even if the step-sizes are different among the agents. In addition, our analysis implies that the ATC structure can accelerate convergence compared to the distributed gradient descent (DGD) structure which has been used in the original DIGing algorithm.

preprint2016arXiv

Hierarchy, dimension, attractor and self-organization -- dynamics of mode-locked fiber lasers

Mode-locked fiber lasers are one of the most important sources of ultra-short pulses. However, A unified description for the rich variety of states and the driving forces behind the complex and diverse nonlinear behavior of mode-locked fiber lasers have yet to be developed. Here we present a comprehensive theoretical framework based upon complexity science, thereby offering a fundamentally new way of thinking about the behavior of mode-locked fiber lasers. This hierarchically structured frame work provide a model with and changeable variable dimensionality resulting in a simple and elegant view, with which numerous complex states can be described systematically. The existence of a set of new mode-locked fiber laser states is proposed for the first time. Moreover, research into the attractors' basins reveals the origin of stochasticity, hysteresis and multistability in these systems. These findings pave the way for dynamics analysis and new system designs of mode-locked fiber lasers. The paradigm will have a wide range of potential applications in diverse research fields.

preprint2016arXiv

Interface coupling in twisted multilayer graphene by resonant Raman spectroscopy of layer breathing modes

Raman spectroscopy is the prime non-destructive characterization tool for graphene and related layered materials. The shear (C) and layer breathing modes (LBMs) are due to relative motions of the planes, either perpendicular or parallel to their normal. This allows one to directly probe the interlayer interactions in multilayer samples. Graphene and other two-dimensional (2d) crystals can be combined to form various hybrids and heterostructures, creating materials on demand with properties determined by the interlayer interaction. This is the case even for a single material, where multilayer stacks with different relative orientation have different optical and electronic properties. In twisted multilayer graphene samples there is a significant enhancement of the C modes due to resonance with new optically allowed electronic transitions, determined by the relative orientation of the layers. Here we show that this applies also to the LBMs, that can be now directly measured at room temperature. We find that twisting does not affect LBMs, quite different from the case of the C modes. This implies that the periodicity mismatch between two twisted layers mostly affects shear interactions. Our work shows that Raman spectroscopy is an ideal tool to uncover the interface coupling of 2d hybrids and heterostructures.

preprint2015arXiv

A Novel Geographic Partitioning System for Anonymizing Health Care Data

With large volumes of detailed health care data being collected, there is a high demand for the release of this data for research purposes. Hospitals and organizations are faced with conflicting interests of releasing this data and protecting the confidentiality of the individuals to whom the data pertains. Similarly, there is a conflict in the need to release precise geographic information for certain research applications and the requirement to censor or generalize the same information for the sake of confidentiality. Ultimately the challenge is to anonymize data in order to comply with government privacy policies while reducing the loss in geographic information as much as possible. In this paper, we present a novel geographic-based system for the anonymization of health care data. This system is broken up into major components for which different approaches may be supplied. We compare such approaches in order to make recommendations on which of them to select to best match user requirements.

preprint2015arXiv

Decentralized Quadratically Approximated Alternating Direction Method of Multipliers

This paper considers an optimization problem that components of the objective function are available at different nodes of a network and nodes are allowed to only exchange information with their neighbors. The decentralized alternating method of multipliers (DADMM) is a well-established iterative method for solving this category of problems; however, implementation of DADMM requires solving an optimization subproblem at each iteration for each node. This procedure is often computationally costly for the nodes. We introduce a decentralized quadratic approximation of ADMM (DQM) that reduces computational complexity of DADMM by minimizing a quadratic approximation of the objective function. Notwithstanding that DQM successively minimizes approximations of the cost, it converges to the optimal arguments at a linear rate which is identical to the convergence rate of DADMM. Further, we show that as time passes the coefficient of linear convergence for DQM approaches the one for DADMM. Numerical results demonstrate the effectiveness of DQM.

preprint2015arXiv

DQM: Decentralized Quadratically Approximated Alternating Direction Method of Multipliers

This paper considers decentralized consensus optimization problems where nodes of a network have access to different summands of a global objective function. Nodes cooperate to minimize the global objective by exchanging information with neighbors only. A decentralized version of the alternating directions method of multipliers (DADMM) is a common method for solving this category of problems. DADMM exhibits linear convergence rate to the optimal objective but its implementation requires solving a convex optimization problem at each iteration. This can be computationally costly and may result in large overall convergence times. The decentralized quadratically approximated ADMM algorithm (DQM), which minimizes a quadratic approximation of the objective function that DADMM minimizes at each iteration, is proposed here. The consequent reduction in computational time is shown to have minimal effect on convergence properties. Convergence still proceeds at a linear rate with a guaranteed constant that is asymptotically equivalent to the DADMM linear convergence rate constant. Numerical results demonstrate advantages of DQM relative to DADMM and other alternatives in a logistic regression problem.

preprint2015arXiv

Geographic Partitioning Techniques for the Anonymization of Health Care Data

Hospitals and health care organizations collect large amounts of detailed health care data that is in high demand by researchers. Thus, the possessors of such data are in need of methods that allow for this data to be released without compromising the confidentiality of the individuals to whom it pertains. As the geographic aspect of this data is becoming increasingly relevant for research being conducted, it is important for an \emph{anonymization} process to pay due attention to the geographic attributes of such data. In this paper, a novel system for health care data anonymization is presented. At the core of the system is the aggregation of an initial regionalization guided by the use of a Voronoi diagram. We conduct a comparison with another geographic-based system of anonymization, GeoLeader. We show that our system is capable of producing results of a comparable quality with a much faster running time.

preprint2015arXiv

Monolayer Molybdenum Disulfide Nanoribbons with High Optical Anisotropy

Two-dimensional Molybdenum Disulfide (MoS2) has shown promising prospects for the next generation electronics and optoelectronics devices. The monolayer MoS2 can be patterned into quasi-one-dimensional anisotropic MoS2 nanoribbons (MNRs), in which theoretical calculations have predicted novel properties. However, little work has been carried out in the experimental exploration of MNRs with a width of less than 20 nm where the geometrical confinement can lead to interesting phenomenon. Here, we prepared MNRs with width between 5 nm to 15 nm by direct helium ion beam milling. High optical anisotropy of these MNRs is revealed by the systematic study of optical contrast and Raman spectroscopy. The Raman modes in MNRs show strong polarization dependence. Besides that the E' and A'1 peaks are broadened by the phonon-confinement effect, the modes corresponding to singularities of vibrational density of states are activated by edges. The peculiar polarization behavior of Raman modes can be explained by the anisotropy of light absorption in MNRs, which is evidenced by the polarized optical contrast. The study opens the possibility to explore quasione-dimensional materials with high optical anisotropy from isotropic 2D family of transition metal dichalcogenides.

preprint2015arXiv

On Calibration of Three-axis Magnetometer

Magnetometer has received wide applications in attitude determination and scientific measurements. Calibration is an important step for any practical magnetometer use. The most popular three-axis magnetometer calibration methods are attitude-independent and have been founded on an approximate maximum likelihood (ML) estimation with a quartic subjective function, derived from the fact that the magnitude of the calibrated measurements should be constant in a homogeneous magnetic field. This paper highlights the shortcomings of those popular methods and proposes to use the quadratic optimal ML estimation instead for magnetometer calibration. Simulation and test results show that the optimal ML calibration is superior to the approximate ML methods for magnetometer calibration in both accuracy and stability, especially for those situations without sufficient attitude excitation. The significant benefits deserve the moderately increased computation burden. The main conclusion obtained in the context of magnetometer in this paper is potentially applicable to various kinds of three-axis sensors.

preprint2015arXiv

Phonon and Raman scattering of two-dimensional transition metal dichalcogenides from monolayer, multilayer to bulk material

Two-dimensional (2D) transition metal dichalcogenide (TMD) nanosheets exhibit remarkable electronic and optical properties. The 2D features, sizable bandgaps, and recent advances in the synthesis, characterization, and device fabrication of the representative MoS$_2$, WS$_2$, WSe$_2$, and MoSe$_2$ TMDs make TMDs very attractive in nanoelectronics and optoelectronics. Similar to graphite and graphene, the atoms within each layer in 2D TMDs are joined together by covalent bonds, while van der Waals interactions keep the layers together. This makes the physical and chemical properties of 2D TMDs layer dependent. In this review, we discuss the basic lattice vibrations of monolayer, multilayer, and bulk TMDs, including high-frequency optical phonons, interlayer shear and layer breathing phonons, the Raman selection rule, layer-number evolution of phonons, multiple phonon replica, and phonons at the edge of the Brillouin zone. The extensive capabilities of Raman spectroscopy in investigating the properties of TMDs are discussed, such as interlayer coupling, spin--orbit splitting, and external perturbations. The interlayer vibrational modes are used in rapid and substrate-free characterization of the layer number of multilayer TMDs and in probing interface coupling in TMD heterostructures. The success of Raman spectroscopy in investigating TMD nanosheets paves the way for experiments on other 2D crystals and related van der Waals heterostructures.

preprint2015arXiv

Polytypism and Unexpected Strong Interlayer Coupling of two-Dimensional Layered ReS2

The anisotropic two-dimensional (2D) van der Waals (vdW) layered materials, with both scientific interest and potential application, have one more dimension to tune the properties than the isotropic 2D materials. The interlayer vdW coupling determines the properties of 2D multi-layer materials by varying stacking orders. As an important representative anisotropic 2D materials, multilayer rhenium disulfide (ReS2) was expected to be random stacking and lack of interlayer coupling. Here, we demonstrate two stable stacking orders (aa and a-b) of N layer (NL, N>1) ReS2 from ultralow-frequency and high-frequency Raman spectroscopy, photoluminescence spectroscopy and first-principles density functional theory calculation. Two interlayer shear modes are observed in aa-stacked NL-ReS2 while only one interlayer shear mode appears in a-b-stacked NL-ReS2, suggesting anisotropic-like and isotropic-like stacking orders in aa- and a-b-stacked NL-ReS2, respectively. The frequency of the interlayer shear and breathing modes reveals unexpected strong interlayer coupling in aa- and a-b-NL-ReS2, the force constants of which are 55-90% to those of multilayer MoS2. The observation of strong interlayer coupling and polytypism in multi-layer ReS2 stimulate future studies on the structure, electronic and optical properties of other 2D anisotropic materials.

preprint2015arXiv

Substrate-free layer-number identification of two-dimensional materials: A case of Mo$_{0.5}$W$_{0.5}$S$_2$ alloy

Any of two or more two-dimensional (2D) materials with similar properties can be alloyed into a new layered material, namely, 2D alloy. Individual monolayer in 2D alloys are kept together by Van der Waals interactions. The property of multilayer alloys is a function of their layer number. Here, we studied the shear (C) and layer-breathing (LB) modes of Mo$_{0.5}$W$_{0.5}$S$_2$ alloy flakes and their link to the layer number of alloy flakes. The study reveals that the disorder effect is absent in the C and LB modes of 2D alloys, and the monatomic chain model can be used to estimate the frequencies of the C and LB modes. We demonstrated how to use the C and LB mode frequency to identify the layer number of alloy flakes deposited on different substrates. This technique is independent of the substrate, stoichiometry, monolayer thickness and complex refractive index of 2D materials, offering a robust and substrate-free approach for layer-number identification of 2D materials.

preprint2014arXiv

Assessing Technical Performance in Differential Gene Expression Experiments with External Spike-in RNA Control Ratio Mixtures

There is a critical need for standard approaches to assess, report, and compare the technical performance of genome-scale differential gene expression experiments. We assess technical performance with a proposed "standard" dashboard of metrics derived from analysis of external spike-in RNA control ratio mixtures. These control ratio mixtures with defined abundance ratios enable assessment of diagnostic performance of differentially expressed transcript lists, limit of detection of ratio (LODR) estimates, and expression ratio variability and measurement bias. The performance metrics suite is applicable to analysis of a typical experiment, and here we also apply these metrics to evaluate technical performance among laboratories. An interlaboratory study using identical samples shared amongst 12 laboratories with three different measurement processes demonstrated generally consistent diagnostic power across 11 laboratories. Ratio measurement variability and bias were also comparable amongst laboratories for the same measurement process. Different biases were observed for measurement processes using different mRNA enrichment protocols.

preprint2014arXiv

EXTRA: An Exact First-Order Algorithm for Decentralized Consensus Optimization

Recently, there have been growing interests in solving consensus optimization problems in a multi-agent network. In this paper, we develop a decentralized algorithm for the consensus optimization problem $$\min\limits_{x\in\mathbb{R}^p}~\bar{f}(x)=\frac{1}{n}\sum\limits_{i=1}^n f_i(x),$$ which is defined over a connected network of $n$ agents, where each function $f_i$ is held privately by agent $i$ and encodes the agent's data and objective. All the agents shall collaboratively find the minimizer while each agent can only communicate with its neighbors. Such a computation scheme avoids a data fusion center or long-distance communication and offers better load balance to the network. This paper proposes a novel decentralized EXact firsT-ordeR Algorithm (abbreviated as EXTRA) to solve the consensus optimization problem. "exact" means that it can converge to the exact solution. EXTRA can use a fixed large step size, {which is independent of the network size}, and has synchronized iterations. The local variable of every agent $i$ converges uniformly and consensually to an exact minimizer of $\bar{f}$. In contrast, the well-known decentralized gradient descent (DGD) method must use diminishing step sizes in order to converge to an exact minimizer. EXTRA and DGD have the same choice of mixing matrices and similar per-iteration complexity. EXTRA, however, uses the gradients of last two iterates, unlike DGD which uses just that of last iterate. EXTRA has the best known convergence rates among the existing first-order decentralized algorithms. Specifically, if $f_i$'s are convex and have Lipschitz continuous gradients, EXTRA has an ergodic convergence rate $O(\frac{1}{k})$ in terms of the first-order optimality residual. If $\bar{f}$ is also restricted strongly convex, EXTRA converges to an optimal solution at a linear rate $O(C^{-k})$ for some constant $C>1$.

preprint2014arXiv

On the Linear Convergence of the ADMM in Decentralized Consensus Optimization

In decentralized consensus optimization, a connected network of agents collaboratively minimize the sum of their local objective functions over a common decision variable, where their information exchange is restricted between the neighbors. To this end, one can first obtain a problem reformulation and then apply the alternating direction method of multipliers (ADMM). The method applies iterative computation at the individual agents and information exchange between the neighbors. This approach has been observed to converge quickly and deemed powerful. This paper establishes its linear convergence rate for decentralized consensus optimization problem with strongly convex local objective functions. The theoretical convergence rate is explicitly given in terms of the network topology, the properties of local objective functions, and the algorithm parameter. This result is not only a performance guarantee but also a guideline toward accelerating the ADMM convergence.

preprint2013arXiv

featureCounts: An efficient general-purpose program for assigning sequence reads to genomic features

Next-generation sequencing technologies generate millions of short sequence reads, which are usually aligned to a reference genome. In many applications, the key information required for downstream analysis is the number of reads mapping to each genomic feature, for example to each exon or each gene. The process of counting reads is called read summarization. Read summarization is required for a great variety of genomic analyses but has so far received relatively little attention in the literature. We present featureCounts, a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments. featureCounts implements highly efficient chromosome hashing and feature blocking techniques. It is considerably faster than existing methods (by an order of magnitude for gene-level summarization) and requires far less computer memory. It works with either single or paired-end reads and provides a wide range of options appropriate for different sequencing applications. featureCounts is available under GNU General Public License as part of the Subread (http://subread.sourceforge.net) or Rsubread (http://www.bioconductor.org) software packages.

Wei Shi

What is connected

Connect this record

See the researcher in context

Building this map preview

39 published item(s)

Caracal: Causal Architecture via Spectral Mixing

Identity-Sensitive Knowledge Propagation for Cloth-Changing Person Re-identification

Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks

Preserving Dense Features for Ki67 Nuclei Detection

RobustAnalog: Fast Variation-Aware Analog Circuit Design Via Multi-task RL

SpinQ Triangulum: a commercial three-qubit desktop quantum computer

Supervised Contrastive CSI Representation Learning for Massive MIMO Positioning

Obfuscation of Images via Differential Privacy: From Facial Images to General Images

SpinQ Gemini: a desktop quantum computer for education and research

Accelerating Incremental Gradient Optimization with Curvature Information

Chip-scale Full-Stokes Spectropolarimeter in Silicon Photonic Circuits

Differential Privacy Via a Truncated and Normalized Laplace Mechanism

Joint Embedding in Named Entity Linking on Sentence Level

Push-Pull Gradient Methods for Distributed Optimization in Networks

Self-Refining Deep Symmetry Enhanced Network for Rain Removal

SNEAP: A Fast and Efficient Toolchain for Mapping Large-Scale Spiking Neural Network onto NoC-based Neuromorphic Platform

The Unusual Eruption of the Extragalactic Classical Nova M31N 2017-09a

A decentralized proximal-gradient method with network independent step-sizes and separated convergence rates

Automatic image-domain Moire artifact reduction method in grating-based x-ray interferometry imaging

A Decentralized Second-Order Method for Dynamic Optimization

A Decentralized Second-Order Method with Exact Linear Convergence Rate for Consensus Optimization

Decentralized Dynamic Optimization for Power Network Voltage Control

Expander Graph and Communication-Efficient Decentralized Optimization

Geometrically Convergent Distributed Optimization with Uncoordinated Step-Sizes

Hierarchy, dimension, attractor and self-organization -- dynamics of mode-locked fiber lasers

Interface coupling in twisted multilayer graphene by resonant Raman spectroscopy of layer breathing modes

A Novel Geographic Partitioning System for Anonymizing Health Care Data

Decentralized Quadratically Approximated Alternating Direction Method of Multipliers

DQM: Decentralized Quadratically Approximated Alternating Direction Method of Multipliers

Geographic Partitioning Techniques for the Anonymization of Health Care Data

Monolayer Molybdenum Disulfide Nanoribbons with High Optical Anisotropy

On Calibration of Three-axis Magnetometer

Phonon and Raman scattering of two-dimensional transition metal dichalcogenides from monolayer, multilayer to bulk material

Polytypism and Unexpected Strong Interlayer Coupling of two-Dimensional Layered ReS2

Substrate-free layer-number identification of two-dimensional materials: A case of Mo$_{0.5}$W$_{0.5}$S$_2$ alloy

Assessing Technical Performance in Differential Gene Expression Experiments with External Spike-in RNA Control Ratio Mixtures

EXTRA: An Exact First-Order Algorithm for Decentralized Consensus Optimization

On the Linear Convergence of the ADMM in Decentralized Consensus Optimization

featureCounts: An efficient general-purpose program for assigning sequence reads to genomic features