Source author record

Bin Gu

Bin Gu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Artificial Intelligence eess.AS Sound astro-ph.SR Distributed, Parallel, and Cluster Computing eess.SP math.OC physics.chem-ph Software Engineering Systems and Control Computation and Language cond-mat.mtrl-sci cond-mat.soft eess.SY Logic in Computer Science physics.comp-ph physics.med-ph physics.space-ph

Catalog footprint

What is connected

24works

19topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A Stage-Wise Learning Strategy with Fixed Anchors for Robust Speaker Verification

Learning robust speaker representations under noisy conditions presents significant challenges, which requires careful handling of both discriminative and noise-invariant properties. In this work, we proposed an anchor-based stage-wise learning strategy for robust speaker representation learning. Specifically, our approach begins by training a base model to establish discriminative speaker boundaries, and then extract anchor embeddings from this model as stable references. Finally, a copy of the base model is fine-tuned on noisy inputs, regularized by enforcing proximity to their corresponding fixed anchor embeddings to preserve speaker identity under distortion. Experimental results suggest that this strategy offers advantages over conventional joint optimization, particularly in maintaining discrimination while improving noise robustness. The proposed method demonstrates consistent improvements across various noise conditions, potentially due to its ability to handle boundary stabilization and variation suppression separately.

preprint2026arXiv

New Insight of Variance reduce in Zero-Order Hard-Thresholding: Mitigating Gradient Error and Expansivity Contradictions

Hard-thresholding is an important type of algorithm in machine learning that is used to solve $\ell_0$ constrained optimization problems. However, the true gradient of the objective function can be difficult to access in certain scenarios, which normally can be approximated by zeroth-order (ZO) methods. The SZOHT algorithm is the only algorithm tackling $\ell_0$ sparsity constraints with ZO gradients so far. Unfortunately, SZOHT has a notable limitation on the number of random directions % in ZO gradients due to the inherent conflict between the deviation of ZO gradients and the expansivity of the hard-thresholding operator. This paper approaches this problem by considering the role of variance and provides a new insight into variance reduction: mitigating the unique conflicts between ZO gradients and hard-thresholding. Under this perspective, we propose a generalized variance reduced ZO hard-thresholding algorithm as well as the generalized convergence analysis under standard assumptions. The theoretical results demonstrate the new algorithm eliminates the restrictions on the number of random directions, leading to improved convergence rates and broader applicability compared with SZOHT. Finally, we illustrate the utility of our method on a ridge regression problem as well as black-box adversarial attacks.

preprint2022arXiv

Balanced Self-Paced Learning for AUC Maximization

Learning to improve AUC performance is an important topic in machine learning. However, AUC maximization algorithms may decrease generalization performance due to the noisy data. Self-paced learning is an effective method for handling noisy data. However, existing self-paced learning methods are limited to pointwise learning, while AUC maximization is a pairwise learning problem. To solve this challenging problem, we innovatively propose a balanced self-paced AUC maximization algorithm (BSPAUC). Specifically, we first provide a statistical objective for self-paced AUC. Based on this, we propose our self-paced AUC maximization formulation, where a novel balanced self-paced regularization term is embedded to ensure that the selected positive and negative samples have proper proportions. Specially, the sub-problem with respect to all weight variables may be non-convex in our formulation, while the one is normally convex in existing self-paced problems. To address this, we propose a doubly cyclic block coordinate descent method. More importantly, we prove that the sub-problem with respect to all weight variables converges to a stationary point on the basis of closed-form solutions, and our BSPAUC converges to a stationary point of our fixed optimization objective under a mild assumption. Considering both the deep learning and kernel-based implementations, experimental results on several large-scale datasets demonstrate that our BSPAUC has a better generalization performance than existing state-of-the-art AUC maximization methods.

preprint2022arXiv

Bragg's Additivity Rule and Core and Bond model studied by real-time TDDFT electronic stopping simulations: the case of water vapor

The electronic stopping power ($S_e$) of water vapor (H$_2$O), hydrogen (H$_2$) and oxygen (O$_2$) gases for protons in a broad range of energies, centered in the Bragg peak, was calculated using real-time time-dependent density functional theory (rt-TDDFT) simulations with Gaussian basis sets. This was done for a kinetic energy of incident protons ($E_k$) ranging from 1.56 keV/amu to 1.6 MeV/amu. $S_e$ was calculated as the average over geometrically pre-sampled short ion trajectories. The average $S_e(E_k)$ values were found to rapidly converge with 25-30 pre-sampled, 2 nm-long ion trajectories. The rt-TDDFT $S_e(E_k)$ curves were compared to experimental and SRIM data, and used to validate the Bragg's Additivity Rule (BAR). Discrepancies were analyzed in terms of basis set effects and omitted nuclear stopping at low energies. At variance with SRIM, we found that BAR is applicable to our rt-TDDFT simulations of $\mathrm{2H_2+O_2}$ $\mathrm{\rightarrow 2H_2O}$ without scaling for $E_k>40$ keV/amu. The hydrogen and oxygen Core and Bond (CAB) contributions to electronic stopping were calculated and found to be slightly smaller than SRIM values as a result of a red-shift in our rt-TDDFT $S_e(E_k)$ curves and a re-distribution of weights due to some bond contributions being neglected in SRIM.

preprint2022arXiv

Desirable Companion for Vertical Federated Learning: New Zeroth-Order Gradient Based Algorithm

Vertical federated learning (VFL) attracts increasing attention due to the emerging demands of multi-party collaborative modeling and concerns of privacy leakage. A complete list of metrics to evaluate VFL algorithms should include model applicability, privacy security, communication cost, and computation efficiency, where privacy security is especially important to VFL. However, to the best of our knowledge, there does not exist a VFL algorithm satisfying all these criteria very well. To address this challenging problem, in this paper, we reveal that zeroth-order optimization (ZOO) is a desirable companion for VFL. Specifically, ZOO can 1) improve the model applicability of VFL framework, 2) prevent VFL framework from privacy leakage under curious, colluding, and malicious threat models, 3) support inexpensive communication and efficient computation. Based on that, we propose a novel and practical VFL framework with black-box models, which is inseparably interconnected to the promising properties of ZOO. We believe that it takes one stride towards designing a practical VFL framework matching all the criteria. Under this framework, we raise two novel {\bf asy}nchronous ze{\bf r}oth-ord{\bf e}r algorithms for {\bf v}ertical f{\bf e}derated {\bf l}earning (AsyREVEL) with different smoothing techniques. We theoretically drive the convergence rates of AsyREVEL algorithms under nonconvex condition. More importantly, we prove the privacy security of our proposed framework under existing VFL attacks on different levels. Extensive experiments on benchmark datasets demonstrate the favorable model applicability, satisfied privacy security, inexpensive communication, efficient computation, scalability and losslessness of our framework.

preprint2022arXiv

Learning to Control under Time-Varying Environment

This paper investigates the problem of regret minimization in linear time-varying (LTV) dynamical systems. Due to the simultaneous presence of uncertainty and non-stationarity, designing online control algorithms for unknown LTV systems remains a challenging task. At a cost of NP-hard offline planning, prior works have introduced online convex optimization algorithms, although they suffer from nonparametric rate of regret. In this paper, we propose the first computationally tractable online algorithm with regret guarantees that avoids offline planning over the state linear feedback policies. Our algorithm is based on the optimism in the face of uncertainty (OFU) principle in which we optimistically select the best model in a high confidence region. Our algorithm is then more explorative when compared to previous approaches. To overcome non-stationarity, we propose either a restarting strategy (R-OFU) or a sliding window (SW-OFU) strategy. With proper configuration, our algorithm is attains sublinear regret $O(T^{2/3})$. These algorithms utilize data from the current phase for tracking variations on the system dynamics. We corroborate our theoretical findings with numerical experiments, which highlight the effectiveness of our methods. To the best of our knowledge, our study establishes the first model-based online algorithm with regret guarantees under LTV dynamical systems.

preprint2022arXiv

Multi-Level Contrastive Learning for Cross-Lingual Alignment

Cross-language pre-trained models such as multilingual BERT (mBERT) have achieved significant performance in various cross-lingual downstream NLP tasks. This paper proposes a multi-level contrastive learning (ML-CTL) framework to further improve the cross-lingual ability of pre-trained models. The proposed method uses translated parallel data to encourage the model to generate similar semantic embeddings for different languages. However, unlike the sentence-level alignment used in most previous studies, in this paper, we explicitly integrate the word-level information of each pair of parallel sentences into contrastive learning. Moreover, cross-zero noise contrastive estimation (CZ-NCE) loss is proposed to alleviate the impact of the floating-point error in the training process with a small batch size. The proposed method significantly improves the cross-lingual transfer ability of our basic model (mBERT) and outperforms on multiple zero-shot cross-lingual downstream tasks compared to the same-size models in the Xtreme benchmark.

preprint2021arXiv

Optimizing Large-Scale Hyperparameters via Automated Learning Algorithm

Modern machine learning algorithms usually involve tuning multiple (from one to thousands) hyperparameters which play a pivotal role in terms of model generalizability. Black-box optimization and gradient-based algorithms are two dominant approaches to hyperparameter optimization while they have totally distinct advantages. How to design a new hyperparameter optimization technique inheriting all benefits from both approaches is still an open problem. To address this challenging problem, in this paper, we propose a new hyperparameter optimization method with zeroth-order hyper-gradients (HOZOG). Specifically, we first exactly formulate hyperparameter optimization as an A-based constrained optimization problem, where A is a black-box optimization algorithm (such as deep neural network). Then, we use the average zeroth-order hyper-gradients to update hyperparameters. We provide the feasibility analysis of using HOZOG to achieve hyperparameter optimization. Finally, the experimental results on three representative hyperparameter (the size is from 1 to 1250) optimization tasks demonstrate the benefits of HOZOG in terms of simplicity, scalability, flexibility, effectiveness and efficiency compared with the state-of-the-art hyperparameter optimization methods.

preprint2021arXiv

Secure Bilevel Asynchronous Vertical Federated Learning with Backward Updating

Vertical federated learning (VFL) attracts increasing attention due to the emerging demands of multi-party collaborative modeling and concerns of privacy leakage. In the real VFL applications, usually only one or partial parties hold labels, which makes it challenging for all parties to collaboratively learn the model without privacy leakage. Meanwhile, most existing VFL algorithms are trapped in the synchronous computations, which leads to inefficiency in their real-world applications. To address these challenging problems, we propose a novel {\bf VF}L framework integrated with new {\bf b}ackward updating mechanism and {\bf b}ilevel asynchronous parallel architecture (VF{${\textbf{B}}^2$}), under which three new algorithms, including VF{${\textbf{B}}^2$}-SGD, -SVRG, and -SAGA, are proposed. We derive the theoretical results of the convergence rates of these three algorithms under both strongly convex and nonconvex conditions. We also prove the security of VF{${\textbf{B}}^2$} under semi-honest threat models. Extensive experiments on benchmark datasets demonstrate that our algorithms are efficient, scalable and lossless.

preprint2020arXiv

An Improved Deep Neural Network for Modeling Speaker Characteristics at Different Temporal Scales

This paper presents an improved deep embedding learning method based on convolutional neural network (CNN) for text-independent speaker verification. Two improvements are proposed for x-vector embedding learning: (1) Multi-scale convolution (MSCNN) is adopted in frame-level layers to capture complementary speaker information in different receptive fields. (2) A Baum-Welch statistics attention (BWSA) mechanism is applied in pooling-layer, which can integrate more useful long-term speaker characteristics in the temporal pooling layer. Experiments are carried out on the NIST SRE16 evaluation set. The results demonstrate the effectiveness of MSCNN and show the proposed BWSA can further improve the performance of the DNN embedding system

preprint2020arXiv

Efficient ab initio calculation of electronic stopping in disordered systems via geometry pre-sampling: application to liquid water

Knowledge of the electronic stopping curve for swift ions, $S_e(v)$, particularly around the Bragg peak, is important for understanding radiation damage. Experimentally, however, the determination of such feature for light ions is very challenging, especially in disordered systems such as liquid water and biological tissue. Recent developments in real-time time-dependent density functional theory (rt-TDDFT) have enabled the calculation of $S_e(v)$ along nm-sized trajectories. However, it is still a challenge to obtain a meaningful statistically averaged $S_e(v)$ that can be compared to observations. In this work, taking advantage of the correlation between the local electronic structure probed by the projectile and the distance from the projectile to the atoms in the target, we devise a trajectory pre-sampling scheme to select, geometrically, a small set of short trajectories to accelerate the convergence of the averaged $S_e(v)$ computed via rt-TDDFT. For protons in liquid water, we first calculate the reference probability distribution function (PDF) for the distance from the proton to the closest oxygen atom, $ϕ_R(r_{p{\rightarrow}O})$, for a trajectory of a length similar to those sampled experimentally. Then, short trajectories are sequentially selected so that the accumulated PDF reproduces $ϕ_R(r_{p{\rightarrow}O})$ to increasingly high accuracy. Using these pre-sampled trajectories, we demonstrate that the averaged $S_e(v_p)$ converges in the whole velocity range with less than eight trajectories, while other averaging methods using randomly and uniformly distributed trajectories require approximately ten times the computational effort. This allows us to compare the $S_e(v_p)$ curve to experimental data, and assess widely used empirical tables based on Bragg's rule.

preprint2020arXiv

Faster On-Device Training Using New Federated Momentum Algorithm

Mobile crowdsensing has gained significant attention in recent years and has become a critical paradigm for emerging Internet of Things applications. The sensing devices continuously generate a significant quantity of data, which provide tremendous opportunities to develop innovative intelligent applications. To utilize these data to train machine learning models while not compromising user privacy, federated learning has become a promising solution. However, there is little understanding of whether federated learning algorithms are guaranteed to converge. We reconsider model averaging in federated learning and formulate it as a gradient-based method with biased gradients. This novel perspective assists analysis of its convergence rate and provides a new direction for more acceleration. We prove for the first time that the federated averaging algorithm is guaranteed to converge for non-convex problems, without imposing additional assumptions. We further propose a novel accelerated federated learning algorithm and provide a convergence guarantee. Simulated federated learning experiments are conducted to train deep neural networks on benchmark datasets, and experimental results show that our proposed method converges faster than previous approaches.

preprint2020arXiv

Federated Doubly Stochastic Kernel Learning for Vertically Partitioned Data

In a lot of real-world data mining and machine learning applications, data are provided by multiple providers and each maintains private records of different feature sets about common entities. It is challenging to train these vertically partitioned data effectively and efficiently while keeping data privacy for traditional data mining and machine learning algorithms. In this paper, we focus on nonlinear learning with kernels, and propose a federated doubly stochastic kernel learning (FDSKL) algorithm for vertically partitioned data. Specifically, we use random features to approximate the kernel mapping function and use doubly stochastic gradients to update the solutions, which are all computed federatedly without the disclosure of data. Importantly, we prove that FDSKL has a sublinear convergence rate, and can guarantee the data security under the semi-honest assumption. Extensive experimental results on a variety of benchmark datasets show that FDSKL is significantly faster than state-of-the-art federated learning methods when dealing with kernels, while retaining the similar generalization performance.

preprint2020arXiv

Gaussian speaker embedding learning for text-independent speaker verification

The x-vector maps segments of arbitrary duration to vectors of fixed dimension using deep neural network. Combined with the probabilistic linear discriminant analysis (PLDA) backend, the x-vector/PLDA has become the dominant framework in text-independent speaker verification. Nevertheless, how to extract the x-vector appropriate for the PLDA backend is a key problem. In this paper, we propose a Gaussian noise constrained network (GNCN) to extract xvector, which adopts a multi-task learning strategy with the primary task classifying the speakers and the auxiliary task just fitting the Gaussian noises. Experiments are carried out using the SITW database. The results demonstrate the effectiveness of our proposed method

preprint2020arXiv

Improved Bilevel Model: Fast and Optimal Algorithm with Theoretical Guarantee

Due to the hierarchical structure of many machine learning problems, bilevel programming is becoming more and more important recently, however, the complicated correlation between the inner and outer problem makes it extremely challenging to solve. Although several intuitive algorithms based on the automatic differentiation have been proposed and obtained success in some applications, not much attention has been paid to finding the optimal formulation of the bilevel model. Whether there exists a better formulation is still an open problem. In this paper, we propose an improved bilevel model which converges faster and better compared to the current formulation. We provide theoretical guarantee and evaluation results over two tasks: Data Hyper-Cleaning and Hyper Representation Learning. The empirical results show that our model outperforms the current bilevel model with a great margin. \emph{This is a concurrent work with \citet{liu2020generic} and we submitted to ICML 2020. Now we put it on the arxiv for record.}

preprint2020arXiv

Large Batch Training Does Not Need Warmup

Training deep neural networks using a large batch size has shown promising results and benefits many real-world applications. However, the optimizer converges slowly at early epochs and there is a gap between large-batch deep learning optimization heuristics and theoretical underpinnings. In this paper, we propose a novel Complete Layer-wise Adaptive Rate Scaling (CLARS) algorithm for large-batch training. We also analyze the convergence rate of the proposed method by introducing a new fine-grained analysis of gradient-based methods. Based on our analysis, we bridge the gap and illustrate the theoretical insights for three popular large-batch training techniques, including linear learning rate scaling, gradual warmup, and layer-wise adaptive rate scaling. Extensive experiments demonstrate that the proposed algorithm outperforms gradual warmup technique by a large margin and defeats the convergence of the state-of-the-art large-batch optimizer in training advanced deep neural networks (ResNet, DenseNet, MobileNet) on ImageNet dataset.

preprint2020arXiv

Privacy-Preserving Asynchronous Federated Learning Algorithms for Multi-Party Vertically Collaborative Learning

The privacy-preserving federated learning for vertically partitioned data has shown promising results as the solution of the emerging multi-party joint modeling application, in which the data holders (such as government branches, private finance and e-business companies) collaborate throughout the learning process rather than relying on a trusted third party to hold data. However, existing federated learning algorithms for vertically partitioned data are limited to synchronous computation. To improve the efficiency when the unbalanced computation/communication resources are common among the parties in the federated learning system, it is essential to develop asynchronous training algorithms for vertically partitioned data while keeping the data privacy. In this paper, we propose an asynchronous federated SGD (AFSGD-VP) algorithm and its SVRG and SAGA variants on the vertically partitioned data. Moreover, we provide the convergence analyses of AFSGD-VP and its SVRG and SAGA variants under the condition of strong convexity. We also discuss their model privacy, data privacy, computational complexities and communication costs. To the best of our knowledge, AFSGD-VP and its SVRG and SAGA variants are the first asynchronous federated learning algorithms for vertically partitioned data. Extensive experimental results on a variety of vertically partitioned datasets not only verify the theoretical results of AFSGD-VP and its SVRG and SAGA variants, but also show that our algorithms have much higher efficiency than the corresponding synchronous algorithms.

preprint2016arXiv

Asynchronous Stochastic Block Coordinate Descent with Variance Reduction

Asynchronous parallel implementations for stochastic optimization have received huge successes in theory and practice recently. Asynchronous implementations with lock-free are more efficient than the one with writing or reading lock. In this paper, we focus on a composite objective function consisting of a smooth convex function $f$ and a block separable convex function, which widely exists in machine learning and computer vision. We propose an asynchronous stochastic block coordinate descent algorithm with the accelerated technology of variance reduction (AsySBCDVR), which are with lock-free in the implementation and analysis. AsySBCDVR is particularly important because it can scale well with the sample size and dimension simultaneously. We prove that AsySBCDVR achieves a linear convergence rate when the function $f$ is with the optimal strong convexity property, and a sublinear rate when $f$ is with the general convexity. More importantly, a near-linear speedup on a parallel system with shared memory can be obtained.

preprint2016arXiv

Decoupled Asynchronous Proximal Stochastic Gradient Descent with Variance Reduction

In the era of big data, optimizing large scale machine learning problems becomes a challenging task and draws significant attention. Asynchronous optimization algorithms come out as a promising solution. Recently, decoupled asynchronous proximal stochastic gradient descent (DAP-SGD) is proposed to minimize a composite function. It is claimed to be able to off-loads the computation bottleneck from server to workers by allowing workers to evaluate the proximal operators, therefore, server just need to do element-wise operations. However, it still suffers from slow convergence rate because of the variance of stochastic gradient is nonzero. In this paper, we propose a faster method, decoupled asynchronous proximal stochastic variance reduced gradient descent method (DAP-SVRG). We prove that our method has linear convergence for strongly convex problem. Large-scale experiments are also conducted in this paper, and results demonstrate our theoretical analysis.

preprint2016arXiv

On the identification of time interval threshold in the twin-CME scenario

Recently it has been suggested that the "twin-CME" scenario Li.etal2012 may be a very effective mechanism in causing extreme Solar Energetic Particle (SEP) events and in particular Ground Level Enhancement (GLE) events. Ding.etal2013 performed a statistical examination of the twin-CME scenario with a total of $126$ fast and wide western Coronal Mass Ejections (CMEs). They found that CMEs having a preceding CME with a speed $>$ 300 $km/s$ within $9$ hours from the same active region have larger probability of leading to large SEP events than CMEs that do not have preceding CMEs. The choice of $9$ hours being the time lag $τ$ between the preceding CME and the main CME was based on some crude estimates of the decay time of the turbulence downstream of the shock driven by the preceding CME. In this work, we examine this choice. For the $126$ fast wide CMEs examined in Ding.etal2013, we vary the time lag $τ$ from $1$ hour to $24$ hours with an increment of $1$ hour. By considering three quantities whose values depend on the choice of this time lag $τ$, we show that the choice of $13$ hours for $τ$ is more appropriate. Our study confirms our earlier result that twin CMEs are more likely to lead to large SEP events than single fast CMEs. The results shown here are of great relevance to space weather studies.

preprint2016arXiv

Seed population in large Solar Energetic Particle events and the twin-CME scenario

It has been recently suggested that large solar energetic particle (SEP) events are often caused by twin CMEs. In the twin-CME scenario, the preceding CME is to provide both an enhanced turbulence level and enhanced seed population at the main CME-driven shock. In this work, we study the effect of the preceding CMEs on the seed population. We examine event-integrated abundance of iron to oxygen ratio (Fe/O) at energies above 25 MeV/nuc for large SEP events in solar cycle 23. We find that the Fe/O ratio (normalized to the reference coronal value of $0.134$) $\leq2.0$ for almost all single-CME events and these events tend to have smaller peak intensities. In comparison, the Fe/O ratio of twin-CME events scatters in a larger range, reaching as high as $8$, suggesting the presence of flare material from perhaps preceding flares. For extremely large SEP events with peak intensity above $1000$ pfu, the Fe/O drop below $2$, indicating that in these extreme events the seed particles are dominated by coronal material than flare material. The Fe/O ratios of Ground level enhancement (GLE) events, all being twin-CME events, scatter in a broad range. For a given Fe/O ratio, GLE events tend to have larger peak intensities than non-GLE events. Using velocity dispersion analysis (VDA), we find that GLE events have lower solar particle release (SPR) heights than non-GLE events, \red{agreeing with earlier results by Reames 2009b.

preprint2016arXiv

Zeroth-order Asynchronous Doubly Stochastic Algorithm with Variance Reduction

Zeroth-order (derivative-free) optimization attracts a lot of attention in machine learning, because explicit gradient calculations may be computationally expensive or infeasible. To handle large scale problems both in volume and dimension, recently asynchronous doubly stochastic zeroth-order algorithms were proposed. The convergence rate of existing asynchronous doubly stochastic zeroth order algorithms is $O(\frac{1}{\sqrt{T}})$ (also for the sequential stochastic zeroth-order optimization algorithms). In this paper, we focus on the finite sums of smooth but not necessarily convex functions, and propose an asynchronous doubly stochastic zeroth-order optimization algorithm using the accelerated technology of variance reduction (AsyDSZOVR). Rigorous theoretical analysis show that the convergence rate can be improved from $O(\frac{1}{\sqrt{T}})$ the best result of existing algorithms to $O(\frac{1}{T})$. Also our theoretical results is an improvement to the ones of the sequential stochastic zeroth-order optimization algorithms.

preprint2013arXiv

MDM: A Mode Diagram Modeling Framework

Periodic control systems used in spacecrafts and automotives are usually period-driven and can be decomposed into different modes with each mode representing a system state observed from outside. Such systems may also involve intensive computing in their modes. Despite the fact that such control systems are widely used in the above-mentioned safety-critical embedded domains, there is lack of domain-specific formal modelling languages for such systems in the relevant industry. To address this problem, we propose a formal visual modeling framework called mode diagram as a concise and precise way to specify and analyze such systems. To capture the temporal properties of periodic control systems, we provide, along with mode diagram, a property specification language based on interval logic for the description of concrete temporal requirements the engineers are concerned with. The statistical model checking technique can then be used to verify the mode diagram models against desired properties. To demonstrate the viability of our approach, we have applied our modelling framework to some real life case studies from industry and helped detect two design defects for some spacecraft control systems.

preprint2012arXiv

MDM: A Mode Diagram Modeling Framework for Periodic Control Systems

Periodic control systems used in spacecrafts and automotives are usually period-driven and can be decomposed into different modes with each mode representing a system state observed from outside. Such systems may also involve intensive computing in their modes. Despite the fact that such control systems are widely used in the above-mentioned safety-critical embedded domains, there is lack of domain-specific formal modelling languages for such systems in the relevant industry. To address this problem, we propose a formal visual modeling framework called MDM as a concise and precise way to specify and analyze such systems. To capture the temporal properties of periodic control systems, we provide, along with MDM, a property specification language based on interval logic for the description of concrete temporal requirements the engineers are concerned with. The statistical model checking technique can then be used to verify the MDM models against desired properties. To demonstrate the viability of our approach, we have applied our modelling framework to some real life case studies from industry and helped detect two design defects for some spacecraft control systems.

Bin Gu

What is connected

Connect this record

See the researcher in context

Building this map preview

24 published item(s)

A Stage-Wise Learning Strategy with Fixed Anchors for Robust Speaker Verification

New Insight of Variance reduce in Zero-Order Hard-Thresholding: Mitigating Gradient Error and Expansivity Contradictions

Balanced Self-Paced Learning for AUC Maximization

Bragg's Additivity Rule and Core and Bond model studied by real-time TDDFT electronic stopping simulations: the case of water vapor

Desirable Companion for Vertical Federated Learning: New Zeroth-Order Gradient Based Algorithm

Learning to Control under Time-Varying Environment

Multi-Level Contrastive Learning for Cross-Lingual Alignment

Optimizing Large-Scale Hyperparameters via Automated Learning Algorithm

Secure Bilevel Asynchronous Vertical Federated Learning with Backward Updating

An Improved Deep Neural Network for Modeling Speaker Characteristics at Different Temporal Scales

Efficient ab initio calculation of electronic stopping in disordered systems via geometry pre-sampling: application to liquid water

Faster On-Device Training Using New Federated Momentum Algorithm

Federated Doubly Stochastic Kernel Learning for Vertically Partitioned Data

Gaussian speaker embedding learning for text-independent speaker verification

Improved Bilevel Model: Fast and Optimal Algorithm with Theoretical Guarantee

Large Batch Training Does Not Need Warmup

Privacy-Preserving Asynchronous Federated Learning Algorithms for Multi-Party Vertically Collaborative Learning

Asynchronous Stochastic Block Coordinate Descent with Variance Reduction

Decoupled Asynchronous Proximal Stochastic Gradient Descent with Variance Reduction

On the identification of time interval threshold in the twin-CME scenario

Seed population in large Solar Energetic Particle events and the twin-CME scenario

Zeroth-order Asynchronous Doubly Stochastic Algorithm with Variance Reduction

MDM: A Mode Diagram Modeling Framework

MDM: A Mode Diagram Modeling Framework for Periodic Control Systems