Source author record

Peng Lin

Peng Lin appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC Artificial Intelligence cond-mat.mtrl-sci Computer Science and Game Theory Emerging Technologies Hardware Architecture Information Theory math.IT Networking and Internet Architecture Neural and Evolutionary Computing physics.comp-ph physics.soc-ph

Catalog footprint

What is connected

12works

12topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

Darwin3: A large-scale neuromorphic chip with a Novel ISA and On-Chip Learning

Spiking Neural Networks (SNNs) are gaining increasing attention for their biological plausibility and potential for improved computational efficiency. To match the high spatial-temporal dynamics in SNNs, neuromorphic chips are highly desired to execute SNNs in hardware-based neuron and synapse circuits directly. This paper presents a large-scale neuromorphic chip named Darwin3 with a novel instruction set architecture(ISA), which comprises 10 primary instructions and a few extended instructions. It supports flexible neuron model programming and local learning rule designs. The Darwin3 chip architecture is designed in a mesh of computing nodes with an innovative routing algorithm. We used a compression mechanism to represent synaptic connections, significantly reducing memory usage. The Darwin3 chip supports up to 2.35 million neurons, making it the largest of its kind in neuron scale. The experimental results showed that code density was improved up to 28.3x in Darwin3, and neuron core fan-in and fan-out were improved up to 4096x and 3072x by connection compression compared to the physical memory depth. Our Darwin3 chip also provided memory saving between 6.8X and 200.8X when mapping convolutional spiking neural networks (CSNN) onto the chip, demonstrating state-of-the-art performance in accuracy and latency compared to other neuromorphic chips.

preprint2021arXiv

Echo state graph neural networks with analogue random resistor arrays

Recent years have witnessed an unprecedented surge of interest, from social networks to drug discovery, in learning representations of graph-structured data. However, graph neural networks, the machine learning models for handling graph-structured data, face significant challenges when running on conventional digital hardware, including von Neumann bottleneck incurred by physically separated memory and processing units, slowdown of Moore's law due to transistor scaling limit, and expensive training cost. Here we present a novel hardware-software co-design, the random resistor array-based echo state graph neural network, which addresses these challenges. The random resistor arrays not only harness low-cost, nanoscale and stackable resistors for highly efficient in-memory computing using simple physical laws, but also leverage the intrinsic stochasticity of dielectric breakdown to implement random projections in hardware for an echo state network that effectively minimizes the training cost thanks to its fixed and random weights. The system demonstrates state-of-the-art performance on both graph classification using the MUTAG and COLLAB datasets and node classification using the CORA dataset, achieving 34.2x, 93.2x, and 570.4x improvement of energy efficiency and 98.27%, 99.46%, and 95.12% reduction of training cost compared to conventional graph learning on digital hardware, respectively, which may pave the way for the next generation AI system for graph learning.

preprint2021arXiv

Implementation of annihilation and junction reactions in vector density-based continuum dislocation dynamics

In a continuum dislocation dynamics formulation by Xia and El-Azab, dislocations are represented by a set of vector density fields, one per crystallographic slip systems. The space-time evolution of these densities is obtained by solving a set of dislocation transport equations coupled with crystal mechanics. Here, we present an approach for incorporating dislocation annihilation and junction reactions into the dislocation transport equations. These reactions consume dislocations and result in nothing as in the annihilation reactions, or produce new dislocations of different types as in the case of junction reactions. Collinear annihilation, glissile junctions, and sessile junctions are particularly emphasized here. A generalized energy-based criterion for junction reactions is established in terms of the dislocation density and Burgers vectors of the reacting species, and the reaction rate terms for junction reactions are formulated in terms of the dislocation densities. In order to illustrate how the dislocation network changes as a result of junction formation and annihilation in a continuum dislocation dynamics setting, we present some numerical examples focusing on the reactions processes themselves. The results show that our modeling approach is able to capture the respective dislocation network changes associated with dislocation reactions in FCC crystals: dislocations of opposite line directions encountering each other on collinear slip systems annihilate to connect the dislocations on the two slip systems, glissile junctions form on new slip system behave like Frank-Read sources, and sessile junctions form and expand along the intersection of the slip planes of the reacting dislocation species. A collective-dynamics test showing the frequency of occurrence of junctions of different types relative to each other is also presented.

preprint2021arXiv

On the computational solution of vector-density based continuum dislocation dynamics models: a comparison of two plastic distortion and stress update algorithms

Continuum dislocation dynamics models of mesoscale plasticity consist of dislocation transport-reaction equations coupled with crystal mechanics equations. The coupling between these two sets of equations is such that dislocation transport gives rise to the evolution of plastic distortion (strain), while the evolution of the latter fixes the stress from which the dislocation velocity field is found via a mobility law. Earlier solutions of these equations employed a staggered solution scheme for the two sets of equations in which the plastic distortion was updated via time integration of its rate, as found from Orowan's law. In this work, we show that such a direct time integration scheme can suffer from accumulation of numerical errors. We introduce an alternative scheme based on field dislocation mechanics that ensures consistency between the plastic distortion and the dislocation content in the crystal. The new scheme is based on calculating the compatible and incompatible parts of the plastic distortion separately, and the incompatible part is calculated from the current dislocation density field. Stress field and dislocation transport calculations were implemented within a finite element based discretization of the governing equations, with the crystal mechanics part solved by a conventional Galerkin method and the dislocation transport equations by the least squares method. A simple test is first performed to show the accuracy of the two schemes for updating the plastic distortion, which shows that the solution method based on field dislocation mechanics is more accurate. This method then was used to simulate an austenitic steel crystal under uniaxial loading and multiple slip conditions.

preprint2020arXiv

Distributed Continuous-Time and Discrete-Time Optimization With Nonuniform Unbounded Convex Constraint Sets and Nonuniform Stepsizes

This paper is devoted to distributed continuous-time and discrete-time optimization problems with nonuniform convex constraint sets and nonuniform stepsizes for general differentiable convex objective functions. The communication graphs are not required to be strongly connected at any time, the gradients of the local objective functions are not required to be bounded when their independent variables tend to infinity, and the constraint sets are not required to be bounded. For continuous-time multi-agent systems, a distributed continuous algorithm is first introduced where the stepsizes and the convex constraint sets are both nonuniform. It is shown that all agents reach a consensus while minimizing the team objective function even when the constraint sets are unbounded. After that, the obtained results are extended to discrete-time multi-agent systems and then the case where each agent remains in a corresponding convex constraint set is studied. To ensure all agents to remain in a bounded region, a switching mechanism is introduced in the algorithms. It is shown that the distributed optimization problems can be solved, even though the discretization of the algorithms might deviate the convergence of the agents from the minimum of the objective functions. Finally, numerical examples are included to show the obtained theoretical results.

preprint2020arXiv

Distributed optimization with nonconvex velocity constraints, nonuniform position constraints and nonuniform stepsizes

This note is devoted to the distributed optimization problem of multi-agent systems with nonconvex velocity constraints, nonuniform position constraints and nonuniform stepsizes. Two distributed constrained algorithms with nonconvex velocity constraints and nonuniform stepsizes are proposed in the absence and the presence of nonuniform position constraints by introducing a switching mechanism to guarantee all agents' position states to remain in a bounded region. The algorithm gains need not to be predesigned and can be selected by each agent using its own and neighbours' information. By a model transformation, the original nonlinear time-varying system is converted into a linear time-varying one with a nonlinear error term. Based on the properties of stochastic matrices, it is shown that the optimization problem can be solved as long as the communication topologies are jointly strongly connected and balanced. Numerical examples are given to show the obtained theoretical results.

preprint2020arXiv

Distributed velocity-constrained consensus of discrete-time multi-agent systems with nonconvex constraints, switching topologies, and delays

In this paper, a distributed velocity-constrained consensus problem is studied for discrete-time multi-agent systems, where each agent's velocity is constrained to lie in a nonconvex set. A distributed constrained control algorithm is proposed to enable all agents to converge to a common point using only local information. {The gains of the algorithm for all agents need not to be the same or predesigned and can be adjusted by each agent itself based on its own and neighbors' information.} It is shown that the algorithm is robust to arbitrarily bounded communication delays and arbitrarily switching communication graphs provided that the union of the graphs has directed spanning trees among each certain time interval. The analysis approach is based on multiple novel model transformations, proper control parameter selections, boundedness analysis of state-dependent stochastic matrices, exploitation of the convexity of stochastic matrices, and the joint connectivity of the communication graphs. Numerical examples are included to illustrate the theoretical results.

preprint2020arXiv

On the implementation of dislocation reactions in continuum dislocation dynamics modeling of mesoscale plasticity

The continuum dislocation dynamics framework for mesoscale plasticity is intended to capture the dislocation density evolution and the deformation of crystals when subjected to mechanical loading. It does so by solving a set of transport equations for dislocations concurrently with crystal mechanics equations, with the latter being cast in the form of an eigenstrain problem. Incorporating dislocation reactions in the dislocation transport equations is essential for making such continuum dislocation dynamics predictive. A formulation is proposed to incorporate dislocation reactions in the transport equations of the vector density-based continuum dislocation dynamics. This formulation aims to rigorously enforce dislocation line continuity using the concept of virtual dislocations that close all dislocation loops involved in cross slip, annihilation, and glissile and sessile junction reactions. The addition of virtual dislocations enables us to accurately enforce the divergence free condition upon the numerical solution of the dislocation transport equations for all slip systems individually. A set of tests were performed to illustrate the accuracy of the formulation and the solution of the transport equations within the vector density-based continuum dislocation dynamics. Comparing the results from these tests with an earlier approach in which the divergence free constraint was enforced on the total dislocation density tensor or the sum of two densities when only cross slip is considered shows that the new approach yields highly accurate results. Bulk simulations were performed for a face centered cubic crystal based on the new formulation and the results were compared with discrete dislocation dynamics predictions of the same. The microstructural features obtained from continuum dislocation dynamics were also analyzed with reference to relevant experimental observations.

preprint2016arXiv

Region Based Approximation for High Dimensional Bayesian Network Models

Performing efficient inference on Bayesian Networks (BNs), with large numbers of densely connected variables is challenging. With exact inference methods, such as the Junction Tree algorithm, clustering complexity can grow exponentially with the number of nodes and so computation becomes intractable. This paper presents a general purpose approximate inference algorithm called Triplet Region Construction (TRC) that reduces the clustering complexity for factorized models from worst case exponential to polynomial. We employ graph factorization to reduce connection complexity and produce clusters of limited size. Unlike MCMC algorithms TRC is guaranteed to converge and we present experiments that show that TRC achieves accurate results when compared with exact solutions.

preprint2015arXiv

Discrete Element Crowd Model for Pedestrian Evacuation Through an Exit

A series of accidents caused by crowd within the last decades evoked a lot of scientific interest in modeling the movement of pedestrian crowds. Based on discrete element method, a granular dynamic model, in which human body is simplified as self-driven sphere, is proposed to simulate the characteristics of crowd flow through an exit. In this model, the repulsive force among people is considered to have an anisotropic feature, and the physical contact force due to body deformation is quantified by the Hertz contact model. The movement of human body is simulated by applying the second Newton's law. The crowd flow through an exit at different desired velocities is studied and simulation results indicated that crowd flow exhibits three distinct states, i.e., smooth state, transition state and phase separation state. In the simulation, clogging phenomenon occurs more easily when the velocity of desired is high and the exit may as a result be totally blocked at a desired velocity of 1.6m/s or above, leading to faster-to-frozen effect.

preprint2015arXiv

Performing Bayesian Risk Aggregation using Discrete Approximation Algorithms with Graph Factorization

Risk aggregation is a popular method used to estimate the sum of a collection of financial assets or events, where each asset or event is modelled as a random variable. Applications, in the financial services industry, include insurance, operational risk, stress testing, and sensitivity analysis, but the problem is widely encountered in many other application domains. This thesis has contributed two algorithms to perform Bayesian risk aggregation when model exhibit hybrid dependency and high dimensional inter-dependency. The first algorithm operates on a subset of the general problem, with an emphasis on convolution problems, in the presence of continuous and discrete variables (so called hybrid models) and the second algorithm offer a universal method for general purpose inference over much wider classes of Bayesian Network models.

preprint2014arXiv

FlexAuc: Serving Dynamic Demands in a Spectrum Trading Market with Flexible Auction

In secondary spectrum trading markets, auctions are widely used by spectrum holders (SHs) to redistribute their unused channels to secondary wireless service providers (WSPs). As sellers, the SHs design proper auction schemes to stimulate more participants and maximize the revenue from the auction. As buyers, the WSPs determine the bidding strategies in the auction to better serve their end users. In this paper, we consider a three-layered spectrum trading market consisting of the SH, the WSPs and the end users. We jointly study the strategies of the three parties. The SH determines the auction scheme and spectrum supplies to optimize its revenue. The WSPs have flexible bidding strategies in terms of both demands and valuations considering the strategies of the end users. We design FlexAuc, a novel auction mechanism for this market to enable dynamic supplies and demands in the auction. We prove theoretically that FlexAuc not only maximizes the social welfare but also preserves other nice properties such as truthfulness and computational tractability.

Peng Lin

What is connected

Connect this record

See the researcher in context

Building this map preview

12 published item(s)

Darwin3: A large-scale neuromorphic chip with a Novel ISA and On-Chip Learning

Echo state graph neural networks with analogue random resistor arrays

Implementation of annihilation and junction reactions in vector density-based continuum dislocation dynamics

On the computational solution of vector-density based continuum dislocation dynamics models: a comparison of two plastic distortion and stress update algorithms

Distributed Continuous-Time and Discrete-Time Optimization With Nonuniform Unbounded Convex Constraint Sets and Nonuniform Stepsizes

Distributed optimization with nonconvex velocity constraints, nonuniform position constraints and nonuniform stepsizes

Distributed velocity-constrained consensus of discrete-time multi-agent systems with nonconvex constraints, switching topologies, and delays

On the implementation of dislocation reactions in continuum dislocation dynamics modeling of mesoscale plasticity

Region Based Approximation for High Dimensional Bayesian Network Models

Discrete Element Crowd Model for Pedestrian Evacuation Through an Exit

Performing Bayesian Risk Aggregation using Discrete Approximation Algorithms with Graph Factorization

FlexAuc: Serving Dynamic Demands in a Spectrum Trading Market with Flexible Auction