Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
21works
0followers
18topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

21 published item(s)

preprint2022arXiv

Analyzing Deep Learning Based Brain Tumor Segmentation with Missing MRI Modalities

This technical report presents a comparative analysis of existing deep learning (DL) based approaches for brain tumor segmentation with missing MRI modalities. Approaches evaluated include the Adversarial Co-training Network (ACN) and a combination of mmGAN and DeepMedic. A more stable and easy-to-use version of mmGAN is also open-sourced at a GitHub repository. Using the BraTS2018 dataset, this work demonstrates that the state-of-the-art ACN performs better especially when T1c is missing. While a simple combination of mmGAN and DeepMedic also shows strong potentials when only one MRI modality is missing. Additionally, this work initiated discussions with future research directions for brain tumor segmentation with missing MRI modalities.

preprint2022arXiv

Debiasing Neural Retrieval via In-batch Balancing Regularization

People frequently interact with information retrieval (IR) systems, however, IR models exhibit biases and discrimination towards various demographics. The in-processing fair ranking methods provide a trade-offs between accuracy and fairness through adding a fairness-related regularization term in the loss function. However, there haven't been intuitive objective functions that depend on the click probability and user engagement to directly optimize towards this. In this work, we propose the In-Batch Balancing Regularization (IBBR) to mitigate the ranking disparity among subgroups. In particular, we develop a differentiable \textit{normed Pairwise Ranking Fairness} (nPRF) and leverage the T-statistics on top of nPRF over subgroups as a regularization to improve fairness. Empirical results with the BERT-based neural rankers on the MS MARCO Passage Retrieval dataset with the human-annotated non-gendered queries benchmark \citep{rekabsaz2020neural} show that our IBBR method with nPRF achieves significantly less bias with minimal degradation in ranking performance compared with the baseline.

preprint2022arXiv

Entailment Tree Explanations via Iterative Retrieval-Generation Reasoner

Large language models have achieved high performance on various question answering (QA) benchmarks, but the explainability of their output remains elusive. Structured explanations, called entailment trees, were recently suggested as a way to explain and inspect a QA system's answer. In order to better generate such entailment trees, we propose an architecture called Iterative Retrieval-Generation Reasoner (IRGR). Our model is able to explain a given hypothesis by systematically generating a step-by-step explanation from textual premises. The IRGR model iteratively searches for suitable premises, constructing a single entailment step at a time. Contrary to previous approaches, our method combines generation steps and retrieval of premises, allowing the model to leverage intermediate conclusions, and mitigating the input size limit of baseline encoder-decoder models. We conduct experiments using the EntailmentBank dataset, where we outperform existing benchmarks on premise retrieval and entailment tree generation, with around 300% gain in overall correctness.

preprint2022arXiv

Model Order Reduction for Water Quality Dynamics

A state-space representation of water quality dynamics describing disinfectant (e.g., chlorine) transport dynamics in drinking water distribution networks has been recently proposed. Such representation is a byproduct of space- and time-discretization of the PDE modeling transport dynamics. This results in a large state-space dimension even for small networks with tens of nodes. Although such a state-space model provides a model-driven approach to predict water quality dynamics, incorporating it into model-based control algorithms or state estimators for large networks is challenging and at times intractable. To that end, this paper investigates model order reduction (MOR) methods for water quality dynamics with the objective of performing post-reduction feedback control. The presented investigation focuses on reducing state-dimension by orders of magnitude, the stability of the MOR methods, and the application of these methods to model predictive control.

preprint2022arXiv

Optimal Pump Control for Water Distribution Networks via Data-based Distributional Robustness

In this paper, we propose a data-based methodology to solve a multi-period stochastic optimal water flow (OWF) problem for water distribution networks (WDNs). The framework explicitly considers the pump schedule and water network head level with limited information of demand forecast errors for an extended period simulation. The objective is to determine the optimal feedback decisions of network-connected components, such as nominal pump schedules and tank head levels and reserve policies, which specify device reactions to forecast errors for accommodation of fluctuating water demand. Instead of assuming the uncertainties across the water network are generated by a prescribed certain distribution, we consider ambiguity sets of distributions centered at an empirical distribution, which is based directly on a finite training data set. We use a distance-based ambiguity set with the Wasserstein metric to quantify the distance between the real unknown data-generating distribution and the empirical distribution. This allows our multi-period OWF framework to trade off system performance and inherent sampling errors in the training dataset. Case studies on a three-tank water distribution network systematically illustrate the tradeoff between pump operational cost, risks of constraint violation, and out-of-sample performance.

preprint2021arXiv

Transferable Graph Optimizers for ML Compilers

Most compilers for machine learning (ML) frameworks need to solve many correlated optimization problems to generate efficient machine code. Current ML compilers rely on heuristics based algorithms to solve these optimization problems one at a time. However, this approach is not only hard to maintain but often leads to sub-optimal solutions especially for newer model architectures. Existing learning based approaches in the literature are sample inefficient, tackle a single optimization problem, and do not generalize to unseen graphs making them infeasible to be deployed in practice. To address these limitations, we propose an end-to-end, transferable deep reinforcement learning method for computational graph optimization (GO), based on a scalable sequential attention mechanism over an inductive graph neural network. GO generates decisions on the entire graph rather than on each individual node autoregressively, drastically speeding up the search compared to prior methods. Moreover, we propose recurrent attention layers to jointly optimize dependent graph optimization tasks and demonstrate 33%-60% speedup on three graph optimization tasks compared to TensorFlow default optimization. On a diverse set of representative graphs consisting of up to 80,000 nodes, including Inception-v3, Transformer-XL, and WaveNet, GO achieves on average 21% improvement over human experts and 18% improvement over the prior state of the art with 15x faster convergence, on a device placement task evaluated in real systems.

preprint2020arXiv

A New Derivative-Free Linear Approximation for Solving the Network Water Flow Problem with Convergence Guarantees

Addressing challenges in urban water infrastructure systems including aging infrastructure, supply uncertainty, extreme events, and security threats, depend highly on water distribution networks modeling emphasizing the importance of realistic assumptions, modeling complexities, and scalable solutions. In this study, we propose a derivative-free, linear approximation for solving the network water flow problem (WFP). The proposed approach takes advantage of the special form of the nonlinear head loss equations and, after the transformation of variables and constraints, the WFP reduces to a linear optimization problem that can be efficiently solved by modern linear solvers. Ultimately, the proposed approach amounts to solving a series of linear optimization problems. We demonstrate the proposed approach through several case studies and show that the approach can model arbitrary network topologies and various types of valves and pumps, thus providing modeling flexibility. Under mild conditions, we show that the proposed linear approximation converges. We provide sensitivity analysis and discuss in detail the current limitations of our approach and suggest solutions to overcome these. All the codes, tested networks, and results are freely available on Github for research reproducibility.

preprint2020arXiv

Attentional Graph Convolutional Networks for Knowledge Concept Recommendation in MOOCs in a Heterogeneous View

Massive open online courses are becoming a modish way for education, which provides a large-scale and open-access learning opportunity for students to grasp the knowledge. To attract students' interest, the recommendation system is applied by MOOCs providers to recommend courses to students. However, as a course usually consists of a number of video lectures, with each one covering some specific knowledge concepts, directly recommending courses overlook students'interest to some specific knowledge concepts. To fill this gap, in this paper, we study the problem of knowledge concept recommendation. We propose an end-to-end graph neural network-based approach calledAttentionalHeterogeneous Graph Convolutional Deep Knowledge Recommender(ACKRec) for knowledge concept recommendation in MOOCs. Like other recommendation problems, it suffers from sparsity issues. To address this issue, we leverage both content information and context information to learn the representation of entities via graph convolution network. In addition to students and knowledge concepts, we consider other types of entities (e.g., courses, videos, teachers) and construct a heterogeneous information network to capture the corresponding fruitful semantic relationships among different types of entities and incorporate them into the representation learning process. Specifically, we use meta-path on the HIN to guide the propagation of students' preferences. With the help of these meta-paths, the students' preference distribution with respect to a candidate knowledge concept can be captured. Furthermore, we propose an attention mechanism to adaptively fuse the context information from different meta-paths, in order to capture the different interests of different students. The promising experiment results show that the proposedACKRecis able to effectively recommend knowledge concepts to students pursuing online learning in MOOCs.

preprint2020arXiv

BusTime: Which is the Right Prediction Model for My Bus Arrival Time?

With the rise of big data technologies, many smart transportation applications have been rapidly developed in recent years including bus arrival time predictions. This type of applications help passengers to plan trips more efficiently without wasting unpredictable amount of waiting time at bus stops. Many studies focus on improving the prediction accuracy of various machine learning and statistical models, while much less work demonstrate their applicability of being deployed and used in realistic urban settings. This paper tries to fill this gap by proposing a general and practical evaluation framework for analysing various widely used prediction models (i.e. delay, k-nearest-neighbour, kernel regression, additive model, and recurrent neural network using long short term memory) for bus arrival time. In particular, this framework contains a raw bus GPS data pre-processing method that needs much less number of input data points while still maintain satisfactory prediction results. This pre-processing method enables various models to predict arrival time at bus stops only, by using a KD-tree based nearest point search method. Based on this framework, using raw bus GPS dataset in different scales from the city of Dublin, Ireland, we also present preliminary results for city managers by analysing the practical strengths and weaknesses in both training and predicting stages of commonly used prediction models.

preprint2020arXiv

Chip Placement with Deep Reinforcement Learning

In this work, we present a learning-based approach to chip placement, one of the most complex and time-consuming stages of the chip design process. Unlike prior methods, our approach has the ability to learn from past experience and improve over time. In particular, as we train over a greater number of chip blocks, our method becomes better at rapidly generating optimized placements for previously unseen chip blocks. To achieve these results, we pose placement as a Reinforcement Learning (RL) problem and train an agent to place the nodes of a chip netlist onto a chip canvas. To enable our RL policy to generalize to unseen blocks, we ground representation learning in the supervised task of predicting placement quality. By designing a neural architecture that can accurately predict reward across a wide variety of netlists and their placements, we are able to generate rich feature embeddings of the input netlists. We then use this architecture as the encoder of our policy and value networks to enable transfer learning. Our objective is to minimize PPA (power, performance, and area), and we show that, in under 6 hours, our method can generate placements that are superhuman or comparable on modern accelerator netlists, whereas existing baselines require human experts in the loop and take several weeks.

preprint2020arXiv

Computing Lipschitz Constants for Hydraulic Models of Water Distribution Networks

Drinking water distribution networks (WDN) are large-scale, dynamic systems spanning large geographic areas. Water networks include various components such as junctions, reservoirs, tanks, pipes, pumps, and valves. Hydraulic models for these components depicting mass and energy balance form nonlinear algebraic differential equations (NDAE). While control theoretic studies have been thoroughly explored for other complex infrastructure such as power and transportation systems, little is understood or even investigated for feedback control and state estimation problems for the NDAE models of WDN. The objective of this letter is to showcase a complete NDAE model of WDN followed by computing Lipschitz constants of the vector-valued nonlinearity in that model. The computation of Lipschitz constants of hydraulic models is crucial as it paves the way to apply a plethora of control-theoretic studies for water system applications. In particular, the computation of Lipschitz constant is explored through closed-form, analytical expressions as well as via numerical methods. Case studies reveal how such computations fare against each other for various water networks.

preprint2020arXiv

Context-Aware Refinement Network Incorporating Structural Connectivity Prior for Brain Midline Delineation

Brain midline delineation can facilitate the clinical evaluation of brain midline shift, which plays an important role in the diagnosis and prognosis of various brain pathology. Nevertheless, there are still great challenges with brain midline delineation, such as the largely deformed midline caused by the mass effect and the possible morphological failure that the predicted midline is not a connected curve. To address these challenges, we propose a context-aware refinement network (CAR-Net) to refine and integrate the feature pyramid representation generated by the UNet. Consequently, the proposed CAR-Net explores more discriminative contextual features and a larger receptive field, which is of great importance to predict largely deformed midline. For keeping the structural connectivity of the brain midline, we introduce a novel connectivity regular loss (CRL) to punish the disconnectivity between adjacent coordinates. Moreover, we address the ignored prerequisite of previous regression-based methods that the brain CT image must be in the standard pose. A simple pose rectification network is presented to align the source input image to the standard pose image. Extensive experimental results on the CQ dataset and one inhouse dataset show that the proposed method requires fewer parameters and outperforms three state-of-the-art methods in terms of four evaluation metrics. Code is available at https://github.com/ShawnBIT/Brain-Midline-Detection.

preprint2020arXiv

Filament Intersections and Cold Dense Cores in Orion A North

We studied the filament structures and dense cores in OMC-2,3 region in Orion A North molecular cloud using the high-resolution N2H+ (1-0) spectral cube observed with the Atacama Large Millimeter/Submillimeter Array (ALMA). The filament network over a total length of 2 pc is found to contain 170 intersections and 128 candidate dense cores. The dense cores are all displaced from the infrared point sources (possible young stars), and the major fraction of cores (103) are located around the intersections. Towards the intersections, there is also an increasing trend for the total column density Ntot as well as the the power-law index of the column-density Probability Distribution Function (N-PDF), suggesting that the intersections would in general have more significant gas assembly than the other part of the filament paths. The virial analysis shows that the dense cores mostly have virial mass ratio of alpha_vir=M_vir/M_gas<1.0, suggesting them to be bounded by the self gravity. In the mean time, only about 23 percent of the cores have critical mass ratio of alpha_crit=M_crit/M_gas<1.0, suggesting them to be unstable against core collapse. Combining these results, it shows that the major fraction of the cold starless and possible prestellar cores in OMC-2,3 are being assembled around the intersections, and currently in a gravitationally bound state. But more extensive core collapse and star formation may still require continuous core-mass growth or other perturbatio

preprint2020arXiv

Generative Temporal Link Prediction via Self-tokenized Sequence Modeling

We formalize networks with evolving structures as temporal networks and propose a generative link prediction model, Generative Link Sequence Modeling (GLSM), to predict future links for temporal networks. GLSM captures the temporal link formation patterns from the observed links with a sequence modeling framework and has the ability to generate the emerging links by inferring from the probability distribution on the potential future links. To avoid overfitting caused by treating each link as a unique token, we propose a self-tokenization mechanism to transform each raw link in the network to an abstract aggregation token automatically. The self-tokenization is seamlessly integrated into the sequence modeling framework, which allows the proposed GLSM model to have the generalization capability to discover link formation patterns beyond raw link sequences. We compare GLSM with the existing state-of-art methods on five real-world datasets. The experimental results demonstrate that GLSM obtains future positive links effectively in a generative fashion while achieving the best performance (2-10\% improvements on AUC) among other alternatives.

preprint2020arXiv

New Insights on One-Sided Lipschitz and Quadratically Inner-Bounded Nonlinear Dynamic Systems

Nonlinear dynamic systems can be classified into various classes depending on the modeled nonlinearity. These classes include Lipschitz, bounded Jacobian, one-sided Lipschitz (OSL), and quadratically inner-bounded (QIB). Such classes essentially yield bounding constants characterizing the nonlinearity. This is then used to design observers and controllers through Riccati equations or matrix inequalities. While analytical expressions for bounding constants of Lipschitz and bounded Jacobian nonlinearity are studied in the literature, OSL and QIB classes are not thoroughly analyzed---computationally or analytically. In short, this paper develops analytical expressions of OSL and QIB bounding constants. These expressions are posed as constrained maximization problems, which can be solved via various optimization algorithms. This paper also presents a novel insight particularly on QIB function set: any function that is QIB turns out to be also Lipschitz continuous.

preprint2020arXiv

Segmentation-based Method combined with Dynamic Programming for Brain Midline Delineation

The midline related pathological image features are crucial for evaluating the severity of brain compression caused by stroke or traumatic brain injury (TBI). The automated midline delineation not only improves the assessment and clinical decision making for patients with stroke symptoms or head trauma but also reduces the time of diagnosis. Nevertheless, most of the previous methods model the midline by localizing the anatomical points, which are hard to detect or even missing in severe cases. In this paper, we formulate the brain midline delineation as a segmentation task and propose a three-stage framework. The proposed framework firstly aligns an input CT image into the standard space. Then, the aligned image is processed by a midline detection network (MD-Net) integrated with the CoordConv Layer and Cascade AtrousCconv Module to obtain the probability map. Finally, we formulate the optimal midline selection as a pathfinding problem to solve the problem of the discontinuity of midline delineation. Experimental results show that our proposed framework can achieve superior performance on one in-house dataset and one public dataset.

preprint2020arXiv

State Estimation in Water Distribution Networks through a New Successive Linear Approximation

State estimation (SE) of water distribution networks (WDNs) is difficult to solve due to nonlinearity/nonconvexity of water flow models, uncertainties from parameters and demands, lack of redundancy of measurements, and inaccurate flow and pressure measurements. This paper proposes a new, scalable successive linear approximation to solve the SE problem in WDNs. The approach amounts to solving either a sequence of linear or quadratic programs---depending on the operators&#39; objectives. The proposed successive linear approximation offers a seamless way of dealing with valve/pump model nonconvexities, is different than a first order Taylor series linearization, and can incorporate with robust uncertainty modeling. Two simple testcases are adopted to illustrate the effectiveness of proposed approach using head measurements at select nodes.

preprint2019arXiv

A PRESTO-based Parallel Pulsar Search Pipeline Used for FAST Drift Scan Data

We developed a pulsar search pipeline based on PRESTO (PulsaR Exploration and Search Toolkit). This pipeline simply runs dedispersion, FFT (Fast Fourier Transformation), and acceleration search in process-level parallel to shorten the processing time. With two parallel strategies, the pipeline can highly shorten the processing time in both the normal searches or acceleration searches. This pipeline was first tested with PMPS (Parkes Multibeam Pulsar Survery) data and discovered two new faint pulsars. Then, it was successfully used in processing the FAST (Five-hundred-meter Aperture Spherical radio Telescope) drift scan data with tens of new pulsar discoveries up to now. The pipeline is only CPU-based and can be easily and quickly deployed in computing nodes for testing purposes or data processes.

preprint2019arXiv

Probing the emission states of PSR J1107-5907

The emission from PSR J1107-5907 is erratic. Sometimes the radio pulse is undetectable, at other times the pulsed emission is weak, and for short durations the emission can be very bright. In order to improve our understanding of these state changes, we have identified archival data sets from the Parkes radio telescope in which the bright emission is present, and find that the emission never switches from the bright state to the weak state, but instead always transitions to the off state. Previous work had suggested the identification of the off state as an extreme manifestation of the weak state. However, the connection between the off and bright emission reported here suggests that the emission can be interpreted as undergoing only two emission states: a bursting state consisting of both bright pulses and nulls as well as the weak-emission state.

preprint2019arXiv

Receding Horizon Control for Drinking Water Networks: The Case for Geometric Programming

Optimal, network-driven control of Water Distribution Networks (WDN) is very difficult: valve and pump models form non-trivial, combinatorial logic; hydraulic models are nonconvex; water demand patterns are uncertain; and WDN are naturally large-scale. Prior research on control of WDN addressed major research challenges, yet either (i) adopted simplified hydraulic models, WDN topologies, and rudimentary valve/pump modeling or (ii) used mixed-integer, nonconvex optimization to solve WDN control problems. The objective of this paper is to develop tractable computational algorithms to manage WDN operation, while considering arbitrary topology, flow direction, an abundance of valve types, control objectives, hydraulic models, and operational constraints---all while only using convex, continuous optimization. Specifically, we propose new Geometric Programming (GP)-based Model Predictive Control (MPC) algorithms, designed to solve the water flow equations and obtain WDN controls, i.e., pump/valve schedules alongside heads and flows. The proposed approach amounts to solving a series of convex optimization problems that graciously scale to large networks. The proposed approach is tested using a 126-node network with many valves and pumps and shown to outperform traditional, rule-based control. The developed GP-based MPC algorithms, as well as the numerical test results are all included on Github.