Source author record

Feng Shi

Feng Shi appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

37works

38topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta

Making deep learning recommendation model (DLRM) training and inference fast and efficient is important. However, this presents three key system challenges - model architecture diversity, kernel primitive diversity, and hardware generation and architecture heterogeneity. This paper presents KernelEvolve-an agentic kernel coding framework-to tackle heterogeneity at-scale for DLRM. KernelEvolve is designed to take kernel specifications as input and automate the process of kernel generation and optimization for recommendation model across heterogeneous hardware architectures. KernelEvolve does so by operating at multiple programming abstractions, from Triton and CuTe DSL to low-level hardware agnostic languages, spanning the full hardware-software optimization stack. The kernel optimization process is described as graph-based search with selection policy, universal operator, fitness function, and termination rule, dynamically adapts to runtime execution context through retrieval-augmented prompt synthesis. We designed, implemented, and deployed KernelEvolve to optimize a wide variety of production recommendation models across generations of NVIDIA and AMD GPUs, as well as Meta's AI accelerators. We validate KernelEvolve on the publicly-available KernelBench suite, achieving 100% pass rate on all 250 problems across three difficulty levels, and 160 PyTorch ATen operators across three heterogeneous hardware platforms, demonstrating 100% correctness. KernelEvolve reduces development time from weeks to hours and achieves substantial performance improvements over PyTorch baselines across diverse production use cases and for heterogeneous AI systems at-scale. Beyond performance efficiency improvements, KernelEvolve significantly mitigates the programmability barrier for new AI hardware by enabling automated kernel generation for in-house developed AI hardware.

preprint2022arXiv

An Extended Halo-based Group/Cluster finder: application to the DESI legacy imaging surveys DR8

We extend the halo-based group finder developed by \citet[][]{Yang2005a} to use data {\it simultaneously} with either photometric or spectroscopic redshifts. A mock galaxy redshift survey constructed from a high-resolution N-body simulation is used to evaluate the performance of this extended group finder. For galaxies with magnitude ${\rm z\le 21}$ and redshift $0<z\le 1.0$ in the DESI legacy imaging surveys (the Legacy Surveys), our group finder successfully identifies more than 60\% of the members in about $90\%$ of halos with mass $\ga 10^{12.5}\msunh$. Detected groups with mass $\ga 10^{12.0}\msunh$ have a purity (the fraction of true groups) greater than 90\%. The halo mass assigned to each group has an uncertainty of about 0.2 dex at the high mass end $\ga 10^{13.5}\msunh$ and 0.40 dex at the low mass end. Groups with more than 10 members have a redshift accuracy of $\sim 0.008$. We apply this group finder to the Legacy Surveys DR8 and find 5.2 Million groups with at least 3 members. About 387,000 of these groups have at least 10 members. The resulting catalog containing 3D coordinates, richness, halo masses, and total group luminosities, is made publicly available.

preprint2022arXiv

An improved approach to manufacture CNT reinforced magnesium AZ91 composites with increased strength and ductility

Multiwalled carbon nanotubes (MWCNTs) are decorated with Pt nanoparticles by a "layer-by-layer" approach using poly (sodium 4-styrene sulfonate) (PSS) and poly (diallyl dimethylammonium chloride) (PDDA). Transmission electron microscopy (TEM) images and Energy Dispersive X-Ray (EDX) analysis of the samples confirm Pt deposition on surfaces of CNTs. Dispersibility and dispersion stability of MWCNTs in the solvents are enhanced when MWCNTs are coated with Pt nanoparticles. Mg AZ91 composites reinforced with MWCNTs are then produced by a melt stirring process. Compression tests of the composites show that adding 0.05\% wt Pt-coated MWCNTs in AZ91 improves the composite's mechanical properties compared to the pure AZ91 and pristine MWCNT/AZ91. Fracture surface analysis of the composite using a scanning electron microscope (SEM) shows individuals pulled out MWCNTs in the case of the Pt-coated MWCNT/AZ91 composites. We attribute this finding to the uniform dispersion of Pt-coated MWCNTs in Mg due to the improved wettability of Pt-coated MWCNTs in Mg melts. Molecular dynamics (MD) simulations of the interaction between Pt-coated MWCNTs and Mg support this interpretation.

preprint2021arXiv

A novel multiple instance learning framework for COVID-19 severity assessment via data augmentation and self-supervised learning

How to fast and accurately assess the severity level of COVID-19 is an essential problem, when millions of people are suffering from the pandemic around the world. Currently, the chest CT is regarded as a popular and informative imaging tool for COVID-19 diagnosis. However, we observe that there are two issues -- weak annotation and insufficient data that may obstruct automatic COVID-19 severity assessment with CT images. To address these challenges, we propose a novel three-component method, i.e., 1) a deep multiple instance learning component with instance-level attention to jointly classify the bag and also weigh the instances, 2) a bag-level data augmentation component to generate virtual bags by reorganizing high confidential instances, and 3) a self-supervised pretext component to aid the learning process. We have systematically evaluated our method on the CT images of 229 COVID-19 cases, including 50 severe and 179 non-severe cases. Our method could obtain an average accuracy of 95.8%, with 93.6% sensitivity and 96.4% specificity, which outperformed previous works.

preprint2020arXiv

Adaptive Feature Selection Guided Deep Forest for COVID-19 Classification with Chest CT

Chest computed tomography (CT) becomes an effective tool to assist the diagnosis of coronavirus disease-19 (COVID-19). Due to the outbreak of COVID-19 worldwide, using the computed-aided diagnosis technique for COVID-19 classification based on CT images could largely alleviate the burden of clinicians. In this paper, we propose an Adaptive Feature Selection guided Deep Forest (AFS-DF) for COVID-19 classification based on chest CT images. Specifically, we first extract location-specific features from CT images. Then, in order to capture the high-level representation of these features with the relatively small-scale data, we leverage a deep forest model to learn high-level representation of the features. Moreover, we propose a feature selection method based on the trained deep forest model to reduce the redundancy of features, where the feature selection could be adaptively incorporated with the COVID-19 classification model. We evaluated our proposed AFS-DF on COVID-19 dataset with 1495 patients of COVID-19 and 1027 patients of community acquired pneumonia (CAP). The accuracy (ACC), sensitivity (SEN), specificity (SPE) and AUC achieved by our method are 91.79%, 93.05%, 89.95% and 96.35%, respectively. Experimental results on the COVID-19 dataset suggest that the proposed AFS-DF achieves superior performance in COVID-19 vs. CAP classification, compared with 4 widely used machine learning methods.

preprint2020arXiv

Community Detection for Hypergraph Networks via Regularized Tensor Power Iteration

To date, social network analysis has been largely focused on pairwise interactions. The study of higher-order interactions, via a hypergraph network, brings in new insights. We study community detection in a hypergraph network. A popular approach is to project the hypergraph to a graph and then apply community detection methods for graph networks, but we show that this approach may cause unwanted information loss. We propose a new method for community detection that operates directly on the hypergraph. At the heart of our method is a regularized higher-order orthogonal iteration (reg-HOOI) algorithm that computes an approximate low-rank decomposition of the network adjacency tensor. Compared with existing tensor decomposition methods such as HOSVD and vanilla HOOI, reg-HOOI yields better performance, especially when the hypergraph is sparse. Given the output of tensor decomposition, we then generalize the community detection method SCORE (Jin, 2015) from graph networks to hypergraph networks. We call our new method Tensor-SCORE. In theory, we introduce a degree-corrected block model for hypergraphs (hDCBM), and show that Tensor-SCORE yields consistent community detection for a wide range of network sparsity and degree heterogeneity. As a byproduct, we derive the rates of convergence on estimating the principal subspace by reg-HOOI, with different initializations, including the two new initialization methods we propose, a diagonal-removed HOSVD and a randomized graph projection. We apply our method to several real hypergraph networks which yields encouraging results. It suggests that exploring higher-order interactions provides additional information not seen in graph representations.

preprint2020arXiv

Dual-Sampling Attention Network for Diagnosis of COVID-19 from Community Acquired Pneumonia

The coronavirus disease (COVID-19) is rapidly spreading all over the world, and has infected more than 1,436,000 people in more than 200 countries and territories as of April 9, 2020. Detecting COVID-19 at early stage is essential to deliver proper healthcare to the patients and also to protect the uninfected population. To this end, we develop a dual-sampling attention network to automatically diagnose COVID- 19 from the community acquired pneumonia (CAP) in chest computed tomography (CT). In particular, we propose a novel online attention module with a 3D convolutional network (CNN) to focus on the infection regions in lungs when making decisions of diagnoses. Note that there exists imbalanced distribution of the sizes of the infection regions between COVID-19 and CAP, partially due to fast progress of COVID-19 after symptom onset. Therefore, we develop a dual-sampling strategy to mitigate the imbalanced learning. Our method is evaluated (to our best knowledge) upon the largest multi-center CT data for COVID-19 from 8 hospitals. In the training-validation stage, we collect 2186 CT scans from 1588 patients for a 5-fold cross-validation. In the testing stage, we employ another independent large-scale testing dataset including 2796 CT scans from 2057 patients. Results show that our algorithm can identify the COVID-19 images with the area under the receiver operating characteristic curve (AUC) value of 0.944, accuracy of 87.5%, sensitivity of 86.9%, specificity of 90.1%, and F1-score of 82.0%. With this performance, the proposed algorithm could potentially aid radiologists with COVID-19 diagnosis from CAP, especially in the early stage of the COVID-19 outbreak.

preprint2020arXiv

HIR4: cosmology from a simulated neutral hydrogen full sky using Horizon Run 4

The distribution of cosmological neutral hydrogen will provide a new window into the large-scale structure of the Universe with the next generation of radio telescopes and surveys. The observation of this material, through 21cm line emission, will be confused by foreground emission in the same frequencies. Even after these foregrounds are removed, the reconstructed map may not exactly match the original cosmological signal, which will introduce systematic errors and offset into the measured correlations. In this paper, we simulate future surveys of neutral hydrogen using the Horizon Run 4 (HR4) cosmological N-body simulation. We generate HI intensity maps from the HR4 halo catalogue, and combine with foreground radio emission maps from the Global Sky Model, to create accurate simulations over the entire sky. We simulate the HI sky for the frequency range 700-800 MHz, matching the sensitivity of the Tianlai pathfinder. We test the accuracy of the fastICA, PCA and log-polynomial fitting foreground removal methods to recover the input cosmological angular power spectrum and measure the parameters. We show the effect of survey noise levels and beam sizes on the recovered the cosmological constraints. We find that while the reconstruction removes power from the cosmological 21cm distribution on large-scales, we can correct for this and recover the input parameters in the noise-free case. However, the effect of noise and beam size of the Tianlai pathfinder prevents accurate recovery of the cosmological parameters when using only intensity mapping information.

preprint2020arXiv

Hypergraph Learning for Identification of COVID-19 with CT Imaging

The coronavirus disease, named COVID-19, has become the largest global public health crisis since it started in early 2020. CT imaging has been used as a complementary tool to assist early screening, especially for the rapid identification of COVID-19 cases from community acquired pneumonia (CAP) cases. The main challenge in early screening is how to model the confusing cases in the COVID-19 and CAP groups, with very similar clinical manifestations and imaging features. To tackle this challenge, we propose an Uncertainty Vertex-weighted Hypergraph Learning (UVHL) method to identify COVID-19 from CAP using CT images. In particular, multiple types of features (including regional features and radiomics features) are first extracted from CT image for each case. Then, the relationship among different cases is formulated by a hypergraph structure, with each case represented as a vertex in the hypergraph. The uncertainty of each vertex is further computed with an uncertainty score measurement and used as a weight in the hypergraph. Finally, a learning process of the vertex-weighted hypergraph is used to predict whether a new testing case belongs to COVID-19 or not. Experiments on a large multi-center pneumonia dataset, consisting of 2,148 COVID-19 cases and 1,182 CAP cases from five hospitals, are conducted to evaluate the performance of the proposed method. Results demonstrate the effectiveness and robustness of our proposed method on the identification of COVID-19 in comparison to state-of-the-art methods.

preprint2020arXiv

Large-Scale Screening of COVID-19 from Community Acquired Pneumonia using Infection Size-Aware Classification

The worldwide spread of coronavirus disease (COVID-19) has become a threatening risk for global public health. It is of great importance to rapidly and accurately screen patients with COVID-19 from community acquired pneumonia (CAP). In this study, a total of 1658 patients with COVID-19 and 1027 patients of CAP underwent thin-section CT. All images were preprocessed to obtain the segmentations of both infections and lung fields, which were used to extract location-specific features. An infection Size Aware Random Forest method (iSARF) was proposed, in which subjects were automated categorized into groups with different ranges of infected lesion sizes, followed by random forests in each group for classification. Experimental results show that the proposed method yielded sensitivity of 0.907, specificity of 0.833, and accuracy of 0.879 under five-fold cross-validation. Large performance margins against comparison methods were achieved especially for the cases with infection size in the medium range, from 0.01% to 10%. The further inclusion of Radiomics features show slightly improvement. It is anticipated that our proposed framework could assist clinical decision making.

preprint2020arXiv

MixPUL: Consistency-based Augmentation for Positive and Unlabeled Learning

Learning from positive and unlabeled data (PU learning) is prevalent in practical applications where only a couple of examples are positively labeled. Previous PU learning studies typically rely on existing samples such that the data distribution is not extensively explored. In this work, we propose a simple yet effective data augmentation method, coined~\algo, based on \emph{consistency regularization} which provides a new perspective of using PU data. In particular, the proposed~\algo~incorporates supervised and unsupervised consistency training to generate augmented data. To facilitate supervised consistency, reliable negative examples are mined from unlabeled data due to the absence of negative samples. Unsupervised consistency is further encouraged between unlabeled datapoints. In addition,~\algo~reduces margin loss between positive and unlabeled pairs, which explicitly optimizes AUC and yields faster convergence. Finally, we conduct a series of studies to demonstrate the effectiveness of consistency regularization. We examined three kinds of reliable negative mining methods. We show that~\algo~achieves an averaged improvement of classification error from 16.49 to 13.09 on the CIFAR-10 dataset across different positive data amount.

preprint2020arXiv

MRI Super-Resolution with GAN and 3D Multi-Level DenseNet: Smaller, Faster, and Better

High-resolution (HR) magnetic resonance imaging (MRI) provides detailed anatomical information that is critical for diagnosis in the clinical application. However, HR MRI typically comes at the cost of long scan time, small spatial coverage, and low signal-to-noise ratio (SNR). Recent studies showed that with a deep convolutional neural network (CNN), HR generic images could be recovered from low-resolution (LR) inputs via single image super-resolution (SISR) approaches. Additionally, previous works have shown that a deep 3D CNN can generate high-quality SR MRIs by using learned image priors. However, 3D CNN with deep structures, have a large number of parameters and are computationally expensive. In this paper, we propose a novel 3D CNN architecture, namely a multi-level densely connected super-resolution network (mDCSRN), which is light-weight, fast and accurate. We also show that with the generative adversarial network (GAN)-guided training, the mDCSRN-GAN provides appealing sharp SR images with rich texture details that are highly comparable with the referenced HR images. Our results from experiments on a large public dataset with 1,113 subjects showed that this new architecture outperformed other popular deep learning methods in recovering 4x resolution-downgraded images in both quality and speed.

preprint2020arXiv

Review of Artificial Intelligence Techniques in Imaging Data Acquisition, Segmentation and Diagnosis for COVID-19

(This paper was submitted as an invited paper to IEEE Reviews in Biomedical Engineering on April 6, 2020.) The pandemic of coronavirus disease 2019 (COVID-19) is spreading all over the world. Medical imaging such as X-ray and computed tomography (CT) plays an essential role in the global fight against COVID-19, whereas the recently emerging artificial intelligence (AI) technologies further strengthen the power of the imaging tools and help medical specialists. We hereby review the rapid responses in the community of medical imaging (empowered by AI) toward COVID-19. For example, AI-empowered image acquisition can significantly help automate the scanning procedure and also reshape the workflow with minimal contact to patients, providing the best protection to the imaging technicians. Also, AI can improve work efficiency by accurate delination of infections in X-ray and CT images, facilitating subsequent quantification. Moreover, the computer-aided platforms help radiologists make clinical decisions, i.e., for disease diagnosis, tracking, and prognosis. In this review paper, we thus cover the entire pipeline of medical imaging and analysis techniques involved with COVID-19, including image acquisition, segmentation, diagnosis, and follow-up. We particularly focus on the integration of AI with X-ray and CT, both of which are widely used in the frontline hospitals, in order to depict the latest progress of medical imaging and radiology fighting against COVID-19.

preprint2020arXiv

Runtime Performances of Randomized Search Heuristics for the Dynamic Weighted Vertex Cover Problem

Randomized search heuristics such as evolutionary algorithms are frequently applied to dynamic combinatorial optimization problems. Within this paper, we present a dynamic model of the classic Weighted Vertex Cover problem and analyze the runtime performances of the well-studied algorithms Randomized Local Search and (1+1) EA adapted to it, to contribute to the theoretical understanding of evolutionary computing for problems with dynamic changes. In our investigations, we use an edge-based representation based on the dual form of the Linear Programming formulation for the problem and study the expected runtime that the adapted algorithms require to maintain a 2-approximate solution when the given weighted graph is modified by an edge-editing or weight-editing operation. Considering the weights on the vertices may be exponentially large with respect to the size of the graph, the step size adaption strategy is incorporated, with or without the 1/5-th rule that is employed to control the increasing/decreasing rate of the step size. Our results show that three of the four algorithms presented in the paper can recompute 2-approximate solutions for the studied dynamic changes in polynomial expected runtime, but the (1+1) EA with 1/5-th Rule requires pseudo-polynomial expected runtime.

preprint2020arXiv

Science and Technology Advance through Surprise

Breakthrough discoveries and inventions involve unexpected combinations of contents including problems, methods, and natural entities, and also diverse contexts such as journals, subfields, and conferences. Drawing on data from tens of millions of research papers, patents, and researchers, we construct models that predict next year's content and context combinations with an AUC of 95% based on embeddings constructed from high-dimensional stochastic block models, where the improbability of new combinations itself predicts up to 50% of the likelihood that they will gain outsized citations and major awards. Most of these breakthroughs occur when problems in one field are unexpectedly solved by researchers from a distant other. These findings demonstrate the critical role of surprise in advance, and enable evaluation of scientific institutions ranging from education and peer review to awards in supporting it.

preprint2020arXiv

Severity Assessment of Coronavirus Disease 2019 (COVID-19) Using Quantitative Features from Chest CT Images

Background: Chest computed tomography (CT) is recognized as an important tool for COVID-19 severity assessment. As the number of affected patients increase rapidly, manual severity assessment becomes a labor-intensive task, and may lead to delayed treatment. Purpose: Using machine learning method to realize automatic severity assessment (non-severe or severe) of COVID-19 based on chest CT images, and to explore the severity-related features from the resulting assessment model. Materials and Method: Chest CT images of 176 patients (age 45.3$\pm$16.5 years, 96 male and 80 female) with confirmed COVID-19 are used, from which 63 quantitative features, e.g., the infection volume/ratio of the whole lung and the volume of ground-glass opacity (GGO) regions, are calculated. A random forest (RF) model is trained to assess the severity (non-severe or severe) based on quantitative features. Importance of each quantitative feature, which reflects the correlation to the severity of COVID-19, is calculated from the RF model. Results: Using three-fold cross validation, the RF model shows promising results, i.e., 0.933 of true positive rate, 0.745 of true negative rate, 0.875 of accuracy, and 0.91 of area under receiver operating characteristic curve (AUC). The resulting importance of quantitative features shows that the volume and its ratio (with respect to the whole lung volume) of ground glass opacity (GGO) regions are highly related to the severity of COVID-19, and the quantitative features calculated from the right lung are more related to the severity assessment than those of the left lung. Conclusion: The RF based model can achieve automatic severity assessment (non-severe or severe) of COVID-19 infection, and the performance is promising. Several quantitative features, which have the potential to reflect the severity of COVID-19, were revealed.

preprint2020arXiv

Synergistic Learning of Lung Lobe Segmentation and Hierarchical Multi-Instance Classification for Automated Severity Assessment of COVID-19 in CT Images

Understanding chest CT imaging of the coronavirus disease 2019 (COVID-19) will help detect infections early and assess the disease progression. Especially, automated severity assessment of COVID-19 in CT images plays an essential role in identifying cases that are in great need of intensive clinical care. However, it is often challenging to accurately assess the severity of this disease in CT images, due to variable infection regions in the lungs, similar imaging biomarkers, and large inter-case variations. To this end, we propose a synergistic learning framework for automated severity assessment of COVID-19 in 3D CT images, by jointly performing lung lobe segmentation and multi-instance classification. Considering that only a few infection regions in a CT image are related to the severity assessment, we first represent each input image by a bag that contains a set of 2D image patches (with each cropped from a specific slice). A multi-task multi-instance deep network (called M$^2$UNet) is then developed to assess the severity of COVID-19 patients and also segment the lung lobe simultaneously. Our M$^2$UNet consists of a patch-level encoder, a segmentation sub-network for lung lobe segmentation, and a classification sub-network for severity assessment (with a unique hierarchical multi-instance learning strategy). Here, the context information provided by segmentation can be implicitly employed to improve the performance of severity assessment. Extensive experiments were performed on a real COVID-19 CT image dataset consisting of 666 chest CT images, with results suggesting the effectiveness of our proposed method compared to several state-of-the-art methods.

preprint2019arXiv

DS-GCNs: Connectome Classification Using Dynamic Spectral Graph Convolution Networks with Assistant Task Training

Functional Connectivity (FC) matrices measure the regional interactions in the brain and have been widely used in neurological brain disease classification. However, a FC matrix is neither a natural image which contains shape and texture information, nor a vector of independent features, which renders the extracting of efficient features from matrices as a challenging problem. A brain network, also named as connectome, could forma a graph structure naturally, the nodes of which are brain regions and the edges are interregional connectivity. Thus, in this study, we proposed novel graph convolutional networks (GCNs) to extract efficient disease-related features from FC matrices. Considering the time-dependent nature of brain activity, we computed dynamic FC matrices with sliding-windows and implemented a graph convolution based LSTM (long short term memory) layer to process dynamic graphs. Moreover, the demographics of patients were also used to guide the classification. However, unlike in conventional methods where personal information, i.e., gender and age were added as extra inputs, we argue that this kind of approach may not actually improve the classification performance, for such personal information given in dataset was usually balanced distributed. In this paper, we proposed to utilize the demographic information as extra outputs and to share parameters among three networks predicting subject status, gender and age, which serve as assistant tasks. We tested the performance of the proposed architecture in ADNI II dataset to classify Alzheimer's disease patients from normal controls. The classification accuracy, sensitivity and specificity reach 0.90, 0.92 and 0.89 on ADNI II dataset.

preprint2016arXiv

An empirical model to form and evolve galaxies in dark matter halos

Based on the star formation histories (SFH) of galaxies in halos of different masses, we develop an empirical model to grow galaxies in dark mattet halos. This model has very few ingredients, any of which can be associated to observational data and thus be efficiently assessed. By applying this model to a very high resolution cosmological $N$-body simulation, we predict a number of galaxy properties that are a very good match to relevant observational data. Namely, for both centrals and satellites, the galaxy stellar mass function (SMF) up to redshift $z\simeq4$ and the conditional stellar mass functions (CSMF) in the local universe are in good agreement with observations. In addition, the 2-point correlation is well predicted in the different stellar mass ranges explored by our model. Furthermore, after applying stellar population synthesis models to our stellar composition as a function of redshift, we find that the luminosity functions in $^{0.1}u$, $^{0.1}g$, $^{0.1}r$, $^{0.1}i$ and $^{0.1}z$ bands agree quite well with the SDSS observational results down to an absolute magnitude at about -17.0. The SDSS conditional luminosity functions (CLF) itself is predicted well. Finally, the cold gas is derived from the star formation rate (SFR) to predict the HI gas mass within each mock galaxy. We find a remarkably good match to observed HI-to-stellar mass ratios. These features ensure that such galaxy/gas catalogs can be used to generate reliable mock redshift surveys.

preprint2016arXiv

Galaxy groups in the 2MASS Redshift Survey

A galaxy group catalog is constructed from the 2MASS Redshift Survey (2MRS) with the use of a halo-based group finder. The halo mass associated with a group is estimated using a `GAP' method based on the luminosity of the central galaxy and its gap with other member galaxies. Tests using mock samples shows that this method is reliable, particularly for poor systems containing only a few members. On average 80% of all the groups have completeness >0.8, and about 65% of the groups have zero contamination. Halo masses are estimated with a typical uncertainty $\sim 0.35\,{\rm dex}$. The application of the group finder to the 2MRS gives 29,904 groups from a total of 43,246 galaxies at $z \leq 0.08$, with 5,286 groups having two or more members. Some basic properties of this group catalog is presented, and comparisons are made with other groups catalogs in overlap regions. With a depth to $z\sim 0.08$ and uniformly covering about 91% of the whole sky, this group catalog provides a useful data base to study galaxies in the local cosmic web, and to reconstruct the mass distribution in the local Universe.

preprint2016arXiv

Interception in Distance-Vector Routing Networks

Despite the large effort devoted to cybersecurity research over the last decades, cyber intrusions and attacks are still increasing. With respect to routing networks, route hijacking has highlighted the need to reexamine the existing protocols that govern traffic routing. In particular, our pri- mary question is how the topology of a network affects the susceptibility of a routing protocol to endogenous route misdirection. In this paper we define and analyze an abstract model of traffic interception (i.e. eavesdropping) in distance-vector routing networks. Specifically, we study al- gorithms that measure the potential of groups of dishonest agents to divert traffic through their infrastructure under the constraint that messages must reach their intended destinations. We relate two variants of our model based on the allowed kinds of lies, define strategies for colluding agents, and prove optimality in special cases. In our main theorem we derive a provably optimal monitoring strategy for subsets of agents in which no two are adjacent, and we extend this strategy to the general case. Finally, we use our results to analyze the susceptibility of real and synthetic networks to endogenous traffic interception. In the Autonomous Systems (AS) graph of the United States, we show that compromising only 18 random nodes in the AS graph surprisingly captures 10% of all traffic paths in the network in expectation when a distance-vector routing protocol is in use.

preprint2016arXiv

Mapping the real space distributions of galaxies in SDSS DR7: I. Two Point Correlation Functions

Using a method to correct redshift space distortion (RSD) for individual galaxies, we mapped the real space distributions of galaxies in the Sloan Digital Sky Survey (SDSS) Data Release 7 (DR7). We use an ensemble of mock catalogs to demonstrate the reliability of our method. Here as the first paper in a series, we mainly focus on the two point correlation function (2PCF) of galaxies. Overall the 2PCF measured in the reconstructed real space for galaxies brighter than $^{0.1}{\rm M}_r-5\log h=-19.0$ agrees with the direct measurement to an accuracy better than the measurement error due to cosmic variance, if the reconstruction uses the correct cosmology. Applying the method to the SDSS DR7, we construct a real space version of the main galaxy catalog, which contains 396,068 galaxies in the North Galactic Cap with redshifts in the range $0.01 \leq z \leq 0.12$. The Sloan Great Wall, the largest known structure in the nearby Universe, is not as dominant an over-dense structure as appears to be in redshift space. We measure the 2PCFs in reconstructed real space for galaxies of different luminosities and colors. All of them show clear deviations from single power-law forms, and reveal clear transitions from 1-halo to 2-halo terms. A comparison with the corresponding 2PCFs in redshift space nicely demonstrates how RSDs boost the clustering power on large scales (by about $40-50\%$ at scales $\sim 10 h^{-1}{\rm {Mpc}}$) and suppress it on small scales (by about $70-80\%$ at a scale of $0.3 h^{-1}{\rm {Mpc}}$).

preprint2016arXiv

Parameterized Algorithms for the Maximum Agreement Forest Problem on Multiple Rooted Multifurcating Trees

The Maximum Agreement Forest problem has been extensively studied in phylogenetics. Most previous work is on two binary phylogenetic trees. In this paper, we study a generalized version of the problem: the Maximum Agreement Forest problem on multiple rooted multifurcating phylogenetic trees, from the perspective of fixed-parameter algorithms. By taking advantage of a new branch-and-bound strategy, two parameterized algorithms, with running times $O(2.42^k m^3 n^4)$ and $O(2.74^k m^3 n^5)$, respectively, are presented for the hard version and the soft version of the problem, which correspond to two different biological meanings to the polytomies in multifurcating phylogenetic trees.

preprint2016arXiv

Parameterized Analysis of Multi-objective Evolutionary Algorithms and the Weighted Vertex Cover Problem

A rigorous runtime analysis of evolutionary multi-objective optimization for the classical vertex cover problem in the context of parameterized complexity analysis has been presented by Kratsch and Neumann (2013). In this paper, we extend the analysis to the weighted vertex cover problem and provide a fixed parameter evolutionary algorithm with respect to OPT, the cost of the the optimal solution for the problem. Moreover, using a diversity mechanisms, we present a multi-objective evolutionary algorithm that finds a 2-approximation in expected polynomial time and introduce a population-based evolutionary algorithm which finds a $(1+\varepsilon)$-approximation in expected time $O(n\cdot 2^{\min \{n,2(1- \varepsilon)OPT \}} + n^3)$.

preprint2016arXiv

Transitivity reinforcement in the coevolving voter model

One of the fundamental structural properties of many networks is triangle closure. Whereas the influence of this transitivity on a variety of contagion dynamics has been previously explored, existing models of coevolving or adaptive network systems use rewiring rules that randomize away this important property. In contrast, we study here a modified coevolving voter model dynamics that explicitly reinforces and maintains such clustering. Employing extensive numerical simulations, we establish that the transitions and dynamical states observed in coevolving voter model networks without clustering are altered by reinforcing transitivity in the model. We then use a semi-analytical framework in terms of approximate master equations to predict the dynamical behaviors of the model for a variety of parameter settings.

preprint2015arXiv

An Analytic Linear Accelerator Source Model for Monte Carlo Dose Calculations. I. Model Representation and Construction

Monte Carlo (MC) simulation is considered as the most accurate method for radiation dose calculations. Accuracy of a source model for a linear accelerator is critical for the overall dose calculation accuracy. In this paper, we presented an analytical source model that we recently developed for GPU-based MC dose calculations. A key concept called phase-space-ring (PSR) was proposed. It contained a group of particles that are of the same type and close in energy and radial distance to the center of the phase-space plane. The model parameterized probability densities of particle location, direction and energy for each primary photon PSR, scattered photon PSR and electron PSR. For a primary photon PSRs, the particle direction is assumed to be from the beam spot. A finite spot size is modeled with a 2D Gaussian distribution. For a scattered photon PSR, multiple Gaussian components were used to model the particle direction. The direction distribution of an electron PSRs was also modeled as a 2D Gaussian distribution with a large standard deviation. We also developed a method to analyze a phase-space file and derive corresponding model parameters. To test the accuracy of our linac source model, dose distributions of different open fields in a water phantom were calculated using our source model and compared to those directly calculated using the reference phase-space file. The average distance-to-agreement (DTA) was within 1 mm for the depth dose in the build-up region and beam penumbra regions. The root-mean-square (RMS) dose difference was within 1.1% for dose profiles at inner and outer beam regions. The maximal relative difference of output factors was within 0.5%. Good agreements were also found in an IMRT prostate patient case and an IMRT head-and-neck case. These results demonstrated the efficacy of our source model in terms of accurately representing a reference phase-space file.

preprint2015arXiv

An Analytic Linear Accelerator Source Model for Monte Carlo dose calculations. II. Model Utilization in a GPU-based Monte Carlo Package and Automatic Source Commissioning

We recently built an analytical source model for GPU-based MC dose engine. In this paper, we present a sampling strategy to efficiently utilize this source model in GPU-based dose calculation. Our source model was based on a concept of phase-space-ring (PSR). This ring structure makes it effective to account for beam rotational symmetry, but not suitable for dose calculations due to rectangular jaw settings. Hence, we first convert PSR source model to its phase-space let (PSL) representation. Then in dose calculation, different types of sub-sources were separately sampled. Source sampling and particle transport were iterated. So that the particles being sampled and transported simultaneously are of same type and close in energy to alleviate GPU thread divergence. We also present an automatic commissioning approach to adjust the model for a good representation of a clinical linear accelerator . Weighting factors were introduced to adjust relative weights of PSRs, determined by solving a quadratic minimization problem with a non-negativity constraint. We tested the efficiency gain of our model over a previous source model using PSL files. The efficiency was improved by 1.70 ~ 4.41, due to the avoidance of long data reading and transferring. The commissioning problem can be solved in ~20 sec. Its efficacy was tested by comparing the doses computed using the commissioned model and the uncommissioned one, with measurements in different open fields in a water phantom under a clinical Varian Truebeam 6MV beam. For the depth dose curves, the average distance-to-agreement was improved from 0.04~0.28 cm to 0.04~0.12 cm for build-up region and the root-mean-square (RMS) dose difference after build-up region was reduced from 0.32%~0.67% to 0.21%~0.48%. For lateral dose profiles, RMS difference was reduced from 0.31%~2.0% to 0.06%~0.78% at inner beam and from 0.20%~1.25% to 0.10%~0.51% at outer beam.

preprint2014arXiv

Algorithms for Maximum Agreement Forest of Multiple General Trees

The Maximum Agreement Forest (Maf) problem is a well-studied problem in evolutionary biology, which asks for a largest common subforest of a given collection of phylogenetic trees with identical leaf label-set. However, the previous work about the Maf problem are mainly on two binary phylogenetic trees or two general (i.e., binary and non-binary) phylogenetic trees. In this paper, we study the more general version of the problem: the Maf problem on multiple general phylogenetic trees. We present a parameterized algorithm of running time $O(3^k n^2m)$ and a 3-approximation algorithm for the Maf problem on multiple rooted general phylogenetic trees, and a parameterized algorithm of running time $O(4^k n^2m)$ and a 4-approximation algorithm for the Maf problem on multiple unrooted general phylogenetic trees. We also implement the parameterized algorithm and approximation algorithm for the Maf problem on multiple rooted general phylogenetic trees, and test them on simulated data and biological data.

preprint2014arXiv

An Enhanced Multiway Sorting Network Based on n-Sorters

Merging-based sorting networks are an important family of sorting networks. Most merge sorting networks are based on 2-way or multi-way merging algorithms using 2-sorters as basic building blocks. An alternative is to use n-sorters, instead of 2-sorters, as the basic building blocks so as to greatly reduce the number of sorters as well as the latency. Based on a modified Leighton's columnsort algorithm, an n-way merging algorithm, referred to as SS-Mk, that uses n-sorters as basic building blocks was proposed. In this work, we first propose a new multiway merging algorithm with n-sorters as basic building blocks that merges n sorted lists of m values each in 1 + ceil(m/2) stages (n <= m). Based on our merging algorithm, we also propose a sorting algorithm, which requires O(N log2 N) basic sorters to sort N inputs. While the asymptotic complexity (in terms of the required number of sorters) of our sorting algorithm is the same as the SS-Mk, for wide ranges of N, our algorithm requires fewer sorters than the SS-Mk. Finally, we consider a binary sorting network, where the basic sorter is implemented in threshold logic and scales linearly with the number of inputs, and compare the complexity in terms of the required number of gates. For wide ranges of N, our algorithm requires fewer gates than the SS-Mk.

preprint2014arXiv

Gradient Boundary Histograms for Action Recognition

This paper introduces a high efficient local spatiotemporal descriptor, called gradient boundary histograms (GBH). The proposed GBH descriptor is built on simple spatio-temporal gradients, which are fast to compute. We demonstrate that it can better represent local structure and motion than other gradient-based descriptors, and significantly outperforms them on large realistic datasets. A comprehensive evaluation shows that the recognition accuracy is preserved while the spatial resolution is greatly reduced, which yields both high efficiency and low memory usage.

preprint2013arXiv

A multi-opinion evolving voter model with infinitely many phase transitions

We consider an idealized model in which individuals' changing opinions and their social network coevolve, with disagreements between neighbors in the network resolved either through one imitating the opinion of the other or by reassignment of the discordant edge. Specifically, an interaction between $x$ and one of its neighbors $y$ leads to $x$ imitating $y$ with probability $(1-α)$ and otherwise (i.e., with probability $α$) $x$ cutting its tie to $y$ in order to instead connect to a randomly chosen individual. Building on previous work about the two-opinion case, we study the multiple-opinion situation, finding that the model has infinitely many phase transitions. Moreover, the formulas describing the end states of these processes are remarkably simple when expressed as a function of $β= α/(1-α)$.

preprint2013arXiv

A New Splitting Method for Time-dependent Convection-dominated Diffusion Problems

We present a new splitting method for time-dependent convection-dominated diffusion problems. The original convection diffusion system is split into two sub-systems: a pure convection system and a diffusion system. At each time step, a convection problem and a diffusion problem are solved successively. The scheme has the following nice features: the convection subproblem is solved explicitly and a multistep technique is introduced to essentially enlarge the stability region so that the resulting scheme behaves like an unconditionally stable scheme; the diffusion subproblem is always self-adjoint and coercive so that it can be solved efficiently using many existing optimal preconditioned iterative solvers. The scheme is then extended for Navier-Stokes equations, where the nonlinear convection is resolved by a linear explicit multistep scheme at the convection step, and only a generalized Stokes problem is needed to solve at the diffusion step with the resulting stiffness matrix being invariant in the time marching process. The new schemes are all free from tuning some stabilization parameters for the convection-dominated diffusion problems. Numerical simulations are presented to demonstrate the stability, convergence and performance of the single-step and multistep variants of the new scheme.

preprint2013arXiv

Percolation-induced exponential scaling in the large current tails of random resistor networks

There is a renewed surge in percolation-induced transport properties of diverse nano-particle composites (cf. RSC Nanoscience & Nanotechnology Series, Paul O'Brien Editor-in-Chief). We note in particular a broad interest in nano-composites exhibiting sharp electrical property gains at and above percolation threshold, which motivated us to revisit the classical setting of percolation in random resistor networks but from a multiscale perspective. For each realization of random resistor networks above threshold, we use network graph representations and associated algorithms to identify and restrict to the percolating component, thereby preconditioning the network both in size and accuracy by filtering {\it a priori} zero current-carrying bonds. We then simulate many realizations per bond density and analyze scaling behavior of the complete current distribution supported on the percolating component. We first confirm the celebrated power-law distribution of small currents at the percolation threshold, and second we confirm results on scaling of the maximum current in the network that is associated with the backbone of the percolating cluster. These properties are then placed in context with global features of the current distribution, and in particular the dominant role of the large current tail that is most relevant for material science applications. We identify a robust, exponential large current tail that: 1. persists above threshold; 2. expands broadly over and dominates the current distribution at the expense of the vanishing power law scaling in the small current tail; and 3. by taking second moments, reproduces the experimentally observed power law scaling of bulk conductivity above threshold.

preprint2012arXiv

Common dependence on earthquake magnitudes for the trapped particles bursts approaching the earthquake

Trapped particles bursts have long been observed to be frequently occurred several hours before earthquakes, especially for strong earthquakes, from several space experiments during past decades. However, the validity of earthquake origin of particles bursts events is still unsolved. In this paper, we firstly reported the frequency distribution and time evolution of particles bursts within different time windows centered around earthquakes for various magnitudes. The results showed nearly the same systematic dependence of particle bursts frequency on earthquake magnitude and characteristic time decay behavior of average number of particles bursts for various magnitudes. These findings should strengthen the validity of earthquake origin of particles bursts and further understanding of particles bursts as possible precursor of earthquake.

preprint2012arXiv

New Crosstalk Avoidance Codes Based on a Novel Pattern Classification

The crosstalk delay associated with global on-chip interconnects becomes more severe in deep submicron technology, and hence can greatly affect the overall system performance. Based on a delay model proposed by Sotiriadis et al., transition patterns over a bus can be classified according to their delays. Using this classification, crosstalk avoidance codes (CACs) have been proposed to alleviate the crosstalk delays by restricting the transition patterns on a bus. In this paper, we first propose a new classification of transition patterns, and then devise a new family of CACs based on this classification. In comparison to the previous classification, our classification has more classes and the delays of its classes do not overlap, both leading to more accurate control of delays. Our new family of CACs includes some previously proposed codes as well as new codes with reduced delays and improved throughput. Thus, this new family of crosstalk avoidance codes provides a wider variety of tradeoffs between bus delay and efficiency. Finally, since our analytical approach to the classification and CACs treats the technology-dependent parameters as variables, our approach can be easily adapted to a wide variety of technology.

preprint2011arXiv

Experimental study of THGEM detector with mini-rim

The gas gain and energy resolution of single and double THGEM detectors (5{\times}5cm2 effective area) with mini-rims (rim is less than 10μm) were studied. The maximum gain can reach 5{\times}103 and 2{\times}105 for single and double THGEM respectively, while the energy resolution of 5.9 keV X-ray varied from 18% to 28% for both single and double THGEM detectors of different hole sizes and thicknesses.All the experiments were investigated in mixture of noble gases(argon,neon) and small content of other gases(iso-butane,methane) at atmospheric pressure.

preprint2011arXiv

Study of the NWC electrons belt observed on DEMETER Satellite

We analyzed observation data collected by the Instrument for the Detection of Particles (IDP) on board of DEMETER satellite during the period of total seventeen months in 2007 and 2008. In the meantime, the VLF transmitter located at NWC ground station was shutdown for seven months and working for total ten months. Our analysis, for the first time, revealed in details the transient properties of the space electrons induced by the man-made VLF wave emitted by the transmitter at NWC. First, we mapped the electron flux distribution and figured out the special range what the NWC belt covered. Then we investigated the NWC electron spectrograms in a wide range of McIlwain parameter (up to L=3.0). Finally, we obtained the averaged energy spectrum of the NWC electrons within the drift loss-cone, and compared the difference during the observations between daytime and nighttime. Our results proved the fact that the VLF emissions from NWC transmitter created momentary electron enhancement with fluxes up to 3 orders of magnitude. These electrons are distributed in the region of 180 degree in longitude and 1.6 ~ 1.9 of L shell. In addition, the VLF emission induced either enhancement or loss of electrons in higher magnetic shells up to L=3, and the maximum loss was up to 60% of the original value. The energy spectra of these electrons revealed that the enhancement during the NWC daytime are more attenuated than those in the NWC nighttime, and that the shape and the cutoff energy of the spectra are also quite different. We will present the results of our analysis, compare it with previous studies, and discuss the agreement of our results with the theory of wave-particle interaction.

Feng Shi

What is connected

Connect this record

See the researcher in context

Building this map preview

37 published item(s)

KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta

An Extended Halo-based Group/Cluster finder: application to the DESI legacy imaging surveys DR8

An improved approach to manufacture CNT reinforced magnesium AZ91 composites with increased strength and ductility

A novel multiple instance learning framework for COVID-19 severity assessment via data augmentation and self-supervised learning

Adaptive Feature Selection Guided Deep Forest for COVID-19 Classification with Chest CT

Community Detection for Hypergraph Networks via Regularized Tensor Power Iteration

Dual-Sampling Attention Network for Diagnosis of COVID-19 from Community Acquired Pneumonia

HIR4: cosmology from a simulated neutral hydrogen full sky using Horizon Run 4

Hypergraph Learning for Identification of COVID-19 with CT Imaging

Large-Scale Screening of COVID-19 from Community Acquired Pneumonia using Infection Size-Aware Classification

MixPUL: Consistency-based Augmentation for Positive and Unlabeled Learning

MRI Super-Resolution with GAN and 3D Multi-Level DenseNet: Smaller, Faster, and Better

Review of Artificial Intelligence Techniques in Imaging Data Acquisition, Segmentation and Diagnosis for COVID-19

Runtime Performances of Randomized Search Heuristics for the Dynamic Weighted Vertex Cover Problem

Science and Technology Advance through Surprise

Severity Assessment of Coronavirus Disease 2019 (COVID-19) Using Quantitative Features from Chest CT Images

Synergistic Learning of Lung Lobe Segmentation and Hierarchical Multi-Instance Classification for Automated Severity Assessment of COVID-19 in CT Images

DS-GCNs: Connectome Classification Using Dynamic Spectral Graph Convolution Networks with Assistant Task Training

An empirical model to form and evolve galaxies in dark matter halos

Galaxy groups in the 2MASS Redshift Survey

Interception in Distance-Vector Routing Networks

Mapping the real space distributions of galaxies in SDSS DR7: I. Two Point Correlation Functions

Parameterized Algorithms for the Maximum Agreement Forest Problem on Multiple Rooted Multifurcating Trees

Parameterized Analysis of Multi-objective Evolutionary Algorithms and the Weighted Vertex Cover Problem

Transitivity reinforcement in the coevolving voter model

An Analytic Linear Accelerator Source Model for Monte Carlo Dose Calculations. I. Model Representation and Construction

An Analytic Linear Accelerator Source Model for Monte Carlo dose calculations. II. Model Utilization in a GPU-based Monte Carlo Package and Automatic Source Commissioning

Algorithms for Maximum Agreement Forest of Multiple General Trees

An Enhanced Multiway Sorting Network Based on n-Sorters

Gradient Boundary Histograms for Action Recognition

A multi-opinion evolving voter model with infinitely many phase transitions

A New Splitting Method for Time-dependent Convection-dominated Diffusion Problems

Percolation-induced exponential scaling in the large current tails of random resistor networks

Common dependence on earthquake magnitudes for the trapped particles bursts approaching the earthquake

New Crosstalk Avoidance Codes Based on a Novel Pattern Classification

Experimental study of THGEM detector with mini-rim

Study of the NWC electrons belt observed on DEMETER Satellite