Source author record

Boyu Zhang

Boyu Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision math.GT Machine Learning math.DS math.SG Neural and Evolutionary Computing Artificial Intelligence Biomolecules Computational Engineering, Finance, and Science cs.CY Distributed, Parallel, and Cluster Computing Information Retrieval math.DG math.ST Statistics Theory

Catalog footprint

What is connected

15works

15topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

Data-aware customization of activation functions reduces neural network error

Activation functions play critical roles in neural networks, yet current off-the-shelf neural networks pay little attention to the specific choice of activation functions used. Here we show that data-aware customization of activation functions can result in striking reductions in neural network error. We first give a simple linear algebraic explanation of the role of activation functions in neural networks; then, through connection with the Diaconis-Shahshahani Approximation Theorem, we propose a set of criteria for good activation functions. As a case study, we consider regression tasks with a partially exchangeable target function, \emph{i.e.} $f(u,v,w)=f(v,u,w)$ for $u,v\in \mathbb{R}^d$ and $w\in \mathbb{R}^k$, and prove that for such a target function, using an even activation function in at least one of the layers guarantees that the prediction preserves partial exchangeability for best performance. Since even activation functions are seldom used in practice, we designed the ``seagull'' even activation function $\log(1+x^2)$ according to our criteria. Empirical testing on over two dozen 9-25 dimensional examples with different local smoothness, curvature, and degree of exchangeability revealed that a simple substitution with the ``seagull'' activation function in an already-refined neural network can lead to an order-of-magnitude reduction in error. This improvement was most pronounced when the activation function substitution was applied to the layer in which the exchangeable variables are connected for the first time. While the improvement is greatest for low-dimensional data, experiments on the CIFAR10 image classification dataset showed that use of ``seagull'' can reduce error even for high-dimensional cases. These results collectively highlight the potential of customizing activation functions as a general approach to improve neural network performance.

preprint2023arXiv

Detachable Novel Views Synthesis of Dynamic Scenes Using Distribution-Driven Neural Radiance Fields

Representing and synthesizing novel views in real-world dynamic scenes from casual monocular videos is a long-standing problem. Existing solutions typically approach dynamic scenes by applying geometry techniques or utilizing temporal information between several adjacent frames without considering the underlying background distribution in the entire scene or the transmittance over the ray dimension, limiting their performance on static and occlusion areas. Our approach $\textbf{D}$istribution-$\textbf{D}$riven neural radiance fields offers high-quality view synthesis and a 3D solution to $\textbf{D}$etach the background from the entire $\textbf{D}$ynamic scene, which is called $\text{D}^4$NeRF. Specifically, it employs a neural representation to capture the scene distribution in the static background and a 6D-input NeRF to represent dynamic objects, respectively. Each ray sample is given an additional occlusion weight to indicate the transmittance lying in the static and dynamic components. We evaluate $\text{D}^4$NeRF on public dynamic scenes and our urban driving scenes acquired from an autonomous-driving dataset. Extensive experiments demonstrate that our approach outperforms previous methods in rendering texture details and motion areas while also producing a clean static background. Our code will be released at https://github.com/Luciferbobo/D4NeRF.

preprint2022arXiv

A note on the existence of U-cyclic elements in periodic Floer homology

Edtmair-Hutchings have recently defined, using periodic Floer homology, a U-cycle property for Hamiltonian isotopy classes of area-preserving diffeomorphisms of closed surfaces. They show that every Hamiltonian isotopy class satisfying the U-cycle property satisfies the smooth closing lemma and also satisfies a kind of Weyl law involving the actions of certain periodic points; they show that every rational isotopy class on the two-torus satisfies the U-cycle property. It seems that in general, not much is known about the U-module structure on PFH. Here we consider a version of Seiberg-Witten-Floer cohomology which is known by the work of Lee-Taubes to be isomorphic, as a U-module, to the periodic Floer homology in sufficiently high degree. We show that the analogous U-cycle property holds for every rational Hamiltonian isotopy class on any closed surface and, more generally, for any non-torsion spin-c structure. On the other hand, we also show that a rational isotopy class may contain elements that are not U-cyclic. By the Lee-Taubes isomorphism, the same results hold for PFH. Our results are some of the first computations concerning the U-module structure on these theories.

preprint2022arXiv

Auto-Encoding Score Distribution Regression for Action Quality Assessment

The action quality assessment (AQA) of videos is a challenging vision task since the relation between videos and action scores is difficult to model. Thus, AQA has been widely studied in the literature. Traditionally, AQA is treated as a regression problem to learn the underlying mappings between videos and action scores. But previous methods ignored data uncertainty in AQA dataset. To address aleatoric uncertainty, we further develop a plug-and-play module Distribution Auto-Encoder (DAE). Specifically, it encodes videos into distributions and uses the reparameterization trick in variational auto-encoders (VAE) to sample scores, which establishes a more accurate mapping between videos and scores. Meanwhile, a likelihood loss is used to learn the uncertainty parameters. We plug our DAE approach into MUSDL and CoRe. Experimental results on public datasets demonstrate that our method achieves state-of-the-art on AQA-7, MTL-AQA, and JIGSAWS datasets. Our code is available at https://github.com/InfoX-SEU/DAE-AQA.

preprint2022arXiv

Instanton homology and knot detection on thickened surfaces

Suppose $Σ$ is a compact oriented surface (possibly with boundary) that has genus zero, and L is a link in the interior of $(-1,1)\timesΣ$. We prove that the Asaeda-Przytycki-Sikora (APS) homology of L has rank 2 if and only if L is isotopic to an embedded knot in $\{0\}\timesΣ$. As a consequence, the APS homology detects the unknot in $(-1,1)\timesΣ$. This is the first detection result for generalized Khovanov homology that is valid on an infinite family of manifolds, and it partially solves a conjecture in arxiv:2005.12863. Our proof is different from the previous detection results obtained by instanton homology because in this case, the second page of Kronheimer-Mrowka's spectral sequence is not isomorphic to the APS homology. We also characterize all links in product manifolds that have minimal sutured instanton homology, which may be of independent interest.

preprint2022arXiv

Periodic Floer homology and the smooth closing lemma for area-preserving surface diffeomorphisms

We prove a very general Weyl-type law for Periodic Floer Homology, estimating the action of twisted Periodic Floer Homology classes over essentially any coefficient ring in terms of the grading and the degree, and recovering the Calabi invariant of Hamiltonians in the limit. We also prove a strong non-vanishing result, showing that under a monotonicity assumption which holds for a dense set of maps, the Periodic Floer Homology has infinite rank. An application of these results yields that a $C^{\infty}$-generic area-preserving diffeomorphism of a closed surface has a dense set of periodic points. This settles Smale's tenth problem in the special case of area-preserving diffeomorphisms of closed surfaces.

preprint2021arXiv

On the compactness problem for a family of generalized Seiberg-Witten equations in dimension three

We prove an abstract compactness theorem for a family of generalized Seiberg-Witten equations in dimension three. This result recovers Taubes' compactness theorem for stable flat $\mathbf{P}\mathrm{SL}_2(\mathbf{C})$-connections as well as the compactness theorem for Seiberg-Witten equations with multiple spinors. Furthermore, this result implies a compactness theorem for the ADHM$_{1,2}$ Seiberg-Witten equation, which partially verifies a conjecture by Doan and Walpuski.

preprint2020arXiv

A Novel DNN Training Framework via Data Sampling and Multi-Task Optimization

Conventional DNN training paradigms typically rely on one training set and one validation set, obtained by partitioning an annotated dataset used for training, namely gross training set, in a certain way. The training set is used for training the model while the validation set is used to estimate the generalization performance of the trained model as the training proceeds to avoid over-fitting. There exist two major issues in this paradigm. Firstly, the validation set may hardly guarantee an unbiased estimate of generalization performance due to potential mismatching with test data. Secondly, training a DNN corresponds to solve a complex optimization problem, which is prone to getting trapped into inferior local optima and thus leads to undesired training results. To address these issues, we propose a novel DNN training framework. It generates multiple pairs of training and validation sets from the gross training set via random splitting, trains a DNN model of a pre-specified structure on each pair while making the useful knowledge (e.g., promising network parameters) obtained from one model training process to be transferred to other model training processes via multi-task optimization, and outputs the best, among all trained models, which has the overall best performance across the validation sets from all pairs. The knowledge transfer mechanism featured in this new framework can not only enhance training effectiveness by helping the model training process to escape from local optima but also improve on generalization performance via implicit regularization imposed on one model training process from other model training processes. We implement the proposed framework, parallelize the implementation on a GPU cluster, and apply it to train several widely used DNN models. Experimental results demonstrate the superiority of the proposed framework over the conventional training paradigm.

preprint2020arXiv

A Use of Even Activation Functions in Neural Networks

Despite broad interest in applying deep learning techniques to scientific discovery, learning interpretable formulas that accurately describe scientific data is very challenging because of the vast landscape of possible functions and the "black box" nature of deep neural networks. The key to success is to effectively integrate existing knowledge or hypotheses about the underlying structure of the data into the architecture of deep learning models to guide machine learning. Currently, such integration is commonly done through customization of the loss functions. Here we propose an alternative approach to integrate existing knowledge or hypotheses of data structure by constructing custom activation functions that reflect this structure. Specifically, we study a common case when the multivariate target function $f$ to be learned from the data is partially exchangeable, \emph{i.e.} $f(u,v,w)=f(v,u,w)$ for $u,v\in \mathbb{R}^d$. For instance, these conditions are satisfied for the classification of images that is invariant under left-right flipping. Through theoretical proof and experimental verification, we show that using an even activation function in one of the fully connected layers improves neural network performance. In our experimental 9-dimensional regression problems, replacing one of the non-symmetric activation functions with the designated "Seagull" activation function $\log(1+x^2)$ results in substantial improvement in network performance. Surprisingly, even activation functions are seldom used in neural networks. Our results suggest that customized activation functions have great potential in neural networks.

preprint2020arXiv

Instantons and Khovanov skein homology on $I\times T^2$

Asaeda, Przytycki and Sikora defined a generalization of Khovanov homology for links in $I$-bundles over compact surfaces. We prove that for a link $L\subset (-1,1)\times T^2$, the Asaeda-Przytycki-Sikora homology of $L$ has rank $2$ with $\mathbb{Z}/2$-coefficients if and only if $L$ is isotopic to an embedded knot in $\{0\}\times T^2$.

preprint2020arXiv

On links with Khovanov homology of small ranks

We classify all links whose Khovanov homology have ranks no greater than 8, and all three-component links whose Khovanov homology have ranks no greater than 12, where the coefficient ring is Z/2. The classification is based on the previous results of Kronheimer-Mrowka, Batson-Seed, Baldwin-Sivek, and the authors.

preprint2020arXiv

The Relationship between Deteriorating Mental Health Conditions and Longitudinal Behavioral Changes in Google and YouTube Usages among College Students in the United States during COVID-19: Observational Study

Mental health problems among the global population are worsened during the coronavirus disease (COVID-19). How individuals engage with online platforms such as Google Search and YouTube undergoes drastic shifts due to pandemic and subsequent lockdowns. Such ubiquitous daily behaviors on online platforms have the potential to capture and correlate with clinically alarming deteriorations in mental health profiles in a non-invasive manner. The goal of this study is to examine, among college students, the relationship between deteriorating mental health conditions and changes in user behaviors when engaging with Google Search and YouTube during COVID-19. This study recruited a cohort of 49 students from a U.S. college campus during January 2020 (prior to the pandemic) and measured the anxiety and depression levels of each participant. This study followed up with the same cohort during May 2020 (during the pandemic), and the anxiety and depression levels were assessed again. The longitudinal Google Search and YouTube history data were anonymized and collected. From individual-level Google Search and YouTube histories, we developed 5 signals that can quantify shifts in online behaviors during the pandemic. We then assessed the differences between groups with and without deteriorating mental health profiles in terms of these features. Significant features included late-night online activities, continuous usages, and time away from the internet, porn consumptions, and keywords associated with negative emotions, social activities, and personal affairs. Though further studies are required, our results demonstrated the feasibility of utilizing pervasive online data to establish non-invasive surveillance systems for mental health conditions that bypasses many disadvantages of existing screening methods.

preprint2020arXiv

Two detection results of Khovanov homology on links

We prove that Khovanov homology with Z/2-coefficients detects the link L7n1, and the union of a trefoil and its meridian.

preprint2018arXiv

Computer-Aided Knee Joint Magnetic Resonance Image Segmentation - A Survey

Osteoarthritis (OA) is one of the major health issues among the elderly population. MRI is the most popular technology to observe and evaluate the progress of OA course. However, the extreme labor cost of MRI analysis makes the process inefficient and expensive. Also, due to human error and subjective nature, the inter- and intra-observer variability is rather high. Computer-aided knee MRI segmentation is currently an active research field because it can alleviate doctors and radiologists from the time consuming and tedious job, and improve the diagnosis performance which has immense potential for both clinic and scientific research. In the past decades, researchers have investigated automatic/semi-automatic knee MRI segmentation methods extensively. However, to the best of our knowledge, there is no comprehensive survey paper in this field yet. In this survey paper, we classify the existing methods by their principles and discuss the current research status and point out the future research trend in-depth.

preprint2015arXiv

In-Situ Data Analysis of Protein Folding Trajectories

The transition from petascale to exascale computers is characterized by substantial changes in the computer architectures and technologies. The research community relying on computational simulations is being forced to revisit the algorithms for data generation and analysis due to various concerns, such as higher degrees of concurrency, deeper memory hierarchies, substantial I/O and communication constraints. Simulations today typically save all data to analyze later. Simulations at the exascale will require us to analyze data as it is generated and save only what is really needed for analysis, which must be performed predominately in-situ, i.e., executed sufficiently fast locally, limiting memory and disk usage, and avoiding the need to move large data across nodes. In this paper, we present a distributed method that enables in-situ data analysis for large protein folding trajectory datasets. Traditional trajectory analysis methods currently follow a centralized approach that moves the trajectory datasets to a centralized node and processes the data only after simulations have been completed. Our method, on the other hand, captures conformational information in-situ using local data only while reducing the storage space needed for the part of the trajectory under consideration. This method processes the input trajectory data in one pass, breaks from the centralized approach of traditional analysis, avoids the movement of trajectory data, and still builds the global knowledge on the formation of individual $α$-helices or $β$-strands as trajectory frames are generated.

Boyu Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

15 published item(s)

Data-aware customization of activation functions reduces neural network error

Detachable Novel Views Synthesis of Dynamic Scenes Using Distribution-Driven Neural Radiance Fields

A note on the existence of U-cyclic elements in periodic Floer homology

Auto-Encoding Score Distribution Regression for Action Quality Assessment

Instanton homology and knot detection on thickened surfaces

Periodic Floer homology and the smooth closing lemma for area-preserving surface diffeomorphisms

On the compactness problem for a family of generalized Seiberg-Witten equations in dimension three

A Novel DNN Training Framework via Data Sampling and Multi-Task Optimization

A Use of Even Activation Functions in Neural Networks

Instantons and Khovanov skein homology on $I\times T^2$

On links with Khovanov homology of small ranks

The Relationship between Deteriorating Mental Health Conditions and Longitudinal Behavioral Changes in Google and YouTube Usages among College Students in the United States during COVID-19: Observational Study

Two detection results of Khovanov homology on links

Computer-Aided Knee Joint Magnetic Resonance Image Segmentation - A Survey

In-Situ Data Analysis of Protein Folding Trajectories