Source author record

Jinhuan Wang

Jinhuan Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Social and Information Networks math-ph math.MP Biomolecules Cryptography and Security Data Structures and Algorithms Machine Learning math.AP math.OC Molecular Networks physics.soc-ph

Catalog footprint

What is connected

9works

11topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2024arXiv

Multi-Modal Representation Learning for Molecular Property Prediction: Sequence, Graph, Geometry

Molecular property prediction refers to the task of labeling molecules with some biochemical properties, playing a pivotal role in the drug discovery and design process. Recently, with the advancement of machine learning, deep learning-based molecular property prediction has emerged as a solution to the resource-intensive nature of traditional methods, garnering significant attention. Among them, molecular representation learning is the key factor for molecular property prediction performance. And there are lots of sequence-based, graph-based, and geometry-based methods that have been proposed. However, the majority of existing studies focus solely on one modality for learning molecular representations, failing to comprehensively capture molecular characteristics and information. In this paper, a novel multi-modal representation learning model, which integrates the sequence, graph, and geometry characteristics, is proposed for molecular property prediction, called SGGRL. Specifically, we design a fusion layer to fusion the representation of different modalities. Furthermore, to ensure consistency across modalities, SGGRL is trained to maximize the similarity of representations for the same molecule while minimizing similarity for different molecules. To verify the effectiveness of SGGRL, seven molecular datasets, and several baselines are used for evaluation and comparison. The experimental results demonstrate that SGGRL consistently outperforms the baselines in most cases. This further underscores the capability of SGGRL to comprehensively capture molecular information. Overall, the proposed SGGRL model showcases its potential to revolutionize molecular property prediction by leveraging multi-modal representation learning to extract diverse and comprehensive molecular insights. Our code is released at https://github.com/Vencent-Won/SGGRL.

preprint2022arXiv

Null Model-Based Data Augmentation for Graph Classification

In network science, the null model is typically used to generate a series of graphs based on randomization as a term of comparison to verify whether a network in question displays some non-trivial features such as community structure. Since such non-trivial features play a significant role in graph classification, the null model could be useful for network data augmentation to enhance classification performance. In this paper, we propose a novel technique that combines the null model with data augmentation for graph classification. Moreover, we propose four standard null model-based augmentation methods and four approximate null model-based augmentation methods to verify and improve the performance of our graph classification technique. Our experiments demonstrate that the proposed augmentation technique has significantly achieved general improvement on the tested datasets. In addition, we find that the standard null model-based augmentation methods always outperform the approximate ones, depending on the design mechanisms of the null models. Our results indicate that the choice of non-trivial features is significant for increasing the performance of augmentation models for different network structures, which also provides a new perspective of data augmentation for studying various graph classification methods.

preprint2022arXiv

Targeted k-node Collapse Problem: Towards Understanding the Robustness of Local k-core Structure

The concept of k-core, which indicates the largest induced subgraph where each node has k or more neighbors, plays a significant role in measuring the cohesiveness and the engagement of a network, and it is exploited in diverse applications, e.g., network analysis, anomaly detection, community detection, etc. Recent works have demonstrated the vulnerability of k-core under malicious perturbations which focuses on removing the minimal number of edges to make a whole k-core structure collapse. However, to the best of our knowledge, there is no existing research concentrating on how many edges should be removed at least to make an arbitrary node in k-core collapse. Therefore, in this paper, we make the first attempt to study the Targeted k-node Collapse Problem (TNCP) with four novel contributions. Firstly, we offer the general definition of TNCP problem with the proof of its NP-hardness. Secondly, in order to address the TNCP problem, we propose a heuristic algorithm named TNC and its improved version named ATNC for implementations on large-scale networks. After that, the experiments on 16 real-world networks across various domains verify the superiority of our proposed algorithms over 4 baseline methods along with detailed comparisons and analyses. Finally, the significance of TNCP problem for precisely evaluating the resilience of k-core structures in networks is validated.

preprint2022arXiv

TSGN: Transaction Subgraph Networks Assisting Phishing Detection in Ethereum

Due to the decentralized and public nature of the Blockchain ecosystem, the malicious activities on the Ethereum platform impose immeasurable losses for the users. Existing phishing scam detection methods mostly rely only on the analysis of original transaction networks, which is difficult to dig deeply into the transaction patterns hidden in the network structure of transaction interaction. In this paper, we propose a \underline{T}ransaction \underline{S}ub\underline{G}raph \underline{N}etwork (TSGN) based phishing accounts identification framework for Ethereum. We first extract transaction subgraphs for target accounts and then expand these subgraphs into corresponding TSGNs based on the different mapping mechanisms. In order to make our model incorporate more important information about real transactions, we encode the transaction attributes into the modeling process of TSGNs, yielding two variants of TSGN, i.e., Directed-TSGN and Temporal-TSGN, which can be applied to the different attributed networks. Especially, by introducing TSGN into multi-edge transaction networks, the Multiple-TSGN model proposed is able to preserve the temporal transaction flow information and capture the significant topological pattern of phishing scams, while reducing the time complexity of modeling large-scale networks. Extensive experimental results show that TSGN models can provide more potential information to improve the performance of phishing detection by incorporating graph representation learning.

preprint2021arXiv

Sampling Subgraph Network with Application to Graph Classification

Graphs are naturally used to describe the structures of various real-world systems in biology, society, computer science etc., where subgraphs or motifs as basic blocks play an important role in function expression and information processing. However, existing research focuses on the basic statistics of certain motifs, largely ignoring the connection patterns among them. Recently, a subgraph network (SGN) model is proposed to study the potential structure among motifs, and it was found that the integration of SGN can enhance a series of graph classification methods. However, SGN model lacks diversity and is of quite high time complexity, making it difficult to widely apply in practice. In this paper, we introduce sampling strategies into SGN, and design a novel sampling subgraph network model, which is scale-controllable and of higher diversity. We also present a hierarchical feature fusion framework to integrate the structural features of diverse sampling SGNs, so as to improve the performance of graph classification. Extensive experiments demonstrate that, by comparing with the SGN model, our new model indeed has much lower time complexity (reduced by two orders of magnitude) and can better enhance a series of graph classification methods (doubling the performance enhancement).

preprint2020arXiv

Adversarial Attacks to Scale-Free Networks: Testing the Robustness of Physical Criteria

Adversarial attacks have been alerting the artificial intelligence community recently, since many machine learning algorithms were found vulnerable to malicious attacks. This paper studies adversarial attacks to scale-free networks to test their robustness in terms of statistical measures. In addition to the well-known random link rewiring (RLR) attack, two heuristic attacks are formulated and simulated: degree-addition-based link rewiring (DALR) and degree-interval-based link rewiring (DILR). These three strategies are applied to attack a number of strong scale-free networks of various sizes generated from the Barabási-Albert model. It is found that both DALR and DILR are more effective than RLR, in the sense that rewiring a smaller number of links can succeed in the same attack. However, DILR is as concealed as RLR in the sense that they both are constructed by introducing a relatively small number of changes on several typical structural properties such as average shortest path-length, average clustering coefficient, and average diagonal distance. The results of this paper suggest that to classify a network to be scale-free has to be very careful from the viewpoint of adversarial attack effects.

preprint2016arXiv

Game Theoretic Control of Multi-Agent Systems

Control of multi-agent systems via game theory is investigated. Assume a system level object is given, the utility functions for individual agents are designed to convert a multi-agent system into a potential game. First, for fixed topology, a necessary and sufficient condition is given to assure the existence of local information based utility functions. Then using local information the system can converge to a maximum point of the system object, which is a Nash equilibrium. It is also proved that a networked evolutionary potential game is a special case of this multi-agent system. Second, for time-varying topology, the state based potential game is utilized to design the optimal control. A strategy based Markov state transition process is proposed to assure the existence of state based potential function. As an extension of the fixed topology case, a necessary and sufficient condition for the existence of state depending utility functions using local information is also presented. It is also proved that using better reply with inertia strategy, the system converges to a maximum strategy of the state based system object, which is called the recurrent state equilibrium.

preprint2014arXiv

Vanishing Pressure Limit of Solutions to the Aw-Rascle Model for Modified Chaplygin gas

This paper analyzes the vanishing pressure limit of solutions to the Aw-Rascle model and the perturbed Aw-Rascle model for modified Chaplygin gas. Firstly, the Riemann problem of the Aw-Rascle model is solved constructively. A special delta shock wave in the limit of Riemann solutions is obtained. Secondly, the Riemann problem of the perturbed Aw-Rascle model is solved analytically. It is proved that, as the pressure tends to zero, any Riemann solution containing two shock wave tends to a delta shock solution to the transport equations; any Riemann solution containing two rarefaction wave tends to a two-contact-discontinuity solution to the transport equations and the nonvacuum intermediate state in between tends to a vacuum state.

preprint2013arXiv

Exact criterion for global existence and blow up to a degenerate Keller-Segel system

A degenerate Keller-Segel system with diffusion exponent $2n/(n+2)<m<2-\frac{2}{n}$ in multi dimension is studied. An exact criterion for global existence and blow up of solution is obtained. The estimates on $L^{2n/(n+2)}$ norm of the solution play important roles in our analysis. These estimates are closely related to the optimal constant in Haddy- Littlewood- Sobolev inequality. In the case of initial free energy less than a universal constant which depends on the inverse of total mass, there exists a constant such that if the $L^{2n/(n+2)}$ norm of initial data is less than this constant, then the weak solution exists globally; if the $L^{2n/(n+2)}$ norm of initial data is larger than the same constant, then the solution must blow-up in finite time. Our result shows that the total mass, which plays the deterministic role in two dimension case, might not be an appropriate criterion for existence and blow up discussion in multi-dimension, while the $L^{\frac{2n}{n+2}}$ norm of the initial data and the relation between initial free energy and initial mass are more important.

Jinhuan Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

Multi-Modal Representation Learning for Molecular Property Prediction: Sequence, Graph, Geometry

Null Model-Based Data Augmentation for Graph Classification

Targeted k-node Collapse Problem: Towards Understanding the Robustness of Local k-core Structure

TSGN: Transaction Subgraph Networks Assisting Phishing Detection in Ethereum

Sampling Subgraph Network with Application to Graph Classification

Adversarial Attacks to Scale-Free Networks: Testing the Robustness of Physical Criteria

Game Theoretic Control of Multi-Agent Systems

Vanishing Pressure Limit of Solutions to the Aw-Rascle Model for Modified Chaplygin gas

Exact criterion for global existence and blow up to a degenerate Keller-Segel system