Source author record

Menghan Wang

Menghan Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computational Geometry Machine Learning Information Retrieval Computer Vision Artificial Intelligence Computation and Language cond-mat.soft math.CO

Catalog footprint

What is connected

13works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2024arXiv

Explainable Recommender with Geometric Information Bottleneck

Explainable recommender systems can explain their recommendation decisions, enhancing user trust in the systems. Most explainable recommender systems either rely on human-annotated rationales to train models for explanation generation or leverage the attention mechanism to extract important text spans from reviews as explanations. The extracted rationales are often confined to an individual review and may fail to identify the implicit features beyond the review text. To avoid the expensive human annotation process and to generate explanations beyond individual reviews, we propose to incorporate a geometric prior learnt from user-item interactions into a variational network which infers latent factors from user-item reviews. The latent factors from an individual user-item pair can be used for both recommendation and explanation generation, which naturally inherit the global characteristics encoded in the prior knowledge. Experimental results on three e-commerce datasets show that our model significantly improves the interpretability of a variational recommender using the Wasserstein distance while achieving performance comparable to existing content-based recommender systems in terms of recommendation behaviours.

preprint2022arXiv

Learning to Hash Naturally Sorts

Learning to hash pictures a list-wise sorting problem. Its testing metrics, e.g., mean-average precision, count on a sorted candidate list ordered by pair-wise code similarity. However, scarcely does one train a deep hashing model with the sorted results end-to-end because of the non-differentiable nature of the sorting operation. This inconsistency in the objectives of training and test may lead to sub-optimal performance since the training loss often fails to reflect the actual retrieval metric. In this paper, we tackle this problem by introducing Naturally-Sorted Hashing (NSH). We sort the Hamming distances of samples' hash codes and accordingly gather their latent representations for self-supervised training. Thanks to the recent advances in differentiable sorting approximations, the hash head receives gradients from the sorter so that the hash encoder can be optimized along with the training procedure. Additionally, we describe a novel Sorted Noise-Contrastive Estimation (SortedNCE) loss that selectively picks positive and negative samples for contrastive learning, which allows NSH to mine data semantic relations during training in an unsupervised manner. Our extensive experiments show the proposed NSH model significantly outperforms the existing unsupervised hashing methods on three benchmarked datasets.

preprint2022arXiv

MP2: A Momentum Contrast Approach for Recommendation with Pointwise and Pairwise Learning

Binary pointwise labels (aka implicit feedback) are heavily leveraged by deep learning based recommendation algorithms nowadays. In this paper we discuss the limited expressiveness of these labels may fail to accommodate varying degrees of user preference, and thus lead to conflicts during model training, which we call annotation bias. To solve this issue, we find the soft-labeling property of pairwise labels could be utilized to alleviate the bias of pointwise labels. To this end, we propose a momentum contrast framework (MP2) that combines pointwise and pairwise learning for recommendation. MP2 has a three-tower network structure: one user network and two item networks. The two item networks are used for computing pointwise and pairwise loss respectively. To alleviate the influence of the annotation bias, we perform a momentum update to ensure a consistent item representation. Extensive experiments on real-world datasets demonstrate the superiority of our method against state-of-the-art recommendation algorithms.

preprint2022arXiv

You Never Cluster Alone

Recent advances in self-supervised learning with instance-level contrastive objectives facilitate unsupervised clustering. However, a standalone datum is not perceiving the context of the holistic cluster, and may undergo sub-optimal assignment. In this paper, we extend the mainstream contrastive learning paradigm to a cluster-level scheme, where all the data subjected to the same cluster contribute to a unified representation that encodes the context of each data group. Contrastive learning with this representation then rewards the assignment of each datum. To implement this vision, we propose twin-contrast clustering (TCC). We define a set of categorical variables as clustering assignment confidence, which links the instance-level learning track with the cluster-level one. On one hand, with the corresponding assignment variables being the weight, a weighted aggregation along the data points implements the set representation of a cluster. We further propose heuristic cluster augmentation equivalents to enable cluster-level contrastive learning. On the other hand, we derive the evidence lower-bound of the instance-level contrastive objective with the assignments. By reparametrizing the assignment variables, TCC is trained end-to-end, requiring no alternating steps. Extensive experiments show that TCC outperforms the state-of-the-art on challenging benchmarks.

preprint2020arXiv

M2GRL: A Multi-task Multi-view Graph Representation Learning Framework for Web-scale Recommender Systems

Combining graph representation learning with multi-view data (side information) for recommendation is a trend in industry. Most existing methods can be categorized as \emph{multi-view representation fusion}; they first build one graph and then integrate multi-view data into a single compact representation for each node in the graph. However, these methods are raising concerns in both engineering and algorithm aspects: 1) multi-view data are abundant and informative in industry and may exceed the capacity of one single vector, and 2) inductive bias may be introduced as multi-view data are often from different distributions. In this paper, we use a \emph{multi-view representation alignment} approach to address this issue. Particularly, we propose a multi-task multi-view graph representation learning framework (M2GRL) to learn node representations from multi-view graphs for web-scale recommender systems. M2GRL constructs one graph for each single-view data, learns multiple separate representations from multiple graphs, and performs alignment to model cross-view relations. M2GRL chooses a multi-task learning paradigm to learn intra-view representations and cross-view relations jointly. Besides, M2GRL applies homoscedastic uncertainty to adaptively tune the loss weights of tasks during training. We deploy M2GRL at Taobao and train it on 57 billion examples. According to offline metrics and online A/B tests, M2GRL significantly outperforms other state-of-the-art algorithms. Further exploration on diversity recommendation in Taobao shows the effectiveness of utilizing multiple representations produced by \method{}, which we argue is a promising direction for various industrial recommendation tasks of different focus.

preprint2016arXiv

Combinatorial rigidity of Incidence systems and Application to Dictionary learning

Given a hypergraph $H$ with $m$ hyperedges and a set $Q$ of $m$ \emph{pinning subspaces}, i.e.\ globally fixed subspaces in Euclidean space $\mathbb{R}^d$, a \emph{pinned subspace-incidence system} is the pair $(H, Q)$, with the constraint that each pinning subspace in $Q$ is contained in the subspace spanned by the point realizations in $\mathbb{R}^d$ of vertices of the corresponding hyperedge of $H$. This paper provides a combinatorial characterization of pinned subspace-incidence systems that are \emph{minimally rigid}, i.e.\ those systems that are guaranteed to generically yield a locally unique realization. Pinned subspace-incidence systems have applications in the \emph{Dictionary Learning (aka sparse coding)} problem, i.e.\ the problem of obtaining a sparse representation of a given set of data vectors by learning \emph{dictionary vectors} upon which the data vectors can be written as sparse linear combinations. Viewing the dictionary vectors from a geometry perspective as the spanning set of a subspace arrangement, the result gives a tight bound on the number of dictionary vectors for sufficiently randomly chosen data vectors, and gives a way of constructing a dictionary that meets the bound. For less stringent restrictions on data, but a natural modification of the dictionary learning problem, a further dictionary learning algorithm is provided. Although there are recent rigidity based approaches for low rank matrix completion, we are unaware of prior application of combinatorial rigidity techniques in the setting of Dictionary Learning. We also provide a systematic classification of problems related to dictionary learning together with various algorithms, their assumptions and performance.

preprint2016arXiv

Symmetry in Sphere-based Assembly Configuration Spaces

Many remarkably robust, rapid and spontaneous self-assembly phenomena in nature can be modeled geometrically starting from a collection of rigid bunches of spheres. This paper highlights the role of symmetry in sphere-based assembly processes. Since spheres within bunches could be identical and bunches could be identical as well, the underlying symmetry groups could be of large order that grows with the number of participating spheres and bunches. Thus, understanding symmetries and associated isomorphism classes of microstates correspond to various types of macrostates can significantly reduce the complexity of computing entropy and free energy, as well as paths and kinetics, in high dimensional configuration spaces. In addition, a precise understanding of symmetries is crucial for giving provable guarantees of algorithmic accuracy and efficiency in such computations. In particular, this may aid in predicting crucial assembly-driving interactions. This is a primarily expository paper that develops a novel, original framework for dealing with symmetries in configuration spaces of assembling spheres with the following goals. (1) We give new, formal definitions of various concepts relevant to sphere-based assembly that occur in previous work, and in turn, formal definitions of their relevant symmetry groups leading to the main theorem concerning their symmetries. These previously developed concepts include, for example, (a) assembly configuration spaces, (b) stratification of assembly configuration space into regions defined by active constraint graphs, (c) paths through the configurational regions, and (d) coarse assembly pathways. (2) We demonstrate the new symmetry concepts to compute sizes and numbers of orbits in two example settings appearing in previous work. (3) We give formal statements of a variety of open problems and challenges using the new conceptual definitions.

preprint2015arXiv

An Incidence Geometry approach to Dictionary Learning

We study the Dictionary Learning (aka Sparse Coding) problem of obtaining a sparse representation of data points, by learning \emph{dictionary vectors} upon which the data points can be written as sparse linear combinations. We view this problem from a geometry perspective as the spanning set of a subspace arrangement, and focus on understanding the case when the underlying hypergraph of the subspace arrangement is specified. For this Fitted Dictionary Learning problem, we completely characterize the combinatorics of the associated subspace arrangements (i.e.\ their underlying hypergraphs). Specifically, a combinatorial rigidity-type theorem is proven for a type of geometric incidence system. The theorem characterizes the hypergraphs of subspace arrangements that generically yield (a) at least one dictionary (b) a locally unique dictionary (i.e.\ at most a finite number of isolated dictionaries) of the specified size. We are unaware of prior application of combinatorial rigidity techniques in the setting of Dictionary Learning, or even in machine learning. We also provide a systematic classification of problems related to Dictionary Learning together with various algorithms, their assumptions and performance.

preprint2015arXiv

Combinatorial rigidity and independence of generalized pinned subspace-incidence constraint systems

Given a hypergraph $H$ with $m$ hyperedges and a set $X$ of $m$ \emph{pins}, i.e.\ globally fixed subspaces in Euclidean space $\mathbb{R}^d$, a \emph{pinned subspace-incidence system} is the pair $(H, X)$, with the constraint that each pin in $X$ lies on the subspace spanned by the point realizations in $\mathbb{R}^d$ of vertices of the corresponding hyperedge of $H$. We are interested in combinatorial characterization of pinned subspace-incidence systems that are \emph{minimally rigid}, i.e.\ those systems that are guaranteed to generically yield a locally unique realization. As is customary, this is accompanied by a characterization of generic independence as well as rigidity. In a previous paper \cite{sitharam2014incidence}, we used pinned subspace-incidence systems towards solving the \emph{fitted dictionary learning} problem, i.e.\ dictionary learning with specified underlying hypergraph, and gave a combinatorial characterization of minimal rigidity for a more restricted version of pinned subspace-incidence system, with $H$ being a uniform hypergraph and pins in $X$ being 1-dimension subspaces. Moreover in a recent paper \cite{Baker2015}, the special case of pinned line incidence systems was used to model biomaterials such as cellulose and collagen fibrils in cell walls. In this paper, we extend the combinatorial characterization to general pinned subspace-incidence systems, with $H$ being a non-uniform hypergraph and pins in $X$ being subspaces with arbitrary dimension. As there are generally many data points per subspace in a dictionary learning problem, which can only be modeled with pins of dimension larger than $1$, such an extension enables application to a much larger class of fitted dictionary learning problems.

preprint2015arXiv

Optimal Decomposition and Recombination of Isostatic Geometric Constraint Systems for Designing Layered Materials

Optimal recursive decomposition (or DR-planning) is crucial for analyzing, designing, solving or finding realizations of geometric constraint sytems. While the optimal DR-planning problem is NP-hard even for general 2D bar-joint constraint systems, we describe an O(n^3) algorithm for a broad class of constraint systems that are isostatic or underconstrained. The algorithm achieves optimality by using the new notion of a canonical DR-plan that also meets various desirable, previously studied criteria. In addition, we leverage recent results on Cayley configuration spaces to show that the indecomposable systems---that are solved at the nodes of the optimal DR-plan by recombining solutions to child systems---can be minimally modified to become decomposable and have a small DR-plan, leading to efficient realization algorithms. We show formal connections to well-known problems such as completion of underconstrained systems. Well suited to these methods are classes of constraint systems that can be used to efficiently model, design and analyze quasi-uniform (aperiodic) and self-similar, layered material structures. We formally illustrate by modeling silica bilayers as body-hyperpin systems and cross-linking microfibrils as pinned line-incidence systems. A software implementation of our algorithms and videos demonstrating the software are publicly available online (visit http://cise.ufl.edu/~tbaker/drp/index.html.)

preprint2014arXiv

Cayley Analysis of Mechanism Configuration Spaces using CayMos: Software Functionalities and Architecture

For a common class of 2D mechanisms called 1-dof tree decomposable linkages, we present a software CayMos which uses new theoretical results to implement efficient algorithmic solutions for: (a) meaningfully representing and visualizing the connected components in the Euclidean realization space; (b) finding a path of continuous motion between two realizations in the same connected component, with or without restricting the realization type (sometimes called orientation type); (c) finding two ``closest'' realizations in different connected components.

preprint2014arXiv

Dynamic Decision Process Modeling and Relation-line Handling in Distributed Cooperative Modeling System

The Distributed Cooperative Modeling System (DCMS) solves complex decision problems involving a lot of participants with different viewpoints by network based distributed modeling and multi-template aggregation. This thesis aims at extending the system with support for dynamic decision making process. First, the thesis presents a discussion of characteristics and optimal policy finding Markov Decision Process as well as a brief introduction to dynamic Bayesian decision network, which is inherently equal to MDP. After that, discussion and implementation of prediction in Markov process for both discrete and continuous random variable are given, as well as several different kinds of correlation analysis among multiple indices which could help decision-makers to realize the interaction of indices and design appropriate policy. Appending history data of Macau industry, as the foundation of extending DCMS, is introduced. Additional works include rearrangement of graphical class hierarchy in DCMS, which in turn allows convenient implementation of curve relation-line, which makes template modeling clearer and friendlier.

preprint2012arXiv

Cayley Configuration Spaces of 1-dof Tree-decomposable Linkages, Part II: Combinatorial Characterization of Complexity

We continue to study Cayley configuration spaces of 1-dof linkages in 2D begun in Part I of this paper, i.e. the set of attainable lengths for a non-edge. In Part II, we focus on the algebraic complexity of describing endpoints of the intervals in the set, i.e., the Cayley complexity. Specifically, We focus on Cayley configuration spaces of a natural class of 1-dof linkages, called 1-dof tree-decomposable linkages. The underlying graphs G satisfy the following: for some base non-edge f, G \cup f is quadratic-radically solvable (QRS), meaning that G \cup f is minimally rigid, and given lengths \bar{l} of all edges, the corresponding linkage (G \cup f, \bar{l}) can be simply realized by ruler and compass starting from f. It is clear that the Cayley complexity only depends on the graph G and possibly the non-edge f. Here we ask whether the Cayley complexity depends on the choice of a base non-edge f. We answer this question in the negative, thereby showing that low Cayley complexity is a property of the graph G (independent of the non-edge f). Then, we give a simple characterization of graphs with low Cayley complexity, leading to an efficient algorithmic characterization, i.e. an efficient algorithm for recognizing such graphs. Next, we show a surprising result that (graph) planarity is equivalent to low Cayley complexity for a natural subclass of 1-dof triangle-decomposable graphs. While this is a finite forbidden minor graph characterization of low Cayley complexity, we provide counterexamples showing impossibility of such finite forbidden minor characterizations when the above subclass is enlarged.

Menghan Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

13 published item(s)

Explainable Recommender with Geometric Information Bottleneck

Learning to Hash Naturally Sorts

MP2: A Momentum Contrast Approach for Recommendation with Pointwise and Pairwise Learning

You Never Cluster Alone

M2GRL: A Multi-task Multi-view Graph Representation Learning Framework for Web-scale Recommender Systems

Combinatorial rigidity of Incidence systems and Application to Dictionary learning

Symmetry in Sphere-based Assembly Configuration Spaces

An Incidence Geometry approach to Dictionary Learning

Combinatorial rigidity and independence of generalized pinned subspace-incidence constraint systems

Optimal Decomposition and Recombination of Isostatic Geometric Constraint Systems for Designing Layered Materials

Cayley Analysis of Mechanism Configuration Spaces using CayMos: Software Functionalities and Architecture

Dynamic Decision Process Modeling and Relation-line Handling in Distributed Cooperative Modeling System

Cayley Configuration Spaces of 1-dof Tree-decomposable Linkages, Part II: Combinatorial Characterization of Complexity