Source author record

Jian Shen

Jian Shen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mtrl-sci Artificial Intelligence cond-mat.str-el Machine Learning cond-mat.mes-hall math.CO Information Theory math.IT Computation and Language Computer Vision cs.CY eess.SP Multiagent Systems physics.ao-ph Social and Information Networks

Catalog footprint

What is connected

23works

15topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Curriculum Offline Imitation Learning

Offline reinforcement learning (RL) tasks require the agent to learn from a pre-collected dataset with no further interactions with the environment. Despite the potential to surpass the behavioral policies, RL-based methods are generally impractical due to the training instability and bootstrapping the extrapolation errors, which always require careful hyperparameter tuning via online evaluation. In contrast, offline imitation learning (IL) has no such issues since it learns the policy directly without estimating the value function by bootstrapping. However, IL is usually limited in the capability of the behavioral policy and tends to learn a mediocre behavior from the dataset collected by the mixture of policies. In this paper, we aim to take advantage of IL but mitigate such a drawback. Observing that behavior cloning is able to imitate neighboring policies with less data, we propose \textit{Curriculum Offline Imitation Learning (COIL)}, which utilizes an experience picking strategy for imitating from adaptive neighboring policies with a higher return, and improves the current policy along curriculum stages. On continuous control benchmarks, we compare COIL against both imitation-based and RL-based methods, showing that it not only avoids just learning a mediocre behavior on mixed datasets but is also even competitive with state-of-the-art offline RL methods.

preprint2022arXiv

Direct visualization of percolating metal-insulator transition in V2O3 using scanning microwave impedance microscopy

Using the extensively studied V2O3 as a prototype system, we investigate the role of percolation in metal-insulator transition (MIT). We apply scanning microwave impedance microscopy to directly determine the metallic phase fraction p and relate it to the macroscopic conductance G, which shows a sudden jump when p reaches the percolation threshold. Interestingly, the conductance G exhibits a hysteretic behavior against p, suggesting two different percolating processes upon cooling and warming. Based on our image analysis and model simulation, we ascribe such hysteretic behavior to different domain nucleation and growth processes between cooling and warming, which is likely caused by the decoupled structural and electronic transitions in V2O3 during MIT. Our work provides a microscopic view of how the interplay of structural and electronic degrees of freedom affects MIT in strongly correlated systems.

preprint2022arXiv

Influences of the dissipative topological edge state on quantized transport in MnBi2Te4

The beauty of quantum Hall (QH) effect is the metrological precision of Hall resistance quantization that originates from the topological edge states. Understanding the factors that lead to quantization breakdown not only provides important insights on the nature of the topological protection of these edge states, but is beneficial for device applications involving such quantized transport. In this work, we combine conventional transport and real space conductivity mapping to investigate whether the quantization breakdown is tied to the disappearance of edge state in the hotly studied MnBi2Te4 system. Our experimental results unambiguously show that topological edge state does exist when quantization breakdown occurs. Such edge state is dissipative in nature and could lead to a quantization breakdown due to its diffusive character causing overlapping with bulk and other edge states in real devices. Our findings bring attentions to issues that are generally inaccessible in the transport study of QH, but can play important roles in practical measurements and device applications.

preprint2022arXiv

Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts

This paper investigates the model-based methods in multi-agent reinforcement learning (MARL). We specify the dynamics sample complexity and the opponent sample complexity in MARL, and conduct a theoretic analysis of return discrepancy upper bound. To reduce the upper bound with the intention of low sample complexity during the whole learning process, we propose a novel decentralized model-based MARL method, named Adaptive Opponent-wise Rollout Policy Optimization (AORPO). In AORPO, each agent builds its multi-agent environment model, consisting of a dynamics model and multiple opponent models, and trains its policy with the adaptive opponent-wise rollout. We further prove the theoretic convergence of AORPO under reasonable assumptions. Empirical experiments on competitive and cooperative tasks demonstrate that AORPO can achieve improved sample efficiency with comparable asymptotic performance over the compared MARL methods.

preprint2022arXiv

On Effective Scheduling of Model-based Reinforcement Learning

Model-based reinforcement learning has attracted wide attention due to its superior sample efficiency. Despite its impressive success so far, it is still unclear how to appropriately schedule the important hyperparameters to achieve adequate performance, such as the real data ratio for policy optimization in Dyna-style model-based algorithms. In this paper, we first theoretically analyze the role of real data in policy training, which suggests that gradually increasing the ratio of real data yields better performance. Inspired by the analysis, we propose a framework named AutoMBPO to automatically schedule the real data ratio as well as other hyperparameters in training model-based policy optimization (MBPO) algorithm, a representative running case of model-based methods. On several continuous control tasks, the MBPO instance trained with hyperparameters scheduled by AutoMBPO can significantly surpass the original one, and the real data ratio schedule found by AutoMBPO shows consistency with our theoretical analysis.

preprint2022arXiv

Towards Return Parity in Markov Decision Processes

Algorithmic decisions made by machine learning models in high-stakes domains may have lasting impacts over time. However, naive applications of standard fairness criterion in static settings over temporal domains may lead to delayed and adverse effects. To understand the dynamics of performance disparity, we study a fairness problem in Markov decision processes (MDPs). Specifically, we propose return parity, a fairness notion that requires MDPs from different demographic groups that share the same state and action spaces to achieve approximately the same expected time-discounted rewards. We first provide a decomposition theorem for return disparity, which decomposes the return disparity of any two MDPs sharing the same state and action spaces into the distance between group-wise reward functions, the discrepancy of group policies, and the discrepancy between state visitation distributions induced by the group policies. Motivated by our decomposition theorem, we propose algorithms to mitigate return disparity via learning a shared group policy with state visitation distributional alignment using integral probability metrics. We conduct experiments to corroborate our results, showing that the proposed algorithm can successfully close the disparity gap while maintaining the performance of policies on two real-world recommender system benchmark datasets.

preprint2020arXiv

Depression Detection using Resting State Three-channel EEG Signal

In universal environment, a patient-friendly inexpensive method is needed to realize the early diagnosis of depression, which is believed to be an effective way to reduce the mortality of depression. The purpose of this study is only to collect EEG signal from three electrodes Fp1, Fpz and Fp2, then the linear and nonlinear features of EEG used to classify depression patients and healthy controls. The EEG recordings were carried out on a group of 18 medication-free depressive patients and 25 gender and age matched controls. In this paper, the selected features include three linear (maximum, mean and center values of the power) and three nonlinear features (correlation dimension, Renyi entropy and C0 complexity). The accuracy and effectiveness of classification model between depressive and control subjects were calculated using leave-one-out cross-validation. The experimental results indicate that selected three channel EEG and features can distinguish the subjects between depression and normal beings, the classification accuracy is 72.25%. It is hoped that the performed results can provide more choices for the early diagnosis of depression in a universal environment.

preprint2020arXiv

GIKT: A Graph-based Interaction Model for Knowledge Tracing

With the rapid development in online education, knowledge tracing (KT) has become a fundamental problem which traces students' knowledge status and predicts their performance on new questions. Questions are often numerous in online education systems, and are always associated with much fewer skills. However, the previous literature fails to involve question information together with high-order question-skill correlations, which is mostly limited by data sparsity and multi-skill problems. From the model perspective, previous models can hardly capture the long-term dependency of student exercise history, and cannot model the interactions between student-questions, and student-skills in a consistent way. In this paper, we propose a Graph-based Interaction model for Knowledge Tracing (GIKT) to tackle the above probems. More specifically, GIKT utilizes graph convolutional network (GCN) to substantially incorporate question-skill correlations via embedding propagation. Besides, considering that relevant questions are usually scattered throughout the exercise history, and that question and skill are just different instantiations of knowledge, GIKT generalizes the degree of students' master of the question to the interactions between the student's current state, the student's history related exercises, the target question, and related skills. Experiments on three datasets demonstrate that GIKT achieves the new state-of-the-art performance, with at least 1% absolute AUC improvement.

preprint2020arXiv

Large-Scale Optimal Transport via Adversarial Training with Cycle-Consistency

Recent advances in large-scale optimal transport have greatly extended its application scenarios in machine learning. However, existing methods either not explicitly learn the transport map or do not support general cost function. In this paper, we propose an end-to-end approach for large-scale optimal transport, which directly solves the transport map and is compatible with general cost function. It models the transport map via stochastic neural networks and enforces the constraint on the marginal distributions via adversarial training. The proposed framework can be further extended towards learning Monge map or optimal bijection via adopting cycle-consistency constraint(s). We verify the effectiveness of the proposed method and demonstrate its superior performance against existing methods with large-scale real-world applications, including domain adaptation, image-to-image translation, and color transfer.

preprint2020arXiv

New upper bounds for the bondage number of a graph in terms of its maximum degree and Euler characteristic

The bondage number $b(G)$ of a graph $G$ is the smallest number of edges whose removal from $G$ results in a graph with larger domination number. Let $G$ be embeddable on a surface whose Euler characteristic $χ$ is as large as possible, and assume $χ\leq0$. Gagarin-Zverovich and Huang have recently found upper bounds of $b(G)$ in terms of the maximum degree $Δ(G)$ and the Euler characteristic $χ(G)=χ$. In this paper we prove a better upper bound $b(G)\leqΔ(G)+\lfloor t\rfloor$ where $t$ is the largest real root of the cubic equation $z^3 + z^2 + (3χ- 8)z + 9χ- 12=0$; this upper bound is asymptotically equivalent to $b(G)\leqΔ(G)+1+\lfloor \sqrt{4-3χ} \rfloor$. We also establish further improved upper bounds for $b(G)$ when the girth, order, or size of the graph $G$ is large compared with its Euler characteristic $χ$.

preprint2020arXiv

Using observed bacteria concentration and modeled transit time under an analytical framework to estimate overall removal rate of fecal coliform in an estuary

Abundance of fecal coliform (FC) is widely used to indicate the potential presence of pathogens, the No.1 cause of water impairments in the U.S. Despite extensive monitoring efforts, assessing and modeling FC pollution still faces challenges, largely owing to the uncertainties in estimation of overall removal rate (K). This study proposes an alternative method to estimate in situ K by combining observational data, hydrodynamic simulation, and analytical solution. The method requires the observed spatial distribution of FC concentration along an estuarine channel and the numerically-simulated transit time, and converts the K estimation from a temporal problem into a spatial problem, potentially reducing survey duration, effort, and cost. Application of the method gave an estimation of K = 0.5 d-1 on average for the Nassawadox Creek in Chesapeake Bay. The numerical and analytical model results with the estimated K agreed well with the observation, demonstrating the credibility of the method.

preprint2014arXiv

Active control of magnetoresistance of organic spin valves using ferroelectricity

Organic spintronic devices have been appealing because of the long spin life time of the charge carriers in the organic materials and their low cost, flexibility and chemical diversity. In previous studies, the control of resistance of organic spin valves is generally achieved by the alignment of the magnetization directions of the two ferromagnetic electrodes, generating magnetoresistance.1 Here we employ a new knob to tune the resistance of organic spin valves by adding a thin ferroelectric interfacial layer between the ferromagnetic electrode and the organic spacer. We show that the resistance can be controlled by not only the spin alignment of the two ferromagnetic electrodes, but also by the electric polarization of the interfacial ferroelectric layer: the MR of the spin valve depends strongly on the history of the bias voltage which is correlated with the polarization of the ferroelectric layer; the MR even changes sign when the electric polarization of the ferroelectric layer is reversed. This new tunability can be understood in terms of the change of relative energy level alignment between ferromagnetic electrode and the organic spacer caused by the electric dipole moment of the ferroelectric layer. These findings enable active control of resistance using both electric and magnetic fields, opening up possibility for multi-state organic spin valves and shed light on the mechanism of the spin transport in organic spin valves.

preprint2014arXiv

Persistent metal-insulator transition at the surface of an oxygen-deficient, epitaxial manganite film

The oxygen stoichiometry has a large influence on the physical and chemical properties of complex oxides. Most of the functionality in e.g. catalysis and electrochemistry depends in particular on control of the oxygen stoichiometry. In order to understand the fundamental properties of intrinsic surfaces of oxygen-deficient complex oxides, we report on in situ temperature dependent scanning tunnelling spectroscopy experiments on pristine oxygen deficient, epitaxial manganite films. Although these films are insulating in subsequent ex situ in-plane electronic transport experiments at all temperatures, in situ scanning tunnelling spectroscopic data reveal that the surface of these films exhibits a metal-insulator transition (MIT) at 120 K, coincident with the onset of ferromagnetic ordering of small clusters in the bulk of the oxygen-deficient film. The surprising proximity of the surface MIT transition temperature of nonstoichiometric films with that of the fully oxygenated bulk suggests that the electronic properties in the surface region are not significantly affected by oxygen deficiency in the bulk. This carries important implications for the understanding and functional design of complex oxides and their interfaces with specific electronic properties for catalysis, oxide electronics and electrochemistry.

preprint2014arXiv

Structural and electronic origin of the magnetic structures in hexagonal LuFeO$_3$

Using combined theoretical and experimental approaches, we studied the structural and electronic origin of the magnetic structure in hexagonal LuFeO$_3$. Besides showing the strong exchange coupling that is consistent with the high magnetic ordering temperature, the previously observed spin reorientation transition is explained by the theoretically calculated magnetic phase diagram. The structural origin of this spin reorientation that is responsible for the appearance of spontaneous magnetization, is identified by theory and verified by x-ray diffraction and absorption experiments.

preprint2013arXiv

Bounds on the Number of Huffman and Binary-Ternary Trees

Huffman coding is a widely used method for lossless data compression because it optimally stores data based on how often the characters occur in Huffman trees. An $n$-ary Huffman tree is a connected, cycle-lacking graph where each vertex can have either $n$ "children" vertices connecting to it, or 0 children. Vertices with 0 children are called \textit{leaves}. We let $h_n(q)$ represent the total number of $n$-ary Huffman trees with $q$ leaves. In this paper, we use a recursive method to generate upper and lower bounds on $h_n(q)$ and get $h_2(q) \approx (0.1418532)(1.7941471)^q+(0.0612410)(1.2795491)^q$ for $n=2$. This matches the best results achieved by Elsholtz, Heuberger, and Prodinger in August 2011. Our approach reveals patterns in Huffman trees that we used in our analysis of the Binary-Ternary (BT) trees we created. Our research opens a completely new door in data compression by extending the study of Huffman trees to BT trees. Our study of BT trees paves the way for designing data-specific trees, minimizing possible wasted storage space from Huffman coding. We prove a recursive formula for the number of BT trees with $q$ leaves. Furthermore, we provide analysis and further proofs to reach numeric bounds. Our discoveries have broad applications in computer data compression. These results also improve graphical representations of protein sequences that facilitate in-depth genome analysis used in researching evolutionary patterns.

preprint2013arXiv

Electrophoretic-like gating used to control metal-insulator transitions in electronically phase separated manganite wires

Electronically phase separated manganite wires are found to exhibit controllable metal-insulator transitions under local electric fields. The switching characteristics are shown to be fully reversible, polarity independent, and highly resistant to thermal breakdown caused by repeated cycling. It is further demonstrated that multiple discrete resistive states can be accessed in a single wire. The results conform to a phenomenological model in which the inherent nanoscale insulating and metallic domains are rearranged through electrophoretic-like processes to open and close percolation channels.

preprint2013arXiv

Growth diagram of La0.7Sr0.3MnO3 thin films using pulsed laser deposition

An experimental study was conducted on controlling the growth mode of La0.7Sr0.3MnO3 thin films on SrTiO3 substrates using pulsed laser deposition (PLD) by tuning growth temperature, pressure and laser fluence. Different thin film morphology, crystallinity and stoichiometry have been observed depending on growth parameters. To understand the microscopic origin, the adatom nucleation, step advance processes and their relationship to film growth were theoretically analyzed and a growth diagram was constructed. Three boundaries between highly and poorly crystallized growth, 2D and 3D growth, stoichiometric and non-stoichiometric growth were identified in the growth diagram. A good fit of our experimental observation with the growth diagram was found. This case study demonstrates that a more comprehensive understanding of the growth mode in PLD is possible.

preprint2013arXiv

Room-temperature multiferroic hexagonal LuFeO$_3$ films

The crystal and magnetic structures of single-crystalline hexagonal LuFeO$_3$ films have been studied using x-ray, electron and neutron diffraction methods. The polar structure of these films are found to persist up to 1050 K; and the switchability of the polar behavior is observed at room temperature, indicating ferroelectricity. An antiferromagnetic order was shown to occur below 440 K, followed by a spin reorientation resulting in a weak ferromagnetic order below 130 K. This observation of coexisting multiple ferroic orders demonstrates that hexagonal LuFeO$_3$ films are room-temperature multiferroics.

preprint2012arXiv

Cyrstal field splitting and optical band gap of hexagonal LuFeO$_3$ films

Hexagonal LuFeO$_3$ films have been studied using x-ray absorption and optical spectroscopy. The crystal splittings of Fe$^{3+}$ are extracted as $E_{e'}-E_{e"}$=0.7 eV and $E_{a_1'}-E_{e'}$=0.9 eV and a 2.0 eV optical band gap is determined assuming a direct gap. First-principles calculations confirm the experiments that the relative energies of crystal field splitting states do follow $E_{a_1'}>E_{e'}>E_{e"}$ with slightly underestimated values and a band gap of 1.35 eV.

preprint2012arXiv

Growth diagram and magnetic properties of hexagonal LuFe$_2$O$_4$ thin films

A growth diagram of Lu-Fe-O compounds on MgO (111) substrates using pulsed laser deposition is constructed based on extensive growth experiments. The LuFe$_2$O$_4$ phase can only be grown in a small range of temperature and O$_2$ pressure conditions. An understanding of the growth mechanism of Lu-Fe-O compound films is offered in terms of the thermochemistry at the surface. Superparamagnetism is observed in LuFe$_2$O$_4$ film and is explained in terms of the effect of the impurity h-LuFeO$_3$ phase and structural defects .

preprint2011arXiv

Acyclic Subgraphs in $k$-Majority Tournaments

A $k$-majority digraph is a directed graph created by combining $k$ individual rankings on the same ground set to form a consensus where edges point in the direction indicated by a strict majority of the rankings. The $k$-majority digraph is used to model voting scenarios, where the vertices correspond to options ranked by $k$ voters. When $k$ is odd, the resulting digraph is always a tournament, called $k$-majority tournament. Let $f_k(n)$ be the minimum, over all $k$-majority tournaments with $n$ vertices, of the maximum order of an induced transitive sub-tournament. Recently, Milans, Schreiber, and West proved that $\sqrt n \le f_3(n) \le 2 \sqrt n +1 $. In this paper, we improve the upper bound of $f_3(n)$ by showing that $f_3(n) < \sqrt {2n} +\frac 12 $.

preprint2011arXiv

Thermodynamics of Information Retrieval

In this work, we suggest a parameterized statistical model (the gamma distribution) for the frequency of word occurrences in long strings of English text and use this model to build a corresponding thermodynamic picture by constructing the partition function. We then use our partition function to compute thermodynamic quantities such as the free energy and the specific heat. In this approach, the parameters of the word frequency model vary from word to word so that each word has a different corresponding thermodynamics and we suggest that differences in the specific heat reflect differences in how the words are used in language, differentiating keywords from common and function words. Finally, we apply our thermodynamic picture to the problem of retrieval of texts based on keywords and suggest some advantages over traditional information retrieval methods.

preprint2011arXiv

Three layer $Q_2$-free families in the Boolean lattice

We prove that the largest $Q_2$-free family of subsets of $[n]$ which contains sets of at most three different sizes has at most $(3 + 2\sqrt {3})N/3 + o(N) \approx 2.1547N + o(N)$ members, where $N = {n \choose {\lfloor n/2 \rfloor}}$. This improves an earlier bound of $2.207N + o(N)$ by Axenovich, Manske, and Martin.

Jian Shen

What is connected

Connect this record

See the researcher in context

Building this map preview

23 published item(s)

Curriculum Offline Imitation Learning

Direct visualization of percolating metal-insulator transition in V2O3 using scanning microwave impedance microscopy

Influences of the dissipative topological edge state on quantized transport in MnBi2Te4

Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts

On Effective Scheduling of Model-based Reinforcement Learning

Towards Return Parity in Markov Decision Processes

Depression Detection using Resting State Three-channel EEG Signal

GIKT: A Graph-based Interaction Model for Knowledge Tracing

Large-Scale Optimal Transport via Adversarial Training with Cycle-Consistency

New upper bounds for the bondage number of a graph in terms of its maximum degree and Euler characteristic

Using observed bacteria concentration and modeled transit time under an analytical framework to estimate overall removal rate of fecal coliform in an estuary

Active control of magnetoresistance of organic spin valves using ferroelectricity

Persistent metal-insulator transition at the surface of an oxygen-deficient, epitaxial manganite film

Structural and electronic origin of the magnetic structures in hexagonal LuFeO$_3$

Bounds on the Number of Huffman and Binary-Ternary Trees

Electrophoretic-like gating used to control metal-insulator transitions in electronically phase separated manganite wires

Growth diagram of La0.7Sr0.3MnO3 thin films using pulsed laser deposition

Room-temperature multiferroic hexagonal LuFeO$_3$ films

Cyrstal field splitting and optical band gap of hexagonal LuFeO$_3$ films

Growth diagram and magnetic properties of hexagonal LuFe$_2$O$_4$ thin films

Acyclic Subgraphs in $k$-Majority Tournaments

Thermodynamics of Information Retrieval

Three layer $Q_2$-free families in the Boolean lattice