Source author record

Tian Xie

Tian Xie appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mtrl-sci Artificial Intelligence Machine Learning physics.comp-ph Information Theory math.IT Computation and Language econ.EM Information Retrieval math.OC physics.atom-ph physics.optics quant-ph

Catalog footprint

What is connected

15works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A cloud platform for automating and sharing analysis of raw simulation data from high throughput polymer molecular dynamics simulations

Open material databases storing hundreds of thousands of material structures and their corresponding properties have become the cornerstone of modern computational materials science. Yet, the raw outputs of the simulations, such as the trajectories from molecular dynamics simulations and charge densities from density functional theory calculations, are generally not shared due to their huge size. In this work, we describe a cloud-based platform to facilitate the sharing of raw data and enable the fast post-processing in the cloud to extract new properties defined by the user. As an initial demonstration, our database currently includes 6286 molecular dynamics trajectories for amorphous polymer electrolytes and 5.7 terabytes of data. We create a public analysis library at https://github.com/TRI-AMDD/htp_md to extract multiple properties from the raw data, using both expert designed functions and machine learning models. The analysis is run automatically with computation in the cloud, and results then populate a database that can be accessed publicly. Our platform encourages users to contribute both new trajectory data and analysis functions via public interfaces. Newly analyzed properties will be incorporated into the database. Finally, we create a front-end user interface at https://www.htpmd.matr.io for browsing and visualization of our data. We envision the platform to be a new way of sharing raw data and new insights for the computational materials science community.

preprint2022arXiv

Accelerating amorphous polymer electrolyte screening by learning to reduce errors in molecular dynamics simulated properties

Polymer electrolytes are promising candidates for the next generation lithium-ion battery technology. Large scale screening of polymer electrolytes is hindered by the significant cost of molecular dynamics (MD) simulation in amorphous systems: the amorphous structure of polymers requires multiple, repeated sampling to reduce noise and the slow relaxation requires long simulation time for convergence. Here, we accelerate the screening with a multi-task graph neural network that learns from a large amount of noisy, unconverged, short MD data and a small number of converged, long MD data. We achieve accurate predictions of 4 different converged properties and screen a space of 6247 polymers that is orders of magnitude larger than previous computational studies. Further, we extract several design principles for polymer electrolytes and provide an open dataset for the community. Our approach could be applicable to a broad class of material discovery problems that involve the simulation of complex, amorphous materials.

preprint2022arXiv

Calibrating DFT formation enthalpy calculations by multi-fidelity machine learning

Machine learning materials properties measured by experiments is valuable yet difficult due to the limited amount of experimental data. In this work, we use a multi-fidelity random forest model to learn the experimental formation enthalpy of materials with prediction accuracy higher than the empirically corrected PBE functional (PBEfe) and meta-GGA functional (SCAN), and it outperforms the hotly studied deep neural-network based representation learning and transfer learning. We then use the model to calibrate the DFT formation enthalpy in the Materials Project database, and discover materials with underestimated stability. The multi-fidelity model is also used as a data-mining approach to find how DFT deviates from experiments by the explaining the model output.

preprint2022arXiv

Converse: A Tree-Based Modular Task-Oriented Dialogue System

Creating a system that can have meaningful conversations with humans to help accomplish tasks is one of the ultimate goals of Artificial Intelligence (AI). It has defined the meaning of AI since the beginning. A lot has been accomplished in this area recently, with voice assistant products entering our daily lives and chat bot systems becoming commonplace in customer service. At first glance there seems to be no shortage of options for dialogue systems. However, the frequently deployed dialogue systems today seem to all struggle with a critical weakness - they are hard to build and harder to maintain. At the core of the struggle is the need to script every single turn of interactions between the bot and the human user. This makes the dialogue systems more difficult to maintain as the tasks become more complex and more tasks are added to the system. In this paper, we propose Converse, a flexible tree-based modular task-oriented dialogue system. Converse uses an and-or tree structure to represent tasks and offers powerful multi-task dialogue management. Converse supports task dependency and task switching, which are unique features compared to other open-source dialogue frameworks. At the same time, Converse aims to make the bot building process easy and simple, for both professional and non-professional software developers. The code is available at https://github.com/salesforce/Converse.

preprint2022arXiv

Crystal Diffusion Variational Autoencoder for Periodic Material Generation

Generating the periodic structure of stable materials is a long-standing challenge for the material design community. This task is difficult because stable materials only exist in a low-dimensional subspace of all possible periodic arrangements of atoms: 1) the coordinates must lie in the local energy minimum defined by quantum mechanics, and 2) global stability also requires the structure to follow the complex, yet specific bonding preferences between different atom types. Existing methods fail to incorporate these factors and often lack proper invariances. We propose a Crystal Diffusion Variational Autoencoder (CDVAE) that captures the physical inductive bias of material stability. By learning from the data distribution of stable materials, the decoder generates materials in a diffusion process that moves atomic coordinates towards a lower energy state and updates atom types to satisfy bonding preferences between neighbors. Our model also explicitly encodes interactions across periodic boundaries and respects permutation, translation, rotation, and periodic invariances. We significantly outperform past methods in three tasks: 1) reconstructing the input structure, 2) generating valid, diverse, and realistic materials, and 3) generating materials that optimize a specific property. We also provide several standard datasets and evaluation metrics for the broader machine learning community.

preprint2022arXiv

High-throughput calculations combining machine learning to investigate the corrosion properties of binary Mg alloys

Magnesium (Mg) alloys have shown great prospects as both structural and biomedical materials, while poor corrosion resistance limits their further application. In this work, to avoid the time-consuming and laborious experiment trial, a high-throughput computational strategy based on first-principles calculations is designed for screening corrosion-resistant binary Mg alloy with intermetallics, from both the thermodynamic and kinetic perspectives. The stable binary Mg intermetallics with low equilibrium potential difference with respect to the Mg matrix are firstly identified. Then, the hydrogen adsorption energies on the surfaces of these Mg intermetallics are calculated, and the corrosion exchange current density is further calculated by a hydrogen evolution reaction (HER) kinetic model. Several intermetallics, e.g. Y3Mg, Y2Mg and La5Mg, are identified to be promising intermetallics which might effectively hinder the cathodic HER. Furthermore, machine learning (ML) models are developed to predict Mg intermetallics with proper hydrogen adsorption energy employing work function (W_f) and weighted first ionization energy (WFIE). The generalization of the ML models is tested on five new binary Mg intermetallics with the average root mean square error (RMSE) of 0.11 eV. This study not only predicts some promising binary Mg intermetallics which may suppress the galvanic corrosion, but also provides a high-throughput screening strategy and ML models for the design of corrosion-resistant alloy, which can be extended to ternary Mg alloys or other alloy systems.

preprint2022arXiv

L2-Relaxation: With Applications to Forecast Combination and Portfolio Analysis

This paper tackles forecast combination with many forecasts or minimum variance portfolio selection with many assets. A novel convex problem called L2-relaxation is proposed. In contrast to standard formulations, L2-relaxation minimizes the squared Euclidean norm of the weight vector subject to a set of relaxed linear inequality constraints. The magnitude of relaxation, controlled by a tuning parameter, balances the bias and variance. When the variance-covariance (VC) matrix of the individual forecast errors or financial assets exhibits latent group structures -- a block equicorrelation matrix plus a VC for idiosyncratic noises, the solution to L2-relaxation delivers roughly equal within-group weights. Optimality of the new method is established under the asymptotic framework when the number of the cross-sectional units $N$ potentially grows much faster than the time dimension $T$. Excellent finite sample performance of our method is demonstrated in Monte Carlo simulations. Its wide applicability is highlighted in three real data examples concerning empirical applications of microeconomics, macroeconomics, and finance.

preprint2021arXiv

GraphHop: An Enhanced Label Propagation Method for Node Classification

A scalable semi-supervised node classification method on graph-structured data, called GraphHop, is proposed in this work. The graph contains attributes of all nodes but labels of a few nodes. The classical label propagation (LP) method and the emerging graph convolutional network (GCN) are two popular semi-supervised solutions to this problem. The LP method is not effective in modeling node attributes and labels jointly or facing a slow convergence rate on large-scale graphs. GraphHop is proposed to its shortcoming. With proper initial label vector embeddings, each iteration of GraphHop contains two steps: 1) label aggregation and 2) label update. In Step 1, each node aggregates its neighbors' label vectors obtained in the previous iteration. In Step 2, a new label vector is predicted for each node based on the label of the node itself and the aggregated label information obtained in Step 1. This iterative procedure exploits the neighborhood information and enables GraphHop to perform well in an extremely small label rate setting and scale well for very large graphs. Experimental results show that GraphHop outperforms state-of-the-art graph learning methods on a wide range of tasks (e.g., multi-label and multi-class classification on citation networks, social graphs, and commodity consumption graphs) in graphs of various sizes. Our codes are publicly available on GitHub (https://github.com/TianXieUSC/GraphHop).

preprint2020arXiv

Boosting Retailer Revenue by Generated Optimized Combined Multiple Digital Marketing Campaigns

Campaign is a frequently employed instrument in lifting up the GMV (Gross Merchandise Volume) of retailer in traditional marketing. As its counterpart in online context, digital-marketing-campaign (DMC) has being trending in recent years with the rapid development of the e-commerce. However, how to empower massive sellers on the online retailing platform the capacity of applying combined multiple digital marketing campaigns to boost their shops' revenue, is still a novel topic. In this work, a comprehensive solution of generating optimized combined multiple DMCs is presented. Firstly, a potential personalized DMC pool is generated for every retailer by a newly proposed neural network model, i.e. the DMCNet (Digital-Marketing-Campaign Net). Secondly, based on the sub-modular optimization theory and the DMC pool by DMCNet, the generated combined multiple DMCs are ranked with respect to their revenue generation strength then the top three ranked campaigns are returned to the sellers' back-end management system, so that retailers can set combined multiple DMCs for their online shops just in one-shot. Real online A/B-test shows that with the integrated solution, sellers of the online retailing platform increase their shops' GMVs with approximately 6$\%$.

preprint2020arXiv

Charting Lattice Thermal Conductivity of Inorganic Crystals

Thermal conductivity is a fundamental material property but challenging to predict, with less than 5% out of about $10^5$ synthesized inorganic materials being documented. In this work, we extract the structural chemistry that governs lattice thermal conductivity, by combining graph neural networks and random forest approaches. We show that both mean and variation of unit-cell configurational properties, such as atomic volume and bond length, are the most important features, followed by mass and elemental electronegativity. We chart the structural chemistry of lattice thermal conductivity into extended van-Arkel triangles, and predict the thermal conductivity of all known inorganic materials in the Inorganic Crystal Structure Database. For the latter, we develop a transfer learning framework extendable for other applications.

preprint2020arXiv

On the Power Leakage Problem in Millimeter-Wave Massive MIMO with Lens Antenna Arrays

The emerging millimeter-wave (mmWave) massive multiple-input multiple-output (MIMO) with lens antenna arrays, which is also known as "beamspace MIMO", can effectively reduce the required number of power-hungry radio frequency (RF) chains. Therefore, it has been considered as a promising technique for the upcoming 5G communications and beyond. However, most current studies on beamspace MIMO have not taken into account the important power leakage problem in beamspace channels, which possibly leads to a significant degradation in the signal-to-noise ratio (SNR) and the system sum-rate. To this end, we propose a beam aligning precoding method to handle the power leakage problem in this paper. Firstly, a phase shifter network (PSN) structure is proposed, which enables each RF chain in beamspace MIMO to select multiple beams to collect the leakage power. Then, a rotation-based precoding algorithm is designed based on the proposed PSN structure, which aligns the channel gains of the selected beams towards the same direction for maximizing the received SNR at each user. Furthermore, we reveal some system design insights by analyzing the sum-rate and energy efficiency (EE) of the proposed beam aligning precoding method. In simulations, the proposed approach is found to achieve the near-optimal sum-rate performance compared with the ideal case of no power leakage, and obtains a higher EE than the existing schemes with either a linear or planar array.

preprint2019arXiv

On-chip coherent microwave-to-optical transduction mediated by ytterbium in YVO$_4$

Optical networks that distribute entanglement among quantum technologies will form a powerful backbone for quantum science but are yet to interface with leading quantum hardware such as superconducting qubits. Consequently, these systems remain isolated because microwave links at room temperature are noisy and lossy. Building connectivity requires interfaces that map quantum information between microwave and optical fields. While preliminary microwave-to-optical (M2O) transducers have been realized, developing efficient, low-noise devices that match superconducting qubit frequencies (gigahertz) and bandwidths (10 kHz - 1 MHz) remains a challenge. Here we demonstrate a proof-of-concept on-chip M2O transducer using $^{171}\mathrm{Yb}^{3+}$-ions in yttrium orthovanadate (YVO) coupled to a nanophotonic waveguide and a microwave transmission line. The device's miniaturization, material, and zero-magnetic-field operation are important advances for rare-earth ion magneto-optical devices. Further integration with high quality factor microwave and optical resonators will enable efficient transduction and create opportunities toward multi-platform quantum networks.

preprint2018arXiv

Frequency stabilization of a 650 nm laser to I$_{2}$ spectrum for trapped $^{138}$Ba$^{+}$ ions

The optical manipulation of Ba$^{+}$ ions is mainly performed by a 493 nm laser for the S$_{1/2}$-P$_{1/2}$ transition and a 650 nm laser for the P$_{1/2}$-D$_{3/2}$ transition. Since the branching ratio between the 493 nm and 650 nm transitions of a single Ba$^{+}$ ion is comparable, stabilization systems of both lasers are equally important for Doppler cooling, sub-Doppler cooling, optical pumping and state detection. The stabilization system of a 493 nm laser to an absolute Te$_2$ reference has been well established. However, the stabilization of a 650 nm laser has not been presented before. Here we report twenty spectral lines of I$_{2}$ in the range of 0.9 GHz above the resonance of the P$_{1/2}$-D$_{3/2}$ transition. We stabilize the 650 nm laser through the optical cavity to the lowest one among these lines, which is about 350 MHz apart, as the absolute frequency reference. Furthermore, we measure the frequency differences between these iodine lines and the Ba$^+$ resonance through fluorescence excitation spectrum with well-resolved dark states, which is in agreement with the theoretical expectation. The presented stabilization scheme enables us to perform precise experiments with Ba$^{+}$ ions.

preprint2016arXiv

A combinatorial algorithm for constrained assortment optimization under nested logit model

We consider the assortment optimization problem with disjoint-cardinality constraints under two-level nested logit model. To solve this problem, we first identify a candidate set with $O(mn^2)$ assortments and show that at least one optimal assortment is included in this set. Based on this observation, a fast algorithm, which runs in $O(m n^2 \log mn)$ time, is proposed to find an optimal assortment.

preprint2016arXiv

GMD-Based Hybrid Precoding For Millimeter-Wave Massive MIMO Systems

Hybrid precoding can significantly reduce the number of required radio frequency (RF) chains and relieve the huge energy consumption in mmWave massive MIMO systems, thus attracting much interests from academic and industry. However, most existing hybrid precoding schemes are based on singular value decomposition (SVD). Due to the very different sub-channel signal-to-noise ratios (SNRs) after SVD, complicated bit allocations is usually required to match the sub-channel SNRs. To solve this problem, we propose a geometric mean decomposition (GMD)-based hybrid precoding scheme to avoid the complicated bit allocation. Its basic idea is to seek a pair of analog and digital precoding matrices that are sufficiently close to the optimal unconstrained GMD precoding matrix. Specifically, we design the analog (digital) precoding matrix while keeping the digital (analog) precoding matrix fixed. Further, the principle of basis pursuit is utilized in the design of analog precoding matrix, while we obtain the digital precoding matrix by projecting the GMD operation on the digital precoding matrix. Simulation results verify that the proposed GMD-based hybird precoding scheme outperforms conventional SVD-based hybrid precoding schemes and achieves much better bit error rate (BER) performance with low complexity.

Tian Xie

What is connected

Connect this record

See the researcher in context

Building this map preview

15 published item(s)

A cloud platform for automating and sharing analysis of raw simulation data from high throughput polymer molecular dynamics simulations

Accelerating amorphous polymer electrolyte screening by learning to reduce errors in molecular dynamics simulated properties

Calibrating DFT formation enthalpy calculations by multi-fidelity machine learning

Converse: A Tree-Based Modular Task-Oriented Dialogue System

Crystal Diffusion Variational Autoencoder for Periodic Material Generation

High-throughput calculations combining machine learning to investigate the corrosion properties of binary Mg alloys

L2-Relaxation: With Applications to Forecast Combination and Portfolio Analysis

GraphHop: An Enhanced Label Propagation Method for Node Classification

Boosting Retailer Revenue by Generated Optimized Combined Multiple Digital Marketing Campaigns

Charting Lattice Thermal Conductivity of Inorganic Crystals

On the Power Leakage Problem in Millimeter-Wave Massive MIMO with Lens Antenna Arrays

On-chip coherent microwave-to-optical transduction mediated by ytterbium in YVO$_4$

Frequency stabilization of a 650 nm laser to I$_{2}$ spectrum for trapped $^{138}$Ba$^{+}$ ions

A combinatorial algorithm for constrained assortment optimization under nested logit model

GMD-Based Hybrid Precoding For Millimeter-Wave Massive MIMO Systems