Researcher profile

Daoyi Dong

Daoyi Dong contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
27works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

27 published item(s)

preprint2026arXiv

Beyond the Dirac Delta: Mitigating Diversity Collapse in Reinforcement Fine-Tuning for Versatile Image Generation

Reinforcement learning (RL) has emerged as a powerful paradigm for fine-tuning large-scale generative models, such as diffusion and flow models, to align with complex human preferences and user-specified tasks. A fundamental limitation remains \textit{the curse of diversity collapse}, where the objective formulation and optimization landscape inherently collapse the policy to a Dirac delta distribution. To address this challenge, we propose \textbf{DRIFT} (\textbf{D}ive\textbf{R}sity-\textbf{I}ncentivized Reinforcement \textbf{F}ine-\textbf{T}uning for Versatile Image Generation), an innovative framework that systematically incentivizes output diversity throughout the on-policy fine-tuning process, reconciling strong task alignment with high generation diversity to enhance versatility essential for applications that demand diverse candidate generations. We approach the problem across three representative perspectives: i) \textbf{sampling} a reward-concentrated subset that filters out reward outliers to prevent premature collapse; ii) \textbf{prompting} with stochastic variations to expand the conditioning space, and iii) \textbf{optimization} of the intra-group diversity with a potential-based reward shaping mechanism. Experimental results show that DRIFT achieves superior Pareto dominance regarding task alignment and generation diversity, yielding a $ 9.08\%\!\sim\! 43.46\%$ increase in diversity at equivalent alignment levels and a $ 59.65\% \!\sim\! 65.86\%$ increase in alignment at equivalent levels of diversity.

preprint2026arXiv

Noise-Resistant Feature-Aware Attack Detection Using Quantum Machine Learning

Continuous-variable quantum key distribution (CV-QKD) is a quantum communication technology that offers an unconditional security guarantee. However, the practical deployment of CV-QKD systems remains vulnerable to various quantum attacks. In this paper, we propose a quantum machine learning (QML)-based attack detection framework (QML-ADF) that safeguards the security of high-rate CV-QKD systems. In particular, two alternative QML models -- quantum support vector machines (QSVM) and quantum neural networks (QNN) -- are developed to perform noise-resistant and feature-aware attack detection before conventional data postprocessing. Leveraging feature-rich quantum data from Gaussian modulation and homodyne detection, the QML-ADF effectively detects quantum attacks, including both known and unknown types defined by these distinctive features. The results indicate that all twelve distinct QML variants for both QSVM and QNN exhibit remarkable performance in detecting both known and previously undiscovered quantum attacks, with the best-performing QSVM variant outperforming the top QNN counterpart. Furthermore, we systematically evaluate the performance of the QML-ADF under various physically interpretable noise backends, demonstrating its strong robustness and superior detection performance. We anticipate that the QML-ADF will not only enable robust detection of quantum attacks under realistic deployment conditions but also strengthen the practical security of quantum communication systems.

preprint2024arXiv

Real-time parameter estimation for two-qubit systems based on hybrid control

In this paper, we consider the real-time parameter estimation problem for a ZZ-coupled system composed of two qubits in the presence of spontaneous emission. To enhance the estimation precision of the coupling coefficient, we first propose two different control schemes, where the first one is feedback control based on quantum-jump detection, and the second one is hybrid control combining Markovian feedback and Hamiltonian control. The simulation results show that compared with free evolution, both control schemes can improve parameter precision and extend system coherence time. Next, on the basis of the two control schemes, we propose a practical single-parameter quantum recovery protocol based on Bayesian estimation theory. In this protocol, by employing batch-style adaptive measurement rules, parameter recovery is conducted to verify the effectiveness of both control schemes.

preprint2023arXiv

Depthwise Convolution for Multi-Agent Communication with Enhanced Mean-Field Approximation

Multi-agent settings remain a fundamental challenge in the reinforcement learning (RL) domain due to the partial observability and the lack of accurate real-time interactions across agents. In this paper, we propose a new method based on local communication learning to tackle the multi-agent RL (MARL) challenge within a large number of agents coexisting. First, we design a new communication protocol that exploits the ability of depthwise convolution to efficiently extract local relations and learn local communication between neighboring agents. To facilitate multi-agent coordination, we explicitly learn the effect of joint actions by taking the policies of neighboring agents as inputs. Second, we introduce the mean-field approximation into our method to reduce the scale of agent interactions. To more effectively coordinate behaviors of neighboring agents, we enhance the mean-field approximation by a supervised policy rectification network (PRN) for rectifying real-time agent interactions and by a learnable compensation term for correcting the approximation bias. The proposed method enables efficient coordination as well as outperforms several baseline approaches on the adaptive traffic signal control (ATSC) task and the StarCraft II multi-agent challenge (SMAC).

preprint2022arXiv

A Dirichlet Process Mixture of Robust Task Models for Scalable Lifelong Reinforcement Learning

While reinforcement learning (RL) algorithms are achieving state-of-the-art performance in various challenging tasks, they can easily encounter catastrophic forgetting or interference when faced with lifelong streaming information. In the paper, we propose a scalable lifelong RL method that dynamically expands the network capacity to accommodate new knowledge while preventing past memories from being perturbed. We use a Dirichlet process mixture to model the non-stationary task distribution, which captures task relatedness by estimating the likelihood of task-to-cluster assignments and clusters the task models in a latent space. We formulate the prior distribution of the mixture as a Chinese restaurant process (CRP) that instantiates new mixture components as needed. The update and expansion of the mixture are governed by the Bayesian non-parametric framework with an expectation maximization (EM) procedure, which dynamically adapts the model complexity without explicit task boundaries or heuristics. Moreover, we use the domain randomization technique to train robust prior parameters for the initialization of each task model in the mixture, thus the resulting model can better generalize and adapt to unseen tasks. With extensive experiments conducted on robot navigation and locomotion domains, we show that our method successfully facilitates scalable lifelong RL and outperforms relevant existing methods.

preprint2022arXiv

Instance Weighted Incremental Evolution Strategies for Reinforcement Learning in Dynamic Environments

Evolution strategies (ES), as a family of black-box optimization algorithms, recently emerge as a scalable alternative to reinforcement learning (RL) approaches such as Q-learning or policy gradient, and are much faster when many central processing units (CPUs) are available due to better parallelization. In this paper, we propose a systematic incremental learning method for ES in dynamic environments. The goal is to adjust previously learned policy to a new one incrementally whenever the environment changes. We incorporate an instance weighting mechanism with ES to facilitate its learning adaptation, while retaining scalability of ES. During parameter updating, higher weights are assigned to instances that contain more new knowledge, thus encouraging the search distribution to move towards new promising areas of parameter space. We propose two easy-to-implement metrics to calculate the weights: instance novelty and instance quality. Instance novelty measures an instance's difference from the previous optimum in the original environment, while instance quality corresponds to how well an instance performs in the new environment. The resulting algorithm, Instance Weighted Incremental Evolution Strategies (IW-IES), is verified to achieve significantly improved performance on challenging RL tasks ranging from robot navigation to locomotion. This paper thus introduces a family of scalable ES algorithms for RL domains that enables rapid learning adaptation to dynamic environments.

preprint2022arXiv

On compression rate of quantum autoencoders: Control design, numerical and experimental realization

Quantum autoencoders which aim at compressing quantum information in a low-dimensional latent space lie in the heart of automatic data compression in the field of quantum information. In this paper, we establish an upper bound of the compression rate for a given quantum autoencoder and present a learning control approach for training the autoencoder to achieve the maximal compression rate. The upper bound of the compression rate is theoretically proven using eigen-decomposition and matrix differentiation, which is determined by the eigenvalues of the density matrix representation of the input states. Numerical results on 2-qubit and 3-qubit systems are presented to demonstrate how to train the quantum autoencoder to achieve the theoretically maximal compression, and the training performance using different machine learning algorithms is compared. Experimental results of a quantum autoencoder using quantum optical systems are illustrated for compressing two 2-qubit states into two 1-qubit states.

preprint2022arXiv

Optimal and two-step adaptive quantum detector tomography

Quantum detector tomography is a fundamental technique for calibrating quantum devices and performing quantum engineering tasks. In this paper, we design optimal probe states for detector estimation based on the minimum upper bound of the mean squared error (UMSE) and the maximum robustness. We establish the minimum UMSE and the minimum condition number for quantum detectors and provide concrete examples that can achieve optimal detector tomography. In order to enhance the estimation precision, we also propose a two-step adaptive detector tomography algorithm to optimize the probe states adaptively based on a modified fidelity index. We present a sufficient condition on when the estimation error of our two-step strategy scales inversely proportional to the number of state copies. Moreover, the superposition of coherent states is used as probe states for quantum detector tomography and the estimation error is analyzed. Numerical results demonstrate the effectiveness of both the proposed optimal and adaptive quantum detector tomography methods.

preprint2022arXiv

Quantum estimation, control and learning: opportunities and challenges

The development of estimation and control theories for quantum systems is a fundamental task for practical quantum technology. This vision article presents a brief introduction to challenging problems and potential opportunities in the emerging areas of quantum estimation, control and learning. The topics cover quantum state estimation, quantum parameter identification, quantum filtering, quantum open-loop control, quantum feedback control, machine learning for estimation and control of quantum systems, and quantum machine learning.

preprint2022arXiv

Residual Tensor Train: A Quantum-inspired Approach for Learning Multiple Multilinear Correlations

States of quantum many-body systems are defined in a high-dimensional Hilbert space, where rich and complex interactions among subsystems can be modelled. In machine learning, complex multiple multilinear correlations may also exist within input features. In this paper, we present a quantum-inspired multilinear model, named Residual Tensor Train (ResTT), to capture the multiple multilinear correlations of features, from low to high orders, within a single model. ResTT is able to build a robust decision boundary in a high-dimensional space for solving fitting and classification tasks. In particular, we prove that the fully-connected layer and the Volterra series can be taken as special cases of ResTT. Furthermore, we derive the rule for weight initialization that stabilizes the training of ResTT based on a mean-field analysis. We prove that such a rule is much more relaxed than that of TT, which means ResTT can easily address the vanishing and exploding gradient problem that exists in the existing TT models. Numerical experiments demonstrate that ResTT outperforms the state-of-the-art tensor network and benchmark deep learning models on MNIST and Fashion-MNIST datasets. Moreover, ResTT achieves better performance than other statistical methods on two practical examples with limited data which are known to have complex feature interactions.

preprint2022arXiv

Robust optimization for quantum reinforcement learning control using partial observations

The current quantum reinforcement learning control models often assume that the quantum states are known a priori for control optimization. However, full observation of quantum state is experimentally infeasible due to the exponential scaling of the number of required quantum measurements on the number of qubits. In this paper, we investigate a robust reinforcement learning method using partial observations to overcome this difficulty. This control scheme is compatible with near-term quantum devices, where the noise is prevalent and predetermining the dynamics of quantum state is practically impossible. We show that this simplified control scheme can achieve similar or even better performance when compared to the conventional methods relying on full observation. We demonstrate the effectiveness of this scheme on examples of quantum state control and quantum approximate optimization algorithm. It has been shown that high-fidelity state control can be achieved even if the noise amplitude is at the same level as the control amplitude. Besides, an acceptable level of optimization accuracy can be achieved for QAOA with noisy control Hamiltonian. This robust control optimization model can be trained to compensate the uncertainties in practical quantum computing.

preprint2021arXiv

Curriculum-based Deep Reinforcement Learning for Quantum Control

Deep reinforcement learning has been recognized as an efficient technique to design optimal strategies for different complex systems without prior knowledge of the control landscape. To achieve a fast and precise control for quantum systems, we propose a novel deep reinforcement learning approach by constructing a curriculum consisting of a set of intermediate tasks defined by a fidelity threshold. Tasks among a curriculum can be statically determined using empirical knowledge or adaptively generated with the learning process. By transferring knowledge between two successive tasks and sequencing tasks according to their difficulties, the proposed curriculum-based deep reinforcement learning (CDRL) method enables the agent to focus on easy tasks in the early stage, then move onto difficult tasks, and eventually approaches the final task. Numerical simulations on closed quantum systems and open quantum systems demonstrate that the proposed method exhibits improved control performance for quantum systems and also provides an efficient way to identify optimal strategies with fewer control pulses.

preprint2021arXiv

Deep Reinforcement Learning with Quantum-inspired Experience Replay

In this paper, a novel training paradigm inspired by quantum computation is proposed for deep reinforcement learning (DRL) with experience replay. In contrast to traditional experience replay mechanism in DRL, the proposed deep reinforcement learning with quantum-inspired experience replay (DRL-QER) adaptively chooses experiences from the replay buffer according to the complexity and the replayed times of each experience (also called transition), to achieve a balance between exploration and exploitation. In DRL-QER, transitions are first formulated in quantum representations, and then the preparation operation and the depreciation operation are performed on the transitions. In this progress, the preparation operation reflects the relationship between the temporal difference errors (TD-errors) and the importance of the experiences, while the depreciation operation is taken into account to ensure the diversity of the transitions. The experimental results on Atari 2600 games show that DRL-QER outperforms state-of-the-art algorithms such as DRL-PER and DCRL on most of these games with improved training efficiency, and is also applicable to such memory-based DRL approaches as double network and dueling network.

preprint2021arXiv

Expectation Synchronization Synthesis in Non-Markovian Open Quantum Systems

In this article, we investigate the problem of engineering synchronization in non-Markovian quantum systems. First, a time-convoluted linear quantum stochastic differential equation is derived which describes the Heisenberg evolution of a localized quantum system driven by multiple colored noise inputs. Then, we define quantum expectation synchronization in an augmented system consisting of two subsystems. We prove that, for two homogenous subsystems, synchronization can always be synthesized without designing direct Hamiltonian coupling given that the degree of non-Markovianity is below a certain threshold. System parameters are explicitly designed to achieve quantum synchronization. Also, a numerical example is presented to illustrate our results.

preprint2021arXiv

Intelligent Trajectory Planning in UAV-mounted Wireless Networks: A Quantum-Inspired Reinforcement Learning Perspective

In this paper, we consider a wireless uplink transmission scenario in which an unmanned aerial vehicle (UAV) serves as an aerial base station collecting data from ground users. To optimize the expected sum uplink transmit rate without any prior knowledge of ground users (e.g., locations, channel state information and transmit power), the trajectory planning problem is optimized via the quantum-inspired reinforcement learning (QiRL) approach. Specifically, the QiRL method adopts novel probabilistic action selection policy and new reinforcement strategy, which are inspired by the collapse phenomenon and amplitude amplification in quantum computation theory, respectively. Numerical results demonstrate that the proposed QiRL solution can offer natural balancing between exploration and exploitation via ranking collapse probabilities of possible actions, compared to the traditional reinforcement learning approaches which are highly dependent on tuned exploration parameters.

preprint2021arXiv

Lifelong Incremental Reinforcement Learning with Online Bayesian Inference

A central capability of a long-lived reinforcement learning (RL) agent is to incrementally adapt its behavior as its environment changes, and to incrementally build upon previous experiences to facilitate future learning in real-world scenarios. In this paper, we propose LifeLong Incremental Reinforcement Learning (LLIRL), a new incremental algorithm for efficient lifelong adaptation to dynamic environments. We develop and maintain a library that contains an infinite mixture of parameterized environment models, which is equivalent to clustering environment parameters in a latent space. The prior distribution over the mixture is formulated as a Chinese restaurant process (CRP), which incrementally instantiates new environment models without any external information to signal environmental changes in advance. During lifelong learning, we employ the expectation maximization (EM) algorithm with online Bayesian inference to update the mixture in a fully incremental manner. In EM, the E-step involves estimating the posterior expectation of environment-to-cluster assignments, while the M-step updates the environment parameters for future learning. This method allows for all environment models to be adapted as necessary, with new models instantiated for environmental changes and old models retrieved when previously seen environments are encountered again. Experiments demonstrate that LLIRL outperforms relevant existing methods, and enables effective incremental adaptation to various dynamic environments for lifelong learning.

preprint2020arXiv

Derivative-free global minimization for a class of multiple minima problems

We prove that the finite-difference based derivative-free descent (FD-DFD) methods have a capability to find the global minima for a class of multiple minima problems. Our main result shows that, for a class of multiple minima objectives that is extended from strongly convex functions with Lipschitz-continuous gradients, the iterates of FD-DFD converge to the global minimizer $x_*$ with the linear convergence $\|x_{k+1}-x_*\|_2^2\leqslantρ^k \|x_1-x_*\|_2^2$ for a fixed $0<ρ<1$ and any initial iteration $x_1\in\mathbb{R}^d$ when the parameters are properly selected. Since the per-iteration cost, i.e., the number of function evaluations, is fixed and almost independent of the dimension $d$, the FD-DFD algorithm has a complexity bound $\mathcal{O}(\log\frac{1}ε)$ for finding a point $x$ such that the optimality gap $\|x-x_*\|_2^2$ is less than $ε>0$. Numerical experiments in various dimensions from $5$ to $500$ demonstrate the benefits of the FD-DFD method.

preprint2020arXiv

Fault-tolerant Coherent H-infinity Control for Linear Quantum Systems

Robustness and reliability are two key requirements for developing practical quantum control systems. The purpose of this paper is to design a coherent feedback controller for a class of linear quantum systems suffering from Markovian jumping faults so that the closed-loop quantum system has both fault tolerance and H-infinity disturbance attenuation performance. This paper first extends the physical realization conditions from the time-invariant case to the time-varying case for linear stochastic quantum systems. By relating the fault tolerant H-infinity control problem to the dissipation properties and the solutions of Riccati differential equations, an H-infinity controller for the quantum system is then designed by solving a set of linear matrix inequalities (LMIs). In particular, an algorithm is employed to introduce additional noises and to construct the corresponding input matrices to ensure the physical realizability of the quantum controller. For real applications of the developed fault-tolerant control strategy, we present a linear quantum system example from quantum optics, where the amplitude of the pumping field randomly jumps among different values. It is demonstrated that a quantum H-infinity controller can be designed and implemented using some basic optical components to achieve the desired control goal.

preprint2020arXiv

Finite-time stabilization control of quantum systems

The finite-time control problem of quantum systems is investigated in this paper. We first define finite-time stability and present a finite-time Lyapunov stability criterion for finite-dimensional quantum systems in coherence vector representation. Then, for two-level quantum systems, we design a continuous non-smooth control law with a state-dependent fractional power and prove the uniqueness of solutions of the system dynamics with the controller via the concept of transversality. By combining the finite-time Lyapunov stability criterion with the homogeneity theory, the finite-time convergence of the system to an eigenstate of its internal Hamiltonian is proved. Numerical results on a spin-1/2 system demonstrate the effectiveness of the proposed finite-time stabilization control scheme.

preprint2020arXiv

Generation of accessible sets in the dynamical modelling of quantum network systems

In this paper, we consider the dynamical modeling of a class of quantum network systems consisting of qubits. Qubit probes are employed to measure a set of selected nodes of the quantum network systems. For a variety of applications, a state space model is a useful way to model the system dynamics. To construct a state space model for a quantum network system, the major task is to find an accessible set containing all of the operators coupled to the measurement operators. This paper focuses on the generation of a proper accessible set for a given system and measurement scheme. We provide analytic results on simplifying the process of generating accessible sets for systems with a time-independent Hamiltonian. Since the order of elements in the accessible set determines the form of state space matrices, guidance is provided to effectively arrange the ordering of elements in the state vector. Defining a system state according to the accessible set, one can develop a state space model with a special pattern inherited from the system structure. As a demonstration, we specifically consider a typical 1D-chain system with several common measurements, and employ the proposed method to determine its accessible set.

preprint2020arXiv

Hybrid filtering for a class of nonlinear quantum systems subject to classical stochastic disturbances

A hybrid quantum-classical filtering problem, where a qubit system is disturbed by a classical stochastic process, is investigated. The strategy is to model the classical disturbance by using an optical cavity. Relations between classical disturbances and the cavity analog system are analyzed. The dynamics of the enlarged quantum network system, which includes a qubit system and a cavity system, are derived. A stochastic master equation for the qubit-cavity hybrid system is given, based on which estimates for the state of the cavity system and the classical signal are obtained. The quantum extended Kalman filter is employed to achieve efficient computation. Numerical results are presented to illustrate the effectiveness of our methods.

preprint2020arXiv

Learning control of quantum systems using frequency-domain optimization algorithms

We investigate two classes of quantum control problems by using frequency-domain optimization algorithms in the context of ultrafast laser control of quantum systems. In the first class, the system model is known and a frequency-domain gradient-based optimization algorithm is applied to searching for an optimal control field to selectively and robustly manipulate the population transfer in atomic Rubidium. The other class of quantum control problems involves an experimental system with an unknown model. In the case, we introduce a differential evolution algorithm with a mixed strategy to search for optimal control fields and demonstrate the capability in an ultrafast laser control experiment for the fragmentation of Pr(hfac)$_3$ molecules.

preprint2020arXiv

Learning-based Quantum Robust Control: Algorithm, Applications and Experiments

Robust control design for quantum systems has been recognized as a key task in quantum information technology, molecular chemistry and atomic physics. In this paper, an improved differential evolution algorithm, referred to as \emph{msMS}\_DE, is proposed to search robust fields for various quantum control problems. In \emph{msMS}\_DE, multiple samples are used for fitness evaluation and a mixed strategy is employed for the mutation operation. In particular, the \emph{msMS}\_DE algorithm is applied to the control problems of (i) open inhomogeneous quantum ensembles and (ii) the consensus goal of a quantum network with uncertainties. Numerical results are presented to demonstrate the excellent performance of the improved machine learning algorithm for these two classes of quantum robust control problems. Furthermore, \emph{msMS}\_DE is experimentally implemented on femtosecond laser control applications to optimize two-photon absorption and control fragmentation of the molecule $\text{CH}_2\text{BrI}$. Experimental results demonstrate excellent performance of \emph{msMS}\_DE in searching for effective femtosecond laser pulses for various tasks.

preprint2020arXiv

On the capability of a class of quantum sensors

Quantum sensors may provide extremely high sensitivity and precision to extract key information in a quantum or classical physical system. A fundamental question is whether a quantum sensor is capable of uniquely inferring unknown parameters in a system for a given structure of the quantum sensor and admissible measurement on the sensor. In this paper, we investigate the capability of a class of quantum sensors which consist of either a single qubit or two qubits. A quantum sensor is coupled to a spin chain system to extract information of unknown parameters in the system. With given initialisation and measurement schemes, we employ the similarity transformation approach and the Grobner basis method to prove that a single-qubit quantum sensor cannot effectively estimate the unknown parameters in the spin chain system while the two-qubit quantum sensor can. The work demonstrates that it is a feasible method to enhance the capability of quantum sensors by increasing the number of qubits in the quantum sensors for some practical applications.

preprint2020arXiv

Realization of a quantum autoencoder for lossless compression of quantum data

As a ubiquitous aspect of modern information technology, data compression has a wide range of applications. Therefore, a quantum autoencoder which can compress quantum information into a low-dimensional space is fundamentally important to achieve automatic data compression in the field of quantum information. Such a quantum autoencoder can be implemented through training the parameters of a quantum device using classical optimization algorithms. In this article, we analyze the condition of achieving a perfect quantum autoencoder and theoretically prove that a quantum autoencoder can losslessly compress high-dimensional quantum information into a low-dimensional space (also called latent space) if the number of maximum linearly independent vectors from input states is no more than the dimension of the latent space. Also, we experimentally realize a universal two-qubit unitary gate and design a quantum autoencoder device by applying machine learning method. Experimental results demonstrate that our quantum autoencoder is able to compress two two-qubit states into two one-qubit states. Besides compressing quantum information, the quantum autoencoder is used to experimentally discriminate two groups of nonorthogonal states.

preprint2019arXiv

Detecting Non-Markovianity via Quantified Coherence: Theory and Experiments

The dynamics of open quantum systems and manipulation of quantum resources are both of fundamental interest in quantum physics. Here, we investigate the relation between quantum Markovianity and coherence, providing an effective way for detecting non-Markovianity based on the \textit{quantum-incoherent relative entropy of coherence} ($\mathcal{QI}$ REC). We theoretically show the relation between completely positive (CP) divisibility and the monotonic behavior of the $\mathcal{QI}$ REC. Also we implement an all-optical experiment to demonstrate that the behavior of the $\mathcal{QI}$ REC is coincident with the entanglement shared between the system and the ancilla for both Markovian and non-Markovian evolution; while other coherence-based non-Markovian information carriers violate monotonicity, even in Markovian processes. Moreover, we experimentally observe that non-Markovianity enhances the ability of creating coherence on an ancilla. This is the first experimental study of the relation between dynamical behavior of the $\mathcal{QI}$ REC and the phenomenon of information backflow. Moreover, our method for detecting non-Markovianity is applicable to general quantum evolutions.