Source author record

Yao Fu

Yao Fu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

quant-ph Machine Learning Computation and Language hep-ex hep-ph Artificial Intelligence Cryptography and Security Computational Complexity Computer Vision Distributed, Parallel, and Cluster Computing Neural and Evolutionary Computing physics.optics

Catalog footprint

What is connected

19works

12topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

RAM-H1200: A Unified Evaluation and Dataset on Hand Radiographs for Rheumatoid Arthritis

Rheumatoid arthritis (RA) assessment from hand radiographs requires multi-level analysis and modeling of anatomical structures and fine-grained local pathological changes. However, existing public resources do not support such unified multi-level analysis, often lacking full-hand coverage, fine-grained annotations, and consistent integration with clinical scoring systems. In particular, annotations that enable quantitative analysis of bone erosion (BE) remain scarce. RAM-H1200 contains 1,200 hand radiographs collected from six medical centers, with multi-level annotations including (i) whole-hand bone structure instance segmentation, (ii) pixel-level BE masks, (iii) SvdH-defined joint regions of interest, and (iv) joint-level SvdH scores for both BE and joint space narrowing (JSN). It is designed to evaluate whether models can jointly capture anatomical structure, localized erosive pathology, and clinically standardized RA severity from hand radiographs. The proposed BE masks enable, for the first time, quantitative BE analysis beyond coarse categorical grading by providing explicit spatial supervision for lesion extent and morphology. To our knowledge, RAM-H1200 is the first public large-scale benchmark that jointly supports whole-hand bone structure instance segmentation, pixel-level BE delineation, and clinically grounded joint-level SvdH scoring for both BE and JSN. Results across benchmark tasks show that anatomical modeling is substantially more mature than quantitative BE analysis: whole-hand bone segmentation achieves strong performance, whereas BE segmentation remains a major open challenge. By unifying anatomical structure modeling, quantitative lesion analysis, and clinically grounded SvdH scoring, RAM-H1200 provides a single benchmark for comprehensive RA analysis on hand radiographs.

preprint2022arXiv

Breaking the Rate-Loss Bound of Quantum Key Distribution with Asynchronous Two-Photon Interference

Twin-field quantum key distribution can overcome the secret key capacity of repeaterless quantum key distribution via single-photon interference. However, to compensate for the channel fluctuations and lock the laser fluctuations, the techniques of phase tracking and phase locking are indispensable in experiment, which drastically increase experimental complexity and hinder free-space realization. Inspired by the duality in entanglement, we herein present an asynchronous measurement-device-independent quantum key distribution protocol that can surpass the secret key capacity even without phase tracking and phase locking. Leveraging the concept of time multiplexing, asynchronous two-photon Bell-state measurement is realized by postmatching two interference detection events. For a 1 GHz system, the new protocol reaches a transmission distance of 450 km without phase tracking. After further removing phase locking, our protocol is still capable of breaking the capacity at 270 km. Intriguingly, when using the same experimental techniques, our protocol has a higher key rate than the phase-matching-type twin-field protocol. In the presence of imperfect intensity modulation, it also has a significant advantage in terms of the transmission distance over the sending-or-not-sending type twin-field protocol. With high key rates and accessible technology, our work provides a promising candidate for practical scalable quantum communication networks.

preprint2022arXiv

Data-to-text Generation with Variational Sequential Planning

We consider the task of data-to-text generation, which aims to create textual output from non-linguistic input. We focus on generating long-form text, i.e., documents with multiple paragraphs, and propose a neural model enhanced with a planning component responsible for organizing high-level information in a coherent and meaningful way. We infer latent plans sequentially with a structured variational model, while interleaving the steps of planning and generation. Text is generated by conditioning on previous variational decisions and previously generated text. Experiments on two data-to-text benchmarks (RotoWire and MLB) show that our model outperforms strong baselines and is sample efficient in the face of limited training data (e.g., a few hundred instances).

preprint2022arXiv

Experimental quantum advantage with quantum coupon collector

An increasing number of communication and computational schemes with quantum advantages have recently been proposed, which implies that quantum technology has fertile application prospects. However, demonstrating these schemes experimentally continues to be a central challenge because of the difficulty in preparing high-dimensional states or highly entangled states. In this study, we introduce and analyse a quantum coupon collector protocol by employing coherent states and simple linear optical elements, which was successfully demonstrated using realistic experimental equipment. We showed that our protocol can significantly reduce the number of samples needed to learn a specific set compared with the classical limit of the coupon collector problem. We also discuss the potential values and expansions of the quantum coupon collector by constructing a quantum blind box game. The information transmitted by the proposed game also broke the classical limit. These results strongly prove the advantages of quantum mechanics in machine learning and communication complexity.

preprint2022arXiv

Factorization of the forward-backward charge asymmetry and measurements of the weak mixing angle and proton structure at hadron colliders

The forward-backward charge asymmetry (AFB) at hadron colliders is sensitive to both the electroweak (EW) symmetry breaking represented by the effective weak mixing angle, and the proton structure information in the initial state modeled by the parton distribution functions (PDFs). Due to their strong correlation, the precisions of the determination on the weak mixing angle and PDFs using the measured AFB spectrum are limited. In this paper, we define a set of structure parameters which factorize the unique proton information of the relative difference between quarks and antiquarks in the AFB observation. Other than the conventional way of extracting the weak mixing angle fro the convolution of PDF and EW calculations, we propose a new method to simultaneously determine the value of the weak mixing angle and the proton structure terms by fitting to the observed AFB distribution, and point out the necessity of specifying additional observations to further reduce the uncertainties on the proton structure terms respectively, so that the model-independent high precision measurements can be achieved at the future LHC experiments.

preprint2022arXiv

Few-shot Subgoal Planning with Language Models

Pre-trained large language models have shown successful progress in many language understanding benchmarks. This work explores the capability of these models to predict actionable plans in real-world environments. Given a text instruction, we show that language priors encoded in pre-trained language models allow us to infer fine-grained subgoal sequences. In contrast to recent methods which make strong assumptions about subgoal supervision, our experiments show that language models can infer detailed subgoal sequences from few training sequences without any fine-tuning. We further propose a simple strategy to re-rank language model predictions based on interaction and feedback from the environment. Combined with pre-trained navigation and visual reasoning components, our approach demonstrates competitive performance on subgoal prediction and task completion in the ALFRED benchmark compared to prior methods that assume more subgoal supervision.

preprint2022arXiv

Latent Topology Induction for Understanding Contextualized Representations

In this work, we study the representation space of contextualized embeddings and gain insight into the hidden topology of large language models. We show there exists a network of latent states that summarize linguistic properties of contextualized representations. Instead of seeking alignments to existing well-defined annotations, we infer this latent network in a fully unsupervised way using a structured variational autoencoder. The induced states not only serve as anchors that mark the topology (neighbors and connectivity) of the representation manifold but also reveal the internal mechanism of encoding sentences. With the induced network, we: (1). decompose the representation space into a spectrum of latent states which encode fine-grained word meanings with lexical, morphological, syntactic and semantic information; (2). show state-state transitions encode rich phrase constructions and serve as the backbones of the latent space. Putting the two together, we show that sentences are represented as a traversal over the latent network where state-state transition chains encode syntactic templates and state-word emissions fill in the content. We demonstrate these insights with extensive experiments and visualizations.

preprint2022arXiv

Neural network-based prediction of the secret-key rate of quantum key distribution

Numerical methods are widely used to calculate the secure key rate of many quantum key distribution protocols in practice, but they consume many computing resources and are too time-consuming. In this work, we take the homodyne detection discrete-modulated continuous-variable quantum key distribution (CV-QKD) as an example, and construct a neural network that can quickly predict the secure key rate based on the experimental parameters and experimental results. Compared to traditional numerical methods, the speed of the neural network is improved by several orders of magnitude. Importantly, the predicted key rates are not only highly accurate but also highly likely to be secure. This allows the secure key rate of discrete-modulated CV-QKD to be extracted in real time on a low-power platform. Furthermore, our method is versatile and can be extended to quickly calculate the complex secure key rates of various other unstructured quantum key distribution protocols.

preprint2022arXiv

ResBos2 and the CDF W Mass Measurement

The recent CDF $W$ mass measurement of 80,433 $\pm$ 9 MeV is the most precise direct measurement. However, this result deviates from the Standard Model predicted mass of 80,359.1 $\pm$ 5.2 MeV by $7σ$. The CDF experiment used an older version of the ResBos code that was only accurate at NNLL+NLO, while the ResBos2 code is able to make predictions at N${}^3$LL+NNLO accuracy. We determine that the data-driven techniques used by CDF capture most of the higher order corrections, and using higher order corrections would result in a decrease in the value reported by CDF by at most 10 MeV.

preprint2021arXiv

On the Practicality of Differential Privacy in Federated Learning by Tuning Iteration Times

In spite that Federated Learning (FL) is well known for its privacy protection when training machine learning models among distributed clients collaboratively, recent studies have pointed out that the naive FL is susceptible to gradient leakage attacks. In the meanwhile, Differential Privacy (DP) emerges as a promising countermeasure to defend against gradient leakage attacks. However, the adoption of DP by clients in FL may significantly jeopardize the model accuracy. It is still an open problem to understand the practicality of DP from a theoretic perspective. In this paper, we make the first attempt to understand the practicality of DP in FL through tuning the number of conducted iterations. Based on the FedAvg algorithm, we formally derive the convergence rate with DP noises in FL. Then, we theoretically derive: 1) the conditions for the DP based FedAvg to converge as the number of global iterations (GI) approaches infinity; 2) the method to set the number of local iterations (LI) to minimize the negative influence of DP noises. By further substituting the Laplace and Gaussian mechanisms into the derived convergence rate respectively, we show that: 3) The DP based FedAvg with the Laplace mechanism cannot converge, but the divergence rate can be effectively prohibited by setting the number of LIs with our method; 4) The learning error of the DP based FedAvg with the Gaussian mechanism can converge to a constant number finally if we use a fixed number of LIs per GI. To verify our theoretical findings, we conduct extensive experiments using two real-world datasets. The results not only validate our analysis results, but also provide useful guidelines on how to optimize model accuracy when incorporating DP into FL

preprint2021arXiv

Reduction of the electroweak correlation in the PDF updating by using the forward-backward asymmetry of Drell-Yan process

We propose a new observable for the measurement of the forward-backward asymmetry $(A_{FB})$ in Drell-Yan lepton production. At hadron colliders, the $A_{FB}$ distribution is sensitive to both the electroweak (EW) fundamental parameter $\sin^2 θ_{W}$, the weak mixing angle, and the parton distribution functions (PDFs). Hence, the determination of $\sin^2 θ_{W}$ and the updating of PDFs by directly using the same $A_{FB}$ spectrum are strongly correlated. This correlation would introduce large bias or uncertainty into both precise measurements of EW and PDF sectors. In this article, we show that the sensitivity of $A_{FB}$ on $\sin^2 θ_{W}$ is dominated by its average value around the $Z$ pole region, while the shape (or gradient) of the $A_{FB}$ spectrum is insensitive to $\sin^2 θ_{W}$ and contains important information on the PDF modeling. Accordingly, a new observable related to the gradient of the spectrum is introduced, and demonstrated to be able to significantly reduce the potential bias on the determination of $\sin^2 θ_{W}$ when updating the PDFs using the same $A_{FB}$ data.

preprint2020arXiv

Paraphrase Generation with Latent Bag of Words

Paraphrase generation is a longstanding important problem in natural language processing. In addition, recent progress in deep generative models has shown promising results on discrete latent variables for text generation. Inspired by variational autoencoders with discrete latent structures, in this work, we propose a latent bag of words (BOW) model for paraphrase generation. We ground the semantics of a discrete latent variable by the BOW from the target sentences. We use this latent variable to build a fully differentiable content planning and surface realization model. Specifically, we use source words to predict their neighbors and model the target BOW with a mixture of softmax. We use Gumbel top-k reparameterization to perform differentiable subset sampling from the predicted BOW distribution. We retrieve the sampled word embeddings and use them to augment the decoder and guide its generation search space. Our latent BOW model not only enhances the decoder, but also exhibits clear interpretability. We show the model interpretability with regard to \emph{(i)} unsupervised learning of word neighbors \emph{(ii)} the step-by-step generation procedure. Extensive experiments demonstrate the transparent and effective generation process of this model.\footnote{Our code can be found at \url{https://github.com/FranxYao/dgm_latent_bow}}

preprint2016arXiv

Detector-decoy quantum key distribution without monitoring signal disturbance

The round-robin differential phase-shift quantum key distribution protocol provides a secure way to exchange private information without monitoring conventional disturbances and still maintains a high tolerance of noise, making it desirable for practical implementations of quantum key distribution. However, photon number resolving detectors are required to ensure that the detected signals are single photons in the original protocol. Here, we adopt the detector-decoy method and give the bounds to the fraction of detected events from single photons. Utilizing the advantages of the protocol, we provide a practical method of performing the protocol with desirable performances requiring only threshold single-photon detectors.

preprint2016arXiv

Practical Quantum Digital Signature

Guaranteeing nonrepudiation, unforgeability as well as transferability of a signature is one of the most vital safeguards in today's e-commerce era. Based on fundamental laws of quantum physics, quantum digital signature (QDS) aims to provide information-theoretic security for this cryptographic task. However, up to date, the previously proposed QDS protocols are impractical due to various challenging problems and most importantly, the requirement of authenticated (secure) quantum channels between participants. Here, we present the first quantum digital signature protocol that removes the assumption of authenticated quantum channels while remaining secure against the collective attacks. Besides, our QDS protocol can be practically implemented over more than 100 km under current mature technology as used in quantum key distribution.

preprint2016arXiv

Security of quantum key distribution with multiphoton components

Most qubit-based quantum key distribution (QKD) protocols extract the secure key merely from single-photon component of the attenuated lasers. However, with the Scarani-Acin-Ribordy-Gisin 2004 (SARG04) QKD protocol, the unconditionally secure key can be extracted from the two-photon component by modifying the classical post-processing procedure in the BB84 protocol. Employing the merits of SARG04 QKD protocol and six-state preparation, one can extract secure key from the components of single photon up to four photons. In this paper, we provide the exact relations between the secure key rate and the bit error rate in a six-state SARG04 protocol with single-photon, two-photon, three-photon, and four-photon sources. By restricting the mutual information between the phase error and bit error, we obtain a higher secure bit error rate threshold of the multiphoton components than previous works. Besides, we compare the performances of the six-state SARG04 with other prepare-and-measure QKD protocols using decoy states.

preprint2015arXiv

Long-Distance Measurement-Device-Independent Multiparty Quantum Communication

The Greenberger-Horne-Zeilinger (GHZ) entanglement, originally introduced to uncover the extreme violation of local realism against quantum mechanics, is an important resource for multiparty quantum communication tasks. But the low intensity and fragility of the GHZ entanglement source in current conditions have made the practical applications of these multiparty tasks an experimental challenge. Here we propose a feasible scheme for practically distributing the post-selected GHZ entanglement over a distance of more than 100 km for experimentally accessible parameter regimes. Combining the decoy-state and measurement-device-independent protocols for quantum key distribution, we anticipate that our proposal suggests an important avenue for practical multiparty quantum communication.

preprint2014arXiv

Long distance measurement-device-independent quantum key distribution with coherent-state superpositions

Measurement-device-independent quantum key distribution (MDI-QKD) with decoy-state method is believed to be securely applied to defeat various hacking attacks in practical quantum key distribution systems. Recently, the coherent-state superpositions (CSS) have emerged as an alternative to single-photon qubits for quantum information processing and metrology. Here, in this Letter, CSS are exploited as the source in MDI-QKD. We present an analytical method which gives two tight formulas to estimate the lower bound of yield and the upper bound of bit error rate. We exploit the standard statistical analysis and Chernoff bound to perform the parameter estimation. Chernoff bound can provide good bounds in the long distance MDI-QKD. Our results show that with CSS, both the security transmission distance and secure key rate are significantly improved compared with those of the weak coherent states in the finite-data case.

preprint2014arXiv

Measurement-device-independent quantum key distribution based on Bell's inequality

We propose two quantum key distribution (QKD) protocols based on Bell's inequality, which can be considered as modified time-reversed E91 protocol. Similar to the measurement-device-independent quantum key distribution (MDI-QKD) protocol, the first scheme requires the assumption that Alice and Bob perfectly characterize the encoded quantum states. However, our second protocol does not require this assumption, which can defeat more known and unknown source-side attacks compared with the MDI-QKD. The two protocols are naturally immune to all hacking attacks with respect to detections. Therefore, the security of the two protocols can be proven based on the violation of Bell's inequality with measurement data under fair-sampling assumption. In our simulation, the results of both protocols show that long-distance quantum key distribution over 200 km remains secure with conventional lasers in the asymptotic-data case. We present a new technique to estimate the Bell's inequality violation, which can also be applied to other fields of quantum information processing.

preprint2014arXiv

Violations of entropic Bell inequalities with coarse-grained quadrature measurements for continuous-variable states

It is a long-standing belief, as pointed out by Bell in 1986, that it is impossible to use a two-mode Gaussian state possessing a positive-definite Wigner function to demonstrate nonlocality as the Wigner function itself provides a local hidden-variable model. In particular, when one performs continuous-variable (CV) quadrature measurements upon a routinely generated CV entanglement, namely, the two-mode squeezed vacuum (TMSV) state, the resulting Wigner function is positive-definite and as such, the TMSV state cannot violate any Bell inequality using CV quadrature measurements. We show here, however, that a Bell inequality for CV states in terms of entropies can be quantum mechanically violated by the TMSV state with two coarse-grained quadrature measurements per site within experimentally accessible parameter regime. The proposed CV entropic Bell inequality is advantageous for an experimental test, especially for a possible loophole-free test of nonlocality, as the quadrature measurements can be implemented with homodyne detections of nearly 100\% detection efficiency under current technology.

Yao Fu

What is connected

Connect this record

See the researcher in context

Building this map preview

19 published item(s)

RAM-H1200: A Unified Evaluation and Dataset on Hand Radiographs for Rheumatoid Arthritis

Breaking the Rate-Loss Bound of Quantum Key Distribution with Asynchronous Two-Photon Interference

Data-to-text Generation with Variational Sequential Planning

Experimental quantum advantage with quantum coupon collector

Factorization of the forward-backward charge asymmetry and measurements of the weak mixing angle and proton structure at hadron colliders

Few-shot Subgoal Planning with Language Models

Latent Topology Induction for Understanding Contextualized Representations

Neural network-based prediction of the secret-key rate of quantum key distribution

ResBos2 and the CDF W Mass Measurement

On the Practicality of Differential Privacy in Federated Learning by Tuning Iteration Times

Reduction of the electroweak correlation in the PDF updating by using the forward-backward asymmetry of Drell-Yan process

Paraphrase Generation with Latent Bag of Words

Detector-decoy quantum key distribution without monitoring signal disturbance

Practical Quantum Digital Signature

Security of quantum key distribution with multiphoton components

Long-Distance Measurement-Device-Independent Multiparty Quantum Communication

Long distance measurement-device-independent quantum key distribution with coherent-state superpositions

Measurement-device-independent quantum key distribution based on Bell's inequality

Violations of entropic Bell inequalities with coarse-grained quadrature measurements for continuous-variable states