Source author record

Yao Liu

Yao Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning astro-ph.GA astro-ph.SR astro-ph.EP Artificial Intelligence Computer Vision Cryptography and Security Networking and Internet Architecture astro-ph.IM Distributed, Parallel, and Cluster Computing eess.AS Information Theory Logic in Computer Science math.IT physics.ins-det Robotics Sound

Catalog footprint

What is connected

32works

17topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Disciplined Diffusion: Text-to-Image Diffusion Model against NSFW Generation

Text-to-image (T2I) diffusion models have the ability to build high-quality pictures from text prompts, but they pose safety concerns because they can generate offensive or disturbing imagery when provided with harmful inputs. Existing safety filters typically rely on text-based classifiers or image-based checkers that completely block the output upon detecting a threat, issuing an explicit allow/block feedback signal to the user. This binary strategy leaves models vulnerable to adversarial attacks that alter keywords to bypass detection, and it causes high false-alarm rates that degrade the experience for benign users. To address such vulnerabilities, we propose Disciplined Diffusion (DDiffusion), a novel robust text-to-image diffusion that counters Not Safe For Work (NSFW) generation by uncovering implicit malicious semantics in prompt embeddings. DDiffusion leverages a semantic retrieval mechanism to evaluate prompts against concept distributions rather than relying on brittle pairwise similarity. Furthermore, it employs a localization method during the diffusion process to selectively edit only the harmful regions of the generated image. By returning locally sanitized images instead of applying uniform blocking, DDiffusion suppresses malicious content while preserving generation fidelity for benign prompts and avoiding the binary allow-deny signal on which existing probing attacks rely.

preprint2022arXiv

Evolution of Galaxy Types and HI Gas in Hickson Compact Groups

Compact groups have high galaxy densities and low velocity dispersions, and their group members have experienced numerous and frequent interactions during their lifetimes. They provide a unique environment to study the evolution of galaxies. We examined the galaxies types and HI contents in groups to make a study on the galaxy evolution in compact groups. We used the group crossing time as an age indicator for galaxy groups. Our sample is derived from the Hickson Compact Group catalog. We obtained group morphology data from the Hyper-Leda database and the IR classification based on Wide-Field Infrared Survey Explorer (WISE) fluxes from Zucker et al. (2016). By cross-matching the latest released ALFALFA 100% HI source catalog and supplemented by data found in literature, we obtained 40 galaxy groups with HI data available. We confirmed that the weak correlation between HI mass fraction and group crossing time found by Ai & Zhu (2018) in SDSS groups also exists in compact groups. We also found that the group spiral galaxy fraction is correlated with the group crossing time, but the actively star-forming galaxy fraction is not correlated with the group crossing time. These results seem to fit with the hypothesis that the sequential acquisition of neighbors from surrounding larger-scale structures has affected the morphology transition and star formation efficiency in compact groups.

preprint2022arXiv

Gas Column Density Distribution of Molecular Clouds in the Third Quadrant of the Milky Way

We have obtained column density maps for an unbiased sample of 120 molecular clouds in the third quadrant of the Milky Way mid-plane (b$\le |5|^{\circ}$) within the galactic longitude range from 195$^{\circ}$ to 225$^{\circ}$, using the high sensitivity $^{12}$CO and $^{13}$CO ($J=1-0$) data from the Milky Way Imaging Scroll Painting (MWISP) project. The probability density functions of the molecular hydrogen column density of the clouds, N-PDFs, are fitted with both log-normal (LN) function and log-normal plus power-law (LN+PL) function. The molecular clouds are classified into three categories according to their shapes of N-PDFs, i.e., LN, LN+PL, and UN (unclear), respectively. About 72\% of the molecular clouds fall into the LN category, while 18\% and 10\% into the LN+PL and UN categories, respectively. A power-law scaling relation, $σ_s\propto N_{H_2}^{0.44}$, exists between the width of the N-PDF, $σ_s$, and the average column density, $N_{H_2}$, of the molecular clouds. However, $σ_s$ shows no correlation with the mass of the clouds. A correlation is found between the dispersion of normalized column density, $σ_{N/\rm <N>}$, and the sonic Mach number, $\mathcal{M}$, of molecular clouds. Overall, as predicted by numerical simulations, the N-PDFs of the molecular clouds with active star formation activity tend to have N-PDFs with power-law high-density tails.

preprint2022arXiv

Generalized Federated Learning via Sharpness Aware Minimization

Federated Learning (FL) is a promising framework for performing privacy-preserving, distributed learning with a set of clients. However, the data distribution among clients often exhibits non-IID, i.e., distribution shift, which makes efficient optimization difficult. To tackle this problem, many FL algorithms focus on mitigating the effects of data heterogeneity across clients by increasing the performance of the global model. However, almost all algorithms leverage Empirical Risk Minimization (ERM) to be the local optimizer, which is easy to make the global model fall into a sharp valley and increase a large deviation of parts of local clients. Therefore, in this paper, we revisit the solutions to the distribution shift problem in FL with a focus on local learning generality. To this end, we propose a general, effective algorithm, \texttt{FedSAM}, based on Sharpness Aware Minimization (SAM) local optimizer, and develop a momentum FL algorithm to bridge local and global models, \texttt{MoFedSAM}. Theoretically, we show the convergence analysis of these two algorithms and demonstrate the generalization bound of \texttt{FedSAM}. Empirically, our proposed algorithms substantially outperform existing FL studies and significantly decrease the learning deviation.

preprint2022arXiv

Information-Theoretic Limits of Integrated Sensing and Communication with Correlated Sensing and Channel States for Vehicular Networks

In connected vehicular networks, it is vital to have vehicular nodes that are capable of sensing about surrounding environments and exchanging messages with each other for automating and coordinating purpose. Towards this end, integrated sensing and communication (ISAC), combining both sensing and communication systems to jointly utilize their resources and to pursue mutual benefits, emerges as a new cost-effective solution. In ISAC, the hardware and spectrum co-sharing leads to a fundamental tradeoff between sensing and communication performance, which is not well understood except for very simple cases with the same sensing and channel states, and perfect channel state information at the receiver (CSIR). In this paper, a general point-to-point ISAC model is proposed to account for the scenarios that the sensing state is different from but correlated with the channel state, and the CSIR is not necessarily perfect. For the model considered, the optimal tradeoff is characterized by a capacity-distortion function that quantifies the best communication rate for a given sensing distortion constraint requirement. An iterative algorithm is proposed to compute such tradeoff, and a few non-trivial examples are constructed to demonstrate the benefits of ISAC as compared to the separation-based approach.

preprint2022arXiv

Learning to Guide Human Attention on Mobile Telepresence Robots with 360 Vision

Mobile telepresence robots (MTRs) allow people to navigate and interact with a remote environment that is in a place other than the person's true location. Thanks to the recent advances in 360 degree vision, many MTRs are now equipped with an all-degree visual perception capability. However, people's visual field horizontally spans only about 120 degree of the visual field captured by the robot. To bridge this observability gap toward human-MTR shared autonomy, we have developed a framework, called GHAL360, to enable the MTR to learn a goal-oriented policy from reinforcements for guiding human attention using visual indicators. Three telepresence environments were constructed using datasets that are extracted from Matterport3D and collected from a real robot respectively. Experimental results show that GHAL360 outperformed the baselines from the literature in the efficiency of a human-MTR team completing target search tasks.

preprint2022arXiv

LoMar: A Local Defense Against Poisoning Attack on Federated Learning

Federated learning (FL) provides a high efficient decentralized machine learning framework, where the training data remains distributed at remote clients in a network. Though FL enables a privacy-preserving mobile edge computing framework using IoT devices, recent studies have shown that this approach is susceptible to poisoning attacks from the side of remote clients. To address the poisoning attacks on FL, we provide a \textit{two-phase} defense algorithm called {Lo}cal {Ma}licious Facto{r} (LoMar). In phase I, LoMar scores model updates from each remote client by measuring the relative distribution over their neighbors using a kernel density estimation method. In phase II, an optimal threshold is approximated to distinguish malicious and clean updates from a statistical perspective. Comprehensive experiments on four real-world datasets have been conducted, and the experimental results show that our defense strategy can effectively protect the FL system. {Specifically, the defense performance on Amazon dataset under a label-flipping attack indicates that, compared with FG+Krum, LoMar increases the target label testing accuracy from $96.0\%$ to $98.8\%$, and the overall averaged testing accuracy from $90.1\%$ to $97.0\%$.

preprint2022arXiv

Molecules with ALMA at Planet-forming Scales (MAPS) III: Characteristics of Radial Chemical Substructures

The Molecules with ALMA at Planet-forming Scales (MAPS) Large Program provides a detailed, high resolution (${\sim}$10-20 au) view of molecular line emission in five protoplanetary disks at spatial scales relevant for planet formation. Here, we present a systematic analysis of chemical substructures in 18 molecular lines toward the MAPS sources: IM Lup, GM Aur, AS 209, HD 163296, and MWC 480. We identify more than 200 chemical substructures, which are found at nearly all radii where line emission is detected. A wide diversity of radial morphologies - including rings, gaps, and plateaus - is observed both within each disk and across the MAPS sample. This diversity in line emission profiles is also present in the innermost 50 au. Overall, this suggests that planets form in varied chemical environments both across disks and at different radii within the same disk. Interior to 150 au, the majority of chemical substructures across the MAPS disks are spatially coincident with substructures in the millimeter continuum, indicative of physical and chemical links between the disk midplane and warm, elevated molecular emission layers. Some chemical substructures in the inner disk and most chemical substructures exterior to 150 au cannot be directly linked to dust substructure, however, which indicates that there are also other causes of chemical substructures, such as snowlines, gradients in UV photon fluxes, ionization, and radially-varying elemental ratios. This implies that chemical substructures could be developed into powerful probes of different disk characteristics, in addition to influencing the environments within which planets assemble. This paper is part of the MAPS special issue of the Astrophysical Journal Supplement.

preprint2022arXiv

Molecules with ALMA at Planet-forming Scales (MAPS). A Circumplanetary Disk Candidate in Molecular Line Emission in the AS 209 Disk

We report the discovery of a circumplanetary disk (CPD) candidate embedded in the circumstellar disk of the T Tauri star AS 209 at a radial distance of about 200 au (on-sky separation of 1."4 from the star at a position angle of $161^\circ$), isolated via $^{13}$CO $J=2-1$ emission. This is the first instance of CPD detection via gaseous emission capable of tracing the overall CPD mass. The CPD is spatially unresolved with a $117\times82$ mas beam and manifests as a point source in $^{13}$CO, indicating that its diameter is $\lesssim14$ au. The CPD is embedded within an annular gap in the circumstellar disk previously identified using $^{12}$CO and near-infrared scattered light observations, and is associated with localized velocity perturbations in $^{12}$CO. The coincidence of these features suggests that they have a common origin: an embedded giant planet. We use the $^{13}$CO intensity to constrain the CPD gas temperature and mass. We find that the CPD temperature is $\gtrsim35$ K, higher than the circumstellar disk temperature at the radial location of the CPD, 22 K, suggesting that heating sources localized to the CPD must be present. The CPD gas mass is $\gtrsim 0.095 M_{\rm Jup} \simeq 30 M_{\rm Earth}$ adopting a standard $^{13}$CO abundance. From the non-detection of millimeter continuum emission at the location of the CPD ($3σ$ flux density $\lesssim26.4~μ$Jy), we infer that the CPD dust mass is $\lesssim 0.027 M_{\rm Earth} \simeq 2.2$ lunar masses, indicating a low dust-to-gas mass ratio of $\lesssim9\times10^{-4}$. We discuss the formation mechanism of the CPD-hosting giant planet on a wide orbit in the framework of gravitational instability and pebble accretion.

preprint2022arXiv

Offline Policy Optimization with Eligible Actions

Offline policy optimization could have a large impact on many real-world decision-making problems, as online learning may be infeasible in many applications. Importance sampling and its variants are a commonly used type of estimator in offline policy evaluation, and such estimators typically do not require assumptions on the properties and representational capabilities of value function or decision process model function classes. In this paper, we identify an important overfitting phenomenon in optimizing the importance weighted return, in which it may be possible for the learned policy to essentially avoid making aligned decisions for part of the initial state space. We propose an algorithm to avoid this overfitting through a new per-state-neighborhood normalization constraint, and provide a theoretical justification of the proposed algorithm. We also show the limitations of previous attempts to this approach. We test our algorithm in a healthcare-inspired simulator, a logged dataset collected from real hospitals and continuous control tasks. These experiments show the proposed method yields less overfitting and better test performance compared to state-of-the-art batch reinforcement learning algorithms.

preprint2022arXiv

On the Convergence of Multi-Server Federated Learning with Overlapping Area

Multi-server Federated learning (FL) has been considered as a promising solution to address the limited communication resource problem of single-server FL. We consider a typical multi-server FL architecture, where the coverage areas of regional servers may overlap. The key point of this architecture is that the clients located in the overlapping areas update their local models based on the average model of all accessible regional models, which enables indirect model sharing among different regional servers. Due to the complicated network topology, the convergence analysis is much more challenging than single-server FL. In this paper, we firstly propose a novel MS-FedAvg algorithm for this multi-server FL architecture and analyze its convergence on non-iid datasets for general non-convex settings. Since the number of clients located in each regional server is much less than in single-server FL, the bandwidth of each client should be large enough to successfully communicate training models with the server, which indicates that full client participation can work in multi-server FL. Also, we provide the convergence analysis of the partial client participation scheme and develop a new biased partial participation strategy to further accelerate convergence. Our results indicate that the convergence results highly depend on the ratio of the number of clients in each area type to the total number of clients in all three strategies. The extensive experiments show remarkable performance and support our theoretical results.

preprint2022arXiv

Perception-Aware Attack: Creating Adversarial Music via Reverse-Engineering Human Perception

Recently, adversarial machine learning attacks have posed serious security threats against practical audio signal classification systems, including speech recognition, speaker recognition, and music copyright detection. Previous studies have mainly focused on ensuring the effectiveness of attacking an audio signal classifier via creating a small noise-like perturbation on the original signal. It is still unclear if an attacker is able to create audio signal perturbations that can be well perceived by human beings in addition to its attack effectiveness. This is particularly important for music signals as they are carefully crafted with human-enjoyable audio characteristics. In this work, we formulate the adversarial attack against music signals as a new perception-aware attack framework, which integrates human study into adversarial attack design. Specifically, we conduct a human study to quantify the human perception with respect to a change of a music signal. We invite human participants to rate their perceived deviation based on pairs of original and perturbed music signals, and reverse-engineer the human perception process by regression analysis to predict the human-perceived deviation given a perturbed signal. The perception-aware attack is then formulated as an optimization problem that finds an optimal perturbation signal to minimize the prediction of perceived deviation from the regressed human perception model. We use the perception-aware framework to design a realistic adversarial music attack against YouTube's copyright detector. Experiments show that the perception-aware attack produces adversarial music with significantly better perceptual quality than prior work.

preprint2022arXiv

Provably Sample-Efficient RL with Side Information about Latent Dynamics

We study reinforcement learning (RL) in settings where observations are high-dimensional, but where an RL agent has access to abstract knowledge about the structure of the state space, as is the case, for example, when a robot is tasked to go to a specific room in a building using observations from its own camera, while having access to the floor plan. We formalize this setting as transfer reinforcement learning from an abstract simulator, which we assume is deterministic (such as a simple model of moving around the floor plan), but which is only required to capture the target domain's latent-state dynamics approximately up to unknown (bounded) perturbations (to account for environment stochasticity). Crucially, we assume no prior knowledge about the structure of observations in the target domain except that they can be used to identify the latent states (but the decoding map is unknown). Under these assumptions, we present an algorithm, called TASID, that learns a robust policy in the target domain, with sample complexity that is polynomial in the horizon, and independent of the number of states, which is not possible without access to some prior knowledge. In synthetic experiments, we verify various properties of our algorithm and show that it empirically outperforms transfer RL algorithms that require access to "full simulators" (i.e., those that also simulate observations).

preprint2022arXiv

The Ages of Optically Bright Sub-Clusters in the Serpens Star-Forming Region

The Serpens Molecular Cloud is one of the most active star-forming regions within 500 pc, with over one thousand of YSOs at different evolutionary stages. The ages of the member stars inform us about the star formation history of the cloud. In this paper, we develop a spectral energy distribution (SED) fitting method for nearby evolved (diskless) young stars from members of the Pleiades to estimate their ages, with a temperature scale adopted from APOGEE spectra. When compared with literature temperatures of selected YSOs in Orion, the SED fits to cool (<5000 K) stars have temperatures that differ by an average of <~ 50 K and have a scatter of ~ 210 K for both disk-hosting and diskless stars. We then apply this method to YSOs in the Serpens Molecular Cloud to estimate ages of optical members previously identified from Gaia DR2 astrometry data. The optical members in Serpens are concentrated in different subgroups with ages from ~4 Myr to ~22 Myr; the youngest clusters, W40 and Serpens South, are dusty regions that lack enough optical members to be included in this analysis. These ages establish that the Serpens Molecular Cloud has been forming stars for much longer than has been inferred from infrared surveys.

preprint2022arXiv

Tiny Object Tracking: A Large-scale Dataset and A Baseline

Tiny objects, frequently appearing in practical applications, have weak appearance and features, and receive increasing interests in meany vision tasks, such as object detection and segmentation. To promote the research and development of tiny object tracking, we create a large-scale video dataset, which contains 434 sequences with a total of more than 217K frames. Each frame is carefully annotated with a high-quality bounding box. In data creation, we take 12 challenge attributes into account to cover a broad range of viewpoints and scene complexities, and annotate these attributes for facilitating the attribute-based performance analysis. To provide a strong baseline in tiny object tracking, we propose a novel Multilevel Knowledge Distillation Network (MKDNet), which pursues three-level knowledge distillations in a unified framework to effectively enhance the feature representation, discrimination and localization abilities in tracking tiny objects. Extensive experiments are performed on the proposed dataset, and the results prove the superiority and effectiveness of MKDNet compared with state-of-the-art methods. The dataset, the algorithm code, and the evaluation code are available at https://github.com/mmic-lcl/Datasets-and-benchmark-code.

preprint2022arXiv

Towards Adaptive Unknown Authentication for Universal Domain Adaptation by Classifier Paradox

Universal domain adaptation (UniDA) is a general unsupervised domain adaptation setting, which addresses both domain and label shifts in adaptation. Its main challenge lies in how to identify target samples in unshared or unknown classes. Previous methods commonly strive to depict sample "confidence" along with a threshold for rejecting unknowns, and align feature distributions of shared classes across domains. However, it is still hard to pre-specify a "confidence" criterion and threshold which are adaptive to various real tasks, and a mis-prediction of unknowns further incurs misalignment of features in shared classes. In this paper, we propose a new UniDA method with adaptive Unknown Authentication by Classifier Paradox (UACP), considering that samples with paradoxical predictions are probably unknowns belonging to none of the source classes. In UACP, a composite classifier is jointly designed with two types of predictors. That is, a multi-class (MC) predictor classifies samples to one of the multiple source classes, while a binary one-vs-all (OVA) predictor further verifies the prediction by MC predictor. Samples with verification failure or paradox are identified as unknowns. Further, instead of feature alignment for shared classes, implicit domain alignment is conducted in output space such that samples across domains share the same decision boundary, though with feature discrepancy. Empirical results validate UACP under both open-set and universal UDA settings.

preprint2020arXiv

Bringing high spatial resolution to the Far-infrared -- A giant leap for astrophysics

The far-infrared (FIR) regime is one of the few wavelength ranges where no astronomical data with sub-arcsecond spatial resolution exist. Neither of the medium-term satellite projects like SPICA, Millimetron nor O.S.T. will resolve this malady. For many research areas, however, information at high spatial and spectral resolution in the FIR, taken from atomic fine-structure lines, from highly excited carbon monoxide (CO), light hydrids, and especially from water lines would open the door for transformative science. A main theme will be to trace the role of water in proto-planetary disks, to observationally advance our understanding of the planet formation process and, intimately related to that, the pathways to habitable planets and the emergence of life. Furthermore, key observations will zoom into the physics and chemistry of the star-formation process in our own Galaxy, as well as in external galaxies. The FIR provides unique tools to investigate in particular the energetics of heating, cooling and shocks. The velocity-resolved data in these tracers will reveal the detailed dynamics engrained in these processes in a spatially resolved fashion, and will deliver the perfect synergy with ground-based molecular line data for the colder dense gas.

preprint2020arXiv

Dual-Wavelength ALMA Observations of Dust Rings in Protoplanetary Disks

We present new Atacama Large Millimeter/submillimeter Array (ALMA) observations for three protoplanetary disks in Taurus at 2.9\,mm and comparisons with previous 1.3\,mm data both at an angular resolution of $\sim0.''1$ (15\,au for the distance of Taurus). In the single-ring disk DS Tau, double-ring disk GO Tau, and multiple-ring disk DL Tau, the same rings are detected at both wavelengths, with radial locations spanning from 50 to 120\,au. To quantify the dust emission morphology, the observed visibilities are modeled with a parametric prescription for the radial intensity profile. The disk outer radii, taken as 95\% of the total flux encircled in the model intensity profiles, are consistent at both wavelengths for the three disks. Dust evolution models show that dust trapping in local pressure maxima in the outer disk could explain the observed patterns. Dust rings are mostly unresolved. The marginally resolved ring in DS Tau shows a tentatively narrower ring at the longer wavelength, an observational feature expected from efficient dust trapping. The spectral index ($α_{\rm mm}$) increases outward and exhibits local minima that correspond to the peaks of dust rings, indicative of the changes in grain properties across the disks. The low optical depths ($τ\sim$0.1--0.2 at 2.9\,mm and 0.2--0.4 at 1.3\,mm) in the dust rings suggest that grains in the rings may have grown to millimeter sizes. The ubiquitous dust rings in protoplanetary disks modify the overall dynamics and evolution of dust grains, likely paving the way towards the new generation of planet formation.

preprint2020arXiv

Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions

Off-policy evaluation in reinforcement learning offers the chance of using observational data to improve future outcomes in domains such as healthcare and education, but safe deployment in high stakes settings requires ways of assessing its validity. Traditional measures such as confidence intervals may be insufficient due to noise, limited data and confounding. In this paper we develop a method that could serve as a hybrid human-AI system, to enable human experts to analyze the validity of policy evaluation estimates. This is accomplished by highlighting observations in the data whose removal will have a large effect on the OPE estimate, and formulating a set of rules for choosing which ones to present to domain experts for validation. We develop methods to compute exactly the influence functions for fitted Q-evaluation with two different function classes: kernel-based and linear least squares, as well as importance sampling methods. Experiments on medical simulations and real-world intensive care unit data demonstrate that our method can be used to identify limitations in the evaluation process and make evaluation more robust.

preprint2020arXiv

NetReduce: RDMA-Compatible In-Network Reduction for Distributed DNN Training Acceleration

We present NetReduce, a novel RDMA-compatible in-network reduction architecture to accelerate distributed DNN training. Compared to existing designs, NetReduce maintains a reliable connection between end-hosts in the Ethernet and does not terminate the connection in the network. The advantage of doing so is that we can fully reuse the designs of congestion control and reliability in RoCE. In the meanwhile, we do not need to implement a high-cost network protocol processing stack in the switch, as IB does. The prototype implemented by using FPGA is an out-of-box solution without modifying commodity devices such as NICs or switches. For the coordination between the end-host and the switch, NetReduce customizes the transport protocol only on the first packet in a data message to comply with RoCE v2. The special status monitoring module is designed to reuse the reliability mechanism of RoCE v2 for dealing with packet loss. A message-level credit-based flow control algorithm is also proposed to fully utilize bandwidth and avoid buffer overflow. We study the effects of intra bandwidth on the training performance in multi-machines multi-GPUs scenario and give sufficient conditions for hierarchical NetReduce to outperform other algorithms. We also extend the design from rack-level aggregation to more general spine-leaf topology in the data center. NetReduce accelerates the training up to 1.7x and 1.5x for CNN-based CV and transformer-based NLP tasks, respectively. Simulations on large-scale systems indicate the superior scalability of NetReduce to the state-of-the-art ring all-reduce.

preprint2020arXiv

Pebbles in an Embedded Protostellar Disk: The Case of CB26

Planetary cores are thought to form in proto-planetary disks via the growth of dusty solid material. However, it is unclear how early this process begins. We study the physical structure and grain growth in the edge-on disk that surrounds the ~1 Myr old low-mass (~0.55 Msun) protostar embedded in the Bok Globule CB26 to examine how much grain growth has already occurred in the protostellar phase. We combine the SED between 0.9 $μ$m and 6.4 cm with high angular resolution continuum maps at 1.3, 2.9, and 8.1 mm, and use the radiative transfer code RADMC-3D to conduct a detailed modelling of the dust emission from the disk and envelope of CB 26. We infer inner and outer disk radii of around 16 au and 172$\pm$22 au, respectively. The total gas mass in the disk is ~0.076 Msun, which amounts to ~14% of the mass of the central star. The inner disk contains a compact free-free emission region, which could be related to either a jet or a photoevaporation region. The thermal dust emission from the outer disk is optically thin at mm wavelengths, while the emission from the inner disk midplane is moderately optically thick. Our best-fit radiative transfer models indicate that the dust grains in the disk have already grown to pebbles with diameters of the order of 10 cm in size. Residual 8.1 mm emission suggests the presence of even larger particles in the inner disk. For the optically thin mm dust emission from the outer disk, we derive a mean opacity slope of 0.6$\pm$0.4, which is consistent with the presence of large dust grains. The presence of cm-sized bodies in the CB 26 disk indicates that solids grow rapidly already during the first million years in a protostellar disk. It is thus possible that Class II disks are already seeded with large particles and may contain even planetesimals.

preprint2020arXiv

Provably Good Batch Reinforcement Learning Without Great Exploration

Batch reinforcement learning (RL) is important to apply RL algorithms to many high stakes tasks. Doing batch RL in a way that yields a reliable new policy in large domains is challenging: a new decision policy may visit states and actions outside the support of the batch data, and function approximation and optimization with limited samples can further increase the potential of learning policies with overly optimistic estimates of their future performance. Recent algorithms have shown promise but can still be overly optimistic in their expected outcomes. Theoretical work that provides strong guarantees on the performance of the output policy relies on a strong concentrability assumption, that makes it unsuitable for cases where the ratio between state-action distributions of behavior policy and some candidate policies is large. This is because in the traditional analysis, the error bound scales up with this ratio. We show that a small modification to Bellman optimality and evaluation back-up to take a more conservative update can have much stronger guarantees. In certain settings, they can find the approximately best policy within the state-action space explored by the batch data, without requiring a priori assumptions of concentrability. We highlight the necessity of our conservative update and the limitations of previous algorithms and analyses by illustrative MDP examples, and demonstrate an empirical comparison of our algorithm and other state-of-the-art batch RL baselines in standard benchmarks.

preprint2020arXiv

Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling

Off-policy policy estimators that use importance sampling (IS) can suffer from high variance in long-horizon domains, and there has been particular excitement over new IS methods that leverage the structure of Markov decision processes. We analyze the variance of the most popular approaches through the viewpoint of conditional Monte Carlo. Surprisingly, we find that in finite horizon MDPs there is no strict variance reduction of per-decision importance sampling or stationary importance sampling, comparing with vanilla importance sampling. We then provide sufficient conditions under which the per-decision or stationary estimators will provably reduce the variance over importance sampling with finite horizons. For the asymptotic (in terms of horizon $T$) case, we develop upper and lower bounds on the variance of those estimators which yields sufficient conditions under which there exists an exponential v.s. polynomial gap between the variance of importance sampling and that of the per-decision or stationary estimators. These results help advance our understanding of if and when new types of IS estimators will improve the accuracy of off-policy estimation.

preprint2019arXiv

Combining Parametric and Nonparametric Models for Off-Policy Evaluation

We consider a model-based approach to perform batch off-policy evaluation in reinforcement learning. Our method takes a mixture-of-experts approach to combine parametric and non-parametric models of the environment such that the final value estimate has the least expected error. We do so by first estimating the local accuracy of each model and then using a planner to select which model to use at every time step as to minimize the return error estimate along entire trajectories. Across a variety of domains, our mixture-based approach outperforms the individual models alone as well as state-of-the-art importance sampling-based estimators.

preprint2015arXiv

An Improved Decision Procedure for Linear Time Mu-Calculus

An improved Present Future form (PF form) for linear time $μ$-calculus ($ν$TL) is presented in this paper. In particular, the future part of the new version turns into the conjunction of elements in the closure of a formula. We show that every closed $ν$TL formula can be transformed into the new PF form. Additionally, based on the PF form, an algorithm for constructing Present Future form Graph (PFG), which can be utilized to describe models of a formula, is given. Further, an intuitive and efficient decision procedure for checking satisfiability of the guarded fragment of $ν$TL formulas based on PFG is proposed and implemented in C++. The new decision procedure has the best time complexity over the existing ones despite the cost of exponential space. Finally, a PFG-based model checking approach for $ν$TL is discussed where a counterexample can be obtained visually when a model violates a property.

preprint2014arXiv

Herschel/PACS view of disks around low-mass stars and brown dwarfs in the TW Hya association

We conducted Herschel/PACS observations of five very low-mass stars or brown dwarfs located in the TW Hya association with the goal of characterizing the properties of disks in the low stellar mass regime. We detected all five targets at $70\,μ{\rm{m}}$ and $100\,μ{\rm{m}}$ and three targets at $160\,μ{\rm{m}}$. Our observations, combined with previous photometry from 2MASS, WISE, and SCUBA-2, enabled us to construct SEDs with extended wavelength coverage. Using sophisticated radiative transfer models, we analyzed the observed SEDs of the five detected objects with a hybrid fitting strategy that combines the model grids and the simulated annealing algorithm and evaluated the constraints on the disk properties via the Bayesian inference method. The modelling suggests that disks around low-mass stars and brown dwarfs are generally flatter than their higher mass counterparts, but the range of disk mass extends to well below the value found in T Tauri stars, and the disk scale heights are comparable in both groups. The inferred disk properties (i.e., disk mass, flaring, and scale height) in the low stellar mass regime are consistent with previous findings from large samples of brown dwarfs and very low-mass stars. We discuss the dependence of disk properties on their host stellar parameters and find a significant correlation between the Herschel far-IR fluxes and the stellar effective temperatures, probably indicating that the scaling between the stellar and disk masses (i.e., $M_{\rm{disk}} \propto M_{\star}$) observed mainly in low-mass stars may extend down to the brown dwarf regime.

preprint2013arXiv

CEN34 -- High-Mass YSO in M17 or Background Post-AGB Star?

We investigate the proposed high-mass young stellar object (YSO) candidate CEN34, thought to be associated with the star forming region M17. Its optical to near-infrared (550-2500 nm) spectrum reveals several photospheric absorption features, such as Hα, Ca triplet and CO bandheads but lacks any emission lines. The spectral features in the range 8375-8770Å are used to constrain an effective temperature of 5250\pm250 (early-/mid-G) and a surface gravity of 2.0\pm0.3 (supergiant). The spectral energy distribution of CEN34 resembles the SED of a high-mass YSO or an evolved star. Moreover, the observed temperature and surface gravity are identical for high-mass YSOs and evolved stars. The radial velocity relative to LSR (V_LSR) of CEN34 as obtained from various photospheric lines is of the order of -60 km/s and thus distinct from the +25 km/s found for several OB stars in the cluster and for the associated molecular cloud. The SED modeling yields ~ 10^{-4} M_sun of circumstellar material which contributes only a tiny fraction to the total visual extinction (11 mag). In the case of a YSO, a dynamical ejection process is proposed to explain the V_LSR difference between CEN34 and M17. Additionally, to match the temperature and luminosity, we speculate that CEN34 had accumulated the bulk of its mass with accretion rate > 4x10^{-3} M_sun/yr in a very short time span (~ 10^3 yrs), and currently undergoes a phase of gravitational contraction without any further mass gain. However, all the aforementioned characteristics of CEN34 are compatible with an evolved star of 5-7 M_sun and an age of 50-100 Myrs, most likely a background post-AGB star with a distance between 2.0 kpc and 4.5 kpc. We consider the latter classification as the more likely interpretation. Further discrimination between the two possible scenarios should come from the more strict confinement of CEN34's distance.

preprint2013arXiv

CO observation of the Galactic bubble N4

We presented a study on the Galactic bubble N4 using the 13.7 m millimeter telescope of Purple Mountain Observatory at the Qinghai Station. N4 is one of the science demonstration regions for the Milky Way Imaging Scroll Painting (WMISP). Simultaneous observations of $^{12}$CO (J = 1$-$0), $^{13}$CO (J = 1$-$0) and C$^{18}$O (J = 1$-$0) line emission towards N4 were carried out. We analyzed the spectral profile and the distribution of the molecular gas. Morphologically, the CO emissions correlate well with Spitzer IRAC 8.0 $μ$m emission. The channel map and velocity-position diagram shows that N4 is more likely an inclined expanding ring than a spherical bubble. We calculated the physical parameters of N4 including the mass, size, column density and optical depth. Some massive star candidates were discovered in the region of N4 using (J, J$-$H) color-magnitude diagram. We found an energy source candidate for the expansion of N4, a massive star with a mass of ${\sim} 15\,M_{\odot}$ and an age of $\sim$ 1 Myr. There exists infall motion signature in N4, which can be a good candidate of infall area. Combined mm and infrared data, we think there may exists triggered star formation in N4.

preprint2012arXiv

A Comparison of Approaches in Fitting Continuum SEDs

We present a detailed comparison of two approaches, the use of a pre-calculated database and simulated annealing (SA), for fitting the continuum spectral energy distribution (SED) of astrophysical objects whose appearance is dominated by surrounding dust. While pre-calculated databases are commonly used to model SED data, only few studies to date employed SA due to its unclear accuracy and convergence time for this specific problem. From a methodological point of view, different approaches lead to different fitting quality, demand on computational resources and calculation time. We compare the fitting quality and computational costs of these two approaches for the task of SED fitting to provide a guide to the practitioner to find a compromise between desired accuracy and available resources. To reduce uncertainties inherent to real datasets, we introduce a reference model resembling a typical circumstellar system with 10 free parameters. We derive the SED of the reference model with our code MC3D at 78 logarithmically distributed wavelengths in the range [0.3um, 1.3mm] and use this setup to simulate SEDs for the database and SA. Our result shows directly the applicability of SA in the field of SED modeling, since the algorithm regularly finds better solutions to the optimization problem than a pre-calculated database. As both methods have advantages and shortcomings, a hybrid approach is preferable. While the database provides an approximate fit and overall probability distributions for all parameters deduced using Bayesian analysis, SA can be used to improve upon the results returned by the model grid.

preprint2012arXiv

A Herschel Survey of Cold Dust in Disks Around Brown Dwarfs and Low-Mass Stars

We report the complete photometric results from our Herschel study which is the first comprehensive program to search for far-infrared emission from cold dust around young brown dwarfs. We surveyed 50 fields containing 51 known or suspected brown dwarfs and very low mass stars that have evidence of circumstellar disks based on Spitzer photometry and/or spectroscopy. The objects with known spectral types range from M3 to M9.5. Four of the candidates were subsequently identified as extragalactic objects. Of the remaining 47 we have successfully detected 36 at 70micron and 14 at 160micron with S/N greater than 3, as well as several additional possible detections with low S/N. The objects exhibit a range of [24]--[70] micron colors suggesting a range in mass and/or structure of the outer disk. We present modeling of the spectral energy distributions of the sample and discuss trends visible in the data. Using two Monte Carlo radiative transfer codes we investigate disk masses and geometry. We find a very wide range in modeled total disk masses from less than 1e-6 solar masses up to 1e-3 solar masses with a median disk mass of order 3e-5 solar masses, suggesting that the median ratio of disk mass to central object mass may be lower than for T Tauri stars. The disk scale heights and flaring angles, however, cover a range consistent with those seen around T Tauri stars. The host clouds in which the young brown dwarfs and low-mass stars are located span a range in estimated age from ~1-3 Myr to ~10 Myr and represent a variety of star-forming environments. No obvious dependence on cloud location or age is seen in the disk properties, though the statistical significance of this conclusion is not strong.

preprint2012arXiv

A Non-Monetary Protocol for Peer-to-Peer Content Distribution in Wireless Broadcast Networks with Network Coding

This paper studies the problem of content distribution in wireless peer-to-peer networks where all nodes are selfish and non-cooperative. We propose a model that considers both the broadcast nature of wireless channels and the incentives of nodes, where each node aims to increase its own download rate and reduces its upload rate through the course of content distribution. We then propose a protocol for these selfish nodes to exchange contents. Our protocol is distributed and does not require the exchange of money, reputation, etc., and hence can be easily implemented without additional infrastructure. Moreover, we show that our protocol can be easily modified to employ network coding. The performance of our protocol is studied. We derive a closed-form expression of Nash Equilibriums when there are only two files in the system. The prices of anarchy, both from each node's perspective and the whole system's perspective, are also characterized. Moreover, we propose a distributed mechanism where each node adjusts its strategies only based on local information and show that the mechanism converges to a Nash Equilibrium. We also introduce an approach for calculating Nash Equilibriums for systems that incorporate network coding when there are more than two files.

preprint2011arXiv

A Herschel Search For Cold Dust in Brown Dwarf Disks: First Results

We report initial results from a {\it Herschel} program to search for far-infrared emission from cold dust around a statistically significant sample of young brown dwarfs. The first three objects in our survey are all detected at 70\micron, and we report the first detection of a brown dwarf at 160\micron. The flux densities are consistent with the presence of substantial amounts of cold dust in the outer disks around these objects. We modeled the SED's with two different radiative transfer codes. We find that a broad range of model parameters provides a reasonable fit to the SED's, but that the addition of our 70\micron, and especially the 160\micron\ detection enables strong lower limits to be placed on the disk masses since most of the mass is in the outer disk. We find likely disk masses in the range of a few $\times 10^{-6}$ to $10^{-4}$ \msun. Our models provide a good fit to the SED's and do not require dust settling.

Yao Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

32 published item(s)

Disciplined Diffusion: Text-to-Image Diffusion Model against NSFW Generation

Evolution of Galaxy Types and HI Gas in Hickson Compact Groups

Gas Column Density Distribution of Molecular Clouds in the Third Quadrant of the Milky Way

Generalized Federated Learning via Sharpness Aware Minimization

Information-Theoretic Limits of Integrated Sensing and Communication with Correlated Sensing and Channel States for Vehicular Networks

Learning to Guide Human Attention on Mobile Telepresence Robots with 360 Vision

LoMar: A Local Defense Against Poisoning Attack on Federated Learning

Molecules with ALMA at Planet-forming Scales (MAPS) III: Characteristics of Radial Chemical Substructures

Molecules with ALMA at Planet-forming Scales (MAPS). A Circumplanetary Disk Candidate in Molecular Line Emission in the AS 209 Disk

Offline Policy Optimization with Eligible Actions

On the Convergence of Multi-Server Federated Learning with Overlapping Area

Perception-Aware Attack: Creating Adversarial Music via Reverse-Engineering Human Perception

Provably Sample-Efficient RL with Side Information about Latent Dynamics

The Ages of Optically Bright Sub-Clusters in the Serpens Star-Forming Region

Tiny Object Tracking: A Large-scale Dataset and A Baseline

Towards Adaptive Unknown Authentication for Universal Domain Adaptation by Classifier Paradox

Bringing high spatial resolution to the Far-infrared -- A giant leap for astrophysics

Dual-Wavelength ALMA Observations of Dust Rings in Protoplanetary Disks

Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions

NetReduce: RDMA-Compatible In-Network Reduction for Distributed DNN Training Acceleration

Pebbles in an Embedded Protostellar Disk: The Case of CB26

Provably Good Batch Reinforcement Learning Without Great Exploration

Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling

Combining Parametric and Nonparametric Models for Off-Policy Evaluation

An Improved Decision Procedure for Linear Time Mu-Calculus

Herschel/PACS view of disks around low-mass stars and brown dwarfs in the TW Hya association

CEN34 -- High-Mass YSO in M17 or Background Post-AGB Star?

CO observation of the Galactic bubble N4

A Comparison of Approaches in Fitting Continuum SEDs

A Herschel Survey of Cold Dust in Disks Around Brown Dwarfs and Low-Mass Stars

A Non-Monetary Protocol for Peer-to-Peer Content Distribution in Wireless Broadcast Networks with Network Coding

A Herschel Search For Cold Dust in Brown Dwarf Disks: First Results