Researcher profile

Guo Chen

Guo Chen contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2024arXiv

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

This paper introduces InternVid, a large-scale video-centric multimodal dataset that enables learning powerful and transferable video-text representations for multimodal understanding and generation. The InternVid dataset contains over 7 million videos lasting nearly 760K hours, yielding 234M video clips accompanied by detailed descriptions of total 4.1B words. Our core contribution is to develop a scalable approach to autonomously build a high-quality video-text dataset with large language models (LLM), thereby showcasing its efficacy in learning video-language representation at scale. Specifically, we utilize a multi-scale approach to generate video-related descriptions. Furthermore, we introduce ViCLIP, a video-text representation learning model based on ViT-L. Learned on InternVid via contrastive learning, this model demonstrates leading zero-shot action recognition and competitive video retrieval performance. Beyond basic video understanding tasks like recognition and retrieval, our dataset and model have broad applications. They are particularly beneficial for generating interleaved video-text data for learning a video-centric dialogue system, advancing video-to-text and text-to-video generation research. These proposed resources provide a tool for researchers and practitioners interested in multimodal video understanding and generation.

preprint2023arXiv

A Two-limb Explanation for the Optical-to-infrared Transmission Spectrum of the Hot Jupiter HAT-P-32Ab

We present a new optical transmission spectrum of the hot Jupiter HAT-P-32Ab acquired with the Carnegie Observatories Spectrograph and Multiobject Imaging Camera (COSMIC) on the Palomar 200 inch Hale Telescope (P200). The P200/COSMIC transmission spectrum, covering a wavelength range of 3990--9390 Å, is composed of 25 spectrophotometric bins with widths ranging from 200 to 400 Åand consistent with previous transit measurements obtained in the common wavelength range. We derive a combined optical transmission spectrum based on measurements from five independent instruments, which, along with the 1.1--1.7 $μ$m spectrum acquired by the Hubble Space Telescope and two Spitzer measurements, exhibits an enhanced scattering slope blueward of a relatively flat optical continuum, a water absorption feature at 1.4 $μ$m, and a carbon dioxide feature at 4.4 $μ$m. We perform Bayesian spectral retrieval analyses on the 0.3--5.1 $μ$m transmission spectrum and find that it can be well explained by a two-limb approximation of $134^{+45}_{-33}\times$ solar metallicity, with a strongly hazy morning limb of $1134^{+232}_{-194}$ K and a haze-free evening limb of $1516^{+33}_{-44}$~K. This makes HAT-P-32Ab a promising target for James Webb Space Telescope to look for asymmetric signatures directly in the light curves.

preprint2023arXiv

Decoupling SQL Query Hardness Parsing for Text-to-SQL

The fundamental goal of the Text-to-SQL task is to translate natural language question into SQL query. Current research primarily emphasizes the information coupling between natural language questions and schemas, and significant progress has been made in this area. The natural language questions as the primary task requirements source determines the hardness of correspond SQL queries, the correlation between the two always be ignored. However, when the correlation between questions and queries was decoupled, it may simplify the task. In this paper, we introduce an innovative framework for Text-to-SQL based on decoupling SQL query hardness parsing. This framework decouples the Text-to-SQL task based on query hardness by analyzing questions and schemas, simplifying the multi-hardness task into a single-hardness challenge. This greatly reduces the parsing pressure on the language model. We evaluate our proposed framework and achieve a new state-of-the-art performance of fine-turning methods on Spider dev.

preprint2023arXiv

FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation

We propose an accurate and efficient scene text detection framework, termed FAST (i.e., faster arbitrarily-shaped text detector). Different from recent advanced text detectors that used complicated post-processing and hand-crafted network architectures, resulting in low inference speed, FAST has two new designs. (1) We design a minimalist kernel representation (only has 1-channel output) to model text with arbitrary shape, as well as a GPU-parallel post-processing to efficiently assemble text lines with a negligible time overhead. (2) We search the network architecture tailored for text detection, leading to more powerful features than most networks that are searched for image classification. Benefiting from these two designs, FAST achieves an excellent trade-off between accuracy and efficiency on several challenging datasets, including Total Text, CTW1500, ICDAR 2015, and MSRA-TD500. For example, FAST-T yields 81.6% F-measure at 152 FPS on Total-Text, outperforming the previous fastest method by 1.7 points and 70 FPS in terms of accuracy and speed. With TensorRT optimization, the inference speed can be further accelerated to over 600 FPS. Code and models will be released at https://github.com/czczup/FAST.

preprint2022arXiv

Benchmark tests of transmission spectroscopy using transiting white dwarfs

Ground-based transit observations are affected by both telluric absorption and instrumental systematics. To account for these effects, a better understanding of the impact of different data analyses is needed to improve the accuracy of the retrieved transmission spectra. We propose validating ground-based low-resolution transmission spectroscopy using transiting white dwarfs. The advantage here is that we know beforehand what the final transmission spectrum should be: a featureless flat spectrum. We analyzed two transiting white dwarfs analogous to hot Jupiters. We used various noise models to account for the systematic noise in their spectroscopic light curves following common procedures of transmission spectroscopy analyses. We compared the derived transmission spectra with the broadband transit depth to determine whether there are any artificial offsets or spectral features arising from light-curve fitting. The results show a strong model dependence, and the transmission spectra exhibit considerable discrepancies when they are computed with different noise models, different reference stars, and different common-mode removal methods. Nonetheless, we can still derive relatively accurate transmission spectra based on a Bayesian model comparison. With current ground-based instrumentation, the systematics in transit light curves can easily contaminate a transmission spectrum, introducing a general offset or some spurious spectral features and thus leading to a biased interpretation on the planetary atmosphere. Therefore, it is necessary to determine the dependence of results on the adopted noise model through model comparison. The model inferences should be examined through multiple observations and different instruments. (Abridged)

preprint2022arXiv

Fitting Semiparametric Cumulative Probability Models for Big Data

Cumulative probability models (CPMs) are a robust alternative to linear models for continuous outcomes. However, they are not feasible for very large datasets due to elevated running time and memory usage, which depend on the sample size, the number of predictors, and the number of distinct outcomes. We describe three approaches to address this problem. In the divide-and-combine approach, we divide the data into subsets, fit a CPM to each subset, and then aggregate the information. In the binning and rounding approaches, the outcome variable is redefined to have a greatly reduced number of distinct values. We consider rounding to a decimal place and rounding to significant digits, both with a refinement step to help achieve the desired number of distinct outcomes. We show with simulations that these approaches perform well and their parameter estimates are consistent. We investigate how running time and peak memory usage are influenced by the sample size, the number of distinct outcomes, and the number of predictors. As an illustration, we apply the approaches to a large publicly available dataset investigating matrix multiplication runtime with nearly one million observations.

preprint2021arXiv

A multi-wavelength look at the GJ 9827 system -- No evidence of extended atmospheres in GJ 9827 b and d from HST and CARMENES data

GJ9827 is a bright star hosting a planetary system with three transiting planets. As a multi-planet system with planets that sprawl within the boundaries of the radius gap between terrestrial and gaseous planets, GJ9827 is an optimal target to study the evolution of the atmospheres of close-in planets with a common evolutionary history and their dependence from stellar irradiation. Here, we report on the Hubble Space Telescope (HST) and CARMENES transit observations of GJ9827 planets b and d. We performed a stellar and interstellar medium characterization from the ultraviolet HST spectra, obtaining fluxes for Ly-alpha and MgII of F(Ly-alpha) = (5.42+0.96-0.75) X 10^{-13} erg cm^{-2} s^{-1} and F(MgII) = (5.64 +- 0.24) X 10^{-14} erg cm^{-2} s^{-1}. We also investigated a possible absorption signature in Ly-alpha in the atmosphere of GJ9827b during a transit event from HST spectra, as well as H-alpha and HeI signature for the atmosphere of GJ9827b and d from CARMENES spectra. We found no evidence of an extended atmosphere in either of the planets. This result is also supported by our analytical estimations of mass-loss based on the measured radiation fields for all the three planets of this system, which led to a mass-loss rate of 0.4, 0.3, and 0.1 planetary masses per Gyr, for GJ9827b, c, and d respectively. These values indicate that the planets could have lost their volatiles quickly in their evolution and probably do not retain an atmosphere at the current stage.

preprint2018arXiv

Modeling Three-dimensional Invasive Solid Tumor Growth in Heterogeneous Microenvironment under Chemotherapy

A systematic understanding of the evolution and growth dynamics of invasive solid tumors in response to different chemotherapy strategies is crucial for the development of individually optimized oncotherapy. Here, we develop a hybrid three-dimensional (3D) computational model that integrates pharmacokinetic model, continuum diffusion-reaction model and discrete cell automaton model to investigate 3D invasive solid tumor growth in heterogeneous microenvironment under chemotherapy. Specifically, we consider the effects of heterogeneous environment on drug diffusion, tumor growth, invasion and the drug-tumor interaction on individual cell level. We employ the hybrid model to investigate the evolution and growth dynamics of avascular invasive solid tumors under different chemotherapy strategies. Our simulations reproduce the well-established observation that constant dosing is generally more effective in suppressing primary tumor growth than periodic dosing, due to the resulting continuous high drug concentration. In highly heterogeneous microenvironment, the malignancy of the tumor is significantly enhanced, leading to inefficiency of chemotherapies. The effects of geometrically-confined microenvironment and non-uniform drug dosing are also investigated. Our computational model, when supplemented with sufficient clinical data, could eventually lead to the development of efficient in silico tools for prognosis and treatment strategy optimization.