Source author record

Xinyi He

Xinyi He appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language Databases Human-Computer Interaction Information Retrieval Machine Learning math.NT

Catalog footprint

What is connected

3works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

ASTA: Learning Analytical Semantics over Tables for Intelligent Data Analysis and Visualization

Intelligent analysis and visualization of tables use techniques to automatically recommend useful knowledge from data, thus freeing users from tedious multi-dimension data mining. While many studies have succeeded in automating recommendations through rules or machine learning, it is difficult to generalize expert knowledge and provide explainable recommendations. In this paper, we present the recommendation of conditional formatting for the first time, together with chart recommendation, to exemplify intelligent table analysis. We propose analytical semantics over tables to uncover common analysis pattern behind user-created analyses. Here, we design analytical semantics by separating data focus from user intent, which extract the user motivation from data and human perspective respectively. Furthermore, the ASTA framework is designed by us to apply analytical semantics to multiple automated recommendations. ASTA framework extracts data features by designing signatures based on expert knowledge, and enables data referencing at field- (chart) or cell-level (conditional formatting) with pre-trained models. Experiments show that our framework achieves recall at top 1 of 62.86% on public chart corpora, outperforming the best baseline about 14%, and achieves 72.31% on the collected corpus ConFormT, validating that ASTA framework is effective in providing accurate and explainable recommendations.

preprint2022arXiv

On an error term for the first moment of twisted $L$-functions

Let $f$ be a Hecke-Maass cusp form for the full modular group and let $χ$ be a primitive Dirichlet character modulo a prime $q$. Let $s_0=σ_0+it_0$ with $\frac{1}{2}\leqσ_0<1$. We improve the error term for the first moment of $L(s_0,f\otimesχ)\overline{L(s_0,χ)}$ over the family of even primitive Dirichlet characters. As an application, we show that for any $t\in\mathbb{R}$, there exists a primitive Dirichlet character $χ$ modulo $q$ for which $L(1/2+it,f\otimesχ)L(1/2+it,χ)\neq0$ if the prime $q$ satisfies $q\gg (1+|t|)^{\frac{543}{25}+\varepsilon}$.

preprint2022arXiv

Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream Tasks

Since a vast number of tables can be easily collected from web pages, spreadsheets, PDFs, and various other document types, a flurry of table pre-training frameworks have been proposed following the success of text and images, and they have achieved new state-of-the-arts on various tasks such as table question answering, table type recognition, column relation classification, table search, formula prediction, etc. To fully use the supervision signals in unlabeled tables, a variety of pre-training objectives have been designed and evaluated, for example, denoising cell values, predicting numerical relationships, and implicitly executing SQLs. And to best leverage the characteristics of (semi-)structured tables, various tabular language models, particularly with specially-designed attention mechanisms, have been explored. Since tables usually appear and interact with free-form text, table pre-training usually takes the form of table-text joint pre-training, which attracts significant research interests from multiple domains. This survey aims to provide a comprehensive review of different model designs, pre-training objectives, and downstream tasks for table pre-training, and we further share our thoughts and vision on existing challenges and future opportunities.

Xinyi He

What is connected

Connect this record

See the researcher in context

Building this map preview

3 published item(s)

ASTA: Learning Analytical Semantics over Tables for Intelligent Data Analysis and Visualization

On an error term for the first moment of twisted $L$-functions

Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream Tasks