Source author record

Zhe Zheng

Zhe Zheng appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Applications Artificial Intelligence Computation and Language cond-mat.mtrl-sci Machine Learning

Catalog footprint

What is connected

3works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Pretrained Domain-Specific Language Model for General Information Retrieval Tasks in the AEC Domain

As an essential task for the architecture, engineering, and construction (AEC) industry, information retrieval (IR) from unstructured textual data based on natural language processing (NLP) is gaining increasing attention. Although various deep learning (DL) models for IR tasks have been investigated in the AEC domain, it is still unclear how domain corpora and domain-specific pretrained DL models can improve performance in various IR tasks. To this end, this work systematically explores the impacts of domain corpora and various transfer learning techniques on the performance of DL models for IR tasks and proposes a pretrained domain-specific language model for the AEC domain. First, both in-domain and close-domain corpora are developed. Then, two types of pretrained models, including traditional wording embedding models and BERT-based models, are pretrained based on various domain corpora and transfer learning strategies. Finally, several widely used DL models for IR tasks are further trained and tested based on various configurations and pretrained models. The result shows that domain corpora have opposite effects on traditional word embedding models for text classification and named entity recognition tasks but can further improve the performance of BERT-based models in all tasks. Meanwhile, BERT-based models dramatically outperform traditional methods in all IR tasks, with maximum improvements of 5.4% and 10.1% in the F1 score, respectively. This research contributes to the body of knowledge in two ways: 1) demonstrating the advantages of domain corpora and pretrained DL models and 2) opening the first domain-specific dataset and pretrained language model for the AEC domain, to the best of our knowledge. Thus, this work sheds light on the adoption and application of pretrained models in the AEC domain.

preprint2020arXiv

How Fast You Can Actually Fly: A Comparative Investigation of Flight Airborne Time in China and the U.S

Actual airborne time (AAT) is the time between wheels-off and wheels-on of a flight. Understanding the behavior of AAT is increasingly important given the ever growing demand for air travel and flight delays becoming more rampant. As no research on AAT exists, this paper performs the first empirical analysis of AAT behavior, comparatively for the U.S. and China. The focus is on how AAT is affected by scheduled block time (SBT), origin-destination (OD) distance, and the possible pressure to reduce AAT from other parts of flight operations. Multiple econometric models are developed. The estimation results show that in both countries AAT is highly correlated with SBT and OD distance. Flights in the U.S. are faster than in China. On the other hand, facing ground delay prior to takeoff, a flight has limited capability to speed up. The pressure from short turnaround time after landing to reduce AAT is immaterial. Sensitivity analysis of AAT to flight length and aircraft utilization is further conducted. Given the more abundant airspace, flexible routing networks, and efficient ATFM procedures, a counterfactual that the AAT behavior in the U.S. were adopted in China is examined. We find that by doing so significant efficiency gains could be achieved in the Chinese air traffic system. On average, 11.8 minutes of AAT per flight would be saved, coming from both reduction in SBT and reduction in AAT relative to the new SBT. Systemwide fuel saving would amount to over 300 million gallons with direct airline operating cost saving of nearly $1.3 billion nationwide in 2016.

preprint2009arXiv

G band Raman double resonance in twisted bilayer graphene: an evidence of band splitting and folding

The stacking faults (deviates from Bernal) will break the translational symmetry of multilayer graphenes and modify their electronic and optical behaviors to the extent depending on the interlayer coupling strength. This paper addresses the stacking-induced band splitting and folding effect on the electronic band structure of twisted bilayer graphene. Based on the first-principles density functional theory study, we predict that the band folding effect of graphene layers may enable the G band Raman double resonance in the visible excitation range. Such prediction is confirmed experimentally with our Raman observation that the resonant energies of the resonant G mode are strongly dependent on the stacking geometry of graphene layers.