Source author record

Jianping Wu

Jianping Wu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence Computer Vision Machine Learning Computational Engineering, Finance, and Science cs.CY math.OC Numerical Analysis Other Computer Science physics.ao-ph physics.soc-ph Robotics

Catalog footprint

What is connected

10works

11topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

All in One: Exploring Unified Video-Language Pre-training

Mainstream Video-Language Pre-training models \cite{actbert,clipbert,violet} consist of three parts, a video encoder, a text encoder, and a video-text fusion Transformer. They pursue better performance via utilizing heavier unimodal encoders or multimodal fusion Transformers, resulting in increased parameters with lower efficiency in downstream tasks. In this work, we for the first time introduce an end-to-end video-language model, namely \textit{all-in-one Transformer}, that embeds raw video and textual signals into joint representations using a unified backbone architecture. We argue that the unique temporal information of video data turns out to be a key barrier hindering the design of a modality-agnostic Transformer. To overcome the challenge, we introduce a novel and effective token rolling operation to encode temporal representations from video clips in a non-parametric manner. The careful design enables the representation learning of both video-text multimodal inputs and unimodal inputs using a unified backbone model. Our pre-trained all-in-one Transformer is transferred to various downstream video-text tasks after fine-tuning, including text-video retrieval, video-question answering, multiple choice and visual commonsense reasoning. State-of-the-art performances with the minimal model FLOPs on nine datasets demonstrate the superiority of our method compared to the competitive counterparts. The code and pretrained model have been released in https://github.com/showlab/all-in-one.

preprint2022arXiv

Cyclic Graph Attentive Match Encoder (CGAME): A Novel Neural Network For OD Estimation

Origin-Destination Estimation plays an important role in the era of Intelligent Transportation. Nevertheless, as a under-determined problem, OD estimation confronts many challenges from cross-space inference to non-convex, non-linear optimization. As a powerful nonlinear approximator, deep learning is an ideal data-driven method to provide a novel perspective for OD estimation. However, viewing multi-interval traffic counts as spatial-temporal inputs and OD matrix as heterogeneous graph-structured output, the existing neural network architecture is not suitable for the cross-space inference problem thus a new deep learning architecture is needed. We propose CGAME, short for cyclic graph attentive matching encoder, including bi-directional encoder-decoder networks and a novel graph matcher in the hidden layer with double-layer attention mechanism. It realizes effective information exchange between the forward networks and backward networks and establishes coupling relations across underlying feature space. The proposed model achieves state-of-the-art compared with baselines in the designed experiments and offers a paradigm for inference tasks across representation space.

preprint2022arXiv

MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval

Dominant pre-training work for video-text retrieval mainly adopt the "dual-encoder" architectures to enable efficient retrieval, where two separate encoders are used to contrast global video and text representations, but ignore detailed local semantics. The recent success of image BERT pre-training with masked visual modeling that promotes the learning of local visual context, motivates a possible solution to address the above limitation. In this work, we for the first time investigate masked visual modeling in video-text pre-training with the "dual-encoder" architecture. We perform Masked visual modeling with Injected LanguagE Semantics (MILES) by employing an extra snapshot video encoder as an evolving "tokenizer" to produce reconstruction targets for masked video patch prediction. Given the corrupted video, the video encoder is trained to recover text-aligned features of the masked patches via reasoning with the visible regions along the spatial and temporal dimensions, which enhances the discriminativeness of local visual features and the fine-grained cross-modality alignment. Our method outperforms state-of-the-art methods for text-to-video retrieval on four datasets with both zero-shot and fine-tune evaluation protocols. Our approach also surpasses the baseline models significantly on zero-shot action recognition, which can be cast as video-to-text retrieval.

preprint2022arXiv

Modeling Adaptive Platoon and Reservation Based Autonomous Intersection Control: A Deep Reinforcement Learning Approach

As a strategy to reduce travel delay and enhance energy efficiency, platooning of connected and autonomous vehicles (CAVs) at non-signalized intersections has become increasingly popular in academia. However, few studies have attempted to model the relation between the optimal platoon size and the traffic conditions around the intersection. To this end, this study proposes an adaptive platoon based autonomous intersection control model powered by deep reinforcement learning (DRL) technique. The model framework has following two levels: the first level adopts a First Come First Serve (FCFS) reservation based policy integrated with a nonconflicting lane selection mechanism to determine vehicles' passing priority; and the second level applies a deep Q-network algorithm to identify the optimal platoon size based on the real-time traffic condition of an intersection. When being tested on a traffic micro-simulator, our proposed model exhibits superior performances on travel efficiency and fuel conservation as compared to the state-of-the-art methods.

preprint2016arXiv

Major Maintenance Schedule Optimization for Electric Multiple Unit Considering Passenger Transport Demand

It is an important objective pursued in a railway agency or company to reduce the major maintenance costs of electric multiple unit (EMU). The EMU major maintenance schedule decides when to undergo major maintenance or undertake transportation task for train-set, based on practical requirements, such as passenger transport demand, workshop inspection capacity, and maintenance requirements. Experienced railway practitioners can generally produce a feasible major maintenance schedule; however, this manual process is time-consuming, and an optimal solution is not guaranteed. This research constructs a time-space network that can display the train-set status transformation process between available and major maintenance status. On this basis, a 0-1 integer programming model is developed to reduce the major maintenance costs with consideration of all necessary regulations and practical constraints. Compared with the manual process, the genetic algorithm with simulated annealing survival mechanism is also developed to improve solution quality and efficiency. It can reduce the complexity of the algorithm substantially by excluding infeasible solutions when constructing the model.

preprint2015arXiv

Impacts of rainfall weather on urban traffic in beijing: analysis and modeling

Recently an increasing number of researches have been focused on the influence of rainfall intensity on traffic flow. Conclusions have been reached that inclement weather does have negative impacts on key traffic parameters. However, due to lack of data, limited work has been implemented in China. In this paper, the impacts of rainfall intensity on urban road traffic flow characteristics are quantified, based on the historical traffic data and weather data in Beijing, capital of China. The reductions of road capacity and operating speed are obtained by statistical estimation for different rainfall intensity categories against clear weather. Then the modified speed-density function and speed-flow function are calibrated at different rainfall levels, from which the reductions of free-flow speed can be calculated. Finally, a generalized continuous speed-flow-rainfall model is developed and calibrated. The validation results show a good accuracy, indicating the new model can be used for urban traffic management under various rainfall intensities.

preprint2014arXiv

An improved car-following model considering variable safety headway distance

Considering high speed following on expressway or highway, an improved car-following model is developed in this paper by introducing variable safety headway distance. Stability analysis of the new model is carried out using the control theory method. Finally, numerical simulations are implemented and the results show good consistency with theoretical study.

preprint2014arXiv

Car-following model on two lanes and stability analysis

Considering lateral influence from adjacent lane, an improved car-following model is developed in this paper. Then linear and non-linear stability analyses are carried out. The modified Korteweg-de Vries (MKdV) equation is derived with the kink-antikink soliton solution. Numerical simulations are implemented and the result shows good consistency with theoretical study.

preprint2014arXiv

Study on FLOWSIM and its Application for Isolated Signal-ized Intersection Assessment

Recently the traffic related problems have become strategically important, due to the continuously increasing vehicle number. As a result, microscopic simulation software has become an efficient method in traffic engineering for its cost-effectiveness and safety characteristics. In this paper, a new fuzzy logic based simulation software (FLOWSIM) is introduced, which can reflect the mixed traffic flow phenomenon in China better. The fuzzy logic based car-following model and lane-changing model are explained in detail. Furthermore, its applications for mixed traffic flow management in mid-size cities and for signalized intersection management assessment in large cities are illustrated by examples in China. Finally, further study objectives are discussed.

preprint2013arXiv

A Multi-stage Collaborative 3D GIS to Support Public Participation

This paper presents a collaborative 3D GIS to support public participation. Realizing that public-involved decision making is often a multi-stage process, the proposed system is designed to provide coherent support for collaborations in the different stages. We differentiate ubiquitous participation and intensive participation, and identify their suitable application stages. The proposed system, then, supports both of the two types of participation by providing synchronous and asynchronous collaboration functionalities. Applying the concept of Digital Earth, the proposed system also features a virtual globe-based user interface. Such an interface integrates a variety of data, functions and services into a unified virtual environment which is delivered to both experts and public participants through the Internet. The system has been designed as a general software framework, and can be tailored for specific projects. In this study, we demonstrate it using a scene modeling case and provide a preliminary evaluation towards its usability.

Jianping Wu

What is connected

Connect this record

See the researcher in context

Building this map preview

10 published item(s)

All in One: Exploring Unified Video-Language Pre-training

Cyclic Graph Attentive Match Encoder (CGAME): A Novel Neural Network For OD Estimation

MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval

Modeling Adaptive Platoon and Reservation Based Autonomous Intersection Control: A Deep Reinforcement Learning Approach

Major Maintenance Schedule Optimization for Electric Multiple Unit Considering Passenger Transport Demand

Impacts of rainfall weather on urban traffic in beijing: analysis and modeling

An improved car-following model considering variable safety headway distance

Car-following model on two lanes and stability analysis

Study on FLOWSIM and its Application for Isolated Signal-ized Intersection Assessment

A Multi-stage Collaborative 3D GIS to Support Public Participation