Source author record

Xintian Shi

Xintian Shi appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision physics.ins-det

Catalog footprint

What is connected

2works

2topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2020arXiv

TEA: Temporal Excitation and Aggregation for Action Recognition

Temporal modeling is key for action recognition in videos. It normally considers both short-range motions and long-range aggregations. In this paper, we propose a Temporal Excitation and Aggregation (TEA) block, including a motion excitation (ME) module and a multiple temporal aggregation (MTA) module, specifically designed to capture both short- and long-range temporal evolution. In particular, for short-range motion modeling, the ME module calculates the feature-level temporal differences from spatiotemporal features. It then utilizes the differences to excite the motion-sensitive channels of the features. The long-range temporal aggregations in previous works are typically achieved by stacking a large number of local temporal convolutions. Each convolution processes a local temporal window at a time. In contrast, the MTA module proposes to deform the local convolution to a group of sub-convolutions, forming a hierarchical residual architecture. Without introducing additional parameters, the features will be processed with a series of sub-convolutions, and each frame could complete multiple temporal aggregations with neighborhoods. The final equivalent receptive field of temporal dimension is accordingly enlarged, which is capable of modeling the long-range temporal relationship over distant frames. The two components of the TEA block are complementary in temporal modeling. Finally, our approach achieves impressive results at low FLOPs on several action recognition benchmarks, such as Kinetics, Something-Something, HMDB51, and UCF101, which confirms its effectiveness and efficiency.

preprint2012arXiv

The single photon sensitivity of the Adaptive Gain Integrating Pixel Detector

Single photon sensitivity is an important property of certain detection systems. This work investigated the single photon sensitivity of the Adaptive Gain Integrating Pixel Detector (AGIPD) and its dependence on possible detector noise values. Due to special requirements at the European X-ray Free Electron Laser (XFEL) the AGIPD finds the number of photons absorbed in each pixel by integrating the total signal. Photon counting is done off line on a thresholded data set. It was shown that AGIPD will be sensitive to single photons of 8 keV energy or more (detection efficiency $\gg$ 50%, less than 1 count due to noise per 10$^6$ pixels). Should the final noise be at the lower end of the possible range (200 - 400 electrons) single photon sensitivity can also be achieved at 5 keV beam energy. It was shown that charge summing schemes are beneficial when the noise is sufficiently low. The total detection rate of events is increased and the probability to count a single event multiple times in adjacent pixels is reduced by a factor of up to 40. The entry window of AGIPD allows 3 keV photons to reach the sensitive volume with approximately 70% probability. Therefore the low energy performance of AGIPD was explored, finding a maximum noise floor below 0.035 hits/pixel/frame at 3 keV beam energy. Depending on the noise level and selected threshold this value can be reduced by a factor of approximately 10. Even though single photon sensitivity, as defined in this work, is not given, imaging at this energy is still possible, allowing Poisson noise limited performance for signals significantly above the noise floor.