Source author record

Min Deng

Min Deng appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Tissues and Organs Computer Vision Distributed, Parallel, and Cluster Computing math.DG Other Quantitative Biology physics.med-ph physics.soc-ph Robotics Social and Information Networks

Catalog footprint

What is connected

8works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

From Perception to Symbolic Task Planning: Vision-Language Guided Human-Robot Collaborative Structured Assembly

Human-robot collaboration (HRC) in structured assembly requires reliable state estimation and adaptive task planning under noisy perception and human interventions. To address these challenges, we introduce a design-grounded human-aware planning framework for human-robot collaborative structured assembly. The framework comprises two coupled modules. Module I, Perception-to-Symbolic State (PSS), employs vision-language models (VLMs) based agents to align RGB-D observations with design specifications and domain knowledge, synthesizing verifiable symbolic assembly states. It outputs validated installed and uninstalled component sets for online state tracking. Module II, Human-Aware Planning and Replanning (HPR), performs task-level multi-robot assignment and updates the plan only when the observed state deviates from the expected execution outcome. It applies a minimal-change replanning rule to selectively revise task assignments and preserve plan stability even under human interventions. We validate the framework on a 27-component timber-frame assembly. The PSS module achieves 97% state synthesis accuracy, and the HPR module maintains feasible task progression across diverse HRC scenarios. Results indicate that integrating VLM-based perception with knowledge-driven planning improves robustness of state estimation and task planning under dynamic conditions.

preprint2020arXiv

RSI-CB: A Large Scale Remote Sensing Image Classification Benchmark via Crowdsource Data

In recent years, deep convolutional neural network (DCNN) has seen a breakthrough progress in natural image recognition because of three points: universal approximation ability via DCNN, large-scale database (such as ImageNet), and supercomputing ability powered by GPU. The remote sensing field is still lacking a large-scale benchmark compared to ImageNet and Place2. In this paper, we propose a remote sensing image classification benchmark (RSI-CB) based on massive, scalable, and diverse crowdsource data. Using crowdsource data, such as Open Street Map (OSM) data, ground objects in remote sensing images can be annotated effectively by points of interest, vector data from OSM, or other crowdsource data. The annotated images can be used in remote sensing image classification tasks. Based on this method, we construct a worldwide large-scale benchmark for remote sensing image classification. This benchmark has two sub-datasets with 256 by 256 and 128 by 128 sizes because different DCNNs require different image sizes. The former contains 6 categories with 35 subclasses of more than 24,000 images. The latter contains 6 categories with 45 subclasses of more than 36,000 images. This classification system of ground objects is defined according to the national standard of land-use classification in China and is inspired by the hierarchy mechanism of ImageNet. Finally, we conduct many experiments to compare RSI-CB with the SAT-4, SAT-6, and UC-Merced datasets on handcrafted features, such as scale-invariant feature transform, color histogram, local binary patterns, and GIST, and classical DCNN models, such as AlexNet, VGGNet, GoogLeNet, and ResNet.

preprint2016arXiv

Improved liver T1rho measurement precision with a breathhold black blood single shot fast spin echo acquisition: a validation study in healthy volunteers

Purpose: To explore the usability and normal T1rho value of liver parenchyma with a novel single breathhold black blood single shot fast spin echo acquisition based liver imaging sequence. Materials and Methods: In total 19 health subjects (10 males, 9 females; mean age: 37.4 yrs; range: 23-54 yrs) participated in the study. 11 subjects had liver scanned twice in the same session to access scan-rescan repeatability. 12 subjects had liver scanned twice in two sessions with 7-10 days' interval to access scan-rescan reproducibility. MR was performed with a 3.0 T scanner with dual transmitter. The MR sequence allows simultaneous acquisition of 4 spin lock times (TSLs: 0ms, 10 ms, 30 ms, 50ms) in 10 second. Inherent black blood effect of fast spin echo and double inversion recovery were utilized to achieve blood signal suppression. Results: The technique demonstrated good image quality and minimal artifacts. For liver parenchyma, Bland-Altman plot showed the scan-rescan repeatability mean difference was 0.025 ms (95% limits of agreement: -1.163 to 1.213 ms), and intraclass correlation coefficient (ICC) was 0.977. The scan-rescan reproducibility mean difference was -0.075 ms (95% limits of agreement: -3.280 to 3.310 ms), and ICC was 0.820 which is better than the ICC of 0.764 of a previous bright blood multi-breath hold gradient echo acquisition technique. The liver T1rho value was 39.9 +/- 2.4 ms (range: 36.1 - 44.2 ms), which is lower than the value of 42.8=/-2.1 ms acquired with the previous bright blood technique. Conclusion: This study validated the application of a single breathhold black blood single shot fast spin echo acquisition based for human liver T1rho imaging. The lower liver parenchyma T1rho value and higher scan rescan reproducibility may improve of the sensitivity of this technique.

preprint2016arXiv

Lumbar degenerative spondylolisthesis epidemiology: a systemic review with a focus on gender-specific and age-specific prevalence

The epidemiology of lumbar degenerative spondylolisthesis (DS) remains controversial. We performed a systemic review with the aim to have a better understanding of DS's prevalence in general population. The results showed the prevalence of DS is very gender specific and age specific. Both women and men have few DS before 50 years old, after 50 years old both women and men start to develop DS, with women having a faster developing rate than men. For elderly Chinese (>=65 yrs, mean age: 72.5 yrs), large population based studies (MsOS(Hong Kong) and MrOS (Hong Kong), females n=2000 and males n=2000) showed DS prevalence was for 25.0% for women and 19.1% for men, and the prevalence F:M (women:men) ratio was 1.3:1. The published data (MsOS(USA) and MrOS(USA) studies) seem to show elderly Caucasian American has a higher DS prevalence, being approximately 60-70% higher than elderly Chinese; however the prevalence F:M ratio was similar to elderly Chinese population. Patient data showed female patients more often received treatment than men; and preliminary data show the ratio of numbers of female patients received treatment compared with men did not differ between Northeast Asians (Chinese, Japanese, and Korean) and European and American Caucasians, being around 2:1 in elderly population. The existing data also suggest that menopause may be a contributing factor for the accelerated DS development in post-menopausal

preprint2016arXiv

On the Neuron Response Features of Convolutional Neural Networks for Remote Sensing Image

In this paper, some patterns of the Neuron Response of deep Convolutional Neural Networks were observed.

preprint2016arXiv

Prevalence of algorithm-based qualitative (ABQ) method osteoporotic vertebral fracture in elderly Chinese men and women with reference to semi-quantitative (SQ) method: Mr. Os and Ms Os. (Hong Kong) studies

Introduction: This study evaluated algorithm-based qualitative (ABQ) method for vertebral fracture (VF) evaluation with reference to semi-quantitative (SQ) method and bone mineral density (BMD) measurement. Methods: Mr. OS (Hong Kong) and Ms. OS (Hong Kong) represent the first large-scale cohort studies on bone health in elderly Chinese men and women. The current study compared Genant's SQ method and ABQ method in these two cohorts. Based on quantitative measurement, the severity of ABQ method detected fractures was additionally classified into grade-1, grad-2, and grade-3 according to SQ's deformity criteria. The radiographs of 1,954 elderly Chinese men (mean: 72.3 years) and 1,953 elderly Chinese women (mean: 72.5 years) were evaluated. Results: according to ABQ, grade-1,-2,-3 VFs accounted for 1.89%, 1.74%, 2.25% in men, and 3.33%, 3.07%, and 5.53% in women. In men and women, 15.7% (35/223) and 34.5% (48/139) of vertebrae with SQ grade-1 deformity were ABQ(+, with fracture) respectively. In men and women, 89.7% (35/39) and 66.7% (48/72) of vertebrae with ABQ grade-1 fracture had SQ grade-1 deformity. For grade-1 change, SQ (-, negative without fracture) & ABQ (+, positive with vertebral cortex line fracture) subjects tend to have a lower BMD than the SQ(+)& ABQ(-) subjects. In subjects with SQ grade-2 deformity, those were also ABQ(+) tended to have a lower BMD than those were ABQ(-). In all grades, SQ(-)&ABQ(-) subjects tended to have highest BMD, while SQ(+)&ABQ(+)subjects tended to have lowest BMD. Conclusion: ABQ method may be more sensitive to VF associated mild lower BMD than SQ method.

preprint2016arXiv

Queue Theory based Response Time Analyses for Geo-Information Processing Chain

Typical characteristics of remote sensing applications are concurrent tasks, such as those found in disaster rapid response. The existing composition approach to geographical information processing service chain, searches for an optimisation solution and is what can be deemed a "selfish" way. This way leads to problems of conflict amongst concurrent tasks and decreases the performance of all service chains. In this study, a non-cooperative game-based mathematical model to analyse the competitive relationships between tasks, is proposed. A best response function is used, to assure each task maintains utility optimisation by considering composition strategies of other tasks and quantifying conflicts between tasks. Based on this, an iterative algorithm that converges to Nash equilibrium is presented, the aim being to provide good convergence and maximise the utilisation of all tasks under concurrent task conditions. Theoretical analyses and experiments showed that the newly proposed method, when compared to existing service composition methods, has better practical utility in all tasks.

preprint2015arXiv

Urban spatial-temporal activity structures: a New Approach to Inferring the Intra-urban Functional Regions via Social Media Check-In Data

Most existing literature focuses on the exterior temporal rhythm of human movement to infer the functional regions in a city, but they neglects the underlying interdependence between the functional regions and human activities which uncovers more detailed characteristics of regions. In this research, we proposed a novel model based on the low rank approximation (LRA) to detect the functional regions using the data from about 15 million check-in records during a yearlong period in Shanghai, China. We find a series of latent structures, called urban spatial-temporal activity structure (USTAS). While interpreting these structures, a series of outstanding underlying associations between the spatial and temporal activity patterns can be found. Moreover, we can not only reproduce the observed data with a lower dimensional representative but also simultaneously project both the spatial and temporal activity patterns in the same coordinate system. By utilizing the K-means clustering algorithm, five significant types of clusters which are directly annotated with a corresponding combination of temporal activities can be obtained. This provides a clear picture of how the groups of regions are associated with different activities at different time of day. Besides the commercial and transportation dominant area, we also detect two kinds of residential areas, the developed residential areas and the developing residential areas. We further verify the spatial distribution of these clusters in the view of urban form analysis. The results shows a high consistency with the government planning from the same periods, indicating our model is applicable for inferring the functional regions via social media check-in data, and can benefit a wide range of fields, such as urban planning, public services and location-based recommender systems and other purposes.

Min Deng

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

From Perception to Symbolic Task Planning: Vision-Language Guided Human-Robot Collaborative Structured Assembly

RSI-CB: A Large Scale Remote Sensing Image Classification Benchmark via Crowdsource Data

Improved liver T1rho measurement precision with a breathhold black blood single shot fast spin echo acquisition: a validation study in healthy volunteers

Lumbar degenerative spondylolisthesis epidemiology: a systemic review with a focus on gender-specific and age-specific prevalence

On the Neuron Response Features of Convolutional Neural Networks for Remote Sensing Image

Prevalence of algorithm-based qualitative (ABQ) method osteoporotic vertebral fracture in elderly Chinese men and women with reference to semi-quantitative (SQ) method: Mr. Os and Ms Os. (Hong Kong) studies

Queue Theory based Response Time Analyses for Geo-Information Processing Chain

Urban spatial-temporal activity structures: a New Approach to Inferring the Intra-urban Functional Regions via Social Media Check-In Data