Qiu W.-XJEN-YU HANALBERT CHEN2022-03-222022-03-22202108873801https://www.scopus.com/inward/record.uri?eid=2-s2.0-85109812879&doi=10.1061%2f%28ASCE%29CP.1943-5487.0000976&partnerID=40&md5=8de084c1250a951ebed6e97976e50ef6https://scholars.lib.ntu.edu.tw/handle/123456789/597889This research estimated the spatial-temporal distribution of humans in buildings through image sensing. Inputs were the in-building network, image sequences recording the movement of human, and camera parameters. Object detection and tracking models were utilized to discover humans in the images. Image depth estimation, clustering, and the camera model were integrated for the association of human and the in-building space in the image coordinates with the real world coordinates. The temporal human count for each in-building space was acquired. To validate the approach, two real cases in a school building, at a corridor and a hallway, were tested, and a synthesized case was carried out to exclude error from the detection and tracking steps. The proposed approach achieved results comparable to those of manual counting. ? 2021 American Society of Civil Engineers.Automated external defibrillator (AED)ClusteringCoordinates projectionDeep learningDemand estimationDepth estimationHuman countingImage analysisIndoor surveillanceObject trackingBuildingsCamerasImage recordingObject detectionObject recognitionCamera parameterDetection and trackingImage coordinatesIn-building networkMonocular imageObject detection and trackingSpatial temporalsSpatial-temporal distribution[SDGs]SDG4[SDGs]SDG7[SDGs]SDG11[SDGs]SDG13Measuring In-Building Spatial-Temporal Human Distribution through Monocular Image Data Considering Deep Learning-Based Image Depth Estimationjournal article10.1061/(ASCE)CP.1943-5487.00009762-s2.0-85109812879