ISDNet: AI-enabled Instance Segmentation of Aerial Scenes for Smart Cities
Author | Garg, P. |
Author | Chakravarthy, Anirudh Srinivasan |
Author | Mandal, Murari |
Author | Narang, Pratik |
Author | Chamola, Vinay |
Author | Guizani, Mohsen |
Available date | 2022-10-30T20:01:11Z |
Publication Date | 2021-08-01 |
Publication Name | ACM Transactions on Internet Technology |
Identifier | http://dx.doi.org/10.1145/3418205 |
Citation | Garg, P., Chakravarthy, A. S., Mandal, M., Narang, P., Chamola, V., & Guizani, M. (2021). Isdnet: Ai-enabled instance segmentation of aerial scenes for smart cities. ACM Transactions on Internet Technology (TOIT), 21(3), 1-18. |
ISSN | 15335399 |
Abstract | Aerial scenes captured by UAVs have immense potential in IoT applications related to urban surveillance, road and building segmentation, land cover classification, and so on, which are necessary for the evolution of smart cities. The advancements in deep learning have greatly enhanced visual understanding, but the domain of aerial vision remains largely unexplored. Aerial images pose many unique challenges for performing proper scene parsing such as high-resolution data, small-scaled objects, a large number of objects in the camera view, dense clustering of objects, background clutter, and so on, which greatly hinder the performance of the existing deep learning methods. In this work, we propose ISDNet (Instance Segmentation and Detection Network), a novel network to perform instance segmentation and object detection on visual data captured by UAVs. This work enables aerial image analytics for various needs in a smart city. In particular, we use dilated convolutions to generate improved spatial context, leading to better discrimination between foreground and background features. The proposed network efficiently reuses the segment-mask features by propagating them from early stages using residual connections. Furthermore, ISDNet makes use of effective anchors to accommodate varying object scales and sizes. The proposed method obtains state-of-the-art results in the aerial context. |
Sponsor | This work is supported by BITS Additional Competitive Research Grant funding under Project Grant File no. PLN/AD/2018-19/5 for the Project titled “Disaster Monitoring from Aerial Imagery using Deep Learning.” Authors’ addresses: P. Garg and V. Chamola, Dept. of EEE, BITS Pilani, India; emails: prateek.garg108@gmail.com, vinay.chamola@pilani.bits-pilani.ac.in; A. S. Chakravarthy and P. Narang, Dept. of CSIS, BITS Pilani, India; emails: anirudh.s.chakravarthy@gmail.com, pratik.narang@pilani.bits-pilani.ac.in; M. Mandal, Dept. of CSE, MNIT Jaipur, India; email: murarimandal.cv@gmail.com; M. Guizani, Dept. of CSE, Qatar University, Qatar; email: mguizani@ieee.org. Updated author affiliations: VINAY CHAMOLA, Dept. of EEE & APPCAIR, BITS Pilani, India; MURARI MANDAL, Dept. of CSE, IIIT Kota, India. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from permissions@acm.org. © 2021 Association for Computing Machinery. 1533-5399/2021/08-ART66 $15.00 https://doi.org/10.1145/3418205 |
Language | en |
Publisher | Association for Computing Machinery |
Subject | aerial scenes deep learning instance segmentation object detection Smart cities UAVs |
Type | Article |
Issue Number | 3 |
Volume Number | 21 |
Files in this item
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |
This item appears in the following Collection(s)
-
Computer Science & Engineering [2402 items ]