Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/32020
Appears in Collections:Computing Science and Mathematics Conference Papers and Proceedings
Author(s): Swingler, Kevin
Bath, Mandy
Title: Learning Spatial Relations with a Standard Convolutional Neural Network
Editor(s): Merelo, Juan Julian
Garibaldi, Jonathan
Wagner, Christian
Bäck, Thomas
Madani, Kurosh
Warwick, Kevin
Citation: Swingler K & Bath M (2020) Learning Spatial Relations with a Standard Convolutional Neural Network. In: Merelo JJ, Garibaldi J, Wagner C, Bäck T, Madani K & Warwick K (eds.) Proceedings of the 12th International Joint Conference on Computational Intelligence - Volume 1: NCTA. 12th International Conference on Neural Computation Theory and Applications, Budapest, Hungary, 02.11.2020-04.11.2020. Setubal, Portugal: SCITEPRESS - Science and Technology Publications, pp. 464-470. https://doi.org/10.5220/0010170204640470
Issue Date: 2020
Date Deposited: 27-Nov-2020
Conference Name: 12th International Conference on Neural Computation Theory and Applications
Conference Dates: 2020-11-02 - 2020-11-04
Conference Location: Budapest, Hungary
Abstract: This paper shows how a standard convolutional neural network (CNN) without recurrent connections is able to learn general spatial relationships between different objects in an image. A dataset was constructed by placing objects from the Fashion-MNIST dataset onto a larger canvas in various relational locations (for example, trousers left of a shirt, both above a bag). CNNs were trained to name the objects and their spatial relationship. Models were trained to perform two different types of task. The first was to name the objects and their relationships and the second was to answer relational questions such as ``Where is the shoe in relation to the bag?". The models performed at above 80\% accuracy on test data. The models were also capable of generalising to spatial combinations that had been intentionally excluded from the training data.
Status: VoR - Version of Record
Rights: This article is published under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 licence (https://creativecommons.org/licenses/by-nc-nd/4.0/)
Licence URL(s): http://creativecommons.org/licenses/by-nc-nd/4.0/

Files in This Item:
File Description SizeFormat 
NCTA_2020_23.pdfFulltext - Published Version387.35 kBAdobe PDFView/Open



This item is protected by original copyright



A file in this item is licensed under a Creative Commons License Creative Commons

Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.