Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/24710
Full metadata record
DC FieldValueLanguage
dc.contributor.authorAbel, Andrewen_UK
dc.contributor.authorMarxer, Ricarden_UK
dc.contributor.authorHussain, Amiren_UK
dc.contributor.authorBarker, Jonen_UK
dc.contributor.authorWatt, Rogeren_UK
dc.contributor.authorWhitmer, Billen_UK
dc.contributor.authorDerleth, Peteren_UK
dc.contributor.editorLiu, CLen_UK
dc.contributor.editorHussain, Aen_UK
dc.contributor.editorLuo, Ben_UK
dc.contributor.editorTan, KCen_UK
dc.contributor.editorZeng, Yen_UK
dc.contributor.editorZhang, Zen_UK
dc.date.accessioned2017-08-26T07:37:52Z-
dc.date.available2017-08-26T07:37:52Z-
dc.date.issued2016-12en_UK
dc.identifier.urihttp://hdl.handle.net/1893/24710-
dc.description.abstractThe concept of using visual information as part of audio speech processing has been of significant recent interest. This paper presents a data driven approach that considers estimating audio speech acoustics using only temporal visual information without considering linguistic features such as phonemes and visemes. Audio (log filterbank) and visual (2D-DCT) features are extracted, and various configurations of MLP and datasets are used to identify optimal results, showing that given a sequence of prior visual frames an equivalent reasonably accurate audio frame estimation can be mapped.en_UK
dc.language.isoenen_UK
dc.publisherSpringeren_UK
dc.relationAbel A, Marxer R, Hussain A, Barker J, Watt R, Whitmer B & Derleth P (2016) A Data Driven Approach to Audiovisual Speech Mapping. In: Liu C, Hussain A, Luo B, Tan K, Zeng Y & Zhang Z (eds.) Advances in Brain Inspired Cognitive Systems. Lecture Notes in Computer Science, 10023. BICS 2016: International Conference on Brain Inspired Cognitive Systems, Beijing, China, 28.11.2016-30.11.2016. Cham, Switzerland: Springer, pp. 331-342. https://doi.org/10.1007/978-3-319-49685-6_30en_UK
dc.relation.ispartofseriesLecture Notes in Computer Science, 10023en_UK
dc.rightsPublisher policy allows this work to be made available in this repository. Published in Liu CL., Hussain A., Luo B., Tan K., Zeng Y., Zhang Z. (eds) Advances in Brain Inspired Cognitive Systems. BICS 2016. Lecture Notes in Computer Science, vol 10023, published by Springer. The original publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-49685-6_30en_UK
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_UK
dc.subjectAudiovisualen_UK
dc.subjectSpeech processingen_UK
dc.subjectSpeech mappingen_UK
dc.subjectANNsen_UK
dc.titleA Data Driven Approach to Audiovisual Speech Mappingen_UK
dc.typeConference Paperen_UK
dc.identifier.doi10.1007/978-3-319-49685-6_30en_UK
dc.citation.issn0302-9743en_UK
dc.citation.spage331en_UK
dc.citation.epage342en_UK
dc.citation.publicationstatusPublisheden_UK
dc.type.statusAM - Accepted Manuscripten_UK
dc.contributor.funderEngineering and Physical Sciences Research Councilen_UK
dc.author.emailr.j.watt@stir.ac.uken_UK
dc.citation.btitleAdvances in Brain Inspired Cognitive Systemsen_UK
dc.citation.conferencedates2016-11-28 - 2016-11-30en_UK
dc.citation.conferencelocationBeijing, Chinaen_UK
dc.citation.conferencenameBICS 2016: International Conference on Brain Inspired Cognitive Systemsen_UK
dc.citation.date30/11/2016en_UK
dc.citation.isbn978-3-319-49685-6en_UK
dc.publisher.addressCham, Switzerlanden_UK
dc.contributor.affiliationComputing Scienceen_UK
dc.contributor.affiliationUniversity of Sheffielden_UK
dc.contributor.affiliationComputing Scienceen_UK
dc.contributor.affiliationUniversity of Sheffielden_UK
dc.contributor.affiliationPsychologyen_UK
dc.contributor.affiliationMedical Research Council Institute of Hearing Researchen_UK
dc.contributor.affiliationSonova Internationalen_UK
dc.identifier.scopusid2-s2.0-84997282854en_UK
dc.identifier.wtid542655en_UK
dc.contributor.orcid0000-0002-8080-082Xen_UK
dc.contributor.orcid0000-0001-8660-1875en_UK
dc.date.accepted2016-08-24en_UK
dcterms.dateAccepted2016-08-24en_UK
dc.date.filedepositdate2016-12-13en_UK
dc.relation.funderprojectTowards visually-driven speech enhancement for cognitively-inspired multi-modal hearing-aid devicesen_UK
dc.relation.funderrefEP/M026981/1en_UK
rioxxterms.apcnot requireden_UK
rioxxterms.typeConference Paper/Proceeding/Abstracten_UK
rioxxterms.versionAMen_UK
local.rioxx.authorAbel, Andrew|en_UK
local.rioxx.authorMarxer, Ricard|en_UK
local.rioxx.authorHussain, Amir|0000-0002-8080-082Xen_UK
local.rioxx.authorBarker, Jon|en_UK
local.rioxx.authorWatt, Roger|0000-0001-8660-1875en_UK
local.rioxx.authorWhitmer, Bill|en_UK
local.rioxx.authorDerleth, Peter|en_UK
local.rioxx.projectEP/M026981/1|Engineering and Physical Sciences Research Council|http://dx.doi.org/10.13039/501100000266en_UK
local.rioxx.contributorLiu, CL|en_UK
local.rioxx.contributorHussain, A|en_UK
local.rioxx.contributorLuo, B|en_UK
local.rioxx.contributorTan, KC|en_UK
local.rioxx.contributorZeng, Y|en_UK
local.rioxx.contributorZhang, Z|en_UK
local.rioxx.freetoreaddate2016-12-16en_UK
local.rioxx.licencehttp://creativecommons.org/licenses/by-nc-sa/4.0/|2016-12-16|en_UK
local.rioxx.filenameabelBics2016Paper-final-submitted.pdfen_UK
local.rioxx.filecount1en_UK
local.rioxx.source978-3-319-49685-6en_UK
Appears in Collections:Psychology Book Chapters and Sections

Files in This Item:
File Description SizeFormat 
abelBics2016Paper-final-submitted.pdfFulltext - Accepted Version193.3 kBAdobe PDFView/Open


This item is protected by original copyright



A file in this item is licensed under a Creative Commons License Creative Commons

Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.