Performance Comparison of Ad-hoc Retrieval Models over Full-text vs. Titles of Documents

Saleh, Ahmed; Beck, Tilman; Galke, Lukas; Scherp, Ansgar

doi:10.1007/978-3-030-04257-8

Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/28014

Appears in Collections:	Computing Science and Mathematics Conference Papers and Proceedings
Author(s):	Saleh, Ahmed Beck, Tilman Galke, Lukas Scherp, Ansgar
Contact Email:	ansgar.scherp@stir.ac.uk
Title:	Performance Comparison of Ad-hoc Retrieval Models over Full-text vs. Titles of Documents
Editor(s):	Dobreva, M Hinze, A Žumer, M
Citation:	Saleh A, Beck T, Galke L & Scherp A (2018) Performance Comparison of Ad-hoc Retrieval Models over Full-text vs. Titles of Documents. In: Dobreva M, Hinze A & Žumer M (eds.) Maturity and Innovation in Digital Libraries. ICADL 2018. Lecture Notes in Computer Science, 11279. The 20th International Conference on Asia-Pacific Digital Libraries, Hamilton, New Zealand, 19.11.2018-22.11.2018. Cham, Switzerland: Springer, pp. 290-303. https://doi.org/10.1007/978-3-030-04257-8
Issue Date:	31-Dec-2018
Date Deposited:	22-Oct-2018
Series/Report no.:	Lecture Notes in Computer Science, 11279
Conference Name:	The 20th International Conference on Asia-Pacific Digital Libraries
Conference Dates:	2018-11-19 - 2018-11-22
Conference Location:	Hamilton, New Zealand
Abstract:	While there are many studies on information retrieval models using full-text, there are presently no comparison studies of full-text retrieval vs. retrieval only over the titles of documents. On the one hand, the full-text of documents like scientific papers is not always available due to, e. g., copyright policies of academic publishers. On the other hand, conducting a search based on titles alone has strong limitations. Titles are short and therefore may not contain enough information to yield satisfactory search results. In this paper, we compare different retrieval models regarding their search performance on the full-text vs. only titles of documents. We use different datasets, including the three digital library datasets: EconBiz, IREON, and PubMed. The results show that it is possible to build effective title-based retrieval models that provide competitive results comparable to full-text retrieval. The difference between the average evaluation results of the best title-based retrieval models is only % less than those of the best full-text-based retrieval models.
Status:	AM - Accepted Manuscript
Rights:	This is a post-peer-review, pre-copyedit version of a paper published in Dobreva M, Hinze A & Žumer M (eds.) Maturity and Innovation in Digital Libraries. ICADL 2018. The final authenticated version is available online at: https://doi.org/10.1007/978-3-030-04257-8_30

Files in This Item:

File	Description	Size	Format
C66-SalehEtAl-Performance Comparison of Ad-hoc Retrieval Models over Full-text vs. Titles of Documents.pdf	Fulltext - Accepted Version	407.4 kB	Adobe PDF	View/Open

This item is protected by original copyright

View License

Show full item record

Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.

STORRE

STORRE: Stirling Online Research Repository