1
|
Lennon M, Peirce DS, Tarry BD, Willett P. An evaluation of some conflation algorithms for information retrieval. J Inf Sci 2016. [DOI: 10.1177/016555158100300403] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The characteristics of conflation algorithms are discussed and examples given of some algorithms which have been used for information retrieval systems. Comparative experiments with a range of keyword dictionaries and with the Cranfield document test collection suggest that there is relatively little difference in the performance of the algorithms despite the widely disparate means by which they have been developed and by which they operate.
Collapse
Affiliation(s)
- Martin Lennon
- Postgraduate School of Librarianship and Information Science, University of Sheffield, Western Bank, Sheffield S10 2TN, UK
| | - David S. Peirce
- Postgraduate School of Librarianship and Information Science, University of Sheffield, Western Bank, Sheffield S10 2TN, UK
| | - Brian D. Tarry
- Postgraduate School of Librarianship and Information Science, University of Sheffield, Western Bank, Sheffield S10 2TN, UK
| | - Peter Willett
- Postgraduate School of Librarianship and Information Science, University of Sheffield, Western Bank, Sheffield S10 2TN, UK
| |
Collapse
|
2
|
Lynch MF, Willett P. Information retrieval research in the Department of Information Studies, University of Sheffield: 1965-1985. J Inf Sci 2016. [DOI: 10.1177/016555158701300405] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
This paper discusses research which was carried out at the Department of Information Studies, University of Sheffield in the period 1965 to 1985 into storage and retrieval techniques for databases of textual and chemical structure data. The research includes the development of methods for the auto matic production of printed subject indexes and for the inde xing and retrieval of chemical structures and chemical reac tions, the variety generation method for the analysis, characterization and storage of data in a range of types of textual database, the prediction of biological activity in chemical compounds, and the design of document retrieval systems.
Collapse
Affiliation(s)
- Michael F. Lynch
- Department of Information Studies, The University of Sheffield, Western Bank, Sheffield S10 2TN, United Kingdom
| | - P. Willett
- Department of Information Studies, The University of Sheffield, Western Bank, Sheffield S10 2TN, United Kingdom
| |
Collapse
|
3
|
Robertson AM, Willett P. Applications ofn‐grams in textual information systems. JOURNAL OF DOCUMENTATION 1998. [DOI: 10.1108/eum0000000007161] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
4
|
|
5
|
Yannakoudakis E, Goyal P, Huggill J. The generation and use of text fragments for data compression. Inf Process Manag 1982. [DOI: 10.1016/0306-4573(82)90047-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
6
|
Cooper D, Emly MA, Lynch MF, Yeates AR. Compression of continuous prose texts using variety generation. ACTA ACUST UNITED AC 1980. [DOI: 10.1002/asi.4630310312] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|