Bin-Hezam, R. and Stevenson, R. orcid.org/0000-0002-9483-6006 (2025) A generalised and adaptable reinforcement learning stopping method. In: Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval. The 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2025), 13-17 Jul 2025, Padua, Italy. ACM , pp. 761-770. ISBN 9798400715921
Abstract
This paper presents a Technology Assisted Review (TAR) stopping approach based on Reinforcement Learning (RL). Previous such approaches offered limited control over stopping behaviour, such as fixing the target recall and tradeoff between preferring to maximise recall or cost. These limitations are overcome by introducing a novel RL environment, GRLStop, that allows a single model to be applied to multiple target recalls, balances the recall/cost tradeoff and integrates a classifier. Experiments were carried out on six benchmark datasets (CLEF e-Health datasets 2017-9, TREC Total Recall, TREC Legal and Reuters RCV1) at multiple target recall levels. Results showed that the proposed approach to be effective compared to multiple baselines in addition to offering greater flexibility.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2025. This work is licensed under a Creative Commons Attribution International 4.0 License - https://creativecommons.org/licenses/by/4.0/ |
Keywords: | Reinforcement Learning; Deep Reinforcement Learning; Technology Assisted Review; TAR; Stopping Methods |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 13 May 2025 10:43 |
Last Modified: | 21 Jul 2025 09:00 |
Status: | Published |
Publisher: | ACM |
Refereed: | Yes |
Identification Number: | 10.1145/3726302.3729879 |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:226220 |