A generalised and adaptable reinforcement learning stopping method

Bin-Hezam, R. and Stevenson, R. orcid.org/0000-0002-9483-6006 (2025) A generalised and adaptable reinforcement learning stopping method. In: Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval. The 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2025), 13-17 Jul 2025, Padua, Italy. ACM , pp. 761-770. ISBN 9798400715921

Abstract

This paper presents a Technology Assisted Review (TAR) stopping approach based on Reinforcement Learning (RL). Previous such approaches offered limited control over stopping behaviour, such as fixing the target recall and tradeoff between preferring to maximise recall or cost. These limitations are overcome by introducing a novel RL environment, GRLStop, that allows a single model to be applied to multiple target recalls, balances the recall/cost tradeoff and integrates a classifier. Experiments were carried out on six benchmark datasets (CLEF e-Health datasets 2017-9, TREC Total Recall, TREC Legal and Reuters RCV1) at multiple target recall levels. Results showed that the proposed approach to be effective compared to multiple baselines in addition to offering greater flexibility.

Metadata

Item Type:	Proceedings Paper
Authors/Creators:	Bin-Hezam, R. Stevenson, R. https://orcid.org/0000-0002-9483-6006
Copyright, Publisher and Additional Information:	© 2025. This work is licensed under a Creative Commons Attribution International 4.0 License - https://creativecommons.org/licenses/by/4.0/
Keywords:	Reinforcement Learning; Deep Reinforcement Learning; Technology Assisted Review; TAR; Stopping Methods
Dates:	Accepted: 4 April 2025 Published (online): 13 July 2025 Published: July 2025
Institution:	The University of Sheffield
Academic Units:	The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Depositing User:	Symplectic Sheffield
Date Deposited:	13 May 2025 10:43
Last Modified:	21 Jul 2025 09:00
Status:	Published
Publisher:	ACM
Refereed:	Yes
Identification Number:	10.1145/3726302.3729879
Related URLs:	arXiv URL Conference
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:226220

CORE (COnnecting REpositories)

A generalised and adaptable reinforcement learning stopping method

Abstract

Metadata

Download

Published Version

Export

Statistics