RegRL-KG: Learning an L1 regularized reinforcement agent for keyphrase generation

Yao, Yu; Yang, Peng; Zhao, Guangzhen; Leng, Juncheng

doi:10.3233/IDA-226561

RegRL-KG: Learning an L1 regularized reinforcement agent for keyphrase generation

Article type: Research Article

Authors: Yao, Yu | Yang, Peng^* | Zhao, Guangzhen | Leng, Juncheng

Affiliations: Key Laboratory of Computer Network and Information Integration (Southeast University), Ministry of Education, School of Computer Science and Engineering, Southeast University, Nanjing, Jiangsu, China

Correspondence: [*] Corresponding author: Peng Yang, Key Laboratory of Computer Network and Information Integration (Southeast University), Ministry of Education, School of Computer Science and Engineering, Southeast University, Nanjing, Jiangsu, China. E-mail: [email protected].

Abstract: Keyphrase generation (KG) aims at condensing the content from the source text to the target concise phrases. Though many KG algorithms have been proposed, most of them are tailored into deep learning settings with various specially designed strategies and may fail in solving the bias exposure problem. Reinforcement Learning (RL), a class of control optimization techniques, are well suited to compensate for some of the limitations of deep learning methods. Nevertheless, RL methods typically suffer from four core difficulties in keyphrase generation: environment interaction and effective exploration, complex action control, reward design, and task-specific obstacle. To tackle this difficult but significant task, we present RegRL-KG, including actor-critic based-reinforcement learning control and L1 policy regularization under the first principle of minimizing the maximum likelihood estimation (MLE) criterion by a sequence-to-sequence (Seq2Seq) deep learnining model, for efficient keyphrase generation. The agent utilizes an actor-critic network to control the generated probability distribution and employs L1 policy regularization to solve the bias exposure problem. Extensive experiments show that our method brings improvement in terms of the evaluation metrics on five scientific article benchmark datasets.

Keywords: Keyphrase generation, natural language processing (NLP), natural language generation (NLG), reinforcement learning, exposure bias

DOI: 10.3233/IDA-226561

Journal: Intelligent Data Analysis, vol. 27, no. 4, pp. 1003-1021, 2023

Published: 20 July 2023

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

如果您在出版方面需要帮助或有任何建, 件至: [email protected]

Share this:

North America

Europe

Asia