Legal text basic element identification based on the BERT model in the judicial field
Article type: Research Article
Authors: Li, Xiang
Affiliations: Guangxi Police College, Nanning, China | E-mail: [email protected]
Correspondence: [*] Corresponding author: Guangxi Police College, Nanning, China. E-mail: [email protected].
Abstract: In recent years, deep learning technology has developed significantly in the judicial field. More and more scholars are introducing deep learning to solve problems in the judicial field and improve the efficiency of judicial agencies in handling cases. An important task of judicial work is to identify the key elements of complex cases, establish the relationship between entities in the document, accurately grasp the development of the case, provide a basis for the understanding, analysis, and ruling of the case, and increase the interpretability of the case results. However, judging from the actual application situation, the application of deep learning technology in the judicial field is not yet mature and still faces huge challenges. On the one hand, the existing legal text element identification methods only consider the identification of core elements and ignore the basic elements. However, the basic elements contain necessary legal-related information, which is basic case information unique to the legal field and has important reference value for judicial staff to analyze cases. On the other hand, the traditional method of identifying key elements does not consider the contextual semantic relationship and ignores the element semantic information and global semantic information is lost, resulting in poor recognition of key elements of legal text. Therefore, in the identification of key elements of legal texts, not only the core elements and basic elements must be considered, but also the semantic features of sentences and the semantic features of elements must be comprehensively considered, thereby improving the effect of identifying key elements of legal texts and promoting the efficiency of judicial organs in handling cases. Aiming at the problem of identifying the basic elements of legal texts, this paper proposes a model for identifying the basic elements of legal texts based on dynamic representation. This algorithm uses the dynamic representation capability of the BERT model to vectorize text and considers global domain semantic information during the pre-training process to achieve a more comprehensive vector representation. Secondly, the memory component of the bidirectional long short-term memory network is used to integrate all the feature information between long-distance words in the legal text, effectively express the meaning of the word in the context, and realize the recognition of the basic elements of the legal text. In addition, the conditional random field model is used to learn the transfer rules of labels between entities and output the label sequence that best conforms to the actual rules. Finally, the basic element identification model proposed in this article has been significantly improved by comparing it with the current better methods.
Keywords: Judicial artificial intelligence, basic element recognition, bert model, bidirectional long short-term memory network
DOI: 10.3233/JCM-247453
Journal: Journal of Computational Methods in Sciences and Engineering, vol. 24, no. 4-5, pp. 2333-2342, 2024