TiTDet: A tiny text detector with scale-sensitive loss and effective fusion factor

Wei, Guangcun; Fu, Jihua; Pan, Zhifei; Fang, Qingge; Zhang, Zhi

doi:10.3233/JIFS-236317

TiTDet: A tiny text detector with scale-sensitive loss and effective fusion factor

Article type: Research Article

Authors: Wei, Guangcun^{a; b; *} | Fu, Jihua^a | Pan, Zhifei^a | Fang, Qingge^a | Zhang, Zhi^c

Affiliations: [a] College of Intelligent Equipment, Shandong University of Science and Technology, Taian, Shandong, China | [b] College of Computer Sicence and Engineering, Shandong University of Science and Technology, Qingdao, Shandong, China | [c] Taian Synergy Software Ltd., Taian, Shandong, China

Correspondence: [*] Corresponding author. Guangcun Wei, College of Intelligent Equipment, Shandong University of Science and Technology, No. 223 Daizong Street, Taian, 271019, Shandong, China. E-mail: [email protected].

Abstract: The text in natural scenes is often smaller compared to artificially designed text. Due to the small proportion of pixels, low resolution, less semantic information, and susceptibility to complex scenes, tiny text detection often results in many missed detections. To address this issue, this paper draws inspiration from small object detection methods and proposes TiTDet, a detection algorithm more suitable for tiny text. Due to the small proportion of pixels, low resolution, less semantic information, and susceptibility to complex scenes, tiny text detection often results in many missed detections. To address this issue, this paper draws inspiration from small object detection methods and proposes TiTDet, a detection algorithm more suitable for tiny text. Firstly, this paper incorporates a context extraction module and an attention-guided module. These modules guide contextual information learning through a self attention mechanism, while eliminating the possible negative impact caused by redundant information. Regarding multi-scale feature fusion, this paper proposes a fine-grained effective fusion factor, making the fusion process emphasize small object learning more and highlight the feature expression of tiny texts. In terms of post-processing, this paper proposes a differentiable binarization module, incorporating the binarization process into model training. Leveraging the implicit information in the data to drive model improvement can enhance the post-processing effect. Lastly, this paper proposes a scale-sensitive loss, which can handle tiny texts more fairly, fully considering the positional relationship between the predicted and real regions, and better guiding the model training. This paper proves that TiTDet exhibits high sensitivity and accuracy in detecting tiny texts, achieving an 86.0% F1-score on ICDAR2015. The paper also compares the superiority of the method on CTW1500 and Total-Text.

Keywords: Tiny text detection, context extraction module, attention-guided module, effective fusion factor, scale-sensitive loss

DOI: 10.3233/JIFS-236317

Journal: Journal of Intelligent & Fuzzy Systems, vol. 46, no. 5-6, pp. 11367-11379, 2024

Published: 24 October 2024

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

如果您在出版方面需要帮助或有任何建, 件至: [email protected]

Share this:

North America

Europe

Asia