Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Issue title: Special issue: Fuzzy Systems in Distributed Sensing Applications
Guest editors: Mohamed Elhoseny and X. Yuan
Article type: Research Article
Authors: Jin, Jiangang; *
Affiliations: School of Information Engineering, North China University of Water Resources and Electric Power, Zhengzhou, Henan, China
Correspondence: [*] Corresponding author. Jiangang Jin, School of Information Engineering, North China University of Water Resources and Electric Power, Zhengzhou, Henan, 450046, China. E-mail: [email protected].
Abstract: With the rapid development of the Internet, the current Web has become the main platform for people to publish and retrieve information. How to quickly and accurately find the information required by users in a large amount of network information resources has become an urgent need of the people. Web crawlers are research fields that appear to meet this demand. Based on this, the paper designs and implements a distributed web crawler system based on the existing research work, and its goal is to provide high quality data support for the network public opinion system. The web crawler system designed and implemented in this paper solves the problems of low efficiency, poor scalability and low automation of single-machine crawlers, which improves the speed of webpage collection and data extraction precision and expands the scale of webpage collection. At the end of the article, the system related interface screenshots and test results are displayed. It can be seen from the test results that the crawler system can effectively collect dynamic web pages, and the result of automatic extraction of web pages has high precision, and also realizes the entire crawling system.
Keywords: Big data, Baidu crawler technology, data retrieval, dynamic page capture, automatic navigation browsing
DOI: 10.3233/JIFS-179482
Journal: Journal of Intelligent & Fuzzy Systems, vol. 38, no. 2, pp. 1203-1213, 2020
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]