Journal of Textile Research ›› 2018, Vol. 39 ›› Issue (10): 156-161.doi: 10.13475/j.fzxb.20171010106

Previous Articles     Next Articles

Extracting method of household textile resources from Web

  

  • Received:2017-10-30 Revised:2018-05-28 Online:2018-10-15 Published:2018-10-17

Abstract:

Aiming at the of poor efficiency while processing a huge quantity of Web resources, particularly data reaources hidden in deep web by problem of current household textile resources from Web acquisition mode, an automatic approach to extract home textile resouces from Web was proposed. In this approach, a domain model was firstly proposed to identify deep Web query interfaces, then the identkfied query interfaces were filled automatically with domain keywords from household textiles, and the household textile resources from deep Web were extracted. In addition, in order to filter noises from response Web pages, pages were divided into different view blocks, a block importance model was proposed and trained by labeled blocks, and the model was utilized to filter the noise information independent from the subject. Experimental results show that in comparison with rule-based approaches, the domain model achieves 8% and 6% improvements in terms of positive predictive value and accuracy for query interface identification. Also, the block importance model achieves average 12.9% improvements at three levels in terms of harmonic average value for filtering noise information.

Key words: household textile, resource database, deep Web, information extraction

[1] 郭春花. 纺织“十三五”蓝图初绘 访中国纺织工业联合会副会长孙瑞哲[J]. 纺织服装周刊,2016,(02):16-17.
GUO Chunhua. Textile "13th five-year" blueprint: Inter-view with Sun Ruizhe, vice president of China Textile In-dustry Association [J]. Textiles and clothing week-ly,2016,(02):16-17.
[2] 战洪飞. 基于网格的家纺行业产品协同设计[J]. 纺织学报,2009,30(08):138-142.
ZHAN H F. Study on grid based product collaborative de-sign for home textile enterprises [J]. Journal of
Textile Research,2009,30(8):138-142.
[3] 曹飞. 家纺床品数据库查询系统的研究与实现[D]. 苏州大学, 2011.
CAO Fei. The Research and Implementation of Home Textile Bedding Database Query System[D]. Soochow University, 2011.
[4] ZHENG Q H, WU Z H, CHENG X C, et al. Learning to crawl deep web [J]. Information Systems, 38(6): 801-819.
[5] Jan Zeleny, Radek Burget, Jaroslav Zendulka. Box cluster-ing segmentation: A new method for vision-based web page preprocessing[J]. Information Processing & Man-agement, 2017, 53(3): 735-750.
[6] Fayzrakhmanov R R. Information Extraction from Web Pages Based on Their Visual Representation[M]Current Trends in Web Engineering. Springer Berlin Heidelberg, 2011:342-346.
[7] Seung Min Kim, Suk I. Yoo. DOM tree browsing of a very large XML document: Design and implementation [J]. Journal of Systems and Software, 82(11): 1843-1858.
[8] Maksim Lapin, Matthias Hein, Bernt Schiele. Learning using privileged information: SVM+ and weighted SVM[J]. Neural Networks, 53: 95-108.
[9] FU Y, YANG D Q, TANG S W. Using Xpath to discover informative content blocks of web pages[C]//Proceedings of the Third International Conference on Semantics, Knowledge and Grid, Shan Xi;2007:450-453.
No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] . [J]. JOURNAL OF TEXTILE RESEARCH, 2003, 24(05): 102 -103 .
[2] WAN Zhen-kai;LI Jing-dong. Feature of acoustic emission and failure analysis for three-dimensional braided composite material under compressive load[J]. JOURNAL OF TEXTILE RESEARCH, 2006, 27(2): 20 -24 .
[3] . [J]. JOURNAL OF TEXTILE RESEARCH, 2004, 25(04): 87 -88 .
[4] ZHANG Yiyu;WANG Lu;C.CAMPAGNE;R.ABDESSEMED. Fragrant finishing of cotton fabric with lavender oil via β-cyclodextrin technology[J]. JOURNAL OF TEXTILE RESEARCH, 2008, 29(9): 94 -97 .
[5] YOU Xiu-lan;LIU Zhao-feng;CAO Yu-tong;HU Zu-ming. Influence of thickness of PPTA gel on the properties of its pulp[J]. JOURNAL OF TEXTILE RESEARCH, 2006, 27(9): 22 -24 .
[6] . [J]. JOURNAL OF TEXTILE RESEARCH, 1982, 3(09): 4 .
[7] . [J]. JOURNAL OF TEXTILE RESEARCH, 1992, 13(07): 16 -19 .
[8] . [J]. JOURNAL OF TEXTILE RESEARCH, 2004, 25(04): 24 -25 .
[9] . [J]. JOURNAL OF TEXTILE RESEARCH, 1989, 10(06): 18 -20 .
[10] . [J]. JOURNAL OF TEXTILE RESEARCH, 1989, 10(08): 15 -18 .