Vision-Based Deep Web Data Extraction For Web Document Clustering
M. Lavanya
Broschiertes Buch

Vision-Based Deep Web Data Extraction For Web Document Clustering

Approach to vision-based deep web data extraction for the clustering of the web document (VDEC)

Versandkostenfrei!
Versandfertig in 6-10 Tagen
52,99 €
inkl. MwSt.
PAYBACK Punkte
26 °P sammeln!
The VDEC approach comprises of two phases: 1) Vision-based web data extraction, and 2) Web document clustering. In phase 1, the web page information is segmented into various chunks from which, surplus noise and duplicate chunks are removed using three parameters, such as hyperlink percentage, noise score and cosine similarity. To identify the relevant chunk, three parameters such as Title word Relevancy, Keyword frequency-based chunk selection, Position features are used and then, a set of keywords is extracted from those main chunks. Finally, the extracted keywords are subjected to web docum...