search engine on the main process of "analysis, is divided into clear functional blocks". As the text block, block links and telephone block, independent advertising block etc.. And it is to determine the ways such as: see how many words, see HTML code form, text content to Natural Language Processing to understand and so on.
search engine program is not in any case, may be perfect judgment on the Internet so many different web pages in different situations.
is the first article in the series, Shanghai dragon should be mentioned on the basis of data, and slightly expanded to write some data preparation. Although the data is very important, but its role is only auxiliary: find the problems and summarize the improvement, as a reference factor in decision-making, but can’t be separated from the existing Shanghai Longfeng method and exist independently.