Website construction companyShangpin ChinaSEO website optimizationIn this case, it is no longer practical, for example, a collection problem.In fact, in order to accurately understand the relationship between index, inclusion and site, we can consider some issues from the basic principles.
In terms of search principle, the spider first grabs the URL of a web page, then downloads and analyzes the content of the web page corresponding to the URL, indexes the web page that meets its quality standards or has a certain purpose, and puts the indexed web page into the index database.At this time, theMarketing website constructionSome have user retrieval value, and some have search engine's own retrieval value. Indexed pages that are valuable for users will be output, that is, included.The pages that only have the search engine's own retrieval value may not be output, but only have a certain number of indexes without the number of output results. Therefore, we can see that many times the number of indexes will be much lower than the number of indexes.
From the perspective of search, the number of pages of a website is sometimes greater than the number of pages currently owned.For example, there are 100 web pages in a website. For users or webmasters, there are 100 web pages. But these 100 web pages may have multiple operations such as data updates and page changes. Different versions may meet different needs. (Therefore, we can see that many times a web page has different snapshots.) From this perspective,In the eyes of search, the number of pages of a website can be larger than the number of output pages currently owned by the website, especially for frequently modified websites or websites with non-standard URLs.At the same time, from the perspective of search engine data, its data volume may consist of historical data and updated data, so the site related result value is also greater than the number of site results.
According to the above statement, we reorganize the relationship between the four:
Index volume and collection volume: Index volume is the collection of all valuable pages for search. Some of these pages are valuable to users. The output of these pages is the collection volume (different people may define different pages). Some pages are only valuable to the search engine itself. The number of these pages causes the index volume to be higher than the collection volume.
Number of site results and related result values: We often see the site results as follows:
We see a problem. The number of relevant results is 2790, while the site result is only about 100. There is a big gap between the two.The reason for the gap may be due to multiple factors. For example, some web pages may be recalculated, and some web pages are included (retrieval value exists), but the page quality is not high (web page value and retrieval value are not the same thing, web page retrieval value is only a foundation of web page value, and web page value is composed of multiple factors.)
At the same time, we also need to know that spiders are machines after all, and the number of pages on many websites on the Internet is changing in different ways. There are always new pages and old pages deleted. The value we see at one time is generally accurate, not 100% accurate.
In terms of inclusion, the relationship between the four is generally as follows:
The number of indexes is greater than the number of collections, the number of collections is greater than the number of site results, and the number of related results is greater than the number of site results.However, in general, we personally recommend the following methods to simplify these relationships:
1. The number of site direct results is of great significance and value to SEO. In addition to the number of site results to judge the value of some pages, we suggest to increase the ratio of site results to Baidu index volume, and the ratio of Baidu index volume to the number of pages on the entire website in terms of inclusion. SEO optimization and operation can be carried out from these two ratios.As for the concept of entanglement, just ignore it directly.
2. Baidu index amount=Baidu collection amount, because the collection amount is actually invisible, and the number of site results and related result values cannot represent the collection amount.