Google sees EVERYTHING. In the literal sense, sees

Детективное агентство ИКС-Инфо. · Dec 24, 2011

Sorry, the post was written by specials and therefore it will be a little complicated for a simple user ...
But no less interesting ... I think so ...

Further, the author’s text ...

Evidence
The Instant Preview feature is why we see annotation screenshots in SERP. These previews have an impressive opportunity: they not only display a screenshot of the page, but also visually highlight and underline the text that suits your request. This is simply impossible to achieve with a simple text spider. Screenshots of flash pages - you may already have noticed screenshots of flash sites in the Google Webmaster Tools. Wait ... I thought Google did not see the flash ... AJAX POST request confirmation - Matt Cutts confirmed that GoogleBot can handle AJAX POST requests, and, by chance, this happened a few hours after Rand posted the article “ GoogleBot is Chrome. ” By definition, AJAX is JavaScript-loaded content when an action occurs after the page loads. Therefore, it cannot be tracked with a text spider, because the text spider does not execute JavaScript, but only receives the existing code as provided at the initial download. Google tracks Flash - Matt Clayton also showed me some server logs in which GoogleBot accessed URLs that are accessible only through the built-in Flash modules on Mixcloud.com: 66.249.71.130 "13 / Nov / 2011: 11: 55: 41 + 0000 "" GET / config /? W = 300 & h = 300 & js = 1 & embed_type = widget_standard & feed = http% 3A // www.mixcloud.com / chrisreadsubstance / bbe-mixtape-competition-2010.json & tk = TlVMTA HTTP / 1.1 "200 695" - "" Mozilla / 5.0 (compatible; Googlebot / 2.1; + http: //www.google.com/bot.html) "66.249.71.116" 13 / Nov / 2011: 11: 51: 14 +0000 "" GET / config /?w=300&h=300&js=1&feed=http%3A//www.mixcloud.com/ZiMoN/electro-house-mix-16.json&embed_type=widget_standard&tk=TlVMTA HTTP / 1.1 "200 694" - "" Mozilla / 5.0 ( compatible; Googlebot / 2.1; + http: //www.google.com/bot.html) Let's say this is not news, but another post from 2008 explains that Google “treats Flash files in the same way as they would a person by entering data, and so on. ”And, you mean, how does a person work with a browser? Site - Although Google could get website load time from the toolbar and usage data from Chrome, it’s much more reliable for it to get this information by indexing the network itself. Without executing all the page code, it is almost impossible to accurately calculate the loading time of this page. Until now, all this may have sounded like Google is just a few steps from SkyNet. And optimizers and Google have been assuring us for many years that the search robot (spider) has a textual basis, so this may seem fantastic to you. I assure you, this is not so, and many of the things that I am talking about are accessible to programmers even with a much less strong team of engineers than Google. Meet PhantomJS PhantomJS is a headless Webkit browser that can be controlled through the JavaScript API. With a little script automation, you can easily turn your browser into a spider. It's funny that its logo is a ghost similar to the ghosts in Pacman, and the concept is quite simple: PhantomJS is used to load the page as the user sees it in Firefox, Chrome or Safari, extract materials and track links. PhantomJS has countless applications for parsing information and other types of website analysis, and I advise the SEO community to realize this before we move on. Josh used PhantomJS to prepare some evidence for the information I posted on SearchLove. Earlier, when I released GoFish, I already mentioned that I had difficulty collecting information about the growth in the number of requests from Google Insights using a text spider due to the fact that the list of these questions is provided through AJAX. Richard Baxter suggested that this data can be easily collected using the XPath string, and this convinces me that the search importXML architecture in Google Docs is also based on a headless browser. It is written in red on the diagram: "In the usual way, this data cannot be obtained, because it is AJAX." Anyway, here Josh takes this data off the page using PhantomJS. It is not possible to take screenshots with a text spider, but using the headless webkit browser is as simple as that. In this example, Josh shows how screenshots are taken using PhantomJS. Chromium is a public branch of the Webkit browser, and I strongly doubt that Google created the browser for purely altruistic reasons. The above study suggests that GoogleBot is a multi-threaded headless browser based on the same code. Why don't they tell us anything? Well, actually, they say, but claim that the "robot indexer for creating previews" is a completely separate object. Imagine this robot as "Mrs. Pacman." A member of the main forum of webmasters complained that as a user agent, they display in their logs "Mozilla / 5.0 (X11; U; Linux x86_64; en-US) AppleWebKit / 534.14 (KHTML, like Gecko) Chrome / 9.0.597 Safari / 534.14" and not "Mozilla / 5.0 (en-us) AppleWebKit / 525.13 (KHTML, like Gecko; Google Web Preview) Version / 3.1 Safari / 525.13". John Mu said: “As a tool for testing instant previews, we use a user agent similar to Chrome, so that we can compare what the browser will see (using this user agent) with what we see using Googlebot’s cached access preview. " While the headless browser and Googlebot, as we know, are different, it seems to me that they always browse the pages in parallel and collect information for indexing and ranking. In other words, it’s like a simultaneous two-user version of Pacman with Mrs. Pacman in 3D and regular Pacman, who play on the same level at the same time. In the end, it would not make sense for spiders to browse the entire network twice separately. So why isn’t everything so clear regarding these opportunities, because they are related to ranking? In a nutshell: search quality. Hiding behind the flaws of text spiders, search engines can continue to use them as a scapegoat to explain their imperfect results. They can continue to move in the direction of things like the alleged AuthorRank and rely on SEO to literally optimize their search engines. They can continue to say vague things, like “don't chase the algorithm”, “improve user experience” and “we take into account what is visible without scrolling”, which makes SEO experts make Google's job easier. The main products of Google (and their only products, if you ask Eric Schmidt in court), is search, and if you release information that their capabilities are much higher than declared, then they will have to improve the quality of the search. They don’t tell us about it, because as opportunities grow, so does responsibility. What does this mean for us? When Josh and I presented our research, many people asked me: “How should this change my actions in terms of SEO?” In my opinion, there are three points: 1. Javascript will not help you hide anything. If it seemed to you that with the help of JavaScript postloading you could hide some content - stop doing this. Luring and switching is now a 100% inefficient method. Pacman sees everything. 2. The user experience is extremely important. Google can literally see your site now! As Matt Cutts said, they look at what is above the scroll border, and therefore can take into account when ranking how much advertising is presented on the page. Google can use behavioral data along with site design to determine how useful the site is to people. This is both pleasing and scary, but it also means that every SEO specialist should buy the Circle of Do Not Make Me Think book. 3. SEO tools need to get smarter. Most SEO tools are based on text scrapers, and although many of them are quite complex (SEOmoz currently leads), they still look a lot like Pacman in the 80s. If we want to understand what Google really takes into account when ranking pages, we need to consider more aspects. - When discussing such things as Page Authority and the likelihood of spam, you need to visually check the pages from the point of view of the program, and not be limited to simple indicators, such as the distribution density of keywords and the link graph. In other words, we need a user perception quality indicator (UX Quality Score) that would be influenced by visual analysis and possible modifications to spam. - You should compare how much the page displayed differs from what can be assumed by the code. This can be called the Delta Score. - When assessing the distribution of the proportion of links on a page, one should also take into account dynamic transformation (dinamic transformations), since search engines are able to understand how many links are actually on the page. This factor can also be included in the Delta Score. - You should also include natural language processing in our analysis, as this, apparently, is also taken into account by the Google algorithm. This factor does not significantly affect the overall result, but helps to identify the key concepts with which the machine associates content, as well as fully understand what the link is worth, taking into account the desired result. In other words, contextual analysis of the link graph is necessary. In two things, I agree with Matt Kuts. The only constant parameter is change. However, we must also understand that Google will continue to misinform us about its capabilities or push us to certain conclusions, which we will then adhere to. Therefore, we should understand that Google is responsible for its technology. Simply put, if they can accurately prove that they are not doing anything, then from this moment they should start; after all, some of the most talented engineers on the planet work there. Google continues to complicate search engine marketing and cancel data that allows us to improve user experience, but the fact is that we have a symbiosis. Search engines need SEO specialists and webmasters to make the network faster, easier and more understandable, and we need search engines to promote high-quality content in more prominent places. The problem is that Google has all the cards in its hands, and I’m glad that I did my best to snatch one of them. Your move, Matt.

https://ne-onn.blog.ru/137298867.html

Частный детектив. Владивосток. · Dec 24, 2011

Детективное агентство ИКС-Инфо. · Dec 24, 2011

Частный детектив. Владивосток. said:

In-in ... and I also scratched turnips from all these protocols, etc. ...
Then he spat - read "through the line" and realized ...
Google is following, scum ... is following ...: roll:

Евгений СБ · Dec 25, 2011

Частный детектив. Владивосток. said:

Search

Search

Google sees EVERYTHING. In the literal sense, sees

Детективное агентство ИКС-Инфо.

Зарегистрированный

Частный детектив. Владивосток.

Зарегистрированный

Детективное агентство ИКС-Инфо.

Зарегистрированный

Евгений СБ

Зарегистрированный

Similar threads

Share this page