Massive Google Search document leak exposes inner workings of ranking algorithm

“You need to drive more successful clicks using a broader set of queries and earn more link diversity to maintain your ranking. It makes sense because strong content will naturally do this. Focusing on driving qualified traffic to a better user experience signals to Google that your page deserves to rank.”

Evidence and statements from the U.S. vs. Google antitrust trial affirmed that Google incorporates clicks into its ranking algorithms, notably through its Navboost system, described as “one of the significant signals” for ranking. Explore further in our coverage:

Brand matters. Fishkin’s big takeaway? Brand matters more than anything else:

“If there was one universal piece of advice I had for marketers seeking to broadly improve their organic search rankings and traffic, it would be: ‘Build a notable, popular, well-recognized brand in your space, outside of Google search.’”

Entities are significant

Authorship remains relevant as Google stores author information linked to content and seeks to attribute documents to specific entities.

SiteAuthority: Google employs a metric known as “siteAuthority”.

Initially acknowledged in 2011 after the Panda update, Google publicly stated that “low-quality content on part of a site can impact a site’s ranking as a whole.” Despite this, Google has denied the existence of a website authority score in subsequent years.

Chrome data: A module named ChromeInTotal indicates Google utilizes data from its Chrome browser for ranking purposes.

Whitelists: Modules like isElectionAuthority and isCovidLocalAuthority indicate Google whitelists specific domains related to elections and COVID. This practice is known as having “exception lists” to mitigate unintended impacts from algorithms on websites.

Small sites: Another feature, smallPersonalSite, is designed for small personal sites or blogs. There is speculation that Google could adjust rankings for such sites using a Twiddler, although the extent of this adjustment remains uncertain. Once again, the weighting of these features is unclear.

Other notable findings from Google’s internal documents include:

Freshness matters: Google considers dates in the byline (bylineDate), URL (syntacticDate), and on-page content (semanticDate) to assess content freshness.
Topic relevance: To determine if a document aligns with a website’s core topics, Google uses page and site embeddings, comparing page embeddings (siteRadius) to site embeddings (siteFocusScore).
Domain registration information: Google stores domain registration details (RegistrationInfo).
Page titles: The feature titlematchScore is believed to gauge how well a page title matches a query.
Text analysis: Google measures the average weighted font size of terms in documents (avgTermWeight) and anchor text.

Massive Google Search document leak exposes inner workings of ranking algorithm

Entities are significant

Schedule FREE Consultation

Fill up the form below and we will get back to you within 24 hours

More To Explore

Why Hiring a Law Firm Digital Marketing Agency Is a Competitive Advantage

PageSpeed Insights: SEO and Performance Guide

More ways to explore

&copy 2024 | All Rights Reserved | SMV Experts

Massive Google Search document leak exposes inner workings of ranking algorithm

Entities are significant

Schedule FREE Consultation

Fill up the form below and we will get back to you within 24 hours

More To Explore

Why Hiring a Law Firm Digital Marketing Agency Is a Competitive Advantage

PageSpeed Insights: SEO and Performance Guide

More ways to explore

&copy 2024 | All Rights Reserved | SMV Experts

Join Our Fastest Growing Community for Expert Tips and Insights