Tag Archive for: Gary Ilyes

Every brand wants to put their best foot forward. If you want to do that online, you need to understand what canonical URLs are. So, today we are going to talk a bit about what canonical URLs are, why your pages may have a canonical version, and how Google chooses which page is the canonical page. 

What Are Canonical URLs?

A canonical URL or web page is the version of a page selected to be indexed by Google when there are multiple versions of the page. 

This version of the page is used by Google to rank the web page and be displayed in search results in order to prevent duplicate search listings. 

As the owner of the website, you have some control over which pages are chosen to be canonical URLs. As we will get into further down, though, Google doesn’t always select the page you believe should be the canonical version.

Before we get to that, let’s take a moment to talk about the legitimate reasons why you may have duplicate versions of a page.

5 Reasons For Having Duplicate Web Pages

According to Google’s official documentation and guidelines about canonical webpages, the search engine believes there are five legitimate reasons a webpage may have multiple versions. 

  1. Region variants: for example, a piece of content for the USA and the UK, accessible from different URLs, but essentially the same content in the same language
  2. Device variants: for example, a page with both a mobile and a desktop version
  3. Protocol variants: for example, the HTTP and HTTPS versions of a site
  4. Site functions: for example, the results of sorting and filtering functions of a category page
  5. Accidental variants: for example, the demo version of the site is accidentally left accessible to crawlers

How Google Chooses A Canonical Webpage 

Until very recently, it was unclear exactly how Google selected canonical pages. Website owners and managers could signal which version they wanted to appear in search results using the rel=”canonical” tag in the code of the page. 

However, this version wasn’t always the one that Google went with.

Gary Ilyes from Google cleared the mystery up (mostly) in a recent Google Search Central video. 

The process starts with finding the content and identifying the main content or “centerpiece of a page”. Then, it groups the pages with similar content in duplicate clusters. 

Then, Google uses a handful of pages to essentially rank each version of the page like it would a listing in search results. The page with the best ranking is selected as the canonical version and included in most search results. 

While he doesn’t list exactly what signals are used, Ilyes did say this:

“Some signals are very straightforward, such as site owner annotations in HTML like rel=”canonical”, while others, like the importance of an individual page on the internet, are less straightforward.”

Notably, this doesn’t mean that Google only indexes one version of the page to be used in all contexts. There are situations where Google may decide to show users a version of the page other than the canonical version.

“The other versions in the cluster become alternate versions that may be served in different contexts, like if the user is searching for a very specific page from the cluster.

To hear Gary Ilyes himself talk about the process, check out the full Google Search Central Video below:

The Google SEO Starter Guide is designed to help individuals and organizations quickly learn the most important steps necessary for getting their websites ranking within Google Search. 

While the guide reportedly maintains a 91% approval rating, it has largely gone without updates for several years but that will be changing soon.

In a recent episode of Google’s “Search Off The Record” podcast, the company’s Search Relations team discussed plans to update the SEO Starter Guide, including talking about what would and would not be included in the revised document. 

Discussions like this are great for seeing how SEO is talked about within the search engine and learning what the company prioritizes when ranking sites along with identifying SEO myths that might lead you astray when optimizing your own site. 

So, what’s changing in the revised SEO Starter Guide?

HTML Structure

One topic the group discussed was the importance (or lack thereof) of HTML structure when it comes to online rankings.

While the team agreed that using proper HTML structure can help with online rankings, they indicated the guide will clarify that these are not all that important in the grand scheme.

As Google’s Gary Ilyes said:

“Using headings and a good title element and having paragraphs, yeah, sure. It’s all great, but other than that it’s pretty futile to think about how the page… or the HTML is structured.”

Branded Domain Names vs Keyword Rich Domain Names

SEO experts have been increasingly debating whether it is better to focus on your existing branding when establishing a domain name, or if domains perform better when including specific keywords.

According to the Google team, the new guide will clarify this by indicating that brands should focus on including branding in their domains over using keywords. The thought process shared by those in the discussion was that establishing a memorable brand will have a more long-term impact than trying to optimize your domain specifically for search engines. 

Debunking SEO Myths

Lastly, the group said one thing they want to improve in the document was how it addressed widespread SEO myths and misconceptions. 

For example, everyone agreed that the SEO Starter Guide should specifically debunk the idea that using Google products while creating or optimizing your site will improve search rankings. 

They indicated they would address this myth and several others to prevent people from optimizing their site based on misinformation found elsewhere online. 

For more from the discussion, listen to the entire episode of “Search Off The Record” here.