Design and Layout

Learning Objective

After going through this guide, you should be able to understand how existing news articles are modified and created using page layout designs that improve Google’s ability to crawl and comprehend the page’s content.

Video Duration

15:35

Answer Quiz

Take current module quiz

Materials

Ready-to-use templates

Resources

Reports and Resources

2.1.1 What Is Design and Layout?

The design and layout of your site determines how it appears to the end user. This is important because Google is ultimately driven by a user-first philosophy. Web pages that satisfy the user’s need first, fastest and in the simplest possible way are rewarded with higher SERP rankings.

Your site’s design and layout also determines how easily web crawlers, such as Google bot, crawl and index it. A simple and optimized design and layout means fast and easy crawling, which in turn translates into better rankings.

2.1.2 Challenges Publishers Face With Design and Layout

So what stops publishers from implementing design and layout best practices? Most often, publishers are confronted by these pain points:

Ambiguity over what results can are feasible
Uncertainty over skill set requirements
Ambiguity over resource requirements

2.1.3 Does Design and Layout Matter for SEO?

To answer this question, we performed a simple Google search entering the keywords “Charlie Puth News” into the search bar.

Here’s what the search results turned up:

At number two on the Top Stories SERP ranking, right above a story by NME, is the Daily Illini’s article on Charlie Puth’s latest release.

The fact that the Daily Illini, a university student newspaper, outranks the world’s biggest standalone music website, raises some important questions

How is a student newspaper in a town of some 40,000 inhabitants in the American midwest outranking the largest music news website on earth? Intrigued, we decide to dig around a little more.

First we checked out NME’s page on Charlie Puth.

Right off the bat, the first thing we notice is the pop-up video trying to load on the bottom right-hand corner of the screen. It’s obviously not doing a very good job of loading. The buffering video also hides part of the news headline and its body.

Next up, we notice that the initial viewport is occupied mostly by stuff that isn’t relevant to the news story. There’s a big banner ad covering about half of the page, and of course there’s the video.

In fact, scrolling down the page, we encounter more videos, more big, rich images, more pop-up ads and a lot of hyperlinks. Given how media-rich the page is, it unsurprisingly took quite a while to load.

We next inspected the Daily Illini and here’s what we found.

The page is neat, clean and uncluttered. It has its share of ads and a big Donate button at the top, but there are no videos or pop-ups covering the viewport or obstructing the news headline. We can see the headline right away, and it is very likely that the same applies to Google’s web crawler.

On the whole, the page is light, minimalistic and lightning fast to load.

We decided to peek under the hood a little more at the underlying code. By right-clicking on the page and selecting View Source (while using Chrome), we can see the page’s code.

This is what we saw for the NME page:

Two things grabbed our attention here:

The NME code spans 5,516 lines, compared with the Daily Illini’s 1,481lines
NMW is using scripts to render key elements of a page — such as breadcrumbs — instead of using HTML

This is not the best thing for a page for two reasons:

web crawlers have been known to encounter issues while implementing scripts
code that is unusually complex can slow down interpretation and execution.

When we looked at the code for the Daily Illini page on the other hand, we saw this:

This is very simple HTML code. Also, there are no scripts running within the <head> section.

How does this all add up to the Daily Illini outranking NME?

There are probably a number of factors at work here, and one among them is design and layout. The Daily Illini page deploys certain design and layout techniques that even small publishers can easily replicate to boost their overall SEO strategy.

These include using clean, simple HTML code, avoiding scripts in the header section, keeping the page light and fast to load, and not relying too heavily on pop-ups and interstitial ads.

The guide below digs into each one of these in detail, while explaining several other techniques you can implement to significantly improve your SERP rankings.

2.1.4 Introduction to Semantic HTML Markup

What Is Semantic HTML Markup?

Semantics relates to the meaning of words. Semantic HTML tags are those that clearly define their meanings both to the reader and a web crawler.

For example, when we use a tag like <header>, we know what it contains — information about the header.

Similarly <h1> is a semantic tag that tells Googlebot that what follows is the most important heading in the article.

By contrast, when we use a tag like <div>, its meaning is not immediately apparent. In HTML <div> stands for division, and all it implies is that a new code section has begun, without necessarily revealing any information about the contents of this section.

Why Is Semantic HTML Important for Technical SEO?

Web crawlers like Googlebot are built using artificial intelligence and machine learning algorithms that attempt to simulate the functioning of the human brain. This means that they make sense of text in much the same way as the human brain does.

HTML code that is easy for humans to understand should also be easy for Google’s web crawler to understand.

As an example, consider the two pieces of HTML code below:

Source: https://www.pluralsight.com/guides/semantic-html

This page uses the <div> tag for everything, from header to the main content to the footer. It’s not immediately apparent by reading the tags what its content is.

By contrast, the page below uses semantic markup. The header is placed within the <header> tag, the footer within the <footer> tag, and the main body of the article goes within the <main> tag.

Source: https://www.pluralsight.com/guides/semantic-html

Since this is easy for Googlebot to read and understand, this page has a better chance of ranking higher than the previous one, all other things being equal.

To view whether your page uses semantic markup, simply right click on the page if you’re using Google Chrome, and click on Inspect. You will be able to see the HTML source code of the page. Common semantic elements include <author>, <video>, <article>, <form>, <header>, etc.

We now know what Semantic Markup is and why it’s important. But how do we use it to improve SEO?

It’s simple — always use semantic markup to mark out important information about your article’s design and layout. This includes the following article information:

Headline
Headers
Paragraphs
Image alt text.

Ensure that your page’s layout is well-ordered to improve crawling

2.1.5 The Basics of Design and Layout

You’re designing your site to be read by both humans and web crawlers and, as such, your design and layout should reflect this fact.

Below are a few tips to help you achieve measurable outcomes for your website.

Use Plain HTML When Creating Articles

You can use HTML, CSS, Javascript or any other frontend language to create rich and interactive pages. However, remember that the more advanced the language, the greater its complexity, and the greater chances are that a web crawler may find it hard to read, interpret and compile.

Anything coded in HTML may not be the prettiest to look at, but it will both load faster and be more optimized for search engines for the simple reason that search engines can read and understand it faster.

Think of plain HTML as the bare bones skeleton of your web page. You can add CSS and Javascript to flesh it out and make it look aesthetically pleasing and dynamic, but it would be better to keep the most important content within the skeleton rather than place it in the flesh.

So how do we implement plain HTML? One simple way of doing it is to place the main body of your content within <article> HTML tags.

This way, when web crawlers encounter the <article> tag, they know immediately that what follows is the most important content on your page — the news article. This helps the search engine understand that the content wrapped within this tag needs to be assigned greater weight.

Plain HTML’s <article> tag is a semantic marker that looks like this:

Source: https://en.wikipedia.org/wiki/Article_element

The next obvious question? If I’m using a CMS like WordPress, where do I insert these tags?

How to Do This: If you’re building a custom website using HTML, then you can check the source code to ensure it’s using plain HTML, especially in critical areas. We’d advise speaking in more detail with a developer to ensure you don’t hamstring functionality by accident.

If you’re using WordPress, then refer to this guide. You may also find this guide on how to insert HTML into posts and pages a useful reference source.

These instructions are for WordPress as WordPress remains the most popular CMS for publishers. If you’re using a different CMS such as Wix, please consult the support or documentation page for your CMS.

If you have access to a team of web developers, it is best to have them do it as editing HTML code can be time consuming.

Test Content Across Platforms

Test to ensure that your content appears correctly in all browsers, devices and sizes. This one is more obvious but often overlooked. If your content does not appear the way you want it across all browsers and devices, it will affect user experience, and in the long run, your SERP rankings.

How to Do This: To test content across platforms you need to open your page in different devices and in different browsers to see how it is being rendered.

At a minimum, you should test for the following:

Open the page in a desktop/laptop to see if the design and layout are the way you intended them to be.
Either open the page in a mobile device to see if all the elements of the page are being rendered correctly or use Google’s Chrome DevTools to simulate a mobile device.
Open the page in multiple browsers — Google Chrome, Mozilla Firefox and Microsoft Edge — to see it is loading correctly and smoothly.

Use Structured Data

HTML markups help highlight the different elements of your page. Structured data helps search engines read what’s inside the different elements of your page and better understand its content.

Structured data is simply a series of instructions written in a simple language, such as JSON-LD, that can be inserted within the existing HTML code of your webpage. Think of it like a meta description, but for individual pieces of content on your page.

In the example below, structured data helps Google identify five attributes about a dbpedia page on John Lennon:

Context: The page is about a person.
ID: Where on the internet is the page located. In this case, its dbpedia.org
Name: What is the name of the person who is the subject of this page? In this case, it’s John Lennon.
Born: When was this person born?
Spouse: What is their spouse’s name?

Source: https://json-ld.org/

As you can see, the code is in simple language that is easy for both a human reader and a web crawler to understand.

Here’s another example that shows how structured data can fit right into your web page’s existing HTML code. The structured data instructions are highlighted in green.

Source: developers.google.com

In this example, structured data tells Googlebot that this is a recipe page about coffee cake form somebody called Mary Stone.

Using structured data in your website’s layout delivers measurable outcomes. For instance, using structured data can increase a website’s click-through rate (CTR) by up to 30%.

Using structured data also helps your page rank better on Google’s carousels, featured snippets, videos and knowledge panel features.

For Google News SEO, it’s important to include the following elements when creating structured data to provide additional value:

Datepublished: Date when the news item was first published using ISO 8601 format.
Datemodified: Date when it was most recently modified or updated.
Headline: Don’t exceed 110 characters.
Image: A link to the image that accompanies the article. The image should be marked using HTML tags.
isAccessibleForFree: This field indicates whether the news item is behind a paywall or not.
author.url: Include a link to the bio or profile of the article’s author. It’s good practice to also include the author’s social media handles in the bio.

How to Do This: You can add structured data/schema to your content either manually or by using a plugin for your particular CMS.

Begin with entering your page’s URL into Google’s Structured Data Testing Tool. This will tell you if you are already using any structured data and, if so, where on your page it’s located. This in turn gives you an idea of what kind of structured data you still need to add and where.
If you plan to add structured data manually, you will need to have a basic understanding of schema.org. This is a good resource to learn more about schema.org. If you’re not comfortable editing the HTML code on your page, we recommend getting the help of a web developer.
If you’re using a CMS such as WordPress you may not be able to edit the HTML code directly. In this case it is more convenient to use a plug-in such as Schema Pro, Schema App Structured Data or any other good WordPress plug-in. If you are using any other CMS such as Wix, check for the appropriate plug-in. Note: Please consult with your hosting provider to avoid potential plug-in conflicts.

Structure Your Content

All the elements of your news article should be arranged in a specific order to allow faster and easier crawling. The order is as follows:

Headline
Image (with alt text)
Author bio and date
Article body

Page Experience

Page experience is a measure of how users experience your page. Google has a set of parameters to quantify page experience. We’ve dedicated an entire module to page experience factors, so we’ll only briefly look at each here.

Core Web Vitals (CWV): CWV are a Google metric that measure three things — how fast your website loads, how interactive it is and how visually stable it is. This is done with the help of three other metrics — largest content paint (LCP), first input delay (FID) and cumulative layout shift (CLS).
Mobile Friendliness: Your site should be responsive to mobile devices.
HTTPS Scheme: HTTPS is the secure version of HTTP used to transfer data over the internet. HTTPS encrypts data sent over a network and ensures authenticity and privacy.
No Intrusive Interstitials: Interstitials are pop-ups such as ads or dialogue boxes that cover a substantial portion of the user’s screen, thereby disrupting their viewing experience. Intrusive interstitials also make it hard for Google to understand the content on the page.

How to Do This: You can test page experience both manually and by using plugins or third-party apps. For instance, Page Speed Insights is a handy tool that helps you analyze your site’s performance based on CWV and other parameters and assigns a score based on its analysis. It also tests for both mobile and desktop responsiveness.

Unique and Permanent URLs

News publishers should not publish multiple news articles under the same URL. This will obstruct Google from indexing them. Each news article should have its own unique URL.

Furthermore, these URLs should be permanent. Which means that the same news story should be displayed on a particular URL. If the news story associated with a particular URL changes frequently, Google will not be able to crawl and index it. Publishers should however, update the news story as often as is needed.

URL Redirects

If redirects need to be used for news articles they should be implemented according to the following best practices:

Use as few redirects as possible to link from one page to another
Make sure you redirect to a valid page
If you are using a redirect timer that redirects a user to a different page after a certain amount of time has elapsed, ensure you minimize this window
Make sure a page does not redirect to itself
For permanent redirects, use a 301 redirect
Avoid using &ID= as a URL parameter

2.1.6 Nice to Have

While the action items listed in this section aren’t as important as those above, we still recommend implementing as many of these as possible once the mission critical points listed above have been addressed.

Clean <head> Code

The <head> element of a page contains important information about the page that is not actually displayed on the page. What it includes is metadata that helps Googlebot identify the contents of the page and classify it.

As a rule, the <head> element should include only the most important tags and nothing else, so a post can be crawled and rendered properly. These include:

title
<style>
<meta>
<link>
<script>
<base>

Anything else contained within the <head> element is likely to confuse web crawlers.

For instance, it is common for novices to confuse the title tag with <h1> and place the latter within the <head> element. As previously explained, the <head> element can only contain metadata that is not displayed on the page.

Even though title and <h1> should contain essentially the same information, the former is metadata meant for web crawlers and to be displayed within the SERP results and browser tab, while the latter is information to be displayed on the page.

The code below shows how to place title within the <head> element.

Source: developer.mozilla.org

Creating a User-friendly UX

Using page elements that make it easy to scan content and make navigation a frictionless experience for the user also impacts SEO.

An easy to navigate page will contain these elements:

Home Page
Menu
Search
Category Pages
News Article Pages
Sign up/Subscribe

Unless you are a seasoned web developer, it is best to consult with an expert on the best way to implement a user-friendly UX.

Creating a Good Ad Experience

Google wants publishers to display ads without disrupting the user’s experience. For this reason it may penalize websites that display too many intrusive ads. While user experience is a subjective metric, Google has certain guidelines and best practices when it comes to ads.

Some of them relate to:

Ad placement: Making sure your page isn’t top heavy with ads
Ad content: Making sure your ads aren’t offensive or inappropriate
Ad to content ratio: Keeping this ratio no higher than 30%
Avoid intrusive interstitials: Avoiding ads that pop-up and cover a significant portion of the user’s screen.

For more on ads and popups, refer to our detailed module.

2.1.7 Avoid These Common Pitfalls

Don’t Use Javascript When Creating Articles

Javascript is great for creating dynamic and interactive content but web crawlers may experience difficulty rendering it.

This is because:

Javascript is usually rendered on the client side rather than on the server. As a general rule, anything rendered on the server side is usually executed faster.
Googlebot uses a two-step indexing process in which it indexes the HTML content of a page in the first wave, and the Javascript in the second wave. This can not only delay indexing, but can sometimes even result in the Javascript content being missed by Googlebot altogether.
So, while this does not mean that you should not use Javascript at all on your page, just make sure that the article section is free from Javascript.

Avoid Article Interruptions

With news articles, it’s good practice to avoid interruptions such as related article carousels or image galleries.

Monitor Your Site After Redesigns

Many publishers that do well become concerned when relaunching/redesigning their site, as it requires Google to recrawl the site. Follow these best practices to ensure a smooth transitioning to normal after redesign/relaunch:

Make sure all your pages are accessible by Google’s web crawler by inputting the URLs into Google’s URL inspection tool
If you don’t want Google to crawl certain pages, block access to them by using robots.txt or noindex tags
Update your sitemap. This tells Google which pages on your site need to be crawled and indexed
If you’ve changed the structure of your website during redesign, update the records associated with your webpages with Google’s publisher center. For more on Google Publisher Center, see our detailed course module.

Avoid Heavyweight HTML on Article Pages

Keep your article pages as light as possible. We’ve already looked at avoiding Javascript in articles, but it’s also good practice to avoid using heavy HTML content.

This is because when Googlebot crawls your page, it can only download a maximum of 15 MB of page data in the first crawl. For most pages, this is not a major issue as heavy weight items such as videos and images are referenced separately within the code that Googlebot indexes later, and are thus beyond the purview of this 15 MB limit.

However, this does once again point to the fact that the lighter your page, the easier it will be for Googlebot to crawl and index it.

Tip: If you want to check the size of your page, go to your browser’s Developer Tools page, switch to the Network tab, then reload the page. This displays all the requests your browser made to fully render the page. The first request on this list shows the size of your page under the Size column. For most pages on the internet, this figure is usually in kilobytes.

Fix Incorrect Article Snippets

Article snippets give readers a preview of the content on the page before they click on it. Google determines the snippet to go with each article by crawling the text in the main body of the article just below the title.

To avoid the possibility of Googlebot including incorrect snippets make sure that:

In the page’s HTML code, no additional text has been placed between the title and the main body.
Author bylines, author bios and article publication date are clearly differentiated from the beginning of the main body of the article. For best practices on how to implement this, refer to guidelines on structured data and semantic markup discussed previously.

Prevent Missing or Incorrect Images

Sometimes, Googlebot may either fail to index your image or index a different image than the one you intended to feature with your article. To avoid this, follow these best practices:

Use schema markup to help Googlebot better identify your images. Schema.org provides semantic markup code that can help Googlebot better identify your image. Opengraph (OG) is another tool used for a similar purpose. Schema.org structured data originated from a collaboration between the major search engines — Google, Yahoo!, Bing and Yandex — to make identifying and indexing elements on a page even more accurate and easy.
Only use image formats that Google supports such as Jpeg, GIF, PNG, BMP, WebP and SVG. Also, make sure that the images are at least 90 x 60 pixels in size.
Be careful when inlining images. There are two ways of including images in your code — inline or referencing to an external source. When you use an inline image, it reduces the number of times the web crawler has to follow an external link thereby reducing crawl time. However, inlining images has the disadvantage of increasing page size. There is thus a tradeoff involved between inlining and referencing, and the best course to follow would be decided by your priorities. If your images are not too heavy, format them inline.
Make sure your featured image is placed close to the title of the article and access to the image is not blocked by a robot.txt tag or metadata

Prevent Inaccurate Article Titles

Googlebot uses your article’s title to correctly identify and index it. Use these following best practices to ensure that it reads your title accurately:

Make sure the content within the title and <h1> tag of your article is the same
Avoid inserting hyperlinks in the article’s title
Try to avoid using a date or time within your article’s title
If you link your article to another section of your website, make sure the anchor text that links to your article is the same as the title of the article

2.1.8 Examples of Sites Done Well

Let’s look at two case studies of sites that have already implemented the steps discussed in this article.

Modern news websites are complex and rich, and it would be unrealistic to expect them to adhere to these guidelines rigorously. However, in this section we’re trying to demonstrate how following the guidelines can result in predictable, measurable outcomes.

Case Study 1: Manly Observer

The Manly Observer is a hyperlocal news website catering to audiences in a popular beach-side suburb of Sydney, Australia. Below is what a typical news article on the site looks like:

We see the following elements of design clear and present at first glance:

Title
Author name and image
Date published
Featured image relevant to the title
Article body

Looking next at the page’s HTML code, we can see the use of semantic markup.

The title tag contains the title of the news article.
The page’s viewport parameters are easily readable.

This is code that’s easily readable by a human. It is safe to presume that a web crawler will be able to read and interpret this code with equal ease.

The website uses the https:// scheme and has no pop-up ads or interstitials loading within the initial viewport.

Case Study 2: Entrepreneur

Entrepreneur is a popular magazine for entrepreneurs and businesses. This is how its homepage appears.

The website is lightning fast to load and there are no pop-up ads or videos on the homepage itself. Most of the ad placement occurs on individual news articles.

When we click to “view source”, we see the following HTML code:

At a glance, we can make out the following from this code:

Use of Semantic Markup: When we read this code, it is possible for us to understand what each tag contains. For example, we can see the title tag which contains the title of the page
Clean <head> Code: We’d discussed how the head element should contain only the following tags — title, style, meta, link, script, base. In the code above we see only these elements and nothing else. This is a clean <head> code.

As we scroll down, we see the following code element:

We’d discussed the use of schema.org and Opengraph (OG) for images. To recap, schema.org and OG are types of structured data that help web crawlers identify specific elements of the code more easily. We see schema.org and OG implemented here.

Further down, we also see structured data tags as shown below:

As with our previous example, entrepreneur.com also uses the https:// scheme in its link, has no disruptive interstitials or pop-ups, and is fast to load. The news articles follow a set format of title, image, author attribution, date and main body of content. This results in a better page experience and hence improved technical SEO.

2.1.9 Actions and Takeaways

After working through this lesson, you should be able to review and update existing news pages to optimize their design and layout to adhere with technical SEO best practices.

Previous Chapter

Back to Chapter

Next Module

Active now

1

Design and Layout

2

Site Architecture