Your Complete Screaming Frog Guide

Black And Beige Feminine How To Website Blog Banner (3)

Use the ChatGPT powered version in the GPT Store

Last updated February 2020. Originally published May 2015.

So, I admit it: When we started looking at our own blog traffic, we realized this was one of the most historically popular blog posts on the Seer domain. After a brief moment of reflection and a swell of enthusiasm for the ever-present greatness of the Screaming Frog SEO Spider, a tool that’s been a loyal companion in our technical SEO journey, we realized we were doing a disservice--both to our readers and to the many leaps forward from the great Screaming Frog.

Though this original guide was published in 2015, in the years since, Screaming Frog has evolved to offer a whole suite of new features and simplified steps to conduct technical audits, check a site’s health, or simply get a quick glimpse of info on a selection of URLs.

Below, you’ll find an updated guide to how SEOs, PPC professionals, and digital marketing experts can use the tool to streamline their workflow.

To get started, simply select what it is that you are looking to do:

Basic Crawling

I want to crawl my entire site
I want to crawl a single subdirectory
I want to crawl a specific set of subdomains or subdirectories
I want a list of all of the pages on my site
I want a list of all of the pages in a specific subdirectory
I want to find all of the subdomains on a site and verify internal links
I want to crawl an e-commerce site or other large site
I want to crawl a site hosted on an older server
I want to crawl a site that requires cookies
I want to crawl using a different user agent
I want to crawl pages that require authentication

Internal Links

I want information about all of the internal and external links on my site (anchor text, directives, links per page etc.)
I want to find broken internal links on a page or site
I want to find broken outbound links on a page or site (or all outbound links in general)
I want to find links that are being redirected
I am looking for internal linking opportunities

Site Content

I want to identify pages with thin content
I want a list of the image links on a particular page
I want to find images that are missing alt text or images that have lengthy alt text
I want to find every CSS file on my site
I want to find every JavaScript file on my site
I want to identify all of the jQuery plugins used on the site and what pages they are being used on
I want to find where flash is embedded on-site
I want to find any internal PDFs that are linked on-site
I want to understand content segmentation within a site or group of pages
I want to find pages that have social sharing buttons
I want to find pages that are using iframes
I want to find pages that contain embedded video or audio content

Meta Data and Directives

I want to identify pages with lengthy page titles, meta descriptions, or URLs
I want to find duplicate page titles, meta descriptions, or URLs
I want to find duplicate content and/or URLs that need to be rewritten/redirected/canonicalized
I want to identify all of the pages that include meta directives e.g.: nofollow/noindex/noodp/canonical etc.
I want to verify that my robots.txt file is functioning as desired
I want to find or verify Schema markup or other microdata on my site

Sitemap

I want to create an XML Sitemap
How to create an XML Sitemap by Uploading URLs
I want to check my existing XML Sitemap

General Troubleshooting

I want to identify why certain sections of my site aren't being indexed or aren’t ranking
I want to check if my site migration/redesign was successful
I want to find slow loading pages on my site
I want to find malware or spam on my site

PPC & Analytics

I want to verify that my Google Analytics code is on every page, or on a specific set of pages on my site
I want to validate a list of PPC URLs in bulk

Scraping

I want to scrape the meta data for a list of pages
I want to scrape a site for all of the pages that contain a specific footprint

URL Rewriting

I want to find and remove session id or other parameters from my crawled URLs
I want to rewrite the crawled URLs (e.g: replace .com with .co.uk, or write all URLs in lowercase)

Keyword Research

I want to know which pages my competitors value most
I want to know what anchor text my competitors are using for internal linking

Link Building

I want to analyze a list of prospective link locations
I want to find broken links for outreach opportunities
I want to verify my backlinks and view the anchor text
I want to make sure that I'm not part of a link network
I am in the process of cleaning up my backlinks and need to verify that links are being removed as requested

Basic Crawling

How to crawl an entire site

When starting a crawl, it’s a good idea to take a moment and evaluate what kind of information you’re looking to get, how big the site is, and how much of the site you’ll need to crawl in order to access it all. Sometimes, with larger sites, it’s best to restrict the crawler to a sub-section of URLs to get a good representative sample of data. This keeps file sizes and data exports a bit more manageable. We go over this in further detail below. For crawling your entire site, including all subdomains, you’ll need to make some slight adjustments to the spider configuration to get started.

By default, Screaming Frog only crawls the subdomain that you enter. Any additional subdomains that the spider encounters will be viewed as external links. In order to crawl additional subdomains, you must change the settings in the Spider Configuration menu. By checking ‘Crawl All Subdomains’, you will ensure that the spider crawls any links that it encounters to other subdomains on your site.

Step 1:

pasted image 0 72

Step 2:

In addition, if you’re starting your crawl from a specific subfolder or subdirectory and still want Screaming Frog to crawl the whole site, check the box marked “Crawl Outside of Start Folder.”

By default, the SEO Spider is only set to crawl the subfolder or subdirectory you crawl from forwards. If you want to crawl the whole site and start from a specific subdirectory, be sure that the configuration is set to crawl outside the start folder.

Pro Tip:

To save time and disk space, be mindful of resources that you may not need in your crawl. Websites link to so much more than just pages. Uncheck Images, CSS, JavaScript, and SWF resources in order to reduce the size of the crawl.

Now in ChatGPT! - Screaming Frog Guide to Doing Almost Anything: 55+ Uses

Basic Crawling

How to crawl an entire site

How to crawl a single subdirectory

How to crawl a specific set of subdomains or subdirectories

I want a list of all of the pages on my site

I want a list of all of the pages in a specific subdirectory

How to find all of the subdomains on a site and verify internal links.

How to crawl an e-commerce site or other large site

How to crawl a site hosted on an older server -- or how to crawl a site without crashing it

How to crawl a site that requires cookies

How to crawl using a different user-agent

How to crawl pages that require authentication

Internal Links

I want information about all of the internal and external links on my site (anchor text, directives, links per page etc.)

How to find broken internal links on a page or site

How to find broken outbound links on a page or site (or all outbound links in general)

How to find links that are being redirected

I am looking for internal linking opportunities

Site Content

How to identify pages with thin content

I want a list of the image links on a particular page

How to find images that are missing alt text or images that have lengthy alt text

How to find every CSS file on my site

How to find every JavaScript file on my site

How to identify all of the jQuery plugins used on the site and what pages they are being used on

How to find where flash is embedded on-site

How to find any internal PDFs that are linked on-site

How to understand content segmentation within a site or group of pages

How to find pages that have social sharing buttons

How to find pages that are using iframes

How to find pages that contain embedded video or audio content

Meta Data and Directives

How to identify pages with lengthy page titles, meta descriptions, or URLs

How to find duplicate page titles, meta descriptions, or URLs

How to find duplicate content and/or URLs that need to be rewritten/redirected/canonicalized

How to identify all of the pages that include meta directives e.g.: nofollow/noindex/noodp/canonical etc.

How to verify that my robots.txt file is functioning as desired

How to find or verify Schema markup or other microdata on my site

Sitemap

How to create an XML Sitemap

Creating an XML Sitemap By Uploading URLs

How to check my existing XML Sitemap

General Troubleshooting

How to identify why certain sections of my site aren't being indexed or aren’t ranking

How to check if my site migration/redesign was successful

How to find slow-loading pages on my site

How to find malware or spam on my site

PPC & Analytics

How to verify that my Google Analytics code is on every page, or on a specific set of pages on my site

How to validate a list of PPC URLs in bulk

Scraping

How to scrape the metadata for a list of pages

How to scrape a site for all of the pages that contain a specific footprint

URL Rewriting

How to find and remove session id or other parameters from my crawled URLs

How to rewrite the crawled URLs (e.g: replace .com with .co.uk, or write all URLs in lowercase)

Keyword Research

How to know which pages my competitors value most

How to know what anchor text my competitors are using for internal linking

How to know which meta keywords (if any) my competitors have added to their pages

Link Building

How to analyze a list of prospective link locations

How to find broken links for outreach opportunities

How to verify my backlinks and view the anchor text

How to make sure that I'm not part of a link network

I am in the process of cleaning up my backlinks and need to verify that links are being removed as requested

Bonus Round

How to Edit Meta Data

How to a Crawl JavaScript Site

View Original HTML and Rendered HTML

Final Remarks

Still nerding out on technical SEO?

For more SEO tutorials and the latest digital marketing updates, subscribe to the Seer newsletter below.

We love helping marketers like you.

Related Posts