What Is a Soft 404?
A soft 404 is a page that returns a success (HTTP 200) status code while showing content that tells the user the page does not exist, which confuses search engines.
The name describes the mismatch at the heart of the problem. A normal “page not found” response carries a 404 error in its header, so any program reading the page knows the content is missing. A soft 404 hides that signal. The server sends a 200 OK, which means “here is a real page,” but the visible content reads something like “Sorry, this product is no longer available” or “Nothing matches your search.” The header says one thing and the body says the opposite.
Search engines treat the status code as the authoritative signal, then check whether the content agrees with it. When it does not, they flag the URL as a soft 404 and usually keep it out of their index. The page wastes resources without earning visibility, which is why soft 404s are worth finding and fixing.
Soft 404 vs a Real (Hard) 404
The difference is the header, not the wording on the screen.
| Type | Status code | What the crawler does |
|---|---|---|
| Hard 404 | HTTP 404 (or 410) | The header itself reports that the resource is gone. Crawl systems trust this immediately and drop the URL from indexing queues. |
| Soft 404 | HTTP 200 | The header reports success even though the page tells humans the content is missing. Search engines have to guess that the page is empty by reading the body. |
A correct 404 is not an error in the SEO sense. It is the expected, honest response when a URL no longer maps to anything. A soft 404 is the broken case, because the machine-readable signal and the human-readable message disagree.
Why Soft 404s Hurt SEO
Soft 404s damage two things: how efficiently a site is crawled and how cleanly it is indexed.
Wasted Crawl Budget
Every site has a finite crawl budget, the number of URLs a search engine will fetch in a given window. Because a soft 404 returns 200, the crawler keeps treating it as a live page worth revisiting. It re-fetches empty URLs on a schedule, spending requests that should have gone to real content. On large sites with many out-of-stock products or empty filters, this adds up fast.
Polluted Indexing
Soft 404s confuse indexing decisions. The search engine has to spend evaluation effort deciding whether each “successful” page is genuinely empty. Pages that should have been cleanly removed instead sit in a gray zone, sometimes lingering in the index, sometimes triggering quality reviews of surrounding URLs. The result is a noisier site profile and slower recognition of the pages that actually matter.
Common Causes of Soft 404s
Most soft 404s come from a handful of recurring patterns.
Empty content served with 200
A template renders a “no results” or “product unavailable” message but the framework never changes the status code from 200.
Redirecting deleted pages home
A removed URL is sent via 301 redirect to the home page or a generic landing page. The destination loads fine, so search engines see an irrelevant 200 and label the original a soft 404.
Thin content
A page like an empty tag archive or a stub article technically loads but holds almost no useful text, so it can be judged as having no real content. See thin content.
Empty search and filter results
Faceted navigation and on-site search can generate countless URLs that match nothing yet still return 200.
Broken templates or queries
A failed database call or broken template swallows the error and renders a blank shell with a success header.
How Google Detects Soft 404s
Google does not rely on the status code alone for these pages. After fetching a URL that returns 200, it evaluates the rendered content and looks for signals that the page is effectively empty.
What the System Looks For
- Telltale phrases such as “page not found,” “no longer available,” “0 results,” or “sorry.”
- Near-empty main content where boilerplate (header, footer, navigation) dominates and the unique body is negligible.
- Redirects to unrelated pages, most often a blanket redirect of dead URLs to the homepage.
When these signals are strong, Google reclassifies the URL as a soft 404 regardless of the 200 header. A soft 404 is a classification, not a header your server sends. It is the search engine’s verdict that your “successful” page is really a not-found page in disguise.
How to Find and Fix Soft 404s
The fastest source is Google’s own reporting, backed up by a crawler for confirmation, then a status-code fix that matches the URL’s true state.
1
Read the Page Indexing report
Google Search Console reports soft 404s directly. Open the Page Indexing report and look for the “Soft 404” reason under URLs that are not indexed. Each entry is a page Google fetched as 200 but judged empty.
2
Crawl and verify
Run a site crawler and look for 200 pages with very low word counts or “not found” phrasing. Spot-check the response header to confirm 200 versus 404, and watch for a wave of empty filter, search, or pagination URLs that signals a systematic cause.
3
Return 404 or 410 for gone content
If the page should not exist, serve a real not-found response. Use a 404 when the URL might return later, or a 410 Gone when removal is permanent. Both lead to faster, cleaner de-indexing than a soft 404 ever will.
4
Use 301 only for true equivalents
If the content genuinely moved, 301 redirect the old URL to the closest matching page, not a generic homepage. A blanket redirect of every dead URL to the home page recreates the soft 404 problem.
5
Restore or block the rest
For pages flagged as empty that should exist, add real content and fix the broken template or failed query. For empty search and filter results, prevent them from being indexable using robots directives or canonical handling so they never reach the crawler.
The fix is always to make the header tell the truth. Once the status code matches what should happen to the URL, the soft 404 stops being a problem.
Which Status Code to Use
| Situation | Correct response | Why |
|---|---|---|
| Page gone, may return later | 404 Not Found | Honest not-found signal that allows the URL to come back. |
| Page gone permanently | 410 Gone | Stronger signal that prompts faster removal from the index. |
| Content moved to an equivalent | 301 redirect | Passes the request to the real replacement page, not a generic homepage. |
| Page should exist but looks empty | 200 with real content | A genuine page with substance will not be classified as a soft 404. |
Last Thoughts on Soft 404
Key Takeaways
- A soft 404 is a page that returns HTTP 200 while its content says the page does not exist, so the header and the body disagree.
- The only difference between a soft 404 and a hard 404 is the status code: 200 versus 404 or 410.
- Soft 404s waste crawl budget and pollute the index, because crawlers keep treating empty URLs as live pages.
- Common causes are empty pages served with 200, redirects of dead URLs to the homepage, and thin content.
- Google detects them by reading the rendered page, not just the header, and flags pages that are effectively empty.
- Find them in the Google Search Console Page Indexing report, then confirm with a crawler.
- Fix them by returning the correct status code: 404 or 410 for gone content, a relevant 301 for moved content, or real content for pages that should stay.
Soft 404s are a signaling problem, not a content problem in most cases. Once each URL returns a status code that matches its true state, crawlers spend their budget on pages that count and the index reflects what your site actually offers.
Frequently Asked Questions (FAQs)
What is the difference between a soft 404 and a regular 404?
A regular (hard) 404 returns an HTTP 404 status code in the header, so search engines know the page is missing. A soft 404 returns a 200 success code while showing not-found content, so the header and the content disagree. The body can look identical; only the status code differs.
Why is a soft 404 bad for SEO?
It returns 200, so search engines keep crawling and evaluating an empty URL as if it were a live page. That wastes crawl budget and clutters the index with pages that should have been removed, which slows recognition of your real content.
How do I fix a soft 404?
Make the status code match reality. Return a 404 or 410 if the page should be gone, a relevant 301 redirect if the content truly moved to an equivalent page, or add real content if the page should exist and was flagged for being empty.
What causes a soft 404?
Common causes are empty or “no results” pages served with a 200 code, redirecting deleted URLs to the homepage, thin content with almost no body text, and empty internal search or filter result pages.
Does Google Search Console report soft 404s?
Yes. The Page Indexing report lists “Soft 404” as a reason under URLs that were not indexed. Each listed URL is a page Google fetched as 200 but judged to be effectively empty.
Is redirecting a deleted page to the homepage a soft 404?
Usually yes. Search engines see the original URL resolve to an unrelated homepage and treat that as a soft 404. Redirect dead URLs only to a closely matching replacement; if there is no equivalent, return a 404 or 410 instead.
Do soft 404s hurt rankings?
Not directly as a ranking penalty, but they waste crawl budget and can keep useful pages from being indexed promptly. On large sites, many soft 404s add noise that makes it harder for search engines to focus on your valuable pages.
What is the difference between 404, 410, and a soft 404?
A 404 means not found and possibly temporary. A 410 means gone and permanent, prompting faster removal. A soft 404 is neither a real header response nor a deliberate choice; it is a 200 page that search engines judge to be empty, which is the case you want to avoid.
How does Google detect soft 404s?
After fetching a 200 page, Google evaluates the rendered content for signals such as “not found” phrasing, near-empty main content, or redirects to unrelated pages. If the page looks empty despite the success code, it is classified as a soft 404.
Can thin content be a soft 404?
Yes. A page that loads with a 200 but contains almost no meaningful content, such as an empty archive or a stub page, can be classified as a soft 404 because the search engine sees no real content behind the successful response.
Should empty category pages return a 404?
It depends. If a category is permanently empty and should not exist, return a 404 or 410. If it is temporarily empty but valid, keep it live with helpful content or guidance so it is not judged empty, and prevent throwaway filter combinations from being indexed.
How do I find soft 404s on my site?
Start with the Google Search Console Page Indexing report and filter for the “Soft 404” reason. Then run a site crawler to flag 200 pages with very low word counts or not-found phrasing, and confirm by checking each URL’s response header.
Want to Go Deeper into SEO?
Explore more from my SEO knowledge base:
▪️ SEO & Content Marketing Hub — Learn how content builds authority and visibility
▪️ Search Engine Semantics Hub — A resource on entities, meaning, and search intent
▪️ Join My SEO Academy — Step-by-step guidance for beginners to advanced learners
Whether you’re learning, growing, or scaling, you’ll find everything you need to build real SEO skills.
Feeling stuck with your SEO strategy?
If you’re unclear on next steps, I’m offering a free one-on-one audit session to help and let’s get you moving forward.
Table of Contents
Toggle