Robots.txt and my Core Concrete5 is getting indexed by Google!!! ... The problem is - it doesn't look like the robots.txt has the /updates folder ...
I get this message from Google Search Console "Coverage issues detected". ... txt: The page was indexed, despite being blocked by robots.
This is a custom result inserted after the second result.
You can double-check this by going to Coverage > Indexed, though blocked by robots.txt and inspect one of the URLs listed. Then under Crawl it'll say No: ...
Investigating, I was shocked to find the entire contents of my /updates folder (over a thousand files) had been indexed by various search ...
Google's John Mueller warns that pages blocked by robots.txt could still get indexed if there are links pointing to them.
Member DavidMIRV shows us how how to improve SEO by redirecting all page requests to the exact same canonical URL, regardless of whether the URL contains a ...
Page disallow in robots.txt but indexed by Google. How it is possible? My website faces coverage issues in Google Search Console. Check the following message by ...
Definitely sounds like a problem with query parameters being indexed though and its often good to ensure these are addressed in the search ...
Google says: A robotted page can still be indexed if linked to from from other sites While Google won't crawl or index the content blocked by ...
* Versions of concrete5 8.2.1 and higher don't need the header override anymore (since this got introduced into the core and is how we want it to be);. #v1 ...