I would like Google to ignore URLs like this:
http://www.mydomain.example/new-printers?dir=asc&order=price&p=3
In other words, all the URLs that have the parameters dir
, order
and price
should be ignored. How do I do so with robots.txt
?
I would like Google to ignore URLs like this:
http://www.mydomain.example/new-printers?dir=asc&order=price&p=3
In other words, all the URLs that have the parameters dir
, order
and price
should be ignored. How do I do so with robots.txt
?
Here's a solutions if you want to disallow query strings:
Disallow: /*?*
or if you want to be more precise on your query string:
Disallow: /*?dir=*&order=*&p=*
You can also add to the robots.txt which url to allow
Allow: /new-printer$
The $
will make sure only the /new-printer
will be allowed.
More info:
http://code.google.com/web/controlcrawlindex/docs/robots_txt.html
http://sanzon.wordpress.com/2008/04/29/advanced-usage-of-robotstxt-w-querystrings/
/new-printer
but not /new-printers?dir=*&order=*&p=*?
? –
Audriaaudrie robots
–
Bordelaise You can block those specific query string parameters with the following lines
Disallow: /*?*dir=
Disallow: /*?*order=
Disallow: /*?*p=
So if any URL contains dir=
, order=
, or p=
anywhere in the query string, it will be blocked.
example.com?p=test
but also example.com?top=test
. –
Saucy Disallow: /*?dir=* Disallow: /*?order=* Disallow: /*?p=* Disallow: /*&dir=* Disallow: /*&order=* Disallow: /*&p=*
–
Kaffir ?
be ignored? –
Traceable Disallow: /*?*order=
, will it also disallow requests that contain reorder=
? –
Heraclea Register your website with Google WebMaster Tools. There you can tell Google how to deal with your parameters.
Site Configuration -> URL Parameters
You should have the pages that contain those parameters indicate that they should be excluded from indexing via the robots meta tag. e.g.
© 2022 - 2024 — McMap. All rights reserved.