Mobile app version of vmapp.org
Login or Join
Dunderdale272

: Bingbot trying to access lower case urls when i am using a mixture of uppercase and lowercase I have a website (dynamic real estate website) that has been up for nearly a year, all of a

@Dunderdale272

Posted in: #301Redirect #LocalSeo #Url

I have a website (dynamic real estate website) that has been up for nearly a year, all of a sudden Bingbots are trying to access completely lower case versions of the main pages resulting in 404 errors.

The base url is lowercase but subsequent pages are a mixture.
example.com/Houses-for-sale-in-Suburb-STATE-3345.php

I have read up on rel canonical and 301 redirects and since have learned that using uppercase is not a good idea - although I decided to use them because states in my country are expressed in UPPERCASE

I am not sure how to handle this, if I convert all of the uppercase URLs to lower case then the bots will still try to access the uppercase versions.

10.02% popularity Vote Up Vote Down


Login to follow query

More posts by @Dunderdale272

2 Comments

Sorted by latest first Latest Oldest Best

 

@Merenda212

Stephen Ostermiller gave me the answer to proceed with testing and to implement the following in my .htaccess

RewriteEngine On # Turn on the rewriting engine

Redirect 301 /houses-for-sale-in-suburb-state-3345.php example.com/Houses-for-sale-in-Suburb-STATE-3345.php


I wrote a redirect for every single page that had CAPS in the URL, except for the dynamically created URL's, which are driven by RewriteRules in the .htaccess & put the Redirects above the RewriteRules

10% popularity Vote Up Vote Down


 

@Megan663

When I worked for TripAdvisor, we saw this problem all the time. TripAdvisor has mixed case URLs like:
www.tripadvisor.com/Hotels-g60763-New_York_City_New_York-Hotels.html

Many crawlers including BingBot would try to lower case the entire URL before fetching it. To fix the issue we made sure that all lowercase URLs would redirect to the mixed case version. So the following gives a 301 redirect:
www.tripadvisor.com/hotels-g60763-new_york_city_new_york-hotels.html

We used the ID in the URL to figure out the correct URL and redirect. It looks like you have a numeric id in your URL that you could use as well.

Because of this experience, I recommend that the canonical version of URLs always be all lowercase. I don't recommend using mixed case URLs because there are so many badly programmed crawlers.

StackExchange does something similar. If you request a butchered URL with the id still intact, you will get a redirect to the correct thing. You can test this by fetching the following URL for this question:
webmasters.stackexchange.com/questions/100613/BINGBOT-IS-DUMB

I also recommend that numeric ids go near the front of the URL because of truncation issues. You can correct for

/Hotels-g60763-New_Yor...


but not as easily for:

/Houses-for-sale-in-Su...

10% popularity Vote Up Vote Down


Back to top | Use Dark Theme