Posted by randfish

It’s been a big month for false positives and getting caught with spam, and I’ve never been one to break up a theme. Short post, but an important one that every dev team should be aware of.

The story starts with a smart SEOmoz member, Per Svanström, getting stumped by a perfectly legitimate, white hat subdirectory, with plenty of PageRank, dropping out of Google’s index:

Birdstep Database out of Google

You can see from the image that the single URL was dropped, but a site:birdstep.com/database query reveals that in fact, all of those pages are out of the index. Time for some detective work.

Jane & I spent a few minutes trying to puzzle out if bad links were pointing in or if the pages were somehow cloaking or violating TOS. As we were digging through the backlink profile, we saw that, naturally, the birdstep.com domain was linking to the subdirectory on most every page. When we viewed the source code of those pages (for example, the homepage - www.birdstep.com), we saw something strange. Below is the tail end of the source code for their top nav bar:

<li class="menuObject"><a href="http://www.birdstep.com/Corporate/"><img src="/images/menu/Corporate.gif" border="0" alt="Corporate" /></a></li>
<li class="menuObject"><a href="http://www.birdstep.com/Contact-us/"><img src="/images/menu/Contact_us_active.gif" border="0" alt="Contact us" /></a></li>
<li class="menuObject"><a href="http://www.birdstep.com/database/"><img src="/images/menu/Database.gif" border="0" alt="Database" /></a></li>

Looks fine, right? Just a regular menu serving up images as the clickable link. Only problem is…

Notice the navbar? See the missing link? That’s where the "database" section should be linked-to, only the image is missing. Apparently, it was just a design mistake and so they used a 1×1 pixel gif until they could get it fixed. There are plenty of other visible links in the content body of many pages over to the database section, but that top link in the navbar is invisible - technically violating Google’s rules. Despite the fact that plenty of other sites and pages link to the database section legitimately, and Birdstep certainly has no reason or intention to hide that link (other than a miscalculation on pixel width), the whole subdirectory was removed from the index.

Luckily, we caught it, Birdstep has removed the link and they’ll hopefully have the subdirectory re-included in the near future. They also generously gave us permission to discuss the Q+A issue on the blog, which we very much appreciate. I think this serves as a wise warning to developers and designers everywhere - unintentional, white-hat spirited mistakes can be just as dangerous and have just as dire consequences as black hat manipulation. Watch your code!

One more point of interest - in seraching around on this issue, I noticed that a Google search for http://www.birdstep.com/database/. (with the added period at the end) brought up this result:

Birdstep database search with trailing period

I ran another query on a page I know was removed from the index, and it also yielded a result like the one above (unfortunately, I can’t share that page publicly). It’s possible that this might help diagnose future pages that are removed for bad behavior and exhibit similar symptoms - definitely not a bad query to have in your arsenal if it really does work consistently.

BTW - We had a meeting today with our investors, who asked if maybe, just maybe, I could do a little more to promote the PRO services in the blog. Usually, I try to keep them a bit church and state, but some good points were made about how it might actually do a disservice to SEOmoz readers not to expose the cool tools and content we offer. Thus, I’m making a point of noting here that Birdstep got their issue solved (or at least diagnosed) thanks to the Q+A section in SEOmoz PRO. I do think we offer a good service, and I really do believe in it; I think I’m just a bit shy about self-promotion.

Do you like this post? Yes No

Tags: Code, Content, Design, Google, Images, Links, PageRank, Plugins, Promotion, SPAM, Tools, Wordpress