Tag Archives: Search Engines

Cult of Mac Caught in Google Content Farm Crossfire

Google recently made some tweaks to their algorithm in order to penalize content farms, which create massive amounts of low-quality content tuned to rank well in Google. If you’ve ever run a search, looking for a solution to a problem, and found the SERP to be full of not-really-helpful results from places like eHow and Squidoo, you know what they’re trying to fix.

Unfortunately, Google’s changes have been affecting legit blogs. One noteworthy example is Cult of Mac, a blog that aims to provide “timely news, insightful analysis, helpful how-tos and honest product reviews about Apple and Apple-related products.”

Cult of Mac has experienced the opposite of Google’s goal: their content has largely disappeared from Google’s SERPs, while content farms and spam-blogs scraping Cult of Mac posts have been pushed to the top.

A lot of our traffic came from Google, which is why the changes are so serious. I’m already seeing a big drop-off in traffic. Over the weekend and today, the traffic is half what it normally would be.

I’m pissed because we’ve worked our asses off over the last two years to make this a successful site. Cult of Mac is an independently owned small business. We’re a startup. We have a small but talented team, and I’m the only full timer. We’re busting our chops to produce high-quality, original content on a shoestring budget.

Indeed, Cult of Mac does break a lot of stories. Along with the Boy Genius Report, Mac Rumors and 9 to 5 Mac, they together are the source of the lion’s share of Apple-related reporting. It’s strange that Google’s algorithm would red-flag them as a content farm. Perhaps it is a result of “splogs” scraping their content; maybe a glitch in Google’s secret algorithm is causing one of the spam blogs to be marked as the original source for some reason or another?

Google Instant: Search as You Type

This past Wednesday, Google started rolling out the latest evolution of their search engine. “Google Instant,” as they call it, enables you to execute your searches significantly faster. It’s like the existing Suggest feature, only more so. The search results appear as you type, updating as you go.

This video demonstrates it well:

It takes a little time to get used to, but it actually does seem to be quicker. There isn’t a noticeable lag while results load, even on a 1mbps DSL line. The feature does disable itself on slower connections, though.

Google Instant: It Searches Before You Type [TechCrunch]

Google and NoFollow

There has been some recent buzz about Google changing how they hande the NoFollow attribute, and how it’s the end of the world. SEOs are running around in circles, and there’s talk of moving to JavaScript and iframe commenting systems so Google can’t read them.

First of all, slow down. Google can read iframes and JavaScript. Google very likely is already programmatically diferentiating between comments and page content anyway, and tweaking their rankings accordingly. PageRank is not a cut-and-dry “links in minus links out equals rank.” It’s a complex system involving a lot of calculations on Google’s part. I imagine they value links differently depending on what sort of link it is.

Second, stop worrying about it. Google’s business is to provide good results. If everyone tries to cheat the system with a bunch of silly schemes (such as PageRank sculpting and JavaScript links) Google will just change things again. If everyone plays by the rules, and doesn’t worry about their ranking too much, it all works out in the end.

Microsoft’s “Bing” Search Engine Isn’t So New After All…

Microsoft recently launched their new search engine “Bing,” in an attempt to compete in the arena that Google has pretty much already won. There was a bit of talk leading up to the relatively quiet launch, which promptly disappointed me.

Microsoft Bing

Continue reading →

Google Adds Ranking Data to Referrer String

Google has been rolling out changes to the way their referrer strings are structured. They are moving from a simple URL that shows the search query to a more complex one with some extra information that may be valuable.

Starting this week, you may start seeing a new referring URL format for visitors coming from Google search result pages. Up to now, the usual referrer for clicks on search results for the term “flowers”, for example, would be something like this:

http://www.google.com/search?hl=en&q=flowers&btnG=Google+Search

Now you will start seeing some referrer strings that look like this:

http://www.google.com/url?sa=t&source=web&ct=res&cd=7&url=http%3A%2F%2Fwww.example.com%2Fmypage.htm&ei=0SjdSa-1N5O8M_qW8dQN&rct=j&q=flowers&usg=AFQjCNHJXSUh7Vw7oubPaO3tZOzz-F-u_w&sig2=X8uCFh6IoPtnwmvGMULQfw

Patrick Altoft of BlogStorm has noticed an interesting addition to the string. He thinks that the cd=7 part stands for “click detail 7,” and is the ranking for your page. So if someone clicked through from Google to your site, your analytics software could collect the referrer string, and determine not just what the user searched for when they found your site, but what the page ranked!

This is valuable information for Search Engine Optimization, and makers of traffic statistics software, certainly.

Yahoo Monetizes Yahoo BOSS

Yahoo is currently readying the next major update to it’s BOSS Search API. With it they will bring access to SearchMonkey data, optional longer abstracts, and greater flexibility for monetization. They will also be tracking API usage, and charging nominally for monthly usage greater than 10,000 queries.

Since launch, the BOSS API has been provided entirely for free. Now Yahoo is putting in place a freemium model where it’ll be free only for developers who generate fewer than 10,000 queries per day. After that, a tiered pricing model will kick in that charges for BOSS as if it were a utility (think AWS). Rates will vary depending on the type of query (web result vs. spelling correction, for example), how many results the developer wants returned per query (with a new maximum of 1000 results), and just how far the developer goes over the free queries cap. (Source: TechCrunch)

It seems reasonable, and Yahoo certainly is improving upon the service. Ten thousand queries for month is a pretty fair ceiling in my opinion. It’s plenty for experimentation or development, or even a web app of reasonably small size.

If Yahoo can leverage BOSS to save themselves from possible bankruptcy, and still have a unique and powerful service, that’s definitely fine by me.

Now pay attention Google: You dumped your API back in 2006, and it’s going to come back to haunt you unless you follow Yahoo’s lead. After a long draught, Yahoo’s getting their game back on, and they’re out to with the developers over with all of their tools.

Yahoo BOSS Search “Vertical Lense”

Remember Yahoo’s BOSS Search API? Well, they’re not done with it yet. A new expansion to the service has just launched, though it is currently available only to “certain Yahoo! partners,” with a promise that it will eventually become part of the main BOSS API.

The new service is called “Vertical Lense Technology.” It allows for topical search engines to be created. These search engines, as seen on TechCrunch, the only site to use the service so far, can

  • Index the partner site in real-time. Whenever a post or comment is added, it’s sent to Yahoo to be instantly indexed.
  • Allow ranking tweaks. When you search on TechCrunch, you are shown a carefully adjusted mix of web and Crunch Network results, ordered in a way that hopefully fits their audience.

TechCrunch describes their experiences with Vertical Lense on their website.

Since we publish using WordPress, supplying our data using the first API essentially required that we design and deploy a plugin that would send information to Yahoo’s servers every time there was a new post or comment on any of our blogs. We also needed to create a similar data indexing system for CrunchBase so that contributions there would show up in the results as well. To ensure that all of our archived content was incorporated in the search index, we supplied Yahoo with historical data dumps from all 10 sites. Perhaps needless to say, this took a considerable amount of time just to ensure that the data we indexed at Yahoo was accurate and complete.

You can read more about this on the Yahoo Search Blog.

Now, when can I get one of these? :D

Duplicate Content

We always hear about how Google doesn’t like duplicate content, and will penalize a page that has the same content as another. There are plenty of articles on optimizing sites to avoid having duplicate content internally, and articles ranting about scrapers.

What I want to know is what Google thinks about duplicate content cases such as Reference.com or the Associated Press.

Head over to Reference.com, the encyclopedia branch of the Ask.com network of reference sites. Enter a search term. Now go over to Wikipedia and enter the same search term. They’re the same! Reference.com is pulling Wikipedia articles onto their site and throwing in a few ads. (How are they doing this? Does Wikipedia have some sort of API?) What does Google think of this?

Continue reading →

Sikbox – Live Search Made Easy

“Live Search” is a term that people started using somewhere along the line to refer to AJAX-y search forms that display results as you type, rather than taking you to a results page. Kind of like Apple’s Spotlight search in OSX, which I have to say works great.

Wouldn’t it be cool to have something like that on your blog? A search form that, as you type, displays the results in a dropdown instead of a results page? That’s where Sikbox comes in.

Sikbox is a Yahoo BOSS app that allows you to create a cut and paste live search solution. It’s free, easy to install, and you can even apply your own CSS styles to it (or pick from one of the pre-made themes). You can search the entire web with it, or limit it to your website, like most of us would probably do.

Here’s a screenshot of it in action:

If you click on one results, you jump right to it. It works pretty good. It seems to work a lot better than the default WordPress search system too (providing your blog is in the Yahoo index).

Cuil, The New Search Engine That Isn’t… Yet

Over the past few days, the blogosphere has been, well, blogging, about a new search engine developed by a couple ex-Google employees. Known as “Cuil,” which is apparently a Gaelic word for “Knowledge,” the search engine is attempting to be “the next Google.” (Side note: In case you didn’t know, it’s pronounced “Cool,” not “Quill.”)

I think the site has potential, if they put some serious work into it, but I don’t think they’ll be able to topple Google from their high throne. Google is, and probably will always be, the king of the serch engines. No other search site has become a synonym for “search.” No other search site has ever become a common colloquial verb. (Have you ever heard someone say “I Yahoo-ed it to see what it meant”?) Then you have the Google muscle memory, where you automatically type “Google.com” when you need to search something, the tight integration in Firefox, and all the other services in the Google empire (I’m so used to GMail/Google Apps I can baarely stand “normal” mail clients). And that’s leaving out the fact that I’ve never found a search engine that works as well. Cuil may one day become a major player, but it’s not going to take on Google.

Cuil has an innovative new interface, featuring three columns of search result blocks, which seems to be an effective way to display results, and “related” images are displayed alongside the results. They also have tabs along the top, which are supposed to lead to results in similar topics, and there is also a “drill-down” box to help you refine your query.

Continue reading →