A Powerful Analytics Tip Every Website Should Employ

Posted by randfishHow many presentations do you see that show traffic stats like these? Or this: Or this: These charts aren’t wrong, per se. They’re not lying to you, but they are obscuring the truth, and they’re making it impossible to know what’s…

Posted by randfish

How many presentations do you see that show traffic stats like these?

New vs. Returning Visits

Or this:

Pageviews Over Time

Or this:

Referring Site Traffic

These charts aren't wrong, per se. They're not lying to you, but they are obscuring the truth, and they're making it impossible to know what's going right and wrong.

The problem isn't that the numbers are inaccurate, it's that no website is just ONE SITE. A website is a collection of pages, and oftentimes, a collection of lots of different KINDS of pages. Even the simplest of sites, built on blog CMS' like Wordpress or basic CMS' like Drupal have unique sections within them - the homepage, individual posts, static pages (about, contact, et al.), categories, search pages, posts by month, author, etc. - all of these have different formats, different functions and, almost certainly, different visitor stats.

Yet, for some reason, when we as marketers look at a site, we don't ask "how are the category pages doing this month?" or "how is the blog performing compared to the white paper articles?" We ask, "how's the site doing."

The singular answer to that question often obscures a more nuanced, but valuable truth: Different website sections perform differently.

If your car starts having trouble accelerating up hills, you don't blame the entire car for the subpar performance, you start to examine potential causes (electrical system, engine, tires, etc.) and break these components down until you find the cause. Likewise, with a website, every piece should be performance tested, tuned and monitored on a regular basis.

Don't do this:

SEOmoz Page Views July 2010

The total page views data is fine as an overview, but we need to monitor each individual section to really understand what's gaining vs. falling.

Do this:

SEOmoz Blog Traffic July 2010

YOUmoz Traffic July 2010

By segmenting out traffic to URLs that include */blog/* and those that include */ugc/* (YOUmoz), we can see when/where/how each section is rising or falling in traffic and contributing to the overall site's performance.

Even better, we should do this:

Stacked Chart of SEOmoz Traffic by Section July 2010

How did I make that chart?

Step 1: Separate the areas of your website by the words/characters in their URL string (or other identifying factors like keywords in their titles). For example, on SEOmoz, we've got:

  • The Blog - all URLs include /blog
  • YOUmoz - all URLs have /UGC
  • Guides - nearly all have /articles
  • Tools - most URLs are different, but there's only around 20 so I can lump them together
  • etc.

Once I have these segments, I'll use the URL structures to get data about pageviews (or any other metric I care about) separately through analytics.

Step 2: Use the content filter in Google Analytics to select only those pages that contain the URL string you're seeking:

 SEOmoz Traffic Filtered by "Article" in the URL

 By using the simple filter for URLs "containing" /article, I've got a segmented report I can now use to start seeing what's really happening on my site.

Google Analytics Filter

pretty simple, right?

Step 3: Filter on each report and grab out the relevant pageviews number on a weekly basis:

Tools Pages Traffic

I grab those numbers for each of the segments each week (well, actually, Joanna does - but she says it's less than an hour of work) and plug them into a spreadsheet.

Step 4: Create a spreadsheet and a stacked graph

SEOmoz Traffic in July by Section

This spreadsheet shows the number of pageviews to each section of the site

Stacked Chart of SEOmoz Traffic by Section July 2010

This stacked, area graph shows where traffic is falling (e.g. the Beginner's Guide) vs. Gaining (e.g. the Blog)

When you run these over long periods of time, you can really see the impact a new section is having, or where problems in traffic might exist. If you neglect to break things out in this fashion, you'll often find that traffic from one section's gain may overshadow the loss in another area. This over/under-compensation can hide the real issues for a site, especially in SEO (where indexation, rankings and keyword demand all play inter-connected roles).

Joanna, in her post on benchmarking,  shared this chart:

Traffic by Section on SEOmoz
Also see this larger, detailed version

This helped us to realize where things had gone awry and why (the problem stemmed from some poorly done redirects from Linkscape to Open Site Explorer). I can't recommend this practice enough - if more marketers managed their analytics in this fashion, we'd have a much easier time identifying potential problems, opportunities and understanding not just the quantity of traffic, but the "whys" behind it.

Anyone with some clever Google Analytics methodologies to build these faster/more efficiently than my Excel hack, please do share!


Do you like this post? Yes No

http://www.seomoz.org/blog

Three Ways to Create a Better Performing Website (Using One Sneaky Tactic)

Posted by RobOusbeyLet’s start with a sneaky tactic. I know that SEOmoz blog readers are an internet-savvy crowd, so many of you are probably familiar with the ‘browser history sniffing’ techniques that exist. (Bear with me, we’ll get to internet marke…

Posted by RobOusbey

Let's start with a sneaky tactic.

I know that SEOmoz blog readers are an internet-savvy crowd, so many of you are probably familiar with the 'browser history sniffing' techniques that exist. (Bear with me, we'll get to internet marketing advice in a moment.)

In case you've not come across the concept before, it's probably best exemplified by the site Start Panic - just hit the 'Let's start!' button to watch it trawl through your browser history, and start listing sites that even you forgot you'd visited.
 
StartPanic uses Javascript to do the dirty work, but it's also possible to do this completely using CSS, and without Javascript. (There's advice about implementing the technical side of this in a popular post by Niall Kennedy.)
 
I wanted to show how you can use this to help your website perform better - let's begin with the least controversial, and work on from there
 

1 - Customize the User Experience

Niall's post - linked above - suggests one very sensible use of this technique: offering your users links to the social sites they use, and hiding the ones they don't. In this bottom of this live example page, you'll see a 'Digg It' button if you've been to Digg, a 'Share on Facebook' button if you've been there, etc. By limiting the set of sharing buttons, you can remove that 'social clutter' that is prevalent on some sites - this doesn't just give a cleaner page to the user, but may have a much higher 'sharing' rate for your page.
 
Customization can also be made is to the content of your site: use the browser history sniffing technique to see the kinds of blogs and news sites your visitors are reading, and then adjust your content based on the results. For example: I might consider writing a weekly post about PPC for the Distilled blog. We could check to see how many of the Distilled visitors had looked at PPC Hero, the AdWords blog, and the AdWords support pages. If the number was high enough, we might consider adding content to satisfy that niche.
 
Likewise, if you find that a high proportion of your readers visit KittenWar, then you might consider adding a little more 'cute' to your posts.
 

2 - Retarget Your Publicity

 
Traditional ad-network retargeting works in the following way:
  • a visitor comes to your site, and leaves without making a purchase
  • your advertising network drops a cookie onto that user's computer
  • the user visits a different site which displays ads from that network
  • the network recognizes the user, and shows them an ad for your product
  • hopefully they're reminded of you, and come back to the site to make a purchase.
However, this retargeting only works when you can cookie people once they've visited your site. I'd propose using this technique to alter the copy on your site, based on what the user has already seen about you elsewhere.
 
For example: check for new posts about your brand each morning (or can I assume you do this already?) If your company had three product reviews on blogs and news sites today, then record these URLs, and check to see if each visitor to your site has already read one of them. You could then display a prominent content box on the front page with information about the exact product they saw reviewed, and a link to your page for that product. You might even acknowledge they'd seen the review: "Initech wants to offer you a 10% discount, as a reader of The Daily Bugle"
 
You could use the same technique for Reputation Management. If a site has published a negative article about you, there's a potential that people will come to your site to find out more. However, you may not want to simply have a message on your front page that reads "The Bluth Company has NOT committed treason - read more" - but you could choose display this headline only to people who've read about the story already.
 

3 - Find Your Competitors' Customers

This is where you could really up-the-ante with your CRO efforts.
 
I recently saw a bank who offered $100 to people who closed their account at a competitor's bank and switched over. This would be a perfect opportunity to sniff each visitor's browser history, to see if you should promote this offer to them on your site. You can even avoid showing it to people who have been shopping around (and looking at every bank's website homepage) by checking to see if they've visited the URLs for logging in and out of the competitor's online banking to see if they're actually a customer of that company.
 
For e-commerce sites, you could check to see if your visitor has visited your competitor's site, but could also check if they've looked at the competitor's product on Amazon or other retailers. Your product page could then include a comparison between the two products. That could increase conversions, but you'd avoid comparing your product to a competitor's for anyone who'd never seen the competing product.
 

To Conclude

So, the practice of checking to see if a visitor has already been to particular pages might seem a little shady at first - but this part of the way that the web and web browsers are designed, and people can block their browser history if they'd prefer.
 
Executed in the right way, it could be a very powerful technique for creating high performing, high converting websites. Use it wisely.
 
 

(Thanks for reading; you can follow me on Twitter: @RobOusbey, and I'm pleased to be speaking alongside some of the best SEO practitioners around at this year's Pro Training Seminar - tickets are still available.)


Do you like this post? Yes No

http://www.seomoz.org/blog

Launching the SEOmoz Free API and Enough Power to Build Open Site Explorer

Posted by Nick Gerner

The launch of Open Site Explorer last week opens up a lot of link data, filters, and anchor text to a much wider audience than we’ve ever had before.  In that same vein, today we’re announcing our new and improved SEOmoz Free API.

Any registered (it’s free) SEOmoz member can visit our API Portal and get an API key that gives you access to:

  • Data for any URL in our index including
    • Domain and Page Authority
    • mozRank
    • total link count
    • external, followed link count
  • The first 500 links to any page, sub domain or domain
  • Filtering on those links: 301s, Follows, External, etc.
  • The first 3 domains linking to any page, sub domain or domain
  • The first 3 anchor text terms or phrases in links to any page, sub domain or domain

You’re welcome to use this data for private or publicly-facing purposes. We already have a variety of partners integrating this data including:

Check out some sample code and applications on the wiki.

Our idea is that getting this data into the hands of webmasters makes everyone better off: we’re excited about our new authority scores, marketers are thirsty for metrics, and users of all kinds of tools are better off with a deeper look at real data.  The free package will keep you covered up to a million links per month that you’re free to use for any purpose from consulting to building an SEO campaign management suite.

API Cartoon

In addition to the free API (which I think is quite powerful already), we’re expanding our paid API offering. The paid API includes everything above, but also includes:

  • Additional metrics:
    • number of domains that link to you
    • mozTrust
    • number of links to all pages on your domain
    • and more
  • A deeper look at links, way beyond the first 500 (first 100k for each sort per page, domain or sub domain)
  • Plenty of sorts on links:
    • domain authority
    • page authority
    • linking root domains
  • Way more anchor text terms and phrases (up to 100k per page, domain or sub domain if you’ve got that many)

This is exactly the same API powering Open Site Explorer.  So if you think OSE missed a feature, or should include other data sources, you can build it over again and do an even better job :)   If you do, drop me a line and I’ll take a look. We’d love to share partner apps on our wiki, Twitter, the blog, and elsewhere.

We don’t even have an attribution requirement. Although, we have a tasty 15% discount if you do cite us as a source ;)

To sign up, just contact us, and we’ll start the process.

EDIT: The paid API is available outside of a PRO membership.  A PRO membership buys the tools, and content, and sweet sweet badge.  The paid API is extra.  Of course, the free API is both free and full of awesome.

Do you like this post? Yes No

Posted by Nick Gerner

The launch of Open Site Explorer last week opens up a lot of link data, filters, and anchor text to a much wider audience than we've ever had before.  In that same vein, today we're announcing our new and improved SEOmoz Free API.

Any registered (it's free) SEOmoz member can visit our API Portal and get an API key that gives you access to:
  • Data for any URL in our index including
    • Domain and Page Authority
    • mozRank
    • total link count
    • external, followed link count
  • The first 500 links to any page, sub domain or domain
  • Filtering on those links: 301s, Follows, External, etc.
  • The first 3 domains linking to any page, sub domain or domain
  • The first 3 anchor text terms or phrases in links to any page, sub domain or domain
You're welcome to use this data for private or publicly-facing purposes. We already have a variety of partners integrating this data including:
Check out some sample code and applications on the wiki.

Our idea is that getting this data into the hands of webmasters makes everyone better off: we're excited about our new authority scores, marketers are thirsty for metrics, and users of all kinds of tools are better off with a deeper look at real data.  The free package will keep you covered up to a million links per month that you're free to use for any purpose from consulting to building an SEO campaign management suite.

API Cartoon

In addition to the free API (which I think is quite powerful already), we're expanding our paid API offering. The paid API includes everything above, but also includes:
  • Additional metrics:
    • number of domains that link to you
    • mozTrust
    • number of links to all pages on your domain
    • and more
  • A deeper look at links, way beyond the first 500 (first 100k for each sort per page, domain or sub domain)
  • Plenty of sorts on links:
    • domain authority
    • page authority
    • linking root domains
  • Way more anchor text terms and phrases (up to 100k per page, domain or sub domain if you've got that many)
This is exactly the same API powering Open Site Explorer.  So if you think OSE missed a feature, or should include other data sources, you can build it over again and do an even better job :)  If you do, drop me a line and I'll take a look. We'd love to share partner apps on our wiki, Twitter, the blog, and elsewhere.

We don't even have an attribution requirement. Although, we have a tasty 15% discount if you do cite us as a source ;)

To sign up, just contact us, and we'll start the process.

EDIT: The paid API is available outside of a PRO membership.  A PRO membership buys the tools, and content, and sweet sweet badge.  The paid API is extra.  Of course, the free API is both free and full of awesome.

Do you like this post? Yes No

http://www.seomoz.org/blog

It’s Only A Clique If You’re Not In It

Posted by Dr. Pete

CliqueThis post started as a reaction to accusations in the SEO industry that Top X lists, awards, etc. are only going to people’s friends. As I was writing it over what ended up being 2 weeks, I realized just how broad this issue really is, from personal to professional to political. I hope you’ll indulge me as I try to do justice to a topic that goes well beyond SEO.

We all know how it feels to be on the outside looking in. You start out feeling awkward and a little envious, but slowly it turns into something worse – depression, resentment, even rage. Eventually, we find a group to belong to, and the tables turn. No matter how often we were excluded (and maybe because of it), we eventually start to exclude others. It’s a vicious, if all too human, cycle, and it extends to every corner of our social interactions.

My Friends Are The Best

Just ask them; I’m sure they’ll agree. Do we prefer our friends? Do we give them the best opportunities and accolades? Absolutely. This is more than bias, though; it’s the simple reality of relevance. If you ask me who the "best" expert is in some niche of my own field or what the best article is on Topic X, I’m going to immediately draw from what I already know. Stating the obvious, I can’t recommend someone or something that I don’t even know exists.

Of course, there are times when we have a responsibility to dig deeper and look for the best candidates outside of our own limited realm of experience. When I was a graduate student at the University of Iowa, I had the opportunity to be the first student in my department to serve on a faculty search committee. One aspect of that experience that stuck with me was Iowa’s affirmative action policy. It wasn’t about numbers and quotas so much as a core philosophy that we had a professional obligation to search far and wide for the best candidate. We had the duty to leave our comfortable world of people just like us and venture into the world of "them".

Confirmation Bias

Beyond simple relevance is something more powerful, and sometimes more insidious. We all have a natural tendency to take sides, and, once we do, to find reasons why our side is right and the other side is wrong. Psychologists call this "confirmation bias," the often unconscious need to find data that confirms what we already believe. If we like someone, we’ll find reasons to support them and give them the benefit of the doubt. If we dislike someone, we’ll find reasons to be suspicious of everything they say and do. If you think confirmation bias is something only other people have, you’re fooling yourself.

Choosing Sides

Beyond our friends, confirmation bias quickly begins to apply to all of our cliques and teams. If you’re a sports fan, then that team mentality is usually just harmless fun – associating with your team provides a shared emotional experience. I’m a Cubs fan – believe me when I say that I understand the thrill of victory and the agony of defeat, although not in quite the ratio I’d like. What happens, though, when that team mentality starts to apply to things like politics, as we’ve seen far too often over the past couple of decades (on both sides of the fence)? Suddenly, our clique is 50% of the population, and our enemies are the other 50%. At best, it’s divisive. At worst, it breeds hate, violence, and bigotry.

Where Do We Go From Here?

Of course, we all like to think that we’re free from bias, but the power of bias is that the flaws that are obvious in others are often hidden and unconscious in ourselves. If I mention that I do SEO, do you picture a savvy internet guru or spam-spewing snake-oil salesman? If you’re an SEO, and you hear that I work with SEOmoz, do you think I’m a paragon of white-hat virtue or part of Rand’s evil conspiracy to take over the industry? Reality is probably somewhere in between. If I tell you that I voted for Obama, do you see a beacon of liberal hope or a Communist bent on destroying our nation? I can assure you that I am neither. So, how do we get past these labels and start to understand people, whether personally or professionally?

Get to Know People

Social media has given us a difficult dichotomy. On the one hand, it’s never been easier to "friend" people in shallow and meaningless ways. On the other hand, we have the tools to get to know our peers and friends of friends in ways that were never before possible. The next time you friend someone, take a moment and find out something about them. Where are they from? What do they do? What kind of music do they like? Do they blog? If they do, read a post. If you see a label ("liberal", "conservative", "Twilight fan"), don’t jump to conclusions. Give that person a chance to speak for themselves.

Play In a Different Park

It’s easy to be self-righteous when you’re surrounded by your fan-boys and girls. It’s easy to get a standing ovation at your campaign rally when you only invite the people who gave you the most money. If you want perspective, you have to give up the home-field advantage. If you disagree with someone, comment on their post instead of running back home to write a rant. Try guest-blogging – even better, guest-blog in a different industry. Try to explain why SEO is worthwhile to an audience of small business owners, designers or UX professionals. It’ll be a tough sell, but you’ll learn a lot in the process.

When In Doubt, Ask

Social media is a mine field of misunderstanding – if you’re not sure what someone means in that 140-character Tweet, ask them. If they write a blog post that seems like a personal attack, call them. It’s not just about being nice – bad blood runs deep, and today’s simple misunderstanding could destroy relationships and opportunities tomorrow.

Open Your Circle

We all remember the people who excluded us, and we too often hold that fact against the universe. Let it go. When you finally get into that circle, especially your professional circle, try to remember that someone else is still outside looking in. Here are a few ways to give someone else a chance, because we can all use a little good karma:

  • Promote other people’s links and awards, even the competition.
  • If you’re at a conference talking to a group and you see someone standing outside the circle with that awkward look of faux participation, invite them in.
  • Make an introduction to help someone’s career along.
  • If someone is new to blogging, comment, subscribe, or even link to them.
  • When someone challenges you publicly, listen and think before you counterattack.
  • Don’t envy other people’s success – learn from it and improve.
  • Every once in a while, shut up and listen.

At the end of the day, those of us who have attained some measure of success need to remember that we all had a little help along the way. Try to return the favor once in a while.

Photo licensed from iStockPhoto.com (Photographer: Hélène Vallée)

Do you like this post? Yes No

Posted by Dr. Pete

CliqueThis post started as a reaction to accusations in the SEO industry that Top X lists, awards, etc. are only going to people's friends. As I was writing it over what ended up being 2 weeks, I realized just how broad this issue really is, from personal to professional to political. I hope you'll indulge me as I try to do justice to a topic that goes well beyond SEO.

We all know how it feels to be on the outside looking in. You start out feeling awkward and a little envious, but slowly it turns into something worse – depression, resentment, even rage. Eventually, we find a group to belong to, and the tables turn. No matter how often we were excluded (and maybe because of it), we eventually start to exclude others. It's a vicious, if all too human, cycle, and it extends to every corner of our social interactions.

My Friends Are The Best

Just ask them; I'm sure they'll agree. Do we prefer our friends? Do we give them the best opportunities and accolades? Absolutely. This is more than bias, though; it's the simple reality of relevance. If you ask me who the "best" expert is in some niche of my own field or what the best article is on Topic X, I'm going to immediately draw from what I already know. Stating the obvious, I can't recommend someone or something that I don't even know exists.

Of course, there are times when we have a responsibility to dig deeper and look for the best candidates outside of our own limited realm of experience. When I was a graduate student at the University of Iowa, I had the opportunity to be the first student in my department to serve on a faculty search committee. One aspect of that experience that stuck with me was Iowa's affirmative action policy. It wasn't about numbers and quotas so much as a core philosophy that we had a professional obligation to search far and wide for the best candidate. We had the duty to leave our comfortable world of people just like us and venture into the world of "them".

Confirmation Bias

Beyond simple relevance is something more powerful, and sometimes more insidious. We all have a natural tendency to take sides, and, once we do, to find reasons why our side is right and the other side is wrong. Psychologists call this "confirmation bias," the often unconscious need to find data that confirms what we already believe. If we like someone, we'll find reasons to support them and give them the benefit of the doubt. If we dislike someone, we'll find reasons to be suspicious of everything they say and do. If you think confirmation bias is something only other people have, you're fooling yourself.

Choosing Sides

Beyond our friends, confirmation bias quickly begins to apply to all of our cliques and teams. If you're a sports fan, then that team mentality is usually just harmless fun – associating with your team provides a shared emotional experience. I'm a Cubs fan – believe me when I say that I understand the thrill of victory and the agony of defeat, although not in quite the ratio I'd like. What happens, though, when that team mentality starts to apply to things like politics, as we've seen far too often over the past couple of decades (on both sides of the fence)? Suddenly, our clique is 50% of the population, and our enemies are the other 50%. At best, it's divisive. At worst, it breeds hate, violence, and bigotry.

Where Do We Go From Here?

Of course, we all like to think that we're free from bias, but the power of bias is that the flaws that are obvious in others are often hidden and unconscious in ourselves. If I mention that I do SEO, do you picture a savvy internet guru or spam-spewing snake-oil salesman? If you're an SEO, and you hear that I work with SEOmoz, do you think I'm a paragon of white-hat virtue or part of Rand's evil conspiracy to take over the industry? Reality is probably somewhere in between. If I tell you that I voted for Obama, do you see a beacon of liberal hope or a Communist bent on destroying our nation? I can assure you that I am neither. So, how do we get past these labels and start to understand people, whether personally or professionally?

Get to Know People

Social media has given us a difficult dichotomy. On the one hand, it's never been easier to "friend" people in shallow and meaningless ways. On the other hand, we have the tools to get to know our peers and friends of friends in ways that were never before possible. The next time you friend someone, take a moment and find out something about them. Where are they from? What do they do? What kind of music do they like? Do they blog? If they do, read a post. If you see a label ("liberal", "conservative", "Twilight fan"), don't jump to conclusions. Give that person a chance to speak for themselves.

Play In a Different Park

It's easy to be self-righteous when you're surrounded by your fan-boys and girls. It's easy to get a standing ovation at your campaign rally when you only invite the people who gave you the most money. If you want perspective, you have to give up the home-field advantage. If you disagree with someone, comment on their post instead of running back home to write a rant. Try guest-blogging – even better, guest-blog in a different industry. Try to explain why SEO is worthwhile to an audience of small business owners, designers or UX professionals. It'll be a tough sell, but you'll learn a lot in the process.

When In Doubt, Ask

Social media is a mine field of misunderstanding – if you're not sure what someone means in that 140-character Tweet, ask them. If they write a blog post that seems like a personal attack, call them. It's not just about being nice – bad blood runs deep, and today's simple misunderstanding could destroy relationships and opportunities tomorrow.

Open Your Circle

We all remember the people who excluded us, and we too often hold that fact against the universe. Let it go. When you finally get into that circle, especially your professional circle, try to remember that someone else is still outside looking in. Here are a few ways to give someone else a chance, because we can all use a little good karma:

  • Promote other people's links and awards, even the competition.
  • If you're at a conference talking to a group and you see someone standing outside the circle with that awkward look of faux participation, invite them in.
  • Make an introduction to help someone's career along.
  • If someone is new to blogging, comment, subscribe, or even link to them.
  • When someone challenges you publicly, listen and think before you counterattack.
  • Don't envy other people's success – learn from it and improve.
  • Every once in a while, shut up and listen.

At the end of the day, those of us who have attained some measure of success need to remember that we all had a little help along the way. Try to return the favor once in a while.

Photo licensed from iStockPhoto.com (Photographer: Hélène Vallée)


Do you like this post? Yes No

http://www.seomoz.org/blog

Free Webinar: Getting to Know Open Site Explorer

Posted by great scott!

Last week we unveiled our newest toy, Open Site Explorer, to the world and the response was phenomenal. Now we want to take some time and really show everyone just what this powerful link analysis tool is capable of and answer your questions, so we’re hosting not one, but two FREE Webinars this week (it’s the same content, run twice to help accomodate schedules and time zones).

The presentations will be 60 minutes each, 25 minutes of slides, followed by 35 minutes of Q+A on Wednesday, January 27th at 2:00PM (PST), and Thursday, January 28th at 10:00AM (PST)  In each live webinar, Rand will show you around Open Site Explorer, offer tips and strategies for getting the most out of it, explain our new Domain Authority & Page Authority metrics, and answer your questions.

Here’s the catch: each webinar is limited to 1,000 attendees. The last time we announced a webinar on the blog, we had over 3,000 people try to register in the first hour, so if you want to attend one of the live sessions, register quickly. If you can’t make it, we’ll have a recording of the presentation available in a couple of days on our webinars page.



Looooove Webinars and can’t get enough of ‘em? Then you should totally become a PRO Member! In the last couple of months we’ve started running regular webinars just for PRO Members and they’ve been really popular.

PRO Webinar Link Building Strategies
A slide from our December PRO Webinar on Link Building Strategies

PRO Webinar SEO Strategies for 2010
A slide from our January PRO Webinar on SEO Strategies for 2010

In February we’re stepping it up even more. In addition to our monthly educational webinar (February 4th on Analytics), we’re adding a second monthly webinar where we’ll be performing live site reviews of sites submitted by our PRO Members!

PRO Members can head over to the PRO Webinars page for more info on February’s webinars, as well as recordings and slide decks from past webinars. If you’d like to join us for the next PRO Webinar–and possibly even get a live site review–sign up for PRO to access the PRO Webinar page for registration details or just watch your inbox for an invite.

Do you like this post? Yes No

Posted by great scott!

Last week we unveiled our newest toy, Open Site Explorer, to the world and the response was phenomenal. Now we want to take some time and really show everyone just what this powerful link analysis tool is capable of and answer your questions, so we're hosting not one, but two FREE Webinars this week (it's the same content, run twice to help accomodate schedules and time zones).

The presentations will be 60 minutes each, 25 minutes of slides, followed by 35 minutes of Q+A on Wednesday, January 27th at 2:00PM (PST), and Thursday, January 28th at 10:00AM (PST)  In each live webinar, Rand will show you around Open Site Explorer, offer tips and strategies for getting the most out of it, explain our new Domain Authority & Page Authority metrics, and answer your questions.


Here's the catch: each webinar is limited to 1,000 attendees. The last time we announced a webinar on the blog, we had over 3,000 people try to register in the first hour, so if you want to attend one of the live sessions, register quickly. If you can't make it, we'll have a recording of the presentation available in a couple of days on our webinars page.


Looooove Webinars and can't get enough of 'em? Then you should totally become a PRO Member! In the last couple of months we've started running regular webinars just for PRO Members and they've been really popular.


PRO Webinar Link Building Strategies
A slide from our December PRO Webinar on Link Building Strategies


PRO Webinar SEO Strategies for 2010
A slide from our January PRO Webinar on SEO Strategies for 2010


In February we're stepping it up even more. In addition to our monthly educational webinar (February 4th on Analytics), we're adding a second monthly webinar where we'll be performing live site reviews of sites submitted by our PRO Members!

PRO Members can head over to the PRO Webinars page for more info on February's webinars, as well as recordings and slide decks from past webinars. If you'd like to join us for the next PRO Webinar--and possibly even get a live site review--sign up for PRO to access the PRO Webinar page for registration details or just watch your inbox for an invite.

Do you like this post? Yes No

http://www.seomoz.org/blog

Indexation for SEO: Real Numbers in 5 Easy Steps

Posted by randfish

How many pages has Google indexed?

This question and the problems surrounding it run rampant through the SEO world. It usually arises when someone starts doing searches like this:

Indexation of SEOmoz According to Google

Google claims to have 93,800 pages indexed on the root domain, seomoz.org. That sounds pretty good, but when I ran that search query last week, the number was closer to 75,000 and when I run it again from Google.co.uk 60 seconds later, the number changes even more dramatically:

Indexation of SEOmoz.org on Google.co.uk

How about if I hit refresh on my Google.com results again:

Indexation on Google.com 3 minutes later

Doh! Google just dropped 8,500 of my pages out of their index. That sucks – but not nearly as much as managers, marketing directors and CEOs who use these numbers as actual KPIs! Can you imagine? A number that means nothing, fluctuates 300% between data centers, can change at a moment’s notice and provides no actionable insight being used as a business metric?

And yet… It happens.

Fortunately, there’s an easy way to get much, much better data than what the search engines provide through "site:" queries and this post is here to walk you through that process step-by-step.

Step 1: Go to Traffic Sources in Your Analytics

Google Analytics Step 1

Click the "traffic sources" link in Google analytics or Omniture (it can also be called "referring sources" in other analytics packages).

Step 2: Head to the Search Engines Section

Step 2 of the Indexation Process

We want to find out how many pages the search engines have indexed, so the obvious next step is to go to the "search engines" sub-section.

Step 3: Choose an Engine

Step 3: Choose an Engine 

Choose the engine you want indexation data on and click. If you have both paid and organic traffic from this engine, you’ll want to display organic only at this step, too.

Step 4: Filter by Landing Pages

Step 4: Filter by Landing Page

The "Landing Page" filter in the dropdown will show you the traffic each individual page on your site received from the engine you’ve selected. This also produces the magical "total" number of pages that have received traffic, described in the last step.

Step 5: Record the Number at the Bottom

Step 5: Indexation Count Arrives

That count tells you the unique number of pages that received at least one visit from searches performed on Google. It’s the Holy Grail of indexation – a number you can accurately track over time to see how the search engine is indexing your site. On its own, it isn’t particularly useful, but over time (I usually recommend recording monthly, but for some sites, every 2-3 months can make more sense), it gives you insight into whether your pages are doing better or worse at drawing in traffic from the engine.

Now, technically I’m being a bit cheeky here. This number doesn’t tell you the full story – it’s not showing the actual number of pages a search engine has crawled or indexed on your site, but it does tell you the unique number of URLs that received at least 1 visit from the engine. In my opinion this data is far more accurate and more actionable. The first adjective – accurate – is hard to argue (particularly given the visual evidence atop this post), but the second requires a bit of an explanation.

Why is Number of Pages Receiving ≥1 Visit Actionable?

Indexation numbers alone are useless. Businesses and websites use them as KPIs because they want to know if, over time, more of their pages are making their way into the engines’ indices. I’d argue that actually, you don’t care if your pages are in the indices – you care if your pages have the opportunity to EARN TRAFFIC!

Being a row in a search index means nothing if your page is:

  • too low in PageRank/link juice to appear in any results
  • displaying content the engines can’t properly parse
  • devoid of keywords or content that could send traffic
  • broken, misdirected or unavailable
  • a duplicate of other pages that the engine will rank instead

Thus, the metric you want to count over time isn’t (in most cases) number of pages indexed, it’s number of pages that earned traffic. Over time, that’s the number you want to rise, the number you want marketers to concentrate on and the KPI that’s meaningful. It tells you whether the engine is crawling, indexing AND listing your pages in the results where someone might (has) actually click(ed) them.

If the number drops, you can investigate the actual pages that are no longer receiving traffic by exporting the data to Excel and doing a side-by-side with the previous month. If the number rises, you can see the new pages getting traffic. Those individual URLs will tell a story – of pages that broke, that stopped being linked-to, that fell too far down in paginated results or lost their unique content. It’s so much better than playing the mystery game that SEOs so often confront in the face of "lower indexation numbers" from the site: command.

Some Necessary Caveats

This methodology certainly isn’t perfect, and there are some important points to be aware of (thanks especially to some folks in the comments who brought these up):

  • Google Analytics (and many other analytics packages) use sampled data at times to make guesstimates. If you want to be sure you’re getting the absolute best number, export to CSV and do the side-by-side in Excel. You can even expunge similar results from two time period to see only those pages that uniquely did/didn’t receive traffic. In many of these cases, you might also only care about pages that gained/lost 5/10/20+ visits.
  • Greater accuracy can be found from shrinking the time period in the analytics, but it also reduces the liklihood that a page receiving very long tail query traffic once in a blue moon will be properly listed, so adjust accordingly, and plan for imperfect data. This method isn’t foolproof, but it is (in my opinion), better than the random roulette wheel of site: queries.
  • This technique isn’t going to help you catch other kinds of SEO issues like duplicate content (it can in some cases, but it’s not as good as something like GG WM Tools reporting) or 301s, 302s, etc. which can require a crawling solution.

I’d, of course, love your feedback. I know many SEOs are addicted to and supportive of the site: command numbers as a way to measure progress, so maybe there’s things I’m not considering or situations where it makes sense. I also know that many of you like the number reported in Google Webmaster tools under the Sitemaps crawl data (I’m skeptical of this too, for the record) and I’d like to hear how you find value with that data as well.

p.s. Tomorrow we’ll be announcing two webinars (open to all) about using Open Site Explorer to get ACTIONABLE data. Be sure to leave either Wednesday the 27th at 2pm Pacific or Thursday the 28th at 10am Pacific free :-)

Do you like this post? Yes No

Posted by randfish

How many pages has Google indexed?

This question and the problems surrounding it run rampant through the SEO world. It usually arises when someone starts doing searches like this:

Indexation of SEOmoz According to Google

Google claims to have 93,800 pages indexed on the root domain, seomoz.org. That sounds pretty good, but when I ran that search query last week, the number was closer to 75,000 and when I run it again from Google.co.uk 60 seconds later, the number changes even more dramatically:

Indexation of SEOmoz.org on Google.co.uk

How about if I hit refresh on my Google.com results again:

Indexation on Google.com 3 minutes later

Doh! Google just dropped 8,500 of my pages out of their index. That sucks - but not nearly as much as managers, marketing directors and CEOs who use these numbers as actual KPIs! Can you imagine? A number that means nothing, fluctuates 300% between data centers, can change at a moment's notice and provides no actionable insight being used as a business metric?

And yet... It happens.

Fortunately, there's an easy way to get much, much better data than what the search engines provide through "site:" queries and this post is here to walk you through that process step-by-step.

Step 1: Go to Traffic Sources in Your Analytics

Google Analytics Step 1

Click the "traffic sources" link in Google analytics or Omniture (it can also be called "referring sources" in other analytics packages).

Step 2: Head to the Search Engines Section

Step 2 of the Indexation Process

We want to find out how many pages the search engines have indexed, so the obvious next step is to go to the "search engines" sub-section.

Step 3: Choose an Engine

Step 3: Choose an Engine 

Choose the engine you want indexation data on and click. If you have both paid and organic traffic from this engine, you'll want to display organic only at this step, too.

Step 4: Filter by Landing Pages

Step 4: Filter by Landing Page

The "Landing Page" filter in the dropdown will show you the traffic each individual page on your site received from the engine you've selected. This also produces the magical "total" number of pages that have received traffic, described in the last step.

Step 5: Record the Number at the Bottom

Step 5: Indexation Count Arrives

That count tells you the unique number of pages that received at least one visit from searches performed on Google. It's the Holy Grail of indexation - a number you can accurately track over time to see how the search engine is indexing your site. On its own, it isn't particularly useful, but over time (I usually recommend recording monthly, but for some sites, every 2-3 months can make more sense), it gives you insight into whether your pages are doing better or worse at drawing in traffic from the engine.

Now, technically I'm being a bit cheeky here. This number doesn't tell you the full story - it's not showing the actual number of pages a search engine has crawled or indexed on your site, but it does tell you the unique number of URLs that received at least 1 visit from the engine. In my opinion this data is far more accurate and more actionable. The first adjective - accurate - is hard to argue (particularly given the visual evidence atop this post), but the second requires a bit of an explanation.

Why is Number of Pages Receiving ≥1 Visit Actionable?

Indexation numbers alone are useless. Businesses and websites use them as KPIs because they want to know if, over time, more of their pages are making their way into the engines' indices. I'd argue that actually, you don't care if your pages are in the indices - you care if your pages have the opportunity to EARN TRAFFIC!

Being a row in a search index means nothing if your page is:

  • too low in PageRank/link juice to appear in any results
  • displaying content the engines can't properly parse
  • devoid of keywords or content that could send traffic
  • broken, misdirected or unavailable
  • a duplicate of other pages that the engine will rank instead

Thus, the metric you want to count over time isn't (in most cases) number of pages indexed, it's number of pages that earned traffic. Over time, that's the number you want to rise, the number you want marketers to concentrate on and the KPI that's meaningful. It tells you whether the engine is crawling, indexing AND listing your pages in the results where someone might (has) actually click(ed) them.

If the number drops, you can investigate the actual pages that are no longer receiving traffic by exporting the data to Excel and doing a side-by-side with the previous month. If the number rises, you can see the new pages getting traffic. Those individual URLs will tell a story - of pages that broke, that stopped being linked-to, that fell too far down in paginated results or lost their unique content. It's so much better than playing the mystery game that SEOs so often confront in the face of "lower indexation numbers" from the site: command.

Some Necessary Caveats

This methodology certainly isn't perfect, and there are some important points to be aware of (thanks especially to some folks in the comments who brought these up):

  • Google Analytics (and many other analytics packages) use sampled data at times to make guesstimates. If you want to be sure you're getting the absolute best number, export to CSV and do the side-by-side in Excel. You can even expunge similar results from two time period to see only those pages that uniquely did/didn't receive traffic. In many of these cases, you might also only care about pages that gained/lost 5/10/20+ visits.
  • Greater accuracy can be found from shrinking the time period in the analytics, but it also reduces the liklihood that a page receiving very long tail query traffic once in a blue moon will be properly listed, so adjust accordingly, and plan for imperfect data. This method isn't foolproof, but it is (in my opinion), better than the random roulette wheel of site: queries.
  • This technique isn't going to help you catch other kinds of SEO issues like duplicate content (it can in some cases, but it's not as good as something like GG WM Tools reporting) or 301s, 302s, etc. which can require a crawling solution.

I'd, of course, love your feedback. I know many SEOs are addicted to and supportive of the site: command numbers as a way to measure progress, so maybe there's things I'm not considering or situations where it makes sense. I also know that many of you like the number reported in Google Webmaster tools under the Sitemaps crawl data (I'm skeptical of this too, for the record) and I'd like to hear how you find value with that data as well.

p.s. Tomorrow we'll be announcing two webinars (open to all) about using Open Site Explorer to get ACTIONABLE data. Be sure to leave either Wednesday the 27th at 2pm Pacific or Thursday the 28th at 10am Pacific free :-)


Do you like this post? Yes No

http://www.seomoz.org/blog

Peer Review: SEO Best Practices for Duplicate Content

Posted by Danny Dover

This post is part of an ongoing series where my co-workers and I are working to build a freely available resource center of up-to-date SEO best practices. As we write this content, we are submitting them for peer review so that everyone on the Internet can benefit from collective intelligence. You can read more about the SEO Knowledge Center here.


This weeks proposed SEO best practice deals with duplicate content. It is my belief that duplicate content is the single biggest SEO problem on the Internet. (Well that and Myspace layouts.) On the page linked below, Jen Lopez discusses what duplicate content is, how it gets created and how to get rid of it. Hopefully, this page will help all of you combat this problem.

Please let us know if there is something we should add, remove or modify. We are also open to suggestions on how to design better robots. As you will see on the duplicate content page below, Rand’s robot mock-up skills are like a mixture of Avatar CGI and Shakespearean writing but without any of the talent or impressiveness (or iambic pentameter for that matter).


Duplicate Content

Duplicate Content

Remember, this page is just a work in progress. I would love to hear your thoughts and suggestions on how to improve it. Please feel free to leave your comments below.


Danny Dover Twitter

Do you like this post? Yes No

Posted by Danny Dover

This post is part of an ongoing series where my co-workers and I are working to build a freely available resource center of up-to-date SEO best practices. As we write this content, we are submitting them for peer review so that everyone on the Internet can benefit from collective intelligence. You can read more about the SEO Knowledge Center here.


This weeks proposed SEO best practice deals with duplicate content. It is my belief that duplicate content is the single biggest SEO problem on the Internet. (Well that and Myspace layouts.) On the page linked below, Jen Lopez discusses what duplicate content is, how it gets created and how to get rid of it. Hopefully, this page will help all of you combat this problem.

Please let us know if there is something we should add, remove or modify. We are also open to suggestions on how to design better robots. As you will see on the duplicate content page below, Rand's robot mock-up skills are like a mixture of Avatar CGI and Shakespearean writing but without any of the talent or impressiveness (or iambic pentameter for that matter).


Duplicate Content

Duplicate Content

Remember, this page is just a work in progress. I would love to hear your thoughts and suggestions on how to improve it. Please feel free to leave your comments below.


Danny Dover Twitter


Do you like this post? Yes No

http://www.seomoz.org/blog

Offline Reading List: Magazines and Books for SEOs

Posted by RobOusbey

This week, I’d like to make suggestions for a ‘reading list‘ to help SEOs, and others who work online, particularly with website strategies.

But this list isn’t going to be blogs, post and online articles, oh no. These suggestions are entirely offline. We’re going into dead tree mode with eleven books and two magazines. Some of these suggestions you may want to flick through, some you may want to read cover to cover. Others will be suitable for suggesting to other people within your organisation.

There’s no intention that everybody should read all these books (they’re spread over many topics) and my list is far from exhaustive. I’ll welcome your feedback and further recommendations in the comments.

(NB: This post links to Amazon.com and Amazon.co.uk to help you find more about the books mentioned. I’ve used affiliate links, and any revenue generated will be donated to good causes through a general disaster/emergency fund.)

Analytics and Data

Web Analytics in an Hour a Day – Avinash Kaushik

This book is regarded as required reading for anyone who needs to understand the concepts behind web analytics and how to properly assess and understand them. Beyond the very basics about collecting analytics data, the book focuses on how to truly understand how it applies to your website’s goals, and using analytics to collect actionable insights that will improve your website.

(Buy online from: Amazon US or Amazon UK.)

 

Information Dashboard Design – Stephen Few

This book isn’t directly related to SEO or web strategy, but since reading it, I’ve already had two opportunities to use its advice on effectively presenting data. Even if you’re comfortable creating tables, graphs and charts, the hugely practical and highly actionable advice about combining data into ‘dashboards’ is worth your time to acquire.

Whether you work with sites that need to present data in a way that’s appealing to users, or if you need to produce a dashboard of analytics and search data for use internally (perhaps gleaned from Avinash’s book) then you’ll be able to communicate the information more effectively after taking advice from here. (You’ll also start spotting the terrible data presentation mistakes that others make, but I can’t help you there unfortunately.)
(Buy online from: Amazon US or Amazon UK.)

 

Usability & Testing

Don’t Make Me Think and Rocket Surgery Made Easy – Steve Krug

Krug’s famous book about design and usability is one of those classic texts that offers the whole premise within the four words of the title, and then goes on to spend the book showing you how to build that premise into your design philosophy. Get a flavour of the author’s style and keen understanding in the sample chapter, How we really use the web.

Krug describes the first book as being about how to think about usability, whereas Rocket Surgery Made Easy is about how to do it – covering the process of improving web site usability though user testing. It’s highly recommended that before you start designing test and recruiting users, you give this book a read; if you’re not planning any user testing just yet, then read it anyway to remind you why you should.

(Buy ‘Don’t Make Me Think’ online from: Amazon US or Amazon UK.)
(Buy ‘Rocket Surgery Made Easy’ online from: Amazon US or Amazon UK.)

 

Always Be Testing: The Complete Guide to Google Website Optimizer – Bryan Eisenberg & John Quarto-von Tivadar

Complementing Krug’s books, this text focuses on using Google Website Optimizer to set up tests for Conversion Rate Optimisation. Beyond the simple technical aspects of how to run a test with the tool, it teaches how to use an understanding of the buying process and creating strong offering to make websites more powerful.

One reviewer on Amazon was given a copy of the book at SES, and mentioned: "In one recent test, we used the principles learned from the book such as persuasion architecture to setup a test in only an hour that increased lead generation on a high volume ecommerce site by 51%"
(Buy online from: Amazon US or Amazon UK.)

 

Search Marketing

Information Architecture for the World Wide Web – Louis Rosenfeld & Peter Morville

How does a book originally written over a decade ago make it into this list? Because when O’Reilly publishes a book by these authors, on a topic so important to the way information is published online and understood / consumed by visitors, the text is going to stand the test of time.

Like many of these suggestions, the book doesn’t just float at a high level, but gets down to ‘brass tacks’ with detailed discussion about designing and implementing IA on websites, and dedicating a significant chapter to choosing whether and how to implement on-site search on a site. (Recommendation by Dr Pete.)
(Buy online from: Amazon US or Amazon UK.)

 

The Art of SEO – Eric Enge, Jessie Stricchiola, Rand Fishkin and Stephan Spencer

Despite the incredible ongoing changes in the field of SEO, an ‘all-star cast’ (including SEOmoz’s Rand Fishkin) has managed to put together this excellent reference book for search marketers. Before page 50, the authors have covered the basics of how search engines crawl & index the web and search ranking factors; it goes onto cover the technical aspects of SEO, keyword research, competitor analysis & benchmarking, linkbuilding, vertical search and monitoring results.

Most appealing about this book is the understanding that the authors bring from their experience managing SEO campaigns in the real world – such as in the chapter dedicated to building SEO teams, and knowing when or how to appoint a search agency.

The main reason I sound like I’m raving about the book is the same reason you should read it: flattery. Rand dedicated this book to you, the members of the SEO community.
(Buy online from: Amazon US or Amazon UK.)

 

Search Marketing Standard

Recommended by RobBothan, Search Marketing Standard published a quarterly magazine for the search industry. They promise: "Stop stressing out over the avalanche of marketing advice from online sources and let us filter the noise for you."

 

Influence: The Psychology of Persuasion – Robert B. Cialdini

This is a classic book, which chooses to pitch persuasion as a science, rather than an art. The author is a professor of psychology, so this is perhaps expected, but the rigour of explanation in the examples (many from Caldini’s own observations) will help you develop new, more persuasive ways of influencing the visitors to your sites.
(Buy online from: Amazon US or Amazon UK.)

 

Management & Implementation

Good to Great – Jim Collins

I’ve never before come across a book that is essentially a write up of a research project; it’s particularly special as the research conclusions are highly valuable, and can be actioned. The premise of the work was to: identify concepts which great companies had in common, but that were not implemented by any (or many) companies that were simply ‘good.’

You can read more about what these concepts turned out to be, and see how Rand tested their application within SEOmoz in his 2007 post, Asking the Tough Questions or a similar post by Will, from Distilled’s perspective.
(Buy online from: Amazon US or Amazon UK.)

 

Getting Things Done – David Allen

An outstanding book that proposes a workflow management system which would seem highly radical to many people with an established system, and terribly common sense to others. The book then leads you into a method for implementing the GTD setup.

From my perspective, the most important message (but one that plays second fiddle to much of the book’s other content) is that your mind is excellent at a certain type of work (creative thinking, problem solving, etc) and shouldn’t be fettered with other tasks (remember to call that client tomorrow, try to come up with some blog post ideas etc) which can be devolved to a trusted system.

You know when you put things by the door, so that you remember to take them with you when you next leave the house? This book provides a way of making sure that your whole life runs that way.
(Buy online from: Amazon US or Amazon UK.)

 

Agile Project Management with Scrum – Ken Schwaber

Ken Schwaber is one of the authors of the ‘Agile Manifesto’ which outlined the principles behind the methodology known as ‘agile software development’. His ‘Scrum’ process – described in detail in this book – uses a series of relatively fast iterations, typically month-long ’sprints’ between releasing product improvements.

For people who don’t like structures and systems that may introduce additional bureaucracy as a barrier to work, the system may sound terrifying (particularly the formal daily meetings) but trust me: once implemented, Scrum reduces almost every barrier between finding out what needs to be done and actually doing it.

Though designed for software development, the principles can be applied to any product or service that can benefit from incremental improvements (and with a bit of creativity, I think this could easily apply to the output of a great deal of organisations.)
(Buy online from: Amazon US or Amazon UK.)

 

Day to Day

Wired

Wired (at Wired.com and Wired.co.uk) is a monthly magazine, covering many facets of ‘technology’, from gadgets to online-strategy. Its blend of creativity and informity will help you keep on top of technological trends and can also spark ideas, inspire design themes and help as a seed for linkbait concepts.

That their staff have coined terms such as ‘crowdsourcing’ and ‘the long tail’ gives an idea of the impact the magazine has had on the internet marketing industry; reading it every month is the only way to make sure that you’re using their next bit of lingo, before it hits the big time.

Send: The Essential Guide to Email for Office and Home – by David Shipley & Will Schwalbe

Various people have written about how to manage email as part of your daily work life, but this book talks about the specifics of style and writing in the medium. It should help you create better understood, more expressive emails. Sam suggested this book; he said "It was recommended by an e-mail marketer friend and it changed the way I write (and read) e-mails. (…) Really useful."
(Buy online from: Amazon US or Amazon UK.)

 

Do you like this post? Yes No

Posted by RobOusbey

This week, I'd like to make suggestions for a 'reading list' to help SEOs, and others who work online, particularly with website strategies.

But this list isn't going to be blogs, post and online articles, oh no. These suggestions are entirely offline. We're going into dead tree mode with eleven books and two magazines. Some of these suggestions you may want to flick through, some you may want to read cover to cover. Others will be suitable for suggesting to other people within your organisation.

There's no intention that everybody should read all these books (they're spread over many topics) and my list is far from exhaustive. I'll welcome your feedback and further recommendations in the comments.

(NB: This post links to Amazon.com and Amazon.co.uk to help you find more about the books mentioned. I've used affiliate links, and any revenue generated will be donated to good causes through a general disaster/emergency fund.)

Analytics and Data

Web Analytics in an Hour a Day - Avinash Kaushik

This book is regarded as required reading for anyone who needs to understand the concepts behind web analytics and how to properly assess and understand them. Beyond the very basics about collecting analytics data, the book focuses on how to truly understand how it applies to your website's goals, and using analytics to collect actionable insights that will improve your website.

(Buy online from: Amazon US or Amazon UK.)
 

Information Dashboard Design - Stephen Few

This book isn't directly related to SEO or web strategy, but since reading it, I've already had two opportunities to use its advice on effectively presenting data. Even if you're comfortable creating tables, graphs and charts, the hugely practical and highly actionable advice about combining data into 'dashboards' is worth your time to acquire.

Whether you work with sites that need to present data in a way that's appealing to users, or if you need to produce a dashboard of analytics and search data for use internally (perhaps gleaned from Avinash's book) then you'll be able to communicate the information more effectively after taking advice from here. (You'll also start spotting the terrible data presentation mistakes that others make, but I can't help you there unfortunately.)
(Buy online from: Amazon US or Amazon UK.)
 

Usability & Testing

Don't Make Me Think and Rocket Surgery Made Easy - Steve Krug

Krug's famous book about design and usability is one of those classic texts that offers the whole premise within the four words of the title, and then goes on to spend the book showing you how to build that premise into your design philosophy. Get a flavour of the author's style and keen understanding in the sample chapter, How we really use the web.

Krug describes the first book as being about how to think about usability, whereas Rocket Surgery Made Easy is about how to do it - covering the process of improving web site usability though user testing. It's highly recommended that before you start designing test and recruiting users, you give this book a read; if you're not planning any user testing just yet, then read it anyway to remind you why you should.

(Buy 'Don't Make Me Think' online from: Amazon US or Amazon UK.)
(Buy 'Rocket Surgery Made Easy' online from: Amazon US or Amazon UK.)
 


Always Be Testing: The Complete Guide to Google Website Optimizer - Bryan Eisenberg & John Quarto-von Tivadar

Complementing Krug's books, this text focuses on using Google Website Optimizer to set up tests for Conversion Rate Optimisation. Beyond the simple technical aspects of how to run a test with the tool, it teaches how to use an understanding of the buying process and creating strong offering to make websites more powerful.

One reviewer on Amazon was given a copy of the book at SES, and mentioned: "In one recent test, we used the principles learned from the book such as persuasion architecture to setup a test in only an hour that increased lead generation on a high volume ecommerce site by 51%"
(Buy online from: Amazon US or Amazon UK.)
 

Search Marketing

Information Architecture for the World Wide Web - Louis Rosenfeld & Peter Morville

How does a book originally written over a decade ago make it into this list? Because when O'Reilly publishes a book by these authors, on a topic so important to the way information is published online and understood / consumed by visitors, the text is going to stand the test of time.

Like many of these suggestions, the book doesn't just float at a high level, but gets down to 'brass tacks' with detailed discussion about designing and implementing IA on websites, and dedicating a significant chapter to choosing whether and how to implement on-site search on a site. (Recommendation by Dr Pete.)
(Buy online from: Amazon US or Amazon UK.)
 


The Art of SEO - Eric Enge, Jessie Stricchiola, Rand Fishkin and Stephan Spencer

Despite the incredible ongoing changes in the field of SEO, an 'all-star cast' (including SEOmoz's Rand Fishkin) has managed to put together this excellent reference book for search marketers. Before page 50, the authors have covered the basics of how search engines crawl & index the web and search ranking factors; it goes onto cover the technical aspects of SEO, keyword research, competitor analysis & benchmarking, linkbuilding, vertical search and monitoring results.

Most appealing about this book is the understanding that the authors bring from their experience managing SEO campaigns in the real world - such as in the chapter dedicated to building SEO teams, and knowing when or how to appoint a search agency.

The main reason I sound like I'm raving about the book is the same reason you should read it: flattery. Rand dedicated this book to you, the members of the SEO community.
(Buy online from: Amazon US or Amazon UK.)
 


Search Marketing Standard

Recommended by RobBothan, Search Marketing Standard published a quarterly magazine for the search industry. They promise: "Stop stressing out over the avalanche of marketing advice from online sources and let us filter the noise for you."
 


Influence: The Psychology of Persuasion - Robert B. Cialdini

This is a classic book, which chooses to pitch persuasion as a science, rather than an art. The author is a professor of psychology, so this is perhaps expected, but the rigour of explanation in the examples (many from Caldini's own observations) will help you develop new, more persuasive ways of influencing the visitors to your sites.
(Buy online from: Amazon US or Amazon UK.)
 


Management & Implementation

Good to Great - Jim Collins

I've never before come across a book that is essentially a write up of a research project; it's particularly special as the research conclusions are highly valuable, and can be actioned. The premise of the work was to: identify concepts which great companies had in common, but that were not implemented by any (or many) companies that were simply 'good.'

You can read more about what these concepts turned out to be, and see how Rand tested their application within SEOmoz in his 2007 post, Asking the Tough Questions or a similar post by Will, from Distilled's perspective.
(Buy online from: Amazon US or Amazon UK.)
 


Getting Things Done - David Allen

An outstanding book that proposes a workflow management system which would seem highly radical to many people with an established system, and terribly common sense to others. The book then leads you into a method for implementing the GTD setup.

From my perspective, the most important message (but one that plays second fiddle to much of the book's other content) is that your mind is excellent at a certain type of work (creative thinking, problem solving, etc) and shouldn't be fettered with other tasks (remember to call that client tomorrow, try to come up with some blog post ideas etc) which can be devolved to a trusted system.

You know when you put things by the door, so that you remember to take them with you when you next leave the house? This book provides a way of making sure that your whole life runs that way.
(Buy online from: Amazon US or Amazon UK.)
 


Agile Project Management with Scrum - Ken Schwaber

Ken Schwaber is one of the authors of the 'Agile Manifesto' which outlined the principles behind the methodology known as 'agile software development'. His 'Scrum' process - described in detail in this book - uses a series of relatively fast iterations, typically month-long 'sprints' between releasing product improvements.

For people who don't like structures and systems that may introduce additional bureaucracy as a barrier to work, the system may sound terrifying (particularly the formal daily meetings) but trust me: once implemented, Scrum reduces almost every barrier between finding out what needs to be done and actually doing it.

Though designed for software development, the principles can be applied to any product or service that can benefit from incremental improvements (and with a bit of creativity, I think this could easily apply to the output of a great deal of organisations.)
(Buy online from: Amazon US or Amazon UK.)
 


Day to Day

Wired

Wired (at Wired.com and Wired.co.uk) is a monthly magazine, covering many facets of 'technology', from gadgets to online-strategy. Its blend of creativity and informity will help you keep on top of technological trends and can also spark ideas, inspire design themes and help as a seed for linkbait concepts.

That their staff have coined terms such as 'crowdsourcing' and 'the long tail' gives an idea of the impact the magazine has had on the internet marketing industry; reading it every month is the only way to make sure that you're using their next bit of lingo, before it hits the big time.

Send: The Essential Guide to Email for Office and Home - by David Shipley & Will Schwalbe

Various people have written about how to manage email as part of your daily work life, but this book talks about the specifics of style and writing in the medium. It should help you create better understood, more expressive emails. Sam suggested this book; he said "It was recommended by an e-mail marketer friend and it changed the way I write (and read) e-mails. (...) Really useful."
(Buy online from: Amazon US or Amazon UK.)
 

Do you like this post? Yes No

http://www.seomoz.org/blog

Whiteboard Friday – Domain Authority & Page Authority Metrics

Posted by great scott!

This week we’ve got a special Whiteboard Friday double feature! As you’ve probably heard, we launched our new link checker and backlink analysis tool, Open Site Explorer, this week and it makes use of some exciting new metrics: Domain Authority and Page Authority. We asked our old chum, Will Critchlow, to talk to Rand about these metrics to help everyone understand what they are, what goes into them, how to use them, and why we created them.

Domain and Page Authority Metrics Comparisons

In Part One, Will and Rand discuss how to use these metrics to gain insight and intelligence on your (and your competitors’) pages, domains, and link profiles, as well as why these metrics can be a better predictor of ranking success than others that you may have used in the past.

In Part Two, the guys dive into detail about what exactly goes into Domain Authority & Page Authority: how they were modeled, how they compare to actual search results, why your DA & PA scores may change over time, and lots of other details to help you better understand how these metrics work.

Both videos are viewable below, simply select the one you’d like to watch from the playlist on the right of the player. I’d recommend watching them in order, but it’s not necessary.

These new metrics have already been quite popular among users of Open Site Explorer, and one of the big questions is, "When can I get them in the SEOmoz Firefox Toolbar?!"  Well, surprise, surprise, we’re on top of it! They’ll be available in the new toolbar update coming out next month…here’s a sneak peek :)

 

mozBar February 2010 preview
New scores, new features and much more are on their way in the February version of the mozbar

If you’ve got questions about Domain or Page Authority, please leave us feedback below. We’re trying to make these metrics as useful and valuable as possible and would love your suggestions.

Do you like this post? Yes No

Posted by great scott!

This week we've got a special Whiteboard Friday double feature! As you've probably heard, we launched our new link checker and backlink analysis tool, Open Site Explorer, this week and it makes use of some exciting new metrics: Domain Authority and Page Authority. We asked our old chum, Will Critchlow, to talk to Rand about these metrics to help everyone understand what they are, what goes into them, how to use them, and why we created them.

Domain and Page Authority Metrics Comparisons

In Part One, Will and Rand discuss how to use these metrics to gain insight and intelligence on your (and your competitors') pages, domains, and link profiles, as well as why these metrics can be a better predictor of ranking success than others that you may have used in the past.

In Part Two, the guys dive into detail about what exactly goes into Domain Authority & Page Authority: how they were modeled, how they compare to actual search results, why your DA & PA scores may change over time, and lots of other details to help you better understand how these metrics work.

Both videos are viewable below, simply select the one you'd like to watch from the playlist on the right of the player. I'd recommend watching them in order, but it's not necessary.





These new metrics have already been quite popular among users of Open Site Explorer, and one of the big questions is, "When can I get them in the SEOmoz Firefox Toolbar?!"  Well, surprise, surprise, we're on top of it! They'll be available in the new toolbar update coming out next month...here's a sneak peek :)

 

mozBar February 2010 preview
New scores, new features and much more are on their way in the February version of the mozbar

If you've got questions about Domain or Page Authority, please leave us feedback below. We're trying to make these metrics as useful and valuable as possible and would love your suggestions.


Do you like this post? Yes No

http://www.seomoz.org/blog

One Giant Leap for Link Data: Announcing Open Site Explorer + Page/Domain Authority Metrics

Posted by randfish

For the past 15 months, we’ve been working hard to improve Linkscape, our index of the WWW. Today, we’re releasing an entirely new platform for Linkscape’s index with more accessible data than ever before. And, for the next 48 hours, full functionality is available entirely for free:

Open Site Explorer

The new tool, Open Site Explorer, makes gathering, sorting and exporting link data easier than ever. It’s built with speed and accessibilty at the forefront and provides a tremendous amount of information about the links to any page or site. Since there’s a lot to cover, let’s dive right into some of the features and functionality.

#1 – Fast Access to Top Level Metrics

OSE Metrics

At the top of every results page, you’ll find the key metrics we have on your page – the importance/ranking ability of that URL (Page Authority) and root domain (Domain Authority), the number of linking root domains and the total number of links.

#2 – See Up to 10,000 Links Alongside Anchor Text & Key Metrics

OSE Link List

You can browse through up to 10,000 links (this is restricted to 1,000 for non-PRO members normally, but will be completely free to everyone for the first 48 hours). We also offer CSV export functionality, but it won’t be available until the weekend (and then, only to PRO members – CSV takes up a LOT of bandwidth for 10K rows :-) ).

#3 – Filtering for the Links You Want to See

OSE Filtering Options

As you drill down in the list of links, you can exclude nofollowed links or see only the 301s that point to a page. You also have the ability to sort by the location from which you want to see links – internal vs. external – and links that point to a given page, all pages on a subdomain or an entire root domain.

#4 – Display Root Domains that Contain Links

OSE Linking Domains

The second tab in Open Site Explorer (OSE for short) is the linking root domains. We realized that a lot of people want to get a quick glance of the types of sites that are sending links to a given page or domain, and thus created this unique view. In the future (probably a couple months away), you’ll also be able to click an individual domain and see a list of pages from that site that link to the target of your choice.

#5 – Review Anchor Text Term & Phrase Distribution

OSE Anchor Text Distribution

Anchor text is often the missing link in a "why does that guy rank there?" puzzle. We’re opening up the anchor text distribution so you can learn more about your own sites and pages and those of the competition. You can also sort by both the number of root domains that contain a link with a particular anchor text term (single word) or phrase and the raw number of links containing that anchor text.

#6 – Pie Chart Displays of Link Data 

OSE Data Pie Charts

Many SEOs worry that, particularly on small sites, they may be seeing lots of numbers of links, but the sources aren’t ideal. In this view, we try to illustrate through pie charts the percentage of links that come from internal vs. external pages and are followed vs. nofollowed. This view is at the top of the "full metrics" tab.

#7 - Rejoice in Data Junkie Heaven 

OSE Full Metrics

Additionally in the "full metrics" tab, you’ll find a list of all the Linkscape data we’ve got including mozRank (an algorithm similar to Google’s PageRank), mozTrust (akin to TrustRank) and many more. You can also see the more refined link counts and data for an individual URL, the subdomain it’s on and the hosting root domain.

#8 – Compare Pages/Sites Link Metrics to One Another

OSE Comparison

A frequently requested feature is the ability to compare one site/page against another. OSE makes this quick and easy with a comparison view drop-down. If you click the "-" symbol again, you can return to the individual report view.

#9 – Graphical Views of Metric Comparisons

OSE Comparative Metrics

In the comparison view, we show nice visual charts that you can embed in a client report or send to your boss to help illustrate just how challenging it might be to take on a particular competitor. For example, you can see above that Fred Wilson has a long way to go to reach Guy Kawasaki’s stats on his blog (granted, Guy’s posts are designed for a much broader audience and he’s been blogging for longer).

#10 – Compare Links Side by Side

OSE Links Side by Side

At the bottom of this comparative view you’ll see links side-by-side. We noticed a lot of SEOs open two browser windows with lists of links to compare them against one another and thought "why not make that easier?!" With this feature, you can scroll through the links for two pages to get a fast sense for the quality and variety of sources that point to each.

New Metrics – Domain Authority & Page Authority

OSE Metrics for NinebyBlue

We’ve got much more information coming soon about these two metrics, but basically, we’re using our ranking models to build predictions about how well an individual page might perform in the search engines (Page Authority) or how well content on a root domain would do (Domain Authority). These aren’t like PageRank or mozRank at all – they’re much broader.

Authority scores take into account all the metrics we have about a page and hundreds of derivatives of those metrics. We’ve put the scores on a classic 0-100 scale that’s logarithmic (so moving from a 50 to a 60 is much harder than moving from a 10 to a 20). Over time, these metrics will change and evolve as we get better and better with our machine learning systems (and as the engines and the web itself changes). Watch for this week’s Whiteboard Friday with much more detail on this subject. For now Open Site Explorer is the only place to get Domain/Page Authority data, but we’ll be rolling it into the SEOmoz toolbar and other tools over the next few months.

Linkscape’s Index Update

Linkscape itself has also updated – growing to a whopping 65 billion URLs with 45 day minimum freshness. As Nick’s previous post on the Trillion+ URLs Linkscape has seen shows, freshness is one of the most critical metrics for those who care about accurate link data, and we’re working hard to keep our index as up-to-date as possible. Linkscape recrawls every page in the index each month, so no "old data" is stored or served. Our current metrics for this index are:

  • Pages: 64,180,990,434 (65 billion)
    • 301s: 293 million
    • 302s: 672 million (Marshall Simmonds calls this "job security")
    • 404s: 360 million (but we do try to exclude known 404s in crawls, so this may be low percentage wise)
  • Subdomains: 259,977,972 (260 million)
  • Root Domains: 63,264,651 (63 million)
    • .com – 49.4%
    • .net – 6.4%
    • .de – 5.8%
    • .org – 5.2%
    • .ru – 2.5%
    • .cn – 2.5%
  • Links: 701,881,850,733 (701 billion)
    • Nofollows: 13 billion (1.85%)
    • Internal Nofollows: 9.06 billion vs. External Nofollows: 4.11 billion
    • Meta Refreshes: 40.9 million
    • Internal Links: 638 billion vs. External Links: 63 billion (people link to their own stuff a lot more than they do to others)
    • Feed Autodiscovery (i.e. RSS/Atom feeds): 2.261 billion
    • Rel=canonical: 100 million
    • Links passed through 301s: 8.61 billion (just over 1% of all links go through a 301)
  • mozRank Correlations to Google Toolbar PageRank
    • Individual page mR: 0.42 (avg. error +/- 0.56 from PR)
    • Subdomain mR: 0.45 (avg. error +/- 0.35 from PR)
    • Root domain mR: 0.45 (avg error +/- 0.37 from PR
  • File Extensions
    • html: 26.5%
    • php: 21.7%
    • htm: 10.6%
    • asp: 5.7%
    • aspx: 2.9%
    • cgi: 0.89%

API Update

Finally, we’ve also updated the SEOmoz API – you can now get lists of links for any URL for FREE along with tons of other link data and metrics. Sarah & Nick have a blog post coming soon with more, but for now, check out the API page to get a developer key and the API Wiki for more details.

Answers to Common Questions About OSE

What’s the difference between OSE and Linkscape?

Open Site Explorer provides a fast, free, more basic view of link data while Linkscape provides power users the ability to refine by dozens of filters, search within link anchor text, URLs and domains. Linkscape will let you dig into significantly more metrics and details on a per link basis on things like mozRank passed, Domain mozTrust, juice per anchor text, links from particular TLDs, etc.

OSE is substantively faster than Linkscape, and not as metrics heavy. It’s designed to give the "500 foot view" vs. the deep, in-the-weeds look you can get in Linkscape. Certainly feel free to try both and use the one that suits you best.

Why is OSE on a separate domain?

Three big reasons, actually:

  1. We’ve haven’t tried the microsite strategy in a long time (since the first launch of the Web 2.0 Awards), and want to test and see lots of SEO and strategic/branding (we’ll have some cool data to report in the next few weeks/months)
  2. OSE is built entirely on the SEOmoz API platform – we wanted to show off just how much you can build using that service :-)
  3. SEOmoz engineers are very busy working on another exciting launch (scheduled for June) so we wanted to split resources without putting a load on folks focused on our site (PRO members may see some previews of that even earlier)

What will OSE continue to offer for free?

For the first 48 hours, registered members (anyone with a free SEOmoz account) will get the full PRO features (unlimited metrics, up to 10K links per report, full anchor text data, etc). After that, anyone can still get up to 1,000 links per search and a sampling of metrics. You can see a full breakdown in the bottom right-hand corner of the homepage.

Why Call it "Open" Site Explorer?

We’re aiming to give out more link data than anyone else on the web for free. Open Site Explorer not only gives out lots and lots of links (up to 1,000), but also metrics and link numbers for free (permanently). We also provide a free API that lets you use any of the data (including lists of links) in your applications, public or private. Our goal is to be transparent with this data – to show exactly how many pages/domains are in our index, show accuracy with freshness and canonicalize and re-crawl like a search engine. We’re trying to take the web’s link graph and make it as available as possible and use the revenue component of PRO membership to accelerate growth on index freshness, quality and size.

Please Give Us Feedback!

We’d love to hear from you. If you have suggestions, bug reports (this is a first launch, after all) or ideas for future iterations, please leave them in the comments or send them via the Open Site Explorer feedback form. We’re of course very excited for the launch of OSE and would certainly appreciate you sharing and helping us spread it around. The free period ends at 8am Pacific on Friday, January 22nd, but PRO members will continue to be able to access all the features and unlimited reports (and free reports will still provide up to 1,000 links).

p.s. Two great posts with more information on this topic appeared in the last 24 hours and are worth sharing:

If you have more to share, feel free to link in the comments.

Do you like this post? Yes No

Posted by randfish

For the past 15 months, we've been working hard to improve Linkscape, our index of the WWW. Today, we're releasing an entirely new platform for Linkscape's index with more accessible data than ever before. And, for the next 48 hours, full functionality is available entirely for free:

Open Site Explorer

The new tool, Open Site Explorer, makes gathering, sorting and exporting link data easier than ever. It's built with speed and accessibilty at the forefront and provides a tremendous amount of information about the links to any page or site. Since there's a lot to cover, let's dive right into some of the features and functionality.

#1 - Fast Access to Top Level Metrics

OSE Metrics

At the top of every results page, you'll find the key metrics we have on your page - the importance/ranking ability of that URL (Page Authority) and root domain (Domain Authority), the number of linking root domains and the total number of links.

#2 - See Up to 10,000 Links Alongside Anchor Text & Key Metrics

OSE Link List

You can browse through up to 10,000 links (this is restricted to 1,000 for non-PRO members normally, but will be completely free to everyone for the first 48 hours). We also offer CSV export functionality, but it won't be available until the weekend (and then, only to PRO members - CSV takes up a LOT of bandwidth for 10K rows :-)).

#3 - Filtering for the Links You Want to See

OSE Filtering Options

As you drill down in the list of links, you can exclude nofollowed links or see only the 301s that point to a page. You also have the ability to sort by the location from which you want to see links - internal vs. external - and links that point to a given page, all pages on a subdomain or an entire root domain.

#4 - Display Root Domains that Contain Links

OSE Linking Domains

The second tab in Open Site Explorer (OSE for short) is the linking root domains. We realized that a lot of people want to get a quick glance of the types of sites that are sending links to a given page or domain, and thus created this unique view. In the future (probably a couple months away), you'll also be able to click an individual domain and see a list of pages from that site that link to the target of your choice.

#5 - Review Anchor Text Term & Phrase Distribution

OSE Anchor Text Distribution

Anchor text is often the missing link in a "why does that guy rank there?" puzzle. We're opening up the anchor text distribution so you can learn more about your own sites and pages and those of the competition. You can also sort by both the number of root domains that contain a link with a particular anchor text term (single word) or phrase and the raw number of links containing that anchor text.

#6 - Pie Chart Displays of Link Data 

OSE Data Pie Charts

Many SEOs worry that, particularly on small sites, they may be seeing lots of numbers of links, but the sources aren't ideal. In this view, we try to illustrate through pie charts the percentage of links that come from internal vs. external pages and are followed vs. nofollowed. This view is at the top of the "full metrics" tab.

#7 - Rejoice in Data Junkie Heaven 

OSE Full Metrics

Additionally in the "full metrics" tab, you'll find a list of all the Linkscape data we've got including mozRank (an algorithm similar to Google's PageRank), mozTrust (akin to TrustRank) and many more. You can also see the more refined link counts and data for an individual URL, the subdomain it's on and the hosting root domain.

#8 - Compare Pages/Sites Link Metrics to One Another

OSE Comparison

A frequently requested feature is the ability to compare one site/page against another. OSE makes this quick and easy with a comparison view drop-down. If you click the "-" symbol again, you can return to the individual report view.

#9 - Graphical Views of Metric Comparisons

OSE Comparative Metrics

In the comparison view, we show nice visual charts that you can embed in a client report or send to your boss to help illustrate just how challenging it might be to take on a particular competitor. For example, you can see above that Fred Wilson has a long way to go to reach Guy Kawasaki's stats on his blog (granted, Guy's posts are designed for a much broader audience and he's been blogging for longer).

#10 - Compare Links Side by Side

OSE Links Side by Side

At the bottom of this comparative view you'll see links side-by-side. We noticed a lot of SEOs open two browser windows with lists of links to compare them against one another and thought "why not make that easier?!" With this feature, you can scroll through the links for two pages to get a fast sense for the quality and variety of sources that point to each.

New Metrics - Domain Authority & Page Authority

OSE Metrics for NinebyBlue

We've got much more information coming soon about these two metrics, but basically, we're using our ranking models to build predictions about how well an individual page might perform in the search engines (Page Authority) or how well content on a root domain would do (Domain Authority). These aren't like PageRank or mozRank at all - they're much broader.

Authority scores take into account all the metrics we have about a page and hundreds of derivatives of those metrics. We've put the scores on a classic 0-100 scale that's logarithmic (so moving from a 50 to a 60 is much harder than moving from a 10 to a 20). Over time, these metrics will change and evolve as we get better and better with our machine learning systems (and as the engines and the web itself changes). Watch for this week's Whiteboard Friday with much more detail on this subject. For now Open Site Explorer is the only place to get Domain/Page Authority data, but we'll be rolling it into the SEOmoz toolbar and other tools over the next few months.

Linkscape's Index Update

Linkscape itself has also updated - growing to a whopping 65 billion URLs with 45 day minimum freshness. As Nick's previous post on the Trillion+ URLs Linkscape has seen shows, freshness is one of the most critical metrics for those who care about accurate link data, and we're working hard to keep our index as up-to-date as possible. Linkscape recrawls every page in the index each month, so no "old data" is stored or served. Our current metrics for this index are:

  • Pages: 64,180,990,434 (65 billion)
    • 301s: 293 million
    • 302s: 672 million (Marshall Simmonds calls this "job security")
    • 404s: 360 million (but we do try to exclude known 404s in crawls, so this may be low percentage wise)
  • Subdomains: 259,977,972 (260 million)
  • Root Domains: 63,264,651 (63 million)
    • .com - 49.4%
    • .net - 6.4%
    • .de - 5.8%
    • .org - 5.2%
    • .ru - 2.5%
    • .cn - 2.5%
  • Links: 701,881,850,733 (701 billion)
    • Nofollows: 13 billion (1.85%)
    • Internal Nofollows: 9.06 billion vs. External Nofollows: 4.11 billion
    • Meta Refreshes: 40.9 million
    • Internal Links: 638 billion vs. External Links: 63 billion (people link to their own stuff a lot more than they do to others)
    • Feed Autodiscovery (i.e. RSS/Atom feeds): 2.261 billion
    • Rel=canonical: 100 million
    • Links passed through 301s: 8.61 billion (just over 1% of all links go through a 301)
  • mozRank Correlations to Google Toolbar PageRank
    • Individual page mR: 0.42 (avg. error +/- 0.56 from PR)
    • Subdomain mR: 0.45 (avg. error +/- 0.35 from PR)
    • Root domain mR: 0.45 (avg error +/- 0.37 from PR
  • File Extensions
    • html: 26.5%
    • php: 21.7%
    • htm: 10.6%
    • asp: 5.7%
    • aspx: 2.9%
    • cgi: 0.89%

API Update

Finally, we've also updated the SEOmoz API - you can now get lists of links for any URL for FREE along with tons of other link data and metrics. Sarah & Nick have a blog post coming soon with more, but for now, check out the API page to get a developer key and the API Wiki for more details.

Answers to Common Questions About OSE

What's the difference between OSE and Linkscape?

Open Site Explorer provides a fast, free, more basic view of link data while Linkscape provides power users the ability to refine by dozens of filters, search within link anchor text, URLs and domains. Linkscape will let you dig into significantly more metrics and details on a per link basis on things like mozRank passed, Domain mozTrust, juice per anchor text, links from particular TLDs, etc.

OSE is substantively faster than Linkscape, and not as metrics heavy. It's designed to give the "500 foot view" vs. the deep, in-the-weeds look you can get in Linkscape. Certainly feel free to try both and use the one that suits you best.

Why is OSE on a separate domain?

Three big reasons, actually:

  1. We've haven't tried the microsite strategy in a long time (since the first launch of the Web 2.0 Awards), and want to test and see lots of SEO and strategic/branding (we'll have some cool data to report in the next few weeks/months)
  2. OSE is built entirely on the SEOmoz API platform - we wanted to show off just how much you can build using that service :-)
  3. SEOmoz engineers are very busy working on another exciting launch (scheduled for June) so we wanted to split resources without putting a load on folks focused on our site (PRO members may see some previews of that even earlier)

What will OSE continue to offer for free?

For the first 48 hours, registered members (anyone with a free SEOmoz account) will get the full PRO features (unlimited metrics, up to 10K links per report, full anchor text data, etc). After that, anyone can still get up to 1,000 links per search and a sampling of metrics. You can see a full breakdown in the bottom right-hand corner of the homepage.

Why Call it "Open" Site Explorer?

We're aiming to give out more link data than anyone else on the web for free. Open Site Explorer not only gives out lots and lots of links (up to 1,000), but also metrics and link numbers for free (permanently). We also provide a free API that lets you use any of the data (including lists of links) in your applications, public or private. Our goal is to be transparent with this data - to show exactly how many pages/domains are in our index, show accuracy with freshness and canonicalize and re-crawl like a search engine. We're trying to take the web's link graph and make it as available as possible and use the revenue component of PRO membership to accelerate growth on index freshness, quality and size.

Please Give Us Feedback!

We'd love to hear from you. If you have suggestions, bug reports (this is a first launch, after all) or ideas for future iterations, please leave them in the comments or send them via the Open Site Explorer feedback form. We're of course very excited for the launch of OSE and would certainly appreciate you sharing and helping us spread it around. The free period ends at 8am Pacific on Friday, January 22nd, but PRO members will continue to be able to access all the features and unlimited reports (and free reports will still provide up to 1,000 links).

p.s. Two great posts with more information on this topic appeared in the last 24 hours and are worth sharing:

If you have more to share, feel free to link in the comments.


Do you like this post? Yes No

http://www.seomoz.org/blog

Page 1 of 2112345»1020...Last »

Seth Godin: Sliced Bread

Malcolm Gladwell: Outliers

Anthony Parinello: Your Price is Too High

© Kaboodle Ventures LLC 2009
Blog WebMastered by All in One Webmaster.