How to do Keyword Research
Google Keyword planner information
http://www.catalystsearchmarketing.com/how-to-the-keyword-planner-for-seo/
https://adwords.google.com/o/KeywordTool
Bing Keyword Planner
http://advertise.bingads.microsoft.com/en-us/bing-ads-intelligence
Magento and Robots.txt
For all new Magento sites be sure to include a robots.txt. There’s a major problem with robots spidering the search results. It creates a huge server resource issue as well as an SEO issue. Feel free to edit/comment on this as Magento evolves and things need to be added/removed.
## robots.txt for Magento Community and Enterprise
## http://turnkeye.com/blog/optimize-robots-txt-for-magento/
## GENERAL SETTINGS
## Enable robots.txt rules for all crawlers
User-agent: *
## Crawl-delay parameter: number of seconds to wait between successive requests to the same server.
## Set a custom crawl rate if you're experiencing traffic problems with your server.
# Crawl-delay: 30
## Magento sitemap: uncomment and replace the URL to your Magento sitemap file
# Sitemap: http://www.example.com/sitemap/sitemap.xml
## DEVELOPMENT RELATED SETTINGS
## Do not crawl development files and folders: CVS, svn directories and dump files
Disallow: /CVS
Disallow: /*.svn$
Disallow: /*.idea$
Disallow: /*.sql$
Disallow: /*.tgz$
## GENERAL MAGENTO SETTINGS
## Do not crawl Magento admin page
Disallow: /admin/
## Do not crawl common Magento technical folders
Disallow: /app/
Disallow: /downloader/
Disallow: /errors/
Disallow: /includes/
Disallow: /js/
Disallow: /lib/
Disallow: /media/css/
Disallow: /media/css_secure/
Disallow: /media/customer/
Disallow: /media/downloadable/
Disallow: /media/favicon/
Disallow: /media/import/
Disallow: /media/js/
Disallow: /media/sales/
Disallow: /media/tmp/
Disallow: /media/wysiwyg/
Disallow: /media/xmlconnect/
Disallow: /pkginfo/
Disallow: /shell/
Disallow: /skin/
Disallow: /var/
## Do not crawl common Magento files
Disallow: /api.php
Disallow: /cron.php
Disallow: /cron.sh
Disallow: /error_log
Disallow: /get.php
Disallow: /install.php
Disallow: /LICENSE.html
Disallow: /LICENSE.txt
Disallow: /LICENSE_AFL.txt
Disallow: /README.txt
Disallow: /RELEASE_NOTES.txt
## MAGENTO SEO IMPROVEMENTS
## Do not crawl sub category pages that are sorted or filtered.
Disallow: /*?dir*
Disallow: /*?dir=desc
Disallow: /*?dir=asc
Disallow: /*?limit=all
Disallow: /*?mode*
## Do not crawl 2-nd home page copy (example.com/index.php/). Uncomment it only if you activated Magento SEO URLs.
## Disallow: /index.php/
## Do not crawl links with session IDs
Disallow: /*?SID=
## Do not crawl checkout and user account pages
Disallow: /checkout/
Disallow: /onestepcheckout/
Disallow: /customer/
Disallow: /customer/account/
Disallow: /customer/account/login/
## Do not crawl seach pages and not-SEO optimized catalog links
Disallow: /catalogsearch/
Disallow: /catalog/product_compare/
Disallow: /catalog/category/view/
Disallow: /catalog/product/view/
## SERVER SETTINGS
## Do not crawl common server technical folders and files
Disallow: /cgi-bin/
Disallow: /cleanup.php
Disallow: /apc.php
Disallow: /memcache.php
Disallow: /phpinfo.php
## PROJECT SPECIFIC FILES AND FOLDERS
## Do not crawl these files and folders that are developer created
#Disallow: /staging-only/
#Disallow: /m13apc.php
## IMAGE CRAWLERS SETTINGS
## Extra: Uncomment if you do not wish Google and Bing to index your images
# User-agent: Googlebot-Image
# Disallow: /
# User-agent: msnbot-media
# Disallow: /
Ajax and SEO
Ajax and SEO. The article focuses on Bing adding some features but talks about techniques for all.
Nested Sitemaps
Ever need more than one site map. Don’t want to have to submit them separate? Nest that shit.
SEO Redirects 301, 302, 404, 410
301 – Permanent move of the content: For site launches to direct users to new urls of the new site structure.
302 – Temporary move of content: Say a product is out of inventory, page removed for a while. Use this to direct users to a different similar page.
404 – Content removed: Google looks at these as broken, though they’re just pages that don’t extist. Particularly problematic on incoming links. All other codes are better than this one for non-existent pages.
410 – Content Permanently removed. Better than a 404 for missing content.
Some info
http://en.wikipedia.org/wiki/List_of_HTTP_status_codes
http://www.seomoz.org/blog/how-should-you-handle-expired-content
Recent Comments