A Data Science Central Community
When you do a search for "career objectives" on Google India (www.google.in), the first result showing up is from a US-based job board specializing in data mining and analytical jobs. The Google link in question redirects to a page that does not even contain the string "career objective". In short, Google is pushing a US web site that has nothing to do with "career objectives" as the #1 web site for "career objectives" in India. In addition, Google totally failed to recognize that the web site in question is about analytics and data mining.
So here's an idea to improve search engine indexes, and to develop better search engine technology:
This feature could be implemented by having webmasters using special blocking meta tags in web pages, recognized by the search engines willing to implement them.
Comment
Regarding the idea to build a website that provides search result pages not just for keywords, but also for related links, I've found one that provides high quality search results when someone is searching for related links. Its name is similarsites.com, and you can check the results, if you search for websites similar to Analyticbridge, by clicking on www.similarsites.com/site/analyticbridge.com.
Clearly its strengths is to show related websites (which link to the target domain, in this case Analyticbridge), by ordering the results (related domains) using a combination of outgoing links and website traffic.
You can create a search engine like Similarsites by building a table with the top 1 million websites (available for download at www.quantcast.com/top-sites-1), and for each of these 1 million websites, have up to 100 related websites (also from the same list of 1 million domains). So you could have a great search engine with a database containing less than 100 x 1 million pair of (related) domains: that's a data set fairly easy to manage (not too big).
To protect your webpage from unwanted traffic you may just disable Alexa, Quantcast, etc. code for bad visits.
So visitor can see his content and measurement tools aren't affected (display measure code only for good visits).
If you block a crawler you may loose you pagerank and many good visitors with it. And GoogleBot is probably the same in India and in US too.
Good point Jozo. Not sure where you would block the traffic, I've been thinking to block google.in via robots.txt, as this would
Blocking can also be made via .htaccess. Here's an example of .htaccess file which blocks lots of undesirable traffic: http://multifinanceit.com/htaccess.txt.
If I add "career objective" in the block list, users visiting the website, following a search query with this keyword, would be redirected to an "access denied" page.
Vincent, can't you write set of rules what would handle a traffic from unwanted sources?
e.g. IF HTTP_REFERRER like "%google.in%q=%career%' THEN dont_count_a_visit
© 2021 TechTarget, Inc.
Powered by
Badges | Report an Issue | Privacy Policy | Terms of Service
Most Popular Content on DSC
To not miss this type of content in the future, subscribe to our newsletter.
Other popular resources
Archives: 2008-2014 | 2015-2016 | 2017-2019 | Book 1 | Book 2 | More
Most popular articles
You need to be a member of AnalyticBridge to add comments!
Join AnalyticBridge