We hear several stories on forums about bloggers and webmasters who are complaining about sneaky content duplicators. These people can be categorized as “very” lazy individuals who don’t want to think on writing creatively or brainstorm on the subject matter. What they want to do is search the subject keyword(s) on Google and then copy and paste the document on their website. Voila! an instant content. They then offer it to Googlebots so that it would be indexed along with the original one.
The worse thing webmasters thought was that, What if Google thought my site is the duplicate one? Many of them are scared of the fact that they might get penalize because of the wrong deed of another individual. Honestly, we don’t believe that this is 100% true but for the sake of this article we will discuss that on succeeding posts. Going back to our topic, I guess this news will make your Christmas holiday merrier.
Google filed at U.S. Patent and Trademark Office last December 1, 2009 for the Duplicate document detection in a web crawler system. The purpose of which is simple – to reduce content theft in the web. We are fully aware that this web crisis is proliferating heavily in cyberspace today. It’s alright to become lazy in a smart way i.e. thinking of a method to earn money without a strict 9-5 job. However, you must also understand that you have to exert effort in order to achieve that. There’s no such thing as “free” cashflow in cyber or in real world. Don’t believe others who tell you that it’s “very” easy to earn money online. Well, it becomes easy once you know ‘how to do it’.
Here’s an excerpt of the abstract from Google’s Duplicate content detection system:
Duplicate documents are detected in a web crawler system. Upon receiving a newly crawled document, a set of documents, if any, sharing the same content as the newly crawled document is identified. Information identifying the newly crawled document and the selected set of documents is merged into information identifying a new set of documents…
The inventors of this system are:
- Dulitz; Daniel
- Verstak; Alexandre A.
- Ghemawat; Sanjay
- Dean; Jeffrey A
Assignee: Google Inc. (Mountain View, CA)
You can read the rest of the document on the USPTO site. Thus, bloggers who abuse control “A”, “C” and “V” of their PC and laptops should really think twice of doing it nowadays. We sincerely believe that Google is preparing for something BIG come 2010. Google Caffeine is launching, several features added on Google search, Google hot trends is decreasing in numbers and this one. On the other hand, there’s nothing to be scared of and these changes are not threats at all. What we need is proper education on the way things work in the internet. We should be aware that some “Old hat” tricks and Old rules don’t work anymore.
You might also like
Story by pinoytutorial
Tags: Content theft, Duplicate content detection system, Duplicate content issue, Duplicate document detection, google, USPTO Google, Web Matters, Web news



