| string | same | % | diff. | % |
| kimcom | 61 | 96.8% | 2 | 3.2% |
| kimdot | 61 | 96.8% | 2 | 3.2% |
| megaracer | 61 | 96.8% | 2 | 3.2% |
| megauploads | 61 | 96.8% | 2 | 3.2% |
| beatz | 59 | 93.7% | 4 | 6.3% |
| megaupload | 57 | 90.5% | 6 | 9.5% |
| schmitz | 55 | 87.3% | 8 | 12.7% |
| swizz beatz | 55 | 87.3% | 8 | 12.7% |
| swizz | 53 | 84.1% | 10 | 15.9% |
| com | 49 | 77.8% | 14 | 22.2% |
| vestor | 48 | 76.2% | 15 | 23.8% |
| vestor vestor | 48 | 76.2% | 15 | 23.8% |
| dotcom | 47 | 74.6% | 16 | 25.4% |
| alicia | 45 | 71.4% | 18 | 28.6% |
| com schmitz | 42 | 66.7% | 21 | 33.3% |
| schmitz dotcom | 41 | 65.1% | 22 | 34.9% |
| tim | 41 | 65.1% | 22 | 34.9% |
| alicia keys | 40 | 63.5% | 23 | 36.5% |
| jim | 40 | 63.5% | 23 | 36.5% |
| kim | 40 | 63.5% | 23 | 36.5% |
| kim tim | 40 | 63.5% | 23 | 36.5% |
| tim schmitz | 40 | 63.5% | 23 | 36.5% |
| jim tim | 39 | 61.9% | 24 | 38.1% |
| kim schmitz | 39 | 61.9% | 24 | 38.1% |
| tim jim | 39 | 61.9% | 24 | 38.1% |
| tim kim | 39 | 61.9% | 24 | 38.1% |
| jim kim | 38 | 60.3% | 25 | 39.7% |
| jim schmitz | 38 | 60.3% | 25 | 39.7% |
| mega | 38 | 60.3% | 25 | 39.7% |
| mr. schmitz | 38 | 60.3% | 25 | 39.7% |
| dot com | 37 | 58.7% | 26 | 41.3% |
| mr. dotcom | 37 | 58.7% | 26 | 41.3% |
| tim dotcom | 37 | 58.7% | 26 | 41.3% |
| kim dotcom | 36 | 57.1% | 27 | 42.9% |
| kim jim | 36 | 57.1% | 27 | 42.9% |
| vestor dotcom | 36 | 57.1% | 27 | 42.9% |
| jim dotcom | 35 | 55.6% | 28 | 44.4% |
| mr dotcom | 35 | 55.6% | 28 | 44.4% |
| call of duty | 33 | 52.4% | 30 | 47.6% |
| tim vestor | 32 | 50.8% | 31 | 49.2% |
| vestor tim | 32 | 50.8% | 31 | 49.2% |
| kim vestor | 31 | 49.2% | 32 | 50.8% |
| mr schmitz | 31 | 49.2% | 32 | 50.8% |
| jim vestor | 30 | 47.6% | 33 | 52.4% |
| dot schmitz | 29 | 46.0% | 34 | 54.0% |
| vestor kim | 29 | 46.0% | 34 | 54.0% |
| vestor jim | 28 | 44.4% | 35 | 55.6% |
| mega racer | 19 | 30.2% | 44 | 69.8% |
| mega upload | 19 | 30.2% | 44 | 69.8% |
| bit | 16 | 25.4% | 47 | 74.6% |
| hit | 16 | 25.4% | 47 | 74.6% |
| racer | 15 | 23.8% | 48 | 76.2% |
| dot | 13 | 20.6% | 50 | 79.4% |
| upload | 10 | 15.9% | 53 | 84.1% |
| a | 9 | 14.3% | 54 | 85.7% |
| cod | 4 | 6.3% | 59 | 93.7% |
| uploads | 4 | 6.3% | 59 | 93.7% |
| its a | 3 | 4.8% | 60 | 95.2% |
| a hit | 2 | 3.2% | 61 | 96.8% |
| call of | 2 | 3.2% | 61 | 96.8% |
| bit by bit | 1 | 1.6% | 62 | 98.4% |
| by | 1 | 1.6% | 62 | 98.4% |
| call duty | 1 | 1.6% | 62 | 98.4% |
| its a hit | 1 | 1.6% | 62 | 98.4% |
| mega uploads | 1 | 1.6% | 62 | 98.4% |
| of | 1 | 1.6% | 62 | 98.4% |
| of duty | 1 | 1.6% | 62 | 98.4% |
| vestor limited | 1 | 1.6% | 62 | 98.4% |
| bit by | 0 | 0.0% | 63 | 100.0% |
| bit it’s a | 0 | 0.0% | 63 | 100.0% |
| by bit | 0 | 0.0% | 63 | 100.0% |
| by bit by | 0 | 0.0% | 63 | 100.0% |
| by bit its | 0 | 0.0% | 63 | 100.0% |
| call | 0 | 0.0% | 63 | 100.0% |
| duty | 0 | 0.0% | 63 | 100.0% |
| guilty | 0 | 0.0% | 63 | 100.0% |
| its | 0 | 0.0% | 63 | 100.0% |
| its hit | 0 | 0.0% | 63 | 100.0% |
| keys | 0 | 0.0% | 63 | 100.0% |
| megaupload limited | 0 | 0.0% | 63 | 100.0% |
With a different language translation demonstration, "The Lorem Ipsum Of It All" established in a different branch of The V Decision Tree Project, "getting lost in the translation" takes on a different dynamic far beyond a simple yes or no symbol match-up.
In my post Welcome to A Down In the Dumps Demo, I extract a significant decline in tally for the plural of the string derivative megaupload and use it to help explain one possible influence on the ranking of a site at any given moment.
This particular demonstration combine the two topics together, using the Google Translate service as the sample provider. In knowing there can be potential for significant mistranslation, this demonstration does not rely on a need to appear semantically and/or linguistically accurate.
Instead, it is relies on an exact match boundary providing clear yes or no matches between languages other than English and how this can be a force driving a spike in a string derivatvie market.
The chart above use a variety of core strings from the megaupload circumstances. Each string was translated into each of the 63 language options Google Translate offered to an end-user and then compared against its english version.
4 words stayed identical throughout the sampling except for 2 languages: kimcom, kimdot, megaracer and megauploads. It is important to note that the creation of megaracer (Call of Duty identity) and megauploads are credited to Kim Schmitz (Dotcom) while kimdot was a combo I threw in as a random string derivative not significantly present in existing online materials. This means that theoretically content can be written in 59 different languages and these 4 words will remain entirely intact as-is.
This is an important facet to absorb. Let's say a news outlet publishes one identical news story in 59 different languages. While some words are not guaranteed transference of credit for existing in an english format, these 4 words are theoretically guaranteed to pull all 59 pages in the 59 different languages as a part of a results array searching for one of those 4 words.
On the contrary, the slogan of "bit by bit, its a hit, its a hit" has variations with all 63 languages available with the Google Translate service. To have a chance at competing in these types of string derivative markets, search engine optimization techniques are typically applied on mass scales. This means despite the 63 different variations of the word "bit," the phrase is able to be duplicated in a viral manner more often than not undetected by automated means and methods, let alone recognizable by the human eye.
For example, in The Comment Factor Observatory, I have a demonstration displaying the ease of structuring generic sets of strings able to pass at-a-glance observation when in a comment setting. This application of generic sets of strings extends far beyond any designated comment section, which means thousands of pages generated from article generators could feasibly cause a temporary spike to these generic words currently in play. Throw into this mix the thousands of campaigns upon thousands of campaigns for people to quick comment in some pre-specified manner and this particular examination method provides an invaluable glimpse into what may be behind a spike or a drop.
Ultimately, once all other search engine indexes are thrown back into this mix, the percentages generated by use of the Google Translate service revert back into more of a variable position and are no more or less valuable than any other snapshot image of this type of statistical analysis. And yet, to have an idea of just how much someone (or something) is relying on when it comes to the task of translating knowledge from one language to another can give a string derivative competitor yet another component to work with...or against.