How to get billions of pages in Google

1 Star2 Stars3 Stars4 Stars5 Stars (1 votes, average: 5.00 out of 5)
Loading ... Loading ...

So Bobby pointed this to me. A nice find on how to spam Google and get billions of pages indexed, with a 3 week-old domain.

Here’s the 5,090,000,000 indexed pages search:

Search results for the domain

All the serps lead to the domain, which btw has 2,460,000,000 indexed pages too:

Search results for the domain

And if you haven’t noticed it yet, yes… Each landing page for all the ads in the domain, leads to another page with : Adsense. Yeah baby. There’s the advertisers money.

Some Alexa data :

Alexa rank Alexa reach per million

And guess what: the final blow : The owner is romanian, from Iasi :

Domain Name: T1PS2SEE.COM
Tips 2 see SRL
Brausova Elena ()
Grenoblea 10
Iasi,2983, RO
Tel. 40.98198289

How did these spammers acomplish such an immense number of pages in such a short time ? Easy .. Millions of subdomains with a few hundred pages each.

Google will never allow a single domain to have billions of pages. Heck .. Look at Dmoz, Microsoft, Apple, Yahoo, Google and so on.

So they artificially generated millions of subdomains with a few pages each.

Update: Adam Lasnik from Google, commented in Threadwatch :

Hey there,

Regarding the “5 billion indexed”…

We have noticed that some site: queries are showing bizarre results and it’s turned out to be tied to a bad data push. We’re fixing it now.

C’mon man .. That’s weird .. 5 billion indexed pages with Adsense on them and only after 3 weeks, and only after the whole Internet is discussing them, Google has a problem with the site: operator ?

Update: CReed from Digitalpoint, directed me to this info, and how spammers are able to spam the search engines trough exploits of Blogspot’s subdomains :

Many of these subdomains have large numbers of backlinks that may never get updated, providing the opportunistic with instant PageRank and traffic (human and bots). Google search results provide a regularly updated list of the available subdomains if you know which search terms to use.

Someone creative has grabbed these available subdomains and created what appears to be regular 404 error pages like one you would normally see if you visited a blog that had moved.

The difference is that these faked error pages are loaded with hundreds of hidden links to subdomains that have been flooding Google’s search results.

If you’d like to see what this guy’s been up to – view the source code of the 404 error pages listed in the above search results.

And apparently the guy had not only those two domains but TONS of other, on the same IP.

SIncew the Google team hand jobed the thing, and banned the IP, we can’t know the total number of indexed pages of those domains were. But my guess, tens of billions of pages.

This comment on Threadwatch is interesting too.

Submit your business or company to Webotopia.
Tags: none

7 Comments so far

TheHoff said:
June 18th, 2006

Lovely, isn’t it?

The guy changed the sites while everyone was talking about them. The first one with 5 billion used to have it’s own pages up with 3 blocks of Adsense. During the discussion he changed it all to redirect to the second domain.

June 18th, 2006

I can’t even imagine how the heck Adsense doesn’t have some automatic filters or sandbox.

How can one publisher generate billions of ad impressions with a 3 weeks old website, and not to ring a bell or a red blip on the Adsense’s radar.

And it doesn’t matter if she switched the Adsense. The other domain is her’s too.

Nice find man.

Mike said:
June 18th, 2006

What have you done, the poor guy is now banned from google, and google has two times less pages in their index.. lol

June 19th, 2006

LOL .. That’s what spammers get ;)

Janos said:
June 19th, 2006

/* Some humor in Romanian follows */

Si acu’ toata mafia web ieseana e pe urmele tale… :)

Intr-o nota mai serioasa acum, tot nu m-ai convins cum e cu treaba aia cu sitewide links, tot mai astept un comentariu pe blogul meu.

June 19th, 2006

/* Some English black magic humor follows */

Please write in english :D

As for the site-wide links, I tried my best to explain you.

Janos said:
June 19th, 2006

All right, in English now:

And now all the web maffia from Iasi will be on your tail.

I’m still waiting for your comments on the issue. Maybe I’m dumb, but I still don’t get the logic…

Sorry for the offtopic comments.

Leave a reply

Allowed tags (XHTML): <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

RSS Readers & Technorati:

My Subscribers Add to Technorati Favorites
View Cristian Mezei's profile on LinkedIn

Subscribe & Syndication:

RSS Feed at FeedBurner NewsGator Google Bloglines My MSN Add to My Yahoo! Add to FeedLounge Add to My AOL Add to NetVibes Add to NewsBurst Add to Pluck Add to NewsIsFree

Updates by e-mail:

Webotopia B2B Portal

Tag Cosmos: Top Tags