[Cascavel-pm] [OFF] Yahoo claims 2-petabyte database is world's biggest, busiest

Nelson Ferraz nferraz em gmail.com
Quinta Maio 22 20:42:27 PDT 2008


Achei a notícia interessante, pois demonstra o potencial do PostgreSQL.

Vocês conhecem outros casos de sucesso?

"Take Yahoo Inc.'s 2-petabyte, specially built data warehouse, which
it uses to analyze the behavior of its half-billion Web visitors per
month. The Sunnyvale, Calif.-based company makes a strong claim that
it is not only the world's single-largest database, but also the
busiest.

Based on a heavily modified PostgreSQL engine, the year-old database
processes 24 billion events a day, according to Waqar Hasan, vice
president of engineering in Yahoo's data group.

And the data, all of it constantly accessed and all of it stored in a
structured, ready-to-crunch form, is expected to grow into the
multiple tens of petabytes by next year.

By comparison, large enterprise databases typically grow no larger
than the tens of terabytes. Large databases about which much is
publicly known include the Internal Revenue Service's data warehouse,
which weighs in at a svelte 150TB. (...)

The database requires fewer than 1,000 PC servers hosted at several
data centers, said Hasan, who declined to reveal the exact number. He
did claim that the number of servers used is one-tenth to
one-twentieth fewer than the number that would be needed if the
database were a conventional one such as Oracle, IBM's DB2 or NCR's
Teradata."

http://www.computerworld.com/action/article.do?command=viewArticleBasic&articleId=9087918

-- 
Nelson Ferraz

Free Software Foundation Associate Member #3203
Projeto Software Livre Brasil (www.softwarelivre.org.br)
Sociedade Perl do Brasil (www.perl.org.br)
Rede Livre de Compartilhamento de Cultura Digital


Mais detalhes sobre a lista de discussão Cascavel-pm