I like angry techs

I wrote about Hadoop some time ago (for ā€œsome reasonsā€) and I did EXACTLY what Ted Dziubas says here. But I’m not offended at all: he is 100% right! And I strongly suggest to take a look at this article: in the middle of his swearing there is, actually, some good stuff about ā€œwhat is going on in the Web2.0 Yet Again Exploding Bubble!!!

August 12, 2008 Ā· 1 min Ā· 64 words

Apache Hadoop on Mac OS X

For some reasons I started to play with Apache Hadoop (Core): Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data. Here’s what makes Hadoop especially useful: Scalable: Hadoop can reliably store and process petabytes. Economical: It distributes the data and processing across clusters of commonly available computers. These clusters can number into the thousands of nodes. Efficient: By distributing the data, Hadoop can process it in parallel on the nodes where the data is located. This makes it extremely rapid. Reliable: Hadoop automatically maintains multiple copies of data and automatically redeploys computing tasks based on failures. Hadoop implements MapReduce, using theHadoop Distributed File System (HDFS). MapReduce divides applications into many small blocks of work. HDFS creates multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster. MapReduce can then process the data where it is located. Hadoop has been demonstrated on clusters with 2000 nodes. The current design target is 10,000 node clusters. I followed the Quickstart guide and I can confirm that it works on [en:Mac OS X] too, but I managed only to make it run in ā€œstandaloneā€ mode: usefull for first-stage development and debugging. ...

April 20, 2008 Ā· 2 min Ā· 298 words