Hadoop and Big Data Resources

6 reasons why 2012 could be the year of Hadoop
http://gigaom.com/cloud/six-reasons-why-2012-could-be-the-year-of-hadoop/

Defining Hadoop: the Players, Technologies and Challenges of 2011
http://pro.gigaom.com/2011/03/defining-hadoop-the-players-technologies-and-challenges-of-2011/

The Hadoop project includes these subprojects:

Other Hadoop-related projects at Apache include:

  • Avro™: A data serialization system.
  • Cassandra™: A scalable multi-master database with no single points of failure.
  • Chukwa™: A data collection system for managing large distributed systems.
  • HBase™: A scalable, distributed database that supports structured data storage for large tables.
  • Hive™: A data warehouse infrastructure that provides data summarization and ad hoc querying.
  • Mahout™: A Scalable machine learning and data mining library.
  • Pig™: A high-level data-flow language and execution framework for parallel computation.
  • ZooKeeper™: A high-performance coordination service for distributed applications.
Free On Line Course:
Other Resources:
Advertisements

2 thoughts on “Hadoop and Big Data Resources

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s