March 18, 2014
Learn more about writing MapReduce programs with the language of your choice with Hadoop Streaming.
February 14, 2014
Using ADF allows developers to minimize the coding effort to build an application's infrastructure and allows them to concentrate more on implementing the complex business logic of the application.
January 29, 2014
Apache HBase is a distributed, non-relational and open source database written in Java that runs on top of HDFS. HBase is a suitable candidate when you have hundreds of millions or billions of rows and enough hardware to support it. Learn more about it's practical use and architectural concepts.
January 23, 2014
Cassandra is an ideal database for managing a large volume of unstructured, semi-structured and structured data across multiple data centers and the cloud environment. Exolore how to get started.
January 17, 2014
Regardless of which movie you’re watching, Field of Dreams or Catch-22, customer demand and provider capacity aren’t lining up as efficiently as anyone would like.
January 14, 2014
Network Functions Virtualization (NFV) is a specification for abstracting network hardware in order to move the control of network functions to software.
January 10, 2014
Explore a basic vision for a single and multicore approach to indexing and querying multiple log file types in Apache Solr.
December 30, 2013
Apache Hive provides a mechanism to manage data in a distributed environment and query it using an SQL-like language called Hive Query Language, or HiveQL. This article will discuss Hive scripts and execution.
December 26, 2013
The whole idea of a Private Cloud is that it’s supposed to be, well, private. As in, nobody else’s business. SoftLayer's “Private” Cloud is actually a single-tenant VPC.
December 16, 2013
Kaushik Pal provides some samples and tips on how to use Apache Pig for efficient analysis of large data sets.
December 2, 2013
We're finally figuring out how to collect store, analyze, and manage far larger and more diverse data sets across the enterprise. We can’t let Big Data be just another management fad.
November 27, 2013
Kaushik Pal explores the basics of the Hadoop Distributed File System (HDFS), the underlying file system of the Apache Hadoop framework.
November 26, 2013
Nobody really wants a Private Cloud
November 15, 2013
The greatest bottleneck in any large scale Hadoop deployment is the local network
November 8, 2013
The better Hadoop gets, the less of a Big Data tool it becomes.
November 5, 2013
Simply writing the phrase Big Data strategy indicates that you either don’t understand Big Data, or even worse, you don’t understand the word strategy.
September 3, 2013
Learn all about OpenTSDB package installation and building client programs using HTTP APIs for loading and extracting time series data.
June 4, 2013
With Big Data analytics, bigger doesn't mean better. Apply liberal doses of common sense, and take any result with a mine full of salt.