About Hadoop Spark and the Cloud

About Hadoop Spark and the Cloud The Hadoop ecosystem is composed of many different tools: ambadri, hbase, hive, sqoop,pig, zookeeper, oozie, flume,etc. But one tool is more well-known than any other: Spark. When somebody speaks about Hadoop, 99% of the time, he will be talking about Spark. Spark is really the “heart” of the Hadoop
Data Vaulting

Data vaulting: from a bad idea to inefficient implementations

Data vaulting: from a bad idea to inefficient implementations An efficient data management mechanism should have two main characteristics: operational efficiency (it must run faster and with less resources than those it aims to replace) and structural clarity (it must be straightforward to access, understand, and query). As IT data manager, you know you sometimes