Advanced Analytical Queries: tez.apache.org
Security of Sensitive Data: ranger.apache.org
Hadoop -> Hadoop can handle big batches of distributed information butmost often there’s a need for a real time processing.
Apache Storm / Apache Spark -> Storm and Spark are two other open source frameworks that handle such real time data generated at a fast rate. Both Storm and Spark can integrate data with any database or data storage technology.
NoSQL-> NoSQL Data technologies are based on non-relational concepts and provide data storage options typically on computing clouds beyond the traditional relational databases centered rate houses. The main advantage of using NoSQL solutions is their ability to organize the data for scalable access to fit the problem and objectives pertaining to how the data will be used.
– Neo4j is an example of a graph database.
– Cassandra is an example of a key value database.