Best Hadoop Books
Hadoop is an open source distributed processing framework that manages data processing and storage for big data applications running on clustered systems. It is at the center of a growing ecosystem of big data technologies that are primarily used to support advanced analytics initiatives, including predictive analytics, data mining, and machine learning applications. Hadoop can handle various forms of structured and unstructured data, giving users more flexibility to collect, process, and analyze data than relational databases and data warehouses.
If you are looking for Hadoop books to advance your knowledge, here is the best list in various formats available for free:
- Cloudera Impala – John Russel
- Programming Pig – Alan Gates
- Data-Intensive Text Processing with MapReduce (Jimmy Lin and Chris Dyer) (PDF)
- Hadoop Explained – Aravind Shenoy, Packt.
- Hadoop Illuminated – Mark Kerzner & Sujee Maniyam
Comments
Post a Comment