TY - GEN AU - White, Tom TI - Hadoop, the definitive guide: storage and analysis at internet scale SN - 9789352130672 U1 - 005.74 PY - 2015/// CY - Mumbai PB - O' Reilly KW - File organization (Computer science) KW - Apache Hadoop KW - Cloud computing KW - Electronic data processing--Distributed processing KW - Electronic data processing N1 - Includes index N2 - Offers information on how to build and maintain reliable, scalable, distributed systems with Apache Hadoop covering such topics as MapReduce, HDFS, YARN, Avro for data serialization, Parquet for nested data, and data ingestion tools Flume and Sqoop ER -