Back to Search

Practical Hive: A Guide to Hadoop's Data Warehouse System

AUTHOR Vermeulen, Andreas Franois; Vermeulen, Andreas Francois; Gupta, Ankur et al.
PUBLISHER Apress (08/28/2016)
PRODUCT TYPE Paperback (Paperback)

Description

Dive into the world of SQL on Hadoop and get the most out of your Hive data warehouses. This book is your go-to resource for using Hive: authors Scott Shaw, Ankur Gupta, David Kjerrumgaard, and Andreas Francois Vermeulen take you through learning HiveQL, the SQL-like language specific to Hive, to analyze, export, and massage the data stored across your Hadoop environment. From deploying Hive on your hardware or virtual machine and setting up its initial configuration to learning how Hive interacts with Hadoop, MapReduce, Tez and other big data technologies, Practical Hive gives you a detailed treatment of the software.

In addition, this book discusses the value of open source software, Hive performance tuning, and how to leverage semi-structured and unstructured data.

What You Will Learn

  • Install and configure Hive for new and existing datasets
  • Perform DDL operations
  • Execute efficient DML operations
  • Use tables, partitions, buckets, and user-defined functions
  • Discover performance tuning tips and Hive best practices

Who This Book Is For

Developers, companies, and professionals who deal with large amounts of data and could use software that can efficiently manage large volumes of input. It is assumed that readers have the ability to work with SQL.


Show More
Product Format
Product Details
ISBN-13: 9781484202722
ISBN-10: 1484202724
Binding: Paperback or Softback (Trade Paperback (Us))
Content Language: English
More Product Details
Page Count: 265
Carton Quantity: 13
Product Dimensions: 7.00 x 0.61 x 10.00 inches
Weight: 1.12 pound(s)
Feature Codes: Index, Illustrated
Country of Origin: NL
Subject Information
BISAC Categories
Computers | Data Science - Data Modeling & Design
Computers | Computer Science
Computers | System Administration - Storage & Retrieval
Dewey Decimal: 005.74
Library of Congress Control Number: 2016951940
Descriptions, Reviews, Etc.
publisher marketing

Dive into the world of SQL on Hadoop and get the most out of your Hive data warehouses. This book is your go-to resource for using Hive: authors Scott Shaw, Ankur Gupta, David Kjerrumgaard, and Andreas Francois Vermeulen take you through learning HiveQL, the SQL-like language specific to Hive, to analyze, export, and massage the data stored across your Hadoop environment. From deploying Hive on your hardware or virtual machine and setting up its initial configuration to learning how Hive interacts with Hadoop, MapReduce, Tez and other big data technologies, Practical Hive gives you a detailed treatment of the software.

In addition, this book discusses the value of open source software, Hive performance tuning, and how to leverage semi-structured and unstructured data.

What You Will Learn

  • Install and configure Hive for new and existing datasets
  • Perform DDL operations
  • Execute efficient DML operations
  • Use tables, partitions, buckets, and user-defined functions
  • Discover performance tuning tips and Hive best practices

Who This Book Is For

Developers, companies, and professionals who deal with large amounts of data and could use software that can efficiently manage large volumes of input. It is assumed that readers have the ability to work with SQL.


Show More
List Price $59.99
Your Price  $59.39
Paperback