Back to Search

Practical Hadoop Migration: How to Integrate Your RDBMS with the Hadoop Ecosystem and Re-Architect Relational Applications to NoSQL

AUTHOR Lakhe, Bhushan
PUBLISHER Apress (08/11/2016)
PRODUCT TYPE Paperback (Paperback)

Description

Re-architect relational applications to NoSQL, integrate relational database management systems with the Hadoop ecosystem, and transform and migrate relational data to and from Hadoop components. This book covers the best-practice design approaches to re-architecting your relational applications and transforming your relational data to optimize concurrency, security, denormalization, and performance.

Winner of IBM's 2012 Gerstner Award for his implementation of big data and data warehouse initiatives and author of Practical Hadoop Security, author Bhushan Lakhe walks you through the entire transition process. First, he lays out the criteria for deciding what blend of re-architecting, migration, and integration between RDBMS and HDFS best meets your transition objectives. Then he demonstrates how to design your transition model.

Lakhe proceeds to cover the selection criteria for ETL tools, the implementation steps for migration with SQOOP- and Flume-based data transfers, and transition optimization techniques for tuning partitions, scheduling aggregations, and redesigning ETL. Finally, he assesses the pros and cons of data lakes and Lambda architecture as integrative solutions and illustrates their implementation with real-world case studies.

Hadoop/NoSQL solutions do not offer by default certain relational technology features such as role-based access control, locking for concurrent updates, and various tools for measuring and enhancing performance. Practical Hadoop Migration shows how to use open-source tools to emulate such relational functionalities in Hadoop ecosystem components.


What You'll Learn

  • Decide whether you should migrate your relational applications to big data technologies or integrate them
  • Transition your relational applications to Hadoop/NoSQL platforms in terms of logical design andphysical implementation
  • Discover RDBMS-to-HDFS integration, data transformation, and optimization techniques
  • Consider when to use Lambda architecture and data lake solutions
  • Select and implement Hadoop-based components and applications to speed transition, optimize integrated performance, and emulate relational functionalities
Who This Book Is For
Database developers, database administrators, enterprise architects, Hadoop/NoSQL developers, and IT leaders. Its secondary readership is project and program managers and advanced students of database and management information systems.
Show More
Product Format
Product Details
ISBN-13: 9781484212882
ISBN-10: 1484212886
Binding: Paperback or Softback (Trade Paperback (Us))
Content Language: English
More Product Details
Page Count: 305
Carton Quantity: 24
Product Dimensions: 6.14 x 0.69 x 9.21 inches
Weight: 1.03 pound(s)
Feature Codes: Illustrated
Country of Origin: NL
Subject Information
BISAC Categories
Computers | Computer Science
Computers | Database Administration & Management
Computers | Data Science - Data Modeling & Design
Dewey Decimal: 004
Descriptions, Reviews, Etc.
publisher marketing

Re-architect relational applications to NoSQL, integrate relational database management systems with the Hadoop ecosystem, and transform and migrate relational data to and from Hadoop components. This book covers the best-practice design approaches to re-architecting your relational applications and transforming your relational data to optimize concurrency, security, denormalization, and performance.

Winner of IBM's 2012 Gerstner Award for his implementation of big data and data warehouse initiatives and author of Practical Hadoop Security, author Bhushan Lakhe walks you through the entire transition process. First, he lays out the criteria for deciding what blend of re-architecting, migration, and integration between RDBMS and HDFS best meets your transition objectives. Then he demonstrates how to design your transition model.

Lakhe proceeds to cover the selection criteria for ETL tools, the implementation steps for migration with SQOOP- and Flume-based data transfers, and transition optimization techniques for tuning partitions, scheduling aggregations, and redesigning ETL. Finally, he assesses the pros and cons of data lakes and Lambda architecture as integrative solutions and illustrates their implementation with real-world case studies.

Hadoop/NoSQL solutions do not offer by default certain relational technology features such as role-based access control, locking for concurrent updates, and various tools for measuring and enhancing performance. Practical Hadoop Migration shows how to use open-source tools to emulate such relational functionalities in Hadoop ecosystem components.


What You'll Learn

  • Decide whether you should migrate your relational applications to big data technologies or integrate them
  • Transition your relational applications to Hadoop/NoSQL platforms in terms of logical design andphysical implementation
  • Discover RDBMS-to-HDFS integration, data transformation, and optimization techniques
  • Consider when to use Lambda architecture and data lake solutions
  • Select and implement Hadoop-based components and applications to speed transition, optimize integrated performance, and emulate relational functionalities
Who This Book Is For
Database developers, database administrators, enterprise architects, Hadoop/NoSQL developers, and IT leaders. Its secondary readership is project and program managers and advanced students of database and management information systems.
Show More

Author: Lakhe, Bhushan
Bhushan Lakhe is a Database Professional, Technology Evangelist and avid blogger residing in windy city of Chicago. After graduating in 1988 from one of India's leading universities (Birla Institute of Technology & Science, Pilani), he started his career with India's biggest software house Tata Consultancy Services. Shortly sent to UK on database assignment, he joined ICL, a British computer company and worked with prestigious British clients on various database assignments. Moving to Chicago in 1995, he worked as a Consultant with Fortune 50 companies in Chicago area including Leo Burnett, Blue Cross and Blue Shield of Illinois, CNA Insurance, ABN AMRO Bank, Abbott Laboratories, Motorola, JPMorgan Chase and British Petroleum, often in a critical and pioneering role. After a 7 year stint at IBM (recipient of prestigious Gerstner award for 2012) executing successful Big Data (as well as Data Warehouse) projects for their clients; Bhushan is currently working with Unisys Corporation, helping their clients with Data Warehouse & Big Data implementations. Bhushan is active in the Chicago Hadoop community and regularly answers queries on various Hadoop user forums.
Show More
List Price $44.99
Your Price  $44.54
Paperback