Login       My Wishlist
  My Cart
$0.00 / 0 items
 
EMS Linux
Utilizing the Best Tools With Linux
 
International Access
Global Shipping Options Available
Home About Us News Our Blog Our Catalog My Cart My Account Track Shippment Contact Us
  Our Catalog   Parallel Programming

Hadoop: The Definitive Guide: Storage and Analysis at Internet Scale


Blowout Sale! Save 63% on the Hadoop: The Definitive Guide: Storage and Analysis at Internet Scale by O'Reilly Media at EMS Linux. MPN: black & white illustrations. Hurry! Limited time offer. Offer valid only while supplies last. Get ready to unlock the power of your data. With the fourth edition of this comprehensive guide, you’ll learn how to build and maintain reliable,


Product Description

Get ready to unlock the power of your data. With the fourth edition of this comprehensive guide, you’ll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters.

Using Hadoop 2 exclusively, author Tom White presents new chapters on YARN and several Hadoop-related projects such as Parquet, Flume, Crunch, and Spark. You’ll learn about recent changes to Hadoop, and explore new case studies on Hadoop’s role in healthcare systems and genomics data processing.

  • Learn fundamental components such as MapReduce, HDFS, and YARN
  • Explore MapReduce in depth, including steps for developing applications with it
  • Set up and maintain a Hadoop cluster running HDFS and MapReduce on YARN
  • Learn two data formats: Avro for data serialization and Parquet for nested data
  • Use data ingestion tools such as Flume (for streaming data) and Sqoop (for bulk data transfer)
  • Understand how high-level data processing tools like Pig, Hive, Crunch, and Spark work with Hadoop
  • Learn the HBase distributed database and the ZooKeeper distributed configuration service

Additional Information

Manufacturer:O'Reilly Media
Brand:O'Reilly Media
Part Number:black & white illustrations
Publisher:O'Reilly Media
Studio:O'Reilly Media
MPN:black & white illustrations
EAN:9781491901632
Item Weight:2.78 pounds
Item Size:1.6 x 9.2 x 9.2 inches
Package Weight:2.35 pounds
Package Size:7 x 1.5 x 1.5 inches

Hadoop: The Definitive Guide: Storage and Analysis at Internet Scale by O'Reilly Media

Buy Now:
Hadoop: The Definitive Guide: Storage and Analysis at Internet Scale

Brand: O'Reilly Media
Condition: New
Lead Time: 1 - 2 Business Days
Availability: In Stock
$49.99
$18.89
You Save: 62%


Quantity:  

 


View More In Parallel Programming.

 


Have questions about this item, or would like to inquire about a custom or bulk order?


If you have any questions about this product by O'Reilly Media, contact us by completing and submitting the form below. If you are looking for a specif part number, please include it with your message.

First Name:
Last Last:
Email Address:
Your Message:

Related Best Sellers


ean: 9780387098449, isbn: 0387098445,
Containing over 300 entries in an A-Z format, the Encyclopedia of Parallel Computing provides easy, intuitive access to relevant information for professionals and researchers seeking access to any aspect within the broad field of parallel computing....

sku: 9780444506733, ean: 9780444506733, isbn: 044450673X,
Parallel CFD 2000, the Twelfth in an International series of meetings featuring computational fluid dynamics research on parallel computers, was held May 22-25, 2000 in Trondheim, Norway. Following the trend of the past conferences, areas such as num...

ean: 9780444828309, isbn: 0444828303,
Process Algebra is a formal description technique for complex computer systems, especially those involving communicating, concurrently executing components. It is a subject that concurrently touches many topic areas of computer science and discrete m...

ean: 9780321312839, isbn: 032131283X,
The latest edition of a classic text on concurrency and distributed programming – from a winner of the ACM/SIGCSE Award for Outstanding Contribution to Computer Science Education....

ean: 9781682854792, isbn: 1682854795,
As an important part of computer science and technology, parallel computing refers to the art of performing multiple different computations and calculations simultaneously. Parallel computing has many sub-fields namely task parallelism, bit-level par...

sku: R11457, mpn: Illustrations, ean: 9781906124144, isbn: 1906124140,
Software agents situated in the same environment typically need to interact with one another in order to fulfill their objectives or improve their performance. Coalition formation is a fundamental form of interaction that has proven to be useful in a...

sku: 9783540441397, mpn: biography, ean: 9783540441397, isbn: 3540441395,
We are proud to introduce the proceedings of the Seventh International C- ference on Parallel Problem Solving from Nature, PPSN VII, held in Granada, Spain, on 7–11 September 2002. PPSN VII was organized back-to-back with the Foundations of Genetic...

mpn: Illustrations, ean: 9783540442967, isbn: 3540442960,

ean: 9788122423877, isbn: 8122423876,

ean: 9780471358312, isbn: 0471358312,
An all-inclusive survey of the fundamentals of parallel and distributed computing. The use of parallel and distributed computing has increased dramatically over the past few years, giving rise to a variety of projects, implementations, and buzzwords ...



Privacy Policy / Terms of Service
© 2018 - emslinux.com. All Rights Reserved.