Hadoop: The Definitive Guide
On Sale Now! Free Shipping Included! Save 4% on the Hadoop: The Definitive Guide by Yahoo Press at EMS Linux. MPN: 9781449311520. Hurry! Limited time offer. Offer valid only while supplies last. Ready to unlock the power of your data? With this comprehensive guide, you’ll learn how to build and maintain reliable, scalable, distributed
Ready to unlock the power of your data? With this comprehensive guide, you’ll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters.
You’ll find illuminating case studies that demonstrate how Hadoop is used to solve specific problems. This third edition covers recent changes to Hadoop, including material on the new MapReduce API, as well as MapReduce 2 and its more flexible execution model (YARN).
- Store large datasets with the Hadoop Distributed File System (HDFS)
- Run distributed computations with MapReduce
- Use Hadoop’s data and I/O building blocks for compression, data integrity, serialization (including Avro), and persistence
- Discover common pitfalls and advanced features for writing real-world MapReduce programs
- Design, build, and administer a dedicated Hadoop cluster—or run Hadoop in the cloud
- Load data from relational databases into HDFS, using Sqoop
- Perform large-scale data processing with the Pig query language
- Analyze datasets with Hive, Hadoop’s data warehousing system
- Take advantage of HBase for structured and semi-structured data, and ZooKeeper for building distributed systems
|Item Weight:||0 pounds|
|Item Size:||1.5 x 9.19 x 9.19 inches|
|Package Weight:||2.29 pounds|
|Package Size:||6.9 x 1.7 x 1.7 inches|
Have questions about this item, or would like to inquire about a custom or bulk order?
If you have any questions about this product by Yahoo Press, contact us by completing and submitting the form below. If you are looking for a specif part number, please include it with your message.
Related Best Sellers
By World Scientific Pub Co Inc
mpn: illustrations, ean: 9789812775887, isbn: 9812775889,
Geosciences particularly numerical weather predication, are demanding the highest levels of computer power available. The European Centre for Medium-Range Weather Forecasts, with its experience in using supercomputers in this field, organizes a works...
ean: 9780387987170, isbn: 0387987177,
In recent years, model checking has become an essential technique for the formal verification of systems. With a clarity of presentation and its many illuminating examples, this book makes this technical material easy to grasp. It is perfectly suited...
ean: 9780471678069, isbn: 0471678066,
Solving complex optimization problems with parallel metaheuristics Parallel Metaheuristics brings together an international group of experts in parallelism and metaheuristics to provide a much-needed synthesis of these two fields. Readers discover h...
mpn: black & white illustrations, ean: 9781466674615, isbn: 146667461X,
Rapidly generating and processing large amounts of data, supercomputers are currently at the leading edge of computing technologies. Supercomputers are employed in many different fields, establishing them as an integral part of the computational scie...
By Chapman and Hall/CRC
ean: 9781584886235, isbn: 1584886234,
The ability of parallel computing to process large data sets and handle time-consuming operations has resulted in unprecedented advances in biological and scientific computing, modeling, and simulations. Exploring these recent developments, the Handb...
By Ezhilchelvan Paul Romanovsky Alexander
ean: 9781441952783, isbn: 1441952780,
Concurrency in Dependable Computing focuses on concurrency related issues in the area of dependable computing. Failures of system components, be hardware units or software modules, can be viewed as undesirable events occurring concurrently with a set...
By ACM Books
ean: 9781970001914, isbn: 1970001917,
Parallelism is the key to achieving high performance in computing. However, writing efficient and scalable parallel programs is notoriously difficult, and often requires significant expertise. To address this challenge, it is crucial to provide progr...
mpn: colour illustrations, ean: 9780128021224, isbn: 0128021225,
The Smart Grid security ecosystem is complex and multi-disciplinary, and relatively under-researched compared to the traditional information and network security disciplines. While the Smart Grid has provided increased efficiencies in monitoring powe...
By Brand: Artech House
ean: 9781596930858, isbn: 1596930853,
Commonly used methods in computational electromagnetics include the Finite Element Method (FEM), the Finite Difference Time Domain (FDTD) method and the Method of Moment (MoM), and they all find applications to the solution of a wide variety of elect...
By Brand: Oxford University Press, USA
mpn: figures, ean: 9780198501787, isbn: 0198501781,
Domain decomposition methods are designed to allow the effective numerical solution of partial differential equations on parallel computer architectures. They comprise a relatively new field of study but have already found important applications in m...