Hadoop is an open source MapReduce platform designed to query and analyze data distributed across large clusters. Especially effective for big data systems, Hadoop powers mission-critical software at Apple, eBay, LinkedIn, Yahoo, and Facebook. It offers developers handy ways to store, manage, and analyze data.
Hadoop in Practice collects 85 battle-tested examples and presents them in a problem/solution format. It balances conceptual foundations with practical recipes for key problem areas like data ingress and egress, serialization, and LZO compression. You’ll explore each technique step by step, learning how to build a specific solution along with the thinking that went into it. As a bonus, the book’s examples create a well-structured and understandable codebase you can tweak to meet your own needs.
This book assumes the reader knows the basics of Hadoop.
What’s Inside
Conceptual overview of Hadoop and MapReduce
85 practical, tested techniques
Real problems, real solutions
How to integrate MapReduce and R
Table of Contents
Part 1: Background and Fundamentals
Chapter 1. Hadoop in a heartbeat
Part 2: Data Logistics
Chapter 2. Moving data in and out of Hadoop
Chapter 3. Data serialization—working with text and beyond
Part 3: Big Data Patterns
Chapter 4. Applying MapReduce patterns to big data
Chapter 5. Streamlining HDFS for big data
Chapter 6. Diagnosing and tuning performance problems
Part 4: Data Science
Chapter 7. Utilizing data structures and algorithms
Chapter 8. Integrating R and Hadoop for statistics and more
Chapter 9. Predictive analytics with Mahout
Part 5: Taming the Elephant
Chapter 10. Hacking with Hive
Chapter 11. Programming pipelines with Pig
Chapter 12. Crunch and other technologies
Chapter 13. Testing and debugging
Appendix A. Related technologies
Appendix B. Hadoop built-in ingress and egress tools
Appendix C. HDFS dissected
Appendix D. Optimized MapReduce join frameworks.
Hadoop in Practice collects 85 battle-tested examples and presents them in a problem/solution format. It balances conceptual foundations with practical recipes for key problem areas like data ingress and egress, serialization, and LZO compression. You’ll explore each technique step by step, learning how to build a specific solution along with the thinking that went into it. As a bonus, the book’s examples create a well-structured and understandable codebase you can tweak to meet your own needs.
This book assumes the reader knows the basics of Hadoop.
What’s Inside
Conceptual overview of Hadoop and MapReduce
85 practical, tested techniques
Real problems, real solutions
How to integrate MapReduce and R
Table of Contents
Part 1: Background and Fundamentals
Chapter 1. Hadoop in a heartbeat
Part 2: Data Logistics
Chapter 2. Moving data in and out of Hadoop
Chapter 3. Data serialization—working with text and beyond
Part 3: Big Data Patterns
Chapter 4. Applying MapReduce patterns to big data
Chapter 5. Streamlining HDFS for big data
Chapter 6. Diagnosing and tuning performance problems
Part 4: Data Science
Chapter 7. Utilizing data structures and algorithms
Chapter 8. Integrating R and Hadoop for statistics and more
Chapter 9. Predictive analytics with Mahout
Part 5: Taming the Elephant
Chapter 10. Hacking with Hive
Chapter 11. Programming pipelines with Pig
Chapter 12. Crunch and other technologies
Chapter 13. Testing and debugging
Appendix A. Related technologies
Appendix B. Hadoop built-in ingress and egress tools
Appendix C. HDFS dissected
Appendix D. Optimized MapReduce join frameworks.
or full course details please visit our website www.hadooponlinetraining.net
Duration for course is 30 days or 45 hours and special care will be taken. It is a one to one training with hands on experience.
* Resume preparation and Interview assistance will be provided.
For any further details please
contact India +91-9052666559
Usa : +1-678-693-3475.
visit www.hadooponlinetraining.net
please mail us all queries to info@magnifictraining.com
Mazenet Solution has been in the business of IT workforce development for over 19 years and has always believed in molding IT professionals to think and perform beyond expectations.
ReplyDeleteFor more details visit : Mazenet Corporate Training
Big Data Corporate Training or contact us at toll free: 1800212334646 or 7397723052.