Mastering Apache Storm by Ankit Jain

Mastering Apache Storm by Ankit Jain from  in  category
Privacy Policy
Read using
(price excluding 0% GST)
Author: Ankit Jain
Category: Engineering & IT
ISBN: 9781787120402
File Size: 3.83 MB
Format: EPUB (e-book)
DRM: Applied (Requires eSentral Reader App)
(price excluding 0% GST)

Synopsis

Key FeaturesExploit the various real-time processing functionalities offered by Apache Storm such as parallelism, data partitioning, and moreIntegrate Storm with other Big Data technologies like Hadoop, HBase, and Apache KafkaAn easy-to-understand guide to effortlessly create distributed applications with StormBook DescriptionApache Storm is a real-time Big Data processing framework that processes large amounts of data reliably, guaranteeing that every message will be processed. Storm allows you to scale your data as it grows, making it an excellent platform to solve your big data problems. This extensive guide will help you understand right from the basics to the advanced topics of Storm.The book begins with a detailed introduction to real-time processing and where Storm fits in to solve these problems. Youll get an understanding of deploying Storm on clusters by writing a basic Storm Hello World example. Next well introduce you to Trident and youll get a clear understanding of how you can develop and deploy a trident topology. We cover topics such as monitoring, Storm Parallelism, scheduler and log processing, in a very easy to understand manner. You will also learn how to integrate Storm with other well-known Big Data technologies such as HBase, Redis, Kafka, and Hadoop to realize the full potential of Storm.With real-world examples and clear explanations, this book will ensure you will have a thorough mastery of Apache Storm. You will be able to use this knowledge to develop efficient, distributed real-time applications to cater to your business needs.What you will learnUnderstand the core concepts of Apache Storm and real-time processingFollow the steps to deploy multiple nodes of Storm ClusterCreate Trident topologies to support various message-processing semanticsMake your cluster sharing effective using Storm schedulingIntegrate Apache Storm with other Big Data technologies such as Hadoop, HBase, Kafka, and moreMonitor the health of your Storm clusterAbout the AuthorAnkit Jain holds a bachelors degree in computer science and engineering. He has 6 years, experience in designing and architecting solutions for the big data domain and has been involved with several complex engagements. His technical strengths include Hadoop, Storm, S4, HBase, Hive, Sqoop, Flume, Elasticsearch, machine learning, Kafka, Spring, Java, and J2EE.He also shares his thoughts on his personal blog. You can follow him on Twitter at @mynameisanky. He spends most of his time reading books and playing with different technologies. When not at work, he spends time with his family and friends watching movies and playing games.Table of ContentsReal-Time Processing and Storm IntroductionDeploying Storm in ClusterStorm Parallelism and Data PartitioningTrident IntroductionTrident Topology and UsesStorm SchedulerMonitoring of Storm ClusterIntegration of Storm and KafkaStorm and Hadoop IntegrationStorm Integration with Redis, Elasticsearch and HBaseApache Log ProcessingTwitter Tweets Collection and Machine learning

Reviews

Write your review

Recommended