Learn how to use Hadoop technologies like HBase, Storm, and Spark in Microsoft Azure HDInsight to create real-time analytical solutions.
In this four week course, you’ll learn how to implement low-latency and streaming Big Data solutions using Hadoop technologies like HBase, Storm, and Spark on Microsoft Azure HDInsight.
Note: To complete the hands-on elements in this course, you will require an Azure subscription and a Windows, Linux, or Mac OS X client computer. You can sign up for a free Azure trial subscription (a valid credit card is required for verification, but you will not be charged for Azure services). Note that the free trial is not available in all regions. It is possible to complete the course and earn a certificate without completing the hands-on practices.
This course is the second in a series that explores big data and advanced analytics techniques with HDInsight; and builds on the batch processing techniques learned in DAT202.1x: Processing Big Data with Hadoop in Azure HDInsight.
- Familiarity with Hadoop clusters and Hive in HDInsight
- Familiarity with database concepts and basic SQL query syntax
- Familiarity with basic programming constructs (for example, variables, loops, conditional logic). Experience with Java or C# is useful but not essential
- A willingness to learn actively and persevere when troubleshooting technical problems is essential
What you will learn
In this course, you’ll learn how to use:
- HBase to implement low-latency NoSQL data stores.
- Storm to implement real-time streaming analytics solutions.
- Spark for high-performance interactive data analysis.
- Module 1: Using HBase for NoSQL Data
- Module 2: Using Storm for Streaming Data
- Module 3: Using Spark for Interactive Analysis
- Module 4: Final Exam