Звезды и С - Главная Звезды и С - Citrix Звезды и С - Microsoft Звезды и С - О нас

Спец. предложения|Обучение|Вебинары|Сертификация|Тестирование|О нас|Работа с сайтом|Новости|Поиск
Обучение

Специальные предложения
Software Assurance - Бесплатные курсы обучения по ваучерам
CITRIX
MICROSOFT
Авторские курсы Microsoft
Microsoft Windows Server 2012 R2 / 2016
Microsoft Windows 10 / 8.1
Облачные технологии: Microsoft Windows Azure, Private Cloud, Office 365
Microsoft Exchange Server 2013 / 2016
Microsoft System Center
Microsoft Lync Server 2013 / Skype for business 2015
Microsoft SQL Server 2014 / 2016
Microsoft SharePoint 2013 / 2016
Microsoft Visual Studio 2013 / 2016
Microsoft Forefront
Microsoft BizTalk Server
Microsoft On-Demand
Microsoft Dynamics
Обучение корпоративных пользователей
Более ранние версии программных продуктов Microsoft
Расписание курсов Microsoft в графическом формате
VMware
Cisco
ITIL
Linux
Вечернее обучение
Условия обучения

Курс 20775: Performing Data Engineering on Microsoft HD Insight

Цена для физических лиц, р.: 29900
Цена для юридических лиц, р.: 30900
Цена вебинара для физических лиц, р.: 28900
Цена вебинара для юридических лиц, р.: 28900

Продолжительность курса (дней): 5

Даты (день):

Даты (вечер):

Курс готовит к тестам:

Цель:

Необходимая подготовка:

Предварительный тест:

Результат: After completing this course, students will be able to:

  • Explain how Microsoft R
  • Transform and clean big data sets
  • План курса:

    20775A

    Module 1: Getting Started with HDInsight

    • Big Data
    • Hadoop
    • MapReduce
    • HDInsight
    • Lab : Querying Big Data
      • Query data with Hive
      • Visualize data with Excel

    Module 2: Deploying HDInsight Clusters

    • HDInsight cluster types
    • Managing HDInsight Clusters
    • Managing HDInsight Clusters with PowerShell
    • Lab : Managing HDInsight clusters with the Azure Portal
      • Create an HDInsight Hadoop Cluster
      • Customise HDInsight using a script action
      • Customize HDInsight using Bootstrap
      • Delete an HDInsight cluster

    Module 3: Authorizing Users to Access Resources

    • Non-domain Joined clusters
    • Configuring domain-joined HDInsight clusters
    • Manage domain-joined HDInsight clusters
    • Lab : Authorizing Users to Access Resources
      • Configure a domain-joined HDInsight cluster
      • Configure Hive policies

    Module 4: Loading data into HDInsight

    • HDInsight Storage
    • Data loading tools
    • Performance and reliability
    • Lab : Loading Data into HDInsight
      • Loading data using Sqoop
      • Loading data using AZcopy
      • Loading data using ADLcopy
      • Use HDInsight to compress data

    Module 5: Troubleshooting HDInsight

    • Analyze HDInsight logs
    • YARN logs
    • Heap dumps
    • Operations management suite
    • Lab : Troubleshooting HDInsight
      • Analyze HDInsight logs
      • Analyze YARN logs
      • Monitor resources with Operations Management Suite

    Module 6: Implementing Batch Solutions

    • Apache Hive storage
    • Querying with Hive and Pig
    • Operationalize HDInsight
    • Lab : Backing Up SQL Server Databases
      • Load data into a hive table
      • Query data with Hive and Pig

    Module 7: Design Batch ETL solutions for big data with Spark

    • What is Spark?
    • ETL with Spark
    • Spark performance
    • Lab : Design Batch ETL solutions for big data with Spark.
      • Create a HDInsight Cluster with access to Data Lake Store
      • Use HDInsight Spark cluster to analyze data in Data Lake Store
      • Analyzing website logs using a custom library with Apache Spark cluster on HDInsight
      • Managing resources for Apache Spark cluster on Azure HDInsight

    Module 8: Analyze Data with Spark SQL

    • Implement interactive queries
    • Perform exploratory data analysis
    • Lab : Analyze data with Spark SQL
      • Implement interactive queries
      • Perform exploratory data analysis

    Module 9: Analyze Data with Hive and Phoenix

    • Implement interactive queries for big data with interactive hive.
    • Perform exploratory data analysis by using Hive
    • Perform interactive processing by using Apache Phoenix
    • Lab : Analyze data with Hive and Phoenix
      • Implement interactive queries for big data with interactive Hive
      • Perform exploratory data analysis by using Hive
      • Perform interactive processing by using Apache Phoenix

    Module 10: Stream Analytics

    • Stream analytics
    • Process streaming data from stream analytics
    • Managing stream analytics jobs
    • Lab : Implement Stream Analytics
      • Process streaming data with stream analytics
      • Managing stream analytics jobs

    Module 11: Spark Streaming using the DStream API

    • Dstream
    • Create Spark structured streaming applications
    • Persistence and visualization
    • Lab : Spark streaming applications using DStream API
      • Creating Spark streaming applications using the DStream API
      • Creating Spark structured streaming applications

    Module 12: Develop big data real-time processing solutions with Apache Storm

    • Persist long term data
    • Stream data with Storm
    • Create Storm topologies
    • Configure Apache Storm
    • Lab : Developing big data real-time processing solutions with Apache Storm
      • Stream data with Storm
      • Create Storm Topologies

    Module 13: Analyze Data with Spark SQL

    • Implement interactive queries
    • Perform exploratory data analysis
    • Lab : Analyze data with Spark SQL
      • Implement interactive queries
      • Perform exploratory data analysis


      Спец. предложения|Обучение|Вебинары|Сертификация|Тестирование|О нас|Работа с сайтом|Новости|Поиск

       Тел: +74953633686 email: info@stars-s.ru

       125040, Москва, Ленинградский проспект, д. 5, стр. 2, под. 5, офис "Звезды и С"

      © Учебный центр "Звезды и С", 1991-2017