Об этом курсе
5.0
Оценки: 2
Рецензии: 1
100% online

100% online

Начните сейчас и учитесь по собственному графику.
Гибкие сроки

Гибкие сроки

Назначьте сроки сдачи в соответствии со своим графиком.
Начальный уровень

Начальный уровень

Часов на завершение

Прибл. 17 часа на выполнение

Предполагаемая нагрузка: 8 hours/week...
Доступные языки

Английский

Субтитры: Английский...
100% online

100% online

Начните сейчас и учитесь по собственному графику.
Гибкие сроки

Гибкие сроки

Назначьте сроки сдачи в соответствии со своим графиком.
Начальный уровень

Начальный уровень

Часов на завершение

Прибл. 17 часа на выполнение

Предполагаемая нагрузка: 8 hours/week...
Доступные языки

Английский

Субтитры: Английский...

Программа курса: что вы изучите

Неделя
1
Часов на завершение
1 ч. на завершение

Big Data Rankings & Products

The first module “Big Data Rankings & Products” focuses on the relation and market shares of big data hardware, software, and professional services. This information provides an insight to how future industry, products, services, schools, and government organizations will be influenced by big data technology. To have a deeper view into the world’s top big data products line and service types, the lecture provides an overview on the major big data company, which include IBM, SAP, Oracle, HPE, Splunk, Dell, Teradata, Microsoft, Cisco, and AWS. In order to understand the power of big data technology, the difference of big data analysis compared to traditional data analysis is explained. This is followed by a lecture on the 4 V big challenges of big data technology, which deal with issues in the volume, variety, velocity, and veracity of the massive data. Based on this introduction information, big data technology used in adding global insights on investments, help locate new stores and factories, and run real-time recommendation systems by Wal-Mart, Amazon, and Citibank is introduced....
Reading
6 видео (всего 28 мин.), 2 тестов
Video6 видео
1.1 Big Data Market Analysis1мин
1.2 IBM / 1.3 SAP8мин
1.4 Oracle / 1.5 Splunk / 1.6 Accenture / 1.7 Dell / 1.8 Teradata6мин
1.9 Microsoft / 1.10 Cisco / 1.11 AWS3мин
1.12 Big Data Landscape1мин
Quiz2 практического упражнения
Ungraded Quiz8мин
Graded Quizмин
Неделя
2
Часов на завершение
1 ч. на завершение

Big Data & Hadoop

The second module “Big Data & Hadoop” focuses on the characteristics and operations of Hadoop, which is the original big data system that was used by Google. The lectures explain the functionality of MapReduce, HDFS (Hadoop Distributed FileSystem), and the processing of data blocks. These functions are executed on a cluster of nodes that are assigned the role of NameNode or DataNodes, where the data processing is conducted by the JobTracker and TaskTrackers, which are explained in the lectures. In addition, the characteristics of metadata types and the differences in the data analysis processes of Hadoop and SQL (Structured Query Language) are explained. Then the Hadoop Release Series is introduced which include the descriptions of Hadoop YARN (Yet Another Resource Negotiator), HDFS Federation, and HDFS HA (High Availability) big data technology....
Reading
8 видео (всего 68 мин.), 2 тестов
Video8 видео
2.3 Big Data's 4 Vs / 2.4 How is Big Data being Used?8мин
2.5 HADOOP11мин
2.6 MapReduce vs. RDBMS6мин
2.7 MapReduce9мин
2.8 Hadoop vs. SQL(RDBMS & RDSMS)12мин
2.9 HDFS Enhancements4мин
2.10 Hadoop vs. Hadoop YARN6мин
Quiz2 практического упражнения
Ungraded Quiz12мин
Graded Quizмин
Неделя
3
Часов на завершение
2 ч. на завершение

Spark

The third module “Spark” focuses on the operations and characteristics of Spark, which is currently the most popular big data technology in the world. The lecture first covers the differences in data analysis characteristics of Spark and Hadoop, then goes into the features of Spark big data processing based on the RDD (Resilient Distributed Datasets), Spark Core, Spark SQL, Spark Streaming, MLlib (Machine Learning Library), and GraphX core units. Details of the features of Spark DAG (Directed Acyclic Graph) stages and pipeline processes that are formed based on Spark transformations and actions are explained. Especially, the definition and advantages of lazy transformations and DAG operations are described along with the characteristics of Spark variables and serialization. In addition, the process of Spark cluster operations based on Mesos, Standalone, and YARN are introduced....
Reading
11 видео (всего 101 мин.), 2 тестов
Video11 видео
3.2 Spark Architecture / 3.3 Spark Family9мин
3.4 Spark vs. Hadoop11мин
3.5 Spark RDD6мин
3.6 Spark Transformations / 3.7 Spark Actions / 3.8 Spark DAG12мин
3.9 Spark Programming7мин
3.10 Spark Core / 3.11 Spark Variables & Serialization7мин
3.12 Spark Cluster Operations / 3.13 Spark Standalone / 3.14 Spark Mesos14мин
3.15 Spark YARN9мин
3.16 Spark SQL / 3.17 Spark GraphX5мин
3.18 Relational DB & Graph DB12мин
Quiz2 практического упражнения
Ungraded Quizмин
Graded Quizмин
Неделя
4
Часов на завершение
1 ч. на завершение

Spark ML & Streaming

The fourth module “Spark ML & Streaming” focuses on how Spark ML (Machine Learning) works and how Spark streaming operations are conducted. The Spark ML algorithms include featurization, pipelines, persistence, and utilities which operate on the RDDs (Resilient Distributed Datasets) to extract information form the massive datasets. The lectures explain the characteristics of the DataFrame-based API, which is the primary ML API in the spark.ml package. Spark ML basic statistics algorithms based on correlation and hypothesis testing (P-value) are first introduced followed by the Spark ML classification and regression algorithms based on linear models, naive Bayes, and decision tree techniques. Then the characteristics of Spark streaming, streaming input and output, as well as streaming receiver types (which include basic, custom, and advanced) are explained, followed by how the Spark Streaming process and DStream (Discretized Stream) enable big data streaming operations for real-time and near-real-time applications....
Reading
4 видео (всего 31 мин.), 2 тестов
Video4 видео
4.2 Spark ML Algorithms part 18мин
4.2 Spark ML Algorithms part 29мин
4.3 Spark Streaming10мин
Quiz2 практического упражнения
Ungraded Quizмин
Graded Quizмин

Преподаватель

Avatar

Jong-Moon Chung

Professor, School of Electrical & Electronic Engineering
Director, Communications & Networking Laboratory

О Yonsei University

Yonsei University was established in 1885 and is the oldest private university in Korea. Yonsei’s main campus is situated minutes away from the economic, political, and cultural centers of Seoul’s metropolitan downtown. Yonsei has 3,500 eminent faculty members who are conducting cutting-edge research across all academic disciplines. There are 18 graduate schools, 22 colleges and 133 subsidiary institutions hosting a selective pool of students from around the world. Yonsei is proud of its history and reputation as a leading institution of higher education and research in Asia....

О специализации ''Emerging Technologies: From Smartphones to IoT to Big Data'

This Specialization is intended for researchers and business experts seeking state-of-the-art knowledge in advanced science and technology. The 4 courses cover details on Big Data (Hadoop, Spark, Storm), Smartphones, Smart Watches, Android, iOS, CPU/GPU/SoC, Mobile Communications (1G to 5G), Sensors, IoT, Wi-Fi, Bluetooth, LP-WAN, Cloud Computing, AR (Augmented Reality), Skype, YouTube, H.264/MPEG-4 AVC, MPEG-DASH, CDN, and Video Streaming Services. The Specialization includes projects on Big Data using IBM SPSS Statistics, AR applications, Cloud Computing using AWS (Amazon Web Service) EC2 (Elastic Compute Cloud), and Smartphone applications to analyze mobile communication, Wi-Fi, and Bluetooth networks. The course contents are for expert level research, design, development, industrial strategic planning, business, administration, and management....
Emerging Technologies: From Smartphones to IoT to Big Data

Часто задаваемые вопросы

  • Зарегистрировавшись на сертификацию, вы получите доступ ко всем видео, тестам и заданиям по программированию (если они предусмотрены). Задания по взаимной оценке сокурсниками можно сдавать и проверять только после начала сессии. Если вы проходите курс без оплаты, некоторые задания могут быть недоступны.

  • Записавшись на курс, вы получите доступ ко всем курсам в специализации, а также возможность получить сертификат о его прохождении. После успешного прохождения курса на странице ваших достижений появится электронный сертификат. Оттуда его можно распечатать или прикрепить к профилю LinkedIn. Просто ознакомиться с содержанием курса можно бесплатно.

Остались вопросы? Посетите Центр поддержки учащихся.