Об этом курсе
4.4
Оценки: 178
Рецензии: 44
Welcome to the Capstone Project for Big Data! In this culminating project, you will build a big data ecosystem using tools and methods form the earlier courses in this specialization. You will analyze a data set simulating big data generated from a large number of users who are playing our imaginary game "Catch the Pink Flamingo". During the five week Capstone Project, you will walk through the typical big data science steps for acquiring, exploring, preparing, analyzing, and reporting. In the first two weeks, we will introduce you to the data set and guide you through some exploratory analysis using tools such as Splunk and Open Office. Then we will move into more challenging big data problems requiring the more advanced tools you have learned including KNIME, Spark's MLLib and Gephi. Finally, during the fifth and final week, we will show you how to bring it all together to create engaging and compelling reports and slide presentations. As a result of our collaboration with Splunk, a software company focus on analyzing machine-generated big data, learners with the top projects will be eligible to present to Splunk and meet Splunk recruiters and engineering leadership....
Stacks
Globe

Только онлайн-курсы

Начните сейчас и учитесь по собственному графику.
Calendar

Гибкие сроки

Назначьте сроки сдачи в соответствии со своим графиком.
Clock

Approx. 19 hours to complete

Предполагаемая нагрузка: 4 hours/week...
Comment Dots

English

Субтитры: English...

Приобретаемые навыки

Big DataNeo4jKnimeSplunk
Stacks
Globe

Только онлайн-курсы

Начните сейчас и учитесь по собственному графику.
Calendar

Гибкие сроки

Назначьте сроки сдачи в соответствии со своим графиком.
Clock

Approx. 19 hours to complete

Предполагаемая нагрузка: 4 hours/week...
Comment Dots

English

Субтитры: English...

Программа курса: что вы изучите

Week
1
Clock
1 ч. на завершение

Simulating Big Data for an Online Game

This week we provide an overview of the Eglence, Inc. Pink Flamingo game, including various aspects of the data which the company has access to about the game and users and what we might be interested in finding out....
Reading
4 видео (всего 18 мин.), 4 материалов для самостоятельного изучения
Video4 видео
Welcome from Splunk: Rob Reed World Education Evangelist3мин
A Summary of Catch the Pink Flamingo7мин
A Conceptual Schema for Catch the Pink Flamingo4мин
Reading4 материала для самостоятельного изучения
Planning, Preparation, and Review10мин
A Game by Eglence Inc. : Catch The Pink Flamingo10мин
Overview of the Catch the Pink Flamingo Data Model10мин
Overview of Final Project Design5мин
Clock
4 ч. на завершение

Acquiring, Exploring, and Preparing the Data

Next, we begin working with the simulated game data by exploring and preparing the data for ingestion into big data analytics applications....
Reading
6 материалов для самостоятельного изучения, 2 тестов
Reading6 материала для самостоятельного изучения
Downloading the Game Data and Associated Scripts10мин
Understanding the CSV Files Generated by the Scripts20мин
Optional Review of Splunkмин
“Catch the Pink Flamingo” Data Exploration with Splunk45мин
Aggregate Calculations Using Splunk45мин
Filtering the Data With Splunk20мин
Quiz1 практическое упражнение
Data Exploration With Splunk30мин
Week
2
Clock
5 ч. на завершение

Data Classification with KNIME

This week we do some data classification using KNIME. ...
Reading
4 материалов для самостоятельного изучения, 1 тест
Reading4 материала для самостоятельного изучения
Review: Classification Using Decision Tree in KNIME10мин
Review: Interpreting a Decision Tree in KNIME10мин
Workflow Overview for Building a Decision Tree in KNIME20мин
Description of combined_data.csv5мин
Week
3
Clock
5 ч. на завершение

Clustering with Spark

This week we do some clustering with Spark. ...
Reading
2 материалов для самостоятельного изучения, 1 тест
Reading2 материала для самостоятельного изучения
Informing business strategies based on client base5мин
Practice with PySpark MLlib Clustering30мин
Week
4
Clock
4 ч. на завершение

Graph Analytics of Simulated Chat Data With Neo4j

This week we apply what we learned from the 'Graph Analytics With Big Data' course to simulated chat data from Catch the Pink Flamingos using Neo4j. We analyze player chat behavior to find ways of improving the game. ...
Reading
2 материалов для самостоятельного изучения, 1 тест
Reading2 материала для самостоятельного изучения
Understanding the Simulated Chat Data Generated by the Scripts10мин
Graph Analytics of Catch the Pink Flamingo Chat Data Using Neo4jмин
4.4
Direction Signs

20%

начал новую карьеру, пройдя эти курсы
Briefcase

83%

получил значимые преимущества в карьере благодаря этому курсу
Money

12%

стал больше зарабатывать или получил повышение

Лучшие рецензии

автор: DMApr 14th 2018

What a challenge, I came into this course as a London Black Cab Taxi Driver, I thought the knowledge was hard but this capstone was a challenge more intense than the Knowledge of London!!!

автор: RAMay 16th 2018

This has been excellent Learning experience.Instructor and fellow members shared their valuable information during the course of the Learning and Capstone Project phase.

Преподавателя

Ilkay Altintas

Chief Data Science Officer
San Diego Supercomputer Center

Amarnath Gupta

Director, Advanced Query Processing Lab
San Diego Supercomputer Center (SDSC)

О University of California San Diego

UC San Diego is an academic powerhouse and economic engine, recognized as one of the top 10 public universities by U.S. News and World Report. Innovation is central to who we are and what we do. Here, students learn that knowledge isn't just acquired in the classroom—life is their laboratory....

О специализации ''Big Data'

Drive better business decisions with an overview of how big data is organized, analyzed, and interpreted. Apply your insights to real-world problems and questions. ********* Do you need to understand big data and how it will impact your business? This Specialization is for you. You will gain an understanding of what insights big data can provide through hands-on experience with the tools and systems used by big data scientists and engineers. Previous programming experience is not required! You will be guided through the basics of using Hadoop with MapReduce, Spark, Pig and Hive. By following along with provided code, you will experience how one can perform predictive modeling and leverage graph analytics to model problems. This specialization will prepare you to ask the right questions about data, communicate effectively with data scientists, and do basic exploration of large, complex datasets. In the final Capstone Project, developed in partnership with data software company Splunk, you’ll apply the skills you learned to do basic analyses of big data....
Big Data

Часто задаваемые вопросы

  • Once you enroll for a Certificate, you’ll have access to all videos, quizzes, and programming assignments (if applicable). Peer review assignments can only be submitted and reviewed once your session has begun. If you choose to explore the course without purchasing, you may not be able to access certain assignments.

  • When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.

Остались вопросы? Посетите Центр поддержки учащихся.