This course will introduce the learner to the basics of the python programming environment, including fundamental python programming techniques such as lambdas, reading and manipulating csv files, and the numpy library. The course will introduce data manipulation and cleaning techniques using the popular python pandas data science library and introduce the abstraction of the Series and DataFrame as the central data structures for data analysis, along with tutorials on how to use functions such as groupby, merge, and pivot tables effectively. By the end of this course, students will be able to take tabular data, clean it, manipulate it, and run basic inferential statistical analyses.
Этот курс входит в специализацию ''Специализация Прикладная наука о данных с Python'
от партнера
Об этом курсе
Чему вы научитесь
Understand techniques such as lambdas and manipulating csv files
Describe common Python functionality and features used for data science
Query DataFrame structures for cleaning and processing
Explain distributions, sampling, and t-tests
Приобретаемые навыки
- Python Programming
- Numpy
- Pandas
- Data Cleansing
от партнера

Мичиганский университет
The mission of the University of Michigan is to serve the people of Michigan and the world through preeminence in creating, communicating, preserving and applying knowledge, art, and academic values, and in developing leaders and citizens who will challenge the present and enrich the future.
Программа курса: что вы изучите
Fundamentals of Data Manipulation with Python
In this week you'll get an introduction to the field of data science, review common Python functionality and features which data scientists use, and be introduced to the Coursera Jupyter Notebook for the lectures. All of the course information on grading, prerequisites, and expectations are on the course syllabus, and you can find more information about the Jupyter Notebooks on our Course Resources page.
Basic Data Processing with Pandas
In this week of the course you'll learn the fundamentals of one of the most important toolkits Python has for data cleaning and processing -- pandas. You'll learn how to read in data into DataFrame structures, how to query these structures, and the details about such structures are indexed.
More Data Processing with Pandas
In this week you'll deepen your understanding of the python pandas library by learning how to merge DataFrames, generate summary tables, group data into logical pieces, and manipulate dates. We'll also refresh your understanding of scales of data, and discuss issues with creating metrics for analysis. The week ends with a more significant programming assignment.
Answering Questions with Messy Data
In this week of the course you'll be introduced to a variety of statistical techniques such a distributions, sampling and t-tests. The week ends with two discussions of science and the rise of the fourth paradigm -- data driven discovery.
Рецензии
- 5 stars66,17 %
- 4 stars24,58 %
- 3 stars5,37 %
- 2 stars1,88 %
- 1 star1,97 %
Лучшие отзывы о курсе INTRODUCTION TO DATA SCIENCE IN PYTHON
Assignments are way tougher than what is taught in the class, but they are challenging and the help in discussion forums is speechless. Without that, completion of assignments will take too much time.
Assignments are tough compared to the course lecture material. Therefore, alot of self learning is required other than the lectures. There should be more study material covered in the course videos
Very good course assignments and projects. Really learnt good by exploring stackoverflow and other forums to complete the assignments. Please include some detailed lectures in the upcoming modules.
I found this course appealing because it was more practical based.it helped me alot in getting hands on experience and most of all I have learned how to solve real world problem with python libraries
Специализация Прикладная наука о данных с Python: общие сведения
The 5 courses in this University of Michigan specialization introduce learners to data science through the python programming language. This skills-based specialization is intended for learners who have a basic python or programming background, and want to apply statistical, machine learning, information visualization, text analysis, and social network analysis techniques through popular python toolkits such as pandas, matplotlib, scikit-learn, nltk, and networkx to gain insight into their data.

Часто задаваемые вопросы
Когда я получу доступ к лекциям и заданиям?
Что я получу, оформив подписку на специализацию?
Можно ли получить финансовую помощь?
Остались вопросы? Посетите Центр поддержки учащихся.