This course will introduce the learner to text mining and text manipulation basics. The course begins with an understanding of how text is handled by python, the structure of text both to the machine and to humans, and an overview of the nltk framework for manipulating text. The second week focuses on common manipulation needs, including regular expressions (searching for text), cleaning text, and preparing text for use by machine learning processes. The third week will apply basic natural language processing methods to text, and demonstrate how text classification is accomplished. The final week will explore more advanced methods for detecting the topics in documents and grouping them by similarity (topic modelling).
Этот курс входит в специализацию ''Специализация Прикладная наука о данных с Python'
от партнера
Об этом курсе
Чему вы научитесь
Understand how text is handled in Python
Apply basic natural language processing methods
Write code that groups documents by topic
Describe the nltk framework for manipulating text
Приобретаемые навыки
- Natural Language Toolkit (NLTK)
- Text Mining
- Python Programming
- Natural Language Processing
от партнера

Мичиганский университет
The mission of the University of Michigan is to serve the people of Michigan and the world through preeminence in creating, communicating, preserving and applying knowledge, art, and academic values, and in developing leaders and citizens who will challenge the present and enrich the future.
Программа курса: что вы изучите
Module 1: Working with Text in Python
Module 2: Basic Natural Language Processing
Module 3: Classification of Text
Module 4: Topic Modeling
Рецензии
- 5 stars55,14 %
- 4 stars25,27 %
- 3 stars11,96 %
- 2 stars4,29 %
- 1 star3,31 %
Лучшие отзывы о курсе APPLIED TEXT MINING IN PYTHON
Everything was awesome, assignment 2 was my favorite in a long while in this specialization series. Week 4 was a little weak, and felt rushed. Overall, I enjoyed this course 4 of the 5.
This course give the basic idea in each module existed in text and natural language processing kits. A lot more for self-explore, but this will intrigue to begin sooner and learn wider.
Passionate instructor and a great primer on how software can infer useful data from text. Gives a preliminary understanding on the algorithms used in scikit learn and nltk.
A little bit stretched my python skill, but learned a lot. Forum is a good place, and maybe next I will join some study group online or offline to have more discussions.
Специализация Прикладная наука о данных с Python: общие сведения
The 5 courses in this University of Michigan specialization introduce learners to data science through the python programming language. This skills-based specialization is intended for learners who have a basic python or programming background, and want to apply statistical, machine learning, information visualization, text analysis, and social network analysis techniques through popular python toolkits such as pandas, matplotlib, scikit-learn, nltk, and networkx to gain insight into their data.

Часто задаваемые вопросы
Когда я получу доступ к лекциям и заданиям?
Что я получу, оформив подписку на специализацию?
Можно ли получить финансовую помощь?
Остались вопросы? Посетите Центр поддержки учащихся.