Project based Text Mining in Python

Use of Natural Language Processing, Machine Learning and Sentiment Analysis towards Data Science

In this course the students will learn the basics of text mining and will build on it to perform document categorization, grouping and sentiment analysis.
The practicals are carried out in Python language, Natural Language Processing (NLP) is used for pre-processing before training machine learning models.
Sentiment analysis of user hotel reviews
Deep neural networks for text analysis

In this course, we study the basics of text mining.

The basic operations related to structuring the unstructured data into vector and reading different types of data from the public archives are taught.
Building on it we use Natural Language Processing for pre-processing our dataset.
Machine Learning techniques are used for document classification, clustering and the evaluation of their models.
Information Extraction part is covered with the help of Topic modeling
Sentiment Analysis with a classifier and dictionary based approach
Almost all modules are supported with assignments to practice.
Two projects are given that make use of most of the topics separately covered in these modules.
Finally, a list of possible project suggestions are given for students to choose from and build their own project.

Beginners in python and curious about data science
Knows programming in Python and basic concepts of Data Science but cannot practically relate the two.
Intermediate level Data scientists interested in latest text analysis approaches.