Practical-5
Text
Analytics
Text analytics is the automated process of translating
large volumes of unstructured text into quantitative data to uncover insights,
trends, and patterns.
Sentiment
Analysis
Sentiment Analysis is the process of determining whether
a piece of writing is positive, negative or neutral. A sentiment analysis
system for text analysis combines natural language processing (NLP) and machine
learning techniques to assign weighted sentiment scores to the entities,
topics, themes and categories within a sentence or phrase.
WHY?
Sentiment analysis is extremely useful in social media monitoring as it allows us to gain an overview of the wider public opinion behind certain topics. Sentiment analysis is useful for quickly gaining insights using large volumes of text.
Preprocessing is a key component in Data Science. Orange tool has various ways to achieve the activities.
Discretization: It is the process of transferring continuous functions, models, variables, and equations into discrete counterparts. This process is usually carried out as a first step toward making them suitable for numerical evaluation and implementation data by the models.
Discretization replaces continuous features with the corresponding categorical features:
Sample Code:
Continuization:
Given a data table, return a new table in which the discretize attributes are replaced with continuous or removed.
- binary variables are transformed into 0.0/1.0 or -1.0/1.0 indicator variables, depending upon the argument
zero_based. - multinomial variables are treated according to the argument
multinomial_treatment. - discrete attribute with only one possible value are removed
Comments
Post a Comment