TEXT ANALYTICS OR TEXT MINING
TEST ANALYTICS is the process of analysing and processing large volume of unorganized and unstructured text data thru a software for indentification of any sort of pattern, logic, concept, keywords or other attributes of data.
CHALLENGES ADDRESSED BY TEXT ANALYTICS
Used for Opinion mining or sentiment analysis by reviewing social networks, emails, reviews for positive and negative reviews or feelings of customer. This is used to fix issues in products or service before it impacts the sales, revenue or profits.
Data mining is also used for screening job candidates based on the keyword present in their resumes vis-a-vis requirement for the post.
Used for blocking spam emails as per the actions and words available in past data.
Also used for classifying contents of websites.
Used for identification of fraudulant claims of insurance by analysing data.
Diagnosis can be done by identification of description of medical situation or symptoms.
CHALLENGES BEING WORKED UPON
Data available for processing is often uncertain, unclear, indefinite and contradictory. Very difficult to process.
Ambiguity in the syntex of the data need to be processed along with presence of slang or sarcasm or techinal language to get proper result.
Large amount of training data is required along with processing power which make the execution expensive.
If data is biased result of analytics can be imperfect.