Opportunities and limitations of automatic detection of negative

Ющук Евгений Леонидович · 06.01.2014

Opportunities and limitations of automatic detection of negative and positive

Source

Around the issues of automating the identification of negative and positive in texts, with the current level of development of monitoring systems, disputes often arise. Critics of such systems have two main arguments. The first is that the machine cannot distinguish between sarcasm and humor, and the second is that if there are two objects in the text (for example, when comparing the Mercedes type, it’s guano, and this is known to every normal person, but “Lada Kalina” is rulezizz ") - the machine will not understand to whom it belongs.

I will express my opinion on this issue.

Automata really are in many ways inferior to humans when it comes to understanding emotions. But they are significantly superior to humans in speed. Therefore, for practical purposes, it is much more important to make a million page coverage than to carefully analyze a dozen. And this, in practice, does not lead to problems, with the modern development of systems for determining positive and negative. I will explain why.

Firstly, far from always even one person can understand another. Yes, at least look at any Internet forum - from time to time you have to ask again what people had in mind. In this regard, the machine identification of negative and positive is neither better nor worse than human.

Secondly, Internet resources, which are evaluated in terms of negative and positive, are divided into two unequal size categories. The first are significant, widely visited, authoritative resources. There are not many of them.
The second - "extras". In the "crowd" the opinion of each individual does not matter, but the quantity is important. When it is small, it does not affect the situation; when it becomes large, it does.

Based on this, the conclusion is obvious: the machine can look at significant resources only for the fact of mentioning the object, but not for evaluating the negative and positive. Such an assessment is already made by man.
The car can look at the "mass" also with a view to assessing the negative and positive.

How to avoid the problem of sarcasm / humor and the problem that the negative word refers to another object, which is also mentioned in the text?

In fact, there is no problem from a practical point of view.
Refined sarcasm is so rare in comparison with more direct statements that in the case of the "extras" it can be safely ignored. Most likely, he will even be balanced by the reverse statements, also not caught by the machine, as a result, the specific gravity of the negative and the positive will not change significantly. But even this fluctuation is so insignificant that it does not matter.
Negative / positive related to the object of study is caught (for example, in IQBuzz) due to the parameter "distance from the object." That is, for example, we look at the negative only in five words from the object. Naturally, some of the objects will not fall into the selection. So what? When do you assess the extent of the flood in the Far East in order to understand whether water is arriving or retreating, trying to measure it with an accuracy of a glass? It is important?

So in the case of definition systems speakers negative and positive - exactly the same.

Группа К · 08.01.2014

How to avoid the problem of sarcasm / humor and the problem that the negative word refers to another object, which is also mentioned in the text?

The solution to the problem of choosing between sarcasm / humor or yes / no was described in detail in the work THE ROLE OF SOFT CALCULATIONS AND FUZZY LOGIC IN UNDERSTANDING, DESIGNING AND DEVELOPING INFORMATION / INTELLECTUAL SYSTEMS academician LA Zade.

In this and his other works, Academician Zade suggested using a projection from qualitative to quantitative characteristics. For example, if we talk about flooding, then the task is described by words in the form of fuzzy if-then-rules:
- If the flood is small, then the value is small,
- If the flood seems to be average, then the quantitative value is average,
- If the flood seems to the researcher to be large, then its quantitative indicator is greater than the previous value,
Here, the values “small”, “medium” and “large” are defined using their membership functions. These functions or quantitative values are determined by the researcher based on his experience.

By the way

Negative / positive related to the object of study is caught (for example, in IQBuzz) due to the parameter "distance from the object." That is, for example, we look at the negative only in five words from the object.

You have given a very successful example of projecting a qualitative characteristic of an event into its quantitative value - one out of five. Look at your palm. Five fingers have always served the rational person, not only as a convenient tool for performing labor functions, but also as a way to express their attitude to any event. For example, opening a palm and showing all five fingers to the interlocutor, we kind of say - Super. Fluctuations of the index finger left and right speaks more about a negative assessment of what is happening. Thus, figure 5 is a universal quantitative measure of events, including floods in the Far East).

If the flood is small, then 1,
If the flood seems less than average, then 2,
If the flood is medium, then 3
If the flood is greater than average, then 4 points
If the flood is large, then 5 points

It was LA Zade's theory that was used in the Search Audit program. There are 67 factors, each of which describes its own "flood". For example. Upon inspection of the office, the detective discovered that the office plate was missing. Based on a subjective assessment of this "flood scale", the detective evaluates this event by moving the indicator to one of five positions. The program contains values for each of the five provisions. Based on 67 factors, the internal algorithm program calculates the final company business reliability index.

Like this. By the way, until January 10, the program is offered for free

Ющук Евгений Леонидович · 08.01.2014

Y644232, thanks!

Ющук Евгений Леонидович · 08.01.2014

Y644232 schrieb:
By the way, until January 10, the program is offered for free

Thanks again. And where can I get it for review?

Группа К · 08.01.2014

Ющук Евгений Леонидович schrieb:
Y644232 schrieb:

By the way, until January 10, the program is offered for free

Zum Vergrößern anklicken....

Thanks again. And where can I get it for review?

Detective audit program can be downloaded here
[DLMURL] https://itunes.apple.com/en/app/detecti [/ DLMURL] ...? L = en & mt = 8
The program can be launched on iPhones 4 and 5.

Download is free. If there is interest, then the freebie can be extended)))) Information request to the Federal State Statistics Service of Rosstat paid - 300 rubles. For this money, the user receives all reg information about the company. But that's not all. In addition to the program, it automatically calculates risks for 12 factors.
I would be glad if you leave your review, it does not matter positive or not)

thank
respectfully
Krioni Alexander

Ющук Евгений Леонидович · 08.01.2014

Thank you, Alexander! I will take a look and contact you. Or in the end, or in the process - if questions arise.

Группа К · 08.01.2014

Ющук Евгений Леонидович schrieb:
Thank you, Alexander! I will take a look and contact you. Or in the end, or in the process - if questions arise.

Always well!

Ющук Евгений Леонидович · 08.01.2014

Did I understand correctly that the application in EppStor is the only version of the program? Does it not exist for Windows or Linux?

Группа К · 08.01.2014

Ющук Евгений Леонидович schrieb:
Did I understand correctly that the application in EppStor is the only version of the program? Does it not exist for Windows or Linux?

No Eugene, so far only for the Appstore

Ющук Евгений Леонидович · 08.01.2014

Yeah, okay. Well, then I'll look at Apple products.

Suche

Suche

Opportunities and limitations of automatic detection of negative

Ющук Евгений Леонидович

Группа К

Ющук Евгений Леонидович

Ющук Евгений Леонидович

Группа К

Ющук Евгений Леонидович

Группа К

Ющук Евгений Леонидович

Группа К

Ющук Евгений Леонидович

Similar threads

Teilen