Release:2019, Vol. 5. №4 (20)
About the authors:Alexander A. Chernyaev, Postgraduate Student, Researcher, University of Tyumen; firstname.lastname@example.org
Abstract:One of the most important tasks of the contemporary society includes fighting the spreading false information. The unprecedented transition from the traditional media to the modern methods of receiving news has created many problems with verifying its authenticity. Contemporary journalists have to compete with a huge data stream of ordinary users, which is why the main quality factor is the time to publish a news article. As a result, an increasing number of traditional news sources report unclarified information due to the rush to be first. This paper considers a method for determining the presence of hearing in the mass media for the Russian language. This method aims to study the possibility of searching for rumors among users’ messages in social networks. Achieving this goal requires various methods of text analysis, including semantic and linguistic analysis, as well as the analysis of the distribution of records relative to time segments. During the research, the authors have analyzed different popular tools for obtaining data from social networks. In addition, they have manually compiled and marked a sample for training the neural network. As a tool for solving the problem, we used a neural network based on a multi-layer perceptron. The inputs receive a set of 15 metrics that evaluate all aspects of hearing, and as an output, the probability of hearing. The test was performed using various metrics that showed high results for the constructed neural network model. Cross-validation has shown that the model is able to withstand various checks at a high level.
Klimenko S. 2012. “Collecting marketing information and competitive intelligence using social networks”. Financial Life, vol. 1, no 1, pp. 27-31. [In Russian]
Korshunov A., Beloborodov I. et al. 2014. “Analysis of social networks: methods and applications”. Proceedings of the Institute of System Programming of the Russian Academy of Sciences, vol. 26, no 1, pp. 439-456. [In Russian]
Martyshkin A. I., Salnikov I. I., Pashchenko D. V. 2018. “Stages of collecting and presenting big data”. Proceedings of the Tula State University. Technical Sciences, no 9, pp. 617-628. [In Russian]
Milchuk Ya. 2018. “Analysis of social network data as a way to collect information”. Scientific Journal, vol. 5, no 28, pp. 30-31. [In Russian]
Mikhaleva K. A., Kubarev A. I., Poddubny V. V. 2015. “Bayesian classifiers based on the main components and assessment of their quality in the absence of control samples”. Proceedings of the 19th All-Russian Scientific-Practical Conference “Scientific Creativity of Youth. Mathematics. Informatics”, pp. 70-75. [In Russian]
Agarwal A., Xie B., Vovsha I., Rambow O., Passonneau A. R. 2011. “Sentiment analysis of Twitter data”. Proceedings of the Workshop on Languages in Social Media. Association for Computational Linguistics, pp. 30-38.
Benoit K., Watanabe K., Wang H., Nulty P., Obeng A., Müller S., Matsuo A. 2018. “quanteda: an R package for the quantitative analysis of textual data”. Journal of Open Source Software, vol. 3, no 30, art. 774. DOI: 10.21105/joss.00774
Chua A. Y. K., Banerjee S. 2016. “Linguistic predictors of rumor veracity on the Internet”. Proceedings of the International MultiConference of Engineers and Computer Scientists, vol. 1, pp. 387-391.
Du J., Zhang Y., Luo J., Jia Y., Wei Q., Tao C., Xu H. 2018. « Extracting psychiatric stressors for suicide from social media using deep learning”. BMC Medical Informatics and Decision Making, vol. 18, art. 43. DOI: 10.1186/s12911-018-0632-8
Friggeri A., Adamic L., Eckles D., Cheng J. 2014. “Rumor cascades”. International AAAI Conference on Web and Social Media, pp. 101-110. https://www.aaai.org/ocs/index.php/ICWSM/ICWSM14/paper/view/8122
Hamidian S., Diab M. 2015. “Rumor detection and classification for Twitter data”. SOTICS 2015: The 5th International Conference on Social Media Technologies, Communication, and Informatics, pp. 71-77.
Ma J., Gao W., Mitra P., Kwon S., Jansen B. J., Wong K.-F., Cha M. 2016. “Detecting rumors from microblogs with recurrent neural networks”. Proceedings of the 25th International Joint Conference on Artificial Intelligence, pp. 3818-3824.
pandas: powerful Python data analysis toolkit. https://pandas.pydata.org/pandas-docs/stable/index.html
Smith K. 2017. “Statistics of the social network Twitter”. Brandwatch. https://www.brandwatch.com/blog/twitter-stats-and-statistics/
Takahashi T., Igata N. 2012. “Rumor detection on Twitter”. Soft Computing and Intelligent Systems (SCIS), no 6, pp. 452-457.
Thomas K., Grier C., Paxson V., Song D. 2011. “Suspended accounts in retrospect: an analysis of Twitter spam”. Proceedings of the ACM SIGCOMM Conference on Internet Measurement Conference, pp. 243-258.
Vosoughi S. 2015. “Automatic detection and verification of rumors on Twitter”. Ph. D. thesis. Cambridge: Massachusetts Institute of Technology.
Zhao Z., Resnick P., Mei Q. 2015. “Enquiring minds: early detection of rumors in social media from enquiry posts”. Proceedings of the 24th International Conference on World Wide Web, pp. 1395-1405. DOI: 10.1145/2736277.2741637