Website of D. Serikbayev EKTU
  • Font Size
    16px
    Website Colors
    Images

CREATION OF A DICTIONARY OF KEYWORDS FOR A CLASSIFIER OF TEXTS CONTAINING DANGEROUS CONTENT IN THE CYBERSPACE OF KAZAKHSTAN

Authors

Name Affiliation
Kuanysh Nursakitov EKTU

Published:

2023-09-30

Article language:

Russian

Keywords:

natural language processing, sentiment analysis, machine learning, term frequency, text classification

Abstract

This work is part of a study on the creation of an information system for searching dangerous content in the cyberspace of Kazakhstan. The aim of the study is to create a dictionary of keywords for the work of a classifier of texts containing dangerous content, using the example of the problem of identifying the presence of a suicidal risk in the texts of suicide notes and groups of suicidal. There is no such database for the Kazakh language. As a result of this research, an experimental corpus and a list of keywords in the Kazakh language were created. Keywords have been added to the database with various morphological forms.

Nursakitov, K. (2023). CREATION OF A DICTIONARY OF KEYWORDS FOR A CLASSIFIER OF TEXTS CONTAINING DANGEROUS CONTENT IN THE CYBERSPACE OF KAZAKHSTAN. Вестник ВКТУ, 1(3). Retrieved from https://vestnik.ektu.kz/index.php/vestnik/article/view/574