QUESTION ANSWERING SYSTEM UPON UNIFIED LANGUAGE MODEL AND EVALUATING PERFORMANCE OF DATASETS

  • Nurbek Ismagulov Suleyman Demirel University

Abstract

Present days require automation and optimization in simple but urgent tasks. It is granted to use opportunities of technologies and science in order to work efficiently and to stay productive. In this paper, I seek to understand opportunities and drawbacks of the publicly available datasets, such as SQuAD, TriviaQA, Natural Questions (NQ), QuAC, NewsQA. It is vital to choose a suitable dataset in order to create a system with better performance. Specifically, the paper proposes an automatic question creating system that uses state-of-the-art Natural Language Processing (NLP) - Unified Language Model (UniLM). The question generating algorithm was verified using best datasets, and it has shown noteworthy results - questions generated were logical and correct. This study is important for teachers, teacher assistants, to save time writing test questions and spend it for more important duties.
Published
2023-03-13
How to Cite
ISMAGULOV, Nurbek. QUESTION ANSWERING SYSTEM UPON UNIFIED LANGUAGE MODEL AND EVALUATING PERFORMANCE OF DATASETS. SDU Bulletin: Natural and Technical Sciences, [S.l.], v. 62, n. 1, p. 103 - 112, mar. 2023. Available at: <https://journals.sdu.edu.kz/index.php/nts/article/view/738>. Date accessed: 18 apr. 2025. doi: https://doi.org/10.47344/sdubnts.v62i1.738.