Automatic Term Extraction on Turkish Scientific Texts

No Thumbnail Available

Date

2020

Journal Title

Journal ISSN

Volume Title

Abstract

In order for a text or collection to be understood, it is very important to understand the terms contained in it. In this study, it is aimed to detect terms in a domain-specific (Cyber Security) corpus. A two-layer method is suggested for the determination of the terms used in single words or phrases. Term candidate words are determined by statistical methods in the first layer. In the second layer, the possibility of using these words in phrases with semantic approaches is checked. In the study, Word2Vec approach was used to determine semantic affinity and 3 different datasets were used. The results show that the terms used in singular or binary patterns were successfully determined using the proposed method. © 2020 IEEE.

Description

Citation