Mining publication papers via text mining Evaluation and Results

Document Type : Original Article

Authors

1 cairo

2 Department of Computer Sciences, Faculty of Computer and Information Sciences, Ain Shams University, Cairo, Egypt.

3 Computer Science Department, Faculty of Computer and Information Science, Ain Shams University Cairo, Egypt

Abstract

Data nowadays is the language of technologies as every process needs a data to be processed the input is data and the output also is data. Analyzing the data is a significant task especially with the increasing production of the data particularly data as a text, it would be difficult to manually analyze the data, extract information and detect the hidden patterns from unstructured text. Data mining is automated technique for gathering or deriving a new high-quality information and uncover the relations among the data. Text mining is one of main branches of the data mining however data mining is more comprehensive this paper, an overview for mining the publication papers via text mining techniques and their results and evaluation would be presented as following: the first approach is keywords extraction using natural language processing (NLP) approach, the second approach named entity recognition and the last approach is document clustering where machine learning techniques are applied to the both of them

Keywords