Author Identification Based on NLP

Authors

Publication Date

DOI

Abstract Views

Downloads

Citation

Authors

Publication Date

DOI

Abstract Views

Downloads

Citation

Abstract

The amount of textual content is increasing exponentially, especially through the publication of articles; the issue is further complicated by the increase in anonymous textual data. Researchers are looking for alternative methods to predict the author of an unknown text, which is called Author Identification. In this research, the study is performed with Bag of Words (BOW) and Latent Semantic Analysis (LSA) features. The “All the news” dataset on Kaggle is used for experimentation and to compare BOW and LSA for the best performance in the task of author identification. Support vector machine, random forest, Bidirectional Encoder Representations from Transformers (BERT), and logistic regression classification algorithms are used for author prediction. For first scope that have 20 authors, for each author 100 articles, the greatest accuracy is seen from logistic regression using bag-of-words, followed by random forest, also using bag-of-words; in all algorithms, bag-of-words scored better than LSA. Ultimately, BERT model was applied in this research and achieved 70.33% accuracy performance. For second scope that increase the number of articles till 500 articles per author and decrees the number of authors till 10, the BOW achieves better performance results with the logistic regression algorithm at 93.86%. Moreover, the best accuracy performance is with LR at 94.9% when merged the feature together and it proved that it is better than applied BOW and LSA individual, with an improvement by almost 0.1% comparing with BOW only. Ultimately, BRET achieved result by 86.56% accuracy performance and 0.51 log los.

Keywords: Analysis, Identification, NLP, author, data analytics

This work by European American Journals is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 Unported License

Recent Publications

Modeling Electromagnetic Field Radiation Variability for Minimization of Exposure Rate in Public Health Environments: A Machine Learning Approach

Read Full Article »

April 23, 2024 No Comments

Soft Computing-Based System for Performance Modeling of Object-Oriented Programming (OOP) Software

Read Full Article »

April 23, 2024 No Comments

Survey on Clustering Techniques for Wireless Sensor Networks and Cluster Head Selection Model

Read Full Article »

April 13, 2024 No Comments

Big Data Security on Hadoop Open Source Frame for Healthcare Data Management using One-Time-Pad Encryption Algorithm

Read Full Article »

March 6, 2024 No Comments

Email ID: editor.ejcsit@ea-journals.org
Impact Factor: 7.80
Print ISSN: 2054-0957
Online ISSN: 2054-0965
DOI: https://doi.org/10.37745/ejcsit.2013

Author Guidelines
Submit Papers
Review Status

Business & Management Journals

British Journal of Marketing Studies (BJMS)
European Journal of Accounting, Auditing and Finance Research (EJAAFR)
European Journal of Business and Innovation Research (EJBIR)
European Journal of Hospitality and Tourism Research (EJHTR)
European Journal of Logistics, Purchasing and Supply Chain Management (EJLPSCM)
Global Journal of Human Resource Management (GJHRM)
International Journal of Business and Management Review (IJBMR)
International Journal of Community and Cooperative Studies (IJCCS)
International Journal of Management Technology (IJMT)
International Journal of Small Business and Entrepreneurship Research (IJSBER)

Education and Research Methods

British Journal of Education (BJE)
European Journal of Training and Development Studies (EJTDS)
International Journal of Education, Learning and Development (IJELD)
International Journal of Interdisciplinary Research Methods (IJIRM)
International Journal of Quantitative and Qualitative Research Methods (IJQQRM)
International Journal of Vocational and Technical Education Research (IJVTER)

Engineering, Science and Technology

British Journal of Earth Sciences Research (BJESR)
British Journal of Environmental Sciences (BJES)
European Journal of Computer Science and Information Technology (EJCSIT)
European Journal of Material Sciences (EJMS)
European Journal of Mechanical Engineering Research (EJMER)
European Journal of Statistics and Probability (EJSP)
Global Journal of Pure and Applied Chemistry Research (GJPACR)
International Journal of Civil Engineering, Construction and Estate Management (IJCECEM)
International Journal of Electrical and Electronics Engineering Studies (IJEEES)
International Journal of Energy and Environmental Research (IJEER)
International Journal of Engineering and Advanced Technology Studies (IJEATS)
International Journal of Environment and Pollution Research (IJEPR)
International Journal of Manufacturing, Material and Mechanical Engineering Research (IJMMMER)
International Journal of Mathematics and Statistics Studies (IJMSS)
International Journal of Network and Communication Research (IJNCR)
International Research Journal of Natural Sciences (IRJNS)
International Research Journal of Pure and Applied Physics (IRJPAP)

English Language and Inter-Continental Studies

British Journal of English Linguistics (BJEL)
European Journal of English Language and Literature Studies (EJELLS)
International Journal of African Society, Cultures and Traditions (IJASCT)
International Journal of Asian History, Culture and Tradition (IJAHCT)
International Journal of Developing and Emerging Economies (IJDEE)
International Journal of English Language and Linguistics Research (IJELLR)
International Journal of English Language Teaching (IJELT)

Health and Agricultural Matters

International Journal of Agricultural Extension and Rural Development Studies (IJAERDS)
International Journal of Animal Health and Livestock Production Research (IJAHLPR)
International Journal of Cancer, Clinical Inventions and Experimental Oncology (IJCCEO)
International Journal of Cell, Animal Biology and Genetics (IJCABG)
International Journal of Dentistry, Diabetes, Endocrinology and Oral Hygiene (IJDDEOH)
International Journal of Ebola, AIDS, HIV and Infectious Diseases and Immunity (IJEAHII)
International Journal of Entomology and Nematology Research (IJENR)
International Journal of Environmental Chemistry and Ecotoxicology Research (IJECER)
International Journal of Fisheries and Aquaculture Research (IJFAR)
International Journal of Horticulture and Forestry Research (IJHFR)
International Journal of Micro Biology, Genetics and Monocular Biology Research (IJMGMR)
International Journal of Nursing, Midwife and Health Related Cases (IJNMH)
International Journal of Nutrition and Metabolism Research (IJNMR)
International Journal of Public Health, Pharmacy and Pharmacology (IJPHPP)
International Journal of Weather, Climate Change and Conservation Research (IJWCCCR)
International Journal Water Resources Management and Irrigation Engineering Research (IJWEMIER)

Health and Food Sciences

British Journal of Psychology Research (BJPR)
European Journal of Agriculture and Forestry Research (EJAFR)
European Journal of Biology and Medical Science Research (EJBMSR)
European Journal of Botany, Plant Sciences and Phytology (EJBPSP)
European Journal of Educational and Development Psychology (EJEDP)
European Journal of Food Science and Technology (EJFST)
Global Journal of Agricultural Research (GJAR)
International Journal of Health and Psychology Research (IJHPR)

Humanities and Social Science

Global Journal of Arts, Humanities and Social Sciences (GJAHSS)
Global Journal of Political Science and Administration (GJPSA)
Global Journal of Politics and Law Research (GJPLR)
International Journal of Development and Economic Sustainability (IJDES)
International Journal of History and Philosophical Research (IJHPHR)
International Journal of International Relations, Media and Mass Communication Studies (IJIRMMCS)
International Journal of Music Studies (IJMS)
International Journal of Non-Governmental Organizations (NGOs) and Essays (IJNGOE)
International Journal of Physical and Human Geography (IJPHG)
International Journal of Sociology and Anthropology Research (IJSAR)

Scientific Matters

International Journal of Biochemistry, Bioinformatics and Biotechnology Studies (IJBBBS)
International Journal of Coal, Geology and Mining Research (IJCGMR)
International Journal of Geography and Regional Planning Research (IJGRPR)
International Journal of Library and Information Science Studies (IJLISS)
International Journal of Petroleum and Gas Engineering Research (IJPGER)
International Journal of Petroleum and Gas Exploration Management (IJPGEM)
International Journal of Physical Sciences Research (IJPSR)
International Journal of Scientific Research in Essays and Case Studies (IJSRECS)