Stylometric Author Identification via CNN-BiLSTM Architecture on Syntactic Text Patterns

Main Article Content

Shaik Mulli Shabeer
Koppula Lakshmi Keerthi
Dhulipala Prem Aditya
Karumanchi Priyanka
Lambu Abhinay

Abstract

Authorship attribution is a critical task in natural language processing that involves identifying the author of a given text based on writing style, linguistic patterns, and structural features. This research presents a deep learning-based approach combining Convolutional Neural Networks (CNN) and Bidirectional Long Short-Term Memory (BiLSTM) networks to accurately attribute authorship. Using the Reuters-50-50 dataset, we extract syntactic and structural information such as part-of-speech tags, punctuation frequency, and average sentence length, which help capture the unique stylistic traits of individual authors. The text is cleaned, transformed into numerical vectors, and used to train the proposed model. Experimental results demonstrate that the hybrid CNN-BiLSTM architecture achieves high accuracy of 96% in identifying authors from unseen text samples. The model also performs well across other metrics such as precision, recall, and F1-score, showing its robustness and effectiveness in capturing deep textual patterns. This work contributes to the fields of authorship verification, plagiarism detection, and digital forensics, offering a scalable and reliable solution for text-based author identification.

Article Details

How to Cite
Shabeer, S. M., Keerthi , K. L., Aditya, D. P., Priyanka, K., & Abhinay, L. (2025). Stylometric Author Identification via CNN-BiLSTM Architecture on Syntactic Text Patterns. International Journal on Advanced Computer Engineering and Communication Technology, 14(1), 1–8. Retrieved from https://journals.mriindia.com/index.php/ijacect/article/view/165
Section
Articles

Similar Articles

1 2 3 4 5 6 7 8 9 > >> 

You may also start an advanced similarity search for this article.