Predicting the Future of Text: A Hybrid Approach to Next-Word Prediction
DOI:
https://doi.org/10.13053/cys-29-3-5919Keywords:
Next-word prediction, CNN-LSTM hybrid architecture, natural language processing, text generation, sequential data modeling, one-hot encoding, Sherlock Holmes corpus, language modeling, neural networksAbstract
Text input has become an integral part of modern communication, spanning from everyday conversations to formal content creation. However, manual typing is often slow and prone to errors, which has driven the need for efficient text prediction models to improve user experience and productivity. By anticipating and generating the next likely word in a sequence, next-word prediction systems contribute significantly to faster and more accurate text composition. Early approaches like N-grams established the foundational concepts but were limited in their ability to grasp complex, wide-reaching context. In the recent years, this field has been dominated by large-scale Transformer architectures, which have set new benchmarks in language understanding. However, their significant computational demands often create a barrier to deployment in resource-constrained environments such smartphones or embedded systems . This paper addresses this challenge by introducing a hybrid deep learning model that offers a predictive accuracy with computational efficiency. Our proposed architecture merges CNNs with Bi-LSTM networks. CNNs are highly effective at extracting local, spatial features from text, while Bi-LSTMs excel at learning long-range sequential dependencies. By training this model on the classic Sherlock Holmes dataset, we demonstrate its ability to achieve nearly 76\% contextual accuracy, proving it is a powerful and viable alternative for real-world applications. This work validates the effectiveness of hybrid models in creating intelligent text generation systems for tools like smart keyboards and assistive writing technologies.Downloads
Published
2025-09-28
Issue
Section
Articles of the Thematic Section
License
Hereby I transfer exclusively to the Journal "Computación y Sistemas", published by the Computing Research Center (CIC-IPN),the Copyright of the aforementioned paper. I also accept that these
rights will not be transferred to any other publication, in any other format, language or other existing means of developing.I certify that the paper has not been previously disclosed or simultaneously submitted to any other publication, and that it does not contain material whose publication would violate the Copyright or other proprietary rights of any person, company or institution. I certify that I have the permission from the institution or company where I work or study to publish this work.The representative author accepts the responsibility for the publicationof this paper on behalf of each and every one of the authors.
This transfer is subject to the following conditions:- The authors retain all ownership rights (such as patent rights) of this work, except for the publishing rights transferred to the CIC, through this document.
- Authors retain the right to publish the work in whole or in part in any book they are the authors or publishers. They can also make use of this work in conferences, courses, personal web pages, and so on.
- Authors may include working as part of his thesis, for non-profit distribution only.