Enhancing Semantic Search Precision through the CBOW Algorithm in the Semantic Web
Received: 30 October 2024 | Revised: 18 November 2024 | Accepted: 29 November 2024 | Online: 5 December 2024
Corresponding author: Ashraf F. A. Mahmoud
Abstract
The Semantic Web enhances data interoperability and enables intelligent information retrieval through structured data representation. However, challenges remain in achieving high precision in semantic search. This paper uses the Continuous Bag of Words (CBOW) model to enhance semantic search precision. By generating rich word embeddings, CBOW enables a better understanding of contextual relationships among terms within semantic queries. Our approach has been evaluated using the websites intended to be used as a sample for testing the efficiency of semantic information retrieval, demonstrating significant improvements in search precision compared to traditional methods. The findings indicate that integrating CBOW into semantic search frameworks can lead to more relevant and accurate search results, paving the way for future advancements in Semantic Web technologies.
Keywords:
semantic search, continuous bag of words, imperfect interface, Word2vecDownloads
References
T. Berners-Lee, J. Hendler, and O. Lassila, "The Semantic Web," Scientific American, vol. 284, no. 5, pp. 29–37, May 01, 2001.
N. Shadbolt, T. Berners-Lee, and W. Hall, "The Semantic Web Revisited," IEEE Intelligent Systems, vol. 21, no. 3, pp. 96–101, Jan. 2006.
P. Hitzler, M. Krotzsch, and S. Rudolph, Foundations of Semantic Web Technologies. New York, NY, USA: Chapman and Hall - CRC Press, 2009.
L. Ding et al., "Swoogle: a search and metadata engine for the semantic web," in Proceedings of the thirteenth ACM international conference on Information and knowledge management, Washington, D.C., USA, 2004, pp. 652–659.
J. D. Ullman and J. Widom, A First Course in Database Systems, 3rd ed. New York, NY, USA: Pearson, 2008.
T. Mikolov, K. Chen, G. Corrado, and J. Dean, "Efficient Estimation of Word Representations in Vector Space," presented at the International Conference on Learning Representations, Scottsdale, AZ, USA, May 2-4, 2013.
R. Guha, R. McCool, and E. Miller, "Semantic search," in Proceedings of the 12th international conference on World Wide Web, New York, NY, USA, 2003, pp. 700–709.
C. D. Manning, P. Raghavan, and H. Schütze, Introduction to Information Retrieval. Cambridge, UK: Cambridge University Press, 2008.
J. Pennington, R. Socher, and C. Manning, "GloVe: Global Vectors for Word Representation," in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, Doha, Qatar, 2014, pp. 1532–1543.
P. Bojanowski, E. Grave, A. Joulin, and T. Mikolov, "Enriching Word Vectors with Subword Information," Transactions of the Association for Computational Linguistics, vol. 5, pp. 135–146, Jun. 2017.
Q. Le and T. Mikolov, "Distributed Representations of Sentences and Documents," in Proceedings of the 31st International Conference on Machine Learning, Beijing, China, 2014, pp. 1188–1196.
E. H. Mohamed and E. M. Shokry, "QSST: A Quranic Semantic Search Tool based on word embedding," Journal of King Saud University - Computer and Information Sciences, vol. 34, no. 3, pp. 934–945, Mar. 2022.
A. Singhal and I. Google, "Modern Information Retrieval: A Brief Overview," IEEE Data Engineering Bulletin, vol. 24, no. 4, pp. 35–43, 2001.
J. Devlin, M. W. Chang, K. Lee, and K. Toutanova, "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding," in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Minneapolis, MN, USA, 2019, pp. 4171–4186.
M. M. El-Gayar, N. E. Mekky, A. Atwan, and H. Soliman, "Enhanced Search Engine Using Proposed Framework and Ranking Algorithm Based on Semantic Relations," IEEE Access, vol. 7, pp. 139337–139349, 2019.
Downloads
How to Cite
License
Copyright (c) 2024 Ashraf F. A. Mahmoud, Zakariya M. S. Mohammed, Mohamed Ben Ammar, Ali Satty, Faroug A. H. Abdalla, Gamal Saad Mohamed Khamis, Mohyaldein Salih, Abdelnasser Saber Mohamed
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.