Using Association Rules to Enrich Arabic Ontology

A. Ksiksi, H. Amiri

Abstract


In this article, we propose the use of a minimal generic base of associative rules between term association rules, to automatically enrich an existing domain ontology. Initially, non-redundant association rules between terms are extracted from an Arabic corpus. Then, the matching of the candidate terms is done through the matching between the concepts of the initial ontology and the premises of the association rules, with three distance measures that we define.


Keywords


ontology; automatic enrichment; association rules

Full Text:

PDF

References


E. Agirre, O. Ansa, E. Hovy , D. Martinez, “Enriching very large ontologies using the WWW”, ECAI 2000 Workshop on Ontology Learning, Berlin, Germany, August 2000

A. Faatz, R. Steinmetz , “Ontology enrichment with texts from the WWW”, 2nd Semantic Web Mining Workshop at ECMLI/PKDD, WS’02, Helsinki, Finland, pp. 20-33, 2002

V. Parekh, J. Gwo, T. Finin, “Mining Domain Specific Texts and Glossaries to Evaluate and Enrich Domain Ontologies”, Proceedings of the International Conference of Information and Knowledge Engineering, Las Vegas, USA, June 21, 2004

K. Neshatian, M. R. Hejazi, “Text categorization and classification in terms of multi-attribute concepts for enriching existing ontologies”, Proceedings of the 2nd Workshop on Information Technology and its Disciplines, pp. 43-48, 2004

P. Velardi, M. Missikoff, R. Basili, “Identification of relevant terms to support the construction of Domain Ontologies”, ACL-EACL Workshop on Human Language Technologies, Toulouse, France, July 2001

A. Xu, S.-K. Park, S. D'Mello, E. Kim, Q. Wang, C. Pikielny, “Novel genes expressed in subsets of chemosensory sensilla on the front legs of male Drosophila melanogaster”, Cell and Tissue Research, Vol. 307, No. 3, pp. 381-392, 2002

R. Srikant, R. Agrawal, “Mining generalized association rules”, Future Generation Computer Systems, Vol. 13, No. 23, pp. 161-180, 1997

R. Bendaoud, “Construction et enrichissement d’une ontologie à partir d’un corpus de textes”, Actes des Rencontres des Jeunes Chercheurs en Recherche d’Information (RJCRI’06), Lyon, France, pp. 353-358, March, 2006 (in French)

A. Maedche, S. Staab, “Mining ontologies from text”, Lecture Notes in Computer Science, Vol. 1937, pp. 189-202, Springer-Verlag, 2000

G. Stumme, A. Hotho, B. Berendt, “Semantic web mining : State of the art and future directions”, Web Semantics: Science, Services and Agents on the World Wide Web, Vol. 4, No. 2, pp. 124-143, 2006

L. Jorio, L. Abrouk, C. Fiot, D. Hérin, M. Teisseire, “Enrichissement d’ontologie basé sur les motifs séquentiels”, Actes de la Plateforme AFIA 2007, Atelier Ontologies et gestion de l’hétérogénéité sémantique, 2007(in French)

A. Maedche, V. Pekar, S. Staab, “Ontology Learning Part One - On Discovering Taxonomic Relations from the Web”, in: Web Intelligence, pp. 301-319, Springer Verlag, 2002

N. Hernandez, J. Mothe, C. Chrisment, D. Egret, “Modeling context through domain ontologies”, Information Retrieval, Vol. 10, No. 2, pp. 143-172, 2007

P. Cimiano, A. Hotho, G. Stumme, J. Tane, “Conceptual Knowledge Processing with Formal Concept Analysis and Ontologies”, Lecture Notes in Computer Science, Vol. 2961, pp. 189-207, Springer-Verlag, 2004

E. Han, G. Karypis, “Centroid based document classification : Analysis and experimental results”, Lecture Notes in Computer Science, Vol. 1910, pp. 424-431 Springer-Verlag, 2000

R. Gras, Contribution à l'étude expérimentale et à l'analyse de certaines acquisitions cognitives et de certains objectifs didactiques en mathématiques, Thèse d'Etat, Universit e de Rennes, 1979, (in French)

J. L. Guigues, V. Duquenne, “Familles minimales d'implications informatives résultant d'un tableau de données binaires”, Mathématiques et Sciences Humaines, Vol. 95, pp. 5-18, 1986

R. Agrawal, T. Imielinski, A. Swami, “Mining Association Rules between sets of items in large Databases”, Proceedings of ACMSIGMOD Conference,Washington, USA, pp. 207-216,May 25-28, 1993

S. Guillaume, Traitement des données volumineuses, mesures et algorithmes d'extraction de RA et règles ordinales, PhD Thesis, Nantes, 2000

M. Cadot, “RA et codage flou des données”, 11èmes Rencontres de la Société Francophone de Classification (SFC'04), Bordeaux, France, pp. 130-133, 2004, (in French)

N. Pasquier, “Data Mining : Algorithmes d'Extraction et de Réduction des RA dans les Bases de Données”, PhD Thesis, Université Blaise Pascal-Clermont-Ferrand II, 2000 (in French)

F. Guillet. “Mesure de qualité des connaissances en ECD”, Cours donné lors des journées de la conférence EGC 2004, Clermont-ferrand, January 2004, (in French)

M. Botta, J. F. Boulicaut, C. Masson, R. Meo, “A Comparison between Query Languages for the Extraction of Association Rules”, Lecture Notes in Computer Science, Vol. 2454, pp. 1-10, Springer-Verlag, 2002

M. Jarrar, “Building a Formal Arabic Ontology”, Experts Meeting on Arabic Ontologies and Semantic Networks, Alecso, Arab League: Tunis, pp. 26-28, July 26-28, 2011

F. Z. Belkredim, F. Meziane, “DEAR-ONTO: A Derivational Arabic Ontology Based on Verbs”, International Journal of Computer Processing of Languages, Vol. 21, No. 03, pp. 279-291, 2008

C. C. Latiri, L. B. Ghezaïel, L. B. Ahmed, T. Tunsisie “Fast-MGB: Nouvelle base générique minimale de règles associatives”, EGC’2006, Lille, France, pp. 217-222, January, 2006, (in French)

C. L. Cherif, W. Bellagha, S. Ben Yahia, G. Guesmi, “VIE-MGB : A Visual Interactive Exploration of Minimal Generic Basis of Association Rules”, 3rd International Conference on Concept Lattices and their Applications (CLA’05), Olomouc, Czech Republic, pp. 179-196, September, 2005

C. Fankam, OntoDB2: un système flexible et efficient de Base de Données à Base Ontologique pour le Web sémantique et les données techniques, PhD Thesis, ISAE-ENSMA Ecole Nationale Supérieure de Mécanique et d’Aérotechique-Poitiers, 2009, (in French)

N. F. Noy, R. W. Ferguson, M. A. Musen, “The Knowledge Model of Protégé-2000 : Combining Interoperability and Flexibility”, Lecture Notes in Computer Science, Vol. 1937, pp. 17-32, Springer-Verlag, 2000

Z. Wu, M. Palmer, “Verb semantics and lexical selection”, 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, New Mexico, USA, pp. 133-138, June 27-30, 1994

T. R. Gruber, “The Role of Common Ontology in Achieving Sharable, Reusable Knowledge Bases”, Proceedings of the Second International Conference, Cambridge, pp. 601-602, Morgan Kaufmann, 1991

D. J. Abadi, A. Marcus, S. R. Madden, K. Hollenbach, “Scalable Semantic Web DataManagement Using Vertical Partitioning”, 33rd International Conference on Very Large Data Bases, Vienna, Austria, pp. 411-422, September 23-27, 2007

P. Gamallo, M. Gonzalez, A. Agustini, G. Lopes, V. S. de Lima, “Mapping Syntactic Dependencies onto Semantic Relations”, Proceedings of the ECAI 2002 Workshop on Machine Learning and Natural Language Processing for Ontology Engineering (OLT’2002), Lyon, France, pp. 15-22, 2002

F. Cerbah, ”Learning highly structured semantic repositories from relational databases”, Lecture Notes in Computer Science, Vol. 5021, pp. 777-781, Springer-Verlag, 2008




eISSN: 1792-8036     pISSN: 2241-4487