BBC Russian

The WNSImRep v1 dataset is provided as supplementary material of the paper by Lastra-Díaz, J. J., & García-Serrano, A. (2016). HESML: a scalable ontology-based semantic similarity measures library with a set of reproducible... more

The WNSImRep v1 dataset is provided as supplementary material of the paper by Lastra-Díaz, J. J., & García-Serrano, A. (2016). HESML: a scalable ontology-based semantic similarity measures library with a set of reproducible experiments and a replication dataset. Information Systems. In the aforementioned work, we introduce a scalable Java software library of ontology-based semantic similarity measures and IC models, called HESML, and a set of reproducible experiments on word similarity. The WNSimRep v1 dataset is detailed in the enclosed file called "appendixB_WNSimRep_dataset_LastraGarcia_v1.pdf". This work introduces a framework whose aim is to allow the exact replication of most intrinsic Information Content (IC) models and ontology-based similarity measures reported in the literature by using the publicly available accompanying dataset, called the WNSimRep v1 dataset. This work has been carried-out in the context of a large evaluation campaign of ontology-based semantic similarity measures and IC models on WordNet based on HESML. Our work is encouraged by the identification of several reproducibility problems in a series of recent experimental surveys carried-out by the authors, together with the lack of a framework and gold standard to assist in the replication of ontology-based similarity measures and IC models. To bridge this gap, we introduce herein a replication framework defined by three different types of data file: (a) node-based data files which contain an explicit representation of the WordNet taxonomy together with a specific IC model and a collection of node-based taxonomical features, (b) edge-based data files which contain a family of edge-valued IC models based on the conditional probability between child and parent concepts, and (c) synset-pair-based data files which contain the synset pairs of the Rubenstein-Goodenough word similarity benchmark, together with a collection of taxonomical features based on synset pairs and all the ontology-based similarity measures evaluated on them. The fr [...]

Publication Date: Sep 8, 2016

Research Interests:
Computer Science, Information Retrieval, Ontology, Semantic similarity, Interdisciplinary research (Social Sciences), and Similarity Geometry

This dataset introduces a companion reproducibility Java console program, called HESML_vs_SML_test.jar, of the work introduced by Lastra-Díaz and García-Serrano [1]. This latter work introduces the Half-Edge Semantic Measures Library... more

This dataset introduces a companion reproducibility Java console program, called HESML_vs_SML_test.jar, of the work introduced by Lastra-Díaz and García-Serrano [1]. This latter work introduces the Half-Edge Semantic Measures Library (HESML), and carries-out an experimental survey between HESML V1R2, the Semantic Measures Library (SML) 0.9 [2] and the WNetSS [4] semantic measures libraries. The HESML_vs_SML_test.jar program runs the set of performance and scalability benchmarks detailed in [1] and generates the figures and tables of results reported in the aforementioned work, which are also enclosed as complementary files of this dataset (see files below). Licensing note: The 'HESML_vs_SML_test.jar' program is based on the HESML V1R2 [3], SML 0.9 [2] and WNetSS [4] semantic measures libraries, and it includes these libraries in its distribution, as well as WordNet 3.0 [6] and the SimLex665 [5] dataset. Thus, if you use this dataset, you should also cite the works related to these resources. References: [1] Lastra-Díaz, J. J., and García-Serrano, A. (2016). HESML: a scalable ontology-based semantic similarity measures library with a set of reproducible experiments and a replication dataset. To appear in Information Systems Journal. [2] Harispe, S., Ranwez, S., Janaqi, S., and Montmain, J. (2014). The Semantic Measures Library: Assessing Semantic Similarity from Knowledge Representation Analysis. In E. Métais, M. Roche, & M. Teisseire (Eds.), Proc. of the 19th International Conference on Applications of Natural Language to Information Systems (NLDB 2014) (Vol. 8455, pp. 254–257). Montpelier, France: Springer. http://dx.doi.org/10.1007/978-3-319-07983-7_37 [3] Lastra-Díaz, J. J., & García-Serrano, A. (2016). HESML V1R2 Java software library of ontology-based semantic similarity measures and information content models. Mendeley Data, v2. https://doi.org/10.17632/t87s78dg78.2 [4] Ben Aouicha, M., Taieb, M. A. H., and Ben Hamadou, A. (2016). SISR: System for integrating semantic relatedness and similarity meas [...]

Publication Date: Dec 21, 2016

Research Interests:
Computer Science and Scalability

HESML V1R2 is the second release of the Half-Edge Semantic Measures Library (HESML) [1], which is a new, scalable and efficient Java software library of ontology-based semantic similarity measures and Information Content (IC) models based... more

HESML V1R2 is the second release of the Half-Edge Semantic Measures Library (HESML) [1], which is a new, scalable and efficient Java software library of ontology-based semantic similarity measures and Information Content (IC) models based on WordNet. HESML V1R2 implements most ontology-based semantic similarity measures and Information Content (IC) models based on WordNet reported in the literature. In addition, it provides a XML-based input file format in order to specify the execution of reproducible experiments on WordNet-based similarity, even with no software coding. The V1R2 release significantly improves the performance of HESML V1R1. HESML is introduced and detailed in a companion reproducibility paper [1] of the methods and experiments introduced in [2,3,4]. The main features of HEMSL are as follows: (1) it is based on an efficient and linearly scalable representation for taxonomies called PosetHERep introduced in [1], (2) its performance exhibits a linear scalability as regards the size of the taxonomy, and (3) it does not use any caching strategy of vertex sets. HESML V1R2 is freely distributed for any non-commercial purpose under a CC By-NC-SA-4.0 license, subject to the citing of the main HESML paper [1] as attribution requirement. On other hand, the commercial use of the similarity measures introduced in [2], as well as part of the intrinsic IC models introduced in [3] and [4], is protected by a patent application [5]. In addition, any user of HESML must fulfill other licensing terms described in [1] related to other resources distributed with the library, such as WordNet and a dataset of corpus-based IC models, among others. References: [1] Lastra-Díaz, J. J., & García-Serrano, A. (2016). HESML: a scalable ontology-based semantic similarity measures library with a set of reproducible experiments and a replication dataset. To appear in Information Systems Journal. [2] Lastra-Díaz, J. J., & García-Serrano, A. (2015). A novel family of IC-based similarity measures with a detailed experimental [...]

Publication Date: Dec 21, 2016

Research Interests:
Computer Science, Information Retrieval, Ontology, Semantic similarity, Interdisciplinary research (Social Sciences), and Java

HESML V1R1 is a new Java software library called Half-Edge Semantic Measures Library (HESML), which implements most ontology-based semantic similarity measures and Information Content (IC) models based on WordNet reported in the... more

HESML V1R1 is a new Java software library called Half-Edge Semantic Measures Library (HESML), which implements most ontology-based semantic similarity measures and Information Content (IC) models based on WordNet reported in the literature. HESML is introduced and detailed in the paper by Lastra-Díaz, J. J., & García-Serrano, A. (2016). HESML: a scalable ontology-based semantic similarity measures library with a set of reproducible experiments and a replication dataset. Information Systems. HESML is motivated by several drawbacks in the current state-of-the-art software libraries, as well as the evaluation of the new methods introduced by the authors, together with the replication and evaluation of most previously reported methods. HESML is based on a new and efficient poset representation, called PosetHERep, which is an adaptation of the half-edge data structure commonly used to represent discrete manifolds and planar graphs in computational geometry. HESML proposes a memory-efficient representation for taxonomies which linearly scales with the taxonomy size and provides an efficient implementation of a large set of topological queries and graph-based algorithms. Likewise, HESML provides an open framework to aid research into the area by providing a simpler and more efficient software architecture than the current software libraries.

Publication Date: Jul 9, 2016

Research Interests:
Computer Science, Information Retrieval, Ontology, Java, and Mendeley

HESML V1R4 is the fourth release of the Half-Edge Semantic Measures Library (HESML) detailed in [1], which is a new, linerarly scalable and efficient Java software library of ontology-based semantic similarity measures and Information... more

HESML V1R4 is the fourth release of the Half-Edge Semantic Measures Library (HESML) detailed in [1], which is a new, linerarly scalable and efficient Java software library of ontology-based semantic similarity measures and Information Content (IC) models based on WordNet. HESML V1R4 implements most ontology-based semantic similarity measures and Information Content (IC) models based on WordNet reported in the literature, as well as the evaluation of three pre-trained word embedding models. It also provides a XML-based input file format in order to specify the execution of reproducible experiments on WordNet-based similarity, even with no software coding. HESML V1R4 introduces the following novelties: (1) a software implementation for the evaluation of three pre-trained word embedding file formats which support most of state-of--the-art models reported in the literature; (2) a software implementation of an intrinsic IC model and two new IC-based semantic similarity measures introduced by Cai et al. (2017); (3) a software implementation of a fast approximation of the Wu&Palmer (1994) measure commonly used in the literature; (4) the integration of a very large set of word similarity benchmarks; and finally (5), the correction of an error in our software implementation of the Leacock&Chodorow (1998) measure in previous HESML versions. HESML library is freely distributed for any non-commercial purpose under a CC By-NC-SA-4.0 license, subject to the citing of the main HESML paper [1] as attribution requirement. On other hand, the commercial use of the similarity measures introduced in [2], as well as part of the intrinsic IC models introduced in [3] and [4], is protected by a patent application [5]. In addition, any user of HESML must fulfill other licensing terms described in [1] related to other resources distributed with the library. References: [1] Lastra-Díaz, J. J., García-Serrano, A., Batet, M., Fernández, M., & Chirigati, F. (2017). HESML: a scalable ontology-based semantic similarity measures libra [...]

Publication Date: Sep 21, 2018

Research Interests:
Computer Science, Information Retrieval, Ontology, Semantic similarity, Interdisciplinary research (Social Sciences), and Java

HESML V1R3 is the third release of the Half-Edge Semantic Measures Library (HESML) detailed in [1], which is a new, scalable and efficient Java software library of ontology-based semantic similarity measures and Information Content (IC)... more

HESML V1R3 is the third release of the Half-Edge Semantic Measures Library (HESML) detailed in [1], which is a new, scalable and efficient Java software library of ontology-based semantic similarity measures and Information Content (IC) models based on WordNet. HESML V1R3 implements most ontology-based semantic similarity measures and Information Content (IC) models based on WordNet reported in the literature. It also provides a XML-based input file format in order to specify the execution of reproducible experiments on WordNet-based similarity, even with no software coding. The main features of HESML are as follows: (1) it is based on an efficient and linearly scalable representation for taxonomies called PosetHERep introduced in [1], (2) its performance exhibits a linear scalability as regards the size of the taxonomy, and (3) it does not use any caching strategy of vertex sets. HESML V1R3 introduces two minor novelties as follows: the vertex ID has been updated from Integer to Long type in order to support a larger number of vertexes, and it includes five new similarity measures introduced by Hao et al (2011), Liu et al (2007), Pekar&Staab (2002) and Stojanovic et al (2001). HESML library is freely distributed for any non-commercial purpose under a CC By-NC-SA-4.0 license, subject to the citing of the main HESML paper [1] as attribution requirement. On other hand, the commercial use of the similarity measures introduced in [2], as well as part of the intrinsic IC models introduced in [3] and [4], is protected by a patent application [5]. In addition, any user of HESML must fulfill other licensing terms described in [1] related to other resources distributed with the library, such as WordNet and a dataset of corpus-based IC models, among others. References: [1] Lastra-Díaz, J. J., García-Serrano, A., Batet, M., Fernández, M., & Chirigati, F. (2017). HESML: a scalable ontology-based semantic similarity measures library with a set of reproducible experiments and a replication dataset. Information Systems, [...]

Publication Date: Oct 3, 2017

Research Interests:
Computer Science, Information Retrieval, Ontology, Semantic similarity, Interdisciplinary research (Social Sciences), and 2 moreJava and Mendeley

The design of a computer support environment for cooperation must be based on the set of agreed organization procedures defined in a previous conceptual modelling phase (chapter 4).

Publisher: Springer Nature

Publication Date: 1993

Publication Name: Springer eBooks

Research Interests:
Computer Science and Springer Ebooks

Publisher: Elsevier BV

Ana M Garcia-Serrano

Publication Date: Sep 8, 2016

Research Interests: Computer Science, Information Retrieval, Ontology, Semantic similarity, Interdisciplinary research (Social Sciences), and Similarity Geometry<div>()</div>

Publication Date: Dec 21, 2016

Research Interests: Computer Science and Scalability<div>()</div>

Publication Date: Dec 21, 2016

Research Interests: Computer Science, Information Retrieval, Ontology, Semantic similarity, Interdisciplinary research (Social Sciences), and Java<div>()</div>

Publication Date: Jul 9, 2016

Research Interests: Computer Science, Information Retrieval, Ontology, Java, and Mendeley<div>()</div>

Publication Date: Sep 21, 2018

Research Interests: Computer Science, Information Retrieval, Ontology, Semantic similarity, Interdisciplinary research (Social Sciences), and Java<div>()</div>

Publication Date: Oct 3, 2017

Publisher: Springer Nature

Publication Date: 1993

Publication Name: Springer eBooks

Research Interests: Computer Science and Springer Ebooks<div>()</div>

Publisher: Elsevier BV

Publication Date: Nov 1, 2015

Publication Name: Knowledge Based Systems

Publication Date: 2014

Research Interests: Engineering, Computer Science, and CLEF<div>()</div>

Publication Date: 2020

Publication Name: CLEF (Working Notes)

Research Interests: Computer Science<div>()</div>

Publisher: Cornell University

Publication Date: May 18, 2022

Publication Name: arXiv (Cornell University)

Publication Date: Jul 18, 2011

Publication Name: Adaptive Multimedia Retrieval

Research Interests: Computer Science, Information Retrieval, Multimedia, Multimedia information retrieval, and Video Retrieval<div>()</div>

Publisher: Elsevier BV

Publication Date: 2015

Publication Name: Procedia Computer Science

Publisher: IGI Global

Publication Date: 2020

Publication Name: UXD and UCD Approaches for Accessible Education

Research Interests: Computer Science, Instructional Design, and Distance Education<div>()</div>

Publisher: ACM

Publication Date: 2016

Publication Name: Proceedings of the XVII International Conference on Human Computer Interaction

Research Interests: Computer Science<div>()</div>

Publication Date: Sep 8, 2016

Research Interests: Computer Science, Artificial Intelligence, Natural Language Processing, Interdisciplinary research (Social Sciences), Wordnet, and Similarity Geometry<div>()</div>

Publisher: Technical University of Valencia

Publication Date: Mar 2, 2021

Publication Name: Procesamiento Del Lenguaje Natural

Research Interests: Computer Science, Artificial Intelligence, Natural Language Processing, Named Entity Recognition, Procesamiento del Lenguaje Natural, and Unified Medical Language System<div>()</div>

Publisher: Elsevier BV

Publication Date: Nov 1, 2015

Publication Name: Engineering Applications of Artificial Intelligence

Research Interests: Engineering, Computer Science, Information Retrieval, Wordnet, and Similarity Geometry<div>()</div>

Publisher: Elsevier BV

Publication Date: Sep 1, 2016

Publication Name: Expert Systems With Applications

Research Interests: Computer Science and Mathematical Sciences<div>()</div>

Publisher: Public Library of Science

Publication Date: Nov 21, 2022

Publication Name: PLOS ONE

Publication Date: Nov 8, 2021

Publication Date: Apr 30, 2021

Research Interests: Computer Science, Information Retrieval, Semantic similarity, and Wordnet<div>()</div>

Publisher: ZappyLab, Inc.

Research Interests: Computer Science, Software, and SNOMED CT<div>()</div>

Publisher: e-cienciaDatos

Publication Date: 2020

Research Interests: Computer Science, Information Retrieval, Semantic similarity, Software, SNOMED CT, and Unified Medical Language System<div>()</div>

Publisher: e-cienciaDatos

Publication Date: 2020

Publisher: Springer Science and Business Media LLC

Publication Date: 2022

Publication Name: BMC Bioinformatics

Publisher: Public Library of Science (PLoS)

Publication Date: 2021

Publication Name: PLOS ONE

Publisher: IEEE

Publication Date: 2017

Publication Name: 2017 Intelligent Systems Conference (IntelliSys)

Research Interests: Computer Science, Abstraction, and Representation Politics<div>()</div>

Publication Date: 2020

Research Interests: Philosophy and Humanities<div>()</div>

Research Interests:
Computer Science, Information Retrieval, Ontology, Semantic similarity, Interdisciplinary research (Social Sciences), and Similarity Geometry

Research Interests:
Computer Science and Scalability

Research Interests:
Computer Science, Information Retrieval, Ontology, Semantic similarity, Interdisciplinary research (Social Sciences), and Java

Research Interests:
Computer Science, Information Retrieval, Ontology, Java, and Mendeley

Research Interests:
Computer Science, Information Retrieval, Ontology, Semantic similarity, Interdisciplinary research (Social Sciences), and Java

Research Interests:
Computer Science and Springer Ebooks

Research Interests:
Engineering, Computer Science, and CLEF

Research Interests:
Computer Science

Research Interests:
Computer Science, Information Retrieval, Multimedia, Multimedia information retrieval, and Video Retrieval

Research Interests:
Computer Science, Instructional Design, and Distance Education

Research Interests:
Computer Science

Research Interests:
Computer Science, Artificial Intelligence, Natural Language Processing, Interdisciplinary research (Social Sciences), Wordnet, and Similarity Geometry

Research Interests:
Computer Science, Artificial Intelligence, Natural Language Processing, Named Entity Recognition, Procesamiento del Lenguaje Natural, and Unified Medical Language System

Research Interests:
Engineering, Computer Science, Information Retrieval, Wordnet, and Similarity Geometry

Research Interests:
Computer Science and Mathematical Sciences

Research Interests:
Computer Science, Information Retrieval, Semantic similarity, and Wordnet

Research Interests:
Computer Science, Software, and SNOMED CT

Research Interests:
Computer Science, Information Retrieval, Semantic similarity, Software, SNOMED CT, and Unified Medical Language System

Research Interests:
Computer Science, Abstraction, and Representation Politics

Research Interests:
Philosophy and Humanities

Research Interests:
Computer Science, Artificial Intelligence, Natural Language Processing, Named Entity Recognition, Procesamiento del Lenguaje Natural, and Unified Medical Language System

Research Interests:
Philosophy, Humanities, and Inteligencia artificial

Research Interests:
Geography, Computer Science, Multimedia, and Multimedia information retrieval

Research Interests:
Computer Science, Visualization, Metadata, World Wide Web, and Semantic Metadata Extraction

Research Interests:
Temporal Information Extraction

Research Interests:
History, Computer Science, Humanities, and Procesamiento del Lenguaje Natural

Research Interests:
Humanities and Art

Research Interests:
Inteligencia artificial

Research Interests:
Languages, Computer Science, Traffic Management, Prolog, and FIPA

Research Interests:
Biomedical, Unified Medical Language System, and Named Entity Recognition (NER)

Research Interests:
Traffic Management and Support System

Research Interests:
Inteligencia artificial

Research Interests:
Computer Science, Comparative, Multilingual Information Retrieval, and CLEF

Research Interests:
Humanities Computing (Digital Humanities), Digital Humanities, and Humanidades Digitales