Text2Onto

From NeOn Wiki

Jump to: navigation, search

Text2Onto

by Johanna Völker

Screenshot
Screenshot

Developed by Johanna Völker
Last Update 11.11.2007
Current Version

1.0.0

Homepage [http://ontoware.org/projects/text2onto/ Text2Onto Website]
Activity
License LGPL
Affiliation

Text2Onto supports the automatic or semi-automatic generation of ontologies from natural language text.

Contents

Versions & Compatibility

This section shows available versions of Text2Onto and for each version its compatible application versions and necessary plugins.

Version compatible with necessary plugins
Text2Onto 1.0.0 NeOn Toolkit 1.1


Changelog

Version Changes in this version
Text2Onto 1.0.0 see page for more details


Description

Text2Onto is an ontology learning framework which has been developed to support the acquisition of ontologies from textual documents. Like its predecessor, TextToOnto, it provides an extensible set of methods for learning atomic classes, class subsumption and instantiation as well as object properties and disjointness axioms.

Installation

  • Unzip org.neon.toolkit.text2onto_1.0.0.jar into your Eclipse plugin directory (e.g. <T2O-DIR>=c:\Eclipse\plugins\org.neon.toolkit.text2onto_1.0.0)
  • Edit <T2O-DIR>\lib\jwnl\file_properties.xml and replace <WN-DIR>
   <param name="file_manager" value="net.didion.jwnl.dictionary.file_manager.FileManagerImpl">
       <param name="file_type" value="net.didion.jwnl.princeton.file.PrincetonRandomAccessDictionaryFile"/>
       <param name="dictionary_path" value="<WN-DIR>\dict"/>
   </param>
  • Start NeOn Toolkit (startup.jar) and open Text2Onto perspective
  • Set the preferences as described above
  • Initialize the algorithm controller by selecting the New item from the Text2Onto menu or toolbar

User Documentation

Technical reports, papers, presentations and demo videos for the standard version of Text2Onto are available from http://www.aifb.uni-karlsruhe.de/WBS/jvo/text2onto/. Detailed information with regards to this plugin can be found in NeOn D3.8.1.

The graphical user interface of the plugin is very similar to the original Swing-based GUI of Text2Onto. It is composed of different views for the configuration of the ontology learning process and the presentation of the results.

Workflow view

The upper left corner contains the workflow view, which is used to set up the ontology learning workflow. By right-clicking on the individual ontology learning tasks (e.g. "Concept" for concept extraction), the user can select one or more methods for each type of ontology element she wants to extract from the corpus.

Corpus view

In the bottom left corner, the user will find a corpus view, which allows her to set up a corpus, that is a collection of text documents from which the ontology will be generated. The doc view (see hidden tab on the right) is used to display previews of selected documents. Text2Onto is able to analyse documents in plain text, PDF (Windows only) and HTML format. However, a manual conversion into purely textual format is highly recommended for efficiency reasons.

POM view

The POM view on the right shows the results of the most recently initiated ontology learning process. The view contains several tabs -- one for each type of ontology element that was extracted from the corpus -- showing a tabular listing of individual results. By clicking on the column headers the user can sort the ontology elements according to their associated labels or confidence values.

Preferences

The preference page, which is accessible from the main menu of on the top of the Text2Onto perspective ("Window" -> "Preferences..." -> "Text2Onto Preferences") replaces the original configuration file of Text2Onto's API. It allows for setting the following parameters:

  • Language: The language of the documents to be analysed. Text2Onto provides full support for learning ontologies from English and Spanish corpora as well as partial support for ontology extraction from German texts. For details with respect to the Spanish version of Text2Onto please refer to SEKT D3.3.3.
  • Normalization: If this parameter is selected Text2Onto will normalize all confidence values to an interval of 0.0 to 1.0.
  • Default corpus: The default directory for populating the ontology learning corpus.
Facts about Text2OntoRDF feed
Affiliated to AIFB  +
Current version

1.0.0

  [[Special:SearchByProperty/Current version::

1.0.0

|+]]
Description Text2Onto supports the automatic or semi-automatic generation of ontologies from natural language text.  +
Developed by Johanna Völker  +
Has topic Ontology Learning  +
Homepage http://ontoware.org/projects/text2onto/  +
Last update 11 November 2007  +
License LGPL  +
Name Text2Onto  +
Screenshot text2onto.png  +