Brief Description: AKTive Media is an ontology based cross-media annotation (Images and
Text) system. Our goal is to automate the process of annotation by
suggesting knowledge to the user in an interactive way while the user
is annotating and hence minimizing user effort. The system actively
works in the background, interacting with web services and queries our
central annotational store to look for context specific knowledge.
License Types: Academic Free License (AFL), Educational Community
License, GNU General Public License (GPL)
Name: Runes
Brief Description: Runes is a plugin-based framework that reverses the way of thinking
about data processing: specify data accesses, and Runes will
automatically choose an efficient representation for the data and
execute the plugins to integrate it from different sources.
Brief Description: Saxon is a rule based tool for annotating documents. Saxon rules are
regular expressions over document features, such as tokens and
sentences. The output of a rule can be either other annotations or
unrestricted actions specified within the rule.
Brief Description: jCC is java-based open source library that provides a collection of
general-purpose algorithms for clustering. It can be applied to
clustering tasks that work with text, images, and/or raw data.
Brief Description: T-Rex (Trainable Relation Extraction) is a highly configurable machine
learning-based Information Extraction from Text framework, which
includes tools for supervised and semi-supervised document
classification, entity extraction and relation extraction.
Brief Description: The multimedia information retrieval SDK is intended to help IR
researchers and application developers. It offers applications new
indexing and search functionalities, algorithms to create multimedia
models, and an extensible application framework for new algorithms.
Brief Description: Based on prefuse this currently has two visualisation options - a
(rooted) network and a tree graph. The semantic network uses icons
and/or text to summarise information of interest in (leaf) nodes, with
the ability to retrieve the source documents/information on provenance.
Colour and size of nodes can also be used to present information on
confidence (e.g., the strength of the relationships between parent and
child nodes, or other types of linked nodes, among others, or the
confidence that the information retrieved does represent the specific
assertion being made). Basic AND queries may also be built to highlight
nodes of interest, based on properties defined for each node.
The tree graph is being used to display (sub-parts of) ontologies.
License Types: open source, uses the (also open source) prefuse libraries http://prefuse.org
Open University
Name: Semsearch
Brief Description: A keyword-based semantic search engine for semantic web portals
Semsearch is a keyword-based semantic search engine, which aims to
wrap up the complexity of semantic search and make it suitable for
naive users who are not necessarily familiar with the problem domain
or with the specified query language.
Brief Description: Relation extraction based on Web-search and pattern matching. Can be
configured for various sources (Google, Wikipedia) and various
learning algorithms.
License Type: to be defined in accordance with other X-Media tools.
Name: AccessKnow
Brief Description: This API feature a store-independent access of knowledge. It offers an
abstraction over RDF and OWL knowledge base and supports entity- and
axiom-based access / manipulation / storage functionalities. Access to
a particular store will be enabled by adapter implementation.
Currently, adapters for KAON2 and Sesame2 are available.
License Type: OPEN SOURCE; specific to be defined in accordance with other X-Media tools.
Name: xXploreKnow
Brief Description: It is an ontology-based application that supports exploration, search
and analysis of semantic data. This semantic data are elements of
RDF&OWL ontologies, which are manually added to the system using an
annotation tool and automatically extracted from documents of various
formats and web pages. For search, a combination of semantic search
and classical keyword-based search on documents is supported. This
means that the user can retrieve both semantic data and documents.
License Type: OPEN SOURCE; specific to be defined in accordance with other X-Media tools.
QUINARY
Name: AccessKnow Sesame2 adaptor(connector)
Brief Description: Implementation of an adaptor(connector) supporting Sesame2 as a
backing store for AccessKnow
Web page: same as AccessKnow
License Type: same as AccessKnow
LJUBLJANA
Name: Pade
Our tool Pade is already released as open source within the collection
of tools called Orange (www.ailab.si/orange). As mentioned at the web site:
Brief Description: "Orange is a component-based data mining software. It includes a range
of preprocessing, modelling and data exploration techniques. It is based
on C++ components, that are accessed either directly (not very common),
through Python scripts (easier and better), or through GUI objects
called Orange Widgets."
License Type: Orange is free, it can be used/changed within terms of GNU GPL licence.
KOBLENZ
Name: LENA - a Fresnel LEns based RDF NAvigator with SPARQL selector support
Brief Description: LENA stands for LEns based NAvigator. A lens
represents a particular view onto RDF data and is described by the
Fresnel Display Vocabulary. LENA enables viewing RDF data in your web
browser, rendered according to the lens descriptions you provide. LENA
supports the use of multiple lenses and indicates if they are
available for a resource, so that a different view onto the same data
is always just one click away!
Name: X-Email: X-COSIM Email for the Semantic Desktop
Brief Description: X-Email is implemented as a plugin for the
Thunderbird email client. X-Email makes available email data for reuse
in further desktop applications based on the X-COSIM framework.
Name: X-COSIM: A Framework for Cross-Context Semantic Information Management
Brief Description: X-COSIM stands for Cross(X)-COntext Semantic
Information Management. X-COSIM consists of an ontology named X-COSIMO
and an application programming interface (API) named X-COSIMA.
Brief Description: Semantic descriptions of non-textual media
available on the web can be used to facilitate retrieval and
presentation of media assets and documents containing them. While
technologies for multimedia semantic descriptions already exist, there
is as yet no formal description of a high quality multimedia ontology
that is compatible with existing (semantic) web technologies. We
propose COMM - A Core Ontology for Multimedia based on both the MPEG-7
standard and the DOLCE foundational ontology.
Name: MetaK - Meta Knowledge Management Framework for Semantic Web Applications
Brief Description: The Semantic Web is based on accessing and reusing RDF data from many
different sources, which one may assign different levels of authority
and credibility. Existing Semantic Web query languages, like SPARQL,
have targeted the retrieval, combination and reuse of facts, but have
so far ignored all aspects of meta knowledge, such as origins,
authorship, recency or certainty of data, to name but a few.
We propose an original, generic, formalized and implemented approach
for managing many dimensions of meta knowledge, like source,
authorship, certainty and others. The approach re-uses existing RDF
modeling possibilities in order to represent meta knowledge. Then, it
extends SPARQL query processing in such a way that given a SPARQL
query for data, one may request meta knowledge without modifying the
query proper. Thus, our approach achieves highly flexible and
automatically coordinated querying for data and meta knowledge, while
completely separating the two areas of concern.
Brief Description: jLSI (java Latent Semantic Indexing) is an open source Java tool for
Latent Semantic Indexing. jLSI requires only a shallow linguistic
processing, such as tokenization, sentence splitting, part-of-speech
tagging (optional) tagging and lemmatization (optional).
Brief Description: jWeb1T is an open source Java tool for efficiently searching the Web
1T 5-gram corpus. It is based on a binary search algorithm that finds
the n-grams and returns their frequency counts in logarithmic time. As
the corpus is stored in many files a simple index is used to retrive
the files containing the n-grams. The corpus must be installed and
uncopressed on a hard drive (approx. 60 GB).
Brief Description: jSRE is a Java tool for Relation Extraction. It is based on a
supervised machine learning approach which is applicable even when
(deep) linguistic processing is not available or reliable. In
particular, jSRE uses a combination of kernel functions to integrate
two different information sources: (i) the whole sentence where the
relation appears, and (ii) the local contexts around the interacting
entities. jSRE requires only a shallow linguistic processing, such as
tokenization, sentence splitting, Part-of-Speech (PoS) tagging and
lemmatization.
Brief Description: RelEns is a java tool for classifying instances by using the relations
to other instances (i.e. the classes of neighbors) and combining
results of different relations with ensemble classification
methods. The instances could be time series or of any other type.