| The
SphereSearch Engine
SphereSearch
The SphereSearch Engine provides unified
ranked retrieval on heterogeneous XML and Web data. Its search
capabilities include vague structure and text content conditions,
and relevance ranking based on IR statistics and a graph-based data
model. Web pages in HTML or PDF are automatically converted into an
intermediate XML format, with the option of generating semantictags
by means of linguistic annotation tools. For semi-structured data
the graphbased query engine is leveraged to provide very rich search
options that cannot be expressed in traditional Web or XML search
engines: concept-aware and linkaware querying that takes into
account the implicit structure and context of Web pages.
|