Home | Publications (intern) | Publications (DBLP) | Research|  Intern  
Research
 
 The SphereSearch Engine





SphereSearch

The SphereSearch Engine provides unified ranked retrieval on heterogeneous XML and Web data. Its search capabilities include vague structure and text content conditions, and relevance ranking based on IR statistics and a graph-based data model. Web pages in HTML or PDF are automatically converted into an intermediate XML format, with the option of generating semantictags by means of linguistic annotation tools. For semi-structured data the graphbased query engine is leveraged to provide very rich search options that cannot be expressed in traditional Web or XML search engines: concept-aware and linkaware querying that takes into account the implicit structure and context of Web pages.

 


webcounter |