Useful negative statements

Knowledge bases (KBs) about notable entities and their properties are an important asset in applications such as search, question answering and dialogue. All popular KBs capture virtually only positive statements, and abstain from taking any stance on statements not stored in the KB. In this work, we make the case for explicitly stating salient statements that do not hold. Negative statements are useful to overcome limitations of question answering, and can often contribute to informative summaries of entities. Due to the abundance of such invalid statements, any effort to compile them needs to address ranking by saliency.

Publications

  • Enriching knowledge bases with interesting negative statements. - AKBC 2020
    Hiba Arnaout, Simon Razniewski, Gerhard Weikum - [PDF], [LINK]
  • Negative statements considered useful. - arXiv 2020.
    Hiba Arnaout, Simon Razniewski, Gerhard Weikum - [PDF], [LINK]

Datasets

Wikidata

  •    1.4M useful negative statements about the most popular 130K people, organizations, and literature work. - [HERE] *
    [Format: S_ID[tab]S_LABEL[tab]P_LABEL[tab]P_LABEL[tab]O_ID[tab]O_LABEL[tab]SCORE]
    Methodology used: peer-based inference (similarity-based) method.
  •    12.5M useful negative statements about 545K entities from various types (mostly people, literature work, and sport events)- [HERE] **
    [Format: ORDERED_GROUP_ID[tab]ORDERED_GROUP_LABEL[tab]S_ID[tab]S_LABEL[tab]RANK[tab]P_ID;O_ID[tab]P_LABEL;O_LABEL[tab]SCORE[tab]EXPLANATION]
    Methodology used: order-oriented inference method.
  •    40K ordered peer sets- [HERE]
    [Format: ORDERED_GROUP_ID[tab]ORDERED_GROUP_LABEL[tab]ENTITIES_IN_GROUP_i[tab]ENTITIES_IN_GROUP_i_LABEL]
    ENTITIES_IN_GROUP_i = set_1|set_2|etc = e1;e2;e3|e4|e5;e6|etc
    Description: order-oriented peer groups created as inputs for dataset **.
  •    6.2K useful negative statements about the most popular 2.4K people- [HERE]
    [Format: S_ID[tab]S_LABEL[tab]NEG_STATEMENT]
    Methodology used: pattern-based query log extraction method.
  •    1K mturk-annotated negative statements - [HERE]
    [Format: ROW_ID[tab]S_ID[tab]S_LABEL[tabl]S_TYPE[tab]CORRECTNESS_LABEL], retrived from *.
    Correctness assessment: correct, incorrect, not sure. 

Find useful negations in your datasets

We make our peer-based inference method public, for users to try it on their own tabular datasets.

 

Visit our Github repository!

 

We provide three sample datasets and their useful negations, on: Turing award winners, U.S. presidents, and hotels in India.

Demo - Wikinegata

 

A web browsing interface: discovering interesting negations about Wikidata entities.
COMING SOON!

For latest news and fun negations, follow us on Twitter!.