Callum Booth Undergraduate Dissertation 2017/18
Triple Scoring: Scoring and ranking the truth of factual triples
Supervised by A.Vlachos
Abstract
In a lot of cases, what we consider a fact essentially boils down to "x is y". Facts such as "the sky is blue", "space is cold", "the universe is huge", all essentially follow this format: subject-relation-object. This is advantageous, as it allows us to represent facts in a very mathematical way, one which can be parsed by a computer. This project aims to create a model that can assign a numeric truth score to a type-like relation triple. These triples take the form of (subject, relation, object), and can be used to represent a fact. In this project, a triple scoring model is designed for the WSDM Cup 2017 Triple Scoring task. The model utilises large corpora, and performs natural language processing and information retrieval techniques to corroborate and rank facts, culminating in a model that achieves an 8th place result out of a pool of 21 solutions.
|