Extending Full Text Search for Legal Document Collections using Word Embeddings
Abstract
Traditional full text search allows fast search for exact matches. However, full text search is not optimal to deal with synonyms or semantically related terms and phrases. In this paper we explore a novel method that provides the ability to find not only exact matches, but also semantically similar parts for arbitrary length search queries. We achieve this without the application of ontologies, but base our approach on Word Embeddings. Recently, Word Embeddings have been applied successfully for many natural language processing tasks. We argue that our method is well suited for legal document collections and examine its applicability for two different use cases: We conduct a case study on a stand-alone law, in particular the EU Data Protection Directive 94/46/EC (EU-DPD) in order to extract obligations. Secondly, from a collection of publicly available templates for German rental contracts we retrieve similar provisions.
Attribute | Value |
---|---|
Address | Sofia Antopolis, France |
Authors | Dr. Jörg Landthaler , Dr. Bernhard Waltl , Patrick Holl , Prof. Dr. Florian Matthes |
Citation | Landthaler, J.; Waltl, B.; Holl, P.; Matthes, F.: Extending Full Text Search for Legal Document Collections using Word Embeddings, Jurix: International Conference on Legal Knowledge and Information Systems, Sofia Antopolis, France, 2016 |
Key | La16c |
Research project | |
Title | Extending Full Text Search for Legal Document Collections using Word Embeddings |
Type of publication | Conference |
Year | 2016 |
Acronym | Jurix 2016 |
Project | |
Publication URL | |
Team members |