We describe our work on information extraction in medical documents written in German . We employ a version of the NegExregular expression algorithm with a large set of triggers as a baseline . Weshow how a significantly smaller trigger set is sufficient to achieve similar results . We elaborate on the question whether dependency parsing (based on the Stanford CoreNLP model) is a good alternative . We describe the potentials and shortcomings of both approaches and describe potentials of bothapproaches. Weshow we show how smaller trigger sets are sufficient to reduce adaptation times to new text types .
Author(s) : Hans-Jürgen Profitlich, Daniel SonntagLinks : PDF - Abstract
Code :
Keywords : describe - dependency - expression - parsing - sufficient -