From d90d99dd9786b4d1d6e92a4331580b208a914692 Mon Sep 17 00:00:00 2001 From: manetta Date: Fri, 19 Apr 2019 10:02:00 +0200 Subject: [PATCH] added the download instruction for the nltk tokenizer --- cross-reader.tfidf/README.md | 6 ++++++ 1 file changed, 6 insertions(+) diff --git a/cross-reader.tfidf/README.md b/cross-reader.tfidf/README.md index 6b11e5f..a85965c 100644 --- a/cross-reader.tfidf/README.md +++ b/cross-reader.tfidf/README.md @@ -14,6 +14,12 @@ $ pip3 install nltk +You also need to download a nltk package for the tokenizer that is used, to split sentences up in lists of words. + + $ python3 + + >>> import nltk + >>> nltk.download('punkt') ## Start