a flask exercise and search machine prototype
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
mb@mb aa9351ac13 readme edit to pre wrap the cli commands 6 years ago
static first commit 6 years ago
templates first commit 6 years ago
.gitignore first commit 6 years ago
README.md readme edit to pre wrap the cli commands 6 years ago
index.json first commit 6 years ago
start.py first commit 6 years ago
tfidf.py first commit 6 years ago

README.md

Grrrrrrrrrrls - search machine (prototype)

A small flask exercise, combining the TFIDF algorithm written in python with a web interface.

Grrrrrrrrrrls is a project in progress for the Computer Grrrls exhibition at the HMKV & La Gaîté Lyrique.

Install

$ pip3 install flask 

$ pip3 install nltk

Txt documents

The search machine is using the index.json file to process results. The function 'create_index' can be called to generate this file. It uses a set of plain text files to index each word and its corresponding TFIDF value. The plain text files are not included in this repo, i don't think i can publish them like that.

If you want to work with another set of documents, make a 'txt/' folder, add a few txt files in it, and remove the index.json file (or rename it if you want to keep it with you).

Start

Start the flask/python local server ...

$ python3 start.py

Browse to your localhost on port 5000 ...

> 127.0.0.1:5000

Notes

This Grrrrrrrrrrls search machine cannot handle too much at once: it can only work with one word.

This is a prototype :)