PushingScores/generate_links.py

import sys, os
import json
import re

with open('wordlist.json', 'r') as f:
    wordlist_dict = json.load(f)
#goes through every single file ending in html
#
path = "static/files/"
for path, subdirs, files in os.walk(path):
    for name in files:
        if name.endswith('html'):
            file = os.path.join(path, name)
            with open(file, 'r+', encoding="utf-8") as f:
                textfile = f.read()
                words = re.compile("([\w-]+)").split(textfile)
                words_to_search = wordlist_dict.keys()
                for i, word in enumerate(words):
                    if word in words_to_search:
                        words[i] = "<a href='/diverge?search={}'>{}</a>".format(word, word)

                textfile = "".join(words)
                # for word in wordlist_dict:
                #     word = re.escape(word)
                #     textfile = re.sub(r"(?<!<)(?<!</)(?<!ge\?)\b(%s)\b" %word, r"<a href='/diverge?search=\1'>\1</a>", textfile)
                f.truncate(0)
                f.write(textfile)
                f.truncate()

print(textfile)
so many changes 2019-05-04 16:27:50 +02:00			`import sys, os`
			`import json`
			`import re`

attempts to fix bugs 2019-05-21 11:54:11 +02:00			`with open('wordlist.json', 'r') as f:`
so many changes 2019-05-04 16:27:50 +02:00			`wordlist_dict = json.load(f)`
the score works finely 2019-05-24 16:59:49 +02:00			`#goes through every single file ending in html`
			`#`
so many changes 2019-05-04 16:27:50 +02:00			`path = "static/files/"`
			`for path, subdirs, files in os.walk(path):`
			`for name in files:`
			`if name.endswith('html'):`
			`file = os.path.join(path, name)`
the score works finely 2019-05-24 16:59:49 +02:00			`with open(file, 'r+', encoding="utf-8") as f:`
so many changes 2019-05-04 16:27:50 +02:00			`textfile = f.read()`
changes to fix bugs in generate_links 2019-07-12 17:31:34 +02:00			`words = re.compile("([\w-]+)").split(textfile)`
			`words_to_search = wordlist_dict.keys()`
			`for i, word in enumerate(words):`
			`if word in words_to_search:`
			`words[i] = "<a href='/diverge?search={}'>{}</a>".format(word, word)`

			`textfile = "".join(words)`
			`# for word in wordlist_dict:`
			`# word = re.escape(word)`
			`# textfile = re.sub(r"(?<!<)(?<!</)(?<!ge\?)\b(%s)\b" %word, r"<a href='/diverge?search=\1'>\1</a>", textfile)`
it works! kinda... 2019-05-05 02:26:25 +02:00			`f.truncate(0)`
			`f.write(textfile)`
			`f.truncate()`
generate_links works now 2019-05-05 01:56:05 +02:00
changes to fix bugs in generate_links 2019-07-12 17:31:34 +02:00			`print(textfile)`