manetta
4 years ago
5 changed files with 528 additions and 0 deletions
@ -0,0 +1,87 @@ |
|||
# RECbot |
|||
|
|||
A small XMPP bot written in Python that logs XMPP conversations into a HTML page, allowing collaborative log writing over time. |
|||
|
|||
The bot is used in group chats, where it includes all images that are send to the group and all messages that include `@bot`. |
|||
|
|||
*work-in-progress* |
|||
|
|||
## Situated tails |
|||
|
|||
* Archive bot, Relearn 2017, <https://gitlab.com/relearn/relearn2017/-/tree/master/xmpp-bots/archive-bot> |
|||
* Streambot, Varia website extension 2017-2018, <https://git.vvvvvvaria.org/varia/xmpp.streambot> |
|||
* Logbot, Varia XMPP extension 2017-2020, <https://git.vvvvvvaria.org/varia/bots/src/branch/master/logbot> |
|||
|
|||
## Use RECbot |
|||
|
|||
* check if `RECbot` is one of the participants in the groupchat! |
|||
* send an image to the groupchat **OR** use one of the `__ACTION WORDS__` below |
|||
* the bot replies and thanks you kindly |
|||
* check the output of RECbot (locally or online, for example: <https://vvvvvvaria.org/logs>) |
|||
|
|||
RECbot works with `__ACTION WORDS__` and unique `:HANDLES`. |
|||
|
|||
* `__ADD__` RECbot entries with `__ADD__ <message>`, for example: `__ADD__ Logging as a form of stretching time.` or `__ADD__ https://nicelink.org` |
|||
* `__DELETE__` RECbot entries with `__DELETE__ :HANDLE`, for example: `__DELETE__ :~+*/+-` (\*spark) |
|||
* `__BOOK__` (\*sparks) |
|||
|
|||
## Install RECbot |
|||
|
|||
RECbot uses the `slixmpp` library to connect to XMPP and `beautifulsoup` to parse the HTML pages. |
|||
|
|||
`$ sudo pip3 install slixmpp beautifulsoup4` |
|||
|
|||
## Run RECbot! |
|||
|
|||
`$ python3 RECbot.py` |
|||
|
|||
The bot will ask you to provide the following details: |
|||
|
|||
* XMPP address of a (bot)account |
|||
* password |
|||
* groupchat address |
|||
* nickname for the bot |
|||
* output folder path |
|||
|
|||
You can also run it as a oneliner, for example by writing: |
|||
|
|||
`$ python3 RECbot.py -u bot@vvvvvvaria.org -p CHANGEME -g roomname@muc.vvvvvvaria.org -n RECbot -o /var/www/logs/` |
|||
|
|||
* `-u` / `--use` = user / use this XMPP address |
|||
* `-p` / `--password` = password |
|||
* `-g` / `--groupchat` = groupchat |
|||
* `-n` / `--nickname` = nickname |
|||
* `-o` / `--output` = output |
|||
|
|||
## \*sparks |
|||
|
|||
----------- |
|||
|
|||
It would be so nice to have different RECbot *modes*: `--log`, `--stream`, `--distribusi` |
|||
|
|||
* `--log`: RECbot writes a growing HTML page with images and text, that can be marked up and styled in HTML/CSS. |
|||
* `--stream`: RECbot stores all images that are send to the group, and displays them as an image stream. |
|||
* `--distribusi`: RECbot saves files (images, messages as markdown, files, links as HTML pages) and generates a distribusi page of all collected material. |
|||
|
|||
Under the hood the process can be cut up into two procedures: |
|||
|
|||
* saving text/image/audio/video based messages as files (.txt, .png/.jpg, .ogg, .og4/.mp4) |
|||
* recbot.py |
|||
* generating different outputs, depending on the selected *mode* |
|||
* distribusi.py[\*] |
|||
* log.py[\*] |
|||
* stream.py[\*] |
|||
|
|||
These modes can be changed at any moment. |
|||
|
|||
[\*] These are standalone scripts. They can be used on any set of files in a folder and generate HTML pages with customizable styling. |
|||
|
|||
------------ |
|||
|
|||
How can `__ACTION WORDS__` become `__MAGIC WORDS__` ??? |
|||
|
|||
------------ |
|||
|
|||
|
|||
|
|||
|
@ -0,0 +1,310 @@ |
|||
#!/usr/bin/env python3 |
|||
# -*- coding: utf-8 -*- |
|||
|
|||
# To run this bot: |
|||
# $ python3 logbot.py |
|||
# The output folder of this bot currently is: /var/www/logs/digital-autonomy |
|||
|
|||
import logging |
|||
from getpass import getpass |
|||
from argparse import ArgumentParser |
|||
import slixmpp |
|||
import ssl, os, requests, urllib |
|||
from datetime import datetime |
|||
from bs4 import BeautifulSoup |
|||
import os, re, random |
|||
|
|||
def check_handle(handle, used_handles): |
|||
if handle in used_handles: |
|||
handle_is_already_used = True |
|||
else: |
|||
handle_is_already_used = False |
|||
return handle_is_already_used |
|||
|
|||
def request_handle(used_handles_path): |
|||
used_handles = open(used_handles_path, 'r').readlines() |
|||
handles = open('handles.txt', 'r').readlines() |
|||
handle = random.choice(handles).replace('\n','') |
|||
|
|||
# check if handle is not used yet! |
|||
handle_is_already_used = False |
|||
if handle in used_handles: |
|||
handle_is_already_used = True |
|||
|
|||
while check_handle(handle, used_handles) == True: |
|||
handle = random.choice(handles) |
|||
|
|||
# add handle to .handles.txt |
|||
with open(used_handles_path, 'a+') as h: |
|||
h.write(handle) |
|||
|
|||
return handle |
|||
|
|||
def write_to_log(self, entry): |
|||
output = self.output |
|||
# print(f'Output: { output }') |
|||
log = 'index.html' |
|||
css = 'stylesheet.css' |
|||
used_handles = '.handles.txt' |
|||
log_path = os.path.join(output, log) |
|||
css_path = os.path.join(output, css) |
|||
used_handles_path = os.path.join(output, used_handles) |
|||
|
|||
# check if file exists, if not: write it! |
|||
if not os.path.isfile(log_path): |
|||
html_template = open('templates/log.html', 'r').read() |
|||
css_template = open('templates/stylesheet.css', 'r').read() |
|||
with open(log_path, 'w') as l: |
|||
l.write(html_template) |
|||
l.write(f'<h1>{ self.groupchat }</h1>') |
|||
with open(css_path, 'w') as c: |
|||
c.write(css_template) |
|||
with open(used_handles_path, 'w') as h: |
|||
h.write('-----') |
|||
|
|||
# add entry to log |
|||
handle = request_handle(used_handles_path) |
|||
print(f'Picked a handle: { handle }') |
|||
now = datetime.now().strftime('%A %d %B (%Y)') |
|||
print(f'Now is: { now }') |
|||
post = f'''<div id="{ handle }" class="post"> |
|||
<small class="postid">{ handle }</small> |
|||
{ entry } |
|||
<small class="date">Added on { now }</small> |
|||
<small class="tags">Tags:<span class="tagcontainer"></span></small> |
|||
</div>''' |
|||
print(f'Post: { post }') |
|||
with open(log_path, 'a+') as l: |
|||
l.write(post) |
|||
print('added to the log!') |
|||
with open(used_handles_path, 'a+') as h: |
|||
h.write(handle) |
|||
print('added to the .handles file!') |
|||
|
|||
def find_in_soup(self, handle, tag): |
|||
print('--------ADD TAG ---------') |
|||
print(f'handle: { handle }') |
|||
log = 'index.html' |
|||
log_path = os.path.join(self.output, log) |
|||
html = open(log_path, 'r').read() |
|||
soup = BeautifulSoup(html, 'html.parser') |
|||
# print(soup.prettify()) |
|||
post = soup.find(id=handle) |
|||
# print(f'posts: { posts }') |
|||
# for post in posts: |
|||
print(f'post: { post }') |
|||
if post: |
|||
# tagcontainer = post.findChildren(id="tagcontainer", recursive=True)[0] |
|||
# print(f'tagcontainer: { tagcontainer }') |
|||
# print(f'tagcontainer.contents: { tagcontainer.contents }') |
|||
# tagcontainer.contents.append(f'<span class="tag">{ tag }</span>') |
|||
# print(f'tagcontainer.contents: { tagcontainer.contents }') |
|||
|
|||
# new_tag = soup.new_tag("a", href="http://www.example.com") |
|||
new_tag = soup.new_tag("span") |
|||
new_tag.append(tag) |
|||
soup.find(id=handle).find(class_="tagcontainer").append(new_tag) |
|||
print(f'new soup: { str(soup) } ') |
|||
|
|||
# write soup to file |
|||
with open(log_path, 'w') as l: |
|||
l.write(str(soup)) |
|||
|
|||
|
|||
class MUCBot(slixmpp.ClientXMPP): |
|||
""" |
|||
A simple Slixmpp bot that will save images |
|||
and messages that are marked with @bot to a folder. |
|||
""" |
|||
|
|||
def __init__(self, use, password, groupchat, nickname, output): |
|||
slixmpp.ClientXMPP.__init__(self, use, password) |
|||
|
|||
self.groupchat = groupchat |
|||
self.nick = nickname |
|||
self.output = output |
|||
|
|||
# The session_start event will be triggered when |
|||
# the bot establishes its connection with the server |
|||
# and the XML logs are ready for use. We want to |
|||
# listen for this event so that we we can initialize |
|||
# our roster. |
|||
self.add_event_handler("session_start", self.start) |
|||
|
|||
# The groupchat_message event is triggered whenever a message |
|||
# stanza is received from any chat room. If you also also |
|||
# register a handler for the 'message' event, MUC messages |
|||
# will be processed by both handlers. |
|||
self.add_event_handler("groupchat_message", self.muc_message) |
|||
|
|||
def start(self, event): |
|||
self.get_roster() |
|||
self.send_presence() |
|||
|
|||
# https://xmpp.org/extensions/xep-0045.html |
|||
self.plugin['xep_0045'].join_muc(self.groupchat, |
|||
self.nick, |
|||
# If a room password is needed, use: |
|||
# password=the_room_password, |
|||
wait=True) |
|||
|
|||
# NOTE(luke): disabled for now. We'll make it possible to speak to logbot privately later |
|||
# Send a message to the room |
|||
# self.send_message(mto=self.groupchat, mbody='Hello! RECbot here. I\'m new :). You can log text/image/sound/video messages, by including @bot in your message. Happy logging! PS. you can access the logs at https://vvvvvvaria.org/logs/', mtype='groupchat') |
|||
|
|||
def muc_message(self, msg): |
|||
# Some inspection commands |
|||
#print('Message: {}'.format(msg)) |
|||
|
|||
# Always check that a message is not the bot itself, otherwise you will create an infinite loop responding to your own messages. |
|||
if msg['mucnick'] != self.nick: |
|||
|
|||
# Check if output folder exists |
|||
if not os.path.exists(self.output): |
|||
os.mkdir(self.output) |
|||
|
|||
# Check if an OOB URL is included in the stanza (which is how an image is sent) |
|||
# (OOB object - https://xmpp.org/extensions/xep-0066.html#x-oob) |
|||
if len(msg['oob']['url']) > 0: |
|||
|
|||
# Send a reply |
|||
self.send_message(mto=self.groupchat, |
|||
mbody="Super, our log is growing. Your image is added!", |
|||
mtype='groupchat') |
|||
|
|||
# Save the image to the output folder |
|||
url = msg['oob']['url'] # grep the url in the message |
|||
filename = os.path.basename(url) # grep the filename in the url |
|||
output_path = os.path.join(self.output, filename) |
|||
u = urllib.request.urlopen(url) # read the image data |
|||
f = open(output_path, 'wb') # open the output file |
|||
f.write(u.read()) # write image to file |
|||
f.close() # close the output file |
|||
|
|||
# Add the image to the log |
|||
img = f'<div class="entry image"><img src="{ filename }"></div>' |
|||
write_to_log(self, img) |
|||
|
|||
# Include a new post in the log (only when '__ADD__' is used in the message) |
|||
if '__ADD__' in msg['body']: |
|||
|
|||
# reply from the bot |
|||
self.send_message(mto=self.groupchat, |
|||
mbody=f'Noted! And added to the log. Thanks { msg["mucnick"] }!', |
|||
mtype='groupchat') |
|||
|
|||
# Add the message to the log! |
|||
message = msg['body'].replace('__ADD__','') |
|||
message = f'<div class="entry text">{ message }</div>' |
|||
write_to_log(self, message) |
|||
|
|||
# Include a new post in the log (only when '__ADD__' is used in the message) |
|||
if '__ANNOTATE__' in msg['body']: |
|||
|
|||
handle = msg['body'].split()[1] |
|||
annotation = msg['body'].replace('__ANNOTATE__', '').replace(handle, '') |
|||
post = find_in_soup(self, handle, annotation) |
|||
|
|||
# reply from the bot |
|||
self.send_message(mto=self.groupchat, |
|||
mbody="Thanks!", |
|||
mtype='groupchat') |
|||
|
|||
# Check if this is a book ... |
|||
if '__BOOK__' in msg['body']: |
|||
|
|||
self.send_message(mto=self.groupchat, |
|||
mbody="Oh a book, that's cool! Thanks {}!".format(msg['mucnick']), |
|||
mtype='groupchat') |
|||
|
|||
# Start of book feature |
|||
book = msg['body'].replace('@bot', '').replace('/book', '') |
|||
book = re.sub(' +', ' ', book) # remove double spaces |
|||
book = book.lstrip().rstrip() # remove spaces at the beginning and at the end |
|||
book = book.replace(' ', '+').lower() # turn space into + and lowercase |
|||
|
|||
page_link = 'https://www.worldcat.org/search?q={}&qt=results_page'.format(book) |
|||
page_response = requests.get(page_link, timeout=5) |
|||
page_content = BeautifulSoup(page_response.content, "html.parser") |
|||
|
|||
try: |
|||
book_title = page_content.findAll("div", {"class": "name"})[0].text |
|||
book_author = page_content.findAll("div", {"class": "author"})[0].text |
|||
book_publisher = page_content.findAll("div", {"class": "publisher"})[0].text |
|||
|
|||
response = '<b>BOOK</b>: ' + book_title + ' ' + book_author + ' ' + book_publisher |
|||
|
|||
book_found = True |
|||
|
|||
except IndexError: |
|||
|
|||
book_found = False |
|||
|
|||
if book_found: |
|||
|
|||
# Add message to log |
|||
message = '<b>BOOK</b>: ' + book_title + ' ' + book_author + ' ' + book_publisher |
|||
message = f'<div class="entry book">{ message }</div>' |
|||
write_to_log(self, message) |
|||
|
|||
self.send_message(mto=self.groupchat, mbody='Hope this was the book you were looking for: ' + book_title + ' ' + book_author + ' ' + book_publisher, mtype='groupchat') |
|||
|
|||
else: |
|||
|
|||
self.send_message(mto=self.groupchat, mbody='Sorry, no book found!', mtype='groupchat') |
|||
|
|||
|
|||
if __name__ == '__main__': |
|||
# Setup the command line arguments. |
|||
parser = ArgumentParser() |
|||
|
|||
# output verbosity options. |
|||
parser.add_argument("-q", "--quiet", help="set logging to ERROR", |
|||
action="store_const", dest="loglevel", |
|||
const=logging.ERROR, default=logging.INFO) |
|||
parser.add_argument("-d", "--debug", help="set logging to DEBUG", |
|||
action="store_const", dest="loglevel", |
|||
const=logging.DEBUG, default=logging.INFO) |
|||
|
|||
# Different options. |
|||
parser.add_argument("-u", "--use", dest="use", |
|||
help="XMPP address to use") |
|||
parser.add_argument("-p", "--password", dest="password", |
|||
help="password to use") |
|||
parser.add_argument("-g", "--groupchat", dest="groupchat", |
|||
help="groupchat to join") |
|||
parser.add_argument("-n", "--nick", dest="nickname", |
|||
help="nickname for the bot") |
|||
parser.add_argument("-o", "--output", dest="output", |
|||
help="output folder, this is where the files are stored", |
|||
type=str) |
|||
|
|||
args = parser.parse_args() |
|||
|
|||
# Setup logging. |
|||
logging.basicConfig(level=args.loglevel, |
|||
format='%(levelname)-8s %(message)s') |
|||
|
|||
if args.use is None: |
|||
args.use = input("Use this XMPP address for the bot: ") |
|||
if args.password is None: |
|||
args.password = getpass("Password: ") |
|||
if args.groupchat is None: |
|||
args.groupchat = input("Groupchat XMPP address: ") |
|||
if args.nickname is None: |
|||
args.nickname = input("Nickname for the bot: ") |
|||
if args.output is None: |
|||
args.output = input("Output folder path of the log: ") |
|||
|
|||
# Setup the MUCBot and register plugins. Note that while plugins may |
|||
# have interdependencies, the order in which you register them does |
|||
# not matter. |
|||
xmpp = MUCBot(args.use, args.password, args.groupchat, args.nickname, args.output) |
|||
xmpp.register_plugin('xep_0030') # Service Discovery |
|||
xmpp.register_plugin('xep_0045') # Multi-User Chat |
|||
xmpp.register_plugin('xep_0199') # XMPP Ping |
|||
xmpp.register_plugin('xep_0066') # Process URI's (files, images) |
|||
|
|||
# Connect to the XMPP server and start processing XMPP stanzas. |
|||
xmpp.connect() |
|||
xmpp.process() |
@ -0,0 +1,16 @@ |
|||
import random |
|||
|
|||
characters = ['*','+','-','/','-','-'] |
|||
out = open('handles.txt', 'w') |
|||
handles = set() |
|||
|
|||
# generate handles |
|||
while len(handles) < 1000: |
|||
handle = '' |
|||
for h in range(5): |
|||
handle += random.choice(characters) |
|||
handles.add(handle) |
|||
|
|||
# write handles to file |
|||
for handle in handles: |
|||
out.write(handle + '\n') |
@ -0,0 +1,47 @@ |
|||
<!DOCTYPE html> |
|||
<html> |
|||
<head> |
|||
<meta charset="utf-8"> |
|||
<title>Log</title> |
|||
<link rel="stylesheet" type="text/css" href="stylesheet.css"> |
|||
</head> |
|||
<body> |
|||
<div id="welcome"> |
|||
<p>Welcome to this Log!</p> |
|||
<p>This Log file is written through <em>logbot</em> and chat messages exchanged in a <em>XMPP groupchat</em>.</p> |
|||
<hr> |
|||
<p>For the writers of this log, you can: |
|||
<br> |
|||
<br> |
|||
send an image, |
|||
<br> |
|||
<br> |
|||
<code>__ADD__</code> a message, |
|||
<br> |
|||
<br> |
|||
<code>__DELETE__</code> it by using the <code>~HANDLE</code> on the left (*spark), |
|||
<br> |
|||
<br> |
|||
<code>__ANNOTATE__</code> something using the <code>~HANDLE</code>, |
|||
<br> |
|||
<br> |
|||
<!-- <code>__ECHO__</code> material using the <code>~HANDLE</code> or a <code>#TAG</code> (*spark), --> |
|||
<!-- <br> --> |
|||
<!-- <br> --> |
|||
<code>__BOOK__</code> (*spark, almost there), |
|||
<br> |
|||
<br> |
|||
or, ... (*spark) |
|||
</p> |
|||
<!-- <hr> --> |
|||
</div> |
|||
<!-- <div id="echo"> |
|||
<label for="echo" style="display: none;">__ECHO__</label> |
|||
<select name="echo" id="echo"> |
|||
<option value="~HANDLE">~HANDLE</option> |
|||
<option value="#TAG">#TAG</option> |
|||
</select> |
|||
<input type="text" name="echo"> |
|||
<button><code>__ECHO__</code></button> |
|||
</div> --> |
|||
<!-- Hmm ... We don't close the body anymore ... --> |
@ -0,0 +1,68 @@ |
|||
body{ |
|||
background-color: lightgrey; |
|||
min-width: 1080px; |
|||
margin: 40px; |
|||
font-size: 20px; |
|||
line-height: 24px; |
|||
} |
|||
div#welcome{ |
|||
float: right; |
|||
top:40px; |
|||
right:40px; |
|||
width: 200px; |
|||
font-size: 16px; |
|||
} |
|||
div#welcome p{ |
|||
margin:0 0 1em 0; |
|||
padding:0; |
|||
} |
|||
div#welcome hr{ |
|||
border:1px dotted blue; |
|||
margin:2em 0; |
|||
} |
|||
div#echo{ |
|||
position: fixed; |
|||
bottom: 0; |
|||
left: 0; |
|||
width: 100%; |
|||
padding: 0.5em; |
|||
background-color: pink; |
|||
} |
|||
div.post{ |
|||
margin: 2em 5em 2em 9em; |
|||
width: 800px; |
|||
} |
|||
div.post span.tagcontainer span{ |
|||
padding-left: 0.5em; |
|||
color: blue; |
|||
} |
|||
|
|||
p{ |
|||
margin: 1em 0; |
|||
} |
|||
code{ |
|||
color: blue; |
|||
} |
|||
small{ |
|||
font-size: 12px; |
|||
line-height:1.2; |
|||
} |
|||
small.postid{ |
|||
float: left; |
|||
font-family: monospace; |
|||
margin: 0 0 0 -180px; |
|||
padding: 1em 1.5em; |
|||
/*border-radius: 50px;*/ |
|||
/*border: 1px dotted blue;*/ |
|||
color: blue; |
|||
background-color: white; |
|||
font-size: 20px; |
|||
} |
|||
small.date{ |
|||
display:block; |
|||
color:magenta; |
|||
margin:1em 0; |
|||
} |
|||
img{ |
|||
max-width: 100%; |
|||
} |
Loading…
Reference in new issue