Files for the publication & poster for Data Workers, an exhibition by Algolit. http://www.algolit.net/index.php/Data_Workers
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

248 lines
11 KiB

5 years ago
# plain text workflow
Files for the plain text publication for Data Workers, an exhibition by Algolit at the Mundaneum in Mons from 28 March until 28 April 2019.
5 years ago
5 years ago
<http://www.algolit.net/index.php/Data_Workers>
` _ `<br>
` _ __ ___ | |_ ___ ___ `<br>
`| '_ \ / _ \| __/ _ \/ __|`<br>
`| | | | (_) | || __/\__ \`<br>
5 years ago
`|_| |_|\___/ \__\___||___/`
line width: 110 char
5 years ago
5 years ago
lines per page: 70
5 years ago
5 years ago
70, 140, 210, 280, 350, 420, 490, 560, 630, 700
5 years ago
---------
### --- txt to pdf ---
5 years ago
options ...
#### weasyprint
5 years ago
(stretched the page size, font size, etc, in order to place everything)
#### enscript
5 years ago
(using postscript to create pdf)
5 years ago
5 years ago
`$ enscript --word-wrap --margins=40:10:10:20 --fancy-header writers.intro.txt -o - | ps2pdf - test.pdf`
5 years ago
5 years ago
`$ cat writers.intro.txt | iconv -c -f utf-8 -t ISO-8859-1 | enscript --word-wrap --margins=40:10:10:20 --fancy-header -o - | ps2pdf - test.pdf`
#### txt2pdf
5 years ago
(uses reportlab)
5 years ago
5 years ago
<https://github.com/baruchel/txt2pdf>
5 years ago
5 years ago
`$ python3 txt2pdf/txt2pdf.py -T 1 -B 2 -L 2 -R 1 writers.intro.txt -o test.pdf`
5 years ago
5 years ago
`$ python3 txt2pdf/txt2pdf.py -m A4 -f fonts/fantasque/TTF/FantasqueSansMono-Regular.ttf -s 10 -v 0 -T 1 -B 1 -L 1.5 -R 1.5 data-workers.txt -o test.pdf`
currently using:
`$ python3 txt2pdf/txt2pdf.py -m A4 -f fonts/fantasque/TTF/FantasqueSansMono-Regular.ttf -s 9 -v 0.05 -T 1 -B 0.9 -L 1.5 -R 1.5 data-workers.txt -o test.pdf`
#### PDF2txt miner
5 years ago
The inverted tool of this process
5 years ago
<https://www.unixuser.org/~euske/python/pdfminer/>
*"What's It? PDFMiner is a tool for extracting information from PDF documents. Unlike other PDF-related tools, it focuses entirely on getting and analyzing text data. PDFMiner allows one to obtain the exact location of text in a page, as well as other information such as fonts or lines."*
5 years ago
------
### --- hyphenation ---
5 years ago
#### Hyphenator
5 years ago
<https://pypi.org/project/hyphenator/>
#### textwrap
5 years ago
<https://pypi.org/project/textwrap3/>
------
### --- commands ---
5 years ago
Generate the publication to PDF:
5 years ago
5 years ago
`$ python3 create_all.py && python3 txt2pdf/txt2pdf.py -m A4 -f fonts/unifont-11.0.03.ttf -s 9 -v 0.05 -T 1 -B 0.9 -L 1.6 -R 1.4 data-workers.en.txt -o data-workers.en.pdf`
Add logos.pdf on last page with PDFTK
5 years ago
5 years ago
`$ pdftk data-workers.en.pdf A=data-workers.en.pdf cat A52 output data-workers.en.backcover.pdf`
5 years ago
5 years ago
`$ pdftk data-workers.en.backcover.pdf multistamp logos.pdf output data-workers.en.logos.pdf `
5 years ago
5 years ago
`$ pdftk A=data-workers.en.pdf B=data-workers.en.backcover.logos.pdf cat A1-51 B output data-workers.en.logos.pdf`
PDFTK in one command:
5 years ago
5 years ago
`$ pdftk data-workers.en.pdf A=data-workers.en.pdf cat A52 output data-workers.en.backcover.pdf && pdftk data-workers.en.backcover.pdf multistamp logos.pdf output data-workers.en.logos.pdf && pdftk A=data-workers.en.pdf B=data-workers.en.backcover.logos.pdf cat A1-51 B output data-workers.en.publication.pdf`
5 years ago
------
### --- ASCII/UNICODE fonts ---
5 years ago
Unicode art :)
5 years ago
5 years ago
<http://xahlee.info/comp/unicode_ascii_art.html>
5 years ago
5 years ago
<http://qaz.wtf/u/convert.cgi?text=This+is+pretty+fun+too.+Do+something+for+your+group+tag>
5 years ago
5 years ago
<https://coolsymbol.com/cool-fancy-text-generator.html>
5 years ago
5 years ago
<http://www.alanwood.net/unicode/>
------
### --- unifont ---
5 years ago
<http://unifoundry.com/unifont/index.html>
------
### --- DUMP ---
5 years ago
5 years ago
`[\/\]\<\?\'\)\(\[\\\"\w]`
`░`
5 years ago
work
many authors
write
every human being
who has access
to the internet
interacts
we
chat,
write,
click,
like
and share
we
leave our data
we
find ourselves writing in Python
some neural networks
write
human editors
assist
poets,
playwrights
or novelists
assist
---
5 years ago
Writers write
░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░ ▒▒▒▒▒▒▒▒▒▒▒▒▒▒
Data workers ░░░░░░░░░░░░ need data to ▒▒▒▒ with. work
The data that is used in the context
of Algolit, is written language.
Machine learning relies on many types
Many authors of writing. ░░░░░░░░░░░░ ▒▒▒▒▒ in the write
form of publications, like books or
articles. These are part of organised
archives and are sometimes digitized.
But there are other kinds of writing
every human too. We could say that ░░░░░░░░░░░░
being who has access to the internet ░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
is a writer each time they ▒▒▒▒▒▒▒▒▒ interact
with algorithms.
We ░░ ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒. chat, write,
click, like
and share
we In return for free services, ░░ ▒▒▒▒▒ leave
▒▒▒▒▒▒▒▒ that is compiled into profiles our data
and sold for advertisement and research.
Machine learning algorithms are not
critics: they take whatever they're
given, no matter the writing style,
no matter the CV of the author, no
matter their spelling mistakes. In
fact, mistakes make it better: the
more variety, the better they learn
to anticipate unexpected text. But
often, human authors are not aware
of what happens to their work.
Most of the writing we use is in
English, some is in French, some in
Dutch. Most often we find ourselves
writing in Python, the programming
language we use.
Algorithms can be writers too. Some
neural networks write their own rules
and generate their own texts. And for
the models that are still wrestling with
the ambiguities of natural language,
there are human editors to assist them.
Poets, playwrights or novelists start
their new careers as assistants of AI.
5 years ago
---
5 years ago
P r o g r a mm e r s are wr iting the datawork
P r o g r am m e rs are writing the dataworker
P r o g ra m mers are writing the dataworke
P r o g r ammers a re writing the datawor
P r o g r ammers are writing the dataw
P r o gram mers a re writing the data
P r ogra m mer s are writing the d
P r o gramm e r s are writing the d
P r ogram m ers a re writing the d
P r o g ram m e rs a re writing the
P r o g r a m m ers a r e writing the
P r o g r a m m e rs are writing the d
P r o g r a m m e r s a r e wr i ting the dat
P r o g r a mme r s ar e writ ing the dataw
P r o g r amm e r s are writing the datawo
P r o g r amm e r s are writing t he datawo
P r o g ra m m er s a r e writ ing the datawork
P r o g r a mm e r s are wr iting the datawork
P r o g r a m m e rs are writing the datawo
P r o gra m m e rs are w riting the datawork
P r og r a m mers a re writing the datawor
P r o g r a mmers a re writing the datawo
P r o g r ammers a r e writing the dataw
P r o g ra mmers a re writing the dat
P r o g ramm ers a re writing the da
P r ogramm e rs a r e writing the da
P r o gramm e r s a r e writing the d
P r o g ram m e rs a r e writing the d
P r o g r a m m e rs are writing the d
P r o g r a m m e r s a r e w r iting the da
P r o g r a mme r s ar e writ ing the dataw
P r o g ramme r s are writing the datawo
P r o g r ammer s a re w riting th e datawor
P r o g r a m mers a r e writ ing the datawork
P r o g r am me r s are wr iting the datawork
5 years ago