update readme
This commit is contained in:
parent
78273b8430
commit
2d7e00c423
206
README.md
206
README.md
@ -1,6 +1,7 @@
|
|||||||
# plain text workflow
|
# plain text workflow
|
||||||
|
|
||||||
Files for the publication & poster for Data Workers, an exhibition by Algolit at the Mundaneum in Mons from 28 March until 28 April 2019.
|
Files for the publication & poster for Data Workers, an exhibition by Algolit at the Mundaneum in Mons from 28 March until 28 April 2019.
|
||||||
|
|
||||||
<http://www.algolit.net/index.php/Data_Workers>
|
<http://www.algolit.net/index.php/Data_Workers>
|
||||||
|
|
||||||
|
|
||||||
@ -15,14 +16,23 @@ line width: 110 char
|
|||||||
lines per page: 70
|
lines per page: 70
|
||||||
|
|
||||||
70
|
70
|
||||||
|
|
||||||
140
|
140
|
||||||
|
|
||||||
210
|
210
|
||||||
|
|
||||||
280
|
280
|
||||||
|
|
||||||
350
|
350
|
||||||
|
|
||||||
420
|
420
|
||||||
|
|
||||||
490
|
490
|
||||||
|
|
||||||
560
|
560
|
||||||
|
|
||||||
630
|
630
|
||||||
|
|
||||||
700
|
700
|
||||||
|
|
||||||
|
|
||||||
@ -35,13 +45,18 @@ options ...
|
|||||||
|
|
||||||
### enscript
|
### enscript
|
||||||
(using postscript to create pdf)
|
(using postscript to create pdf)
|
||||||
|
|
||||||
`$ enscript --word-wrap --margins=40:10:10:20 --fancy-header writers.intro.txt -o - | ps2pdf - test.pdf`
|
`$ enscript --word-wrap --margins=40:10:10:20 --fancy-header writers.intro.txt -o - | ps2pdf - test.pdf`
|
||||||
|
|
||||||
`$ cat writers.intro.txt | iconv -c -f utf-8 -t ISO-8859-1 | enscript --word-wrap --margins=40:10:10:20 --fancy-header -o - | ps2pdf - test.pdf`
|
`$ cat writers.intro.txt | iconv -c -f utf-8 -t ISO-8859-1 | enscript --word-wrap --margins=40:10:10:20 --fancy-header -o - | ps2pdf - test.pdf`
|
||||||
|
|
||||||
### txt2pdf
|
### txt2pdf
|
||||||
(uses reportlab)
|
(uses reportlab)
|
||||||
|
|
||||||
<https://github.com/baruchel/txt2pdf>
|
<https://github.com/baruchel/txt2pdf>
|
||||||
|
|
||||||
`$ python3 txt2pdf/txt2pdf.py -T 1 -B 2 -L 2 -R 1 writers.intro.txt -o test.pdf`
|
`$ python3 txt2pdf/txt2pdf.py -T 1 -B 2 -L 2 -R 1 writers.intro.txt -o test.pdf`
|
||||||
|
|
||||||
`$ python3 txt2pdf/txt2pdf.py -m A4 -f fonts/fantasque/TTF/FantasqueSansMono-Regular.ttf -s 10 -v 0 -T 1 -B 1 -L 1.5 -R 1.5 data-workers.txt -o test.pdf`
|
`$ python3 txt2pdf/txt2pdf.py -m A4 -f fonts/fantasque/TTF/FantasqueSansMono-Regular.ttf -s 10 -v 0 -T 1 -B 1 -L 1.5 -R 1.5 data-workers.txt -o test.pdf`
|
||||||
|
|
||||||
currently using:
|
currently using:
|
||||||
@ -49,8 +64,10 @@ currently using:
|
|||||||
|
|
||||||
### PDF2txt miner
|
### PDF2txt miner
|
||||||
The inverted tool of this process
|
The inverted tool of this process
|
||||||
https://www.unixuser.org/~euske/python/pdfminer/
|
|
||||||
"What's It? PDFMiner is a tool for extracting information from PDF documents. Unlike other PDF-related tools, it focuses entirely on getting and analyzing text data. PDFMiner allows one to obtain the exact location of text in a page, as well as other information such as fonts or lines."
|
<https://www.unixuser.org/~euske/python/pdfminer/>
|
||||||
|
|
||||||
|
*"What's It? PDFMiner is a tool for extracting information from PDF documents. Unlike other PDF-related tools, it focuses entirely on getting and analyzing text data. PDFMiner allows one to obtain the exact location of text in a page, as well as other information such as fonts or lines."*
|
||||||
|
|
||||||
|
|
||||||
## --- hyphenation ---
|
## --- hyphenation ---
|
||||||
@ -65,23 +82,32 @@ https://www.unixuser.org/~euske/python/pdfminer/
|
|||||||
## --- commands ---
|
## --- commands ---
|
||||||
|
|
||||||
Generate the publication to PDF:
|
Generate the publication to PDF:
|
||||||
|
|
||||||
`$ python3 create_all.py && python3 txt2pdf/txt2pdf.py -m A4 -f fonts/unifont-11.0.03.ttf -s 9 -v 0.05 -T 1 -B 0.9 -L 1.6 -R 1.4 data-workers.en.txt -o data-workers.en.pdf`
|
`$ python3 create_all.py && python3 txt2pdf/txt2pdf.py -m A4 -f fonts/unifont-11.0.03.ttf -s 9 -v 0.05 -T 1 -B 0.9 -L 1.6 -R 1.4 data-workers.en.txt -o data-workers.en.pdf`
|
||||||
|
|
||||||
Add logos.pdf on last page with PDFTK
|
Add logos.pdf on last page with PDFTK
|
||||||
|
|
||||||
`$ pdftk data-workers.en.pdf A=data-workers.en.pdf cat A52 output data-workers.en.backcover.pdf`
|
`$ pdftk data-workers.en.pdf A=data-workers.en.pdf cat A52 output data-workers.en.backcover.pdf`
|
||||||
|
|
||||||
`$ pdftk data-workers.en.backcover.pdf multistamp logos.pdf output data-workers.en.logos.pdf `
|
`$ pdftk data-workers.en.backcover.pdf multistamp logos.pdf output data-workers.en.logos.pdf `
|
||||||
|
|
||||||
`$ pdftk A=data-workers.en.pdf B=data-workers.en.backcover.logos.pdf cat A1-51 B output data-workers.en.logos.pdf`
|
`$ pdftk A=data-workers.en.pdf B=data-workers.en.backcover.logos.pdf cat A1-51 B output data-workers.en.logos.pdf`
|
||||||
|
|
||||||
PDFTK in one command:
|
PDFTK in one command:
|
||||||
|
|
||||||
`$ pdftk data-workers.en.pdf A=data-workers.en.pdf cat A52 output data-workers.en.backcover.pdf && pdftk data-workers.en.backcover.pdf multistamp logos.pdf output data-workers.en.logos.pdf && pdftk A=data-workers.en.pdf B=data-workers.en.backcover.logos.pdf cat A1-51 B output data-workers.en.logos.pdf`
|
`$ pdftk data-workers.en.pdf A=data-workers.en.pdf cat A52 output data-workers.en.backcover.pdf && pdftk data-workers.en.backcover.pdf multistamp logos.pdf output data-workers.en.logos.pdf && pdftk A=data-workers.en.pdf B=data-workers.en.backcover.logos.pdf cat A1-51 B output data-workers.en.logos.pdf`
|
||||||
|
|
||||||
|
|
||||||
## --- ASCII/UNICODE fonts ---
|
## --- ASCII/UNICODE fonts ---
|
||||||
|
|
||||||
Unicode art :)
|
Unicode art :)
|
||||||
|
|
||||||
<http://xahlee.info/comp/unicode_ascii_art.html>
|
<http://xahlee.info/comp/unicode_ascii_art.html>
|
||||||
|
|
||||||
<http://qaz.wtf/u/convert.cgi?text=This+is+pretty+fun+too.+Do+something+for+your+group+tag>
|
<http://qaz.wtf/u/convert.cgi?text=This+is+pretty+fun+too.+Do+something+for+your+group+tag>
|
||||||
|
|
||||||
<https://coolsymbol.com/cool-fancy-text-generator.html>
|
<https://coolsymbol.com/cool-fancy-text-generator.html>
|
||||||
|
|
||||||
<http://www.alanwood.net/unicode/>
|
<http://www.alanwood.net/unicode/>
|
||||||
|
|
||||||
|
|
||||||
@ -91,8 +117,10 @@ Unicode art :)
|
|||||||
|
|
||||||
## --- DUMP ---
|
## --- DUMP ---
|
||||||
|
|
||||||
[\/\]\<\?\'\)\(\[\\\"\w]
|
`[\/\]\<\?\'\)\(\[\\\"\w]`
|
||||||
░
|
|
||||||
|
`░`
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
work
|
work
|
||||||
@ -123,98 +151,98 @@ Unicode art :)
|
|||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
Writers write
|
Writers write
|
||||||
░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░ ▒▒▒▒▒▒▒▒▒▒▒▒▒▒
|
░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░ ▒▒▒▒▒▒▒▒▒▒▒▒▒▒
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
Data workers ░░░░░░░░░░░░ need data to ▒▒▒▒ with. work
|
Data workers ░░░░░░░░░░░░ need data to ▒▒▒▒ with. work
|
||||||
The data that is used in the context
|
The data that is used in the context
|
||||||
of Algolit, is written language.
|
of Algolit, is written language.
|
||||||
Machine learning relies on many types
|
Machine learning relies on many types
|
||||||
Many authors of writing. ░░░░░░░░░░░░ ▒▒▒▒▒ in the write
|
Many authors of writing. ░░░░░░░░░░░░ ▒▒▒▒▒ in the write
|
||||||
form of publications, like books or
|
form of publications, like books or
|
||||||
articles. These are part of organised
|
articles. These are part of organised
|
||||||
archives and are sometimes digitized.
|
archives and are sometimes digitized.
|
||||||
But there are other kinds of writing
|
But there are other kinds of writing
|
||||||
every human too. We could say that ░░░░░░░░░░░░
|
every human too. We could say that ░░░░░░░░░░░░
|
||||||
being who has access to the internet ░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
|
being who has access to the internet ░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
|
||||||
is a writer each time they ▒▒▒▒▒▒▒▒▒ interact
|
is a writer each time they ▒▒▒▒▒▒▒▒▒ interact
|
||||||
with algorithms.
|
with algorithms.
|
||||||
|
|
||||||
We ░░ ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒. chat, write,
|
We ░░ ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒. chat, write,
|
||||||
click, like
|
click, like
|
||||||
and share
|
and share
|
||||||
|
|
||||||
we In return for free services, ░░ ▒▒▒▒▒ leave
|
we In return for free services, ░░ ▒▒▒▒▒ leave
|
||||||
▒▒▒▒▒▒▒▒ that is compiled into profiles our data
|
▒▒▒▒▒▒▒▒ that is compiled into profiles our data
|
||||||
and sold for advertisement and research.
|
and sold for advertisement and research.
|
||||||
|
|
||||||
Machine learning algorithms are not
|
Machine learning algorithms are not
|
||||||
critics: they take whatever they're
|
critics: they take whatever they're
|
||||||
given, no matter the writing style,
|
given, no matter the writing style,
|
||||||
no matter the CV of the author, no
|
no matter the CV of the author, no
|
||||||
matter their spelling mistakes. In
|
matter their spelling mistakes. In
|
||||||
fact, mistakes make it better: the
|
fact, mistakes make it better: the
|
||||||
more variety, the better they learn
|
more variety, the better they learn
|
||||||
to anticipate unexpected text. But
|
to anticipate unexpected text. But
|
||||||
often, human authors are not aware
|
often, human authors are not aware
|
||||||
of what happens to their work.
|
of what happens to their work.
|
||||||
|
|
||||||
Most of the writing we use is in
|
Most of the writing we use is in
|
||||||
English, some is in French, some in
|
English, some is in French, some in
|
||||||
Dutch. Most often we find ourselves
|
Dutch. Most often we find ourselves
|
||||||
writing in Python, the programming
|
writing in Python, the programming
|
||||||
language we use.
|
language we use.
|
||||||
|
|
||||||
Algorithms can be writers too. Some
|
Algorithms can be writers too. Some
|
||||||
neural networks write their own rules
|
neural networks write their own rules
|
||||||
and generate their own texts. And for
|
and generate their own texts. And for
|
||||||
the models that are still wrestling with
|
the models that are still wrestling with
|
||||||
the ambiguities of natural language,
|
the ambiguities of natural language,
|
||||||
there are human editors to assist them.
|
there are human editors to assist them.
|
||||||
Poets, playwrights or novelists start
|
Poets, playwrights or novelists start
|
||||||
their new careers as assistants of AI.
|
their new careers as assistants of AI.
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
P r o g r a mm e r s are wr iting the datawork
|
P r o g r a mm e r s are wr iting the datawork
|
||||||
P r o g r am m e rs are writing the dataworker
|
P r o g r am m e rs are writing the dataworker
|
||||||
P r o g ra m mers are writing the dataworke
|
P r o g ra m mers are writing the dataworke
|
||||||
P r o g r ammers a re writing the datawor
|
P r o g r ammers a re writing the datawor
|
||||||
P r o g r ammers are writing the dataw
|
P r o g r ammers are writing the dataw
|
||||||
P r o gram mers a re writing the data
|
P r o gram mers a re writing the data
|
||||||
P r ogra m mer s are writing the d
|
P r ogra m mer s are writing the d
|
||||||
P r o gramm e r s are writing the d
|
P r o gramm e r s are writing the d
|
||||||
P r ogram m ers a re writing the d
|
P r ogram m ers a re writing the d
|
||||||
P r o g ram m e rs a re writing the
|
P r o g ram m e rs a re writing the
|
||||||
P r o g r a m m ers a r e writing the
|
P r o g r a m m ers a r e writing the
|
||||||
P r o g r a m m e rs are writing the d
|
P r o g r a m m e rs are writing the d
|
||||||
P r o g r a m m e r s a r e wr i ting the dat
|
P r o g r a m m e r s a r e wr i ting the dat
|
||||||
P r o g r a mme r s ar e writ ing the dataw
|
P r o g r a mme r s ar e writ ing the dataw
|
||||||
P r o g r amm e r s are writing the datawo
|
P r o g r amm e r s are writing the datawo
|
||||||
P r o g r amm e r s are writing t he datawo
|
P r o g r amm e r s are writing t he datawo
|
||||||
P r o g ra m m er s a r e writ ing the datawork
|
P r o g ra m m er s a r e writ ing the datawork
|
||||||
P r o g r a mm e r s are wr iting the datawork
|
P r o g r a mm e r s are wr iting the datawork
|
||||||
P r o g r a m m e rs are writing the datawo
|
P r o g r a m m e rs are writing the datawo
|
||||||
P r o gra m m e rs are w riting the datawork
|
P r o gra m m e rs are w riting the datawork
|
||||||
P r og r a m mers a re writing the datawor
|
P r og r a m mers a re writing the datawor
|
||||||
P r o g r a mmers a re writing the datawo
|
P r o g r a mmers a re writing the datawo
|
||||||
P r o g r ammers a r e writing the dataw
|
P r o g r ammers a r e writing the dataw
|
||||||
P r o g ra mmers a re writing the dat
|
P r o g ra mmers a re writing the dat
|
||||||
P r o g ramm ers a re writing the da
|
P r o g ramm ers a re writing the da
|
||||||
P r ogramm e rs a r e writing the da
|
P r ogramm e rs a r e writing the da
|
||||||
P r o gramm e r s a r e writing the d
|
P r o gramm e r s a r e writing the d
|
||||||
P r o g ram m e rs a r e writing the d
|
P r o g ram m e rs a r e writing the d
|
||||||
P r o g r a m m e rs are writing the d
|
P r o g r a m m e rs are writing the d
|
||||||
P r o g r a m m e r s a r e w r iting the da
|
P r o g r a m m e r s a r e w r iting the da
|
||||||
P r o g r a mme r s ar e writ ing the dataw
|
P r o g r a mme r s ar e writ ing the dataw
|
||||||
P r o g ramme r s are writing the datawo
|
P r o g ramme r s are writing the datawo
|
||||||
P r o g r ammer s a re w riting th e datawor
|
P r o g r ammer s a re w riting th e datawor
|
||||||
P r o g r a m mers a r e writ ing the datawork
|
P r o g r a m mers a r e writ ing the datawork
|
||||||
P r o g r am me r s are wr iting the datawork
|
P r o g r am me r s are wr iting the datawork
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user