update readme

This commit is contained in:
manetta 2019-03-25 08:38:59 +01:00
parent 78273b8430
commit 2d7e00c423

View File

@ -1,6 +1,7 @@
# plain text workflow # plain text workflow
Files for the publication & poster for Data Workers, an exhibition by Algolit at the Mundaneum in Mons from 28 March until 28 April 2019. Files for the publication & poster for Data Workers, an exhibition by Algolit at the Mundaneum in Mons from 28 March until 28 April 2019.
<http://www.algolit.net/index.php/Data_Workers> <http://www.algolit.net/index.php/Data_Workers>
@ -15,14 +16,23 @@ line width: 110 char
lines per page: 70 lines per page: 70
70 70
140 140
210 210
280 280
350 350
420 420
490 490
560 560
630 630
700 700
@ -35,13 +45,18 @@ options ...
### enscript ### enscript
(using postscript to create pdf) (using postscript to create pdf)
`$ enscript --word-wrap --margins=40:10:10:20 --fancy-header writers.intro.txt -o - | ps2pdf - test.pdf` `$ enscript --word-wrap --margins=40:10:10:20 --fancy-header writers.intro.txt -o - | ps2pdf - test.pdf`
`$ cat writers.intro.txt | iconv -c -f utf-8 -t ISO-8859-1 | enscript --word-wrap --margins=40:10:10:20 --fancy-header -o - | ps2pdf - test.pdf` `$ cat writers.intro.txt | iconv -c -f utf-8 -t ISO-8859-1 | enscript --word-wrap --margins=40:10:10:20 --fancy-header -o - | ps2pdf - test.pdf`
### txt2pdf ### txt2pdf
(uses reportlab) (uses reportlab)
<https://github.com/baruchel/txt2pdf> <https://github.com/baruchel/txt2pdf>
`$ python3 txt2pdf/txt2pdf.py -T 1 -B 2 -L 2 -R 1 writers.intro.txt -o test.pdf` `$ python3 txt2pdf/txt2pdf.py -T 1 -B 2 -L 2 -R 1 writers.intro.txt -o test.pdf`
`$ python3 txt2pdf/txt2pdf.py -m A4 -f fonts/fantasque/TTF/FantasqueSansMono-Regular.ttf -s 10 -v 0 -T 1 -B 1 -L 1.5 -R 1.5 data-workers.txt -o test.pdf` `$ python3 txt2pdf/txt2pdf.py -m A4 -f fonts/fantasque/TTF/FantasqueSansMono-Regular.ttf -s 10 -v 0 -T 1 -B 1 -L 1.5 -R 1.5 data-workers.txt -o test.pdf`
currently using: currently using:
@ -49,8 +64,10 @@ currently using:
### PDF2txt miner ### PDF2txt miner
The inverted tool of this process The inverted tool of this process
https://www.unixuser.org/~euske/python/pdfminer/
"What's It? PDFMiner is a tool for extracting information from PDF documents. Unlike other PDF-related tools, it focuses entirely on getting and analyzing text data. PDFMiner allows one to obtain the exact location of text in a page, as well as other information such as fonts or lines." <https://www.unixuser.org/~euske/python/pdfminer/>
*"What's It? PDFMiner is a tool for extracting information from PDF documents. Unlike other PDF-related tools, it focuses entirely on getting and analyzing text data. PDFMiner allows one to obtain the exact location of text in a page, as well as other information such as fonts or lines."*
## --- hyphenation --- ## --- hyphenation ---
@ -65,23 +82,32 @@ https://www.unixuser.org/~euske/python/pdfminer/
## --- commands --- ## --- commands ---
Generate the publication to PDF: Generate the publication to PDF:
`$ python3 create_all.py && python3 txt2pdf/txt2pdf.py -m A4 -f fonts/unifont-11.0.03.ttf -s 9 -v 0.05 -T 1 -B 0.9 -L 1.6 -R 1.4 data-workers.en.txt -o data-workers.en.pdf` `$ python3 create_all.py && python3 txt2pdf/txt2pdf.py -m A4 -f fonts/unifont-11.0.03.ttf -s 9 -v 0.05 -T 1 -B 0.9 -L 1.6 -R 1.4 data-workers.en.txt -o data-workers.en.pdf`
Add logos.pdf on last page with PDFTK Add logos.pdf on last page with PDFTK
`$ pdftk data-workers.en.pdf A=data-workers.en.pdf cat A52 output data-workers.en.backcover.pdf` `$ pdftk data-workers.en.pdf A=data-workers.en.pdf cat A52 output data-workers.en.backcover.pdf`
`$ pdftk data-workers.en.backcover.pdf multistamp logos.pdf output data-workers.en.logos.pdf ` `$ pdftk data-workers.en.backcover.pdf multistamp logos.pdf output data-workers.en.logos.pdf `
`$ pdftk A=data-workers.en.pdf B=data-workers.en.backcover.logos.pdf cat A1-51 B output data-workers.en.logos.pdf` `$ pdftk A=data-workers.en.pdf B=data-workers.en.backcover.logos.pdf cat A1-51 B output data-workers.en.logos.pdf`
PDFTK in one command: PDFTK in one command:
`$ pdftk data-workers.en.pdf A=data-workers.en.pdf cat A52 output data-workers.en.backcover.pdf && pdftk data-workers.en.backcover.pdf multistamp logos.pdf output data-workers.en.logos.pdf && pdftk A=data-workers.en.pdf B=data-workers.en.backcover.logos.pdf cat A1-51 B output data-workers.en.logos.pdf` `$ pdftk data-workers.en.pdf A=data-workers.en.pdf cat A52 output data-workers.en.backcover.pdf && pdftk data-workers.en.backcover.pdf multistamp logos.pdf output data-workers.en.logos.pdf && pdftk A=data-workers.en.pdf B=data-workers.en.backcover.logos.pdf cat A1-51 B output data-workers.en.logos.pdf`
## --- ASCII/UNICODE fonts --- ## --- ASCII/UNICODE fonts ---
Unicode art :) Unicode art :)
<http://xahlee.info/comp/unicode_ascii_art.html> <http://xahlee.info/comp/unicode_ascii_art.html>
<http://qaz.wtf/u/convert.cgi?text=This+is+pretty+fun+too.+Do+something+for+your+group+tag> <http://qaz.wtf/u/convert.cgi?text=This+is+pretty+fun+too.+Do+something+for+your+group+tag>
<https://coolsymbol.com/cool-fancy-text-generator.html> <https://coolsymbol.com/cool-fancy-text-generator.html>
<http://www.alanwood.net/unicode/> <http://www.alanwood.net/unicode/>
@ -91,8 +117,10 @@ Unicode art :)
## --- DUMP --- ## --- DUMP ---
[\/\]\<\?\'\)\(\[\\\"\w] `[\/\]\<\?\'\)\(\[\\\"\w]`
`░`
work work
@ -123,12 +151,12 @@ Unicode art :)
--- ---
Writers write Writers write
░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░ ▒▒▒▒▒▒▒▒▒▒▒▒▒▒ ░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░ ▒▒▒▒▒▒▒▒▒▒▒▒▒▒
Data workers ░░░░░░░░░░░░ need data to ▒▒▒▒ with. work Data workers ░░░░░░░░░░░░ need data to ▒▒▒▒ with. work
The data that is used in the context The data that is used in the context
of Algolit, is written language. of Algolit, is written language.
Machine learning relies on many types Machine learning relies on many types
@ -138,11 +166,11 @@ Data workers ░░░░░░░░░░░░ nee
archives and are sometimes digitized. archives and are sometimes digitized.
But there are other kinds of writing But there are other kinds of writing
every human too. We could say that ░░░░░░░░░░░░ every human too. We could say that ░░░░░░░░░░░░
being who has access to the internet ░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░ being who has access to the internet ░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
is a writer each time they ▒▒▒▒▒▒▒▒▒ interact is a writer each time they ▒▒▒▒▒▒▒▒▒ interact
with algorithms. with algorithms.
We ░░ ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒. chat, write, We ░░ ▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒. chat, write,
click, like click, like
and share and share
@ -189,9 +217,9 @@ We ░░ ▒▒▒▒▒▒▒▒▒▒▒
P r ogram m ers a re writing the d P r ogram m ers a re writing the d
P r o g ram m e rs a re writing the P r o g ram m e rs a re writing the
P r o g r a m m ers a r e writing the P r o g r a m m ers a r e writing the
P r o g r a m m e rs are writing the d P r o g r a m m e rs are writing the d
P r o g r a m m e r s a r e wr i ting the dat P r o g r a m m e r s a r e wr i ting the dat
P r o g r a mme r s ar e writ ing the dataw P r o g r a mme r s ar e writ ing the dataw
P r o g r amm e r s are writing the datawo P r o g r amm e r s are writing the datawo
P r o g r amm e r s are writing t he datawo P r o g r amm e r s are writing t he datawo
P r o g ra m m er s a r e writ ing the datawork P r o g ra m m er s a r e writ ing the datawork
@ -206,9 +234,9 @@ P r o g r a mme r s ar e writ ing the dataw
P r ogramm e rs a r e writing the da P r ogramm e rs a r e writing the da
P r o gramm e r s a r e writing the d P r o gramm e r s a r e writing the d
P r o g ram m e rs a r e writing the d P r o g ram m e rs a r e writing the d
P r o g r a m m e rs are writing the d P r o g r a m m e rs are writing the d
P r o g r a m m e r s a r e w r iting the da P r o g r a m m e r s a r e w r iting the da
P r o g r a mme r s ar e writ ing the dataw P r o g r a mme r s ar e writ ing the dataw
P r o g ramme r s are writing the datawo P r o g ramme r s are writing the datawo
P r o g r ammer s a re w riting th e datawor P r o g r ammer s a re w riting th e datawor
P r o g r a m mers a r e writ ing the datawork P r o g r a m mers a r e writ ing the datawork