2009-07-31 Swhack IRC Log

00:00:48 *** Arnia_ (n=jgeldart@host86-176-154-128.range86-176.btcentralplus.com) has joined #swhack
00:02:01 *** Arnia has quit (Nick collision from services.)
00:02:03 *** Arnia_ is now known as Arnia
00:05:57 *** cre8radix has quit ("your mama! >:)")
00:25:14 *** chris2 has quit ("Leaving")
00:49:35 *** ephemerian has quit ("Leaving.")
02:00:09 *** MacTed (n=Thud@c-24-61-62-241.hsd1.ma.comcast.net) has joined #swhack
02:16:16 *** MoiraA has quit (Client Quit)
05:32:13 *** lmorchard is now known as lmorchard|away
05:37:21 *** linear_shift (n=linear_s@unaffiliated/linearshift/x-186235) has joined #swhack
05:37:47 <linear_shift> Does anyone here know anything about the Phenny IRC bot?
05:48:06 <Arnia> phenny, help
05:48:06 <phenny> Arnia: Hi, I'm a bot. Say ".commands" to me in private for a list of my commands, or see http://inamidst.com/phenny/ for more general details. My owner is sbp.
05:58:25 <linear_shift> Nono, I'm trying to get one connected.
05:58:38 <linear_shift> I then get this error message: http://pastebin.com/m582909f
05:58:52 <linear_shift> Tried different python versions too.
05:58:57 <linear_shift> 2.4 and 2.5
05:59:06 <linear_shift> When 2.6 didn't work.
06:19:30 <Arnia> .c 45000GBP in USD
06:19:31 <phenny> 45 000 British pounds = 74 200.5 U.S. dollars
06:19:45 <Arnia> .c 55000USD in GBP
06:19:46 <phenny> 55 000 U.S. dollars = 33 355.5704 British pounds
06:25:26 *** linear_shift has parted #swhack ("We part, but not forever, for forever is only a measure of time. lata, ls.")
08:06:20 *** laplink (n=link@193.157.66.240) has joined #swhack
08:10:16 *** cre8radix (n=jamaerah@c-base/crew/cre8radix) has joined #swhack
08:10:41 <cre8radix> moin
08:15:22 *** BigJibby has quit (zelazny.freenode.net irc.freenode.net)
08:22:51 *** BigJibby (n=matt@nikita.mattread.com) has joined #swhack
08:33:43 *** [1]jeanniecool (n=jeanniec@c-67-171-63-240.hsd1.wa.comcast.net) has joined #swhack
08:40:15 *** jeanniecool has quit (Read error: 60 (Operation timed out))
08:40:15 *** [1]jeanniecool is now known as jeanniecool
08:42:13 *** Talliesin (n=Talliesi@86.47.160.33) has joined #swhack
09:10:18 *** JimJibber has quit (zelazny.freenode.net irc.freenode.net)
09:11:21 <sbp> yo
09:11:36 *** cre8radix has quit ()
09:13:50 *** cre8radix (n=jamaerah@c-base/crew/cre8radix) has joined #swhack
09:14:55 <cre8radix> YOYO
09:17:44 <sbp> yo cre8radix
09:25:52 <laplink> .w primite
09:25:52 <phenny> primite — adjective: 1. feminine plural form of primit, 2. neuter plural form of primit
09:26:00 <laplink> .w primit
09:26:00 <phenny> Couldn't get any definitions for primit.
09:26:08 <laplink> .ety primit
09:26:09 <phenny> Can't find the etymology for "primit". Try http://etymonline.com/?search=primit
09:26:20 <laplink> .wik primite
09:26:22 <phenny> "The primitive recursive functions are defined using primitive recursion and composition as central operations and are a strict subset of the recursive functions (recursive functions are also known as computable functions)." - http://en.wikipedia.org/wiki/Primitive_recursive_function
09:26:25 <laplink> .wik primit
09:26:27 <phenny> Can't find anything in Wikipedia for "primit".
09:26:31 <laplink> grr
09:27:25 <sbp> .w premit
09:27:26 <phenny> Couldn't get any definitions for premit.
09:27:42 <sbp> .oed primit
09:27:43 <phenny> primit (no ety.)...)
09:28:36 *** JimJibber (n=jim@81-86-174-182.dsl.pipex.com) has joined #swhack
09:38:32 *** jeffarch has quit (Read error: 104 (Connection reset by peer))
09:44:40 *** jeffarch (n=jeff@70-41-156-138.cust.wildblue.net) has joined #swhack
09:52:06 *** jeffarch has quit (Read error: 54 (Connection reset by peer))
09:59:54 <[bjoern]> nuyo
10:01:46 * [bjoern] returns from another exam, only to leave for the next in 19.7 hours.
10:02:41 <[bjoern]> http://www.vanityfair.com/politics/features/2009/07/drug-czars-slideshow200907?slide=7
10:03:24 *** jeffarch (n=jeff@70-41-156-138.cust.wildblue.net) has joined #swhack
10:05:58 <sbp> all you have to do is pay unfashionable people to take drugs
10:06:04 <sbp> and heavily publicise that fact
10:06:16 <[bjoern]> i was thinking about Free Drugs For Terrorists.
10:06:20 <sbp> drug usage would halve overnight
10:06:23 <sbp> oh man
10:06:44 <sbp> terrorists are too fashionable though?
10:07:12 <sbp> oh this is great:
10:07:14 <sbp> [[[
10:07:14 <sbp> Most Popular
10:07:14 <sbp> Sarah Palin’s Resignation: The Edited Version
10:07:14 <sbp> Half-Naked Celebrities on the Beach!
10:07:15 <sbp> ]]]
10:07:24 <[bjoern]> I only set trends, I don't know anything about them.
10:09:08 <sbp> .title http://news.bbc.co.uk/1/hi/england/lincolnshire/8177107.stm
10:09:10 <phenny> sbp: BBC NEWS | UK | England | Lincolnshire | Flying underwear causes power cut
10:09:24 <sbp> “Engineers traced the fault and found a thong had short circuited a power line. / It is believed the clothing was carried by a helium balloon from a nearby party.”
10:11:05 *** dmiles has quit (Read error: 60 (Operation timed out))
10:13:13 *** jeffarch has quit (Read error: 54 (Connection reset by peer))
10:16:58 <[bjoern]> Russel's teapot has been recovered from orbit http://asset.soup.io/asset/0410/1739_24e0.jpeg
10:17:11 <sbp> http://newsimg.bbc.co.uk/media/images/46132000/gif/_46132603_broadband_speed_466.gif
10:19:00 <[bjoern]> She's working on her witch diploma http://asset.soup.io/asset/0410/1627_dfac_480.jpeg
10:19:53 <[bjoern]> tree-shirts http://asset.soup.io/asset/0410/1464_2e3b.jpeg
10:20:17 <[bjoern]> hmm http://asset.soup.io/asset/0410/1394_2a69_480.jpeg
10:20:35 <[bjoern]> maybe this has shitrude material http://www.xmarkjenkinsx.com/outside.html
10:22:19 <[bjoern]> phenny, tell cre8radix http://asset.soup.io/asset/0410/1304_f804_480.jpeg
10:22:19 <phenny> [bjoern]: I'll pass that on when cre8radix is around.
10:23:14 <cre8radix> hrhr
10:23:15 <phenny> cre8radix: 10:22Z <[bjoern]> tell cre8radix http://asset.soup.io/asset/0410/1304_f804_480.jpeg
10:23:15 <[bjoern]> these are becoming a meme http://asset.soup.io/asset/0410/1156_3a87_480.png
10:24:28 <sbp> http://farm4.static.flickr.com/3127/2924243505_d015527d83_o.jpg is probably most shitrude
10:26:26 <[bjoern]> I suppose this falls into the category, can't argue with what works? http://asset.soup.io/asset/0410/0810_f211_480.png
10:27:08 *** jeffarch (n=jeff@70-41-156-138.cust.wildblue.net) has joined #swhack
10:28:49 <sbp> the only possible good thing about that strip is the phrase "boob terrorist"
10:29:00 <[bjoern]> This is one thing I haven't thought about so far, and neither seen porn on the internet http://asset.soup.io/asset/0410/0363_3516_480.jpeg
10:30:22 <[bjoern]> Well it also doesn't have sharks with lasers.
10:30:32 <sbp> two rexes, or a rex and a regina?
10:30:59 <[bjoern]> Took me two reads to correctly read "regina"
10:31:02 <sbp> .gc "3.0.196.0"
10:31:02 <phenny> "3.0.196.0": 2,670
10:31:04 <sbp> yeah, heh
10:31:16 <sbp> (not a stardate)
10:31:56 <[bjoern]> those don't have double dots
10:33:03 <[bjoern]> This is one thing I had thought about http://asset.soup.io/asset/0409/9682_ca2b_480.png
10:35:26 <sbp> they misspelled ARAF
10:36:03 <sbp> wait, no they didn't
10:36:07 <sbp> that's good
10:36:24 <[bjoern]> Good, I was gonna say these two will rectify their mistake shortly with a huge brush http://asset.soup.io/asset/0409/9111_285e_480.jpeg
10:38:41 <[bjoern]> Not sure if i had seen this before http://asset.soup.io/asset/0409/8187_1b0e_800.jpeg
10:40:12 <sbp> that's hilarious
10:40:14 <sbp> who's the lady?
10:41:06 <[bjoern]> You would know if she had tattooed her name in the right place.
10:41:41 <sbp> her lack of forethought in this department is astonishing
10:42:17 <[bjoern]> it's at the g8 summit
10:42:25 <sbp> gr8
10:42:56 <sbp> when you type 8 in an English sentence, what do you say in your head?
10:43:04 <sbp> "eight", "acht", or "I'm hungry"?
10:43:28 *** cre8radix has quit ()
10:44:33 <sbp> wenn ich spreche Deutsch hätte ich gesagt: "acht", "eight", or "ich bin hungrig"?
10:44:34 <[bjoern]> I am not sure about typing, but with reading sometimes option one, usually options 2 and 3.
10:45:04 <sbp> *oder
10:45:35 <sbp> oh, and an even better question
10:45:46 <sbp> when you make shit up in German, how do you decide what gender a word should be?
10:48:58 <[bjoern]> http://www.reuters.com/news/pictures/searchpopup?picId=10806432 does not say
10:49:21 <[bjoern]> shit as in nouns?
10:49:25 <sbp> yeah
10:49:45 <[bjoern]> What makes you think I would know?
10:49:56 <sbp> you are well trained in the art of making shit up
10:50:25 <sbp> I expect you to make shit up, therefore, about making shit up
10:50:49 <sbp> perhaps I should expect that not expecting would lead to more andisappointment
10:51:15 <[bjoern]> No, how would I know how I decide the gender?
10:51:22 <sbp> I love how the Reuters caption completely avoids commenting on the actual picture
10:51:32 <sbp> I don't know, just make something up
10:51:48 <sbp> "gender is determined based on the number of funny sounding vowels in the new noun"
10:51:52 *** jeffarch has quit (Read error: 54 (Connection reset by peer))
10:52:01 <sbp> "gender is determined by application to the supreme court, for a nominal fee"
10:52:17 <sbp> "gender is determined solely on the basis of how much alcohol you imbibed prior to the coinage"
10:52:35 <sbp> "gender must match the gender of the coinage"
10:52:47 <sbp> "gender is determined on whatever because nobody gives a fuck"
10:53:21 <sbp> actually I'm assuming the latter is the proper answer
10:54:41 <[bjoern]> I did not know that west frisian distinguishes between Utrum and Neutrum, well except for the Schiermonnikoog dialect of course.
10:55:05 * sbp works on the basis that those are new elements
10:55:07 <[bjoern]> http://de.wikipedia.org/wiki/Utrum has no enwik version
10:55:20 *** jeffarch (n=jeff@70-41-156-138.cust.wildblue.net) has joined #swhack
10:55:37 <[bjoern]> they are grammatical genders.
10:56:02 <sbp> “The Genus utrum (Ger. which both [always]) or commune (dt. together) is a grammatical gender, which is a common form for masculine and feminine means. The only counterpart to this is the neuter ( "utrum ne").”
10:56:28 <sbp> set union of masc. and fem.?
10:56:51 <sbp> Swedish has a better article
10:56:54 <sbp> but it's in Swedish
10:57:09 <sbp> http://translate.google.com/translate?hl=en&sl=sv&tl=en&u=http%3A%2F%2Fsv.wikipedia.org%2Fwiki%2FUtrum
10:57:09 <Monty> "you"
10:57:12 <sbp> nu
10:59:18 <sbp> ah, in English it's called the common gender
10:59:19 <sbp> "(grammar) In some languages, particularly Germanic languages, of the gender originating from the coalescence of the masculine and feminine categories of nouns."
10:59:45 <sbp> http://en.wikipedia.org/wiki/Grammatical_gender#Common_and_neuter
10:59:59 <sbp> “(note that the common/neuter distinction is close to animate/inanimate)”
11:00:40 <sbp> so yes, set union of masc. and fem.
11:00:46 <sbp> as distinct from neuter
11:00:47 <[bjoern]> My post-googlian conclusion is that I JUST FUCKING KNOW.
11:00:53 <sbp> heh, heh
11:02:16 <sbp> I think I read a paper on this once
11:02:25 <sbp> and actually, I remember asking my French teacher something similar
11:02:28 <[bjoern]> There needs to be a proper complement phrase for "on the internet".
11:02:31 <sbp> I said, how do you know the gender of new words?
11:02:35 <sbp> she said you just have to ask
11:02:47 <sbp> her example was "CDs". she didn't know if CDs were masc. or fem., so she just asked someone
11:02:57 <sbp> non the internet
11:03:08 <[bjoern]> non the internet disclaimers http://asset.soup.io/asset/0409/8425_d619_480.jpeg
11:03:35 <sbp> that font should be unallowed
11:03:40 <[bjoern]> This made me almost chuckle http://asset.soup.io/asset/0409/8296_5bb6_480.jpeg
11:03:58 <sbp> chox from me
11:05:20 <[bjoern]> http://asset.soup.io/asset/0409/7331_079e.jpeg
11:09:39 <[bjoern]> http://asset.soup.io/asset/0410/1911_923b.jpeg
11:11:09 *** jeffarch has quit (Success)
11:11:09 <sbp> the scary thing about 079e is I'm not sure how many of those are parodies
11:11:12 <[bjoern]> http://asset.soup.io/asset/0410/1792_a911_480.jpeg
11:12:47 <[bjoern]> http://asset.soup.io/asset/0410/1715_0a74.jpeg
11:12:51 <sbp> ugh, "math"
11:13:16 <sbp> reminds me of the Japanese product they called "homo sausage"
11:13:34 <sbp> (I'd say this is probably apocryphal, but it's probably not)
11:15:35 *** jeffarch (n=jeff@70-41-156-138.cust.wildblue.net) has joined #swhack
11:16:38 *** jeffarch has quit (Read error: 54 (Connection reset by peer))
11:19:11 <[bjoern]> http://www.nytimes.com/2009/07/27/arts/design/27border.html?_r=1
11:19:29 <[bjoern]> i like under what category they put that
11:19:45 *** jeffarch (n=jeff@70-41-156-138.cust.wildblue.net) has joined #swhack
11:20:42 <sbp> ‘“There were security concerns,” said Kelly Ivahnenko, a spokeswoman for the customs agency. “The sign could be a huge target and attract undue attention. Anything that would place our officers at risk we need to avoid.”’
11:20:59 *** jeffarch has quit (Read error: 104 (Connection reset by peer))
11:21:56 <[bjoern]> [[[
11:22:08 <[bjoern]> A Microsoft-Yahoo deal
11:22:08 <[bjoern]> Also in Tech »
11:22:08 <[bjoern]> Video: distracted drivers
11:22:08 <[bjoern]> Instant love, followed by letdown
11:22:14 <[bjoern]> ]]]
11:22:24 <[bjoern]> hmm less confusing on irc than on the site
11:22:52 <sbp> yeah
11:23:06 <sbp> .title http://www.neatorama.com/2007/09/03/clowns-kicked-kkk-asses/
11:23:08 <phenny> sbp: Clowns Kicked KKK Asses - Neatorama
11:23:08 *** jeffarch (n=jeff@70-41-156-138.cust.wildblue.net) has joined #swhack
11:24:19 *** jeffarch has quit (Read error: 104 (Connection reset by peer))
11:26:55 <[bjoern]> I like how the BBC is now doing DIY guides http://news.bbc.co.uk/2/hi/europe/8154029.stm
11:27:29 <[bjoern]> We need a google suggest phenny command
11:28:08 <[bjoern]> you go .cmd do it yourself -> network, wedding invitations, divorce, pest control, hairstyles, wedding, invitations, magazine, plumbing, wedding programs.
11:30:51 <[bjoern]> don't click if this is a bad time for a seizure http://asset.soup.io/asset/0410/2179_1e9a.gif
11:56:29 <[bjoern]> countdown
11:56:29 <Monty> One large one and five small numbers: 25, 5, 9, 9, 4, 2. Your target is 723. You have thirty seconds, tick, tock, tick, tock...
11:56:59 <Monty> Duh dum, duh dum, duh-da-da-dum! Your 30 seconds are up!
11:57:01 <Monty> I found a solution in approximately 15 milliseconds. I found all 8 solutions in about 46 milliseconds.
11:57:02 <Monty> ibot (5*4 + 9)*25 - 2
11:57:03 <phenny> (((5 * 4) + 9) * 25) - 2 = 723
11:57:10 <[bjoern]> countdown
11:57:10 <Monty> One large one and five small numbers: 25, 9, 4, 5, 1, 5. Your target is 693. You have thirty seconds, tick, tock, tick, tock...
11:57:35 <[bjoern]> .c (5*5)*(25+4)
11:57:36 <phenny> (5 * 5) * (25 + 4) = 725
11:57:40 <Monty> Duh dum, duh dum, duh-da-da-dum! Your 30 seconds are up!
11:57:42 <Monty> Oh bugger, I don't think I can solve that one!
11:57:46 <[bjoern]> countdown
11:57:46 <Monty> One large one and five small numbers: 50, 2, 4, 9, 2, 10. Your target is 147. You have thirty seconds, tick, tock, tick, tock...
11:58:16 <Monty> Duh dum, duh dum, duh-da-da-dum! Your 30 seconds are up!
11:58:17 *** jeffarch (n=jeff@70-41-156-138.cust.wildblue.net) has joined #swhack
11:58:18 <Monty> I found a solution in approximately 16 milliseconds. I found all 34 solutions in about 31 milliseconds.
11:58:18 *** cre8radix (n=jamaerah@c-base/crew/cre8radix) has joined #swhack
11:58:19 <Monty> ibot ((50 + 9)*10 - 2)/4
11:58:20 <phenny> (((50 + 9) * 10) - 2) / 4 = 147
11:58:23 <[bjoern]> stone
11:58:23 <Monty> I chose paper - I win! :o)
11:58:35 <[bjoern]> Monty has a Glückssträhne.
11:58:40 <Monty> My secret: exasperating tinkering figure magnifies M-Saunders's deceased personality.
11:59:00 <[bjoern]> And I am far too tired, and hungry, for it...
12:08:43 <tav> stone
12:08:43 <Monty> I chose stone - Draw!
12:08:54 <tav> paper
12:08:54 <Monty> I chose stone - You win :(
12:08:55 <dahut> I had paper, nobody wins.
12:09:07 <tav> hehe
12:21:00 <[bjoern]> phenny, tell sbp "Move DES to Historic Status for Kerberos" - draft-lha-des-die-die-die-00.txt
12:21:01 <phenny> [bjoern]: I'll pass that on when sbp is around.
12:35:31 <sbp> hehe
12:44:18 *** MacTed has quit ()
12:44:25 *** sivy (n=sivy@68-186-153-156.dhcp.sffl.va.charter.com) has joined #swhack
12:44:25 <Monty> howdy, sivy
13:01:10 *** sivy_afk has quit (Read error: 110 (Connection timed out))
13:16:16 <sbp> phenny: tell Talliesin I only just noticed the amazingly close similarity of Caitlin the name to cailín the word. coincidence, or more than just coincidence?
13:16:16 <phenny> sbp: I'll pass that on when Talliesin is around.
13:18:23 <sbp> [bjoern]: http://en.wikipedia.org/wiki/Grammatical_gender#Gender_assignment
13:18:26 <sbp> has some stuffs
13:29:13 *** MacTed (n=Thud@63.119.36.36) has joined #swhack
13:54:56 *** lmorchard|away is now known as lmorchard
13:55:51 *** kpreid has quit ()
13:56:24 *** kpreid (n=kpreid@cpe-67-249-56-101.twcny.res.rr.com) has joined #swhack
15:26:25 <sbp> data is crazy these days
15:26:25 <sbp> Length: 5378452444 (5.0G) [application/octet-stream]
15:26:26 <sbp> Saving to: `enwiki-20090724-pages-articles.xml.bz2'
15:26:26 <sbp> 2% [                    ] 117,871,224 11.2M/s eta 7m 44s 
15:26:32 <sbp> on the one hand, what a huge fucking file!
15:26:39 <sbp> on the other, not taking long to download :-)
15:33:59 <sbp> okay, now, how to process it...
15:35:32 <_ulises> cat file | process.py --size=really_big_fuck_off_piece_of_data
15:36:20 <sbp> I was thinking about just doing it in shell
15:36:40 <sbp> not sure if there's a command that I want though
15:38:58 <_ulises> that takes the --size parameter?
15:39:05 <sbp> nope, see this is kinda stupid
15:39:13 <sbp> I want to uniq -c on unsorted input
15:39:21 <sbp> obviously I don't want to sort the input here
15:39:27 <sbp> but uniq -c won't do that
15:39:34 <_ulises> 'course not
15:39:37 <sbp> there's no option to say "don't take the input as sorted"
15:40:00 <sbp> which is pretty silly
15:40:00 <_ulises> stupid uniq
15:42:44 <sbp> anybody got any ideas?
15:42:53 <sbp> just need to take a running tally of input lines
15:42:55 <sbp> unsorted
15:44:15 <jsled> awk '{ stuff[$0]++ } END { for (key in stuff) { print 'key:' stuff[key]; } }'
15:44:42 <jsled> Er. with non-retaded quoting in the END block.
15:45:18 <jsled> awk '{ stuff[$0]++ } END { for (key in stuff) { print key ": " stuff[key]; } }'
15:45:41 <sbp> IT WORKS YOUR FACE IS BEAUTIFUL TODAY
16:24:05 *** sivy has quit ()
16:26:01 <sbp> settled on this:
16:26:02 <sbp> nohup bzcat enwiki-20090724-pages-articles.xml.bz2 | \
16:26:02 <sbp>  sed 's!<text xml:space="preserve">!!; s!</text>!!' | grep -v '<' | \
16:26:02 <sbp>  sed -r 's/\{\{[^}]+\}\}//g; s/&[^;]+;//g; s/http:[^ ]+//g' | \
16:26:02 <sbp>  egrep -o '[A-Za-z]+' | tr '{A-Z}' '{a-z}' | \
16:26:02 <sbp>  awk '{ words[$0]++ } END { for (word in words) { print words[word] " " word } }' \
16:26:04 <sbp>  > words.txt &
16:27:10 * sbp watches awk's memory usage climb in top
16:27:21 <clsn> sbp: what the hell do you think you're doing with that?
16:27:41 <sbp> very naïve English word frequency table
16:27:58 <clsn> Oh.
16:27:58 <sbp> on like 5 GB of *compressed* input
16:29:07 <sbp> awk is using a slightly uncomfortable amount of memory, but oh well
16:29:22 <sbp> what's the maximum acceptable amount of memory to be taking up on a multiuser system?
16:29:45 <clsn> sbp: I put Gentium on my home page with @font-face, what do you think? Many of the samples on your page looked okay (Gentium actually not the most so).
16:30:04 <jsled> sbp: naively, 1/Nth, where N is the number of active users … ?
16:30:30 <sbp> clsn: looks very good
16:30:44 <jsled> OR maybe it's 1/Mth, where M is the Interpersonal Value of the set of users.
16:30:48 <sbp> I don't really count Gentium because it's basically Palatino in my mind
16:30:57 <sbp> jsled: how do I find out how many active users?
16:31:05 <clsn> Thanks. I'm not even sure which was the best. I'm not even sure any were superior to using the net-standard Palatino/Book Antiqua.
16:31:05 <sbp> oh wait, top of top
16:31:07 <sbp> 4 users
16:31:10 <sbp> so, 25%. hmm
16:31:16 <clsn> Oof, Palatino and Gentium look very different I think.
16:31:27 <clsn> Though I guess they both have sort of a painterly look too 'em.
16:31:43 <sbp> they don't look similar, but to me they have a similar feel
16:31:52 <clsn> OK.
16:32:07 <sbp> perhaps because Arnia likes 'em both
16:32:12 <jsled> I actually like the second metric, because you're clearly more interesting and probably better looking than the other 3 users, so you can use more memory.
16:32:16 <clsn> On your test page I thought Liberation Serif came out particularly clear.
16:32:20 <sbp> 13120 sbp    20  0 189m 180m 724 S  17 9.0  1:08.93 awk
16:32:21 <jsled> I mean, what are they all doing on the machine? Porn, probably.
16:32:27 <clsn> I would like Palatino more if it weren't so overused.
16:32:28 <sbp> yeah, Liberation Serif *looks* good
16:32:33 <sbp> but then try using it on a page. not so good
16:32:42 <sbp> similar for a few others
16:32:46 <sbp> MgOpen Canonica especially
16:32:49 <clsn> And the Linux Libertine wasn't bad either.
16:32:52 <sbp> that font looks awesome in a short sample
16:32:55 <sbp> but in a large one... eek
16:33:09 <clsn> No, MgOpen Canonica doesn't work for me.
16:33:13 <sbp> jsled: I agree with your metric
16:33:25 <sbp> to the point or extent where crschmidt won't moan at me
16:33:48 * clsn substitutes Liberation. Try it now.
16:34:19 <sbp> yeah, don't like it
16:34:19 <clsn> So sharp and clean!
16:34:28 <sbp> it has a slightly Times New Roman thing going on
16:34:39 <clsn> You don't? Too narrow/straight?
16:34:49 <sbp> yeah, too narrow I think is the main problem
16:35:00 <clsn> Ah....
16:35:12 <clsn> What about Linux Libertine?
16:35:32 <sbp> lemme know when to refresh. now?
16:35:38 <clsn> Which is also Times-ish.  Oops, sec.
16:36:04 <clsn> Refresh now.
16:36:15 <clsn> But they have more humanistic proportions than Times.
16:36:31 <clsn> I think that closed eye of Times bugs me more than most things; these are more open.
16:36:32 <Monty> Can't find
16:37:03 <clsn> Tallys looked okay, but is too small. I'd have to change the font size, which is too much like work.
16:38:19 <sbp> took an age to load
16:38:24 *** dmiles_afk (n=dmiles@c-76-104-220-73.hsd1.wa.comcast.net) has joined #swhack
16:38:26 <sbp> doesn't actually look too bad, but it's far too small
16:38:57 <clsn> Mebbe I'll go back to Liberation. That was a worry with Gentium; the font is *big*.
16:39:10 <clsn> OTOH, I don't want to be missing my beloved þ and ƿ and whatever when I need them.
16:39:40 <sbp> heh, heh
16:39:52 <sbp> what about Junicode?
16:39:55 <clsn> If I have a lot of @font-face definitions, but only use one or two, are the others still loaded in?
16:40:05 <sbp> er, I think they might be yeah
16:40:09 <sbp> not entirely sure
16:41:09 <clsn> Junicode is GPLd, I guess that would be okay.
16:41:17 <clsn> Well, then, I'll comment out the unused ones.
16:41:19 <sbp> GPL?! ew
16:41:32 <sbp> 13120 sbp    20  0 310m 301m 724 S  15 15.0  2:38.75 awk
16:41:43 <sbp> 15%. it's still good, it's still good!
16:41:47 <clsn> More a matter of whether or not you CAN use it than whether or not it's going to infect you.
16:42:01 <clsn> i.e. something proprietary would NOT be the thing to use on your web page.
16:42:08 <sbp> well, also the side point of whether you want to "support" it
16:42:23 <clsn> I don't have the same distaste as you do.
16:42:53 <sbp> a side point even for me
16:42:56 <sbp> I use plenty of GNU code
16:42:56 <clsn> Basically, anything approximately "open" will do for me. Actually, anything closed will do as well, so long as it's just me.
16:43:05 <sbp> heh
16:43:29 <clsn> (I admit to not being particularly a stickler for such things)
16:43:51 <sbp> me neither, I use plenty of proprietary stuff
16:44:29 <clsn> Once I have my grubby little hands on it, it's fair game.
16:44:49 <clsn> As the saying goes, if it's not nailed down, it's mine. And if I can pry it loose, it's not nailed down. :)
16:44:50 <sbp> is that your policy in all things?
16:45:11 <sbp> if it moves, nab it
16:46:48 <sbp> I wonder how efficient awk's memory management is?
16:46:48 <sbp> seems to be using a heck of a lot
16:46:53 <sbp> I suppose it's all the weird unique words
16:46:59 <sbp> I'd hoped filtering out URIs would help with that
16:47:02 <sbp> well, presumably it has
16:47:43 <sbp> also I expect the amount of new unique words coming in to drop off over time
16:47:55 <sbp> and it seems like it is, somewhat, but... still a bit alarmingly steep
16:48:05 <clsn> Put up Junicode, but now everything is in italic.
16:48:15 <sbp> 13120 sbp    20  0 391m 382m 724 S  17 19.1  3:48.76 awk
16:48:18 <sbp> clsn: hehe
16:49:32 <clsn> Back to Liberation, until I find something better.
16:52:53 <clsn> *blink* Trying Fontin, but it doesn't look like it's working. Does Opera not do opentype?
16:53:45 <clsn> No, I think Prociono and Tallys worked...
16:54:48 <sbp> shame I can't get a progress report from bzcat
16:55:01 <sbp> 13120 sbp    20  0 464m 455m 724 S  19 22.7  4:59.87 awk
16:55:05 <clsn> Weird. Wonder what's with Fontin.
16:55:17 <sbp> did you chmod it and stuffs?
16:56:25 <clsn> Certainly looks readable. You're not seeing it either?
16:56:53 <clsn> OK, there's an error in theboldface in my css...
16:57:00 <sbp> nope, I see it
16:57:08 <sbp> I don't like Fontin
16:57:17 <clsn> Wow. The boldface had to be fixed before the rest worked. And now it's all in smallcaps.
16:57:28 <sbp> looks like somebody painted your webpage using one of the calligraphy tools in the GIMP
16:57:44 <sbp> works fine for me
16:57:44 <clsn> which is particularly interesting since i never reference the smallcaps flavor of fontin in the css.
16:59:00 <clsn> I think I might like it if it didn't shout caps at me. EVEN after I *deleted* the -Smallcaps variant .otf!
16:59:37 <clsn> That
16:59:41 <clsn> That's really weird.
16:59:59 <sbp> over 25%
17:00:08 <clsn> Back to Liberation. Sigh.
17:00:26 <sbp> see how far I can push it now before I get afraid of a visit from crschmidt
17:00:44 <sbp> jessica: ping
17:00:46 *** TheOracle23 (n=sroracle@c-98-215-178-14.hsd1.in.comcast.net) has joined #swhack
17:00:58 <sbp> hi TheOracle23
17:01:06 <sbp> what is thy bidding?
17:01:10 <TheOracle23> :O
17:01:21 <TheOracle23> Well, I use phenny a lot, its very nice
17:01:47 <TheOracle23> Just that it disconnects randomly when using asynchat.py because it often references non-existant data
17:02:39 <TheOracle23> I've made a patch for it using a try~except on line...ugh I can't remember
17:02:56 <TheOracle23> But yesterday she disconnected again but with a different line number O_O
17:07:06 <sbp> okay, just asked crschmidt
17:07:11 <sbp> he says I can use 1GB virt
17:07:27 <TheOracle23> ?
17:07:55 <sbp> TheOracle23: often references non-existent data?
17:07:57 <sbp> what kind of data?
17:08:06 <TheOracle23> well let me pull it up here
17:08:58 <sbp> thinking about it, 5 GB is what bz2 can manage
17:09:06 <TheOracle23> Line 244: del self.producer_fifo[0]
17:09:13 <sbp> so I guess hoping it's going to fit into 1 GB even when parsed in this way is a bit naïve
17:09:21 <sbp> I mean, bz2 ought to be making a similar kind of table
17:09:27 <TheOracle23> When phenny disconnects she returns "Index error: deque index out of range"
17:09:46 <TheOracle23> So I wrapped that in a try~except
17:10:05 <TheOracle23> Seems to be working, but yesterday it had the same error but with a different line #
17:10:22 <TheOracle23> I think it was 213
17:10:36 <TheOracle23> "del self.producer_fifo[0]"
17:10:48 <TheOracle23> the same del statement
17:11:23 <sbp> weird
17:11:25 <_ulises> head of an empty list? tsk tsk
17:12:37 <TheOracle23> It would disconnect on Line 244 about every other time I sent a command, so it was pretty ridiculous
17:12:56 <sbp> TheOracle23: what OS and version of Python, out of interest?
17:13:02 <TheOracle23> Like I would send a part command and it would part, but immediately disconnect
17:13:26 <TheOracle23> sbp: Python 2.6 running on MS-Windows XP Service Pack 3
17:14:58 <sbp> I've not had the problem on OS X or Linux
17:15:00 <sbp> so I wonder if it might be a Windows problem
17:15:13 <TheOracle23> hmm
17:15:29 <sbp> I mean the fact that it's a problem *in asynchat* is kinda weird
17:15:50 <TheOracle23> I would give you the entire error but I can't find it
17:16:05 <TheOracle23> I could disabled the try~except and then wait until it does?
17:22:01 *** cre8radix has quit ()
17:24:58 <deltab> colourful castle on BBC1 now
17:27:01 <deltab> Dover Castle was a “very posh B&B”
17:31:49 <clsn> There. Got Fontin working for me. It apparently matters what *order* the @font-faces are listed, to Opera. I'm kinda liking it.
17:33:15 *** JimJibber has quit (zelazny.freenode.net irc.freenode.net)
17:34:31 <clsn> OK, off to hunt food for the weekend.
17:36:54 <sbp> 13120 sbp    20  0 804m 795m 724 S  14 39.6 12:08.75 awk
17:38:06 <TheOracle23> brb
17:38:08 *** TheOracle23 has parted #swhack ()
17:39:58 <sbp> I should've done this in python
17:40:01 <sbp> could've used a shelf
17:40:02 <sbp> oh well
17:41:17 <[bjoern]> Just when I was contemplating what to ask of the oracle.
17:41:18 <phenny> [bjoern]: 13:29Z <sbp> tell [bjoern] http://u.nu/39qp
17:41:29 <[bjoern]> I doubt i nu.
17:41:46 <sbp> there is much evidence that u nu
17:41:53 <[bjoern]> Hadn't I mentioned it some time ago?
17:42:06 <sbp> .swhack bjoern.*u.nu
17:42:07 <phenny> sbp: http://swhack.com/logs/2009-02-27#T19-18-34
17:42:17 <sbp> .swhack bjoern.*u\.nu
17:42:18 <Monty> "you"
17:42:23 <[bjoern]> [20 Jun 09 18:31] * [bjoern] * .title u.nu
17:42:35 <sbp> that doesn't count
17:42:53 <[bjoern]> How about phenny's [20 Jun 09 18:31] * phenny * [bjoern]: u.nu :: the shortest urls. period. ?
17:42:55 <sbp> you have to be all "sbp! sbp! look! u.nu! / phenny: tell sbp u.nu. U.NU!!!"
17:43:02 <sbp> nope. has to be exactly as above
17:43:26 <[bjoern]> Well consider the events:
17:43:29 <[bjoern]> [20 Jun 09 18:31] * [bjoern] * .title u.nu
17:43:29 <[bjoern]> [20 Jun 09 18:31] * phenny * [bjoern]: u.nu :: the shortest urls. period.
17:43:29 <[bjoern]> [20 Jun 09 18:36] * sbp * --
17:43:53 <sbp> see, that's the start of a quote
17:43:57 <sbp> so I clearly wasn't paying attention
17:44:04 <sbp> I was reading awesome things to quote from
17:44:18 <[bjoern]> You went through the surrounding pixstream immediately after.
17:44:29 <[bjoern]> In the end we'll blame Morbus for distracting you.
17:44:45 <sbp> phew
17:45:20 <[bjoern]> u.nu does not support data: urls.
17:45:39 <sbp> what?!
17:45:41 <sbp> ugh
17:45:52 <sbp> what about tweak.tk?
17:46:06 *** Talliesin has quit ("Leaving.")
17:46:43 <[bjoern]> I never heard about it and already find it annoying.
17:48:38 *** TheOracle23 (n=sroracle@unaffiliated/theoracle23) has joined #swhack
17:48:54 <TheOracle23> sbp, I want to show you something :o
17:49:05 <sbp> TheOracle23: pix pls
17:49:20 <[bjoern]> TheOracle23: is it http://asset.soup.io/asset/0410/4956_5ad8.jpeg ?
17:49:37 <TheOracle23> Nope :O
17:50:18 <[bjoern]> TheOracle23: then http://asset.soup.io/asset/0410/4808_3ccd.jpeg perhaps? sbp might like to see more of that.
17:50:31 <TheOracle23> heh
17:50:34 <TheOracle23> sadly, no
17:50:50 <TheOracle23> ugh this isn't working atm >> hold on
17:50:55 <[bjoern]> This is a good idea for anti graffiti ads http://asset.soup.io/asset/0410/4776_56e6_480.jpeg
17:51:21 <sbp> wonder if she did that herself
17:51:49 <[bjoern]> one of them sucks less than the others: http://asset.soup.io/asset/0410/4762_fa32.jpeg
17:52:09 *** JimJibber (n=jim@81-86-174-182.dsl.pipex.com) has joined #swhack
17:52:33 <TheOracle23> hmm
17:52:34 <TheOracle23> odd
17:52:44 <TheOracle23> well it just doesn't want to work today
17:53:02 <sbp> [bjoern]: which one?
17:53:36 <TheOracle23> well here I pasted it @sbp http://pastesite.com/9342
17:54:16 <TheOracle23> I've added the search result text and the link to the results page to the google command, and it works nicely
17:54:28 <sbp> well done
17:54:31 <sbp> did you find it easy?
17:54:39 <TheOracle23> quite
17:54:42 <sbp> good, good
17:54:53 <TheOracle23> I simply inspected some of the output JSON for a query
17:54:58 <[bjoern]> It does not say.
17:55:14 <TheOracle23> and found all you had to do is reference some of those objects
17:55:57 * sbp starts writing the python version of the awk script
17:56:08 <[bjoern]> http://asset.soup.io/asset/0410/4263_10af_480.jpeg
17:57:02 <TheOracle23> well, I must be going
17:57:06 <TheOracle23> bye #swhack
17:57:10 *** TheOracle23 has quit ("(Killed (NickServ (GHOST command used by TheOracle)))")
17:57:29 <[bjoern]> hmm this could do with some modifications http://asset.soup.io/asset/0410/3966_c5dd.png
17:58:48 <[bjoern]> hmm hmm http://asset.soup.io/asset/0410/3849_6032_480.jpeg
17:59:23 <[bjoern]> hmm hmm hmm http://asset.soup.io/asset/0410/4438_7220_480.jpeg
18:03:10 <sbp> 13114 is the process I'll need to kill
18:09:15 <[bjoern]> I could not find a pix to which I could say "She won't help you do it"
18:09:23 <[bjoern]> http://asset.soup.io/asset/0410/2419_ba8a_480.jpeg
18:09:24 <[bjoern]> http://asset.soup.io/asset/0410/2939_2935_480.jpeg
18:10:10 <sbp> the latter one would have been good to say that to
18:10:38 <[bjoern]> no.
18:12:02 <sbp> okay, running:
18:12:02 <sbp> nohup bzcat enwiki-20090724-pages-articles.xml.bz2 | \
18:12:02 <sbp>  sed 's!<text xml:space="preserve">!!; s!</text>!!' | grep -v '<' | \
18:12:02 <sbp>  sed -r 's/\{\{[^}]+\}\}//g; s/&[^;]+;//g; s/http:[^ ]+//g' | \
18:12:02 <sbp>  egrep -o '[A-Za-z]+' | tr '{A-Z}' '{a-z}' | nice python tally.py &
18:12:40 <[bjoern]> What's your problem with xml:space?
18:13:09 <sbp> it's evil
18:13:11 <sbp> eeeevil
18:14:35 <sbp> okay, now I've just got to check it doesn't take up too much disc space
18:14:42 <[bjoern]> Reminds me of the former HTML WG concluding that Tim Bray does not understand the xml:space attribute.
18:15:37 <sbp> suppose he also misunderstands HTML conformance criteria?
18:16:23 <[bjoern]> If by "he" you mean the HTML WG, yes.
18:17:53 *** JimJibber has quit (zelazny.freenode.net irc.freenode.net)
18:18:23 *** JimJibber (n=jim@81-86-174-182.dsl.pipex.com) has joined #swhack
18:18:23 <sbp> yeah, I was referring to the email you showed xover
18:18:39 <sbp> .c 5238784 bytes
18:18:40 <phenny> sbp: Sorry, no result.
18:18:43 <sbp> .c 5238784 bytes in MB
18:18:44 <phenny> 5 238 784 bytes = 4.99609375 megabytes
18:19:07 <[bjoern]> Don't pretend I did things I don't remember doing.
18:19:22 <laplink> huh?
18:19:34 <sbp> hmm, laplink doesn't remember it either
18:19:45 * sbp goes and for^Wfinds documentary proof
18:19:50 <laplink> There's a 66.7% chance you're wrong.
18:20:29 <sbp> Jul 17 10:46:32 <[bjoern]>   [18:54] phenny, tell xover http://lists.w3.org/Archives/Public/public-rdf-in-xhtml-tf/2009Jul/0099.html
18:21:04 <laplink> ah
18:21:06 <sbp> xmlns:foo attributes, totally allowed in HTML 4. and XHTML 1.*
18:21:22 <sbp> as you guys ought to have known!
18:21:49 <laplink> There's a 33.3% chance we did and failed to admit it.
18:23:00 <[bjoern]> going from the former html wg and tim bray and two weeks back is a stretch.
18:26:00 <sbp> I'm concerned that the shelf size is not growing
18:26:13 *** jeffarch has quit (Success)
18:27:31 <sbp> hmm, but it is growing
18:27:40 <sbp> must be predicting space needed or doing something databasey
18:29:42 *** jeffarch (n=jeff@70-41-156-138.cust.wildblue.net) has joined #swhack
18:30:09 <sbp> .gc site:en.wikipedia.org swhack
18:30:11 <phenny> site:en.wikipedia.org swhack: 1
18:30:12 <sbp> .g site:en.wikipedia.org swhack
18:30:13 <phenny> sbp: http://en.wikipedia.org/wiki/User:Unforgettableid
18:31:18 <_ulises> sbp, whatcha up to anyway?
18:31:34 <sbp> making a frequency table of English words from Wikipedia
18:31:34 <_ulises> some trendy experiment involving del.icio.us and wikipedia? is this for your blawg?
18:31:37 *** jeffarch has quit (Read error: 104 (Connection reset by peer))
18:31:40 <sbp> a very naïve table
18:31:46 <sbp> nope, just because I like words
18:31:54 <_ulises> you like Word? zomg!
18:31:59 <sbp> words
18:32:12 <sbp> observe:
18:32:14 <_ulises> you used to have a sense of humour you know?
18:32:19 <sbp> .py 'words' == 'Word'
18:32:21 <phenny> False
18:32:24 <sbp> yeah, but then I started programming python
18:32:55 <_ulises> .py 'sense of humour' in ['s', 'b', 'p']
18:32:55 <phenny> False
18:32:59 <_ulises> there we go
18:33:21 <_ulises> is it programming Python or programming _in_ Python? (genuine question)
18:33:41 <sbp> programming in Python is formal
18:33:49 <sbp> programming python is colloquial
18:34:07 <_ulises> ah, cool, thx
18:37:04 <laplink> You're wrong.
18:37:10 <laplink> .title http://oreilly.com/catalog/9780596000851/
18:37:11 <phenny> laplink: Programming Python | O'Reilly Media
18:37:42 <sbp> that's American English
18:37:49 <sbp> _ulises and I are in the UK
18:38:14 <_ulises> give me an U give a K ...
18:38:27 <_ulises> though we're in very different coun'ries
18:38:29 <laplink> To whom?
18:39:10 <sbp> to one another
18:39:43 <sbp> same sovereign state, different countries
18:40:01 *** jeffarch (n=jeff@70-41-156-138.cust.wildblue.net) has joined #swhack
18:41:53 <Arnia> completely different legal systems for a start
18:41:55 <laplink> But to whom are you encouraging me to present a “K”?
18:42:48 <_ulises> I could do with an U and perhaps Arnia would like a K?
18:43:02 <sbp> gimme a Q
18:43:04 <Arnia> I'd like a P actually
18:43:07 <_ulises> Q!
18:43:08 * Arnia goes to the loo
18:43:16 <[bjoern]> You could also use a pie in the face.
18:43:17 * _ulises hands a P to Arnia ... happy looing!
18:43:22 <_ulises> here we go
18:43:25 <_ulises> same old abuse
18:43:30 <[bjoern]> Thank you.
18:43:36 <_ulises> don't know why I tolerate it any more
18:43:43 <sbp> occasionally there is different new abuse
18:44:00 <Arnia> That's exciting, but unlikely
18:44:01 <Monty> given the branching and posted before
18:44:06 <_ulises> paper
18:44:06 <Monty> I chose scissors - I win! :o)
18:44:07 <dahut> I had rock, _ulises wins.
18:44:12 <_ulises> yeah!
18:44:17 <sbp> Monty: play dahut
18:44:20 <Arnia> _ulises: middle of the road
18:44:22 <Monty> As we all know, PCI porno fireman scratches Deng Xiaoping's LuRcH's Gary Rhodes :P
18:44:30 <sbp> dahut: play Monty
18:44:34 <_ulises> PCI porno!
18:44:35 <Monty> definitely consider a “clean install,â€? or not ;) i 'upgrade' to prune the scholars?
18:44:42 <laplink> .gc "PCI porno fireman"
18:44:43 <phenny> "PCI porno fireman": 0
18:44:48 <laplink> .gc "porno fireman"
18:44:49 <phenny> "porno fireman": 64
18:45:13 <Arnia> .gc "PCI fireman"
18:45:14 <phenny> "PCI fireman": 0
18:45:22 <Arnia> .gc "PC fireman"
18:45:22 <sbp> .gc "porno firewoman"
18:45:23 <phenny> "PC fireman": 108
18:45:24 <phenny> "porno firewoman": 0
18:45:38 <sbp> .gc "firewoman porn"
18:45:39 <phenny> "firewoman porn": 29
18:45:48 *** laplink changed the topic to: "«'upgrade' to prune the scholars?» that is the question."
18:45:53 <_ulises> .gc "pci porno"
18:45:53 <phenny> "pci porno": 965
18:45:58 <_ulises> woop!
18:46:05 <_ulises> .g "pci porno"
18:46:06 <phenny> _ulises: http://www.bahaicampusassociations.com/index.php?option=content&task=view&id=13&Itemid=
18:46:14 <_ulises> Bahai? heh heh
18:46:53 <sbp> okay, I started running the same process on the cywiki output
18:46:58 <sbp> which is just 18 MB compressed
18:47:01 <sbp> and it hasn't exited yet
18:47:06 <sbp> so that's probably not a good sign
18:48:20 <laplink> depends what your goals are
18:49:32 <sbp> my goals are to get the same output as this:
18:49:33 <sbp> http://swhack.com/logs/2009-07-31#T16-26-02
18:49:44 <sbp> only without using any more than a gig of server memoryt
18:49:54 <sbp> and, preferably, to complete within a day or so
18:50:04 <sbp> not using up all the disc space would be handy too
18:50:57 <[bjoern]> You would not worry had you twittered your commands into the cloud.
18:51:43 <sbp> yeah I would, because then I'd have to pay for it
18:52:43 <[bjoern]> You are clouding the issue.
18:54:08 * laplink hopes to the gods that sbp has figured out to do a non-naive implementation of that…
18:55:46 <sbp> nope
18:57:56 * laplink despairs for whoever the dope you've tricked into hosting this is…
19:00:29 <jsled> sbp: know of 'pv' ?
19:00:46 <sbp> pv: no such command
19:00:55 <jsled> http://www.ivarch.com/programs/pv.shtml
19:01:27 <sbp> ooh, interesting
19:01:34 <sbp> don't suppose you can get it to work retroactively on commands?
19:01:58 <jsled> heh
19:02:06 <jsled> YOU"RE FUNNY GUY
19:02:18 <jsled> brilliant: http://thisiswhyyourefat.com/post/153052030/oreo-cupcake-with-built-in-milk-cup-submitted-by
19:03:48 <thelsdj> do you think you could use a modem over a voip line?
19:04:02 <sbp> yes. but why?
19:04:12 <thelsdj> to play doom 2 of course!
19:04:34 <[bjoern]> I propose we stone whoever mentioned cake without delivering cake.
19:04:44 <sbp> he sounds already stoned
19:04:50 <thelsdj> just to be clear, i don't mean voip over a modem, i mean a modem over voip
19:05:00 <sbp> yeah
19:05:00 <[bjoern]> Hinkelstein-style
19:05:25 <thelsdj> sbp: that 18M ever finish?
19:05:34 <sbp> nope, still going!
19:05:44 <thelsdj> awk is sloooow
19:05:52 <sbp> I'm not using awk now, I had to use python
19:05:55 <[bjoern]> 18MB is the bz2 size I would think.
19:05:59 <sbp> because awk started to use over 1 GB of memory
19:06:03 <thelsdj> ha
19:06:37 <sbp> I think there's a problem with the shelf, quite frankly
19:06:44 <sbp> it gets to like 250,000 keys and then it just grinds
19:08:18 <sbp> massively increasing the cache size
19:08:26 <sbp> see if that helps. maybe the constant syncing is buggering it
19:08:33 <sbp> well, it was sync-per-10000 lines
19:08:36 <sbp> raised it to a million
19:09:00 <thelsdj> sync to what? disk?
19:09:39 <jsled> .title http://www.loper-os.org/?p=44
19:09:44 <phenny> jsled: Loper OS » Blog Archive » There is no “Cloud”: There are only Other People’s Hard Drives.
19:10:03 <sbp> thelsdj: yeah
19:10:10 <sbp> using the python shelve module
19:10:15 <sbp> .g docs.python.org shelve
19:10:17 <phenny> sbp: http://docs.python.org/library/shelve.html
19:10:27 * sbp makes pv meanwhile
19:11:11 <thelsdj> how often is sync being called?
19:11:26 <thelsdj> (in wallclock time, about)
19:12:29 <thelsdj> jsled: i read that as 'I find it interesting that PCP never caught on...' and I was like, what do drugs have to do with this?
19:14:35 <sbp> sbp@helios:~/cywiki$ nohup bzcat cywiki-20090715-pages-articles.xml.bz2 |  ~/local/bin/pv -cN bzcat |  sed 's!<text xml:space="preserve">!!; s!</text>!!' | grep -v '<' |  sed -r 's/\{\{[^}]+\}\}//g; s/&[^;]+;//g; s/http:[^ ]+//g' |  egrep -o '[A-Za-z]+' | tr '{A-Z}' '{a-z}' |  ~/local/bin/pv -cN python | nice python tally.py
19:14:35 <sbp>   bzcat: 6.95MB 0:00:47 [ 133kB/s] [             <=>      ]
19:14:35 <sbp>  python: 3.2MB 0:00:46 [64.4kB/s] [              <=>     ]
19:14:43 <sbp> thelsdj: dunno in wallclock...
19:15:04 <sbp> mem use is only 0.6% though
19:15:09 <sbp> could probably crank it up another 10x
19:15:09 <Monty> also, yeah, too OLD for anti graffiti ads http://asset.soup.io/asset/0410/4776_56e6_480.jpeg
19:16:39 <_ulises> lovely though, eh Monty?
19:16:42 <Monty> fortunately, in on any of Python, out particularly important one million
19:19:05 <sbp> .c 90816956 bytes in MB
19:19:06 <phenny> 90 816 956 bytes = 86.6097984 megabytes
19:19:14 <sbp> that's the size of cywiki inflated
19:19:25 <sbp> bzcat: 41.4MB
19:19:33 <sbp> that's what we're up to using the 1/1000000 sync rate
19:20:13 <[bjoern]> you should have used the low german wikipedia.
19:20:23 <sbp> how big is that?
19:21:08 <[bjoern]> .g wikipedia ndswiki pages articles
19:21:09 <phenny> [bjoern]: http://marc.info/?l=wikipedia-l&m=112059109123515&w=2
19:21:47 <sbp> http://download.wikipedia.org/ndswiki/20090728/
19:21:55 <sbp> 11.3 MB
19:22:37 <[bjoern]> that's bigger than I thought
19:23:13 <[bjoern]> fywik also 9mb
19:23:23 <sbp> fy?
19:24:34 <[bjoern]> west frisian
19:26:00 <[bjoern]> I don't suppose there is a frr wik
19:26:12 <sbp> east frisian?
19:26:41 <[bjoern]> north, like mooring.
19:27:26 <sbp> .wik North Frisian
19:27:27 <phenny> "North Frisian is a minority language of Germany, spoken by about 10,000 people in North Frisia." - http://en.wikipedia.org/wiki/North_Frisian
19:27:30 <sbp> .wik East Frisian
19:27:30 <phenny> "East Friesian (sheep), a breed of sheep originating there" - http://en.wikipedia.org/wiki/East_Frisian
19:27:38 <sbp> .wik East Frisian (language)
19:27:39 <phenny> "East Friesian (sheep), a breed of sheep originating there" - http://en.wikipedia.org/wiki/East_Frisian
19:27:44 <sbp> .wik South Frisian
19:27:46 <phenny> "The Frisian–Frankish wars were a series of conflicts between the Frankish Empire and the Frisian kingdom in the 7th and 8th century." - http://en.wikipedia.org/wiki/Frisian%E2%80%93Frankish_wars
19:27:51 <sbp> .wik West Frisian
19:27:52 <phenny> "West Frisian language, the language spoken in the Friesland province of the Netherlands" - http://en.wikipedia.org/wiki/West_Frisian
19:28:02 <sbp> there are only sheep in East Frisia
19:28:15 <[bjoern]> .gc Ostfriesenwitz
19:28:16 <phenny> Ostfriesenwitz: 1,280
19:28:21 <thelsdj> sbp: will you post your python?
19:28:27 <sbp> sure
19:28:37 <[bjoern]> wp has West, Saterland, Noth, Middle, and Old.
19:28:51 <sbp>   bzcat: 86.6MB 0:14:48 [99.8kB/s] [  <=>                 ]
19:28:52 <sbp>  python: 41.9MB 0:15:07 [47.3kB/s] [      <=>             ]
19:28:56 <sbp> bzcat's done the whole file
19:29:01 <sbp> python still tagging along behind
19:29:29 <lisppaste2> sbp pasted "tally.py" at http://paste.lisp.org/display/84524
19:29:33 <[bjoern]> Frisian horses have 15-17 hands.
19:29:44 <sbp> ah, it finished!
19:29:51 <sbp>   bzcat: 86.6MB 0:14:48 [99.8kB/s] [  <=>                 ]
19:29:51 <sbp>  python: 41.9MB 0:15:07 [47.3kB/s] [      <=>             ]
19:30:24 <sbp> okay, now doing it with 10m sync
19:31:03 <sbp> oh, not much point
19:31:08 <sbp> there are only 2.5m lines
19:31:09 <sbp> heh
19:31:11 <sbp> well, see how it goes
19:31:23 <[bjoern]> http://www.ethnologue.com/show_language.asp?code=frs
19:31:57 <[bjoern]> reads like it'll be dead soon.
19:32:31 <sbp> do you know any?
19:32:40 <laplink> What happens if you delete the whole sync thing?
19:32:53 <sbp> that's what I've done, in effect
19:33:36 <sbp> I can't do that for the main script though
19:33:42 <sbp> because then it'll just use as much memory as it needs
19:33:48 <sbp> and as we saw for awk, that can reach 1GB
19:33:58 <sbp> at the moment it's using 0.9%
19:34:59 <sbp> looks like this isn't going to be much faster, here
19:35:01 <_ulises> are you doing this for a Google/MS interview?
19:35:04 <sbp> it only synced twice before anyway
19:35:06 <sbp> _ulises: heh
19:35:40 <sbp> I really have a feeling that large shelves are just inefficient
19:35:51 <sbp> rings a bell that I had this problem before
19:40:16 <sbp> mem at 1.5%
19:40:24 <sbp> so I guess 5,000,000 would be okay for the enwiki script
19:40:30 <sbp> that'd make it 3-5%
19:40:54 <sbp> probably more like 3% — it's still at 1.5% and nearly done
19:43:42 <sbp>   bzcat: 86.6MB 0:13:02 [ 113kB/s] [         <=>          ]
19:43:42 <sbp>  python: 41.9MB 0:13:09 [54.4kB/s] [          <=>         ]
19:43:46 <sbp> bit of an improvement
19:45:33 <sbp> okay, put enwiki up to 10m
19:45:36 <sbp> and running again
19:47:14 <sbp> interesting
19:47:24 <sbp> even though I used [a-z], it did the "right" thing:
19:47:25 <sbp> 2961 môr
19:47:32 <laplink> What was the wordcount out of that, and how large is the output file?
19:47:45 <sbp> sbp@helios:~/cywiki$ wc words.txt
19:47:45 <sbp> 303933 607866 3424363 words.txt
19:48:10 <sbp> gimme the awk for tallying a column and I'll run it
19:48:38 <sbp> it's actually filled with a disturbing amount of junk
19:48:53 <jsled> sbp: what're the lines like? "num word"?
19:49:00 <sbp> yeah
19:49:03 <sbp> sample from near the top:
19:49:04 <sbp> --
19:49:04 <jsled> { sum += $1 } END { print sum }
19:49:04 <sbp> 52803 yr
19:49:04 <sbp> 45727 mae
19:49:04 <sbp> 45046 category
19:49:04 <sbp> 43867 en
19:49:06 <sbp> 42797 kategori
19:49:08 <sbp> 39623 au
19:49:12 <sbp> 39329 ei
19:49:14 <sbp> 37803 n
19:49:16 <sbp> 32897 gan
19:49:18 <sbp> 31309 ac
19:49:20 <sbp> 29935 br
19:49:22 <sbp> 29609 the
19:49:24 <sbp> --
19:49:34 <sbp> sbp@helios:~/cywiki$ awk '{ sum += $1 } END { print sum }' words.txt
19:49:34 <sbp> 7547907
19:50:01 <sbp> lot of language codes, must be interwiki links
19:50:11 <laplink> Uhm. 7.5M unique words?
19:50:17 <sbp> and a surprising amount of English still crept in
19:50:20 <sbp> yup
19:50:22 <sbp> er, nope
19:50:24 <sbp> total words
19:50:31 <laplink> right
19:50:33 <sbp> unique words: 303933
19:50:46 <[bjoern]> note that his word extor is megafail.
19:51:02 <sbp> hey, it picks up môr
19:51:19 <thelsdj> heh i went to see what other options there are besides shelve but looks like there aren't really any if you want to store a number, which is probably part of the performance issue
19:51:30 <[bjoern]> what's the count for "http"?
19:51:30 <sbp> aye
19:51:39 <sbp> [bjoern]: 149
19:51:51 <sbp> --
19:51:52 <sbp> 149 mauritius
19:51:52 <sbp> 149 indiaidd
19:51:52 <sbp> 149 hystyried
19:51:52 <sbp> 149 http
19:51:52 <sbp> 149 hiraethog
19:51:54 <sbp> 149 gydol
19:51:56 <sbp> 149 flags
19:51:58 <sbp> 149 ffurfiol
19:52:00 <sbp> --
19:52:02 <sbp> lower down, it's not so bad
19:52:04 <sbp> just a lot of language tags and other wikicrap at the top
19:52:16 <sbp> you see "flags" is still there, which is English too
19:52:24 <laplink> .c 303933 / ((15 * 60)) + 7
19:52:24 <phenny> (303 933 / (15 * 60)) + 7 = 344.703333
19:52:33 <[bjoern]> you have a hiraethog in your ffurfiol, and it's flagsing your hystyried.
19:52:48 <sbp> .cy hiraethog
19:52:49 <phenny> No result. (BBC)
19:52:50 <phenny> No matches. (Geiriadur)
19:52:55 <sbp> .cy ffurfiol
19:52:56 <phenny> ffurfiol, adj formal, prim (BBC)
19:52:57 <phenny> ffurfiol formal adj. (Geiriadur)
19:53:06 <sbp> .cy hystyried
19:53:08 <phenny> ystyried, vb CONJUGATE consider, reflect, deem, deliberate, heed, mark, ponder, regard, view (BBC)
19:53:09 <phenny> No matches. (Geiriadur)
19:55:51 <sbp> one major problem with this approach on Welsh words is that it doesn't take into account mutations
19:55:53 <laplink> .c 303933 / ((15 * 60) + 7)
19:55:53 <phenny> 303 933 / ((15 * 60) + 7) = 335.097023
19:56:17 <thelsdj> so its about 10 times faster if you just use a dict, maybe manually pickle it from time to time?
19:56:42 <sbp> thelsdj: if you pickle it, how are you going to update it?
19:56:49 <sbp> remember, you can't store the whole dict in memory
19:57:31 <thelsdj> you sure you can't store it all in memory?
19:57:35 <sbp> yeah
19:58:00 <sbp> awk got to 1 GB quite quickly
19:58:10 <sbp> and I doubt python is more optimised than awk in that
19:58:25 <sbp> and anyway, I can see the kind of stats
19:58:35 <sbp> 1.5% memory usage in just 2,500,000 lines
19:58:42 <sbp> which is what you get from an 18 MB file
19:58:46 <sbp> now image a 5 GB file
19:58:53 <sbp> *imagine
19:59:03 <sbp> (1.5% in python, this is)
19:59:24 <sbp> the 5 GB is compressed, don't forget
19:59:33 <sbp> goodness knows what it is uncompressed. let's see!
20:00:27 <laplink> What's your test file?
20:00:43 <sbp> test file is the cywiki dump
20:00:44 <thelsdj> i think it might be useful to do even a minimal manual map-reduce type process where process parts of the file at a time then combine
20:01:07 <sbp> technically isn't that what shelve ought to be doing?
20:01:20 <laplink> memmap
20:03:03 <thelsdj> yea i guess if you want to do it all yourself than the disk is still the bottleneck and you probably don't have a bunch of disks to work off of, though there might be some cpu stuff that could be split between processors for a small amount of speedup
20:05:13 <thelsdj> i think would be interesting to see how big the 5GB file is after processed through grep/sed/etc
20:05:33 <thelsdj> and how many words it is
20:06:52 <thelsdj> hehe sort goes to 5.3% on 4GB ram just on the ndswiki
20:06:54 <sbp> we can roughly guess that from the ratio we got for the cywiki file
20:07:08 <thelsdj> ----
20:07:12 <thelsdj>  38617 county
20:07:12 <thelsdj>  53182 inwahners
20:07:12 <thelsdj>  58916 n
20:07:12 <thelsdj>  59445 is
20:07:12 <thelsdj>  66204 en
20:07:14 <thelsdj>  75824 un
20:07:17 <thelsdj>  82669 dat
20:07:19 <thelsdj>  95108 in
20:07:22 <thelsdj> 131280 vun
20:07:24 <thelsdj> 273099 de
20:07:27 <thelsdj> ------
20:07:50 <sbp> bork bork bork
20:08:37 <thelsdj> ndswiki gives a bit less than 3x the 12M bz2 file to 32M, 5.3M words
20:09:18 <laplink> hmm.
20:09:25 <laplink> a bz2 IO filter
20:09:31 <thelsdj> so 5G to 15G and oh 2.4 billion words?
20:09:34 <laplink> tie the file to an array
20:10:14 <laplink> loop over it, constrained by disk I/O + seek time, say an order of magnitude faster than the above
20:10:38 <_ulises> sbp, alternatively you could see http://stats.wikimedia.org/EN/TablesDatabaseWords.htm
20:10:41 <_ulises> no?
20:11:09 <laplink> hmm, at this point your memory consumption is what's required for the hash of unique words and their counts
20:11:26 <sbp> _ulises: only if it has dumps of the actual words and frequencies
20:11:30 <sbp> which it doesn't seem to
20:11:36 <_ulises> sbp, only up to 2006 :(
20:11:43 <_ulises> ah, not the total
20:11:59 <sbp> laplink: and that hash gets huge
20:12:00 <thelsdj> also, we could do a first pass of the en just to find out how many unique words there are
20:12:08 <thelsdj> that would be a lot cheaper than also counting them i think?
20:12:26 <thelsdj> then we could guess at the size of count hash
20:12:28 <sbp> hmm. I'd have thought only about a third cheaper
20:12:29 <laplink> say 10 characters average, and an int or so for the count
20:12:47 <sbp> because when you're storing {word: freq} that's only one extra value
20:12:47 <sbp> plus the hashtable structure cost
20:13:19 <[bjoern]> I tend to just write the words into a file, and do the counting when that is done.
20:13:25 <sbp> bzcat is still piping into wc
20:13:32 <sbp> so it takes quite a while just to cat the file
20:13:42 <sbp> (with cywiki it was pretty zippy, matter of seconds)
20:14:40 <[bjoern]> "Subject: Good evening dear from me Jessica"
20:15:18 <thelsdj> i think it would be more efficient to work on smaller hashes that can fit in memory, write them out to disk when done, and then use something else to combine them
20:15:36 <thelsdj> don't even write out the hash structure, just a sorted 'word count'
20:15:49 <sbp> hmm, q-mebs
20:15:50 <thelsdj> that way you can easily iterate over 2 'word count' files to output a 3rd
20:16:14 <[bjoern]> that's merge sorting tapes.
20:16:57 <thelsdj> its going to be WAAAY faster than the overhead of pickling and unpickling ints to a disk based hash for every word
20:17:39 <sbp> yeah, but that's only happening every 10m lines now
20:17:46 <sbp> so it can't be *that* bad, surely?
20:18:07 <[bjoern]> http://www.youtube.com/watch?v=yFPQbnraeVg should be covered by the geneva convention
20:18:10 <thelsdj> no
20:18:13 <thelsdj> it is that bad
20:18:34 <thelsdj> try a regular dictionary with a single pickle at the end on your cywiki
20:18:44 <thelsdj> its 10-20 times faster than shelve
20:19:26 <sbp> huh
20:19:27 <sbp> hmm
20:19:33 <thelsdj> i think shelve is pickling and unpickling the ints every time you pull in and out of dict
20:19:40 <sbp> ah!
20:19:48 <sbp> well that's bloody stupid
20:20:05 <[bjoern]> hmm http://www.crazystuff.ch/naked-bikini-p-42.html bikini where the important parts dissolve after a few minutes in water.
20:20:07 <sbp> what's the point of using memory if you're also going to pickle it?
20:20:26 <thelsdj> i wrote my own that used int()/str() on a berkely db file and was about the same speed as shelve
20:21:23 <sbp> .wik Eurocats
20:21:25 <phenny> "EUROCAT is the 'European network of population based registries for congenital anomalies'." - http://en.wikipedia.org/wiki/EUROCAT
20:21:33 <sbp> they definitely need to bring the other Eurocats back
20:22:17 <thelsdj> .title http://trollcats.com/
20:22:20 <phenny> thelsdj: Trollcats
20:22:39 <thelsdj> trollcats is the new lolbots
20:22:56 <[bjoern]> You use these words as if they had meaning.
20:24:03 <[bjoern]> http://torrentfreak.com/pirated-youtube-clip-boosts-bands-album-sales-090727/
20:24:07 <thelsdj> of course, both are no match for narwhals http://www.weebls-stuff.com/toons/Narwhals/
20:26:44 <[bjoern]> [[[
20:26:45 <[bjoern]> Unfortunately for most artists, anti-piracy outfits such as the RIAA, BPI and IFPI are increasingly policing YouTube to get all copyrighted music taken off the site. One such artist to suffer recently is the unfortunate Calvin Harris, who clashed with the music industry lobby group BPI.
20:26:46 <[bjoern]> \ldblquote IT\rquote S MY FUCKING SONG YOU ABSOLUTE BASTARDS,\rdblquote Harris wrote on Twitter when he found out that YouTube had removed a clip he uploaded himself, following a copyright complaint from the BPI.
20:26:46 <[bjoern]> ]]]
20:26:53 <sbp> alright, bugger it
20:27:03 <sbp> quit the process, will have to think about it
20:27:03 <Monty> :O
20:27:08 <sbp> YES
20:27:08 <Monty> customs agency. “The IRS sent with some modifications http://asset.soup.io/asset/0410/3966_c5dd.png
20:27:19 <sbp> maybe it needs some different language even
20:27:25 <sbp> offs
20:27:26 <sbp> 'night!
20:27:32 <_ulises> night!
20:28:36 <[bjoern]> "Does anywhere in London deliver food after 11pm??? HELP URGENTLY NEEDED!!!!!"
20:33:35 <[bjoern]> http://www.tonightshowwithconanobrien.com/video/clips/shatner-does-palin-072709/1139665/ TITLE NOT WANTS
20:36:12 <[bjoern]> http://www.randomyoutubeinsult.com/
20:36:26 <[bjoern]> "COPY AND PASTE IF YOU HATE FRED"
21:13:27 *** MacTed has quit ()
21:47:44 <thelsdj> the nice thing about sbp's sed/grep/sed/egrep/tr chain is that it easily uses multiple cpus, i did the bzip first on the huge 5G file to see how big it was, 23G
22:14:22 *** MacTed (n=Thud@c-24-61-62-241.hsd1.ma.comcast.net) has joined #swhack
22:27:15 *** xjrn has quit (Read error: 110 (Connection timed out))
22:29:29 <jsled> "At high speed, nothing stays in the same place for long. Aim your car at the spot where you see an accident start. Chances are the accident will have moved by the time you get there." – http://rulesofthumb.org/perma.php?ruleid=1881
22:30:07 <jsled> which reminds me of the maxim I was taught: aim your car for the deer's hindquarters.
22:30:50 <[bjoern]> Optimize for keeping the edible parts intact.
22:31:14 <jsled> heh. no, optimize for not dying.
22:31:46 <[bjoern]> Well if you can survive without food, go ahead.
22:34:10 <jsled> everyone I've known/met involved in a car crash with a deer had their car totaled, and was as lucky to survive as the age of the incident.
22:34:46 <jsled> Though when my dad was getting a ride back, he did notice "some mexicans" loading the deer onto a pickup.
22:35:08 <jsled> Though I'm sure if he had access to a pickup, we'd've been eating deer the rest of the year.
22:38:40 <[bjoern]> You have big deer over there.
22:38:51 <[bjoern]> or it was christmas perhaps.
23:18:18 <laplink> .c 5GB over (17.5 MB over ((7 * 60) + 34) seconds)
23:18:19 <phenny> (5 GB) over ((17.5 MB) over (((7 * 60) + 34) * seconds)) = 1.5373545 days
23:30:45 *** xover has quit ("Leaving")
23:37:54 *** xover (n=xover@octet.neutri.no) has joined #swhack
23:43:32 *** clarity_ (n=clarity@69.181.77.150) has joined #swhack
23:43:33 <clarity_> yo
23:51:46 <nslater> yo clarity_, wasup?
23:52:07 <nslater> clarity_: i know a girl who has a flickr account called clarity. not you, is it?
23:52:32 <nslater> xover: yo
23:52:50 <laplink> `sup
23:53:07 <clarity_> nslater, I ain't a bitch
23:53:12 <nslater> shame
23:53:16 <nslater> got pics?
23:53:40 <clarity_> yeah, but i'm to lazy to find them :-)
23:53:47 <nslater> no, i meant of chix
23:54:17 <clarity_> naw man. I'm into animals
23:54:21 <clarity_> i don't like girls
23:54:40 <jessica> That's good, as we're all completely and totally insane.
23:54:53 <clarity_> :-)
23:54:56 <[bjoern]> I don't see the problem.
23:54:58 <clarity_> i'm missing defcon :-(
23:55:06 <clarity_> first time in 3 years
23:55:34 <jessica> Of course you don't see the problem, Björn!
23:55:39 <clarity_> instead i'm eating jelly beans smoking weed and watching ted
23:55:40 <jessica> You're a German!
23:55:55 * jessica has gained a German devotee from another network.
23:56:23 <[bjoern]> Aren't germans notorious problem-seeers?
23:56:30 <clarity_> jessica: how'd you know i'm german?
23:56:42 <jessica> Tell me about it. crschmidt married me.
23:56:59 <[bjoern]> So, no "of course" there then.
23:57:04 *** crschmidt (i=crschmid@59.176.232.72.static.reverse.ltdomains.com) has joined #swhack
23:57:10 * crschmidt defends his honor
23:57:10 <phenny> crschmidt: 25 Jun 09:34Z <nsh> tell crschmidt i noticed athena got h4xed by someone, see fourth result of http://www.google.fi/search?q=w4ck1ng-shell+intitle%3A%E2%80%9Dshell%40%E2%80%9D%E2%80%9C&ie=utf-8&oe=utf-8&aq=t&rls=com.ubuntu:en-GB:unofficial&client=firefox-a
23:57:16 <clarity_> interesting. I still haven't married. I haven't found the right horse
23:57:24 <jessica> This guy must lack a girlfriend, because apparently he hangs around krautchan.
23:57:37 <jessica> he thought I might, too, given my sense of humor.
23:57:56 <clarity_> lol
23:57:58 <jessica> you DO realise that some problems can take on full-time status and eventually kill you, right?
23:58:05 <jessica> you know this, bjoern, clarity?
23:58:10 <laplink> .c 5GB over (17.5 MB over ((2 * 60) + 49) seconds)
23:58:10 <phenny> (5 GB) over ((17.5 MB) over (((2 * 60) + 49) * seconds)) = 13.7346032 hours
23:58:26 <[bjoern]> Like children.
23:58:26 <clarity_> jessica: oh i've factored... oh i've factored
23:58:43 <jessica> ZEN TOMORROW WE VIL TAKE POLAND!
23:58:53 <jessica> I mean, shit, it's the week-end, you guys got anything else planned?
23:58:53 <Monty> potty mouth!
23:58:57 <[bjoern]> I am intrigued by the krautchan countdown
23:59:13 <[bjoern]> Yes, exam in 6 hours.
23:59:22 <jessica> Examination of what?
23:59:29 <clarity_> [bjoern], what skool?