Options:
- # Session Start: Mon Apr 07 00:00:00 2008
- # Session Ident: #whatwg
- # [00:03] <jgraham> "So HTTP 1.0 was a failure too? And DOM Level 1? And CSS1? And the first season of Buffy? And the first Feynman lecture? And the first act of Macbeth? Maybe this paragraph is a failure too, since it's followed by more paragraphs. :-)"
- # [00:03] <jgraham> Is a wonderful sentence, with one error; the *whole* of Buffy was a failure
- # [00:04] * jgraham hides
- # [00:05] <annevk> now you'll get a ban :p
- # [00:07] <jgraham> What for failing to notice that it's actually 5 sentences?
- # [00:07] * jgraham tries to look innocent
- # [00:07] <jgraham> s/5/7/
- # [00:09] <Hixie> jgraham: yeah i wasn't a big fan of buffy (especially seasons 1-4)
- # [00:09] <Hixie> 6.6 was a good episode
- # [00:10] <Hixie> and spike and dawn made 5-7 watchable
- # [00:10] <Hixie> but i'm just not a fan of whedon's writing style -- it's good, just not my kettle of fish :-)
- # [00:11] <jgraham> I have friends who were really into Buffy but I'm never really got it.
- # [00:19] * Joins: qwert666_ (n=qwert666@dak171.neoplus.adsl.tpnet.pl)
- # [00:21] <annevk> hmm, if we do <script> and <style> using CDATA stripping <![CDATA[ would help authors...
- # [00:21] <annevk> but it might be problematic for E4X and all...
- # [00:21] * annevk shrugs
- # [00:23] * Joins: roc (n=roc@CPE-58-165-31-191.vic.bigpond.net.au)
- # [00:25] <Hixie> e4x?
- # [00:25] <Hixie> how so?
- # [00:25] <gsnedders> Hixie: But HTTP 1.0 was a failure :P
- # [00:25] <annevk> not sure, but I remember <!-- being a problem for it
- # [00:26] <Hixie> annevk: eh?
- # [00:26] <Hixie> gsnedders: i hope html5 is that much of a failure
- # [00:27] <gsnedders> Hixie: So almost everything will move away as soon as HTML6 is out?
- # [00:27] <annevk> Hixie, https://bugzilla.mozilla.org/show_bug.cgi?id=311071
- # [00:28] <annevk> Hixie, no idea what the exact problem is, there's no testcases, examples, etc.
- # [00:31] * weinig is now known as weinig|away
- # [00:32] <annevk> Hixie, there's more here https://bugzilla.mozilla.org/show_bug.cgi?id=309242
- # [00:32] <annevk> Hixie, and here https://bugzilla.mozilla.org/show_bug.cgi?id=311157
- # [00:33] <annevk> Hixie, though I suppose that if we only strip <![cdata[ at the start it might be ok
- # [00:37] <jwalden> frankly, I don't think you should worry about E4X
- # [00:38] <jwalden> it's rarely used, the spec's not very good, rarely implemented, and if it complicates parsing (which it would), you should fall back on external scripts or entity encoding
- # [00:38] * Quits: qwert666 (n=qwert666@dak171.neoplus.adsl.tpnet.pl) (Connection timed out)
- # [00:40] <annevk> the problem is a combination of existing SVG content, current parsing of <script> in XML, parsing of <script> in HTML, and the desire to make existing SVG content work in HTML
- # [00:41] <annevk> (well, and existing HTML content that contains <svg><script> presumably...)
- # [00:43] <jgraham> annevk: html5lib produces a parse error when it encounters EOF in innerHTML mode and there is > 1 tag on the stack of open elements, which is commented as so: "# XXX This is not what the specification says. Not sure what to do
- # [00:43] <jgraham> # here."
- # [00:44] <jgraham> Do you mind if we switch that error off at the moment to agree with the testcases and presumably the spec
- # [00:44] <annevk> no, go ahead
- # [00:44] <annevk> though please check the spec if you plan on removing the comment :)
- # [00:44] <jgraham> I won't remove the comment
- # [00:50] <jgraham> Hmm. It might just be the testcase that's wrong. The code looks pretty unlike the spec. Maybe I should try sleeping first and thinking later :)
- # [00:51] * Quits: tndH (i=Rob@adsl-87-102-83-116.karoo.KCOM.COM) ("ChatZilla 0.9.81-rdmsoft [XULRunner 1.8.0.9/2006120508]")
- # [00:51] * Joins: heycam` (n=cam@clm-laptop.infotech.monash.edu.au)
- # [01:00] * Quits: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com) (Read error: 110 (Connection timed out))
- # [01:07] * Quits: shepazu (n=schepers@cpe-069-134-123-228.nc.res.rr.com) (Read error: 104 (Connection reset by peer))
- # [01:07] * Joins: shepazu (n=schepers@cpe-069-134-123-228.nc.res.rr.com)
- # [01:14] * Quits: qwert666_ (n=qwert666@dak171.neoplus.adsl.tpnet.pl) ("Leaving")
- # [01:16] * Quits: roc (n=roc@CPE-58-165-31-191.vic.bigpond.net.au) (Read error: 110 (Connection timed out))
- # [01:28] * Quits: svl (n=me@ip565744a7.direct-adsl.nl) ("And back he spurred like a madman, shrieking a curse to the sky.")
- # [01:29] * Quits: jwalden (n=waldo@RANDOM-SEVENTY-TWO.MIT.EDU) (Remote closed the connection)
- # [02:05] <takkaria> annevk: on your latest post, first para, s/totaled/totalled/ and s/amount/number/
- # [02:07] <annevk> ty, fixed
- # [02:08] <annevk> night all
- # [02:08] * annevk -> bed
- # [02:09] * Quits: KevinMarks (n=KevinMar@c-98-207-134-151.hsd1.ca.comcast.net) (Read error: 104 (Connection reset by peer))
- # [02:18] * Joins: KevinMarks (n=KevinMar@c-98-207-134-151.hsd1.ca.comcast.net)
- # [02:36] * Joins: jruderman_ (n=jruderma@c-67-180-174-213.hsd1.ca.comcast.net)
- # [02:47] * Joins: BenMillard (n=cerbera@cpc1-flee1-0-0-cust285.glfd.cable.ntl.com)
- # [02:50] * Quits: jruderman (n=jruderma@c-67-180-174-213.hsd1.ca.comcast.net) (Read error: 110 (Connection timed out))
- # [02:52] * Joins: kfish (n=conrad@61.194.21.25)
- # [02:53] * Quits: hober (n=ted@unaffiliated/hober) ("ERC Version 5.3 (IRC client for Emacs)")
- # [02:55] * Joins: jwalden (n=waldo@STRATTON-THREE-FIFTY-EIGHT.MIT.EDU)
- # [03:06] <annevk> http://lists.w3.org/Archives/Public/public-xhtml2/2008Apr/0020.html
- # [03:06] * annevk can't sleep (for those who read the earlier message)
- # [03:20] <BenMillard> it was snowing here yesterday morning...very unusual for this part of the UK
- # [03:21] <BenMillard> especially in April!
- # [03:24] <Philip`> Unfortunately it had all melted by the time I woke up (around 1pm) :-(
- # [03:24] <BenMillard> I took some photos...I suck at doing that, though they will appear on my blog eventually
- # [03:29] <tomg> it was nice
- # [03:29] <tomg> was shocked at how white it was
- # [03:31] <BenMillard> indeed, it was proper powder snow
- # [03:31] <tomg> only light snow was forecast
- # [03:32] * Quits: dbaron (n=dbaron@c-67-160-251-228.hsd1.ca.comcast.net) ("8403864 bytes have been tenured, next gc will be global.")
- # [03:34] * Hixie e-mails the xhtml2 group
- # [03:38] * Joins: MacDome (n=eric@c-24-130-11-246.hsd1.ca.comcast.net)
- # [03:49] <MikeSmith> BenMillard - can you give a brief summary of the goals of your ongoing table work?
- # [03:49] <MikeSmith> is there are particular hypothesis you're working under?
- # [03:50] <BenMillard> it's mostly to understand how tables get authored in reality, so we can design HTML5 features to be more realistic and robust
- # [03:50] <BenMillard> particularly the table header association mechanism
- # [03:51] <BenMillard> where "we" means HTMLWG in general
- # [03:52] <BenMillard> I should probably summarise the goals in the document :P
- # [03:53] <MikeSmith> BenMillard - yeah, that would be useful, I think
- # [03:54] <MikeSmith> summary of what you've done so far, and what else you want to do going forward
- # [03:54] <MikeSmith> what specific product/deliverable you have in mind
- # [03:54] <BenMillard> the document starts with a "Numbers" section which summarises what's been done
- # [03:55] <MikeSmith> for example, we could end up producing/publishing a W3C Note from it
- # [03:55] <MikeSmith> BenMillard - what's the URL?
- # [03:55] <BenMillard> oh, sorry I thought you were looking at it! it's here: http://sitesurgeon.co.uk/tables/
- # [03:56] <BenMillard> deliverables include helping develop the HTML5 header association algorithm with other HTMLWG participants (James Graham and Simon Pieters, mostly)
- # [03:57] <MikeSmith> BenMillard - hmm, seems like that wouldn't be a specific deliverable but instead something the deliverable/document could be used for
- # [03:58] <MikeSmith> have you gotten much feedback from WAI folks about it yet?
- # [03:58] <BenMillard> I don't recall talking to any WAI people about it yet
- # [03:58] <Hixie> http://www.w3.org/2003/entities/2007xml/unicode.xml has a last-modified date of 1970
- # [03:58] <Hixie> go w3c!
- # [03:58] <MikeSmith> heh
- # [03:59] <BenMillard> MikeSmith, it was done in my spare time so I've not been tracking feedback around it
- # [03:59] * Hixie removes the -N from his command line so that hsi script will always download the file instead of assuming that it has an accurate date
- # [04:00] <Hixie> oh actually
- # [04:00] <Hixie> that could be a minefield bug
- # [04:00] <Hixie> with xslt
- # [04:00] <Hixie> nevermind
- # [04:00] <MikeSmith> Hixie - you mean the headers?
- # [04:00] <MikeSmith> Last-Modified: Sun, 06 Apr 2008 10:46:09
- # [04:00] <Hixie> yeah
- # [04:00] <Hixie> yeah, minefield bug with xslt
- # [04:01] <Hixie> lovely
- # [04:01] <MikeSmith> ah, OK
- # [04:01] <MikeSmith> go minefield!
- # [04:02] * Joins: bzed_ (n=bzed@devel.recluse.de)
- # [04:02] <BenMillard> MikeSmith, yes the document itself does not contain a product
- # [04:02] <MikeSmith> BenMillard - fwiw, I think it might be worthwhile to post the URL and short summary to one of the WAI lists
- # [04:02] <MikeSmith> if you've not done that already
- # [04:03] <BenMillard> it's not under active development at the moment; I can't afford the time
- # [04:03] * Joins: xShad0w_ (n=xShad0w@pool-72-66-13-80.washdc.fios.verizon.net)
- # [04:03] * Quits: jruderman_ (n=jruderma@c-67-180-174-213.hsd1.ca.comcast.net) (Read error: 104 (Connection reset by peer))
- # [04:04] <BenMillard> I'd like it to be used by anyone who finds it useful, though
- # [04:04] * Joins: jruderman (n=jruderma@c-67-180-174-213.hsd1.ca.comcast.net)
- # [04:04] * MikeSmith is getting lots of "Connection reset by peer http://intertwingly.net/blog/index.atom" when trying to get Sam's feed
- # [04:04] <MikeSmith> BenMillard - I see
- # [04:05] <BenMillard> I've been seeking sponsorship from various places to make the work more sustainable
- # [04:05] <BenMillard> I took the whole of last month off work to try and speed that up, but haven't quite got there
- # [04:10] <MikeSmith> BenMillard - I would think investing some time in doing a little awareness-raising about might help with getting others to consider sponsoring further work
- # [04:10] <MikeSmith> e.g., by posting about it to WAI lists or elsewhere
- # [04:11] * Quits: csarven- (n=csarven@70.50.11.114) (Read error: 104 (Connection reset by peer))
- # [04:11] * Quits: bzed (n=bzed@debian/developer/bzed) (Read error: 111 (Connection refused))
- # [04:11] * bzed_ is now known as bzed
- # [04:11] <BenMillard> MikeSmith, that's a good idea. can you suggest a specific list for me to post it to? I've only sent it to public-html before now.
- # [04:13] <MikeSmith> BenMillard - no, I can't suggest any specific list. I don't follow them. Karl Dubost might know
- # [04:14] <MikeSmith> might be worthwhile for you to get in touch with Michael Cooper and/or Shawn Henry
- # [04:14] <MikeSmith> both W3C staff for WAI
- # [04:15] <BenMillard> I've spoken to Shawn Henry in person in November 2007; I could ask her
- # [04:18] <MikeSmith> BenMillard - yeah, she would certainly be able to tell you
- # [04:19] * MikeSmith drops off to head into office
- # [04:19] * Quits: MikeSmith (n=MikeSmit@58.157.21.205) ("Less talk, more pimp walk.")
- # [04:19] <BenMillard> MikeSmith, thanks for your advice about this. I'm a noob when it comes to W3C!
- # [04:19] <BenMillard> oh bugger, he left just before I sent that
- # [04:19] * Quits: xShad0w__ (n=xShad0w@pool-71-163-158-40.washdc.fios.verizon.net) (Read error: 110 (Connection timed out))
- # [04:28] * Quits: enn (i=eli@dsl253-036-017.chi1.dsl.speakeasy.net) (Read error: 104 (Connection reset by peer))
- # [04:43] * Joins: csarven (n=csarven@70.50.11.114)
- # [04:43] * Joins: MikeSmith (n=MikeSmit@eM60-254-229-138.pool.emnet.ne.jp)
- # [04:43] <jwalden> BenMillard: ^
- # [04:46] <MikeSmith> BenMillard - I'm a noob too.. I've worked for the W3C only for a year now. most people here actually have a lot more experience with the W3C than I do
- # [04:48] <MikeSmith> I'm sort of like a foreign element that's been inserted, and not clear yet if it's going to make things worse or better
- # [04:50] <jwalden> <foreignElement id="MikeSmith"/>
- # [04:57] <BenMillard> MikeSmith, that's cool
- # [04:57] * MikeSmith reads dbaron's latest
- # [04:58] <MikeSmith> http://dbaron.org/log/20080406-acid3
- # [04:58] <MikeSmith> "Teaching to the Test" (on HTML5 and Acid3 and Firefox)
- # [05:02] * Joins: wakaba (n=w@201.162.210.220.dy.bbexcite.jp)
- # [05:09] * Joins: dbaron (n=dbaron@c-67-160-251-228.hsd1.ca.comcast.net)
- # [05:11] * Quits: jruderman (n=jruderma@c-67-180-174-213.hsd1.ca.comcast.net) (brown.freenode.net irc.freenode.net)
- # [05:11] * Quits: annevk (n=annevk@77.163.243.203) (brown.freenode.net irc.freenode.net)
- # [05:11] * Quits: Kuruma (n=Kuruman@h123-176-107-050.catv01.catv-yokohama.ne.jp) (brown.freenode.net irc.freenode.net)
- # [05:14] * Quits: MikeSmith (n=MikeSmit@eM60-254-229-138.pool.emnet.ne.jp) ("Less talk, more pimp walk.")
- # [05:14] * Joins: jruderman (n=jruderma@c-67-180-174-213.hsd1.ca.comcast.net)
- # [05:14] * Joins: annevk (n=annevk@77.163.243.203)
- # [05:14] * Joins: Kuruma (n=Kuruman@h123-176-107-050.catv01.catv-yokohama.ne.jp)
- # [05:22] * Quits: csarven (n=csarven@70.50.11.114) (Read error: 110 (Connection timed out))
- # [05:22] * Quits: wakaba_ (n=w@201.162.210.220.dy.bbexcite.jp) (Read error: 110 (Connection timed out))
- # [05:23] * Joins: deane (n=dean@202-74-210-41.ue.woosh.co.nz)
- # [05:25] <BenMillard> MikeSmith, I've updated the tables document: http://sitesurgeon.co.uk/tables/
- # [05:26] <BenMillard> away for lunch now (at 4am local time!)
- # [06:02] * Joins: MikeSmith (n=MikeSmit@dhcp-246-231.mag.keio.ac.jp)
- # [06:07] * Joins: billyjack (n=MikeSmit@dhcp-246-231.mag.keio.ac.jp)
- # [06:08] * Quits: billyjack (n=MikeSmit@dhcp-246-231.mag.keio.ac.jp) (Read error: 104 (Connection reset by peer))
- # [06:09] * Joins: Thezilch (i=fuz007@cpe-76-170-22-70.socal.res.rr.com)
- # [06:28] * Quits: MacDome (n=eric@c-24-130-11-246.hsd1.ca.comcast.net)
- # [06:30] <BenMillard> MikeSmith, I've updated the tables document http://sitesurgeon.co.uk/tables/ is this better?
- # [06:46] * Joins: MacDome (n=eric@70-3-102-248.area5.spcsdns.net)
- # [06:50] * Joins: MacDome_ (n=eric@99-207-45-235.area5.spcsdns.net)
- # [06:53] * Joins: MacDome__ (n=eric@68-240-90-28.area5.spcsdns.net)
- # [06:54] <MikeSmith> BenMillard - which parts did you change/add?
- # [06:54] <MikeSmith> The Goals and Deliverables part?
- # [06:54] <BenMillard> that's right
- # [06:55] <BenMillard> I also moved the Feedback section to the top to emphasise the openness of it's development
- # [06:57] <BenMillard> MikeSmith, using the UserName, Message format is more compatible with IRC clients and the #whatwg log than using UserName - Message
- # [06:57] <BenMillard> e.g.:
- # [06:57] <BenMillard> BenMillard, which parts did you change/add?
- # [06:57] <MikeSmith> um, thanks for the tip
- # [06:58] <Hixie> personally i'm more of a foo: bar fan than a foo, bar fan
- # [06:59] <BenMillard> either of those are compatible with IRC clients and the log, afaict, but foo - bar is not
- # [07:00] <MikeSmith> BenMillard - I don't know what you mean by compatible with IRC clients and the log
- # [07:00] <MikeSmith> not that I really care to get into a discussion about it
- # [07:01] * Quits: MacDome (n=eric@70-3-102-248.area5.spcsdns.net) (Read error: 104 (Connection reset by peer))
- # [07:01] <BenMillard> the highlight lines directed at you when either of the conventional methods are followed
- # [07:01] <BenMillard> *they
- # [07:02] <BenMillard> so if you address me as BenMillard, message it gets highlighted and I'm more likely to see it
- # [07:02] <BenMillard> (or BenMillard: message)
- # [07:03] <Hixie> either works with a decent irc client like irssi :-)
- # [07:03] <MikeSmith> BenMillard - yeah, I'd say that's problem between you and your IRC client
- # [07:03] * jwalden snickers
- # [07:04] <jwalden> don't most clients get set off when your nick appears anywhere in the line, surrounded by \b ?
- # [07:06] <BenMillard> the , and : forms are the only ones I've found interoperable...is there a standard for this?
- # [07:07] * Quits: MacDome_ (n=eric@99-207-45-235.area5.spcsdns.net) (Read error: 104 (Connection reset by peer))
- # [07:07] <jwalden> standard? IRC? hah!
- # [07:08] <jwalden> or so I've been led to believe
- # [07:10] <MikeSmith> I think the standard for IRC is "please leave your %s at the door"
- # [07:11] <MikeSmith> jwalden - fwiw, XChat highlights the nick of the user who uses your nick in a message
- # [07:11] <MikeSmith> not your own nick
- # [07:11] <MikeSmith> which sorta makes sense to me
- # [07:12] <MikeSmith> I know my own name
- # [07:12] <MikeSmith> (most of the time)
- # [07:12] <jwalden> I used the phrase "set off" because it probably differs across clients; Chatzilla highlights the individual message, I assume others do other thigns
- # [07:12] <MikeSmith> the useful piece of information for this case seems to be, who's calling?
- # [07:12] <jwalden> s/gns/ngs/
- # [07:14] <BenMillard> the IRC logs for this channel are athttp://krijnhoetmer.nl/irc-logs/ and it support both the , and : forms. so if you use one of those forms, while in this channel, the highlighting feature will work in the logs
- # [07:14] <BenMillard> although I agree that any message which mentions a user's name should light up in that user's client
- # [07:21] * Quits: Thezilch (i=fuz007@cpe-76-170-22-70.socal.res.rr.com) (Read error: 110 (Connection timed out))
- # [07:27] * Joins: hober (n=ted@unaffiliated/hober)
- # [07:34] * MacDome__ is now known as MacDome
- # [07:38] * Quits: kfish (n=conrad@61.194.21.25) ("Pike!")
- # [07:41] * weinig|away is now known as weinig
- # [07:42] * Joins: maikmerten (n=merten@ls5laptop14.cs.uni-dortmund.de)
- # [07:46] * Quits: MacDome (n=eric@68-240-90-28.area5.spcsdns.net)
- # [07:55] * Quits: heycam` (n=cam@clm-laptop.infotech.monash.edu.au) (Read error: 110 (Connection timed out))
- # [08:08] * Joins: MacDome (n=eric@c-24-130-11-246.hsd1.ca.comcast.net)
- # [08:14] * Quits: MacDome (n=eric@c-24-130-11-246.hsd1.ca.comcast.net)
- # [08:18] * Quits: jgraham_ (n=james@81-86-218-215.dsl.pipex.com)
- # [08:18] * Joins: othermaciej (n=mjs@dsl081-048-145.sfo1.dsl.speakeasy.net)
- # [08:20] <hsivonen> http://bitworking.org/news/317/Revisionist-XHistory
- # [08:20] <othermaciej> hi all
- # [08:21] <hsivonen> hi
- # [08:35] <hsivonen> surprisingly, one of the hotspots of Validator.nu is TreeSet doing a silly number of compares when the inserted items are already ordered or reverse-ordered.
- # [08:36] <hsivonen> too bad the JDK and Commons Collections don't seem to have head/tail-biased linked list-backed SortedSets.
- # [08:37] <othermaciej> that's not a good way to make a sorted data structure
- # [08:38] <othermaciej> (it's a good way to make an ordered associative data structure)
- # [08:38] <hsivonen> othermaciej: what's not good? TreeSet?
- # [08:38] <othermaciej> no, a linked-list backed set
- # [08:38] <othermaciej> a TreeSet (assuming it's a balanced tree) is a good way to make a balanced associative structure
- # [08:39] <othermaciej> but inserting N items into it should be N log N so it's surprising it would be a hot spot
- # [08:39] <hsivonen> othermaciej: why not if you know the insertion will always be either to head of the list or the next from head?
- # [08:39] <othermaciej> if you have items that are already ordered, you would want to use a ListHashSet
- # [08:39] <othermaciej> (I think that's what Java calls it)
- # [08:39] * weinig is now known as weinig|zZz
- # [08:40] <hsivonen> othermaciej: the insertions are *almost* ordered
- # [08:40] <hsivonen> that is, the new insertion is most often to the head of the list, but sometimes a slot or two further
- # [08:41] <hsivonen> (LinkedHashSet is not what I need here)
- # [08:41] <othermaciej> that doesn't have an insertBefore?
- # [08:41] <othermaciej> (too bad, it should be easy to do)
- # [08:43] <hsivonen> it appears it does not
- # [08:47] <hsivonen> the show source feature ends up comparing locations 29 times the number of location objects
- # [08:47] <hsivonen> that's not good
- # [09:03] * Joins: tndH (i=Rob@adsl-87-102-83-116.karoo.KCOM.COM)
- # [09:07] * Joins: MacDome (n=eric@c-24-130-11-246.hsd1.ca.comcast.net)
- # [09:12] * Joins: met_ (n=Hassman@r5bx220.net.upc.cz)
- # [09:16] <BenMillard> annevk, you wrote "heh, RDF fanatics use " and <em> for quotes" and I see things like that quite frequently on the blogs of markup/accessibility enthusiasts/experts
- # [09:20] <BenMillard> indeed, it's hard to find anyone using <q>...present company excepted :)
- # [09:21] <BenMillard> getting the right punctuation seems more important to authors than using the right element
- # [09:25] <jwalden> problem being q's quotation behavior (CSS's, that is) is underspecified, as dbaron tells me
- # [09:26] <BenMillard> Hixie, I made editorial changes to http://sitesurgeon.co.uk/tables/ which include clarifying the markup used by each group in "How Authors Indicate Headers in Data Tables". They were a bit vague before. Let me know if this changes anything.
- # [09:26] <Hixie> k
- # [09:27] <Hixie> i probably won't look at table stuff for some time
- # [09:34] * Joins: roc (n=roc@CPE-60-230-33-80.vic.bigpond.net.au)
- # [09:40] * Joins: svl (n=me@ip565744a7.direct-adsl.nl)
- # [09:46] * Quits: MacDome (n=eric@c-24-130-11-246.hsd1.ca.comcast.net)
- # [09:47] * Quits: roc (n=roc@CPE-60-230-33-80.vic.bigpond.net.au)
- # [09:47] * Joins: roc (n=roc@CPE-60-230-33-80.vic.bigpond.net.au)
- # [09:48] * Quits: webben (n=benh@dip5-fw.corp.ukl.yahoo.com)
- # [09:52] * Joins: Thezilch (i=fuz007@cpe-76-170-22-70.socal.res.rr.com)
- # [09:53] * Quits: dbaron (n=dbaron@c-67-160-251-228.hsd1.ca.comcast.net) ("8403864 bytes have been tenured, next gc will be global.")
- # [10:07] * Parts: hober (n=ted@unaffiliated/hober) ("ERC Version 5.2 (IRC client for Emacs)")
- # [10:16] * Joins: zcorpan_ (n=zcorpan@pat-tdc.opera.com)
- # [10:17] * zcorpan_ did not know about Document.strictErrorChecking
- # [10:18] * zcorpan_ will define in dom5core what happens when it is false
- # [10:18] * Quits: MikeSmith (n=MikeSmit@dhcp-246-231.mag.keio.ac.jp) ("Less talk, more pimp walk.")
- # [10:28] * Joins: ROBOd (n=robod@89.122.216.38)
- # [10:32] <BenMillard> Philip`, my (badly taken) photos of UK snow are now blogged: http://projectcerbera.com/blog/2008/04#day06
- # [10:34] * Joins: Lachy_ (n=Lachlan@ti200710a340-2662.bb.online.no)
- # [10:34] * Joins: qwert666 (n=qwert666@acdi143.neoplus.adsl.tpnet.pl)
- # [10:41] * Parts: BenMillard (n=cerbera@cpc1-flee1-0-0-cust285.glfd.cable.ntl.com)
- # [10:48] * Joins: webben (n=benh@nat/yahoo/x-4832041cfaeca695)
- # [10:51] * Quits: Lachy_ (n=Lachlan@ti200710a340-2662.bb.online.no) (Read error: 110 (Connection timed out))
- # [10:51] * Quits: Lachy (n=Lachlan@ti200710a340-2662.bb.online.no) (Read error: 110 (Connection timed out))
- # [10:52] * Joins: zcorpan (n=zcorpan@pat-tdc.opera.com)
- # [10:54] <jwalden> oh man
- # [10:54] <jwalden> that looks AWESOME
- # [10:57] * Quits: Thezilch (i=fuz007@cpe-76-170-22-70.socal.res.rr.com) (Read error: 104 (Connection reset by peer))
- # [11:06] * Quits: zcorpan_ (n=zcorpan@pat-tdc.opera.com) (Read error: 110 (Connection timed out))
- # [11:14] * Joins: Camaban (n=alee@85-211-226-141.dyn.gotadsl.co.uk)
- # [11:16] * Quits: roc (n=roc@CPE-60-230-33-80.vic.bigpond.net.au)
- # [11:41] * Joins: virtuelv (n=virtuelv@pat-tdc.opera.com)
- # [11:44] * Quits: jwalden (n=waldo@STRATTON-THREE-FIFTY-EIGHT.MIT.EDU) (Remote closed the connection)
- # [11:50] * Joins: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [12:03] <hsivonen> othermaciej: swiching to HeadBiasedSortedSet and TailBiasetSortedSet changed the comparison patterns to the better by approximately a factor of 29
- # [12:04] <hsivonen> I don't know what kind of balanced tree TreeSet has, but it sure compares a *lot*
- # [12:05] <othermaciej> how big is your data set?
- # [12:05] <hsivonen> othermaciej: the HTML 5 spec
- # [12:05] * Joins: webben_ (n=benh@nat/yahoo/x-1c1582fe4b9a68c2)
- # [12:05] <hsivonen> about 16000 items in the set
- # [12:06] <othermaciej> log base 2 of that is 14
- # [12:06] <othermaciej> must not be that well balanced
- # [12:06] <othermaciej> (well, close to 14)
- # [12:06] <hsivonen> or it compares everything twice
- # [12:06] <hsivonen> or something
- # [12:07] <hsivonen> the next big hotspots are IO and XPath
- # [12:09] <hsivonen> making IO go away is hard, but making XPath go away is quite doable and something I want to do anyway
- # [12:10] <othermaciej> I am still somewhat surprised that inserting into a tree-based data structure could be the top hot spot for a program
- # [12:10] <hsivonen> doh brainfart re compares everything twice
- # [12:11] <hsivonen> anyway, the profiler data is what it is.
- # [12:16] <othermaciej> sorting an already-sorted 16000 element array in JavaScript takes 10 milliseconds
- # [12:16] <othermaciej> (on Safari on a decent machine)
- # [12:16] <othermaciej> I would suspect something is broken about TreeSet
- # [12:19] <hsivonen> the factor 29 wasn't a time factor but number of invocations of compareTo factor
- # [12:19] <hsivonen> so it can't be CPU timing weirdness with the profiler
- # [12:19] * Quits: webben (n=benh@nat/yahoo/x-4832041cfaeca695) (Read error: 110 (Connection timed out))
- # [12:22] * Joins: Lachy (n=Lachlan@pat-tdc.opera.com)
- # [12:34] * Joins: hasather_ (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [12:44] * Quits: zcorpan (n=zcorpan@pat-tdc.opera.com) (Read error: 104 (Connection reset by peer))
- # [12:45] * Joins: zcorpan (n=zcorpan@pat-tdc.opera.com)
- # [12:48] <zcorpan> 2137 entities
- # [12:49] <zcorpan> it's like learning simplified chinese
- # [12:50] <hsivonen> I think this entity business is a bad idea, but I haven't gotten around to sending mail yet
- # [12:52] <hsivonen> annevk: re blog: http://www.photobasement.com/wp-content/uploads/2008/04/quotationmarks.jpg
- # [12:54] <zcorpan> wiki syndrome?
- # [13:04] * Quits: ROBOd (n=robod@89.122.216.38) ("http://www.robodesign.ro")
- # [13:13] <annevk> zcorpan, fwiw, I think you should drop the strict error reporting thing from dom5
- # [13:17] <jruderman> hsivonen: nice photo
- # [13:20] <gsnedders> Philip`: http://pastebin.ca/974108
- # [13:23] * Joins: roc (n=roc@CPE-124-181-181-187.vic.bigpond.net.au)
- # [13:24] <zcorpan> annevk: isn't that too late given acid3?
- # [13:25] <zcorpan> annevk: or do you mean just drop the attribute and leave the strict behavior intact?
- # [13:26] <annevk> what does acid3 have to do with anything?
- # [13:26] <zcorpan> it checks that createElementNS('...', 'foo::') raises an exception, e.g.
- # [13:27] * Quits: webben_ (n=benh@nat/yahoo/x-1c1582fe4b9a68c2)
- # [13:29] <annevk> I meant the method on document, fwiw
- # [13:32] <zcorpan> ok
- # [13:32] <zcorpan> although it would be nice to be able to create an html5 parser in js
- # [13:32] <hsivonen> so here I was parsing XML
- # [13:32] <hsivonen> and it went really slowly
- # [13:32] <annevk> zcorpan, document.innerHTML
- # [13:33] <hsivonen> until I realized I should prevent it from fetching DTDs from w3.org...
- # [13:33] <zcorpan> annevk: true, the dom3core attribute doesn't help legacy browsers anyway
- # [13:35] * zcorpan uses del.icio.us as his dom5core issue tracker
- # [13:48] <annevk> hmm, entity changes are impossible to track using web-apps-tracker
- # [13:49] * Joins: tndH_ (i=Rob@adsl-87-102-77-74.karoo.KCOM.COM)
- # [13:54] * Quits: hasather_ (n=hasather@90-231-107-133-no62.tbcn.telia.com) (Remote closed the connection)
- # [13:54] * Joins: hasather_ (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [13:57] * Quits: deane (n=dean@202-74-210-41.ue.woosh.co.nz) ("ChatZilla 0.9.81 [Firefox 3.0b5/2008032620]")
- # [14:07] * Joins: myakura (n=myakura@p1215-ipbf3008marunouchi.tokyo.ocn.ne.jp)
- # [14:08] * Quits: tndH (i=Rob@adsl-87-102-83-116.karoo.KCOM.COM) (Read error: 110 (Connection timed out))
- # [14:10] * Joins: jmb (n=jmb@login.ecs.soton.ac.uk)
- # [14:13] <Philip`> gsnedders: How come so many people spell "connection" wrong, but get every other header correct?
- # [14:14] <gsnedders> Philip`: there are plenty of rare mistakes, though
- # [14:14] <gsnedders> Philip`: like spaces and not hyphens comes up a fair bit
- # [14:15] <gsnedders> Philip`: but cneonction is caused by a proxy, I can't remember which
- # [14:15] <gsnedders> Philip`: it was to avoid keeping the connection open, IIRC
- # [14:15] <gsnedders> Philip`: there was a bizarre reason for it
- # [14:16] <Philip`> Ah, that's what http://www.nextthing.org/archives/2005/08/07/fun-with-http-headers says
- # [14:16] <gsnedders> the web is weird.
- # [14:18] <toruvinn> gsnedders, my guess would be the 'neon proxy', there was something like that.
- # [14:19] <Philip`> ("... I had a database with 2,686,155 page responses and 23,699,737 response headers. The actual downloading of all of this took about a week." - that sounds really quite slow)
- # [14:19] <toruvinn> haha, awesome page, Philip`.
- # [14:19] <toruvinn> thanks.
- # [14:21] <gsnedders> how do you plot something using gnuplot from a data file taking the log of one column of data?
- # [14:21] <Philip`> gsnedders: "set logscale x 2" might be what you want
- # [14:23] <gsnedders> Philip`: no, y
- # [14:23] <Philip`> or "plot 'foo.dat' using 1:log($2) ..." might be
- # [14:23] <gsnedders> But that's good enough :)
- # [14:24] <gsnedders> it still shows a huge long tail
- # [14:26] * Joins: webben (n=benh@nat/yahoo/x-ef5334904e12e7ad)
- # [14:28] <Philip`> Everything has a long tail :-)
- # [14:30] <annevk> from the blog: "Can we get OOXML in HTML5 too? They seem to be very similar in their approaches to standardisation."
- # [14:30] <annevk> wtf, really...
- # [14:40] <Philip`> annevk: You might need to be more specific than "the blog", since there are several
- # [14:41] <annevk> oops, s/the/my/
- # [14:41] <Philip`> Ah, that narrows it down sufficiently
- # [14:41] <zcorpan> annevk's blog is *the* blog, didn't you know?
- # [14:41] <gsnedders> I mean, nobody reads my blog
- # [14:41] <gsnedders> Except, maybe, James Holderness (who I strongly suspect does)
- # [14:43] <hsivonen> hmm. V.nu parser perf sucks compared to Xerces
- # [14:43] <hsivonen> so badly that I suspect it is IO buffering and nothing in the algorithm
- # [14:46] <Philip`> Need more abstraction, so you can use the same IO buffering in both implementations
- # [14:50] * gsnedders hopes that email is pointless sending
- # [14:51] <gsnedders> http://lists.w3.org/Archives/Public/ietf-http-wg/2008AprJun/0124.html
- # [14:52] <gsnedders> (or rather, I hope that sending that email isn't pointless)
- # [14:52] <Philip`> Count the number of emails that have been sent in the past day; calculate how much better the world is today than it was yesterday; divide; conclude that all emails are almost entirely useless, and so you should stop writing them
- # [14:54] <gsnedders> That would be a time-saver.
- # [14:55] * gsnedders concludes he MUST reply to what a girl sent him ages ago
- # [14:55] <gsnedders> and apologise for being so damned slow.
- # [14:58] * gsnedders can't believe he actually just used an RFC2119 term there
- # [14:58] <hsivonen> Philip`: neither parser pegs the CPU, btw, which also points to IO
- # [14:59] <Philip`> It could also point to Thread.sleep calls, but I assume you've avoided doing that
- # [15:04] <annevk> zcorpan, there's more than one blog?
- # [15:05] <hsivonen> heh. the hotspot in V.nu is isNcname
- # [15:05] <hsivonen> which wouldn't be needed if the DOM impl. accepted any element name
- # [15:06] <annevk> isNcname is becoming easier in a few months, I think
- # [15:07] * hsivonen changes the test setup from DOM to SaxTree
- # [15:08] * Joins: ROBOd (n=robod@89.122.216.38)
- # [15:13] <zcorpan> SaxTree doesn't do such checks?
- # [15:15] <hsivonen> zcorpan: it doesn't
- # [15:16] * Quits: roc (n=roc@CPE-124-181-181-187.vic.bigpond.net.au) (Remote closed the connection)
- # [15:18] <hsivonen> hmm. I'll just try SAX with defaulthandler
- # [15:18] * Quits: myakura (n=myakura@p1215-ipbf3008marunouchi.tokyo.ocn.ne.jp) ("Leaving...")
- # [15:20] * Joins: roc (n=roc@CPE-124-181-181-187.vic.bigpond.net.au)
- # [15:23] <hsivonen> a java.util.regex-based isNcname is incredibly bad
- # [15:35] <hsivonen> looks like it's all about how often they go and read from the underlying FileInputStream
- # [15:39] * Quits: roc (n=roc@CPE-124-181-181-187.vic.bigpond.net.au)
- # [15:39] <hsivonen> Xerces has special UTF-8 decoding...
- # [15:43] <hsivonen> OK. I have created a bug in my bytes to UTF-8 buffering
- # [15:44] * Joins: csarven (i=csarven@on-irc.csarven.ca)
- # [15:45] * Parts: zcorpan (n=zcorpan@pat-tdc.opera.com)
- # [15:45] * Joins: zcorpan (n=zcorpan@pat-tdc.opera.com)
- # [15:45] <hsivonen> bytes to UTF-16 that is
- # [16:02] * Parts: frumioj (n=jkemp@cpe-69-205-56-47.nycap.res.rr.com) ("Ex-Chat")
- # [16:08] * Quits: virtuelv (n=virtuelv@pat-tdc.opera.com) ("Ex-Chat")
- # [16:16] * Joins: MacDome (n=eric@c-24-130-11-246.hsd1.ca.comcast.net)
- # [16:17] * Joins: qwert666_ (n=qwert666@acab97.neoplus.adsl.tpnet.pl)
- # [16:23] * Quits: MacDome (n=eric@c-24-130-11-246.hsd1.ca.comcast.net)
- # [16:26] * Joins: billmason (n=billmaso@ip150.unival.com)
- # [16:26] * Joins: phsiao (n=shawn@nat/ibm/x-97172210566073a0)
- # [16:33] * Quits: qwert666 (n=qwert666@acdi143.neoplus.adsl.tpnet.pl) (Connection timed out)
- # [16:39] * Quits: Lachy (n=Lachlan@pat-tdc.opera.com) ("This computer has gone to sleep")
- # [16:55] <hsivonen> Hixie: I can now confirm that not calling JDK intern() really makes a difference
- # [16:57] * Joins: aroben (n=aroben@unaffiliated/aroben)
- # [16:57] * Joins: Lachy (n=Lachlan@ti200710a340-2662.bb.online.no)
- # [16:58] * Joins: eseidel (n=eseidel@nat/google/x-92d553fd39d884af)
- # [16:59] * Quits: maikmerten (n=merten@ls5laptop14.cs.uni-dortmund.de) ("Verlassend")
- # [17:10] <hsivonen> Hixie: is this on your radar: https://bugzilla.mozilla.org/show_bug.cgi?id=427329#c7
- # [17:13] * Quits: KevinMarks (n=KevinMar@c-98-207-134-151.hsd1.ca.comcast.net) ("The computer fell asleep")
- # [17:31] * Quits: zcorpan (n=zcorpan@pat-tdc.opera.com) (Read error: 110 (Connection timed out))
- # [17:39] * Joins: aroben_ (n=aroben@unaffiliated/aroben)
- # [17:40] * Quits: aroben (n=aroben@unaffiliated/aroben) (Nick collision from services.)
- # [17:40] * aroben_ is now known as aroben
- # [18:12] * Quits: weinig|zZz (n=weinig@c-71-198-176-23.hsd1.ca.comcast.net)
- # [18:16] * Joins: jgraham_ (n=james@81-86-218-215.dsl.pipex.com)
- # [18:17] <annevk> hsivonen, I don't think we should start special casing the parser for that
- # [18:17] <annevk> fwiw
- # [18:51] * Quits: csarven (i=csarven@on-irc.csarven.ca) (Read error: 110 (Connection timed out))
- # [18:52] * Joins: heycam` (n=cam@210-84-23-247.dyn.iinet.net.au)
- # [18:56] * Joins: maikmerten (n=maikmert@Lb08c.l.pppool.de)
- # [18:57] * Quits: eseidel (n=eseidel@nat/google/x-92d553fd39d884af)
- # [19:02] * Quits: heycam (n=cam@124-168-34-12.dyn.iinet.net.au) (Read error: 101 (Network is unreachable))
- # [19:04] * Joins: eseidel (n=eseidel@nat/google/x-955877cc9a21c59b)
- # [19:11] * Quits: tantek_ (n=tantek@72.158.245.34) (Read error: 110 (Connection timed out))
- # [19:15] * Joins: csarven (i=csarven@on-irc.csarven.ca)
- # [19:50] * Quits: svl (n=me@ip565744a7.direct-adsl.nl) ("And back he spurred like a madman, shrieking a curse to the sky.")
- # [20:02] * Quits: othermaciej (n=mjs@dsl081-048-145.sfo1.dsl.speakeasy.net)
- # [20:22] * Joins: weinig (n=weinig@17.203.15.172)
- # [20:23] * Joins: hober (n=ted@adsl-75-13-203-9.dsl.sndg02.sbcglobal.net)
- # [20:31] * Quits: eseidel (n=eseidel@nat/google/x-955877cc9a21c59b)
- # [20:43] <hsivonen> annevk: btw, the NCName thing isn't getting any better per spec--only worse
- # [20:43] <hsivonen> annevk: the point of checking for NCNames is to avoid exceptions in existing software--not as much to comply with XML infosets
- # [20:45] <annevk> ah
- # [20:57] * Quits: hober (n=ted@adsl-75-13-203-9.dsl.sndg02.sbcglobal.net) ("ERC Version 5.2 (IRC client for Emacs)")
- # [21:01] * Joins: maikmerten_ (n=maikmert@T685f.t.pppool.de)
- # [21:01] * Joins: svl (n=me@ip565744a7.direct-adsl.nl)
- # [21:02] * Joins: andersca (n=andersca@nat/apple/x-1ceea0bd483dfdfa)
- # [21:08] * Quits: jruderman (n=jruderma@c-67-180-174-213.hsd1.ca.comcast.net)
- # [21:11] * Joins: Camaban_ (n=alee@85-211-226-141.dyn.gotadsl.co.uk)
- # [21:17] * Quits: maikmerten (n=maikmert@Lb08c.l.pppool.de) (Read error: 113 (No route to host))
- # [21:21] <Hixie> hsivonen: it's not clear to me that the parser is the problem
- # [21:21] * Joins: tommorris (n=tommorri@89.192.214.69)
- # [21:21] <hsivonen> Hixie: it's claimed that backing out the parser fix helps
- # [21:22] * Parts: tommorris (n=tommorri@89.192.214.69)
- # [21:22] <andersca> hey Hixie
- # [21:25] * Quits: weinig (n=weinig@17.203.15.172)
- # [21:26] * Joins: othermaciej (n=mjs@nat/apple/x-a2e98109d9caf86b)
- # [21:28] * Quits: Camaban (n=alee@85-211-226-141.dyn.gotadsl.co.uk) (Read error: 110 (Connection timed out))
- # [21:32] * Joins: weinig (n=weinig@17.255.111.152)
- # [21:36] <Hixie> hsivonen: i thought it was claimed that it didn't
- # [21:36] <Hixie> oh, my bad
- # [21:36] <Hixie> misread it
- # [21:40] * Joins: qwert666__ (n=qwert666@acab97.neoplus.adsl.tpnet.pl)
- # [21:42] * Joins: eseidel (n=eseidel@nat/google/x-e1a476627d6cf684)
- # [21:51] * Joins: jruderman (n=jruderma@guest-226.mountainview.mozilla.com)
- # [21:56] * Quits: Camaban_ (n=alee@85-211-226-141.dyn.gotadsl.co.uk) ("Ex-Chat")
- # [21:59] * Quits: qwert666_ (n=qwert666@acab97.neoplus.adsl.tpnet.pl) (Read error: 110 (Connection timed out))
- # [22:02] * Quits: weinig (n=weinig@17.255.111.152)
- # [22:08] * Quits: maikmerten_ (n=maikmert@T685f.t.pppool.de) ("Leaving")
- # [22:12] * eseidel is now known as eseidelHack
- # [22:14] * Joins: hober (n=ted@unaffiliated/hober)
- # [22:25] * Quits: webben (n=benh@nat/yahoo/x-ef5334904e12e7ad) (Connection timed out)
- # [22:40] * Joins: weinig (n=weinig@17.203.15.172)
- # [22:42] * Quits: ROBOd (n=robod@89.122.216.38) ("http://www.robodesign.ro")
- # [22:43] <hsivonen> http://typophile.com/node/43971
- # [22:44] <annevk> I wonder if it's really embedding
- # [22:44] * annevk was just reading that
- # [22:47] <annevk> http://lists.w3.org/Archives/Public/www-style/2007Dec/thread.html#msg84
- # [22:47] * Quits: jruderman (n=jruderma@guest-226.mountainview.mozilla.com)
- # [22:56] <Hixie> ok wtf
- # [22:57] <Hixie> "REPORT /webapps/!svn/bc/1409/source HTTP/1.1" is taking up insane amounts of CPU on my box
- # [22:57] <annevk> maybe html5.org?
- # [22:58] <jgraham> Er, that would be me
- # [22:58] <Hixie> aha!
- # [22:58] <Hixie> the magic of irc
- # [22:58] <jgraham> I didn't realise it would take up CPU on your box
- # [22:58] <Hixie> jgraham: go ahead, it's ok
- # [22:58] * jgraham is ignorant
- # [22:58] <Hixie> jgraham: i'm sure you have legitimate reasons for it :-)
- # [22:58] <Hixie> jgraham: just making sure it wasn't some runaway script or something
- # [22:59] <Hixie> what is REPORT, anyway?
- # [22:59] <Hixie> svn blame?
- # [22:59] <jgraham> I was just wondering why html5lib's EOF handling appears to be different to the spec
- # [22:59] <jgraham> Hixie: yep
- # [22:59] <Hixie> cool
- # [23:01] <hsivonen> I wonder why multiple ns doesn't go to XHTML5 validation: http://www.w3.org/2008/03/validators-chart
- # [23:01] * Quits: eseidelHack (n=eseidel@nat/google/x-e1a476627d6cf684)
- # [23:03] <Hixie> i wonder why text/html with DTD doesn't go to (x)html5 validator
- # [23:04] <Hixie> in fact that whole thing is WAY more complex than necessary or desirable
- # [23:04] <Hixie> where does it come from?
- # [23:05] <annevk> W3C :)
- # [23:06] <hsivonen> Hixie: http://lists.w3.org/Archives/Public/www-tag/2008Apr/0017.html
- # [23:06] <annevk> http://lists.w3.org/Archives/Public/www-validator/2008Apr/0014.html ?
- # [23:06] <annevk> Web page study: http://nikitathespider.com/articles/ByTheNumbers/
- # [23:06] * Quits: jgraham_ (n=james@81-86-218-215.dsl.pipex.com)
- # [23:06] * Joins: eseidel (n=eseidel@nat/google/x-c377ac04bf3ca60d)
- # [23:07] <Hixie> ah
- # [23:07] <hsivonen> enabling NVDL in Valdator.nu seems to be only a tiny bit of hacking away
- # [23:08] <hsivonen> once again there's some bad entity resolving that I need to fix
- # [23:14] * Quits: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com) (Read error: 110 (Connection timed out))
- # [23:15] <hober> this is awesome: http://nikitathespider.com/articles/ByTheNumbers/0803/MediaTypes.png
- # [23:15] <Hixie> looks basically like the numbers i got
- # [23:15] <Hixie> iirc i got 0.0044% to 0.2% depending on what kinds of pages i included
- # [23:16] <Hixie> (lower if i focused on the actively maintained web, higher if i included everything i could)
- # [23:17] * Joins: dbaron (n=dbaron@guest-226.mountainview.mozilla.com)
- # [23:17] <Philip`> I got 0.03% application/xhtml+xml from dmoz.org
- # [23:17] <Philip`> (and 99.8% text/html)
- # [23:18] * gsnedders needs to get more HTTP headers :P
- # [23:18] <Philip`> gsnedders: Why? :-)
- # [23:18] <gsnedders> I mean, 1.1 million is nothing
- # [23:18] <Philip`> Depends on what you want to do with it
- # [23:18] <gsnedders> write a spec! :P
- # [23:19] <Hixie> i can never think of good examples for data-*
- # [23:19] <Philip`> People do real statistics with a sample size of hundreds - you don't always need billions :-)
- # [23:19] <gsnedders> Philip`: I know :)
- # [23:19] <gsnedders> Philip`: But it is all the edge cases that are helpful to have a large sample size for
- # [23:20] <Philip`> The web must be fractal, since you always find more edge cases when you look in more detail
- # [23:20] <Hixie> for writing the parser, i found that testing implementations was more useful than the data from the web
- # [23:20] <Hixie> but for defining new features, the data is invaluable
- # [23:21] <Hixie> i don't understand how people wrote specs without
- # [23:21] <gsnedders> Speaking of implementations, I need to email a guy at Opera
- # [23:21] <gsnedders> But I heart my left hand, and typig is slower than normal
- # [23:22] <gsnedders> typing, eve
- # [23:22] <gsnedders> *even
- # [23:22] <Philip`> Do you mean s/heart/hurt/ ?
- # [23:22] <gsnedders> yes
- # [23:23] <Hixie> Philip`, you're a braver man than i. i wasn't going to touch that one with a barge pole.
- # [23:23] * gsnedders wonders what that one is
- # [23:23] * gsnedders looks on the lists
- # [23:23] <hober> I imagine it was the s/// above
- # [23:23] <gsnedders> ah
- # [23:23] <gsnedders> oh dear…
- # [23:24] <gsnedders> now I realise…
- # [23:24] * Parts: hasather_ (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [23:24] <gsnedders> I would say I ought to go hide in a corner because I didn't realise, but in this case, that's the wrong thing to say.
- # [23:27] <gsnedders> Hixie: Can I call you sick for just thinking of that?
- # [23:27] <hober> indeed.
- # [23:31] <gsnedders> Now, let me leave before I make an even more regrettable fuck up.
- # [23:32] * Quits: met_ (n=Hassman@r5bx220.net.upc.cz) ("Chemists never die, they just stop reacting.")
- # [23:34] * Joins: jruderman (n=jruderma@guest-226.mountainview.mozilla.com)
- # [23:35] * Joins: webben (n=benh@91.84.239.113)
- # [23:39] * Quits: qwert666__ (n=qwert666@acab97.neoplus.adsl.tpnet.pl) ("Leaving")
- # [23:41] * Joins: webben_ (n=benh@dip5-fw.corp.ukl.yahoo.com)
- # [23:41] <Hixie> annevk: what specs are you editor of these days?
- # [23:43] * Joins: blooberry (n=brian@c-76-126-194-196.hsd1.ca.comcast.net)
- # [23:44] <Hixie> hey bloo
- # [23:44] * Quits: phsiao (n=shawn@nat/ibm/x-97172210566073a0)
- # [23:44] <blooberry> hey hixie. 8-}
- # [23:44] <Hixie> wassup dude
- # [23:44] <blooberry> statistics.
- # [23:44] <blooberry> (trying to figure out how to present data and things)
- # [23:45] <Hixie> good times
- # [23:46] <blooberry> if you say so. ;-} *visions of standard deviations dancing through my head*
- # [23:49] * Quits: dbaron (n=dbaron@guest-226.mountainview.mozilla.com) (Read error: 110 (Connection timed out))
- # [23:49] <Philip`> Just say "the error bars are too small to show on this graph"
- # [23:49] <blooberry> I like that. 8-}
- # [23:49] <Hixie> hah
- # [23:50] <andersca> hey Hixie
- # [23:50] <Hixie> hey
- # [23:53] * Joins: roc (n=roc@CPE-58-168-130-187.vic.bigpond.net.au)
- # [23:54] <jgraham> It's always good if you can claim the error bars aren't meaningful
- # [23:55] <Hixie> it's not at all clear to me what my error bars should actually be on some of my stats
- # [23:55] <Hixie> i mean, i can tell you exactly what the count was for the n billion pages
- # [23:55] <Hixie> it's not an estimate
- # [23:55] <Hixie> but since it's just a biased sample of an infinite number of pages...
- # [23:55] <Hixie> i don't know what to conclude
- # [23:56] <jgraham> Hixie: The error bars are supposed to represent the error on the population average based on the properties of your sample
- # [23:57] <jgraham> But since, as you note, you have a biased sample of the population it's not clear what that actually means
- # [23:57] <Hixie> so if n out of N pages had property X, what's the error on the population average for the property X?
- # [23:57] * Quits: webben (n=benh@91.84.239.113) (Read error: 110 (Connection timed out))
- # [23:57] <andersca> Hixie: I have another application cache question for you
- # [23:57] <Hixie> go for it
- # [23:58] <andersca> Hixie: about the networking model
- # [23:58] <andersca> Hixie: so when a browsing context is associated with an application cache, all loads should go through the cache
- # [23:59] <Hixie> with the caveats defined in 4.6.5.1. Changes to the networking model, yes
- # [23:59] <andersca> yeah
- # Session Close: Tue Apr 08 00:00:00 2008
The end :)