Options:
- # Session Start: Sat Apr 25 00:00:00 2009
- # Session Ident: #whatwg
- # [00:03] * Parts: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [00:03] * Joins: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [00:04] * Parts: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [00:05] * Quits: zdobersek (n=zan@cpe-92-37-64-139.dynamic.amis.net) ("Leaving.")
- # [00:05] * Joins: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [00:06] * Parts: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [00:06] * Joins: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [00:16] * Joins: gmiernicki (n=gmiernic@unaffiliated/gmiernicki)
- # [00:21] * Parts: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [00:22] * Joins: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [00:26] * Quits: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com) (Read error: 104 (Connection reset by peer))
- # [00:26] * Joins: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [00:27] * Quits: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com) (Remote closed the connection)
- # [00:27] * Joins: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [00:33] * Quits: virtuelv (n=virtuelv@95.34.170.26.customer.cdi.no) (Read error: 110 (Connection timed out))
- # [00:33] * Quits: Maurice (i=copyman@5ED548D4.cable.ziggo.nl) ("Disconnected...")
- # [00:34] * Parts: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [00:35] * Joins: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [00:36] * Parts: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [00:36] * Joins: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [00:42] * Joins: weinig (n=weinig@nat/apple/x-cac6edf73a1dfe8d)
- # [00:43] * Quits: weinig (n=weinig@nat/apple/x-cac6edf73a1dfe8d) (Client Quit)
- # [00:54] * Quits: aroben (n=aroben@unaffiliated/aroben) (Read error: 104 (Connection reset by peer))
- # [00:54] * Parts: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [00:54] * Joins: davidb (n=davidb@bas4-toronto06-1242458409.dsl.bell.ca)
- # [00:54] * Joins: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [01:07] * Quits: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [01:13] * Quits: dglazkov (n=dglazkov@nat/google/x-ca5244171d05b519)
- # [01:14] * Quits: roc_ (n=roc@121-72-195-129.dsl.telstraclear.net)
- # [01:31] * Joins: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [01:31] * Quits: bgalbraith (n=bgalbrai@corp-241.mountainview.mozilla.com)
- # [01:31] * Quits: mpt (n=mpt@canonical/launchpad/mpt) (Read error: 113 (No route to host))
- # [01:33] * Quits: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net) (Client Quit)
- # [01:36] * Quits: cgriego (n=cgriego@out-02.hotels.com) (Read error: 110 (Connection timed out))
- # [02:10] * Quits: slightlyoff (n=slightly@nat/google/x-4e976876b6432a46) (Read error: 60 (Operation timed out))
- # [02:21] * Quits: ZombieLoffe (n=e@unaffiliated/zombieloffe)
- # [02:36] * Quits: starjive (i=beos@213-66-216-93-no30.tbcn.telia.com)
- # [02:37] * Quits: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com) (Read error: 110 (Connection timed out))
- # [02:40] * Parts: dave_levin (n=dave_lev@72.14.227.1)
- # [02:42] * Quits: dbaron (n=dbaron@corp-241.mountainview.mozilla.com) ("8403864 bytes have been tenured, next gc will be global.")
- # [02:48] * Quits: Amorphous (i=jan@unaffiliated/amorphous) (Read error: 110 (Connection timed out))
- # [02:51] * Joins: Amorphous (i=jan@unaffiliated/amorphous)
- # [02:51] * Joins: xydyx (n=hdh@58.187.17.196)
- # [02:56] * Joins: sid0_ (n=sid0@202.3.77.136)
- # [02:57] * Quits: dimich (n=dimich@72.14.227.1)
- # [02:59] * Joins: slightlyoff_ (n=slightly@72.14.224.1)
- # [03:00] * Quits: davidb (n=davidb@bas4-toronto06-1242458409.dsl.bell.ca)
- # [03:00] * Quits: hdh (n=hdh@58.187.17.196) (Read error: 60 (Operation timed out))
- # [03:01] * Joins: dimich (n=dimich@72.14.227.1)
- # [03:01] * Joins: slightlyoff__ (n=slightly@67.218.104.46)
- # [03:02] * Joins: sid0__ (n=sid0@202.3.77.136)
- # [03:07] * Quits: sid0 (n=sid0@unaffiliated/sid0) (Remote closed the connection)
- # [03:07] * jcranmer is now known as jcranmer|SUPPER
- # [03:11] * Quits: slightlyoff__ (n=slightly@67.218.104.46)
- # [03:15] * Quits: sid0_ (n=sid0@unaffiliated/sid0) (Remote closed the connection)
- # [03:22] * Quits: slightlyoff_ (n=slightly@72.14.224.1) (Read error: 110 (Connection timed out))
- # [03:30] * jcranmer|SUPPER is now known as jcranmer
- # [03:39] * Joins: Marianoe (n=Mariano@adsl-99-24-230-202.dsl.emhril.sbcglobal.net)
- # [03:43] * Quits: dimich (n=dimich@72.14.227.1)
- # [03:43] * Joins: dbaron (n=dbaron@c-98-234-51-190.hsd1.ca.comcast.net)
- # [03:56] * Quits: Marianoe (n=Mariano@adsl-99-24-230-202.dsl.emhril.sbcglobal.net)
- # [04:11] * Joins: olliej_ (n=oliver@17.246.18.56)
- # [04:13] * Joins: weinig (n=weinig@c-67-180-35-124.hsd1.ca.comcast.net)
- # [04:19] * Quits: olliej (n=oliver@17.203.15.141) (Read error: 110 (Connection timed out))
- # [04:21] * Joins: onar_ (n=onar@c-98-234-65-251.hsd1.ca.comcast.net)
- # [04:24] * Quits: taf2 (n=taf2@static-71-127-149-10.bltmmd.fios.verizon.net)
- # [04:25] * Quits: onar_ (n=onar@c-98-234-65-251.hsd1.ca.comcast.net) (Client Quit)
- # [04:28] * Quits: olliej_ (n=oliver@17.246.18.56)
- # [04:39] * Joins: myakura (n=myakura@p1063-ipbf3305marunouchi.tokyo.ocn.ne.jp)
- # [04:42] * Quits: weinig (n=weinig@c-67-180-35-124.hsd1.ca.comcast.net)
- # [04:47] * Joins: danbri (n=danbri@89.130.83.193)
- # [04:49] * Quits: karlcow (n=karl@nerval.la-grange.net) (Remote closed the connection)
- # [04:51] * Joins: danbri_ (n=danbri@89.130.83.193)
- # [04:56] * Joins: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [04:59] * Joins: karlcow (n=karl@nerval.la-grange.net)
- # [05:05] * Quits: danbri (n=danbri@unaffiliated/danbri) (Read error: 110 (Connection timed out))
- # [05:06] * Joins: onar_ (n=onar@c-98-234-65-251.hsd1.ca.comcast.net)
- # [05:09] * Quits: onar_ (n=onar@c-98-234-65-251.hsd1.ca.comcast.net) (Client Quit)
- # [05:11] * Joins: slightlyoff (n=slightly@204.14.154.244)
- # [05:13] * Joins: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [05:21] * Quits: jwalden (n=waldo@corp-241.mountainview.mozilla.com) ("->home")
- # [05:29] * Joins: doublec (n=doublec@118-93-163-62.dsl.dyn.ihug.co.nz)
- # [05:37] * Joins: davidb (n=davidb@bas4-toronto06-1242458409.dsl.bell.ca)
- # [05:37] * Quits: davidb (n=davidb@bas4-toronto06-1242458409.dsl.bell.ca) (Client Quit)
- # [06:39] * Quits: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [06:41] * Joins: onar_ (n=onar@c-98-234-65-251.hsd1.ca.comcast.net)
- # [06:59] * Joins: zalan (n=kvirc@catv-80-99-193-98.catv.broadband.hu)
- # [07:10] * Quits: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [07:12] * Quits: doublec (n=doublec@118-93-163-62.dsl.dyn.ihug.co.nz) ("Leaving")
- # [07:24] * Quits: xydyx (n=hdh@58.187.17.196) (Read error: 104 (Connection reset by peer))
- # [07:26] * Joins: weinig (n=weinig@c-67-180-35-124.hsd1.ca.comcast.net)
- # [07:35] * Joins: mlpug (n=mlpug@a91-156-60-13.elisa-laajakaista.fi)
- # [07:40] * Quits: Hixie (i=ianh@trivini.no) ("brb")
- # [07:41] * Joins: Hixie (i=ianh@trivini.no)
- # [07:48] * Joins: doublec (n=doublec@118-93-163-62.dsl.dyn.ihug.co.nz)
- # [07:56] * Quits: weinig (n=weinig@c-67-180-35-124.hsd1.ca.comcast.net)
- # [07:57] * Joins: virtuelv (n=virtuelv@95.34.170.26.customer.cdi.no)
- # [07:58] * Quits: virtuelv (n=virtuelv@95.34.170.26.customer.cdi.no) (Client Quit)
- # [07:58] * Joins: virtuelv (n=virtuelv@95.34.170.26.customer.cdi.no)
- # [08:02] * Quits: onar_ (n=onar@c-98-234-65-251.hsd1.ca.comcast.net)
- # [08:11] * Quits: mlpug (n=mlpug@a91-156-60-13.elisa-laajakaista.fi) (Remote closed the connection)
- # [08:13] * Quits: virtuelv (n=virtuelv@95.34.170.26.customer.cdi.no) ("Ex-Chat")
- # [08:29] * Joins: onar_ (n=onar@c-98-234-65-251.hsd1.ca.comcast.net)
- # [08:59] * Joins: annevk5 (n=annevk@85.196.122.246)
- # [09:03] * Joins: ap (n=ap@194.154.88.45)
- # [09:10] * Joins: zdobersek (n=zan@cpe-92-37-76-143.dynamic.amis.net)
- # [09:14] * Quits: onar_ (n=onar@c-98-234-65-251.hsd1.ca.comcast.net)
- # [09:21] * Joins: doublec_ (n=doublec@118-93-168-138.dsl.dyn.ihug.co.nz)
- # [09:32] * Joins: zdobersek1 (n=zan@cpe-92-37-65-85.dynamic.amis.net)
- # [09:34] * Joins: yorick (n=Yorick@85.146.77.160)
- # [09:35] <yorick> the html5 boolean attributes seem to be inconsistent with the html4 ones
- # [09:36] <yorick> html4: <option selected="selected">contents</option>
- # [09:36] <yorick> html5: <div draggable="true">contents</div>
- # [09:39] * Quits: doublec (n=doublec@118-93-163-62.dsl.dyn.ihug.co.nz) (Read error: 110 (Connection timed out))
- # [09:39] * Joins: slightlyoff_ (n=slightly@216.239.44.65)
- # [09:41] * slightlyoff_ is now known as slightlyoff_afk
- # [09:42] <yorick> also, is there a possibility to set DataTransfer on dragstart to insensitive, so it can be accessed when dragging over something?
- # [09:45] * Quits: dbaron (n=dbaron@c-98-234-51-190.hsd1.ca.comcast.net) ("8403864 bytes have been tenured, next gc will be global.")
- # [09:49] * Quits: zdobersek (n=zan@cpe-92-37-76-143.dynamic.amis.net) (Read error: 113 (No route to host))
- # [09:49] <jgraham> Philip`: Can you get data on things that start <!-- but with a > and no --> before the end of the document
- # [09:49] * jgraham isn't quite sure how to express that as a regexp
- # [09:56] * Joins: Maurice (i=copyman@5ED548D4.cable.ziggo.nl)
- # [09:56] * Quits: slightlyoff (n=slightly@204.14.154.244) (Read error: 110 (Connection timed out))
- # [10:00] * Quits: zdobersek1 (n=zan@cpe-92-37-65-85.dynamic.amis.net) (Read error: 104 (Connection reset by peer))
- # [10:06] * Joins: zdobersek (n=zan@cpe-92-37-65-85.dynamic.amis.net)
- # [10:11] * Joins: roc (n=roc@121-72-162-81.dsl.telstraclear.net)
- # [10:19] <annevk5> yorick, draggable is not a boolean attribute per HTML5
- # [10:21] * Joins: gsnedders (n=gsnedder@host86-136-52-180.range86-136.btcentralplus.com)
- # [10:21] * Joins: ROBOd (n=robod@89.122.216.38)
- # [10:21] * Quits: riven (n=colin@pdpc/supporter/professional/riven) (Read error: 110 (Connection timed out))
- # [10:22] * Joins: roc_ (n=roc@121-72-162-81.dsl.telstraclear.net)
- # [10:22] * Quits: roc (n=roc@121-72-162-81.dsl.telstraclear.net) (Read error: 104 (Connection reset by peer))
- # [10:31] * sid0__ is now known as sid0
- # [10:32] * Joins: ZombieLoffe (n=e@unaffiliated/zombieloffe)
- # [10:32] <yorick> annevk5: then what is it?
- # [10:43] * Joins: zcorpan (n=zcorpan@c83-252-196-43.bredband.comhem.se)
- # [10:44] <zcorpan> yorick: an enumerated attribute
- # [11:00] * zcorpan adds another entry to http://wiki.whatwg.org/wiki/HTML5_Presentations
- # [11:02] * Parts: zcorpan (n=zcorpan@c83-252-196-43.bredband.comhem.se)
- # [11:03] * Joins: zcorpan (n=zcorpan@c83-252-196-43.bredband.comhem.se)
- # [11:14] * Quits: annevk5 (n=annevk@85.196.122.246)
- # [11:20] * Quits: zcorpan (n=zcorpan@c83-252-196-43.bredband.comhem.se) (Read error: 60 (Operation timed out))
- # [11:24] * Quits: yorick (n=Yorick@85.146.77.160) ("Poef!")
- # [11:40] * Quits: zalan (n=kvirc@catv-80-99-193-98.catv.broadband.hu) ("KVIrc 3.4.0 Virgo http://www.kvirc.net/")
- # [11:45] * Joins: zcorpan (n=zcorpan@c83-252-196-43.bredband.comhem.se)
- # [11:46] * Joins: sid0_ (n=sid0@202.3.77.136)
- # [11:49] * Joins: grimboy (n=grimboy@78-86-152-156.zone2.bethere.co.uk)
- # [11:58] * Quits: danbri_ (n=danbri@unaffiliated/danbri)
- # [11:58] * Quits: sid0 (n=sid0@unaffiliated/sid0) (Remote closed the connection)
- # [11:58] * Quits: zcorpan (n=zcorpan@c83-252-196-43.bredband.comhem.se) (Remote closed the connection)
- # [11:59] * Joins: zcorpan (n=zcorpan@c83-252-196-43.bredband.comhem.se)
- # [12:04] * Quits: zcorpan (n=zcorpan@c83-252-196-43.bredband.comhem.se) (Remote closed the connection)
- # [12:15] * Quits: ap (n=ap@194.154.88.45)
- # [12:30] * Quits: Rik` (n=Rik@pha75-2-81-57-187-57.fbx.proxad.net) (Remote closed the connection)
- # [12:32] * Joins: Rik` (n=Rik@pha75-2-81-57-187-57.fbx.proxad.net)
- # [12:45] <gsnedders> Ergh…
- # [12:45] * gsnedders doesn't want to sign up to another bug tracker to report a bug in Validator.nu
- # [13:02] <Philip`> jgraham: You mean something like, um, /<!--[^>]*(?<!-->)>([^>]|(?<!-->)>)*$/ perhaps?
- # [13:03] <Philip`> Whoops, not that one
- # [13:03] <Philip`> /<!--[^>]*(?<!--)>([^>]|(?<!--)>)*$/
- # [13:06] <jgraham> Philip`: Perhaps
- # [13:07] <jgraham> If that matches "<!-- foo > bar" and <!-- foo>" but not "<!-- foo > bar -->"
- # [13:08] <Philip`> It does
- # [13:08] * Philip` should take this opportunity to hook his grep tool up to his new set of pages...
- # [13:08] <jgraham> Ah, well that sounds like what I want then
- # [13:09] * Philip` notes that "(?<!--)" confusingly has nothing to do with the string "<!--", it's just a negative lookbehind assertion on the string "--"
- # [13:10] <jgraham> Ah, that makes a little more sense
- # [13:10] <gsnedders> And this is why you shouldn't use regex to parse HTML P
- # [13:10] <gsnedders> * :P
- # [13:11] <jgraham> gsnedders: No this is why should shouldn't have ultra-weird comment parsing
- # [13:11] <gsnedders> jgraham: It's saner than SGML.
- # [13:11] <jgraham> which requires lookhead
- # [13:12] <jgraham> s/should/you/
- # [13:13] <gsnedders> I don't. I have perfectly sane comment parsing, thank you very much.
- # [13:14] <jgraham> gsnedders: You use a sophisticated biological neural network to parse comments and it's not even that reliable. How can you describe that as sane?
- # [13:14] <gsnedders> jgraham: Through my own insaity.
- # [13:14] <gsnedders> *insanity.
- # [13:18] <Philip`> gsnedders: Parsing it's easy, it's just like /<!(-?>|--.*?-->)/
- # [13:18] <Philip`> s/it's/is/
- # [13:19] <Philip`> Uh
- # [13:19] * Joins: MikeSmith (n=MikeSmit@p4bfc04.tokynt01.ap.so-net.ne.jp)
- # [13:19] <Philip`> gsnedders: Parsing it's easy, it's just like /<!(-?>|([^-]|--).*?-->)/
- # [13:19] <Philip`> s/it's/is/
- # [13:19] <Philip`> or something like that
- # [13:20] <Philip`> but anyway it's easy
- # [13:20] <MikeSmith> gsnedders: I'm looking at the class="" bug now
- # [13:20] <Philip`> The difficulty is trying to match things that are *not* matched by the normal state machine
- # [13:21] * Joins: maikmerten (n=maikmert@U27b4.u.pppool.de)
- # [13:32] <gsnedders> How the hell do you get script to validate in HTML 4.01?
- # [13:32] * gsnedders stabs SGML
- # [13:35] <Philip`> Use <script src>
- # [13:35] <Philip`> If you want inline scripts, use <script src="data:text/javascript,...">
- # [13:38] <Dashiva> gsnedders: Just rephrase all your less-than tests :)
- # [13:47] <MikeSmith> from the HTML4 spec, I can't tell whether id and class are allowed to be empty or not
- # [13:51] <Dashiva> It says "must begin with a letter" for id
- # [13:52] <Dashiva> But it's taken from the SGML spec, so would probably have to look there
- # [13:54] <MikeSmith> hmm, the XHTML DTD defines the value of class as NMTOKENS
- # [13:54] <gsnedders> Yay! More undocumented differences between XHTML 1.0 and HTML 4.01!
- # [13:56] <MikeSmith> I think as far as the HTML4 and XHTML1 specs are concerned, the value of class can't be empty
- # [13:56] <Dashiva> Can't class be a list of zero class names?
- # [13:57] <MikeSmith> Dashiva: not as far as I can see, as far as XML goes
- # [13:58] <MikeSmith> http://www.w3.org/TR/REC-xml/#NT-Nmtoken
- # [13:58] <MikeSmith> Nmtokens ::= Nmtoken (#x20 Nmtoken)*
- # [13:58] <MikeSmith> Nmtoken ::= (NameChar)+
- # [13:58] * Joins: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [13:58] <MikeSmith> and XHTML1 DTD says:
- # [13:59] <Dashiva> Fair enough. Where does it say it's NMTOKENS? xhtml1 says CDATA where I'm looking
- # [13:59] <MikeSmith> "class NMTOKENS #IMPLIED
- # [13:59] <MikeSmith> I'm looking at a DTD on my local system
- # [14:00] <MikeSmith> XHTML Modularization
- # [14:00] <gsnedders> Well, it claims to be an XHTML 1.0 schema, not an XHTML Mod. one
- # [14:01] <Dashiva> What I found: http://www.w3.org/TR/xhtml1/dtds.html#dtdentry_xhtml1-strict.dtd_coreattrs
- # [14:01] <Dashiva> I guess it's obsolete
- # [14:01] <MikeSmith> Dashiva: yeah, that's what I'm looking at now
- # [14:01] <MikeSmith> well, it's right, even if it's obsolete
- # [14:02] <MikeSmith> and XHTML modularization is wrong
- # [14:02] <MikeSmith> I mean, in practice at least
- # [14:02] <Dashiva> YSOD on empty class attribute? :)
- # [14:02] <MikeSmith> YSOD?
- # [14:03] <Dashiva> Yellow screen of death
- # [14:03] <MikeSmith> heh
- # [14:06] <MikeSmith> the HTML 4.01 DTD used by the W3C validator says "class CDATA #IMPLIED"
- # [14:07] <MikeSmith> and all the XHTML1 DTDs it uses says the same
- # [14:08] <MikeSmith> but the XHTML 1.1 DTD it uses says "class NMTOKENS #IMPLIED"
- # [14:08] <MikeSmith> so those mooncalfs apparently redefined it in XHTML 1.1
- # [14:09] <MikeSmith> I think in in validator.nu we should preserve that brokenness
- # [14:10] <MikeSmith> to learn people not to bother validating against XHTML 1.1
- # [14:11] <MikeSmith> but I see hsivonen had the foresight to not even include an XHTML1.1-checking option in validator.nu
- # [14:12] <MikeSmith> I wonder if they bothered to document this in the XHTML 1.1 spec
- # [14:13] <MikeSmith> nope
- # [14:13] <MikeSmith> http://www.w3.org/TR/xhtml11/changes.html#a_changes
- # [14:22] * Quits: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com) (Remote closed the connection)
- # [14:22] * Joins: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [14:22] <Philip`> jgraham: With that search thing, how many results do you want?
- # [14:26] <Philip`> jgraham: Actually, you can have them all
- # [14:27] <Philip`> jgraham: http://philip.html5.org/data/comments-not-closed-but-with-a-gt-after-them.txt
- # [14:27] * Quits: roc_ (n=roc@121-72-162-81.dsl.telstraclear.net)
- # [14:30] <jgraham> Philip`: Great, thanks
- # [14:38] * Philip` hopes his regexp isn't wrong
- # [14:44] * Joins: taf2 (n=taf2@static-71-127-149-10.bltmmd.fios.verizon.net)
- # [14:45] * Quits: karlcow (n=karl@nerval.la-grange.net) ("This computer has gone to sleep")
- # [14:46] * jgraham hasn't checked yet
- # [14:47] <Philip`> <!doctype html> seems to be one of those most popular HTML5 features
- # [14:47] * Philip` sees it on 64 distinct domains
- # [14:48] <Philip`> like last.fm and pear.php.net and maps.google.com and edward.oconnor.cx and help.godaddy.com
- # [14:49] <Philip`> s/those most/the most/
- # [15:04] * Quits: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com) (Read error: 110 (Connection timed out))
- # [15:09] * Quits: taf2 (n=taf2@static-71-127-149-10.bltmmd.fios.verizon.net)
- # [15:11] * Quits: zdobersek (n=zan@cpe-92-37-65-85.dynamic.amis.net) (Read error: 110 (Connection timed out))
- # [15:12] * Quits: doublec_ (n=doublec@118-93-168-138.dsl.dyn.ihug.co.nz) ("Leaving")
- # [15:13] * Joins: zdobersek (n=zan@cpe-92-37-77-246.dynamic.amis.net)
- # [15:18] * Joins: zcorpan (n=zcorpan@83.252.196.43)
- # [15:25] * Quits: maikmerten (n=maikmert@U27b4.u.pppool.de) (Read error: 110 (Connection timed out))
- # [15:30] <MikeSmith> gsnedders: http://bugzilla.validator.nu/attachment.cgi?id=85
- # [15:34] <MikeSmith> hsivonen: ↑
- # [15:35] * Quits: zcorpan (n=zcorpan@83.252.196.43) (Read error: 110 (Connection timed out))
- # [15:38] <MikeSmith> http://bugzilla.validator.nu/attachment.cgi?id=86
- # [15:47] * Joins: maikmerten (n=maikmert@U27b4.u.pppool.de)
- # [15:52] * Quits: grimboy (n=grimboy@78-86-152-156.zone2.bethere.co.uk) (Client Quit)
- # [15:53] * Joins: grimboy (n=grimboy@78-86-152-156.zone2.bethere.co.uk)
- # [15:55] * Quits: Maurice (i=copyman@5ED548D4.cable.ziggo.nl) ("Disconnected...")
- # [15:59] * Joins: taf2 (n=taf2@static-71-127-149-10.bltmmd.fios.verizon.net)
- # [16:02] * Quits: ZombieLoffe (n=e@unaffiliated/zombieloffe)
- # [16:07] * Joins: wakaba (n=wakaba@EM114-51-32-233.pool.e-mobile.ne.jp)
- # [16:20] * Joins: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [16:20] * Quits: MikeSmith (n=MikeSmit@p4bfc04.tokynt01.ap.so-net.ne.jp) ("Tomorrow to fresh woods, and pastures new.")
- # [16:23] * Joins: riven (n=colin@5ED0BC66.cable.ziggo.nl)
- # [16:23] * Joins: annevk5 (n=annevk@85.196.122.246)
- # [16:29] * Quits: taf2 (n=taf2@static-71-127-149-10.bltmmd.fios.verizon.net)
- # [16:46] * Joins: hdh (n=hdh@58.187.16.54)
- # [16:51] * Quits: hdh (n=hdh@58.187.16.54) (Remote closed the connection)
- # [16:51] * Joins: karlcow (n=karl@nerval.la-grange.net)
- # [17:05] * Joins: zdobersek1 (n=zan@cpe-92-37-73-138.dynamic.amis.net)
- # [17:07] * Quits: zdobersek (n=zan@cpe-92-37-77-246.dynamic.amis.net) (Read error: 110 (Connection timed out))
- # [17:08] * Joins: zcorpan (n=zcorpan@c83-252-196-43.bredband.comhem.se)
- # [17:11] * Joins: MikeSmith (n=MikeSmit@EM114-48-175-178.pool.e-mobile.ne.jp)
- # [17:11] * Quits: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net) (Read error: 110 (Connection timed out))
- # [17:19] <zcorpan> jgraham: have you documented <!-- and --> in web ecmascript yet?
- # [17:21] <Philip`> Hmm, I've seeded that dotnetdotcom index of web pages for almost 24 hours, and there's been zero connections
- # [17:21] <Philip`> Torrents aren't so useful for files that nobody wants to download anyway
- # [17:44] <jgraham> zcorpan: No, good point
- # [17:45] <jgraham> Philip`: I would quite like to download it but I'm not sure that it's a good idea over my crappy/bandwidth limited home connection
- # [17:46] * Quits: zcorpan (n=zcorpan@c83-252-196-43.bredband.comhem.se)
- # [17:46] <Philip`> jgraham: They provide plain HTTP download too, which might be more compatible with your connection
- # [17:47] <Philip`> and it's only 2.5GB, which is smaller than they claim
- # [17:49] <Philip`> And if you don't want all of it, you could just download the first N megabytes and discard the last entry
- # [17:49] <Philip`> where N is the largest value that still is considered a good idea to download
- # [17:50] * Joins: danbri (n=danbri@s5590d015.adsl.wanadoo.nl)
- # [18:14] * Joins: taf2 (n=taf2@static-71-127-149-10.bltmmd.fios.verizon.net)
- # [18:16] * Joins: onar_ (n=onar@c-98-234-65-251.hsd1.ca.comcast.net)
- # [18:30] * Joins: wakaba_ (n=wakaba@EM114-51-19-23.pool.e-mobile.ne.jp)
- # [18:47] * Quits: onar_ (n=onar@c-98-234-65-251.hsd1.ca.comcast.net)
- # [18:50] * Quits: wakaba (n=wakaba@EM114-51-32-233.pool.e-mobile.ne.jp) (Read error: 110 (Connection timed out))
- # [18:51] * Joins: zdobersek (n=zan@cpe-92-37-74-217.dynamic.amis.net)
- # [18:54] * Joins: dbaron (n=dbaron@c-98-234-51-190.hsd1.ca.comcast.net)
- # [18:58] * Joins: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [19:06] * Quits: zdobersek1 (n=zan@cpe-92-37-73-138.dynamic.amis.net) (No route to host)
- # [19:30] * Joins: hasather (n=hasather@90-231-107-133-no62.tbcn.telia.com)
- # [19:50] * Joins: hdh (n=hdh@58.187.16.54)
- # [19:52] * Joins: Maurice (i=copyman@5ED548D4.cable.ziggo.nl)
- # [19:55] * Quits: dolske (n=dolske@firefox/developer/dolske)
- # [19:57] * Joins: dolske (n=dolske@c-76-103-40-203.hsd1.ca.comcast.net)
- # [20:10] * Quits: arun__ (n=arun@adsl-75-37-31-202.dsl.pltn13.sbcglobal.net)
- # [20:11] * Joins: arun_ (n=arun@adsl-75-37-31-202.dsl.pltn13.sbcglobal.net)
- # [20:14] * Quits: gsnedders (n=gsnedder@host86-136-52-180.range86-136.btcentralplus.com)
- # [20:18] * Quits: arun_ (n=arun@adsl-75-37-31-202.dsl.pltn13.sbcglobal.net) (Read error: 60 (Operation timed out))
- # [20:24] * Joins: gsnedders (n=gsnedder@host86-136-52-180.range86-136.btcentralplus.com)
- # [20:28] * Joins: mlpug (n=mlpug@a91-156-60-13.elisa-laajakaista.fi)
- # [20:39] * Joins: zalan (n=kvirc@catv-80-99-193-98.catv.broadband.hu)
- # [20:49] * Quits: myakura (n=myakura@p1063-ipbf3305marunouchi.tokyo.ocn.ne.jp) ("Leaving...")
- # [20:50] * Quits: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [21:15] * Quits: maikmerten (n=maikmert@U27b4.u.pppool.de) (Client Quit)
- # [21:30] * Joins: ap (n=ap@194.154.88.45)
- # [21:54] <jgraham> Hmm, unless I am missing something, it looks like (almost?) all the cases of <!-- not followed by --> are in <script> blocks
- # [21:54] <Philip`> I looked at three, and saw one which wasn't
- # [21:54] <Philip`> but I don't know which one that was
- # [21:54] * Joins: olliej (n=oliver@17.203.15.141)
- # [21:55] * Joins: slightlyoff (n=slightly@204.14.154.244)
- # [21:56] <jgraham> Yeah, my script is a bit buggy
- # [21:57] <Philip`> I hope mine wasn't
- # [21:57] * Philip` wishes people would independently verify his data :-)
- # [21:57] * jgraham would like to do that
- # [21:58] <jgraham> I will get some or more of that dotcomdotnet data at some point
- # [21:58] <Philip`> Also I hope their data isn't buggy
- # [21:59] <jgraham> That is, of course, quite possible
- # [21:59] <Philip`> I haven't seen any problems though
- # [22:00] <Philip`> except a few pages which seemingly return bogus HTTP responses that the HttpClient parser dies on
- # [22:00] <Philip`> but those could be legitimately broken servers
- # [22:00] * Quits: mlpug (n=mlpug@a91-156-60-13.elisa-laajakaista.fi) (Remote closed the connection)
- # [22:01] <Philip`> I guess the main question is what direction their sample is most biased in
- # [22:02] <Philip`> (The sample of the web that they crawl, not the sample of their crawled data that they published (which they claim is uniform, and seemingly is restricted to 200s and text/html))
- # [22:02] <jgraham> Philip`: How is your sample biased? html5lib is telling me that all the sites are in no-quirks mode which seems unreasonable
- # [22:03] <jgraham> (the small selection I looked at were compatible with that hypothesis but that was like 2 sites)
- # [22:05] <Philip`> jgraham: I didn't do any sampling myself
- # [22:05] <gsnedders> Is html5lib reliable?
- # [22:05] * Joins: slightlyoff_ (n=slightly@72.14.224.1)
- # [22:05] <Philip`> Looking at a few random pages from my list, www.articlear.com/profile/Jason-Uvios/994 looks quirky
- # [22:06] <jgraham> gsnedders: No
- # [22:07] <jgraham> This is a good way of finding bugs in it though :)
- # [22:08] * Joins: onar_ (n=onar@c-98-234-65-251.hsd1.ca.comcast.net)
- # [22:09] * Quits: slightlyoff_afk (n=slightly@216.239.44.65) (Read error: 110 (Connection timed out))
- # [22:10] * Joins: weinig (n=weinig@nat/apple/x-9719d6f90d2f4fe7)
- # [22:11] * Quits: ap (n=ap@194.154.88.45)
- # [22:11] * Joins: ZombieLoffe (n=e@unaffiliated/zombieloffe)
- # [22:14] * Joins: danbri_ (n=danbri@s5590d015.adsl.wanadoo.nl)
- # [22:15] * Quits: zalan (n=kvirc@catv-80-99-193-98.catv.broadband.hu) ("KVIrc 3.4.0 Virgo http://www.kvirc.net/")
- # [22:20] * Quits: gsnedders (n=gsnedder@host86-136-52-180.range86-136.btcentralplus.com)
- # [22:21] * Quits: slightlyoff (n=slightly@204.14.154.244) (Read error: 110 (Connection timed out))
- # [22:22] * Joins: gsnedders (n=gsnedder@host86-136-52-180.range86-136.btcentralplus.com)
- # [22:22] * Quits: weinig (n=weinig@nat/apple/x-9719d6f90d2f4fe7)
- # [22:28] * Quits: onar_ (n=onar@c-98-234-65-251.hsd1.ca.comcast.net)
- # [22:29] * Joins: zcorpan (n=zcorpan@c83-252-196-43.bredband.comhem.se)
- # [22:29] * Quits: ROBOd (n=robod@89.122.216.38) ("http://www.robodesign.ro")
- # [22:30] * Quits: danbri (n=danbri@unaffiliated/danbri) (Read error: 110 (Connection timed out))
- # [22:46] * Joins: weinig (n=weinig@nat/apple/x-3f281801d43e82df)
- # [22:47] * Quits: zcorpan (n=zcorpan@c83-252-196-43.bredband.comhem.se) (Read error: 110 (Connection timed out))
- # [22:47] * Joins: onar_ (n=onar@c-98-234-65-251.hsd1.ca.comcast.net)
- # [22:52] * Joins: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [22:58] * Joins: zcorpan (n=zcorpan@c83-252-196-43.bredband.comhem.se)
- # [23:05] * Quits: zcorpan (n=zcorpan@c83-252-196-43.bredband.comhem.se) (Remote closed the connection)
- # [23:06] * Joins: zcorpan (n=zcorpan@c83-252-196-43.bredband.comhem.se)
- # [23:07] * slightlyoff_ is now known as slightlyoff_afk
- # [23:12] * Quits: zdobersek (n=zan@cpe-92-37-74-217.dynamic.amis.net) (Read error: 104 (Connection reset by peer))
- # [23:21] * Quits: zcorpan (n=zcorpan@c83-252-196-43.bredband.comhem.se) (Read error: 60 (Operation timed out))
- # [23:25] * Quits: MikeSmith (n=MikeSmit@EM114-48-175-178.pool.e-mobile.ne.jp) (Read error: 110 (Connection timed out))
- # [23:57] * Quits: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [23:57] * Quits: onar_ (n=onar@c-98-234-65-251.hsd1.ca.comcast.net)
- # Session Close: Sun Apr 26 00:00:00 2009
The end :)