Options:
- # Session Start: Sat Mar 28 00:00:00 2009
- # Session Ident: #whatwg
- # [00:01] * Joins: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [00:03] * Parts: ojan (n=ojan@72.14.229.81)
- # [00:15] <sicking_> Hixie, ping
- # [00:16] * sicking_ is now known as sicking
- # [00:16] <Hixie> sicking: here
- # [00:16] <sicking> Hixie, re CDATA
- # [00:16] <Hixie> yes?
- # [00:16] <sicking> Hixie, was there anything that was unclear in my latest email on the subject to the HTML list?
- # [00:17] <Hixie> uri?
- # [00:17] <Hixie> http://lists.w3.org/Archives/Public/public-html/2009Mar/0511.html ?
- # [00:18] * Quits: dglazkov (n=dglazkov@nat/google/x-07cc00c20f701a00)
- # [00:18] <Hixie> insofar as i don't understand what the proposal exactly is, yes
- # [00:19] <annevk3> sicking, could you please post your redirect proposal? Talked with ap from WebKit and he thinks the current spec module 303 for preflight request makes sense.
- # [00:19] <Hixie> s/module/modulo/?
- # [00:19] <annevk3> yes
- # [00:20] <annevk3> Hixie, I think the idea is to parse the data as CDATA, then if it starts with <![CDATA[ and ends with ]]> you strip that (ignoring whitespace) before handing it of to the style or script engine
- # [00:21] * Quits: jwalden (n=waldo@c-76-102-160-171.hsd1.ca.comcast.net) ("ChatZilla 0.9.82.1-rdmsoft [XULRunner 1.8.0.9/2006120508]")
- # [00:21] <annevk3> Hixie, for both HTML and SVG <script> and <style> elements
- # [00:21] * Quits: mpilgrim (n=mark@67.217.134.48) (Read error: 113 (No route to host))
- # [00:21] <Hixie> in xml or in html? how do you deal with <!-- magic? what about <![CDATA[ ]]> elsewhere? are the svgwg really ok with us not achieving their sole goal?
- # [00:22] <annevk3> HTML; you do not deal with <!-- magic. I.e. when that is there you would not do the stripping; they say they think this solution is fine
- # [00:24] <annevk3> oh, and <![CDATA[ ]]> elsewhere would also not be stripped or handled (so if you have a double block you'd run into issues)
- # [00:25] <Hixie> so uh
- # [00:25] * slightlyoff_afk is now known as slightlyoff
- # [00:25] <Hixie> why did they list one goal is they don't actually care about that goal
- # [00:25] <Hixie> i've been evaluating things based on the requirements they listed
- # [00:26] <annevk3> apparently it's not a black/white thing
- # [00:26] <annevk3> and this would still work for most copy/paste scenarios; just like the other solutions work for most copy/paste scenarios, but not all
- # [00:26] <annevk3> Simon had a different solution by the way: http://lists.w3.org/Archives/Public/public-html/2009Mar/0624.html
- # [00:29] <sicking> Hixie, http://lists.w3.org/Archives/Public/public-html/2009Mar/0634.html
- # [00:30] <Hixie> oh i haven't replied to that. but no, you are incorrect; making d.w() async doesn't make it not work
- # [00:31] <Hixie> it just means the parser isn't invoked reentrantly while the script is running
- # [00:32] <Hixie> it's the same thing as what happens in normal <script> in html when you do document.write("<script src=a></script>..."): the ... isn't parsed until after the original outer script is finished and the "a" script is downloaded and run.
- # [00:32] <sicking> Hixie, and from i haven't heard anyone from the svg WG has objected to having entities inside <script> not work. The theory being that the far vast majority uses <![CDATA[]]>
- # [00:33] <sicking> Hixie, i can find pointers to agreement that that is ok if that helps
- # [00:33] <sicking> Hixie, ok, wasn't sure which interpreation of how d.w() was gonna work was the correct one. Spec isn't very clear.
- # [00:34] <Hixie> these are two separate issues
- # [00:34] <Hixie> so let's start with the cdata one
- # [00:34] <sicking> indeed
- # [00:34] <sicking> fwiw i do treat the issues separately in the mail
- # [00:35] <Hixie> the proposal is to remove 8.2.4.36 CDATA section state, along with all the branch that leads to that state, and then to make <script> amd <style> in SVG use the "generic CDATA element parsing algorithm"
- # [00:36] <Hixie> and then to strip leading and trailing <![CDATA[ ]]> text somehow
- # [00:36] <Hixie> right?
- # [00:36] <Hixie> or not?
- # [00:37] * Joins: jwalden (n=waldo@c-76-102-160-171.hsd1.ca.comcast.net)
- # [00:40] * slightlyoff is now known as slightlyoff_afk
- # [00:46] <sicking> Hixie, would that remove the ability to have <![CDATA[]]> outside of <script> and <style>?
- # [00:47] <Hixie> yes
- # [00:47] <sicking> that does not seem acceptable
- # [00:47] <sicking> why would you do that?
- # [00:47] <Hixie> that's what anne said you had proposed
- # [00:47] <sicking> that was wrong :)
- # [00:47] <Hixie> i recommend sending an e-mail with a clear statement of the proposal
- # [00:47] <sicking> Hixie, what is not clear in http://lists.w3.org/Archives/Public/public-html/2009Mar/0634.html ?
- # [00:48] <Hixie> oh i didn't realise that also talked about cdata parsing
- # [00:48] <sicking> Hixie, see the part starting with "Instead, I propose the following behavior"
- # [00:48] <Hixie> i haven't gotten to that e-mail yet
- # [00:48] * Quits: kangax (n=kangax@157.130.31.226)
- # [00:48] <sicking> Hixie, it talks about both issues. Separately
- # [00:49] <sicking> Hixie, except I just spotted one error
- # [00:49] <Hixie> so wait, they want to make <script> and <style> work like html but everything else not work like html?
- # [00:49] <Hixie> i'm really confused now
- # [00:49] <Hixie> what's the goal here?
- # [00:49] <sicking> i can only speak for my goals:
- # [00:50] <sicking> 1. Make everything that exists both in HTML and SVG work the same in HTML and SVG
- # [00:50] <sicking> 2. Make the rest of SVG work as similar as it makes sense to HTML
- # [00:50] <sicking> 3. Eat cookies
- # [00:51] <Hixie> ok well that's basically completely incompatible with the goal the svgwg have conveyed
- # [00:51] <Hixie> it would mean doing things like making tags that are known to never have children be void elements, and not support the /> syntax, and not support cdata at all, and so on
- # [00:51] <sicking> oh, also: 2.5 Make it possible for as much as possible of XML-SVG content to be copyable into HTML
- # [00:52] <Hixie> your goals are incompatible with themselves
- # [00:53] <sicking> there's some "as much *as possible*" and "as similar as it *makes sense*" in there that makes it compatible
- # [00:53] <sicking> there's no absolutes
- # [00:53] <sicking> it's all about tradeoffs
- # [00:54] <sicking> i'd imagine there's no one goal that is absolute. Ever. It's all relative
- # [00:54] <Hixie> as similar as possible and as copyable as possible are not compatible goals
- # [00:54] <Hixie> i don't know how to evaluate when to do one and when to do the other
- # [00:54] <Hixie> what is the criteria by which we know how much to not address each goal?
- # [00:54] <sicking> that's why we have conversations
- # [00:54] <Hixie> how do conversations help us evaluate how far to go?
- # [00:55] <sicking> when it comes to tradeoffs between making it similar to HTML and making it possible to copy, i'd judge it by how much existing content would we have to make not copyable, compared to how much extra compatibility would we get with HTML
- # [00:55] <sicking> that's why i say it's all tradeoffs
- # [00:56] <sicking> isn't that how it is with everything else in HTML5?
- # [00:56] <Hixie> aha, basing it on data
- # [00:56] <Hixie> ok
- # [00:56] <Hixie> do we have that data?
- # [00:56] <Hixie> i don't have an svg corpus
- # [00:56] <sicking> very little. Just have the judgements of the SVG people
- # [00:56] <Hixie> if the judgements of the svg people are anything like the judgements of the html people, they're wildly off :-)
- # [00:56] <sicking> so basically the SVG they have seen and authored
- # [00:56] <Hixie> ok there has to be better data than that, surely
- # [00:57] <Hixie> i wonder where we can get a good svg corpus
- # [00:57] <sicking> yeah, that would be great indeed
- # [00:57] <Hixie> Philip` had some svgs from wikipedia, i wonder if that is representative
- # [00:57] * gsnedders waves a crazed lunatic
- # [00:57] <Philip`> They're not representative
- # [00:58] <Philip`> particularly because they (almost?) never have scripting
- # [00:58] <Hixie> bummer
- # [00:58] <Philip`> (seeing as they're designed to be flattened into PNGs by Wikipedia's thumbnailer)
- # [00:58] * Quits: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [00:59] <sicking> Philip`, is there a reason to believe that other SVG has more scripting?
- # [00:59] <Hixie> are there sites like for flash games where people upload their animated or scripted svgs or something?
- # [00:59] <sicking> Philip`, though i guess the wikipedia ones have absolutely no scripts
- # [00:59] <Philip`> To be more precise, I should say the sample of SVGs from Wikipedia is representative of all the SVGs on Wikipedia
- # [01:00] <sicking> Philip`, but in general i would expect SVG in general to have far less scripts
- # [01:00] * gsnedders hopes Hixie saw his latest tweet (from six hours ago), seeming he made it just for him :P
- # [01:00] <Philip`> but probably not representative of all other sane collections of SVGs that one might imagine
- # [01:00] <Philip`> (I tried looking for SVGs on the web via 130K pages from dmoz.org, but found a grand total of about 1, so that's not a good way of getting a different sample)
- # [01:00] <Hixie> gsnedders: i don't watch twitter closely
- # [01:01] <gsnedders> Hixie: "At a station with semaphores."
- # [01:01] <Hixie> Philip`: yeah my own research for svg using the google corpus didn't find a useful set of files
- # [01:01] <Hixie> gsnedders: heh, britain. how quaint. :-P
- # [01:02] <gsnedders> (The station I got off had electric signals!11!!!!eleventy!!!)
- # [01:03] <Philip`> sicking: My sample of 300 from Wikipedia has zero with scripting; http://codedread.com/ has one SVG with scripting; therefore SVGs outside Wikipedia definitely have more scripting than those on Wikipedia :-)
- # [01:03] * Quits: Kuruma (n=Kuruman@www1664.sakura.ne.jp) (Remote closed the connection)
- # [01:03] <sicking> Philip`, indeed
- # [01:03] <sicking> Philip`, though technically speaking not statistically significant difference. I think
- # [01:03] <sicking> :)
- # [01:04] <Philip`> sicking: The difference between 0 and 1 is infinity percent, so it must be significant!
- # [01:07] * Quits: Maurice (n=copyman@5ED548D4.cable.ziggo.nl) ("Disconnected...")
- # [01:08] * Quits: svl (n=me@ip565744a7.direct-adsl.nl) ("And back he spurred like a madman, shrieking a curse to the sky.")
- # [01:10] <sicking> Hixie, btw, I also suggest that we support <!CDATA[ ]]> everwhere where we parse PCDATA and RCDATA. Due to goal 2 above. Sounded like Operas experiment in this area is that it doesn't break the web. But it's an orthogonal discussion to the one we're having now so I wasn't going to raise it until after
- # [01:12] * Joins: bgalbraith (n=bgalbrai@corp-241.mountainview.mozilla.com)
- # [01:14] <annevk3> sicking, actually, we encountered issues with supporting <![CDATA[
- # [01:14] <sicking> annevk3, big enough that you removed the support?
- # [01:14] <annevk3> sicking, I don't think we have yet, but I think we should
- # [01:15] * Joins: Kuruma (n=Kuruman@www1664.sakura.ne.jp)
- # [01:15] <sicking> annevk3, why not, and why?
- # [01:15] * Joins: ginger (n=nessy@124-171-38-148.dyn.iinet.net.au)
- # [01:15] <annevk3> because it causes issues and supporting CDATA has no real benefit
- # [01:15] * Philip` hopes that if Opera doesn't remove CDATA support, they at least make it less insane
- # [01:16] <sicking> annevk3, i doubt you'll ever find a feature that you can deploy without any issues. The most recent example we had was a site breaking because we implemented document.readyState
- # [01:16] <annevk3> sicking, but CDATA is not a feature, it's near useless
- # [01:16] <sicking> annevk3, since we'll have to support <![CDATA[]]> inside SVG, I think the consistency would be nice
- # [01:17] <annevk3> maybe, I'd rather remove it there too
- # [01:17] <annevk3> (also Opera's current CDATA support doesn't match the HTML5 spec)
- # [01:18] <sicking> annevk3, i'd be supportive of that if we think there isn't much existing SVG content that uses it. In other words, if we don't think it'd affect the ability to copy SVG into HTML
- # [01:20] <annevk3> there's very little SVG so that's pretty hard to tell
- # [01:21] <annevk3> I don't recall ever needing it in any SVG content I've created, but then I mostly do simple things
- # [01:22] <Philip`> http://philip.html5.org/misc/spec-links-anim.svgz
- # [01:22] <Philip`> That's got a script, but doesn't use CDATA
- # [01:23] * Quits: nessy (n=nessy@124-168-165-203.dyn.iinet.net.au) (Read error: 101 (Network is unreachable))
- # [01:24] <Philip`> Hmm, why does View Source in Firefox 3 go unbelievably slowly on that SVG file?
- # [01:24] * Joins: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [01:24] <Philip`> If the script had to use < or & then I probably would have put it in <![CDATA[]]>
- # [01:24] <Philip`> but it didn't so I didn't
- # [01:25] <sicking> have a good weekend people
- # [01:25] <Philip`> Yikes, FF3 uses about 400MB of RAM to view-source on that page :-/
- # [01:26] * olliej opens in S4
- # [01:26] * olliej wonders how badly it will fair
- # [01:26] <sicking> Philip`, view-source will always use more memory than the actual page
- # [01:27] * Quits: ginger (n=nessy@124-171-38-148.dyn.iinet.net.au) (Read error: 60 (Operation timed out))
- # [01:27] <olliej> memory seems to have peaked at ~600mb :-O
- # [01:27] * olliej wonders wtf is happening
- # [01:27] <sicking> Philip`, it doesn't use any entities either though, so should work find
- # [01:27] <sicking> fine even
- # [01:28] * Quits: sicking (n=chatzill@corp-241.mountainview.mozilla.com) ("ChatZilla 0.9.84 [Firefox 3.5b4pre/20090317114959]")
- # [01:28] <annevk3> what's the advantage of toTempURL over toDataURL? working around IE bugs?
- # [01:29] <Dashiva> That's how I understood it
- # [01:29] <annevk3> feature design 101: don't propose a new feature to work around UA bugs in an existing feature
- # [01:29] <Dashiva> Not sure why supporting toTempURL would be easier than fixing data urls
- # [01:30] <Philip`> I think the idea is that you should be able to say img.src = canvas.toSomeKindOfThingThatWorksInImgSrc() and have it work in browsers that don't support data URIs
- # [01:30] <annevk3> Philip`, right, see above
- # [01:31] <annevk3> nn
- # [01:31] <Philip`> Dashiva: Because you can't fix data URIs in IE6
- # [01:31] <Philip`> (I assume)
- # [01:32] <Dashiva> But can you fix toTempURL in IE6?
- # [01:32] * Quits: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [01:32] <Philip`> If you're writing a plugin or add-on or whatever it is, then you presumably can
- # [01:33] <Philip`> since you can save the canvas data to disk and then use file:/// to refer to it
- # [01:33] <Philip`> or you can register a protocol handler for customHandler://
- # [01:34] <Dashiva> Register a protocol handler for data:, feed the contents?
- # [01:34] <Philip`> I've got no idea whether you can do that
- # [01:34] <Philip`> (If you could, it'd be bad in terms of IE-compatibility with other sites that uses data: URIs)
- # [01:47] * Quits: bgalbraith (n=bgalbrai@corp-241.mountainview.mozilla.com)
- # [01:50] <hdh> (defun last-heading () (search-backward-regexp "C-x C-f") (match-string 0)) (setq mode-line-format '(:eval (last-heading)))
- # [01:50] <hdh> idk how to plug the eval part into the existing modeline
- # [01:50] <hdh> the text seems to keep its formatting in the matched buffer
- # [01:51] * Quits: erlehmann (n=erlehman@86.59.25.121) (Remote closed the connection)
- # [01:54] * Quits: davidb (n=davidb@bas4-toronto06-1279310263.dsl.bell.ca)
- # [02:00] * Quits: jwalden (n=waldo@c-76-102-160-171.hsd1.ca.comcast.net) ("ChatZilla 0.9.82.1-rdmsoft [XULRunner 1.8.0.9/2006120508]")
- # [02:01] * Quits: dave_levin (n=dave_lev@72.14.227.1)
- # [02:04] * Joins: nessy (n=nessy@124-171-28-204.dyn.iinet.net.au)
- # [02:06] * Quits: dimich (n=dimich@72.14.227.1)
- # [02:10] <Hixie> hdh: interesting
- # [02:16] * Quits: tndH (n=Rob@adsl-77-86-107-105.karoo.KCOM.COM) ("ChatZilla 0.9.84-rdmsoft [XULRunner 1.9.0.1/2008072406]")
- # [02:18] * Quits: atw (n=atwilson@74.125.59.1) (Read error: 110 (Connection timed out))
- # [02:28] * Joins: craigbass1976 (n=craig@64.223.240.104)
- # [02:37] * Joins: azazul (n=azazul@zelli-kojas.lanet.lv)
- # [02:37] * Parts: craigbass1976 (n=craig@64.223.240.104) ("Leaving")
- # [02:38] * slightlyoff_afk is now known as slightlyoff
- # [03:14] * slightlyoff is now known as slightlyoff_afk
- # [03:15] * Joins: erlehmann (n=erlehman@86.59.25.121)
- # [03:26] * slightlyoff_afk is now known as slightlyoff
- # [03:31] * Quits: weinig (n=weinig@17.246.18.97)
- # [03:45] * Joins: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [03:49] * Quits: Andrii (n=Miranda@91.200.180.19) ("Miranda IM! Smaller, Faster, Easier. http://miranda-im.org")
- # [03:53] * Joins: doublec (n=doublec@118.93.172.205)
- # [04:01] * Joins: slightlyoff_ (n=slightly@67.218.107.254)
- # [04:12] * Quits: slightlyoff (n=slightly@nat/google/x-d83a107a121c5731) (Read error: 110 (Connection timed out))
- # [04:19] * Joins: kangax (n=kangax@ool-182f8118.dyn.optonline.net)
- # [04:28] * Joins: roc (n=roc@121-72-187-98.dsl.telstraclear.net)
- # [04:35] * Joins: sid0_ (n=sid0@unaffiliated/sid0)
- # [04:38] * Joins: weinig (n=weinig@c-67-180-35-124.hsd1.ca.comcast.net)
- # [04:40] * Joins: roc_ (n=roc@121-72-187-98.dsl.telstraclear.net)
- # [04:46] * Quits: sid0 (n=sid0@unaffiliated/sid0) (Remote closed the connection)
- # [04:47] * Quits: slightlyoff_ (n=slightly@67.218.107.254)
- # [04:48] * Quits: roc (n=roc@121-72-187-98.dsl.telstraclear.net) (Read error: 110 (Connection timed out))
- # [04:56] * Quits: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [05:01] * Parts: erlehmann (n=erlehman@86.59.25.121)
- # [05:22] * Quits: weinig (n=weinig@c-67-180-35-124.hsd1.ca.comcast.net)
- # [05:49] * Joins: weinig (n=weinig@c-67-180-35-124.hsd1.ca.comcast.net)
- # [05:51] * Joins: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [05:53] * Quits: weinig (n=weinig@c-67-180-35-124.hsd1.ca.comcast.net) (Client Quit)
- # [05:57] * Joins: weinig (n=weinig@c-67-180-35-124.hsd1.ca.comcast.net)
- # [06:19] * Quits: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [06:48] * Joins: dimich (n=dimich@72.14.224.1)
- # [06:58] * Joins: zalan (n=kvirc@catv-80-99-193-98.catv.broadband.hu)
- # [07:05] * Quits: roc_ (n=roc@121-72-187-98.dsl.telstraclear.net)
- # [07:19] * Quits: annodomini (n=lambda@wikipedia/lambda)
- # [07:45] * Joins: mlpug (n=mlpug@a91-156-60-13.elisa-laajakaista.fi)
- # [07:47] * Quits: weinig (n=weinig@c-67-180-35-124.hsd1.ca.comcast.net)
- # [07:51] * Joins: weinig (n=weinig@c-67-180-35-124.hsd1.ca.comcast.net)
- # [08:11] * Quits: doublec (n=doublec@118.93.172.205) ("Leaving")
- # [08:17] * Joins: olliej_ (n=oliver@c-67-164-125-23.hsd1.ca.comcast.net)
- # [08:35] * Quits: zalan (n=kvirc@catv-80-99-193-98.catv.broadband.hu) (Read error: 110 (Connection timed out))
- # [08:35] * Joins: taf2_ (n=taf2@static-71-127-149-10.bltmmd.fios.verizon.net)
- # [08:36] * Quits: taf2 (n=taf2@static-71-127-149-10.bltmmd.fios.verizon.net) (Read error: 104 (Connection reset by peer))
- # [08:37] * Joins: zdobersek (n=zan@cpe-92-37-76-170.dynamic.amis.net)
- # [08:42] * Quits: weinig (n=weinig@c-67-180-35-124.hsd1.ca.comcast.net)
- # [08:51] * Joins: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [08:55] * Quits: sid0_ (n=sid0@unaffiliated/sid0) (Remote closed the connection)
- # [08:55] * Joins: sid0 (n=sid0@unaffiliated/sid0)
- # [08:59] * Quits: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [09:09] * Quits: hdh (n=hdh@58.187.20.174) ("Leaving.")
- # [09:47] * Joins: Maurice (n=copyman@5ED548D4.cable.ziggo.nl)
- # [09:51] * Quits: gavin_ (n=gavin@firefox/developer/gavin) (Remote closed the connection)
- # [09:52] * Joins: gavin_ (n=gavin@firefox/developer/gavin)
- # [09:55] * Quits: dimich (n=dimich@72.14.224.1)
- # [09:58] * Joins: jwalden (n=waldo@c-24-6-169-169.hsd1.ca.comcast.net)
- # [10:01] * Joins: zalan (n=kvirc@catv-80-99-193-98.catv.broadband.hu)
- # [10:04] * Quits: nessy (n=nessy@124-171-28-204.dyn.iinet.net.au) ("This computer has gone to sleep")
- # [10:08] * Joins: dimich (n=dimich@c-98-203-230-54.hsd1.wa.comcast.net)
- # [10:34] * Joins: danbri (n=danbri@s55927ef8.adsl.wanadoo.nl)
- # [10:34] * Joins: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [10:48] * Joins: mat_t (n=mattomas@ppp-1-128.edin-a-1.access.uk.tiscali.com)
- # [10:59] * Quits: olliej_ (n=oliver@c-67-164-125-23.hsd1.ca.comcast.net) (Read error: 60 (Operation timed out))
- # [10:59] * Quits: mat_t (n=mattomas@ppp-1-128.edin-a-1.access.uk.tiscali.com) ("This computer has gone to sleep")
- # [10:59] * Quits: dimich (n=dimich@c-98-203-230-54.hsd1.wa.comcast.net)
- # [11:00] * Joins: nessy (n=nessy@124-171-28-204.dyn.iinet.net.au)
- # [11:10] * Joins: ROBOd (n=robod@89.122.216.38)
- # [11:35] * Joins: mstange (n=markus@pD9579181.dip0.t-ipconnect.de)
- # [11:53] * Joins: mat_t (n=mattomas@ppp-1-128.edin-a-1.access.uk.tiscali.com)
- # [11:54] * Quits: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [11:55] * Quits: Amorphous (i=jan@unaffiliated/amorphous) ("shutdown")
- # [11:56] * Joins: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [12:00] * Quits: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com) (Client Quit)
- # [12:04] * Quits: mlpug (n=mlpug@a91-156-60-13.elisa-laajakaista.fi) (Read error: 110 (Connection timed out))
- # [12:11] * Joins: Amorphous (i=jan@unaffiliated/amorphous)
- # [12:18] * Joins: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [12:19] * Joins: sid0_ (n=sid0@unaffiliated/sid0)
- # [12:23] * Quits: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com) (Client Quit)
- # [12:24] * Joins: svl (n=me@ip565744a7.direct-adsl.nl)
- # [12:25] * Joins: myakura (n=myakura@p1063-ipbf3305marunouchi.tokyo.ocn.ne.jp)
- # [12:27] * Joins: benh_ (n=benh@91.85.204.41)
- # [12:29] <benh_> Historical question: does anyone know why HTML4 deprecated u, s, strike, but not b/i/big/small, even though it discouraged the later? What was special about u/s/strike?
- # [12:30] * Quits: mat_t (n=mattomas@ppp-1-128.edin-a-1.access.uk.tiscali.com) (Remote closed the connection)
- # [12:31] * benh_ is now known as webben
- # [12:31] * Quits: sid0 (n=sid0@unaffiliated/sid0) (Nick collision from services.)
- # [12:31] * sid0_ is now known as sid0
- # [12:32] * Joins: zdobersek1 (n=zan@cpe-92-37-65-36.dynamic.amis.net)
- # [12:34] * Quits: sid0 (n=sid0@unaffiliated/sid0) ("Leaving")
- # [12:36] * Joins: zdobersek2 (n=zan@cpe-92-37-67-227.dynamic.amis.net)
- # [12:39] * Joins: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [12:39] * Joins: tndH (n=Rob@adsl-77-86-107-105.karoo.KCOM.COM)
- # [12:43] * Quits: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com) (Client Quit)
- # [12:46] * Quits: zdobersek (n=zan@cpe-92-37-76-170.dynamic.amis.net) (Read error: 110 (Connection timed out))
- # [12:51] * Quits: zdobersek1 (n=zan@cpe-92-37-65-36.dynamic.amis.net) (Read error: 110 (Connection timed out))
- # [13:02] * Joins: webben_ (n=benh@dip5-fw.corp.ukl.yahoo.com)
- # [13:07] * Quits: webben (n=benh@91.85.204.41) (Read error: 60 (Operation timed out))
- # [13:13] * Joins: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [13:32] * Quits: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [13:32] * Joins: mlpug (n=mlpug@a91-156-60-13.elisa-laajakaista.fi)
- # [13:32] * Joins: pauld (n=pauld@86.133.17.49)
- # [13:33] * Quits: pauld (n=pauld@86.133.17.49) (Client Quit)
- # [13:53] * Joins: Andrii (n=Miranda@91.200.180.78)
- # [13:59] * Quits: Andrii (n=Miranda@91.200.180.78) ("Miranda IM! Smaller, Faster, Easier. http://miranda-im.org")
- # [14:03] <annevk3> Hixie, s/buu can/but can/
- # [14:03] <annevk3> Hixie, s/is can be/can be/
- # [14:05] <annevk3> Hixie, s/sections that cause/sections cause/ (I think, the sentence does not seem correct otherwise)
- # [14:07] <annevk3> Hixie, "For each <span>cache host</span> associated with an <span>application cache</span>" Isn't a cache host always associated with one?
- # [14:07] * Joins: Andrii (n=Miranda@91.200.180.78)
- # [14:07] * Joins: sid0 (n=sid0@unaffiliated/sid0)
- # [14:17] * Quits: jcranmer (n=jcranmer@remote.csl.tjhsst.edu) (Read error: 104 (Connection reset by peer))
- # [14:24] * Joins: mpilgrim (n=mark@67.217.134.48)
- # [14:32] * Joins: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [14:37] * Joins: annodomini (n=lambda@wikipedia/lambda)
- # [14:42] * Quits: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com) (Read error: 104 (Connection reset by peer))
- # [14:42] * Joins: pauld_ (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [15:05] * Quits: annodomini (n=lambda@wikipedia/lambda)
- # [15:13] * Quits: nessy (n=nessy@124-171-28-204.dyn.iinet.net.au) ("This computer has gone to sleep")
- # [15:21] * Quits: pauld_ (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [15:25] * Joins: jcranmer (n=jcranmer@remote.csl.tjhsst.edu)
- # [15:59] * Quits: tndH (n=Rob@adsl-77-86-107-105.karoo.KCOM.COM) ("ChatZilla 0.9.84-rdmsoft [XULRunner 1.9.0.1/2008072406]")
- # [16:10] <annevk3> Gmail on a message I sent: "4:06 PM (-1 minutes ago)"
- # [16:14] * Joins: Morphous (i=jan@unaffiliated/amorphous)
- # [16:15] * Joins: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [16:21] * Quits: Amorphous (i=jan@unaffiliated/amorphous) (Nick collision from services.)
- # [16:21] * Morphous is now known as Amorphous
- # [16:23] * Quits: mpilgrim (n=mark@67.217.134.48) (Read error: 113 (No route to host))
- # [16:42] * Joins: sid0_ (n=sid0@unaffiliated/sid0)
- # [16:57] * Quits: sid0 (n=sid0@unaffiliated/sid0) (Read error: 110 (Connection timed out))
- # [16:57] * Quits: mstange (n=markus@pD9579181.dip0.t-ipconnect.de) ("ChatZilla 0.9.84-2009030221 [Firefox 3.6a1pre/20090327031146]")
- # [16:58] <Andrii> annevk3: google invented the time machine, cutting edge technology
- # [17:05] * Joins: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [17:12] * Joins: mpilgrim (n=mark@67.217.134.48)
- # [17:13] * Quits: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [17:18] * Joins: pauld (n=pauld@86.133.17.49)
- # [17:21] * Quits: pauld (n=pauld@86.133.17.49) (Client Quit)
- # [17:29] * Quits: webben_ (n=benh@dip5-fw.corp.ukl.yahoo.com) (Read error: 60 (Operation timed out))
- # [17:40] * Quits: mpilgrim (n=mark@67.217.134.48) (Read error: 60 (Operation timed out))
- # [17:45] * Joins: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [17:47] * Joins: dglazkov_ (n=dglazkov@72.14.224.1)
- # [17:49] * Joins: weinig (n=weinig@c-67-180-35-124.hsd1.ca.comcast.net)
- # [17:49] * Quits: weinig (n=weinig@c-67-180-35-124.hsd1.ca.comcast.net) (Remote closed the connection)
- # [17:58] * Quits: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [17:59] <Hixie> annevk3: please send mail for feedback, pleeeeease. :-)
- # [18:01] * Joins: maikmerten (n=maikmert@Lb37b.l.pppool.de)
- # [18:03] * Quits: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net) (Read error: 110 (Connection timed out))
- # [18:03] * Joins: shepazutoo (n=schepers@adsl-221-7-163.rmo.bellsouth.net)
- # [18:04] * Quits: shepazu (n=schepers@adsl-221-119-243.rmo.bellsouth.net) (Read error: 60 (Operation timed out))
- # [18:08] * Joins: dimich (n=dimich@72.14.224.1)
- # [18:14] * Joins: webben (n=benh@dip5-fw.corp.ukl.yahoo.com)
- # [18:19] * Quits: shepazutoo (n=schepers@adsl-221-7-163.rmo.bellsouth.net) ("Core Breach")
- # [18:20] * Joins: shepazu (n=schepers@adsl-221-7-163.rmo.bellsouth.net)
- # [18:27] * Joins: shepazutoo (n=schepers@adsl-221-38-51.rmo.bellsouth.net)
- # [18:30] * Joins: listoflights (n=kilgore@78.26.4.53)
- # [18:33] <annevk3> Hixie, next time or also for those four lines?
- # [18:33] <annevk3> IRC is convenient, there's less UI involved
- # [18:34] <Dashiva> I'm betting he wants the paper trail
- # [18:36] <Philip`> He prints out emails?
- # [18:37] * Joins: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [18:37] <Philip`> Someone needs to make an IRC bot which lets you say "!feedback s/buu can/but can/" and it will automatically send an email to Hixie
- # [18:41] * Quits: kangax (n=kangax@ool-182f8118.dyn.optonline.net)
- # [18:42] * Quits: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [18:42] * Quits: shepazu (n=schepers@adsl-221-7-163.rmo.bellsouth.net) (Read error: 113 (No route to host))
- # [18:43] <krijnh> Doesn't need to be an IRC bot, since all the logs are available as HTML as well :)
- # [18:44] <Philip`> krijnh: Good point :-)
- # [18:45] <Philip`> krijnh: If you could change your log processor to let us embed RDF in our IRC messages that will get translated into RDFa in the logs, then we could write a simple RDFa-based tool to automatically extract all the feedback and email it
- # [18:45] <annevk3> yeah, and with enough annotation everything gets done automatically
- # [18:46] <Dashiva> That reminds me of joel's post about spec writing
- # [18:47] <Dashiva> "And then some people go into a dark place where they imagine automatically generating implementations from specs, and think they have invented a way to program without programming" (by memory)
- # [18:47] * gsnedders starts implementing parse errors in html5lib php
- # [18:48] <takkaria> parse errors are overrated
- # [18:49] * Quits: myakura (n=myakura@p1063-ipbf3305marunouchi.tokyo.ocn.ne.jp) ("Leaving...")
- # [19:00] <annevk3> gsnedders, why do you need parse errors? isn't a treebuilder more useful?
- # [19:00] <gsnedders> annevk3: Trying to finish Tokenizer first before moving on
- # [19:02] * Joins: kangax (n=kangax@ool-182f8118.dyn.optonline.net)
- # [19:07] * gsnedders returns to the point where more than 50% of tests pass
- # [19:16] * Quits: dglazkov_ (n=dglazkov@72.14.224.1) (Read error: 110 (Connection timed out))
- # [19:29] * Joins: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [19:30] * Quits: Andrii (n=Miranda@91.200.180.78) ("Miranda IM! Smaller, Faster, Easier. http://miranda-im.org")
- # [19:35] * Quits: kangax (n=kangax@ool-182f8118.dyn.optonline.net)
- # [19:42] * Joins: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [19:42] * Joins: dglazkov_ (n=dglazkov@72.14.224.1)
- # [19:44] * Quits: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net) (Read error: 104 (Connection reset by peer))
- # [19:44] * Joins: dglazkov__ (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [19:48] <gsnedders> Weeee… 23 tests failing now
- # [19:49] <Philip`> Delete those tests, then you'll pass 100%
- # [19:51] * Joins: kangax (n=kangax@ool-182f8118.dyn.optonline.net)
- # [19:53] * Quits: dglazkov_ (n=dglazkov@72.14.224.1) (Read error: 145 (Connection timed out))
- # [19:54] <gsnedders> 14% perf. regression from throwing parse errors
- # [19:55] <Philip`> In a document that has no parse errors?
- # [19:55] <gsnedders> Yeah
- # [19:55] <gsnedders> (i.e., the spec)
- # [19:56] <takkaria> ouch
- # [19:57] * Quits: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [19:57] <takkaria> I'm not sure hubbub is ever going to have parse error reporting
- # [19:58] <gsnedders> 12.0s is still a massive improvement over the 48s it was a week ago
- # [19:58] <takkaria> sure :)
- # [20:00] * gsnedders runs with profiler
- # [20:00] <gsnedders> (This takes it back to around 50s :P)
- # [20:02] <Philip`> You need to multithread your tokeniser
- # [20:03] <gsnedders> Multi-threading in PHP? :P
- # [20:03] <Philip`> Shouldn't be too hard to just split the input document into n pieces, and speculatively parse the last n-1, and discards any results that are invalidated by the tokeniser state at the end of the previous section
- # [20:08] <jgraham> Philip`: presumably you would end up being wrong a lot of the time
- # [20:08] <jgraham> Which seems bad
- # [20:09] <Philip`> jgraham: You could scan forwards to the next '>' and assume you're now going to be in the data state, which is likely to be right quite often
- # [20:10] <takkaria> is it?
- # [20:10] <Philip`> and if you were in a <script> or something then you make sure you've kept enough state so you can sync up once you've reached the </script>
- # [20:10] <jgraham> Hmm. You're making this sound surprisingly reasonable
- # [20:10] <Philip`> Really?
- # [20:10] <Philip`> That wasn't my intent
- # [20:10] <jgraham> Which suggests that you're misleading me somehow
- # [20:11] * Joins: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [20:12] * Quits: kangax (n=kangax@ool-182f8118.dyn.optonline.net)
- # [20:13] <Philip`> takkaria: I suppose it should be fairly easy to instrument a tokeniser to report how often it sees '>' when it's in the data state (and PCDATA, and no escape flag)
- # [20:17] * Quits: pauld (n=pauld@host86-133-17-49.range86-133.btcentralplus.com)
- # [20:17] <Philip`> Does Python have a multiprocessing thing nowadays that isn't unbearably hard to use efficiently?
- # [20:17] <gsnedders> Philip`: Yes, multiprocessing
- # [20:17] <Philip`> Ah, sounds good
- # [20:17] <Philip`> Maybe html5lib should do this! :-)
- # [20:18] <gsnedders> Will all zero users of php-html5lib kill me if I make 23 test cases fail?
- # [20:19] <annevk3> we should make error reporting optional in the tests
- # [20:19] <annevk3> or flag tests that rely on error reporting
- # [20:19] <Philip`> That's easy
- # [20:19] <Philip`> if 'ParseError' in expected_tokens: it relies on error reporting
- # [20:19] <annevk3> it seems to me that the PHP parser is not intended for building a validator so it should just not do it and be fast :)
- # [20:20] <Philip`> and you could just strip out all the ParseErrors when comparing your tokeniser against the test result
- # [20:20] <annevk3> yeah, I guess that's the best way
- # [20:20] <annevk3> you still want to test error handling
- # [20:21] <gsnedders> Yeah, that's what it currently does, for a few more minutes at least
- # [20:21] * Quits: zalan (n=kvirc@catv-80-99-193-98.catv.broadband.hu) ("KVIrc 3.4.0 Virgo http://www.kvirc.net/")
- # [20:22] <gsnedders> annevk3: I'd disagree that it is irrelevant. You might want to only allow valid comments on my blog. Oh, wait, you already do.
- # [20:23] <annevk3> i wouldn't mind syntax errors actually
- # [20:23] <takkaria> gsnedders: testcases failing is bad, mmkay
- # [20:23] <annevk3> i'd just validate the tree
- # [20:23] * Joins: maikmerten_ (n=maikmert@L977a.l.pppool.de)
- # [20:23] <annevk3> validate the tree based on some whitelists
- # [20:23] <Philip`> Someone should make a blog comment CAPTCHA system which presents you with a random word (just in plain text) and requires you to use it in a grammatically-correct sentence in your comment
- # [20:24] <gsnedders> takkaria: All failures are due to the parse errors, and one of them is somewhat questionable (I'd argue that the test case relies on impl. specific behaviour)
- # [20:35] * Joins: erlehmann (n=erlehman@86.59.25.121)
- # [20:39] * Quits: maikmerten (n=maikmert@Lb37b.l.pppool.de) (Read error: 110 (Connection timed out))
- # [20:39] * Joins: mstange (n=markus@pD9579181.dip0.t-ipconnect.de)
- # [20:42] * Quits: mlpug (n=mlpug@a91-156-60-13.elisa-laajakaista.fi) (Remote closed the connection)
- # [20:46] <olliej> Philip`: heheh
- # [20:47] <gsnedders> Philip`: How do you determine whether a sentence is grammatically correct?
- # [20:52] <Philip`> gsnedders: Mechanical Turk
- # [20:54] * Joins: zalan (n=kvirc@catv-80-99-193-98.catv.broadband.hu)
- # [21:01] <Niictar> But... couldm
- # [21:02] <Niictar> But... couldn't a bot be programmed to generate a non-sensicial but grammatically correct sentence after doing a dictionary lookup of the word in question?
- # [21:02] * Joins: hdh (n=hdh@118.71.77.44)
- # [21:03] <Niictar> Or, just quote a sentence example straight out of said dictionary? :P
- # [21:08] <olliej> Niictar: sssh
- # [21:08] <olliej> Niictar: although that captcha might be fairly good for filtering out most reddit/digg/youtube commenters :D
- # [21:09] <Niictar> =D
- # [21:23] * Joins: mpilgrim (n=mark@67.217.134.48)
- # [21:26] * dglazkov__ is now known as dglazkov
- # [21:41] * Joins: annodomini (n=lambda@wikipedia/lambda)
- # [21:41] * Joins: nessy (n=nessy@124-171-28-204.dyn.iinet.net.au)
- # [21:43] <gsnedders> "It quite clearly shows that Humbert Hubmert does \emph{not} love Lolita." — Discuss.
- # [21:43] <gsnedders> :P
- # [21:44] <Philip`> s/Hubmert/Humbert/
- # [21:44] * gsnedders is typing quickly damnit!
- # [21:46] <Dashiva> If you reversed the grammar captcha, it could work well for sites where real users are inable to write properly, whereas a bot would be too successful
- # [21:50] * Quits: annodomini (n=lambda@wikipedia/lambda)
- # [21:51] * Parts: erlehmann (n=erlehman@86.59.25.121)
- # [21:55] * Quits: zalan (n=kvirc@catv-80-99-193-98.catv.broadband.hu) ("KVIrc 3.4.0 Virgo http://www.kvirc.net/")
- # [21:57] * Joins: sid0 (n=sid0@unaffiliated/sid0)
- # [21:57] * Joins: webben_ (n=benh@91.85.204.41)
- # [22:03] * Quits: taf2_ (n=taf2@static-71-127-149-10.bltmmd.fios.verizon.net)
- # [22:05] * gsnedders makes another tweet exactly 140 characters long
- # [22:05] <gsnedders> (And 142 bytes)
- # [22:06] * Quits: webben (n=benh@dip5-fw.corp.ukl.yahoo.com) (Read error: 110 (Connection timed out))
- # [22:09] * Quits: sid0_ (n=sid0@unaffiliated/sid0) (Remote closed the connection)
- # [22:13] * Joins: mpilgrim_ (n=mark@67.217.134.48)
- # [22:18] <jwalden> gsnedders: is the limit 140 characters or 140 code points?
- # [22:20] <gsnedders> jwalden: It appears to apply NFC
- # [22:20] * mpilgrim is now known as Guest96328
- # [22:21] * mpilgrim_ is now known as mpilgrim
- # [22:21] <gsnedders> (So my first test failed)
- # [22:21] * Quits: Guest96328 (n=mark@67.217.134.48) (Read error: 113 (No route to host))
- # Session Close: Sun Mar 29 00:00:00 2009
The end :)