Options:
- # Session Start: Tue Dec 18 00:00:00 2007
- # Session Ident: #whatwg
- # [00:04] * Quits: billmason (n=billmaso@ip156.unival.com) (Read error: 104 (Connection reset by peer))
- # [00:06] * Joins: billmason (n=billmaso@ip156.unival.com)
- # [00:06] * Quits: jgraham_ (n=james@81-86-217-3.dsl.pipex.com) (Read error: 104 (Connection reset by peer))
- # [00:06] * Joins: jgraham_ (n=james@81-86-217-3.dsl.pipex.com)
- # [00:08] * Joins: kingryan (n=kingryan@dsl092-219-050.sfo1.dsl.speakeasy.net)
- # [00:11] * Quits: jruderman (n=jruderma@corp-241.mountainview.mozilla.com)
- # [00:18] * Joins: jruderman (n=jruderma@corp-241.mountainview.mozilla.com)
- # [00:19] * om_sleep is now known as othermaciej
- # [00:29] * Quits: tndH_ (n=Rob@adsl-87-102-85-140.karoo.KCOM.COM) ("ChatZilla 0.9.79-rdmsoft [XULRunner 1.8.0.9/2006120508]")
- # [00:38] * Quits: doublec (n=doublec@203-211-82-218.ue.woosh.co.nz)
- # [00:48] * Quits: phsiao (n=shawn@nat/ibm/x-8ba68e3d4ab72a0a)
- # [01:05] * Philip` sees that at least one of the iPlayer shows has a sign language interpreter overlayed (seemingly with no way to disable it), which is a bit distracting
- # [01:17] <Hixie> many shows in the uk get broadcast twice, once with and once without the sign interpreter
- # [01:17] <Hixie> including, e.g., cartoons
- # [01:19] * Joins: akaroa (n=opera@121-72-6-99.dsl.telstraclear.net)
- # [01:21] * Quits: billmason (n=billmaso@ip156.unival.com) (".")
- # [01:30] <Hixie> hsivonen: how do i distinguish RDF from MathML and SVG in a generic sense?
- # [01:31] <Hixie> as in, "elements that are from non-XHTML namespaces and whose semantics define metadata" might describe RDF
- # [01:31] <Hixie> but i can't see how to describe MathML/SVG
- # [01:36] <othermaciej> what is the purpose of the distinction?
- # [01:42] <Hixie> i want to allow rdf anywhere metadata is allowed
- # [01:42] <Hixie> and want to allow svg anywhere prose content is allowed
- # [01:43] <Hixie> (without calling them out by name, since i don't want to close teh set of namespaces allowed)
- # [01:45] <Hixie> 25 points to anyone who finds a reason why we can't allow <ol>, <div>, <ul>, <p>, etc, in <address>
- # [01:47] <othermaciej> "elements ... whose semantics define metadata" and "elements ... whose semantics define content" maybe?
- # [01:47] <othermaciej> that's pretty vague
- # [01:47] <Hixie> that's what i have but i argued with myself that metadata is a kind of content
- # [01:48] <othermaciej> what isn't a kind of content?
- # [01:49] <othermaciej> (you could also ask, what isn't a kind of metadata?)
- # [01:49] <Hixie> yeah
- # [01:49] <Hixie> that's my problem
- # [01:50] <webben> Philip`: Do you have a link for that? The impression I got was that you'd get a choice of non-interpreted vs interpreted.
- # [01:51] <Hixie> while i'm boiling the oceans over here getting rid of hte block vs inline distinction, maybe i should make <body> optional too... http://www.glazman.org/weblog/dotclear/index.php?post/2005/05/27/1055-adam-2
- # [01:51] <Hixie> (see the paragraph starting "Steven Pemberton")
- # [01:52] <Philip`> webben: http://www.bbc.co.uk/iplayer/page/item/b008d36h.shtml is one (linked from the front page, with no link to an alternative version that I can see)
- # [01:52] <webben> ta
- # [01:52] <othermaciej> Hixie: isn't <body> already optional?
- # [01:52] <Philip`> (It's quite probable that the Windows downloadable version has more options than the Flash streaming one)
- # [01:53] <Hixie> othermaciej: not in the dom, only in the html serialisation
- # [01:53] <othermaciej> ah
- # [01:59] * Joins: doublec (n=doublec@203-211-86-194.ue.woosh.co.nz)
- # [02:00] * Quits: doublec (n=doublec@203-211-86-194.ue.woosh.co.nz) (Client Quit)
- # [02:11] <Hixie> as part of the liberation of html elements, i'm allowing <style scoped> virtually anywhere
- # [02:11] <Hixie> let me know if you think <style scoped> should be a little more oppressed
- # [02:15] <othermaciej> where was it previously not allowed?
- # [02:17] <Hixie> only in section, nav, div, and article
- # [02:17] <Hixie> (i'm still requiring that it be at the top of its parent)
- # [02:22] * Joins: hdh (n=hdh@118.71.58.207)
- # [02:44] * Quits: csarven (n=nevrasc@modemcable130.251-202-24.mc.videotron.ca) ("http://www.csarven.ca/")
- # [02:54] * Quits: kingryan (n=kingryan@dsl092-219-050.sfo1.dsl.speakeasy.net) (Read error: 110 (Connection timed out))
- # [03:01] * Joins: dglazkov (n=dglazkov@adsl-074-229-248-021.sip.bhm.bellsouth.net)
- # [03:02] * Quits: webben (n=benh@82.152.16.177)
- # [03:07] * othermaciej is now known as om_afk
- # [03:10] * Joins: kingryan (n=kingryan@dsl092-002-056.sfo1.dsl.speakeasy.net)
- # [03:37] <deltab> om_afk: in SGML terms at least, the body element is mandatory but its tags are optional
- # [03:38] <deltab> likewise html, head, and tbody
- # [03:38] * Quits: weinig (n=weinig@17.203.15.140) (Read error: 104 (Connection reset by peer))
- # [03:38] * Joins: weinig (n=weinig@17.203.15.140)
- # [03:42] <Hixie> hsivonen is going to have a field day when this is done
- # [03:42] <Hixie> there are so many subtle changes
- # [03:43] <Philip`> Only one field day? That doesn't sound too bad
- # [03:43] <Hixie> hsivonen is good
- # [03:43] <Hixie> :-P
- # [04:15] * Quits: kingryan (n=kingryan@dsl092-002-056.sfo1.dsl.speakeasy.net)
- # [04:17] * om_afk is now known as othermaciej
- # [04:46] * Quits: hdh (n=hdh@118.71.58.207) (Read error: 110 (Connection timed out))
- # [04:56] * Quits: dglazkov (n=dglazkov@adsl-074-229-248-021.sip.bhm.bellsouth.net)
- # [05:28] * Quits: dbaron_ (n=dbaron@corp-241.mountainview.mozilla.com) ("8403864 bytes have been tenured, next gc will be global.")
- # [05:51] * Joins: chessguy (n=chessguy@ip68-100-163-1.dc.dc.cox.net)
- # [05:54] * Quits: roc (n=roc@202.0.36.64)
- # [05:55] * Joins: Facedown (n=HELLO@c-68-48-62-208.hsd1.md.comcast.net)
- # [06:17] * Joins: billyjack (n=MikeSmit@58.157.21.205)
- # [06:17] * Quits: billyjack (n=MikeSmit@58.157.21.205) (Client Quit)
- # [06:25] * Joins: dbaron (n=dbaron@c-71-204-145-103.hsd1.ca.comcast.net)
- # [06:38] * Joins: hdh (n=hdh@118.71.209.234)
- # [06:49] * Quits: chessguy (n=chessguy@ip68-100-163-1.dc.dc.cox.net) ("Leaving")
- # [07:40] * Quits: MikeSmith (n=MikeSmit@58.157.21.205) ("Less talk, more pimp walk.")
- # [07:43] * Joins: maikmerten (n=merten@ls5laptop14.cs.uni-dortmund.de)
- # [07:54] * Joins: MikeSmith (n=MikeSmit@58.157.21.205)
- # [07:59] * Quits: Hemebond (n=Hemebond@ip-118-90-3-92.xdsl.xnet.co.nz) ("Leaving.")
- # [08:00] * Disconnected
- # [08:01] * Attempting to rejoin channel #whatwg
- # [08:01] * Rejoined channel #whatwg
- # [08:01] * Topic is 'WHATWG (HTML5) -- http://www.whatwg.org/ -- Logs: http://krijnhoetmer.nl/irc-logs/ -- Please leave your sense of logic at the door, thanks!'
- # [08:01] * Set by Hixie on Tue Apr 03 03:10:22
- # [08:17] * Quits: krijn (n=krijnhoe@ktk.xs4all.nl) (Read error: 110 (Connection timed out))
- # [08:28] * Joins: weinig_ (n=weinig@17.203.15.140)
- # [08:28] * Quits: weinig (n=weinig@17.203.15.140) (Read error: 104 (Connection reset by peer))
- # [08:30] * Quits: hober (n=ted@unaffiliated/hober) ("ERC Version 5.3 (devel) (IRC client for Emacs)")
- # [08:31] * Quits: jruderman (n=jruderma@corp-241.mountainview.mozilla.com)
- # [08:41] * Joins: Hemebond (n=Hemebond@ip-118-90-125-176.xdsl.xnet.co.nz)
- # [08:41] * Quits: weinig_ (n=weinig@17.203.15.140) (Read error: 104 (Connection reset by peer))
- # [08:41] * Joins: weinig (n=weinig@17.203.15.140)
- # [08:43] * Joins: roc (n=roc@121-72-28-191.dsl.telstraclear.net)
- # [08:46] * Quits: weinig (n=weinig@17.203.15.140)
- # [08:56] * Quits: MikeSmith (n=MikeSmit@58.157.21.205) ("Less talk, more pimp walk.")
- # [08:56] * Joins: MikeSmith (n=MikeSmit@58.157.21.205)
- # [08:57] * Joins: jruderman (n=jruderma@c-67-180-15-227.hsd1.ca.comcast.net)
- # [08:57] * Quits: dbaron (n=dbaron@c-71-204-145-103.hsd1.ca.comcast.net) (Read error: 110 (Connection timed out))
- # [08:59] * Quits: MikeSmith (n=MikeSmit@58.157.21.205) (Client Quit)
- # [09:28] * Joins: tndH_ (i=Rob@adsl-87-102-85-140.karoo.KCOM.COM)
- # [09:28] * tndH_ is now known as tndH
- # [09:34] * Joins: weinig (n=weinig@c-71-198-176-23.hsd1.ca.comcast.net)
- # [09:56] * Joins: peepo (n=Jay@host86-129-168-72.range86-129.btcentralplus.com)
- # [10:01] * Quits: peepo (n=Jay@host86-129-168-72.range86-129.btcentralplus.com) ("later")
- # [10:03] * Joins: zcorpan (n=zcorpan@c-cb21e353.1451-1-64736c12.cust.bredbandsbolaget.se)
- # [10:13] * Quits: Lachy (n=Lachlan@cm-84.215.41.149.getinternet.no) (Read error: 104 (Connection reset by peer))
- # [10:25] * Quits: maikmerten (n=merten@ls5laptop14.cs.uni-dortmund.de) (Read error: 110 (Connection timed out))
- # [10:29] * Joins: ROBOd (n=robod@89.122.216.38)
- # [10:39] * Joins: Camaban (n=adrianle@host217-41-27-233.in-addr.btopenworld.com)
- # [10:42] <Hixie> my edits are making the heading association algorithm even harder to understand
- # [10:43] <Hixie> so i'll definitely have to rewrite it at some point
- # [10:55] * Quits: roc (n=roc@121-72-28-191.dsl.telstraclear.net)
- # [11:16] * Joins: maikmerten (n=merten@ls5laptop14.cs.uni-dortmund.de)
- # [11:16] * Quits: zcorpan (n=zcorpan@c-cb21e353.1451-1-64736c12.cust.bredbandsbolaget.se) (Read error: 110 (Connection timed out))
- # [11:31] * Joins: Lachy (n=Lachlan@pat-tdc.opera.com)
- # [11:40] * Quits: deltab (n=deltab@82-36-30-34.cable.ubr02.smal.blueyonder.co.uk) (calvino.freenode.net irc.freenode.net)
- # [11:40] * Quits: arnath01 (n=arnath@d54C1C929.access.telenet.be) (calvino.freenode.net irc.freenode.net)
- # [11:40] * Quits: Lfe (n=lfe@bergstroem.nu) (calvino.freenode.net irc.freenode.net)
- # [11:40] * Quits: Philip` (n=philip@zaynar.demon.co.uk) (calvino.freenode.net irc.freenode.net)
- # [11:40] * Quits: Kuruma (n=Kuruman@h123-176-107-050.catv01.catv-yokohama.ne.jp) (calvino.freenode.net irc.freenode.net)
- # [11:40] * Quits: syp| (n=syp@lasigpc9.epfl.ch) (calvino.freenode.net irc.freenode.net)
- # [11:40] * Quits: gavins (n=gavin@firefox/developer/gavin) (calvino.freenode.net irc.freenode.net)
- # [11:40] * Quits: mitsuhiko (n=mitsuhik@ubuntu/member/mitsuhiko) (calvino.freenode.net irc.freenode.net)
- # [11:41] * Joins: deltab (n=deltab@82-36-30-34.cable.ubr02.smal.blueyonder.co.uk)
- # [11:41] * Joins: syp| (n=syp@lasigpc9.epfl.ch)
- # [11:41] * Joins: Kuruma (n=Kuruman@h123-176-107-050.catv01.catv-yokohama.ne.jp)
- # [11:41] * Joins: Philip` (n=philip@zaynar.demon.co.uk)
- # [11:41] * Joins: Lfe (n=lfe@bergstroem.nu)
- # [11:41] * Joins: mitsuhiko (n=mitsuhik@ubuntu/member/mitsuhiko)
- # [11:41] * Joins: arnath01 (n=arnath@d54C1C929.access.telenet.be)
- # [11:41] * Joins: gavins (n=gavin@firefox/developer/gavin)
- # [11:42] * Quits: maikmerten (n=merten@ls5laptop14.cs.uni-dortmund.de) (Read error: 110 (Connection timed out))
- # [12:06] <Hixie> good lord <ins> is going to make my head explode
- # [12:07] <Hixie> it just totally screwed up my entire carefully designed content model
- # [12:07] <Hemebond> Why?
- # [12:07] <Hemebond> You working on HTML5?
- # [12:07] <Hixie> yeah
- # [12:07] <Philip`> Replace it by an attribute, so you can have <p ins> and <b ins> and <blockquote ins> and whatever, which won't affect the content model
- # [12:08] <othermaciej> the XHTML2 solution!
- # [12:08] <Hixie> <article> <ins> <em> hello </em> </ins> <ins> world </ins> of pain <ins> <p> how are you </p> </ins> </article>
- # [12:08] <Hixie> that has to become 2 paragraphs
- # [12:08] <Hixie> but it is legal
- # [12:09] <Hixie> and <article> <em> hello <ins> world </ins> </em> </article> is legal too
- # [12:09] <Hemebond> ins is block?
- # [12:09] <Hixie> it's... something
- # [12:09] <Philip`> It's not like people do much processing of <ins> today, so it wouldn't hurt much to change it
- # [12:09] <Hemebond> Old INS was only inline, right?
- # [12:10] <Hixie> <ins> was neither inline nor block in html4
- # [12:10] <Hixie> it was an sgml inclusion
- # [12:11] <Hemebond> *boggle*
- # [12:11] <Lachy> Hixie, I thought this issue was solved using transparent content models, and that your first example wouldn't be legal
- # [12:24] <Hixie> what's wrong with the first example?
- # [12:25] <Hixie> the transparent content models don't really help determine where <ins> is allowed in the first place, only what's allowed inside it
- # [12:25] <Lachy> Hixie, because <article> <em></em> <p></p> </article> isn't allowed
- # [12:26] <Hixie> it is in the new model
- # [12:27] * Quits: akaroa (n=opera@121-72-6-99.dsl.telstraclear.net) (Read error: 110 (Connection timed out))
- # [12:27] <Lachy> why?
- # [12:27] <Hixie> we're removing the requirement that <p> be required around paragraphs
- # [12:27] <Hixie> apparently it was causing too much confusion
- # [12:27] <Lachy> who was confused by it?
- # [12:27] <Hixie> sam ruby, e.g.
- # [12:28] <Lachy> oh, he's confused by a lot of things, and I generally disagree with him
- # [12:28] <Hixie> heh
- # [12:28] <Hixie> he wasn't the only one
- # [12:29] <Hixie> basically paragraphs are implied much like sections get implied
- # [12:29] <Hixie> at least, that's the model i'm writing up
- # [12:29] <Hixie> who knows if we'll keep it
- # [12:30] <Lachy> but that makes it possible to do things like this: <div><p>foo</p> bar <p>baz</p></div> and it represents 3 paragraphs.
- # [12:31] <Lachy> I really don't like that model
- # [12:31] <Hemebond> Ugh
- # [12:31] <Hixie> me either, but it's hard to find a good argument for why that's not ok
- # [12:31] <Hixie> (and note that html4 strict allows it)
- # [12:31] <Lachy> where are the arguments that it is ok?
- # [12:31] <Hixie> (so a lot of people will hit that when transitioning to html5)
- # [12:32] <Hemebond> Allows what? Dropping of tags?
- # [12:32] <Hixie> Lachy: it doesn't cause anyone any harm, and it's unambiguous
- # [12:32] <Hixie> Hemebond: allows mixing inline and block content in <div> elements
- # [12:33] <Hemebond> Ah, right.
- # [12:34] <Hemebond> Hixie: Are you the BDFL for WHATWG?
- # [12:35] <Hixie> pretty much
- # [12:35] <Hemebond> The intawebs just told me so.
- # [12:35] <Hixie> (except maybe for the "FL" part)
- # [12:35] * Joins: OmegaJunior (n=ZJr@a82-95-48-162.adsl.xs4all.nl)
- # [12:35] <Hixie> Lachy: see, kinda hard to come up with a good counter argument
- # [12:36] <Lachy> Hixie, not hard, just busy with other stuff right now.
- # [12:37] <Hixie> well, if you can find an argument, do let me know, because i couldn't, and that backed me into the corner of allowing it.
- # [12:38] <Hixie> and i don't like it
- # [12:41] <Lachy> can you point me to Sam's arguments?
- # [12:45] <Hixie> he didn't really argue for anything as i recall
- # [12:45] <Hixie> but see the dogfood thread in public-html, e.g.
- # [12:46] <Hixie> wow i'm really confused about how to do <ins> and <del> with the implied paragraph stuff
- # [12:46] <Hixie> i'd better sleep and work on this tomorro
- # [12:46] <Hixie> nn
- # [12:48] * Joins: zcorpan (n=zcorpan@pat.se.opera.com)
- # [12:50] <Camaban> what the... so hang on, if <p> isn't required, does that mean whitespace in HTML is going to become something to think about? (aside from a couple of browser bugs)
- # [12:52] <Lachy> Camaban, no
- # [12:52] <Camaban> presume I'm missing something, I'll go look it up
- # [12:53] <Hemebond> What makes it an implied <P>?
- # [12:53] <Lachy> Camaban, white space can't become significant because it would seriously break compatibility
- # [12:53] <OmegaJunior> It already does... browser bug in MSIE
- # [12:54] <Hemebond> And isn't whitespace kept in Mozilla too?
- # [12:54] <Camaban> Lachy: that's what I'm thinking, but if <div><p>foo</p> bar <p>baz</p></div> represents 3 paragraphs, then it looks like whitespace would be significant
- # [12:54] <Hemebond> Between parent and child elements?
- # [12:54] <OmegaJunior> It shouldn't...
- # [12:55] <Lachy> Hemebond, <p> will never be implied directly. But rather, the concept of an implied paragraph is a little more abstract
- # [12:55] <Hemebond> I believe (maybe they fixed it) if you iterated through the children of an <OL>, you'd pick up the whitespace/line-returns too.
- # [12:55] <Lachy> Camaban, even if you took out the whitepace in that example, it would still represent 3 paragraphs in the new model
- # [12:55] <OmegaJunior> Yes, Hemebond, that is normal
- # [12:55] <Hemebond> Lachy: So it isn't marked up with anything except the DIV?
- # [12:55] <OmegaJunior> But it shouldn't be displayed by the browser
- # [12:56] <Hemebond> OmegaJunior: But.... why is there whitespace between elements where there shouldn't be any other child elements?
- # [12:56] <Hemebond> <OL> can only have <LI> as a child element, no?
- # [12:56] <Camaban> ok, so how would soemthing like http://pastebin.com/d367d5a2c be interpreted? as 1 paragraph, or 2?
- # [12:56] <OmegaJunior> Hemebond: because the source contains new-line or carriage-return?
- # [12:56] <OmegaJunior> That counts as whitespace, too.
- # [12:57] <Hemebond> OmegaJunior: I'm talking about the DOM though.
- # [12:57] <OmegaJunior> Yes
- # [12:57] <Lachy> the whitespace will still be present in the DOM, but as far as semantics are concerned, it's very much irrelevant
- # [12:57] <Lachy> I gotta go
- # [12:57] * Quits: Lachy (n=Lachlan@pat-tdc.opera.com) ("Leaving")
- # [12:57] <OmegaJunior> The DOM sees spaces, carriage returns, new-lines, tabs, etc. as white-space text nodes
- # [12:57] <OmegaJunior> unless they're already inside a text
- # [12:58] <Hemebond> OmegaJunior: I know.
- # [12:58] <Philip`> "Inter-element whitespace, comment nodes, and processing instruction nodes must be ignored when establishing whether an element matches its content model or not, and must be ignored when following algorithms that define document and element semantics."
- # [12:58] <Philip`> (says HTML5)
- # [12:58] <OmegaJunior> Excellent
- # [12:58] <OmegaJunior> Let's hope the browser vendors will implement it
- # [12:59] <othermaciej> that's not a requirement for browsers
- # [12:59] <OmegaJunior> How so?
- # [12:59] <Philip`> so they're in the DOM, and included in DOM childNodes and stuff, but are not relevant for conformance or for HTML5-defined algorithms (e.g. to find the list items in a list)
- # [12:59] <othermaciej> that rule applies to documents and validators
- # [13:00] <Philip`> othermaciej: The "must be ignored when following algorithms ..." sounds like browser conformance
- # [13:00] <othermaciej> "when following algorithms that define document and element semantics"
- # [13:00] <OmegaJunior> othermaciej: it applies to documents and validators but not to browsers? That seems contradictory.
- # [13:00] <Philip`> at least to the extent that browsers run those algorithms
- # [13:00] <othermaciej> in particular it wouldn't apply to the parsing algorithm, as written
- # [13:00] <Philip`> (which should be a non-zero extent, I guess)
- # [13:00] <othermaciej> since that does not define document or element semantics
- # [13:01] <OmegaJunior> No... but it interprets it...
- # [13:01] <Hemebond> The DOM isn't the document?
- # [13:01] <OmegaJunior> No, just a model of it
- # [13:01] <Hemebond> Which the browser uses to build the document.
- # [13:02] <othermaciej> I don't know what algorithms are considered to define document and element semantics and which browsers must follow
- # [13:02] <othermaciej> off hand
- # [13:02] <othermaciej> but I don't think the quote from Philip` is primarily aimed at browser algorithms that would be relevant here, thus, "let's hope browser vendors will implement it" is not really a proper response
- # [13:03] <OmegaJunior> I disagree
- # [13:03] * Philip` agrees with othermaciej :-)
- # [13:03] <othermaciej> it's true that <div><p>foo</p> <p>bar</p></div> should generate a text node in the DOM for the space, but not render it
- # [13:04] <OmegaJunior> What good does it do, to describe how whitespace should be handled, if that description doesn't apply to how a browser should handle whitespace while parsing and displaying a document?
- # [13:04] <othermaciej> but the reasons are the HTML5 parsing algorithm and CSS whitespace collapsing rules
- # [13:04] <othermaciej> there are descriptions of how whitespace should be handled while parsing and displaying a document
- # [13:04] <othermaciej> for parsing, that is the parsing algorithm
- # [13:05] <othermaciej> for display, it is CSS (mainly whitespace collapsing, possibly some other stuff)
- # [13:05] <Hemebond> the parsing alg is different to the alg for generating a DOM?
- # [13:05] <Philip`> The parsing algorithm is what generates the DOM
- # [13:05] <OmegaJunior> othermaciej, And as everyone will understand, the parsing algorythem is developed without any contact with the rest of the HTML5 spec.
- # [13:05] <OmegaJunior> Difficult word, algorythm
- # [13:05] <othermaciej> the HTML5 parsing algorithm describes how to take a text/html document and generate a DOM (with various possible side effects along the way)
- # [13:06] * Parts: zcorpan (n=zcorpan@pat.se.opera.com)
- # [13:06] <Hemebond> This is confusing. And messy.
- # [13:06] <OmegaJunior> So how whitespace should be handled in a document has no bearing on how a parser should handle whitespace in a document?
- # [13:06] <OmegaJunior> Hemebond: I agree
- # [13:07] <Hemebond> I think I'll go learn XHTML2.
- # [13:07] <Philip`> If you have <div><p>foo</p> <p>bar</p></div> in XHTML, you get a text node between the two 'p' nodes, exactly the same as in HTML
- # [13:08] <Philip`> (in terms of the DOM, which the document gets parsed into)
- # [13:08] <OmegaJunior> Yes, I'd expect as much.
- # [13:08] <Philip`> (and if you write scripts to manipulate the DOM, you'll hit those text nodes)
- # [13:08] <othermaciej> the web is messy
- # [13:08] <Hemebond> Very.
- # [13:08] <Philip`> so XHTML2 won't make this any less confusing :-)
- # [13:08] <Hemebond> So why is HTML5 trying to validate that mess instead of cleaning it up?
- # [13:08] <OmegaJunior> The HTML5-project should try to avoid adding to the mess.
- # [13:09] <Philip`> What should HTML5 do instead?
- # [13:09] <Philip`> (in this particular case)
- # [13:09] <Hemebond> Well, is inline content allowed in a DIV?
- # [13:10] <Philip`> Today HTML5 says "Content model: Zero or more style elements, followed by either zero or more block-level elements, or inline-level content (but not both)."
- # [13:10] <Philip`> (but that is being rewritten by Hixie)
- # [13:10] <Philip`> so it depends on what "is" is :-)
- # [13:11] <OmegaJunior> Yes, inline-content is allowed in a DIV, but not together with block-content.
- # [13:11] <OmegaJunior> (current spec)
- # [13:11] <Hemebond> Then document is invalid and should fail.
- # [13:12] <Philip`> But the current spec says inter-element whitespace is not significant for whether a document is valid; otherwise it'd be horrible because you couldn't e.g. write a list with one <li>item...</li> per line
- # [13:13] <Hemebond> I was thinking of the original example that had text between the nodes.
- # [13:13] <Hemebond> Whitespace between elements should be ignored.
- # [13:14] <OmegaJunior> Agreed
- # [13:14] <OmegaJunior> That whitespace exists for the benefit of the source programmers and readers.
- # [13:14] <Philip`> Okay - that one is currently invalid, but apparently there has been no convincing reason why it should be invalid, and it causes problems for people migrating from HTML and provides no benefits
- # [13:15] <Hemebond> Boohoo. Their code is shit.
- # [13:15] <Philip`> Their code is valid HTML4
- # [13:15] <Hemebond> Yeah.
- # [13:15] <Hemebond> So they can stay with HTML4.
- # [13:15] <Hemebond> Or clean it up.
- # [13:15] <Hemebond> HTML4 allows inline and block in the same div?
- # [13:15] <OmegaJunior> Yeah
- # [13:16] <Hemebond> Awesome.
- # [13:16] <Hemebond> Why not allow it here in HTML5 then?
- # [13:16] <OmegaJunior> Hemebond: because the rendering is undefined
- # [13:16] <OmegaJunior> Which leads to differences between browser
- # [13:16] <Hemebond> I see.
- # [13:17] <Hemebond> Hmmm
- # [13:17] <Hemebond> Render it inline, at the top of the DIV.
- # [13:17] <Hemebond> With block elements rendering below it.
- # [13:17] <Hemebond> Haha
- # [13:17] <Philip`> http://validator.w3.org/check?fragment=%3C!DOCTYPE%20HTML%20PUBLIC%20%22-//W3C//DTD%20HTML%204.0//EN%22%3E%20%3Ctitle%3E%3C/title%3E%20%3Cdiv%3E%3Cp%3Efoo%3C/p%3E%20bar%20%3Cp%3Ebaz%3C/p%3E%3C/div%3E
- # [13:17] <Hemebond> That'd mess with people.
- # [13:17] <Philip`> This Page Is Valid HTML 4.0 Strict!
- # [13:17] <Hemebond> w3.org validator means nothing.
- # [13:18] <Hemebond> I mentally slap anyone who claims spec conformance by using that validator.
- # [13:18] <OmegaJunior> o.O.
- # [13:18] <Hemebond> In this case of course...
- # [13:18] <Hemebond> it does match.
- # [13:18] <Camaban> and the one you use is.....
- # [13:19] <Hemebond> Got a link somewhere....
- # [13:19] <Philip`> http://htmlhelp.com/cgi-bin/validate.cgi?url=http%3A%2F%2Fphilip.html5.org%2Fmisc%2Fdivblockinline.html&warnings=yes
- # [13:19] <OmegaJunior> To my knowledge the w3 validator is authoritive?
- # [13:19] <Hemebond> It's no longer developed as far as I know.
- # [13:19] <Hemebond> And not complete.
- # [13:20] <Hemebond> According to ex-w3 members.
- # [13:20] <Hemebond> Philip`: Yes, I think that's the one.
- # [13:20] <Philip`> "Congratulations, no errors!"
- # [13:20] <Hemebond> Yeah... I know.
- # [13:20] <Hemebond> Because it is valid HTML4.
- # [13:20] <Hemebond> My comment was just slightly OT.
- # [13:21] <Hemebond> Just an aside.
- # [13:22] <Philip`> Hmm, Validome doesn't work at all - it says the doctype is missing
- # [13:23] <Philip`> Oh, right, it's just broken in Opera
- # [13:24] <Philip`> Hemebond: The W3C validator is still actively developed
- # [13:24] <Camaban> http://validator.w3.org/whatsnew.html#t2007-10-11 no longer developed?
- # [13:24] <Hemebond> *shrug*
- # [13:24] <Hemebond> Just repeating what I read.
- # [13:24] <Camaban> and that actually the validator at w3.org and htmlhelp are identical
- # [13:26] * Quits: jgraham_ (n=james@81-86-217-3.dsl.pipex.com) ("This computer has gone to sleep")
- # [13:26] <Hemebond> 1:30am
- # [13:26] <Hemebond> Night all.
- # [13:28] <Philip`> Good night
- # [13:28] <OmegaJunior> Night!
- # [13:45] * Joins: Lachy (n=Lachlan@pat-tdc.opera.com)
- # [14:12] * Joins: akaroa (n=opera@121-72-18-70.dsl.telstraclear.net)
- # [14:19] * Joins: maikmerten (n=merten@ls5laptop14.cs.uni-dortmund.de)
- # [14:30] * hdh thought of tagging white-space node with irrelevant
- # [14:35] * Quits: ROBOd (n=robod@89.122.216.38) ("http://www.robodesign.ro")
- # [14:47] * Quits: OmegaJunior (n=ZJr@a82-95-48-162.adsl.xs4all.nl) ("Trillian (http://www.ceruleanstudios.com")
- # [14:48] <Philip`> hdh: Adding the 'irrelevant' attribute to white-space text nodes in the DOM? I thought only Element nodes could have attributes
- # [14:50] <hdh> hm, okay
- # [14:59] * Joins: webben (n=benh@dip5-fw.corp.ukl.yahoo.com)
- # [15:30] * Joins: csarven (n=nevrasc@81-5-133-33.static.nfwebsolutions.com)
- # [15:32] * Quits: csarven (n=nevrasc@81-5-133-33.static.nfwebsolutions.com) (Remote closed the connection)
- # [15:37] * Joins: csarven (n=nevrasc@81-5-133-33.static.nfwebsolutions.com)
- # [15:41] * Joins: phsiao (n=shawn@c-24-61-15-24.hsd1.ma.comcast.net)
- # [15:52] * Quits: phsiao (n=shawn@c-24-61-15-24.hsd1.ma.comcast.net)
- # [15:57] * Quits: maikmerten (n=merten@ls5laptop14.cs.uni-dortmund.de) ("Verlassend")
- # [16:06] * Joins: MikeSmith (n=MikeSmit@58.157.21.205)
- # [16:19] * dolphinling_ is now known as dolphinling
- # [16:32] * Joins: ROBOd (n=robod@89.122.216.38)
- # [16:39] * Joins: webben_ (n=benh@82.152.16.177)
- # [16:40] * Joins: phsiao (n=shawn@nat/ibm/x-3fbca537f5cd714d)
- # [16:45] * Joins: billmason (n=billmaso@ip156.unival.com)
- # [16:55] * Quits: webben (n=benh@dip5-fw.corp.ukl.yahoo.com) (Read error: 110 (Connection timed out))
- # [16:57] * Joins: gsnedders (n=gsnedder@host86-135-224-200.range86-135.btcentralplus.com)
- # [16:59] * Joins: Lachy_ (n=Lachlan@cm-84.215.9.100.getinternet.no)
- # [17:01] * Quits: Lachy_ (n=Lachlan@cm-84.215.9.100.getinternet.no) (Client Quit)
- # [17:01] * Joins: Lachy_ (n=Lachlan@cm-84.215.9.100.getinternet.no)
- # [17:06] * Quits: Lachy (n=Lachlan@pat-tdc.opera.com) (Read error: 110 (Connection timed out))
- # [17:21] * Joins: grimboy (n=grimboy@85.211.236.206)
- # [17:29] * Joins: Lachy__ (n=Lachlan@cm-84.215.9.100.getinternet.no)
- # [17:38] * Quits: Lachy_ (n=Lachlan@cm-84.215.9.100.getinternet.no) (Read error: 110 (Connection timed out))
- # [17:57] * Joins: jgraham_ (n=james@81-86-217-3.dsl.pipex.com)
- # [17:57] * Joins: Lachy_ (n=Lachlan@cm-84.215.9.100.getinternet.no)
- # [18:00] * Joins: hober (n=ted@unaffiliated/hober)
- # [18:00] * Quits: Lachy__ (n=Lachlan@cm-84.215.9.100.getinternet.no) (Read error: 104 (Connection reset by peer))
- # [18:06] * Joins: Lachy__ (n=Lachlan@cm-84.215.9.100.getinternet.no)
- # [18:09] * Joins: Lachy (n=Lachlan@cm-84.215.9.100.getinternet.no)
- # [18:10] * Quits: Lachy_ (n=Lachlan@cm-84.215.9.100.getinternet.no) (Read error: 104 (Connection reset by peer))
- # [18:14] * Joins: grimeboy (n=grimboy@85-211-255-209.dsl.pipex.com)
- # [18:18] * Joins: grimboy_uk (n=grimboy@85-211-246-205.dsl.pipex.com)
- # [18:19] * Joins: Lachy_ (n=Lachlan@cm-84.215.9.100.getinternet.no)
- # [18:26] * Quits: Lachy__ (n=Lachlan@cm-84.215.9.100.getinternet.no) (Read error: 110 (Connection timed out))
- # [18:27] * Quits: Lachy_ (n=Lachlan@cm-84.215.9.100.getinternet.no) ("Leaving")
- # [18:30] * Joins: webben (n=benh@dip5-fw.corp.ukl.yahoo.com)
- # [18:30] * Quits: Lachy (n=Lachlan@cm-84.215.9.100.getinternet.no) (Read error: 110 (Connection timed out))
- # [18:32] * Quits: grimeboy (n=grimboy@85-211-255-209.dsl.pipex.com) (Read error: 110 (Connection timed out))
- # [18:35] * Quits: grimboy (n=grimboy@85.211.236.206) (Read error: 110 (Connection timed out))
- # [18:46] * Quits: webben_ (n=benh@82.152.16.177) (Connection timed out)
- # [18:50] * Quits: weinig (n=weinig@c-71-198-176-23.hsd1.ca.comcast.net)
- # [18:51] * Joins: maikmerten (n=maikmert@L9809.l.pppool.de)
- # [18:58] * Parts: Camaban (n=adrianle@host217-41-27-233.in-addr.btopenworld.com)
- # [19:01] * Quits: jgraham_ (n=james@81-86-217-3.dsl.pipex.com) ("This computer has gone to sleep")
- # [19:06] * Joins: dbaron (n=dbaron@c-71-204-145-103.hsd1.ca.comcast.net)
- # [19:11] * Joins: virtuelv (n=virtuelv@pat-tdc.opera.com)
- # [19:16] * Quits: psa (n=yomode@71.93.19.66) (Read error: 104 (Connection reset by peer))
- # [19:17] * Joins: psa (n=yomode@71.93.19.66)
- # [19:23] * Joins: Lachy (n=Lachlan@cm-84.215.9.100.getinternet.no)
- # [19:31] * Quits: virtuelv (n=virtuelv@pat-tdc.opera.com) ("Leaving")
- # [19:40] * Quits: jruderman (n=jruderma@c-67-180-15-227.hsd1.ca.comcast.net)
- # [19:47] * Quits: dbaron (n=dbaron@c-71-204-145-103.hsd1.ca.comcast.net) ("8403864 bytes have been tenured, next gc will be global.")
- # [20:00] * Joins: weinig (n=weinig@17.203.15.140)
- # [20:04] * Joins: roc (n=roc@121-72-28-191.dsl.telstraclear.net)
- # [20:06] * Joins: dbaron (n=dbaron@corp-241.mountainview.mozilla.com)
- # [20:08] * Joins: jruderman (n=jruderma@corp-241.mountainview.mozilla.com)
- # [20:40] * Joins: zcorpan (n=zcorpan@c-cb21e353.1451-1-64736c12.cust.bredbandsbolaget.se)
- # [20:42] * Quits: roc (n=roc@121-72-28-191.dsl.telstraclear.net)
- # [20:47] * Joins: jgraham_ (n=james@81-86-217-3.dsl.pipex.com)
- # [20:52] * Joins: tantek (n=tantek@70-13-206-227.area2.spcsdns.net)
- # [20:59] * Quits: maikmerten (n=maikmert@L9809.l.pppool.de) ("Leaving")
- # [21:11] * Joins: kingryan (n=kingryan@dsl092-219-050.sfo1.dsl.speakeasy.net)
- # [21:12] * Joins: nikola_tesla (n=nikola_t@gw.subscribermail.com)
- # [21:12] * nikola_tesla changes topic to ''
- # [21:16] * Quits: zcorpan (n=zcorpan@c-cb21e353.1451-1-64736c12.cust.bredbandsbolaget.se) (Read error: 110 (Connection timed out))
- # [21:24] * gsnedders wonders what the point of \paragraph and \subparagraph is in LaTeX
- # [21:28] * Joins: grimeboy (n=grimboy@85.211.238.60)
- # [21:29] <hsivonen> isn't \paragraph a bit like \theorem or somesuch--not a generic paragraph?
- # [21:32] * Joins: roc (n=roc@202.0.36.64)
- # [21:35] * Joins: zcorpan (n=zcorpan@c-cb21e353.1451-1-64736c12.cust.bredbandsbolaget.se)
- # [21:35] <zcorpan> is it possible to throw proper DOMExceptions with javascript? e.g. INVALID_STATE_ERR ?
- # [21:36] <zcorpan> throw new Error('INVALID_STATE_ERR') isn't it
- # [21:36] <gsnedders> hsivonen: \theorem? :P
- # [21:40] * Quits: nikola_tesla (n=nikola_t@gw.subscribermail.com) ("~ Computertron ~")
- # [21:40] <Philip`> He stole our topic :-(
- # [21:41] * gsnedders blames him
- # [21:41] * Philip` wonders if anyone has a backup copy
- # [21:42] <hdh> WHATWG (HTML5) -- http://www.whatwg.org/ -- Logs: http://krijnhoetmer.nl/irc-logs/ -- Please leave your sense of logic at the door, thanks!
- # [21:42] * gsnedders changes topic to 'WHATWG (HTML5) -- http://www.whatwg.org/ -- Logs: http://krijnhoetmer.nl/irc-logs/ -- Please leave your sense of logic at the door, thanks!'
- # [21:42] <Philip`> Ah, thanks :-)
- # [21:42] <hdh> :) pidgin powered
- # [21:49] <hsivonen> gsnedders: isn't there a \theorem in LaTeX?
- # [21:49] <gsnedders> hsivonen: no idea. I've only started properly learning LaTeX tonight :P
- # [21:50] * Quits: grimboy_uk (n=grimboy@85-211-246-205.dsl.pipex.com) (Read error: 110 (Connection timed out))
- # [21:51] * Quits: csarven (n=nevrasc@81-5-133-33.static.nfwebsolutions.com) ("http://www.csarven.ca")
- # [22:16] * Quits: zcorpan (n=zcorpan@c-cb21e353.1451-1-64736c12.cust.bredbandsbolaget.se) (Read error: 110 (Connection timed out))
- # [22:20] * Quits: tantek (n=tantek@70-13-206-227.area2.spcsdns.net)
- # [22:28] <jgraham_> Er, well LaTeX can have \anything-you-like with the right style file :)
- # [22:29] <jgraham_> But I think \theorem is pretty standard
- # [22:31] <jgraham_> see http://www.math.uiuc.edu/~hildebr/tex/theorems.html - it seems to be included in amslatex
- # [22:33] <jgraham_> actually I think I'm wrong about that \begin{theorem} seems to be in ordinary LaTeX
- # [22:33] * Quits: grimeboy (n=grimboy@85.211.238.60) (Read error: 110 (Connection timed out))
- # [22:39] * Joins: grimeboy (n=grimboy@85-211-248-197.dsl.pipex.com)
- # [23:01] * Quits: ROBOd (n=robod@89.122.216.38) (Read error: 110 (Connection timed out))
- # [23:06] * Joins: doublec (n=doublec@202.0.36.64)
- # [23:23] * gsnedders wonders whether the following will confuse his computing teacher:
- # [23:23] <gsnedders> "Reliability is the consistency that the program creates the correct output
- # [23:23] <gsnedders> for any given input (to the extent that output can be found given
- # [23:23] <gsnedders> the halting problem)."
- # [23:25] <Philip`> I don't see why the halting problem is at all relevant
- # [23:25] <gsnedders> actually, it shouldn't create any output, should it, therefore meaning it is irrelevant
- # [23:25] * gsnedders headdesks
- # [23:25] <Philip`> Uh, I'm not sure what you mean
- # [23:26] <gsnedders> Philip`: don't worry, just how I ended up with the halting problem being relevant in my head (which is stupid)
- # [23:27] <Philip`> It might be relevant that something which doesn't give an incorrect answer because it doesn't give any answer (i.e. it doesn't terminate) is incorrect, but that's not the halting problem
- # [23:27] <gsnedders> Philip`: mainly down to whether it should be creating any output for a certain input that makes the algorithm fall into the halting problem
- # [23:28] * gsnedders has already forgotten half his reasoning for it anyway
- # [23:28] <gsnedders> and trying to re-create it is failing
- # [23:29] <Philip`> It's only a problem if you want to prove whether the program does or does not produce output, and I'm not sure why you'd want to prove that :-)
- # [23:29] * Joins: Lachy_ (n=Lachlan@cm-84.215.9.100.getinternet.no)
- # [23:30] <gsnedders> Philip`: I did, indirectly, without explaining myself anywhere near enough :)
- # [23:30] <gsnedders> And it's so vaguely relevant and the halting problem isn't even taught so you couldn't really rely on my teacher having a clue what it is :\
- # [23:30] <Philip`> The halting problem is largely irrelevant in reality anyway, because a tool which decides "this program does terminate" or "I'm not sure this program terminates" is possible and useful (and e.g. implemented by Microsoft)
- # [23:32] <Philip`> Also, computers are finite
- # [23:32] * jgraham_ can suddenly access whatwg.org again :)
- # [23:32] <Philip`> So I don't understand why people use the halting problem as an excuse so often :-)
- # [23:34] <gsnedders> Philip`: "… we're only immortal for a limited time."
- # [23:37] <Philip`> (I also don't understand http://www.ietf.org/internet-drafts/draft-gregorio-uritemplate-02.txt talking about Turing completeness, because XML DTDs are not Turing complete and you can do things like the billion laughs attack in non-Turing-complete languages easily since a billion is finite)
- # [23:40] * gsnedders doesn't really under turning completeness
- # [23:40] * Quits: Lachy (n=Lachlan@cm-84.215.9.100.getinternet.no) (Read error: 110 (Connection timed out))
- # [23:41] <jgraham_> Philip`: Do you have any data on how many pages use <form method=foo> with foo not in the set (get, post, put, delete)?
- # [23:42] <Philip`> If a language is Turing complete, it kind of means you can emulate a Turing machine in the language and you can emulate that language on a Turing machine, and hence the language is equivalent in expressiveness to every other Turing complete language, except the details are a bit more complex and that's probably inaccurate :-)
- # [23:43] <jgraham_> (Turing machines are kinda hard to come by. Specifically the infinite rolls of tape aren't stocked by Tesco)
- # [23:44] <Philip`> (That's not a problem - you can predict how fast the tape will be used up, and just buy new rolls and sellotape them onto the end of the previous one when it's about to be used up)
- # [23:44] <gsnedders> (What's a turning machine?)
- # [23:44] <hsivonen> jgraham_: the tape only needs to be long enough for whatever you are computing if the program terminates :-)
- # [23:44] <hsivonen> of course, figuring out if it terminates so you can go buy tape is hard
- # [23:45] <Philip`> http://c2.com/cgi/wiki?BloopFloopAndGloop - you can do vaguely useful things without being Turing complete
- # [23:45] <Philip`> gsnedders: (Do you mean "Turing" rather than "turning"?)
- # [23:45] <gsnedders> Philip`: (yes)
- # [23:45] <gsnedders> (I always do that)
- # [23:45] <Philip`> (http://en.wikipedia.org/wiki/Turing_machine is the obvious place to look :-) )
- # [23:45] <Philip`> (but, er, it looks quite a long page)
- # [23:47] <gsnedders> yeah
- # [23:47] <gsnedders> it is, and confusing.
- # [23:47] * gsnedders wonders what \relax is in LaTeX
- # [23:48] <Philip`> http://www.abelard.org/turpap2/tp2-ie.asp is more confusing
- # [23:48] <jgraham_> gsnedders: if you ever have quite a lot of time to spare you might enjoy something like The Emperor's New Mind by Roger Penrose. It touches on Turing machines, Godel's theorem and lots of other interesting things in a mostly-comprehensible way
- # [23:49] * gsnedders wishes he had more spare time
- # [23:49] * jgraham_ wishes he made better use of his time
- # [23:49] * gsnedders finishes computing homework for today
- # [23:50] <gsnedders> I better do the homework for the Friday before last tomorrow, then.
- # [23:51] <hdh> last tomorrow? end of the world ?-)
- # [23:51] <Philip`> jgraham_: http://www.cl.cam.ac.uk/~pjt47/misc/method.txt is total number of occurrences
- # [23:51] <Philip`> (out of ~15K pages)
- # [23:52] <Philip`> (with XML escaping that shouldn't be there)
- # [23:52] * Philip` fixes the escaping
- # [23:53] <jgraham_> Philip`: Thanks
- # [23:53] <Philip`> http://www.showroomnavi.com/ - <form name="area01" method=post" action="cgi-bin/awsearch.cgi" accept-charset="x-euc-jp"> - that is all the post" ones
- # [23:54] * Joins: zcorpan (n=zcorpan@c-cb21e353.1451-1-64736c12.cust.bredbandsbolaget.se)
- # [23:56] * Quits: othermaciej (n=mjs@dsl081-048-145.sfo1.dsl.speakeasy.net)
- # [23:57] * Quits: gsnedders (n=gsnedder@host86-135-224-200.range86-135.btcentralplus.com) ("404: Not Found")
- # [23:58] <hdh> 410 fits better, no?
- # [23:59] * Quits: jruderman (n=jruderma@corp-241.mountainview.mozilla.com)
- # [23:59] * zcorpan made a new http://simon.html5.org/tools/js/innerhtml-viewer/
- # Session Close: Wed Dec 19 00:00:01 2007
The end :)