/irc-logs / freenode / #whatwg / 2006-12-02 / end

Options:

  1. # [0:00] <webben> gsnedders, yep
  2. # [0:00] <webben> http://dublincore.org/ ... look at the backing from museums and libraries
  3. # [0:00] <gsnedders> it's used in RSS feeds quite a lot, and most consumers make use of it. As in HTML, I don't know any thing that uses it there.
  4. # [0:01] <webben> http://www.zotero.org/ might
  5. # [0:01] <hsivonen> having worked in the Finnish National Archives thinking about metadata and having been assigned to maintain a metadata spec in the military, I am pretty confident that many people who talk about metadata don't understand processing it in software
  6. # [0:01] * Lachy *shrugs*
  7. # [0:01] <Lachy> DC is flawed because it's hidden metadata
  8. # [0:02] <hsivonen> Lachy: DC is flawed, because it is too fluffy for software processing in ways that solves important use cases
  9. # [0:02] <webben> yeah ... Zotero supports it http://www.zotero.org/documentation/compatible_standards_and_software
  10. # [0:02] <Lachy> which reminds me, why are we keeping the meta element for anything but setting the charset?
  11. # [0:02] <webben> it's not hidden --- just go to page info in FF
  12. # [0:02] <webben> and anyway that's because user agents seem to be designed without thinking about academic/educational uses much at all
  13. # [0:03] <Lachy> webben, that counts as hidden!
  14. # [0:04] * Lachy thinks the meta element should drop the name attribute and only allow a single meta element for setting the charset
  15. # [0:05] * mpt (n=mpt@121-72-128-96.dsl.telstraclear.net) Quit ("This computer has gone to sleep")
  16. # [0:05] <webben> And then people will add such citation information where?
  17. # [0:05] <webben> in the text?
  18. # [0:05] <Lachy> in the body of the page, where it's visible
  19. # [0:06] <webben> i suppose they can hide it with CSS
  20. # [0:06] <Lachy> why do you want it hidden?
  21. # [0:06] <webben> Lachy, because it's not important unless your searching or citing
  22. # [0:06] <webben> and thinking about it hiding with CSS isn't good enough because of text UAs
  23. # [0:07] <webben> i suppose they could push it down into the "footer"
  24. # [0:07] <webben> i think moving it out of meta will break existing DC tools
  25. # [0:07] <Lachy> but users need to be able to find the information easily. How many average users do you know that would actuallly look at the page info dialog to read the meta values?
  26. # [0:07] <Lachy> tough.
  27. # [0:07] <webben> Lachy, that's a UA problem
  28. # [0:07] <webben> it's not a problem with the specs
  29. # [0:07] <webben> (and it's solved with extensions/addons)
  30. # [0:07] <Lachy> no, it's an authoring problem. visible data is always better than invisible data
  31. # [0:08] <raspberry-lemon> O,o
  32. # [0:08] <webben> Title is visible.
  33. # [0:08] <webben> The feed links have been made visible
  34. # [0:08] <Lachy> yes, that's right. what's your point?
  35. # [0:08] <webben> those are UA decisions
  36. # [0:09] <Lachy> meta has been around for many years and UAs still don't do anything useful with it
  37. # [0:09] <Lachy> there's no incentive for authors to use it properly
  38. # [0:09] <webben> I don't see that page info is that bad a solution ... except you should be able to extract a citation.
  39. # [0:09] <webben> But then that's what Zotero is for.
  40. # [0:09] <Lachy> those that do think <meta name="description" ...> is actually useful for search engines!
  41. # [0:10] <Lachy> never heard of Zotero
  42. # [0:10] <Lachy> citation information should go on the page, probably in the footer or something
  43. # [0:10] <webben> I just linked to it above. I've never used. But that's because I tend not to have to write paper citations atm.
  44. # [0:11] <webben> hmm ... that will make for much bigger footers
  45. # [0:11] <webben> (the citation i tend to use is stuff to bring the print world into the digital rather than the other way round... e.g. citeulike
  46. # [0:15] <webben> how about a <link> for a page of citation xml or something?
  47. # [0:15] <Hixie> right
  48. # [0:15] * Hixie moves on
  49. # [0:16] <Hixie> scripting and threading.
  50. # [0:16] <Hixie> my prediction: nobody will have the slightest comment on this section
  51. # [0:16] <Lachy> why not?
  52. # [0:16] <Hixie> (except a few people who will say "scripting should be multithreaded" and have clue what they're asking for)
  53. # [0:16] <Hixie> because this section is topic to understand :-)
  54. # [0:16] <Hixie> er
  55. # [0:17] <Hixie> this topic is hard to understand
  56. # [0:17] <Hixie> even
  57. # [0:17] <Lachy> right.
  58. # [0:17] <hsivonen> (the xml-dev thread turned out not to be an endless rathole)
  59. # [0:17] <Hixie> uri?
  60. # [0:17] <raspberry-lemon> i'm going to say "scripting should *NOT* be multithreaded" because then i won't have to deal with threading -,-;;;
  61. # [0:17] <webben> what section number is this?
  62. # [0:17] <hsivonen> http://lists.xml.org/archives/xml-dev/200611/msg00253.html
  63. # [0:18] <Hixie> webben: 4.2 right now
  64. # [0:18] <Hixie> of course that can change at a moment's notice
  65. # [0:18] <Hixie> hsivonen: o_O
  66. # [0:18] <Hixie> hsivonen: "by 'processing model suitable for the Web' you mean something useful? if so, we can stop now, because i'm not interested in useful things" ???
  67. # [0:18] <Hixie> wtf
  68. # [0:19] <Hixie> what world do these people live in
  69. # [0:19] <webben> Does 4.2 imply that a select changes as you move an arrow-key down in the select box?
  70. # [0:20] <Hixie> ?
  71. # [0:20] <hsivonen> Hixie: in the ISO/IEC 19757 aka. DSDL world
  72. # [0:20] <webben> "for controls implemented with a non-editable stateful UI (e.g. select elements, checkboxes, or radio buttons as deployed in typical desktop Web browsers), the change event shall be fired when the selection is completed"
  73. # [0:20] <webben> even if the control does not lose focus.
  74. # [0:20] <hsivonen> ISO/IEC 19757 is good, btw
  75. # [0:21] <webben> what completes a selection in a select box?
  76. # [0:21] <webben> also, is there an event that /is/ triggered by scripted changes?
  77. # [0:22] <hsivonen> (of course, the original non-ISO specs are more readable...)
  78. # [0:22] <Hixie> hsivonen: ah.
  79. # [0:22] <Hixie> webben: 4.2 in web apps 1.0?
  80. # [0:22] <webben> yep
  81. # [0:22] * Hixie confused
  82. # [0:22] <webben> oh wait
  83. # [0:22] <Hixie> that's just a red box for me at the moment
  84. # [0:22] <webben> no
  85. # [0:22] <webben> sorry
  86. # [0:22] * webben the idiot looks at the wrong doc
  87. # [0:23] <Hixie> heh
  88. # [0:25] <webben> for your list of ways of running scripts, the spec presumably doesn't need to discuss user js/greasemonkey does it?
  89. # [0:25] <Hixie> dunno
  90. # [0:26] <webben> e.g. defining how such scripts might interact with chains of events or something
  91. # [0:26] * webben doesn't do much scripting
  92. # [0:26] <Hixie> probably not, since that's just a UA-specific concern
  93. # [0:26] <Hixie> not an interoperability concern
  94. # [0:26] <webben> are there plans for multithreading in JS 2.0 ?
  95. # [0:27] <Hixie> dunno
  96. # [0:28] <hsivonen> webben: doubt it
  97. # [0:28] <hsivonen> webben: considering the threading story of Gecko (lack thereof)
  98. # [0:31] <webben> what about freaky stuff like jscript and vbscript... do those have threads?
  99. # [0:32] <hsivonen> webben: my wild guess is that Trident is multithreaded but jscript achieves Web compatibility by locking
  100. # [0:33] <hsivonen> I wonder how Opera is threaded considering that it runs on esoteric and limited platforms
  101. # [0:33] <Hixie> scripting in opera is run per-instruction, so threading is irrelevant for opera
  102. # [0:34] <hsivonen> Hixie: wow. and still it performs better than Gecko on some things
  103. # [0:34] <webben> in 4.2.1 can we at least include a "UAs should not silently correct errors without user configuration"
  104. # [0:35] <webben> (indeed, given IE already doesn't silently correct errors, could that be a must)
  105. # [0:35] <hsivonen> I had thought that Gecko's suckiness in this area was to cater for the old Mac OS, but timeless explained that it is to deal with the suckiness of X11
  106. # [0:35] <webben> Doesn't Opera have to deal with the suckiness of X11 too?
  107. # [0:35] <hsivonen> webben: not on HP-UX
  108. # [0:36] <Hixie> webben: correct errors?
  109. # [0:36] <hsivonen> webben: only with XFree86, X.org and the Sun stuff
  110. # [0:36] <webben> Hixie, you know the TAG stuff.
  111. # [0:36] <Hixie> webben: ?
  112. # [0:37] <webben> http://www.w3.org/TR/webarch/#no-silent-recovery
  113. # [0:37] <Hixie> i don't understand what you want the spec to say
  114. # [0:38] <webben> oh i guess it does say that really
  115. # [0:38] <webben> what does "The error should be reported to the user." mean
  116. # [0:38] <webben> e.g. is that something the user can turn off?
  117. # [0:38] <webben> send to the status bar etc..?
  118. # [0:38] <webben> or does that mean give the user a big dialog warning every time?
  119. # [0:39] <Hixie> it means exactly what it says
  120. # [0:39] <Hixie> no more no less
  121. # [0:41] <webben> aren't there two r's in occurred?
  122. # [0:41] <Hixie> probably
  123. # [0:41] <Hixie> the spec is full of typos
  124. # [0:41] <webben> or is occured an Americanism?
  125. # [0:41] <Hixie> it's too early to worry about them
  126. # [0:43] <webben> "the first must give the message that the UA is considering reporting" ... in what form?
  127. # [0:43] <webben> an error number? a string?
  128. # [0:44] <Hixie> whatever it is the UA is considering reporting
  129. # [0:44] <Lachy> http://blog.whatwg.org/faq/#whattf
  130. # [0:45] <webben> why is Applications capitalized?
  131. # [0:46] <webben> "intersted"
  132. # [0:46] * webben is sorry for being pedantic
  133. # [0:46] <Lachy> cause I copied it from the whatwg.org home page and didn't change it
  134. # [0:47] <Lachy> fixed
  135. # [0:49] <webben> It would be good if "Headings and sections" included an example that showed where people should put fragment identifiers.
  136. # [0:49] <Hixie> i encourage you to send feedback to the list
  137. # [0:49] <Hixie> on irc it will get lost
  138. # [0:51] <webben> (I've have a vested interest in frag id's because i've been writing a firefox extension to try and make it easier to link to them.)
  139. # [0:52] <webben> the chaos of section id's in old-style html doesn't help
  140. # [0:53] <Lachy> webben, what chaos of section ids?
  141. # [0:54] <webben> there are at least three ways of doing it
  142. # [0:54] <webben> (and that's with people who do it sanely)
  143. # [0:54] <Hixie> i don't suppose anything defines how javascript: URIs work
  144. # [0:55] <Lachy> I don't know of anywhere that defines them
  145. # [0:55] <webben> e.g. <div id="foo"><h2> ... <div><h2 id="foo">... and <div><h2><a name="foo" />...
  146. # [0:56] <Hixie> all of those are reasonable IMHO
  147. # [0:56] <Hixie> the first two are probably preferred
  148. # [0:56] <Lachy> <a name> is no longer in HTML5
  149. # [0:56] <webben> oh and <div><h2><a name="foo">Foobar</a></h2>
  150. # [0:57] <Hixie> wow, i removed name=""? how radical of me.
  151. # [0:57] <Lachy> and you forgot <h2 xml:id="foo"> ;-)
  152. # [0:57] <Hixie> pff
  153. # [0:58] <webben> so i had some sort of xpath or something trying to pick between those things
  154. # [0:58] <webben> and then when it fails to find those running back up the document looking for them in each preceding "section"
  155. # [0:58] <webben> it makes for a pretty inefficient algorithm
  156. # [0:59] <webben> there's probably a good form for when you want the h2 to be a link
  157. # [0:59] <webben> and a good form for when you just want a "hidden" id
  158. # [1:00] <webben> i can't really see a need for more than two ways of doing it though
  159. # [1:00] <webben> but the fact that <section> can be implied makes things interesting
  160. # [1:02] <webben> hmm it would be good to be clear about whether <h1> should be unique
  161. # [1:02] <Hixie> it does not have to be unique
  162. # [1:02] <Hixie> i thought the spec was clear about that
  163. # [1:03] <jgraham> Do I need an actual gmail account to sign up for Google project hosting? My ordinary Google signin doesn't seem to work :(
  164. # [1:04] <webben> ah it is clear from the examples anyway
  165. # [1:04] <Hixie> webben: examples are not normative, so if the prose doesn't say it, the example could be wrong
  166. # [1:04] <Hixie> jgraham: yes, you need a gmail account (though you don't need to use gmail itself)
  167. # [1:05] * webben doesn't grok this headers thing yet
  168. # [1:06] <jgraham> Ah. OK. It would be really useful if there was some useful error message to tell you that
  169. # [1:06] <Hixie> it's in the faq, but i will convey your message to the team
  170. # [1:07] <webben> "header elements must have at least one h1, h2, h3, h4, h5, or h6 element as a descendant." ... is that supposed to mean you can jump from header to h3
  171. # [1:07] <webben> even though you can't have a header as a descendant of a header
  172. # [1:08] <jgraham> (to be fair it says "sign in with your gmail account" but it doesn't say that you failed to do so)
  173. # [1:08] <Hixie> jgraham: feedback conveyed
  174. # [1:08] <Hixie> and they agreed, so hopefully it'll be fixed :-)
  175. # [1:09] <jgraham> Great :)
  176. # [1:09] <Hixie> webben: <header> elements are ways of wrapping multiple <hx> elements into one header, so you can have tag lines, e.g.
  177. # [1:11] * webben can't understand why a tagline would want to be inside a <hx> element
  178. # [1:12] <webben> but that's not quite what I was asking ... is <header><h3>foo</h3></header> okay?
  179. # [1:12] <jgraham> webben It turns out that lots of people do that in the real world. It breaks any tool that tries to generate a document outline but they think it's "more semantic"
  180. # [1:13] <webben> jgraham, yeah but they're crazy
  181. # [1:13] <Hixie> <header><h3>foo</h3></header> is equivalent to <h3>foo</h3> iirc
  182. # [1:13] <Hixie> or maybe equivalent to <h1>foo</h1>
  183. # [1:13] <Hixie> i forget
  184. # [1:13] <Hixie> see the spec :-)
  185. # [1:13] <jgraham> Well, maybe. For subheadings what they are trying to do makes a lot of sense
  186. # [1:14] <webben> jgraham, ah you're talking about <header><h1>foo</h1><h2>bar</h2></header> not <header><h3>foo</h3></header> when you say "that"?
  187. # [1:14] <webben> why not just have a subhead
  188. # [1:14] <webben> element
  189. # [1:15] <Hixie> because you might have:
  190. # [1:15] <Hixie> <header><p>Welcome to...</p><h1>My home!</h1><h2>or what some people might call "my cube"</h2></header>
  191. # [1:15] <jgraham> But this is why the spec should be clear about what the use cases behind semantic constructs are so there is some hope people won't break well intentioned UAs by over broadening their element use
  192. # [1:15] <Hixie> yeah
  193. # [1:15] <Hixie> i thought the examples for <header> were clear
  194. # [1:16] <webben> Hixie, yeah ... I don't understand the use of <h2> there
  195. # [1:16] <jgraham> Although HTML4 did that with <hx> and it didn't help much
  196. # [1:16] <webben> http://www.w3.org/TR/html4/struct/global.html#h-7.5.5 was lousy
  197. # [1:17] <webben> doesn't even say MUST be used in order
  198. # [1:17] <webben> instead "Some people consider skipping heading levels to be bad practice."
  199. # [1:17] <webben> what a cop out
  200. # [1:17] <jgraham> So, could I trouble someone to invite me to join gmail?
  201. # [1:17] <webben> jgraham, sure
  202. # [1:17] * jgraham is the last person in the universe with no gmail account
  203. # [1:18] <Hixie> heh
  204. # [1:18] <jgraham> My email address is jg307@cam.ac.uk
  205. # [1:19] <webben> jgraham, there you go (hopefully)
  206. # [1:19] <jgraham> webben: thanks :)
  207. # [1:20] <webben> why can't one have a <heading> as a descendant of a <heading> ?
  208. # [1:21] <webben> e.g. if you have a long document with a bit-per-page view and an all-in-one view
  209. # [1:21] <webben> you might have subsections with headings with taglines
  210. # [1:23] <jgraham> Can we go with "because my head would explode trying to work out how to generate an outline for it"? ;)
  211. # [1:24] <webben> jgraham, not nearly as much as the author trying to revise programmatically said document for all-in-one viewing
  212. # [1:25] <webben> Of course if <heading> was simply one <hX> rather than as many as you want
  213. # [1:25] <webben> and if hX are in order
  214. # [1:25] <webben> then outlining would be unproblematic
  215. # [1:26] <Hixie> there's like an exact spec for how to create an outline
  216. # [1:26] <Hixie> just implement that
  217. # [1:26] <Hixie> and your life will be good
  218. # [1:28] <webben> why are headings inside blockquotes part of the TOC?
  219. # [1:28] <jgraham> Hixie: I know. But as I recall, it's pretty complicated
  220. # [1:29] <webben> ah i see
  221. # [1:29] <webben> they aren't
  222. # [1:29] * webben provides a demonstration.
  223. # [1:29] <Hixie> jgraham: yeah, but that shouldn't affect implementing it. he just has to follow the spec. :-)
  224. # [1:30] <hsivonen> FYI: http://www.intertwingly.net/blog/2006/12/01/The-White-Pebble#c1165015647
  225. # [1:30] <webben> Will authors understand it?
  226. # [1:30] * Kanashii (n=Kanashii@ppp108-141.lns2.bne4.internode.on.net) Quit ()
  227. # [1:36] <Hixie> "Breaking XML is too politically incorrect even for the WHATWG."
  228. # [1:36] <Hixie> haha
  229. # [1:36] <Hixie> nice
  230. # [1:36] <Hixie> we could try!
  231. # [1:36] <Hixie> XML5!
  232. # [1:37] <Hixie> maybe sometime after SVG5!
  233. # [1:40] <jgraham> Can we replace all the angle brackets with something more aesthetically pleasing? ;)
  234. # [1:40] * tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net) Quit (Read error: 54 (Connection reset by peer))
  235. # [1:40] <Hixie> i'd love to
  236. # [1:40] <Hixie> but backwards compatibility forces us to keep them
  237. # [1:40] <Hixie> :-)
  238. # [1:40] * tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net) has joined #whatwg
  239. # [1:42] <jgraham> I've put all the html5 python code I have written up on google code: http://code.google.com/p/html5lib/
  240. # [1:42] <hsivonen> Hixie: only the W3C gets to break XML
  241. # [1:42] <hsivonen> (I mean XML 1.1)
  242. # [1:42] <Hixie> heh
  243. # [1:42] <hsivonen> speaking of which
  244. # [1:43] <jgraham> (Note: nothing there works)
  245. # [1:43] <hsivonen> the spec should require XML 1.0--not "some version"
  246. # [1:43] <webben> why?
  247. # [1:43] <Hixie> i have no idea what thomas broyer is asking for
  248. # [1:43] <Hixie> i hate it when i can't work out what someone wants
  249. # [1:43] <jgraham> (but I don't want to end up with 3 different efforts to do the same thing)
  250. # [1:43] <hsivonen> webben: XML 1.1 is a huge compatibility problem and PITA
  251. # [1:44] <hsivonen> webben: and XHTML5 does not have Cambodian tags
  252. # [1:44] <hsivonen> Khmer tags, I should say
  253. # [1:44] <Hixie> i'm not requiring XML 1.1 for the same reason that I _am_ defining XHTML at all
  254. # [1:44] <Hixie> er 1.0
  255. # [1:44] <Hixie> namely, if i require xml 1.0, someone will have to define their own serialisation using 1.1.
  256. # [1:45] <hsivonen> good point
  257. # [1:45] <Hixie> (but i agree with you in principle)
  258. # [1:45] <hsivonen> I will enforce 1.0
  259. # [1:45] <Hixie> you can do that, just by being an XML 1.0 Conformant Processor :-)
  260. # [1:48] * webben is confused ... how can you both not require 1.0 and enforce 1.0 ?
  261. # [1:51] <hsivonen> webben: Hixie doesn't require but I do
  262. # [1:52] <webben> you mean with your validator?
  263. # [1:52] <hsivonen> webben: yes
  264. # [1:52] <webben> can XHTML 1.1 be in XML 1.1?
  265. # [1:53] <hsivonen> webben: it wouldn't be conforming, AFAIK
  266. # [1:53] <hsivonen> http://hsivonen.iki.fi/validator/html5/?doc=http%3A%2F%2Fhsivonen.iki.fi%2Ftest%2Fxml11.xhtml
  267. # [1:55] <Hixie> "IO Error: HTTP resource not retrievable." should probably be "The file you specified could not be downloaded. Are you sure you specified the right address? (You may also [validate the 404 document].)"
  268. # [1:55] <Hixie> or something
  269. # [1:55] <hsivonen> Hixie: do you see that on the URL I just pasted?
  270. # [1:55] <Hixie> no
  271. # [1:56] <Hixie> i see it on http://hsivonen.iki.fi/validator/html5/?doc=http%3A%2F%2Fhsivonen.iki.fi%2Ftest%2Fxml10.xhtml
  272. # [1:56] <Hixie> which is what i immediately tried :-)
  273. # [1:57] <hsivonen> Hixie: suggestion logged
  274. # [1:57] <hsivonen> the message comes from the bowels of Apache Commons HTTP Client
  275. # [1:57] <Hixie> ah
  276. # [1:59] <hsivonen> I should see if it has an IOException subclass with the http status code
  277. # [2:02] <hsivonen> oops. it comes from my code after all
  278. # [2:02] <hsivonen> if (m.getStatusCode() != 200) {
  279. # [2:02] <Hixie> heh
  280. # [2:02] <hsivonen> looks like I've been lazy
  281. # [2:03] <hsivonen> redirects are transparent to me
  282. # [2:03] <hsivonen> err opaque
  283. # [2:03] <hsivonen> I don't notice
  284. # [2:04] * hsivonen gets confused with transparent and opaque if the library hides it
  285. # [2:11] * Hixie tries to get the hang of the results of http://www.hixie.ch/tests/adhoc/dom/level0/window/open/
  286. # [2:12] <Hixie> (turn off tabs first)
  287. # [2:15] <Hixie> i don't understand what mozilla does
  288. # [2:15] <Hixie> on 002
  289. # [2:16] <Hixie> wow
  290. # [2:17] <Hixie> a window.alert() on safari blocks the entire browser
  291. # [2:19] <Hixie> and on IE it blocks UI interaction and JS for that tab
  292. # [2:19] <Hixie> and all the tabs that are involved in the test
  293. # [2:19] <Hixie> and the chrome for windows involved in the test, even though other tabs on that test are fine!
  294. # [2:19] <Hixie> wow, there's proof that the menu bar is per-tab if nothing else
  295. # [2:26] <Hixie> man, all the browsers act differently
  296. # [2:26] <Hixie> gah
  297. # [2:26] <Hixie> bbl
  298. # [3:32] * webben (n=benjamin@91.84.22.233) Quit ("Leaving")
  299. # [3:38] * whateley (n=whateley@S01060013463ece73.ed.shawcable.net) Quit (Read error: 110 (Connection timed out))
  300. # [3:56] * tantek_ (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net) has joined #whatwg
  301. # [3:56] * tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net) Quit (Read error: 131 (Connection reset by peer))
  302. # [4:36] * mpt (n=mpt@121-72-128-96.dsl.telstraclear.net) has joined #whatwg
  303. # [4:53] * mpt (n=mpt@121-72-128-96.dsl.telstraclear.net) Quit ("This computer has gone to sleep")
  304. # [6:12] <Lachy> Hixie, typo in 4.2.2: If the value is null - The error should not [be] reported to the user.
  305. # [6:24] * mpt (n=mpt@121-72-128-96.dsl.telstraclear.net) has joined #whatwg
  306. # [7:02] <Lachy> I've added several new questions to the FAQ
  307. # [7:02] <Lachy> http://blog.whatwg.org/faq/#mime-type
  308. # [7:02] <Lachy> http://blog.whatwg.org/faq/#tracking-changes
  309. # [7:02] <Lachy> http://blog.whatwg.org/faq/#namespaces
  310. # [7:09] * mpt (n=mpt@121-72-128-96.dsl.telstraclear.net) Quit ("Leaving")
  311. # [7:50] * csarven (i=nevrasc@modemcable081.169-202-24.mc.videotron.ca) Quit (Read error: 104 (Connection reset by peer))
  312. # [7:55] <Lachy> what the???? "I don't want to use namespaces. I want to use an xmlns attribute. " -- Robert Sayre.
  313. # [7:55] <Lachy> I think that's the quote of the day ;-)
  314. # [8:31] * Kanashii (n=Kanashii@ppp108-141.lns2.bne4.internode.on.net) has joined #whatwg
  315. # [11:06] * jgraham (n=jgraham@81.178.250.219) Quit (sterling.freenode.net irc.freenode.net)
  316. # [11:06] * gavin_s (n=gavin@63.245.208.169) Quit (sterling.freenode.net irc.freenode.net)
  317. # [11:07] * gavin_s (n=gavin@63.245.208.169) has joined #whatwg
  318. # [11:07] * jgraham (n=jgraham@81.178.250.219) has joined #whatwg
  319. # [11:13] * rhymes (n=rhymes@host221-72-dynamic.54-82-r.retail.telecomitalia.it) has joined #whatwg
  320. # [11:17] <hsivonen> Lachy: I think Robert has a good point
  321. # [12:32] <Lachy> hsivonen, I don't think so
  322. # [12:50] * rhymes (n=rhymes@host221-72-dynamic.54-82-r.retail.telecomitalia.it) Quit ()
  323. # [12:50] * Kanashii (n=Kanashii@ppp108-141.lns2.bne4.internode.on.net) Quit ()
  324. # [13:10] * ROBOd (n=robod@86.34.246.154) has joined #whatwg
  325. # [13:12] <Lachy> jgraham's idea of using a different attribute name from xmlns is better. It's similar to what I said here yesterday, but I'd rather avoid requiring authors to remember the full URI
  326. # [13:16] <Lachy> I'd just use <svg ns="svg">, where the attribute takes a set of predefined values, such as "svg", "mathml", "xhtml". But in most cases, it would be unnecessary to use it anyway.
  327. # [13:19] <Lachy> although I still think it's better to such things for use in XHTML. Browsers, especially IE, are much more likely to add support for XHTML, SVG and MathML, before a special html-based math/svg syntax.
  328. # [14:22] * rhymes (n=rhymes@host221-72-dynamic.54-82-r.retail.telecomitalia.it) has joined #whatwg
  329. # [14:26] <ROBOd> good eday to all
  330. # [14:27] <Lachy> hey ROBOd
  331. # [14:27] <ROBOd> Lachy: it seems attractive to use ns instead of xmlns, because it would cause less confusion, because people wouldn't mistake it with XHTML, etc. however... i am suspicious if in the grand scheme of things creating a "fork" of xmlns is that good
  332. # [14:27] <ROBOd> it would only give web developers more work in the future
  333. # [14:28] <ROBOd> my suggestion would be that no new ns attribute is added
  334. # [14:28] <Lachy> I agree and I don't think it is needed
  335. # [14:28] <ROBOd> if, and only if, something is to be done in regards to this, add xmlns.
  336. # [14:29] <ROBOd> personally i am not yet decided if xmlns should not be in HTML5
  337. # [14:29] <Lachy> but, if a namespace syntax is ever added to HTML, I think it should be at least that simple and must definately not use xmlns
  338. # [14:29] <ROBOd> at the moment, i don't see the big gripe, the big need for xmlns in HTML(4|5)
  339. # [14:30] <ROBOd> Lachy: yes, it should be *that* simple, but not another attribute
  340. # [14:30] <Lachy> are you saying you would rather reuse xmlns for that purpose?
  341. # [14:30] <ROBOd> yes
  342. # [14:30] <Lachy> which would also mean using the full URIs as well
  343. # [14:31] <ROBOd> yep
  344. # [14:31] <ROBOd> there's no need to reinvent the wheel, IMHO
  345. # [14:31] <Lachy> no, that would only serve to further encouage those with teh misconception that HTML can be treated as XML
  346. # [14:31] <ROBOd> as i said above, it's true, that happens
  347. # [14:31] <Lachy> and it would give the impression that any arbitrary namespace can be used in HTML
  348. # [14:32] <ROBOd> but another attribute would just add other troubles
  349. # [14:32] <Lachy> but, as Hixie's study showed, many people get the namespace wrong anyway
  350. # [14:32] <ROBOd> exactly
  351. # [14:32] <Lachy> which is why I don't think any namespaces should be added to HTML either.
  352. # [14:32] <ROBOd> and there's no UA with complete xmlns implementation
  353. # [14:33] <ROBOd> e.g. Opera had serious problems with xmlns last time i checked
  354. # [14:33] <Lachy> but my point is that xmlns is too difficult for the average HTML coder plus the other problems just mentioned
  355. # [14:33] <Lachy> doesn't Mozilla fully support xmlns in XML?
  356. # [14:34] <Lachy> what's Opera's bug with it?
  357. # [14:34] <ROBOd> iirc they have some problems as well
  358. # [14:34] <ROBOd> don't know the Mozilla bugs precisely, since I mostly work with Opera
  359. # [14:34] <ROBOd> well... for example, Opera with VoiceXML doesn't really care much about the XML namespace
  360. # [14:35] <ROBOd> it just detects the tag name, and that's pretty much all
  361. # [14:36] <ROBOd> e.g. if one wants to use something else than the default xmlns prefix (vxml)
  362. # [14:38] <ROBOd> at the end of that day ... i was pretty much sure XML namespace support was glued (read: not good) :)
  363. # [14:39] <Lachy> but those are bugs in the XML implementation, specifically relating to prefixes. There would be no prefixes in HTML, so any use of xmlns couldn't use prefixes and that difference would only cause problems
  364. # [14:40] <Lachy> besides, as Hixie has mentioned, Opera has tried to implement namespaces in HTML, but apparently had to back out of it because so many pages relied on MS Office namespaces being completely ignored by non-IE browsers.
  365. # [14:40] <ROBOd> the more i think of it, the more i'd recommend Hixie *not* to accept xmlns (or any derivate, for that matter) in html5
  366. # [14:41] <Lachy> that's another reason we couldn't reuse xmlns in HTML because MS office has broken it
  367. # [14:41] <Lachy> I fully agree!
  368. # [14:41] <ROBOd> thing is: use xhtml for svg and for other "advanced" stuff
  369. # [14:41] <Lachy> yep
  370. # [14:41] <raspberry-lemon> the newbie agrees too, just for the record
  371. # [14:42] <Lachy> raspberry-lemon, what's your real name? Have I seen on on the mailing list before?
  372. # [14:42] * rhymes (n=rhymes@host221-72-dynamic.54-82-r.retail.telecomitalia.it) Quit ()
  373. # [14:43] <raspberry-lemon> real name is chris svindseth, but if you've seen me on the mailing list it would be quite the miracle as i only read it sporadically :)
  374. # [14:43] <ROBOd> Lachy: i've read Sam's blog post (link posted yesterday here). i now believe he exaggerates with his wish to merge XHTML with HTML.
  375. # [14:44] <Lachy> ah, so you've never posted to the list.
  376. # [14:44] <raspberry-lemon> no
  377. # [14:45] <Lachy> yep, I agree. I think Sam's just taking it too far
  378. # [14:47] <ROBOd> gotta go now, bbl
  379. # [14:47] <Lachy> ok, cya
  380. # [15:21] * rhymes (n=rhymes@host221-72-dynamic.54-82-r.retail.telecomitalia.it) has joined #whatwg
  381. # [16:35] <annevk> hah
  382. # [16:36] <citoyen> oh look, it's awake
  383. # [16:36] <annevk> next time I go away for more than 24 hours I'll turn IRC off
  384. # [16:36] <Lachy> hi annevk
  385. # [16:36] <annevk> hi there
  386. # [16:36] * annevk just read through the entire backlog...
  387. # [16:36] * annevk hasn't yet read Sam Ruby's post
  388. # [16:36] <annevk> morning citoyen :)
  389. # [16:36] <Lachy> annevk, was it worth reading it all?
  390. # [16:36] <citoyen> mornin' :) how's the head? :)
  391. # [16:38] <annevk> better
  392. # [16:38] <annevk> Lachy, no, I skipped major parts
  393. # [16:39] <annevk> "HTML is tantalizingly close to well-formed XML." ...
  394. # [16:40] <Lachy> hah! :-D
  395. # [16:40] <citoyen> *blink*
  396. # [16:40] <Lachy> there's been several funny quotes on the list today
  397. # [16:46] <annevk> class AtheistParseError(ParseError): ...
  398. # [17:00] <annevk> "Breaking XML is too politically incorrect even for the WHATWG." We could try...
  399. # [17:00] <annevk> Introduce graceful error handling for XML
  400. # [17:01] * rhymes (n=rhymes@host221-72-dynamic.54-82-r.retail.telecomitalia.it) Quit ()
  401. # [17:08] <Lachy> it's too late for that
  402. # [17:10] <annevk> it's already happening
  403. # [17:11] <annevk> see feed parsers for instance
  404. # [17:11] <Lachy> ?
  405. # [17:11] <annevk> we better define how it should work...
  406. # [17:11] <Lachy> Oh, that's just crap. They should use draconian error handling
  407. # [17:11] <annevk> that doesn't make much sense to me
  408. # [17:12] <Lachy> and CMSs should use proper XML tools and ensure they output well-formed feeds
  409. # [17:12] <annevk> it seems better for their users to do the non draconian thing
  410. # [17:12] <annevk> right...
  411. # [17:12] <annevk> those CMSs have been promised for over the past ten years or so
  412. # [17:12] <Lachy> IE7 does draconian error handling for feeds, doesn't it?
  413. # [17:12] <hsivonen> Lachy: have fun trying to convince Mark P. not to do what he does. :-)
  414. # [17:12] <annevk> there's not really such a thing as bugfree software, I think we should try to learn from that
  415. # [17:13] <annevk> Lachy, only partially
  416. # [17:13] <hsivonen> annevk: TeX. The conclusion is that we should use .dvi for interchange. :-)
  417. # [17:14] <citoyen> Let's face it, people fail and tools fail, no matter how much we try. Given that, and that tools are meant to make our lives easier, not more annoying, I think error handling is the way to go.
  418. # [17:14] <annevk> hsivonen, I don't get that
  419. # [17:14] <annevk> as in, I'm not sure what you're saying :)
  420. # [17:15] <hsivonen> annevk: TeX is famous for being the non-trivial piece of software that is free of bugs
  421. # [17:15] <hsivonen> TeX outputs .dvi
  422. # [17:15] <annevk> oh
  423. # [17:17] <hsivonen> grr. I have to update my <t> test cases
  424. # [17:20] <annevk> s/t/time/
  425. # [17:21] <hsivonen> annevk: won't work
  426. # [17:21] <hsivonen> consider <title>
  427. # [17:22] <annevk> ok, do it a bit smarter :)
  428. # [17:22] <annevk> s/<t /
  429. # [17:22] <annevk> s/<t>/
  430. # [17:22] <annevk> etc.
  431. # [17:22] <hsivonen> yeah
  432. # [17:26] <annevk> Hixie, if you have nothing else to work, consider updating the parsing section a bit more to remove the last couple of red blocks and do the rewrite of the tree construction section...
  433. # [17:38] <annevk> http://therealcrisp.xs4all.nl/blog/ "Hell is where browsers come from"
  434. # [17:42] <hsivonen> Lachy: wp-comments-post.php is broken
  435. # [17:42] <hsivonen> "Error: This file cannot be used on its own."
  436. # [17:43] <Lachy> ok, let me see...
  437. # [17:44] <Lachy> Does that happen when you try to post a comment?
  438. # [17:44] <hsivonen> yos
  439. # [17:44] <hsivonen> yes
  440. # [17:44] <Lachy> when you're logged in or not?
  441. # [17:44] <hsivonen> logged in
  442. # [17:44] <Lachy> ok, it worked for me when not logged in
  443. # [17:45] * ROBOd (n=robod@86.34.246.154) Quit (Read error: 104 (Connection reset by peer))
  444. # [17:45] * ROBOd2 (n=robod@86.34.246.154) has joined #whatwg
  445. # [17:45] <Lachy> worked for me when logged in too
  446. # [17:45] <hsivonen> hmm. interesting
  447. # [17:45] <hsivonen> gotta run for dinner
  448. # [17:46] * csarven (i=nevrasc@modemcable081.169-202-24.mc.videotron.ca) has joined #whatwg
  449. # [17:46] <ROBOd2> bon appétit hsivonen
  450. # [17:46] <hsivonen> thanks
  451. # [17:46] <Lachy> I get that error when I visit http://blog.whatwg.org/wp-comments-post.php directly, rather than posting to it
  452. # [17:46] <annevk> isn't it a little early...
  453. # [17:47] <annevk> oh, wait, Finland
  454. # [17:47] <hsivonen> annevk: board game scheduled after dinner
  455. # [17:47] <hsivonen> hence, early dinner
  456. # [17:47] <hsivonen> really going now
  457. # [17:47] <annevk> bye
  458. # [17:48] <annevk> Lachy, you want http://c2.com/cgi/wiki?GeneratorsInPython
  459. # [17:51] <Lachy> I see. so we would implement a getChar() function that uses yield and returns the next character in the stream
  460. # [17:51] <annevk> I think that's the idea
  461. # [17:51] <Lachy> what about when we have to back up a few chars for error handling?
  462. # [17:52] <annevk> you store the characters somewhere I suppose
  463. # [17:52] <annevk> hmm
  464. # [17:52] <Lachy> ok, need to think about it.
  465. # [17:59] * gsnedders (n=gsnedder@host86-139-123-225.range86-139.btcentralplus.com) Quit ("Don't touch /dev/null…")
  466. # [18:00] <annevk> hmm yeah
  467. # [18:00] <annevk> for states like the entity state
  468. # [18:01] <Lachy> it might be easier to implement in it a stream object that handles walking forward and backward through the stream, even if it uses yield internally for some stuff
  469. # [18:01] <Lachy> and even supports inserting markup into the stream, which would be needed for document.write() support
  470. # [18:02] <annevk> yeah, didn't jgraham have something like that?
  471. # [18:02] * Lachy will check
  472. # [18:06] <Lachy> I think that's what his Tokeniser object does, but not sure. It seems to be structured in a very strange way.
  473. # [18:13] <annevk> when I source on google for "live dom viewer" i get your site Lachy ... some copy
  474. # [18:13] <jgraham> Lachy: what is strange
  475. # [18:13] <jgraham> ?
  476. # [18:14] <jgraham> Did you see that I started a google project for a python based html5 parser: http://code.google.com/p/html5lib/
  477. # [18:15] <annevk> cool
  478. # [18:15] <annevk> I'm willing to help out
  479. # [18:15] <jgraham> I'm really up for working with other people on this, soI'm quite happy to change the design if it's no good. And I seem to have a bit more python experience, which might help
  480. # [18:17] <Lachy> jgraham, write an article about it on the blog
  481. # [18:17] <Lachy> let a few more people know about it and ask for more contributors
  482. # [18:19] <jgraham> Yeah, that's a good idea. I might set up a wiki page for discussing the design as well
  483. # [18:19] <Lachy> Cool, I'm happy with the BSD licence for it
  484. # [18:19] <annevk> what does BSD imply?
  485. # [18:20] <annevk> what are the restrictions, basically
  486. # [18:20] <Lachy> it means that you retain copyright, but anyone is free to do whatever they like with it
  487. # [18:20] <jgraham> http://www.opensource.org/licenses/bsd-license.php
  488. # [18:21] <jgraham> I think it's about the most liberal license available
  489. # [18:21] <Lachy> http://en.wikipedia.org/wiki/BSD
  490. # [18:21] <jgraham> But if anyone has any good reasons to change it, I'm listening
  491. # [18:22] <annevk> I'd be happy with a license that doesn't require attribution
  492. # [18:23] <Lachy> http://en.wikipedia.org/wiki/Public_domain_equivalent_license
  493. # [18:23] <Lachy> BSD is near enough to public domain
  494. # [18:25] <jgraham> The options in google hosting are BSD, Apache 2.0, Artistic/GPLv2.0, GPL2.0, LGPL, MIT, MPL1.1
  495. # [18:25] <Lachy> This is what I usually do for copyright http://lachy.id.au/about/copyright
  496. # [18:26] <Lachy> of those, either MIT or BSD are the most permissive
  497. # [18:27] <jgraham> Do you think MIT would work better?
  498. # [18:29] <annevk> yes
  499. # [18:29] <jgraham> OK
  500. # [18:29] <annevk> per http://en.wikipedia.org/wiki/MIT_License that doesn't require attribution which may be a problem for some commercial entities
  501. # [18:29] <jgraham> OK, it's changed
  502. # [18:30] <annevk> if you want you can add annevankesteren@gmail.com though I wonder how to deal with such a project
  503. # [18:32] <jgraham> I added you as a project owner
  504. # [18:32] <annevk> hah
  505. # [18:32] * Lachy will register a new gmail account and join
  506. # [18:32] <jgraham> What do you mean "deal with such a project"? You mean how to actually design the code collaboratively?
  507. # [18:33] <Lachy> if only someone hadn't stolen my name! lachlan.hunt at gmail.com is taken :-(
  508. # [18:33] <jgraham> Heh. I ended up with jgraham.cantab since almost everything I could think of was gone...
  509. # [18:34] <annevk> jgraham, yes
  510. # [18:35] <annevk> I took the liberty to add more text to the frontpage
  511. # [18:35] <jgraham> Well I think a design document on a wiki would help. I don't know if the whatwg wiki is the right place though
  512. # [18:35] <Lachy> oh, no I forgot, I already have lachyhunt at gmail.com :-)
  513. # [18:36] <jgraham> Lachy: OK, I added you
  514. # [18:36] <Lachy> thanks
  515. # [18:39] <annevk> checkout is still going on...
  516. # [18:39] <annevk> hmm
  517. # [18:39] * Lachy is finishing off the blog entry for feed autodiscovery...
  518. # [18:40] <Lachy> are there any other issues with "alternate", besides a feed not necessarily being an alternate represntaion and the MIME type not always being a good indicator of a feed?
  519. # [18:40] <annevk> you should prolly post on monday
  520. # [18:40] <Lachy> why wait? It'll still be there on Monday
  521. # [18:41] <annevk> posts tend to get more attention throughout the week
  522. # [18:41] <annevk> at least, in my experience
  523. # [18:42] <Lachy> yeah, but what difference does it make if it's posted today or tomorrow? It'll still show up in peoples feed readers on monday morning
  524. # [18:42] <annevk> i've wondered about that myself
  525. # [18:43] <Lachy> but I can hold it off for a day if you like, it doesn't matter that much
  526. # [18:44] * whateley (n=whateley@S01060013463ece73.ed.shawcable.net) has joined #whatwg
  527. # [18:53] <Lachy> hehe... :-) The latest from elliot...
  528. # [18:53] <Lachy> "Secondly, anyone who actually tried to use an SGML parser to handle HTML rapidly hit a wall since most HTML documents were not even close to actually conformant to the SGML spec or the HTML DTD. "
  529. # [18:54] <Lachy> now if only he could figure the concept when s/SGML/XML
  530. # [18:55] <annevk> hmm, I can't seem to commit
  531. # [18:56] <annevk> jgraham, should we use a googlegroups for discussion?
  532. # [18:58] <jgraham> annevk: I guess googlegroups might be good. I'd still like a wiki page somewhere to hack out a design. Any ideas where? I could set something up on my desktop but it's unlikely to be very reliable...
  533. # [18:59] <annevk> lets use wiki.html5.org
  534. # [18:59] <Lachy> jgraham, wiki.whatwg.org
  535. # [18:59] <annevk> what Lachy said
  536. # [18:59] * gsnedders (n=gsnedder@host86-139-123-225.range86-139.btcentralplus.com) has joined #whatwg
  537. # [18:59] <annevk> PythonHTML5Lib ?
  538. # [19:00] <jgraham> OK, I just didn't want it to seem like an "official" implementation
  539. # [19:00] <annevk> lets make that clear in the first paragraph :)
  540. # [19:00] <jgraham> OK
  541. # [19:30] <jgraham> I've created http://wiki.whatwg.org/wiki/HTML5Lib I'll fill in some more of the details shortly
  542. # [19:38] <Lachy> You should use [Category:Implementations] instead so that the list is automatic
  543. # [19:42] <Lachy> done http://lachy.id.au/log/2005/12/xhtml-beginners
  544. # [19:42] <Lachy> oops, wrong like
  545. # [19:42] <Lachy> *link
  546. # [19:42] <Lachy> http://wiki.whatwg.org/wiki/Category:Implementations
  547. # [19:49] * Lachy has had enough of Elliot, the arguments are just going round and round in circles.
  548. # [19:52] <Lachy> I'm going to try to not respond to him again, no matter how tempting it gets.
  549. # [20:30] <jgraham> http://wiki.whatwg.org/wiki/HTML5Lib now has some description of the tokeniser Please go ahead and rip it to shreds :)
  550. # [20:47] * whateley (n=whateley@S01060013463ece73.ed.shawcable.net) has left #whatwg
  551. # [20:47] * whateley (n=whateley@S01060013463ece73.ed.shawcable.net) has joined #whatwg
  552. # [20:49] <annevk> hmm, seems to come down to yet aonther mime type debate
  553. # [20:49] <annevk> I love those! [pause] Not.
  554. # [20:49] * annevk reads the wiki
  555. # [20:49] * annevk just had some food
  556. # [20:52] * jgraham notices a mistake in the wiki page
  557. # [20:54] <annevk> We should use the word Tokenizer
  558. # [20:54] <annevk> or HTMLTokenizer
  559. # [20:54] <annevk> note the z
  560. # [20:56] <annevk> see Google if you don't believe me :)
  561. # [20:56] <annevk> jgraham, so how does the tokenizer integrate with the parser?
  562. # [20:56] <annevk> parser -> tree construction phase
  563. # [20:56] <annevk> the three construction phase directly affects the tokenizer
  564. # [20:56] <annevk> s/three/tree ...
  565. # [20:57] <jgraham> Tokeniser == english spelling, tokenizer == American spelling, no?
  566. # [20:57] <annevk> yes
  567. # [20:57] <jgraham> But we can go with "z", I'll just make more typos that way ;)
  568. # [20:57] <annevk> "Results 1 - 10 of about 40,100 for tokeniser."
  569. # [20:57] <annevk> "Results 1 - 10 of about 1,240,000 for tokenizer. "
  570. # [20:58] <annevk> Google also suggested that I search for tokenizer when I tried tokeniser :)
  571. # [20:58] <jgraham> annevk: The parser calls getToken every time it wants a token. But it also holds a reference to the tokeniser so it can change the tokeniser state when it needs to. Does it ever do more than change the content model flag?
  572. # [21:00] <annevk> I don't think so
  573. # [21:00] <annevk> but can't we work with functions then in the tokenizer that the parser implements?
  574. # [21:02] <jgraham> Could do, I guess. I'm not sure what the benefit is though?
  575. # [21:02] <annevk> I think it's cleaner than having temporary token objects...
  576. # [21:04] * annevk reads through the spec once again
  577. # [21:06] <jgraham> Well this way the seperation between tokeniser and parser is pretty clean. It also has the nice property of being a very literal implementation of the spec - when it says "create a token" you really do. But I see your point; maybe it adds lots of overhead
  578. # [21:09] <annevk> I might have mentioned this already, but it would be nice if the parser was fairly low-level so it can be ported to other languages as well.
  579. # [21:09] <annevk> In an easy way
  580. # [21:11] <annevk> I think having functions might also make it easier to add markup injection, if ever...
  581. # [21:12] <jgraham> document.write in python?!
  582. # [21:14] <annevk> well, the architecture should sort of take it into account
  583. # [21:17] <annevk> jgraham, why do the base classes inherit from object?
  584. # [21:18] <jgraham> Because that makes them "new style" python classes
  585. # [21:18] <jgraham> Which have several generally desirable properties compared to old style classes
  586. # [21:19] <jgraham> see e.g. http://www.geocities.com/foetsch/python/new_style_classes.htm
  587. # [21:20] <jgraham> It's a backwards compat. issue
  588. # [21:23] <jgraham> annevk: So in your proposal, what would the interface between the parser and the tokeniser look like? Would you start with the tokeniser and have it call parser.startTagToken(name, attrs) when it made a start tag token? Or something else?
  589. # [21:23] <annevk> And what does frozenset gives us? What it seems to imply?
  590. # [21:24] <annevk> jgraham, I suppose self.startTagToken() if the parser inherits from it...
  591. # [21:24] <annevk> but yeah
  592. # [21:24] <annevk> I'm updating the wiki as we chat
  593. # [21:25] <jgraham> Also I think document.write would work in my model, you'd have to append the extra markup to the characterQueue (mistakenly called characterStack in the svn code). The treebuilder side of that would be the hard part
  594. # [21:27] <annevk> perhaps we should call it "characters"
  595. # [21:27] <annevk> hmm
  596. # [21:27] <annevk> jgraham, yeah, I guess it would
  597. # [21:28] <jgraham> frozenset is just an immutable set. Sets are nice because it's easy to compute unions, etc - useful since there are definitions like "All other elements found while parsing an HTML document" which we need to test against. Also membership tests should be fast (I think).
  598. # [21:29] <annevk> is it ok that they are global variables though?
  599. # [21:31] <annevk> hmm, I suppose you don't want to pass them around all the time
  600. # [21:31] <jgraham> They're only global in the current file
  601. # [21:31] <annevk> okay
  602. # [21:32] <annevk> that's what I expected
  603. # [21:33] <annevk> hmm, I've got referrers from example.com ...
  604. # [21:33] <jgraham> I don't understand why the parser would inherit from the tokeniser? I can see that the parser and tokeniser would call each other somehow but I don't see why they'd inherit?
  605. # [21:33] <jgraham> heh
  606. # [21:33] <jgraham> spammers?
  607. # [21:34] <annevk> think so
  608. # [21:35] <annevk> hmm, you're right
  609. # [21:36] <annevk> so you'd have x = HTMLParser("docRef"); HTMLParser invokes HTMLTokenizer(self, "docRef") and there you go
  610. # [21:36] <annevk> would that work?
  611. # [21:37] * gsnedders (n=gsnedder@host86-139-123-225.range86-139.btcentralplus.com) Quit ("Don't touch /dev/null…")
  612. # [21:38] <jgraham> Yeah. That's basically what I have at the moment. Only I have a "parse" function in the parser which creates the tokeniser.
  613. # [21:38] <jgraham> As well as starting parsing obviously
  614. # [21:43] <annevk> this is what I just added to the wiki: "There's an HTMLParser class you can invoke with an object. What this object is can be decided later. File object, string, URI, etc. The newly created HTMLParser object then instantiates an HTMLTokenizer with itself as argument and the object. The HTMLTokenizer then invokes does things like parser.emitStartTagToken(name, ...) etc."
  615. # [21:51] * gsnedders (n=gsnedder@host86-139-123-225.range86-139.btcentralplus.com) has joined #whatwg
  616. # [22:03] <gsnedders> what HTML5 parsers are there in existence already?
  617. # [22:04] <gsnedders> (and are bug-free enough to use as a reference implementation)
  618. # [22:04] <annevk> there are none
  619. # [22:04] <annevk> there's a project
  620. # [22:06] <jgraham> annevk: I've created a "callback" branch in svn to try your approach.
  621. # [22:06] <gsnedders> annevk: right. I knew there were several, but I didn't know how far they were in terms of development
  622. # [22:07] * jgraham wishes he knew enough computer science to make an informed argument one way or the other
  623. # [22:07] <annevk> several, even?
  624. # [22:09] * ROBOd2 (n=robod@86.34.246.154) Quit (Read error: 104 (Connection reset by peer))
  625. # [22:10] * ROBOd2 (n=robod@86.34.246.154) has joined #whatwg
  626. # [22:53] * annevk (n=annevk@pat-tdc.opera.com) Quit (Read error: 110 (Connection timed out))
  627. # [23:07] * Kanashii (n=Kanashii@ppp108-141.lns2.bne4.internode.on.net) has joined #whatwg
  628. # [23:09] * annevk (n=annevk@89.10.19.124) has joined #whatwg
  629. # [23:11] * ROBOd2 (n=robod@86.34.246.154) Quit ("http://www.robodesign.ro")
  630. # [23:22] * annevk (n=annevk@89.10.19.124) Quit (Read error: 148 (No route to host))

The end :)