Options:
- # Session Start: Tue Jan 13 00:00:00 2009
- # Session Ident: #html-wg
- # [00:18] * Joins: animungo (Karl@93.128.175.164)
- # [00:19] * Parts: animungo (Karl@93.128.175.164) (Leaving.)
- # [00:19] * Joins: matt (matt@128.30.52.30)
- # [00:36] * Quits: aroben (aroben@71.58.73.153) (Quit: aroben)
- # [00:36] * Quits: MichaelC (Michael@128.30.52.30) (Quit: ChatZilla 0.9.84 [Firefox 3.0.5/2008120122])
- # [00:54] * Quits: aaronlev (chatzilla@85.179.60.186) (Ping timeout)
- # [01:01] * Joins: jwatt_ (roslea@83.87.4.17)
- # [01:03] * Quits: jwatt (roslea@83.87.4.17) (Ping timeout)
- # [01:03] * jwatt_ is now known as jwatt
- # [01:04] * Quits: maddiin (mc@87.185.238.37) (Quit: maddiin)
- # [01:23] * Joins: tlr (tlr@128.30.52.30)
- # [01:42] * Quits: smedero (smedero@192.223.6.251) (Quit: smedero)
- # [02:00] * Joins: MikeSmith (MikeSmith@mcclure.w3.org)
- # [02:03] * Quits: gavin_ (gavin@99.226.207.11) (Ping timeout)
- # [02:03] * Quits: Dashiva (noone@84.48.51.1) (Ping timeout)
- # [02:06] * Quits: tlr (tlr@128.30.52.30) (Quit: tlr)
- # [02:08] * Joins: gavin_ (gavin@99.226.207.11)
- # [02:08] * Joins: Dashiva (noone@84.48.51.1)
- # [02:09] * Quits: gavin_ (gavin@99.226.207.11) (Quit: gavin_)
- # [02:11] * Joins: deane (opera@121.98.190.61)
- # [02:16] * Quits: Dashiva (noone@84.48.51.1) (Ping timeout)
- # [02:17] * Quits: tH (Rob@129.11.83.58) (Quit: ChatZilla 0.9.84-rdmsoft [XULRunner 1.9.0.1/2008072406])
- # [02:22] * Joins: Dashiva (noone@84.48.51.1)
- # [02:31] * Quits: adele (adele@17.203.14.201) (Quit: adele)
- # [02:37] * Quits: hober (ted@206.212.254.2) (Ping timeout)
- # [02:37] * Joins: hober (ted@206.212.254.2)
- # [02:44] * Quits: deane (opera@121.98.190.61) (Ping timeout)
- # [02:59] * Joins: deane (opera@121.98.190.61)
- # [03:00] * Quits: Sander (svl@86.87.68.167) (Quit: And back he spurred like a madman, shrieking a curse to the sky.)
- # [03:32] * Quits: ChrisWilson (cwilso@131.107.0.70) (Ping timeout)
- # [03:48] * Joins: tH (Rob@129.11.83.58)
- # [04:41] * Quits: rking3 (rking3@24.5.77.167) (Quit: rking3)
- # [04:42] * Joins: rking3 (rking3@67.164.15.57)
- # [04:49] * Quits: rking3 (rking3@67.164.15.57) (Ping timeout)
- # [04:51] * Joins: rking3 (rking3@24.5.77.167)
- # [05:10] * Quits: dbaron (dbaron@63.245.220.241) (Quit: 8403864 bytes have been tenured, next gc will be global.)
- # [05:25] * Joins: Zeros (Zeros-Elip@67.154.87.254)
- # [06:52] * Quits: heycam (cam@130.194.72.84) (Quit: bye)
- # [07:21] * Joins: heycam (cam@124.168.97.132)
- # [07:24] * Quits: MikeSmith (MikeSmith@mcclure.w3.org) (Quit: sex break)
- # [07:25] * Joins: MikeSmith (MikeSmith@mcclure.w3.org)
- # [07:33] * Quits: Zeros (Zeros-Elip@67.154.87.254) (Quit: Leaving)
- # [08:45] * Quits: laplink (link@193.157.66.69) (Quit: This computer has gone to sleep)
- # [08:55] * Quits: rking3 (rking3@24.5.77.167) (Quit: rking3)
- # [08:56] * Joins: rking3 (rking3@24.5.77.167)
- # [08:58] * Quits: rking3 (rking3@24.5.77.167) (Quit: rking3)
- # [09:00] * Joins: aaronlev (chatzilla@85.179.60.186)
- # [09:11] * Joins: laplink (link@193.157.66.69)
- # [10:02] * Joins: aaronlev_ (chatzilla@85.179.60.186)
- # [10:04] * Quits: aaronlev (chatzilla@85.179.60.186) (Ping timeout)
- # [10:04] * aaronlev_ is now known as aaronlev
- # [10:49] * Joins: ROBOd (robod@89.122.216.38)
- # [10:56] * Quits: Lachy (Lachlan@85.196.122.246) (Quit: This computer has gone to sleep)
- # [11:08] * Joins: Lachy (Lachlan@213.236.208.22)
- # [11:17] * Joins: Sander (svl@86.87.68.167)
- # [11:40] * Quits: MikeSmith (MikeSmith@mcclure.w3.org) (Quit: sex break)
- # [12:17] * Joins: myakura (myakura@122.16.160.96)
- # [12:30] * Quits: laplink (link@193.157.66.69) (Ping timeout)
- # [12:49] * Parts: deane (opera@121.98.190.61)
- # [13:10] * Quits: rubys (rubys@75.182.92.38) (Client exited)
- # [13:33] * Joins: darobin (robinb@82.233.247.234)
- # [13:34] * Joins: jwatt_ (roslea@83.87.4.17)
- # [13:36] * Quits: jwatt (roslea@83.87.4.17) (Ping timeout)
- # [13:36] * jwatt_ is now known as jwatt
- # [14:00] * Joins: maddiin (mc@87.185.204.56)
- # [14:11] * Quits: Shunsuke (Shunsuke@116.0.163.146) (Ping timeout)
- # [14:23] * Joins: MikeSmith (MikeSmith@mcclure.w3.org)
- # [14:34] * Quits: Sander (svl@86.87.68.167) (Quit: And back he spurred like a madman, shrieking a curse to the sky.)
- # [14:36] * Joins: MichaelC (Michael@128.30.52.30)
- # [14:42] * Quits: aaronlev (chatzilla@85.179.60.186) (Quit: ChatZilla 0.9.84 [Firefox 3.1b3pre/20090109073009])
- # [14:43] * Joins: aaronlev (chatzilla@85.179.60.186)
- # [14:46] * Quits: maddiin (mc@87.185.204.56) (Quit: maddiin)
- # [14:59] * Joins: Shunsuke (Shunsuke@116.0.163.146)
- # [15:45] * Quits: Shunsuke (Shunsuke@116.0.163.146) (Client exited)
- # [15:48] * Joins: Shunsuke (Shunsuke@116.0.163.146)
- # [16:07] * Quits: darobin (robinb@82.233.247.234) (Ping timeout)
- # [16:08] * Joins: darobin (robinb@82.233.247.234)
- # [16:12] * Quits: MichaelC (Michael@128.30.52.30) (Client exited)
- # [16:14] * Joins: MichaelC (Michael@128.30.52.30)
- # [16:17] * Joins: billyjackass (MikeSmith@mcclure.w3.org)
- # [16:19] * Quits: MikeSmith (MikeSmith@mcclure.w3.org) (Ping timeout)
- # [16:19] * billyjackass is now known as MikeSmith
- # [16:39] * Joins: laplink (link@193.157.66.207)
- # [16:40] * Joins: aroben (aroben@71.58.73.153)
- # [16:47] * Joins: primal1 (ccadwall@72.87.132.196)
- # [16:51] * Quits: myakura (myakura@122.16.160.96) (Quit: Leaving...)
- # [17:04] * Joins: Sander (svl@86.87.68.167)
- # [17:06] * Quits: Julian (chatzilla@217.91.35.233) (Connection reset by peer)
- # [17:07] * Joins: Julian (chatzilla@217.91.35.233)
- # [17:08] * Joins: rubys (rubys@75.182.92.38)
- # [17:10] * Quits: Lachy (Lachlan@213.236.208.22) (Quit: This computer has gone to sleep)
- # [17:12] * Joins: billyjackass (MikeSmith@mcclure.w3.org)
- # [17:13] * Quits: primal1 (ccadwall@72.87.132.196) (Quit: primal1)
- # [17:13] * Joins: primal1 (ccadwall@72.87.132.196)
- # [17:16] * Quits: MikeSmith (MikeSmith@mcclure.w3.org) (Ping timeout)
- # [17:33] * Joins: Julian_ (chatzilla@217.91.35.233)
- # [17:33] * Quits: Julian (chatzilla@217.91.35.233) (Ping timeout)
- # [17:33] * Julian_ is now known as Julian
- # [17:40] <pimpbot> planet: IE8 Beta 7000 Bug <http://intertwingly.net/blog/2009/01/12/IE8-Beta-7000-Bug>
- # [17:56] * billyjackass is now known as MikeSmith
- # [17:56] <DanC> hm... html5lib/python/parse.py falls over at 'http://www.myspace.com/parishilton'
- # [17:58] * Joins: gavin_ (gavin@99.226.207.11)
- # [18:01] * Quits: gavin_ (gavin@99.226.207.11) (Ping timeout)
- # [18:01] <Philip> DanC: Works for me
- # [18:01] <Philip> What error do you get?
- # [18:01] <DanC> File "/home/connolly/projects/html5lib/python/src/html5lib/html5parser.py", line 628, in endTagHead
- # [18:01] <DanC> assert node.name == "head"
- # [18:01] <DanC> AssertionError
- # [18:01] <Philip> (Also, is it the latest version of html5lib?)
- # [18:01] <DanC> I just did an svn up
- # [18:01] * Joins: gavin_ (gavin@99.226.207.11)
- # [18:02] <DanC> Updated to revision 1257.
- # [18:03] <Philip> $ svn up
- # [18:03] <Philip> At revision 1257.
- # [18:03] <Philip> $ wget http://www.myspace.com/parishilton -O parishilton.html
- # [18:03] <pimpbot> Title: MySpace.com - Paris Hilton - 27 - Female - California - www.myspace.com/parishilton (at www.myspace.com)
- # [18:03] <Philip> $ python parse.py parishilton.html --no-html
- # [18:03] <Philip> $
- # [18:03] <Philip> is what works for me
- # [18:03] <DanC> what fails for me is:
- # [18:03] <DanC> b$ python python/parse.py -x 'http://www.myspace.com/parishilton' >,ph.html
- # [18:04] <Philip> Oh
- # [18:04] <Philip> File "/home/philip/html/html5lib/python/src/html5lib/liberalxmlparser.py", line 64, in _parse
- # [18:04] <DanC> with --no-html it works, but it's considerably less useful. ;-)
- # [18:04] <Philip> I think liberalxmlparser is known to not work
- # [18:04] <Philip> but I don't know why -x activates that
- # [18:06] <Philip> (since --help says
- # [18:06] <Philip> -x, --xml Output as xml
- # [18:06] <Philip> )
- # [18:06] * Joins: ChrisWilson (cwilso@131.107.0.104)
- # [18:06] <pimpbot> bugmail: "[Bug 6389] New: Avoid double parse error on EOF in DOCTYPE state" ( message in thread) <http://lists.w3.org/Archives/Public/public-html-bugzilla/2009Jan/0012.html>
- # [18:06] <Philip> jgraham: or someone else: Would it seem sensible to make -x just output XML like it says, and add a new option like --liberal-xml-parser to use the liberal XML parser?
- # [18:10] * Joins: Lachy (Lachlan@85.196.122.246)
- # [18:14] * Joins: tlr (tlr@128.30.52.30)
- # [18:16] * Quits: tlr (tlr@128.30.52.30) (Quit: tlr)
- # [18:16] * Joins: tlr (tlr@128.30.52.30)
- # [18:16] * Philip hears no objections
- # [18:17] * Joins: tlr_ (tlr@128.30.52.30)
- # [18:18] * Quits: tlr_ (tlr@128.30.52.30) (Client exited)
- # [18:18] <Philip> DanC: If you svn up and run that command again, it'll use the proper HTML5 parser and then serialise as XML
- # [18:18] <Philip> DanC: (I'm assuming that you do actually want the proper HTML5 parser, not its liberal XML parser)
- # [18:18] <DanC> indeed. thanks!
- # [18:19] <Philip> DanC: But you won't get an xmlns="http://blahblah/xhtml" on the output, if the input didn't have that, which I hope you won't mind too much
- # [18:19] <Philip> and if you do mind, you should ask someone to fix html5lib's XML serialiser :-)
- # [18:20] <DanC> I might be able to fix it as well as anybody else... but in this case, I think the guy who I'm helping just needs a SAX interface to myspace data
- # [18:20] <DanC> so the xmlns is prolly not a deal-breaker
- # [18:21] <Philip> (In particular, it's the serialiser in simpletree.py's toxml)
- # [18:21] <DanC> wild... xmlns:myspace="http://x.myspacecdn.com/modules/sitesearch/static/rdf/profileschema.rdf#"
- # [18:22] <DanC> oh right... RDFa ... that's what brought this up in the 1st place.
- # [18:23] <hsivonen> "http://www.w3.org/TR/1999/PR-rdf-schema-19990303#" that's an unfortunate URI to get hard-coded as URI-as-identifier...
- # [18:23] <pimpbot> Title: Resource Description Framework (RDF) Schema Specification (at www.w3.org)
- # [18:23] <MikeSmith> http://www.w3.org/2007/03/HTML-WG-charter.html
- # [18:23] <pimpbot> Title: HTML Working Group (at www.w3.org)
- # [18:23] <Philip> Oh, if the input has got anything that parses into a DOM that cannot be dumbly serialised into well-formed XML, then parse.py is going to work pretty badly
- # [18:24] <DanC> yeah, the RDF schema URI was one of the 1st ones minted; we didn't have a good feel for the trade-offs
- # [18:25] <DanC> that one doesn't look right, to me
- # [18:35] * Joins: dbaron (dbaron@98.234.51.190)
- # [18:49] * Joins: adele (adele@17.203.14.201)
- # [19:02] * Quits: hober (ted@206.212.254.2) (Quit: ERC Version 5.3 (IRC client for Emacs))
- # [19:02] * Joins: hober (ted@206.212.254.2)
- # [19:07] <pimpbot> bugmail: "[Bug 6390] New: Add a note about reaching after head state and the fragment mode" (1 message in thread) <http://lists.w3.org/Archives/Public/public-html-bugzilla/2009Jan/0013.html>
- # [19:13] * Joins: maddiin (mc@87.185.205.11)
- # [19:17] <rubys> re: "Would it seem sensible to make -x just output XML like it says, and add a new option like --liberal-xml-parser to use the liberal XML parser?" +1. That was my fault. Thanks for fixing it.
- # [19:20] <Philip> rubys: Ah, okay
- # [19:21] * Philip supposes someone should probably actually fix the liberal XML parser, rather than just hiding it, but he doesn't care about it enough (or indeed at all) to look at it himself :-)
- # [19:21] * Quits: darobin (robinb@82.233.247.234) (Ping timeout)
- # [19:22] <rubys> that was something I always meant to do, but I seem to be the only one interested in it
- # [19:29] <DanC> timbl is interested in it... or something nearby, at least: a common parser for XML and HTML
- # [19:30] <DanC> cf http://www.w3.org/2008/10/22-cleaning-tbl.html
- # [19:30] <pimpbot> Title: Cleaning up the Web - W3C Technical Plenary 2008 (at www.w3.org)
- # [19:32] <rubys> lots of people may be interested in having somebody *else* do the work <grin>
- # [19:33] <DanC> tim has been known to do a little coding here and there
- # [19:33] <DanC> but I doubt he'd dig into this one
- # [19:33] * Quits: maddiin (mc@87.185.205.11) (Quit: maddiin)
- # [19:33] <gsnedders> I was talking to tim after the reception about liberal XML, FWIW
- # [19:33] <DanC> I might, but it competes with a lot of stuff on my plate for a while
- # [19:34] <DanC> in fact, finishing publication of that "Cleaning up the Web" talk is in the "At Risk (maybe next week...)" part of my to-do list for this week
- # [19:35] <DanC> or was that last week
- # [19:35] * DanC skipped breakfast and should get lunch
- # [19:37] <pimpbot> bugmail: "[Bug 6390] Add a note about reaching after head state and the fragment mode" (1 message in thread) <http://lists.w3.org/Archives/Public/public-html-bugzilla/2009Jan/0014.html>
- # [19:49] * Quits: dbaron (dbaron@98.234.51.190) (Quit: 8403864 bytes have been tenured, next gc will be global.)
- # [20:01] * Joins: rking3 (rking3@24.5.77.167)
- # [20:05] * Joins: dbaron (dbaron@63.245.220.225)
- # [20:17] * Quits: ChrisWilson (cwilso@131.107.0.104) (Ping timeout)
- # [20:33] * Joins: maddiin (mc@87.185.213.121)
- # [20:37] * Joins: alexf (alejandro@85.152.42.1)
- # [20:44] * Quits: dbaron (dbaron@63.245.220.225) (Quit: 8403864 bytes have been tenured, next gc will be global.)
- # [20:44] * Joins: dbaron (dbaron@63.245.220.241)
- # [20:44] * Quits: dbaron (dbaron@63.245.220.241) (Connection reset by peer)
- # [20:47] * Parts: alexf (alejandro@85.152.42.1)
- # [20:49] * Quits: MikeSmith (MikeSmith@mcclure.w3.org) (Ping timeout)
- # [21:09] <Philip> rubys: You might want to change the subject line of the telcon agenda emails, since the latest one says "Re: {agenda} HTML WG telcon 2008-11-20"
- # [21:15] <rubys> oopsie
- # [21:15] * rubys hangs head in mock shame
- # [21:24] * Quits: primal1 (ccadwall@72.87.132.196) (Quit: primal1)
- # [21:30] <anne> no need, contrary to other WGs I'm involved in this one actually came nicely ahead of time :)
- # [21:52] * Quits: rking3 (rking3@24.5.77.167) (Quit: rking3)
- # [21:55] * Joins: ChrisWilson (cwilso@131.107.0.73)
- # [22:05] * Quits: Julian (chatzilla@217.91.35.233) (Ping timeout)
- # [22:06] * Joins: Julian (chatzilla@217.91.35.233)
- # [22:07] * Joins: rking3 (rking3@99.189.162.6)
- # [22:08] * Quits: rking3 (rking3@99.189.162.6) (Quit: rking3)
- # [22:28] * Quits: tH (Rob@129.11.83.58) (Quit: ChatZilla 0.9.84-rdmsoft [XULRunner 1.9.0.1/2008072406])
- # [22:37] * Quits: heycam (cam@124.168.97.132) (Quit: bye)
- # [22:45] * Quits: ROBOd (robod@89.122.216.38) (Quit: http://www.robodesign.ro )
- # [23:20] * Quits: MichaelC (Michael@128.30.52.30) (Quit: ChatZilla 0.9.84 [Firefox 3.0.5/2008120122])
- # [23:21] * Joins: tH (Rob@129.11.83.58)
- # [23:24] * Disconnected
- # [23:24] * Attempting to rejoin channel #html-wg
- # [23:24] * Rejoined channel #html-wg
- # [23:24] * Topic is 'HTML WG http://www.w3.org/html/wg/ ; This channel is logged: http://krijnhoetmer.nl/irc-logs/'
- # [23:24] * Set by DanC on Thu Jan 08 18:01:34
- # [23:34] * Joins: dbaron (dbaron@63.245.220.241)
- # [23:35] * Quits: dbaron (dbaron@63.245.220.241) (Connection reset by peer)
- # [23:35] * Joins: dbaron (dbaron@63.245.220.241)
- # [23:44] * Disconnected
- # [23:44] * Attempting to rejoin channel #html-wg
- # [23:44] * Rejoined channel #html-wg
- # [23:44] * Topic is 'HTML WG http://www.w3.org/html/wg/ ; This channel is logged: http://krijnhoetmer.nl/irc-logs/'
- # [23:44] * Set by DanC on Thu Jan 08 18:01:34
- # [23:50] * Quits: rubys (rubys@75.182.92.38) (Client exited)
- # Session Close: Wed Jan 14 00:00:00 2009
The end :)