Options:
- # Session Start: Wed Aug 29 00:00:00 2007
- # Session Ident: #whatwg
- # [00:01] * Joins: Ducki__ (n=Ducki@c-180-209-29.cvx-l.dial.de.ignite.net)
- # [00:02] * Quits: KevinMarks (i=KevinMar@nat/google/x-42fc0687ce7b717c) ("The computer fell asleep")
- # [00:05] * Joins: KevinMarks (i=KevinMar@nat/google/x-f94ae5c00ba44eb1)
- # [00:08] * Quits: othermaciej (n=mjs@17.255.110.220) (Connection timed out)
- # [00:20] * Quits: G0k (n=hmason@cpe-24-58-3-19.twcny.res.rr.com)
- # [00:28] * Quits: Ducki_ (n=Ducki@1Cust198.tnt7.ber2.deu.da.uu.net) (Read error: 110 (Connection timed out))
- # [00:31] * Quits: dev0_ (i=Tobias@unaffiliated/icefox0) ("dev0_ has no reason")
- # [00:34] * Joins: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [00:40] * Joins: karlUshi (n=karl@124-144-94-188.rev.home.ne.jp)
- # [00:43] * Quits: Ducki__ (n=Ducki@c-180-209-29.cvx-l.dial.de.ignite.net) (Read error: 110 (Connection timed out))
- # [00:44] * Quits: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net) (Read error: 104 (Connection reset by peer))
- # [00:45] * Joins: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [00:55] * Quits: billmason (n=billmaso@ip156.unival.com) (Read error: 104 (Connection reset by peer))
- # [00:55] * Joins: billmason (n=billmaso@ip156.unival.com)
- # [00:59] * Quits: KevinMarks (i=KevinMar@nat/google/x-f94ae5c00ba44eb1) ("brb")
- # [00:59] * Joins: KevinMarks (i=KevinMar@nat/google/x-61295f2f09c3f848)
- # [01:07] * Joins: othermaciej (n=mjs@17.255.110.220)
- # [01:12] * Quits: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net) (Read error: 104 (Connection reset by peer))
- # [01:12] * Joins: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [01:20] * Quits: billmason (n=billmaso@ip156.unival.com) (".")
- # [01:22] * Quits: othermaciej_ (n=mjs@17.203.15.177) (Read error: 110 (Connection timed out))
- # [01:36] * Quits: hasather (n=david@90-227-221-48-no62.tbcn.telia.com) (Remote closed the connection)
- # [01:41] * Quits: karlUshi (n=karl@124-144-94-188.rev.home.ne.jp) ("Where dwelt Ymir, or wherein did he find sustenance?")
- # [01:43] * Joins: othermaciej_ (n=mjs@17.203.15.177)
- # [01:57] * Quits: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net) (Read error: 104 (Connection reset by peer))
- # [01:58] * Joins: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [01:58] * Joins: webben_ (n=benh@91.84.196.77)
- # [02:05] * Quits: othermaciej (n=mjs@17.255.110.220) (Read error: 110 (Connection timed out))
- # [02:18] * Quits: webben (n=benh@91.84.196.77) (Read error: 110 (Connection timed out))
- # [02:21] * Quits: webben_ (n=benh@91.84.196.77)
- # [02:24] * Quits: dbaron (n=dbaron@corp-241.mountainview.mozilla.com) (Remote closed the connection)
- # [02:26] * Joins: cplot (n=cplot@c-71-57-41-70.hsd1.il.comcast.net)
- # [02:27] * Joins: karlUshi (n=karl@fnttkyo029008.tkyo.fnt.ftth2.ppp.infoweb.ne.jp)
- # [02:31] * Joins: tantek_ (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [02:32] * Quits: karlUshi (n=karl@fnttkyo029008.tkyo.fnt.ftth2.ppp.infoweb.ne.jp) (Client Quit)
- # [02:32] * Joins: karlUshi (n=karl@fnttkyo029008.tkyo.fnt.ftth2.ppp.infoweb.ne.jp)
- # [02:35] * Quits: tantek_ (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net) (Read error: 104 (Connection reset by peer))
- # [02:36] * Joins: tantek_ (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [02:36] * Quits: cplot (n=cplot@c-71-57-41-70.hsd1.il.comcast.net)
- # [02:41] * Quits: tndH (i=Rob@adsl-87-102-38-101.karoo.KCOM.COM) ("ChatZilla 0.9.78.1-rdmsoft [XULRunner 1.8.0.9/2006120508]")
- # [02:44] * Joins: yod (n=ot@softbank221018155222.bbtec.net)
- # [02:44] * Quits: tantek_ (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net) (Read error: 104 (Connection reset by peer))
- # [02:44] * Joins: tantek_ (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [02:48] * Quits: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net) (Read error: 110 (Connection timed out))
- # [02:50] * Quits: tantek_ (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net) (Read error: 104 (Connection reset by peer))
- # [02:51] * Joins: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [02:52] * Quits: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net) (Connection reset by peer)
- # [02:53] * Joins: dbaron (n=dbaron@corp-241.mountainview.mozilla.com)
- # [02:53] * Joins: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [02:55] * Quits: othermaciej_ (n=mjs@17.203.15.177) (Read error: 104 (Connection reset by peer))
- # [02:55] * Joins: othermaciej (n=mjs@17.203.15.177)
- # [02:55] * Joins: cplot (n=cplot@c-71-57-41-70.hsd1.il.comcast.net)
- # [03:07] * Lachy_ is now known as Lachy
- # [03:11] * Quits: h3h (n=w3rd@66-162-32-234.static.twtelecom.net) ("|")
- # [03:11] * Quits: kingryan (n=kingryan@corp.technorati.com)
- # [03:13] * Joins: MikeSmith (n=MikeSmit@eM60-254-206-143.pool.emnet.ne.jp)
- # [03:15] * Quits: psa (n=yomode@posom.com) (heinlein.freenode.net irc.freenode.net)
- # [03:16] * Joins: psa (n=yomode@posom.com)
- # [03:36] * Quits: cplot (n=cplot@c-71-57-41-70.hsd1.il.comcast.net)
- # [03:37] * Quits: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net) (Read error: 104 (Connection reset by peer))
- # [03:37] * Joins: cplot (n=cplot@c-71-57-41-70.hsd1.il.comcast.net)
- # [03:37] * Joins: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [03:43] * Quits: karlUshi (n=karl@fnttkyo029008.tkyo.fnt.ftth2.ppp.infoweb.ne.jp) (heinlein.freenode.net irc.freenode.net)
- # [03:43] * Joins: karlUshi (n=karl@fnttkyo029008.tkyo.fnt.ftth2.ppp.infoweb.ne.jp)
- # [03:45] * Quits: othermaciej (n=mjs@17.203.15.177) (Read error: 104 (Connection reset by peer))
- # [03:45] * Joins: othermaciej (n=mjs@17.203.15.177)
- # [03:46] * Quits: cplot (n=cplot@c-71-57-41-70.hsd1.il.comcast.net)
- # [04:01] * Joins: h3h (n=w3rd@cpe-76-88-44-219.san.res.rr.com)
- # [04:20] * Quits: aroben (n=adamrobe@unaffiliated/aroben)
- # [04:46] * Joins: G0k (n=hmason@cpe-24-58-3-19.twcny.res.rr.com)
- # [05:02] * Quits: weinig (i=weinig@nat/apple/x-ee5aa2dd58744fdd)
- # [05:02] * moeffju is now known as moeffju[ZzZz]
- # [05:04] * Quits: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [05:04] * Quits: KevinMarks (i=KevinMar@nat/google/x-61295f2f09c3f848) ("The computer fell asleep")
- # [05:05] <markp> jgraham: you around?
- # [05:05] <markp> or any other html5lib hackers?
- # [05:05] * Quits: othermaciej (n=mjs@17.203.15.177) (Read error: 104 (Connection reset by peer))
- # [05:06] * Joins: othermaciej (n=mjs@17.203.15.177)
- # [05:09] * Joins: aroben (n=adamrobe@c-67-160-250-192.hsd1.ca.comcast.net)
- # [05:09] * Quits: aroben (n=adamrobe@unaffiliated/aroben) (Remote closed the connection)
- # [05:10] * Joins: aroben (n=adamrobe@c-67-160-250-192.hsd1.ca.comcast.net)
- # [05:32] * Joins: [1]markp (i=markp@nat/google/x-c19c3ef2d0706754)
- # [05:38] * Joins: weinig (n=weinig@c-24-6-210-182.hsd1.ca.comcast.net)
- # [05:39] * Quits: dbaron (n=dbaron@corp-241.mountainview.mozilla.com) ("8403864 bytes have been tenured, next gc will be global.")
- # [05:45] * Quits: karlUshi (n=karl@fnttkyo029008.tkyo.fnt.ftth2.ppp.infoweb.ne.jp) ("Where dwelt Ymir, or wherein did he find sustenance?")
- # [05:47] * Quits: markp (i=markp@nat/google/x-5ce81e517e8c8aab) (Read error: 110 (Connection timed out))
- # [05:47] * [1]markp is now known as markp
- # [05:53] * Quits: h3h (n=w3rd@cpe-76-88-44-219.san.res.rr.com)
- # [05:53] * Quits: csarven (n=nevrasc@modemcable081.152-201-24.mc.videotron.ca) ("http:/www.csarven.ca")
- # [06:03] * Quits: G0k (n=hmason@cpe-24-58-3-19.twcny.res.rr.com)
- # [06:03] * Quits: weinig (n=weinig@c-24-6-210-182.hsd1.ca.comcast.net) (Read error: 104 (Connection reset by peer))
- # [06:03] * Joins: weinig (n=weinig@c-24-6-210-182.hsd1.ca.comcast.net)
- # [06:08] * Joins: [1]markp (i=markp@nat/google/x-fc47d1f132e6d17d)
- # [06:10] * Quits: doublec (n=doublec@202.180.114.137)
- # [06:25] * Quits: markp (i=markp@nat/google/x-c19c3ef2d0706754) (Read error: 110 (Connection timed out))
- # [06:25] * [1]markp is now known as markp
- # [07:00] * Quits: jwalden (n=waldo@corp-241.mountainview.mozilla.com) ("ChatZilla 0.9.78.1-rdmsoft [XULRunner 1.8.0.9/2006120508]")
- # [07:10] * Quits: othermaciej (n=mjs@17.203.15.177) (Read error: 110 (Connection timed out))
- # [07:17] * Quits: markp (i=markp@nat/google/x-fc47d1f132e6d17d) (Read error: 110 (Connection timed out))
- # [07:23] * Joins: markp (n=markp@38.99.84.33)
- # [07:31] * Joins: jwalden (n=waldo@207.47.1.108.static.nextweb.net)
- # [08:00] * Joins: othermaciej (n=mjs@dsl081-048-145.sfo1.dsl.speakeasy.net)
- # [08:43] * Quits: weinig (n=weinig@c-24-6-210-182.hsd1.ca.comcast.net)
- # [08:53] * Quits: MikeSmith (n=MikeSmit@eM60-254-206-143.pool.emnet.ne.jp) ("Less talk, more pimp walk.")
- # [09:04] * Quits: markp (n=markp@38.99.84.33) (Read error: 110 (Connection timed out))
- # [09:08] * Joins: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [09:13] * Joins: MikeSmith (n=MikeSmit@eM60-254-212-214.pool.emnet.ne.jp)
- # [09:29] * Joins: karlUshi (n=karl@124-144-94-188.rev.home.ne.jp)
- # [09:49] * Quits: virtuelv (n=virtuelv@pat-tdc.opera.com) ("Leaving")
- # [09:52] * Quits: yod (n=ot@softbank221018155222.bbtec.net) ("Leaving")
- # [09:53] * Joins: virtuelv (n=virtuelv@pat-tdc.opera.com)
- # [10:09] * Quits: aroben (n=adamrobe@unaffiliated/aroben)
- # [10:10] * Quits: zcorpan (n=zcorpan@c-d391e355.022-154-6c6b7013.cust.bredbandsbolaget.se) (Read error: 110 (Connection timed out))
- # [10:17] * Joins: kfish (n=conrad@61.194.21.25)
- # [10:22] * Quits: MikeSmith (n=MikeSmit@eM60-254-212-214.pool.emnet.ne.jp) ("Less talk, more pimp walk.")
- # [10:34] * Joins: zcorpan (n=zcorpan@pat.se.opera.com)
- # [10:34] * Quits: psa (n=yomode@posom.com) (Remote closed the connection)
- # [10:35] * Quits: virtuelv (n=virtuelv@pat-tdc.opera.com) ("Leaving")
- # [10:38] * Parts: zcorpan (n=zcorpan@pat.se.opera.com)
- # [10:40] * Joins: virtuelv (n=virtuelv@pat-tdc.opera.com)
- # [11:03] * Joins: BenWard (i=BenWard@nat/yahoo/x-efb994a36d3b4de1)
- # [11:10] * Quits: jwalden (n=waldo@207.47.1.108.static.nextweb.net) (Remote closed the connection)
- # [11:10] * Joins: jwalden (n=waldo@207.47.1.108.static.nextweb.net)
- # [11:15] * Joins: tndH (i=Rob@adsl-87-102-38-101.karoo.KCOM.COM)
- # [11:22] * Joins: jwalden_ (n=waldo@207.47.1.108.static.nextweb.net)
- # [11:22] * Quits: jwalden (n=waldo@207.47.1.108.static.nextweb.net) (Dead socket)
- # [11:22] * jwalden_ is now known as jwalden
- # [11:27] * Quits: othermaciej (n=mjs@dsl081-048-145.sfo1.dsl.speakeasy.net)
- # [11:40] * Quits: karlUshi (n=karl@124-144-94-188.rev.home.ne.jp) ("Where dwelt Ymir, or wherein did he find sustenance?")
- # [11:43] * Joins: ROBOd (n=robod@86.34.246.154)
- # [11:47] * Quits: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [11:59] * Quits: Lachy (n=Lachy@124-170-30-206.dyn.iinet.net.au) ("ChatZilla 0.9.78.1 [Firefox 2.0.0.6/2007072518]")
- # [12:06] * Joins: Ducki (n=Ducki@nrdh-d9b98070.pool.mediaWays.net)
- # [12:30] * Joins: zcorpan (n=zcorpan@pat.se.opera.com)
- # [12:32] * Joins: dev0 (i=Tobias@unaffiliated/icefox0)
- # [12:44] * Joins: Lachy (n=Lachy@124-170-30-206.dyn.iinet.net.au)
- # [13:13] * Quits: Lachy (n=Lachy@124-170-30-206.dyn.iinet.net.au) (Remote closed the connection)
- # [13:33] * Joins: hasather (n=david@90-227-221-48-no62.tbcn.telia.com)
- # [13:38] * Quits: kfish (n=conrad@61.194.21.25) ("Pike!")
- # [14:00] * Joins: Ducki_ (n=Ducki@nrdh-d9b9804a.pool.mediaWays.net)
- # [14:17] * Joins: weinig (n=weinig@c-24-6-210-182.hsd1.ca.comcast.net)
- # [14:26] * Quits: Ducki (n=Ducki@nrdh-d9b98070.pool.mediaWays.net) (Read error: 113 (No route to host))
- # [15:02] * moeffju[ZzZz] is now known as moeffju[Away]
- # [15:05] * Quits: weinig (n=weinig@c-24-6-210-182.hsd1.ca.comcast.net)
- # [15:22] <virtuelv> is anyone going to be offended if I refer to the selectors API naming debate as a bikeshed problem?
- # [15:27] <zcorpan> not me
- # [15:27] <zcorpan> and i participated in it :(
- # [15:28] * zcorpan will stay away from naming debates in the future
- # [15:28] <Dashiva> It was a bikeshed problem where all the colors were ugly shades of purple and beige
- # [15:30] <virtuelv> fwiw, http://programming.reddit.com/info/2jrrg/comments
- # [15:35] <zcorpan> accessing radiobuttons with the [[Get]] method is funny -- if there's only one control that matches, it returns it directly, but if there are more it returns a nodelist
- # [15:35] <zcorpan> document.forms[0]["foo"]
- # [15:36] <zcorpan> and undefined if 0 match
- # [15:44] <Dashiva> Same with checkboxes
- # [15:45] <zcorpan> yeah
- # [15:50] <Dashiva> And we should do our best to prevent the same behavior from happening to responsebodies in multipart xhr responses :)
- # [15:57] * Quits: Ducki_ (n=Ducki@nrdh-d9b9804a.pool.mediaWays.net) (Read error: 104 (Connection reset by peer))
- # [16:09] * Quits: hasather (n=david@90-227-221-48-no62.tbcn.telia.com) ("leaving")
- # [16:16] * Joins: hasather (n=david@90-227-221-48-no62.tbcn.telia.com)
- # [16:32] * Joins: billmason (n=billmaso@ip156.unival.com)
- # [17:07] <gsnedders> is there anyway to check WF2 support through JS?
- # [17:11] * Joins: doublec (n=doublec@203-97-173-6.cable.telstraclear.net)
- # [17:11] <gsnedders> document.implementation.hasFeature('WebForms', '2.0')?
- # [17:11] * Joins: Codler (n=Codler@84-218-4-13.eurobelladsl.telenor.se)
- # [17:24] <krijnh> gsnedders: Yeah, that returns true in Opera
- # [17:26] <zcorpan> gsnedders: it's probably safer to check specific methods etc before using them
- # [17:26] <gsnedders> zcorpan: need some way to check if |input|@type=date has a native controller before falling back to a JavaScript one
- # [17:27] <gsnedders> (which will just be a text field on something with neither WF2 or JS)
- # [17:27] <zcorpan> .type == "date" ?
- # [17:27] <gsnedders> does that work?
- # [17:27] <zcorpan> think so
- # [17:27] <gsnedders> does that set itself to "text" in browsers that don't support it?
- # [17:28] <zcorpan> exactly
- # [17:28] <zcorpan> so you know if it's not supported
- # [17:28] <zcorpan> if (input.type != "date") { // not supported
- # [17:28] <gsnedders> falls back in Saf at least
- # [17:30] <Dashiva> falls back in ie and ff too
- # [17:30] <Dashiva> ff2, ie7
- # [17:30] <zcorpan> there you go
- # [17:31] <gsnedders> zcorpan: thanks
- # [17:31] <zcorpan> welcome :)
- # [17:34] * Quits: doublec (n=doublec@203-97-173-6.cable.telstraclear.net)
- # [17:49] * Joins: aroben (n=adamrobe@17.203.15.195)
- # [17:57] * Joins: markp (n=markp@38.99.84.33)
- # [18:06] * Joins: Ducki (n=Ducki@nrdh-d9b98041.pool.mediaWays.net)
- # [18:12] * Quits: dolphinling (n=chatzill@132.198.252.57) ("pong timeout")
- # [18:22] * Joins: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [18:30] * Quits: markp (n=markp@38.99.84.33) (Read error: 110 (Connection timed out))
- # [18:35] * Joins: markp (i=markp@nat/google/x-230ef89d21fd75e0)
- # [18:35] * Joins: KevinMarks (i=KevinMar@nat/google/x-5775183821a6b4aa)
- # [18:36] * Quits: jwalden (n=waldo@207.47.1.108.static.nextweb.net) ("ChatZilla 0.9.78.1-rdmsoft [XULRunner 1.8.0.9/2006120508]")
- # [18:36] * Joins: Lachy (n=Lachy@124-170-30-206.dyn.iinet.net.au)
- # [18:37] * Quits: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net) (Read error: 104 (Connection reset by peer))
- # [18:37] * Joins: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [18:48] * Quits: Ducki (n=Ducki@nrdh-d9b98041.pool.mediaWays.net) (Read error: 104 (Connection reset by peer))
- # [18:55] * Quits: markp (i=markp@nat/google/x-230ef89d21fd75e0) (Read error: 110 (Connection timed out))
- # [18:55] * Joins: h3h (n=w3rd@66-162-32-234.static.twtelecom.net)
- # [18:59] * Joins: jwalden_ (n=waldo@corp-241.mountainview.mozilla.com)
- # [18:59] * jwalden_ is now known as jwalden
- # [19:05] * Joins: weinig (i=weinig@nat/apple/x-36a12a42d9c32b7c)
- # [19:12] * Joins: markp (i=markp@nat/google/x-dfe5f365b2427940)
- # [19:13] * Quits: zcorpan (n=zcorpan@pat.se.opera.com) (Read error: 110 (Connection timed out))
- # [19:22] <Lachy> http://ajaxian.com/archives/selectors-api-method-names-selectelement-and-selectallelements
- # [19:22] <Lachy> http://dean.edwards.name/weblog/2007/08/names/
- # [19:22] <Lachy> http://lachy.id.au/log/2007/06/naming-debate
- # [19:23] * Joins: dev0_ (i=Tobias@dslb-088-077-241-235.pools.arcor-ip.net)
- # [19:25] <Lachy> oops, wrong link. http://lachy.id.au/log/2007/08/naming-debate-revisited
- # [19:36] * Joins: tndH_ (i=Rob@83.100.251.102)
- # [19:37] * Joins: kingryan (n=kingryan@corp.technorati.com)
- # [19:39] * Quits: dev0 (i=Tobias@unaffiliated/icefox0) (Read error: 110 (Connection timed out))
- # [19:40] * Joins: [1]markp (i=markp@nat/google/x-4a69c0b28c223d49)
- # [19:42] * Quits: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [19:44] * Joins: webben (i=benh@nat/yahoo/x-74567a839898ed18)
- # [19:50] * Quits: markp (i=markp@nat/google/x-dfe5f365b2427940) (Read error: 110 (Connection timed out))
- # [19:50] * [1]markp is now known as markp
- # [19:55] * Quits: tndH (i=Rob@adsl-87-102-38-101.karoo.KCOM.COM) (Read error: 110 (Connection timed out))
- # [19:55] * Quits: BenWard (i=BenWard@nat/yahoo/x-efb994a36d3b4de1) ("Fades out again…")
- # [19:55] * Joins: yod (n=ot@softbank221018155222.bbtec.net)
- # [20:01] * Joins: psa (n=yomode@posom.com)
- # [20:13] <gsnedders> kingryan: what did you do regarding text/plain sniffing?
- # [20:13] <kingryan> gsnedders: what do you mean?
- # [20:14] <gsnedders> kingryan: in your implementation of feed/html, did you just count every text/plain document as such, and not do what browsers do, and sniff it?
- # [20:14] <kingryan> the sniffing impl I've added to html5lib/ruby isn't actually integrated w/ the rest of the parser
- # [20:15] <kingryan> in other words, I haven't implemented the part of the decision tree that switches on content-type
- # [20:16] <kingryan> ... if that answers your question, gsnedders
- # [20:16] <gsnedders> ya
- # [20:16] <kingryan> I've also implemented it at technorati, where we don't really care about text/plain
- # [20:16] <kingryan> we only really care wether something is a feed or html
- # [20:16] <kingryan> or other
- # [20:17] <gsnedders> kingryan: but what if text/plain _is_ a feed or HTML?
- # [20:17] <kingryan> I wasn't clear... we don't care if people *say* its text/plain
- # [20:17] <gsnedders> you just sniff it regardless of the claim?
- # [20:18] <kingryan> yup
- # [20:19] <kingryan> but we're only indexing URLs that either 1) people pinged us with or 2) were discovered through link[@rel=~alternate]
- # [20:21] <kingryan> so, it works well enough for us
- # [20:21] <gsnedders> the feed/html algorithm currently concludes "otherwise return text/html". How do you tell apart HTML from the rest?
- # [20:22] <kingryan> because of where it sits in the decision tree, it doesn't matter
- # [20:22] * markp goes off to start a text/plain blog
- # [20:22] <kingryan> if someone says "this is a feed", but its not, we give up on processing it
- # [20:22] <markp> does that happen a lot?
- # [20:22] <kingryan> the common case is an http 500 which has html
- # [20:23] <markp> heh
- # [20:23] <kingryan> or 503
- # [20:23] <gsnedders> 503?
- # [20:23] <kingryan> 503 = "temporarily unavailable"
- # [20:23] <markp> http error 503: my web hosting sucks
- # [20:23] <gsnedders> markp: I was wondering whether I'd get such a response from you :)
- # [20:23] <kingryan> the more common case is that someone says their feed is rss , but its really atom
- # [20:24] <gsnedders> I don't even differentiate between the two. Just check if we have a feed or not.
- # [20:24] <markp> kingryan: do you use feedparser to determine feed version?
- # [20:25] <markp> just curious
- # [20:27] <kingryan> markp: our old spider uses UFP, but is slowly being replaced by one written in ruby
- # [20:28] <kingryan> for which the parsing is mostly based on html5lib
- # [20:29] * Joins: zcorpan (n=zcorpan@85.227.145.211)
- # [20:41] * Philip` tries running the html5lib validator on his documentation examples
- # [20:42] <Philip`> http://james.html5.org/cgi-bin/parsetree/parsetree.py?source=%3C%21DOCTYPE+HTML%3E%0D%0A%3Ctitle%3E%3Cb%3E+%26amp%3B+%3Ci%3E%3C%2Ftitle%3E - how come that's sensible there, but gives different results in SVN html5lib?
- # [20:43] * Quits: dev0_ (i=Tobias@unaffiliated/icefox0) (Read error: 110 (Connection timed out))
- # [20:48] <zcorpan> Philip`: what does the latter give?
- # [20:49] <zcorpan> shouldn't the innerHTML view escape < and > in title, btw?
- # [20:49] <zcorpan> and &
- # [20:50] <Philip`> >>> print p.parse('<!doctype html><title><b> & <i></title>').printTree()
- # [20:50] <Philip`> #document
- # [20:50] <Philip`> | <!DOCTYPE html>
- # [20:51] <Philip`> | <html>
- # [20:51] <Philip`> | <head>
- # [20:51] <Philip`> | <title>
- # [20:51] <Philip`> | <body>
- # [20:51] <Philip`> | <b>
- # [20:51] <Philip`> | " & "
- # [20:51] <Philip`> | <i>
- # [20:51] <zcorpan> aha.
- # [20:52] <zcorpan> regression then
- # [20:53] <Philip`> Looks like <title> is meant to be parsed as RCDATA
- # [20:54] <zcorpan> yes
- # [22:58] * Disconnected
- # [22:58] * Attempting to rejoin channel #whatwg
- # [22:58] * Rejoined channel #whatwg
- # [22:58] * Topic is 'WHATWG (HTML5) -- http://www.whatwg.org/ -- Logs: http://krijnhoetmer.nl/irc-logs/ -- Please leave your sense of logic at the door, thanks!'
- # [22:58] * Set by Hixie on Tue Apr 03 04:10:22
- # [22:58] <jgraham> Oh attribute values
- # [22:58] <jgraham> I didn't do those
- # [22:58] <zcorpan> right
- # [22:59] * Joins: aroben (n=adamrobe@unaffiliated/aroben)
- # [23:00] * Joins: weinig_ (n=weinig@17.255.99.100)
- # [23:00] <jgraham> That looks a bit more promising
- # [23:00] * Joins: wildcfo (n=wild_c_f@ip72-200-107-73.tc.ph.cox.net)
- # [23:00] * Parts: wildcfo (n=wild_c_f@ip72-200-107-73.tc.ph.cox.net) ("Leaving")
- # [23:02] <zcorpan> jgraham: <plaintext> is also a cdata element
- # [23:02] <zcorpan> for this purpose anyway
- # [23:03] <zcorpan> otherwise looks right
- # [23:04] <zcorpan> hmm, http://james.html5.org/cgi-bin/parsetree/parsetree.py?source=%3Cnoscript%3E doesn't look like it's parsed correctly (or there's a bug in the spec)
- # [23:04] <Philip`> markp: Oh, I'm happy to slack - I just wanted to see if it worked already, and only found that one problem, so I'm content to wait until it works even better in the future ;-)
- # [23:08] <zcorpan> "<noscript>" is supposed to be parsed into <html><head><noscript></noscript></head><body></body></html> if i read the spec right
- # [23:09] <jgraham> zcorpan: That looks more plausible
- # [23:10] * jgraham is wondering why html5lib has constants named rcdataElements and cdataElements that seem to be almost exactly back to front
- # [23:12] * Quits: weinig (i=weinig@nat/apple/x-36a12a42d9c32b7c) (Read error: 110 (Connection timed out))
- # [23:13] * Quits: aroben_ (n=adamrobe@unaffiliated/aroben) (Read error: 110 (Connection timed out))
- # [23:22] * Quits: weinig_ (n=weinig@17.255.99.100)
- # [23:23] * Joins: aroben_ (n=adamrobe@unaffiliated/aroben)
- # [23:25] * Joins: othermaciej (n=mjs@dsl081-048-145.sfo1.dsl.speakeasy.net)
- # [23:32] * Joins: weinig (i=weinig@nat/apple/x-a5e8e96ea8914a4e)
- # [23:39] <jgraham> zcorpan: Where should the </noscript> be generated in the spec?
- # [23:41] * Quits: weinig (i=weinig@nat/apple/x-a5e8e96ea8914a4e)
- # [23:41] <zcorpan> jgraham: i don't understand the question
- # [23:41] * Joins: weinig (i=weinig@nat/apple/x-2f071f5d370aae0d)
- # [23:42] * Quits: weinig (i=weinig@nat/apple/x-2f071f5d370aae0d) (Client Quit)
- # [23:42] * Joins: weinig (i=weinig@nat/apple/x-55902fa5c18e7762)
- # [23:42] <zcorpan> jgraham: you mean as an implied token in the tree construction?
- # [23:42] <jgraham> Yeah
- # [23:42] <zcorpan> they aren't
- # [23:43] <zcorpan> the cdata parsing algorithm just looks until it finds a token that's not a character token
- # [23:43] <zcorpan> if that's the end tag it's ignored
- # [23:43] <zcorpan> otherwise you carry on
- # [23:43] <zcorpan> so say the tokens are: start tag "noscript", character "X", end-of-file
- # [23:44] * Quits: aroben (n=adamrobe@unaffiliated/aroben) (Connection timed out)
- # [23:44] <jgraham> Oh, I see
- # [23:44] <jgraham> We're mistakenly inserting the token for <noscript> into the stack of open elements
- # [23:44] <zcorpan> when you get to the end-of-file it's a parse error and you get back to "in head"
- # [23:45] <zcorpan> seems so
- # [23:46] * moeffju[Away] is now known as moeffju
- # [23:47] <zcorpan> it should get the same treatment as <style> i think
- # [23:47] <zcorpan> basically
- # [23:48] <zcorpan> or exactly :)
- # [23:49] <zcorpan> if scripting is enabled
- # [23:49] <jgraham> zcorpan: I think we get <style> wrong too
- # [23:49] <zcorpan> http://james.html5.org/cgi-bin/parsetree/parsetree.py?source=%3Cstyle%3E
- # [23:49] <jgraham> Eh, that doesn't look like it should work
- # [23:58] * moeffju is now known as moeffju[afk]
- # [23:59] <jgraham> zcorpan: OK I have a fix but I think this code needs a little more love (tomorrow :) )
- # Session Close: Thu Aug 30 00:00:00 2007
The end :)