/irc-logs / freenode / #whatwg / 2010-07-31 / end

Options:

  1. # Session Start: Sat Jul 31 00:00:00 2010
  2. # Session Ident: #whatwg
  3. # [00:17] * Quits: ttepasse (~ttepasse@ip-109-90-160-217.unitymediagroup.de) (Read error: Connection reset by peer)
  4. # [00:20] * Joins: f1lt3r (~f1lt3r@64.119.159.231)
  5. # [00:26] * Quits: oal (~oal@5.79-160-122.customer.lyse.net) (Remote host closed the connection)
  6. # [00:33] * Quits: Maurice (copyman@5ED573FA.cable.ziggo.nl)
  7. # [00:37] * Joins: miketaylr (~miketaylr@24.42.95.108)
  8. # [00:42] * Quits: meandi2 (~meandi@dynadsl-080-228-79-198.ewetel.net) (Quit: Nettalk6 - www.ntalk.de)
  9. # [00:43] * Joins: hamcore (hamcore@unaffiliated/msmosso)
  10. # [00:44] * Quits: miketaylr (~miketaylr@24.42.95.108) (Ping timeout: 264 seconds)
  11. # [00:45] * Joins: miketaylr (~miketaylr@24.42.95.108)
  12. # [00:48] * Quits: workmad3 (~workmad3@cpc3-bagu10-0-0-cust651.1-3.cable.virginmedia.com) (Remote host closed the connection)
  13. # [00:55] * Joins: Alystair (Alystair@bas1-toronto10-1279558760.dsl.bell.ca)
  14. # [00:56] * Quits: miketaylr (~miketaylr@24.42.95.108) (Ping timeout: 246 seconds)
  15. # [01:06] * Quits: JonathanNeal (~Jonathan_@99-59-124-67.lightspeed.irvnca.sbcglobal.net) (Read error: Connection reset by peer)
  16. # [01:06] * Joins: JonathanNeal (~Jonathan_@99-59-124-67.lightspeed.irvnca.sbcglobal.net)
  17. # [01:13] * Quits: eighty4 (~eighty4@c-76c8e455.012-403-6c6b701.cust.bredbandsbolaget.se) (Remote host closed the connection)
  18. # [01:15] * Joins: deepthawtz (~deepthawt@173-11-69-17-SFBA.hfc.comcastbusiness.net)
  19. # [01:21] * Joins: seventh (~seventh@64-9-175-104.fwd.datafoundry.com)
  20. # [01:21] * Quits: mdelaney (~mdelaney@2620:0:1b00:1191:d69a:20ff:febf:89a0) (Quit: mdelaney)
  21. # [01:21] * Quits: seventh (~seventh@64-9-175-104.fwd.datafoundry.com) (Remote host closed the connection)
  22. # [01:21] * Quits: richardschwerdtf (~RichS@99-39-114-91.lightspeed.austtx.sbcglobal.net) (Quit: richardschwerdtf)
  23. # [01:25] * Quits: JonathanNeal (~Jonathan_@99-59-124-67.lightspeed.irvnca.sbcglobal.net) (Read error: Connection reset by peer)
  24. # [01:25] * Joins: JonathanNeal (~Jonathan_@99-59-124-67.lightspeed.irvnca.sbcglobal.net)
  25. # [01:33] * Quits: deepthawtz (~deepthawt@173-11-69-17-SFBA.hfc.comcastbusiness.net) (Remote host closed the connection)
  26. # [01:34] * Quits: JonathanNeal (~Jonathan_@99-59-124-67.lightspeed.irvnca.sbcglobal.net) (Read error: Connection reset by peer)
  27. # [01:34] * Joins: Jon_Neal (~Jonathan_@99-59-124-67.lightspeed.irvnca.sbcglobal.net)
  28. # [01:34] * Quits: Jon_Neal (~Jonathan_@99-59-124-67.lightspeed.irvnca.sbcglobal.net) (Read error: Connection reset by peer)
  29. # [01:35] * Quits: paul_irish (~paul_iris@nat/google/x-djjuistvkrlgsuaq) (Remote host closed the connection)
  30. # [01:35] * Quits: Lachy (~Lachlan@cm-84.215.59.50.getinternet.no) (Quit: Leaving)
  31. # [01:35] * Joins: Lachy (~Lachlan@cm-84.215.59.50.getinternet.no)
  32. # [01:46] * Quits: ZombieLoffe (~e@unaffiliated/zombieloffe)
  33. # [01:47] * Quits: dglazkov (~dglazkov@nat/google/x-qsijtshgtwasgira) (Quit: dglazkov)
  34. # [01:49] * Quits: othermaciej (~mjs@17.246.18.24) (Quit: othermaciej)
  35. # [01:53] * Joins: paul_irish (~paul_iris@nat/google/x-yhmuwvckcjqgoson)
  36. # [01:55] * Quits: stalled (~stalled@unaffiliated/stalled) (Ping timeout: 265 seconds)
  37. # [02:02] * Joins: seventh (seventh@64-9-175-104.fwd.datafoundry.com)
  38. # [02:09] * Joins: stalled (~stalled@unaffiliated/stalled)
  39. # [02:21] * Quits: paul_irish (~paul_iris@nat/google/x-yhmuwvckcjqgoson) (Remote host closed the connection)
  40. # [02:25] * Joins: paul_irish (~paul_iris@nat/google/x-trrfohavsfnpckpm)
  41. # [02:29] * Quits: dave_levin (~dave_levi@74.125.59.73) (Quit: dave_levin)
  42. # [02:30] * Joins: titacgs (~titacgs@201.250.188.146)
  43. # [02:34] * Joins: miketaylr (~miketaylr@24.42.95.108)
  44. # [02:35] * Joins: dglazkov (~dglazkov@75-37-194-175.lightspeed.lsatca.sbcglobal.net)
  45. # [02:37] * Quits: jwalden (~waldo@nat/mozilla/x-wdggqfmbaegqzipu) (Quit: ChatZilla 0.9.86-rdmsoft [XULRunner 1.9.2.7/20100723203751])
  46. # [02:38] * Quits: cardona507 (~cardona50@c-67-180-160-250.hsd1.ca.comcast.net) (Quit: zzzzz)
  47. # [02:40] * Quits: Necrathex (~bleptop@212-123-163-12.ip.telfort.nl) (Quit: Necrathex)
  48. # [02:45] * Joins: jwalden (~waldo@nat/mozilla/x-tvmlslwqgykcklui)
  49. # [02:46] * Quits: jlebar (~jlebar@63.245.220.220) (Quit: Leaving)
  50. # [02:57] * Quits: roc (~roc@121.98.230.221) (Ping timeout: 276 seconds)
  51. # [03:08] * Quits: dglazkov (~dglazkov@75-37-194-175.lightspeed.lsatca.sbcglobal.net) (Quit: dglazkov)
  52. # [03:09] * Quits: ojan (~ojan@nat/google/x-toubowkbkhiajayo) (Quit: ojan)
  53. # [03:13] * Quits: miketaylr (~miketaylr@24.42.95.108) (Quit: miketaylr)
  54. # [03:15] * Joins: othermaciej (~mjs@17.246.18.24)
  55. # [03:40] * Joins: erlehmann (~erlehmann@dslb-094-223-090-112.pools.arcor-ip.net)
  56. # [03:40] * Quits: paul_irish (~paul_iris@nat/google/x-trrfohavsfnpckpm) (Remote host closed the connection)
  57. # [03:45] * Joins: dglazkov (~dglazkov@75-37-194-175.lightspeed.lsatca.sbcglobal.net)
  58. # [03:47] * Joins: paul_irish (~paul_iris@66.109.106.132)
  59. # [03:49] * Joins: erlehmann_ (~erlehmann@dslb-088-075-059-120.pools.arcor-ip.net)
  60. # [03:52] * Quits: erlehmann (~erlehmann@dslb-094-223-090-112.pools.arcor-ip.net) (Ping timeout: 245 seconds)
  61. # [03:58] * Joins: roc (~roc@121.98.230.221)
  62. # [03:59] * Joins: 30BAAVWXX (~homata@h219-110-13-055.catv02.itscom.jp)
  63. # [04:05] * Quits: othermaciej (~mjs@17.246.18.24) (Quit: othermaciej)
  64. # [04:15] * Quits: jamesr (~jamesr@nat/google/x-erbdjrcipujvabzo) (Quit: jamesr)
  65. # [04:26] * Quits: paul_irish (~paul_iris@66.109.106.132) (Remote host closed the connection)
  66. # [04:27] * Joins: nimbupani (~nimbupani@c-24-22-131-46.hsd1.wa.comcast.net)
  67. # [04:31] * Quits: aho (~nya@fuld-4d00d239.pool.mediaWays.net) (Quit: EXEC_over.METHOD_SUBLIMATION)
  68. # [04:35] * Quits: jwalden (~waldo@nat/mozilla/x-tvmlslwqgykcklui) (Quit: ChatZilla 0.9.86-rdmsoft [XULRunner 1.9.2.7/20100723203751])
  69. # [04:39] * Joins: jamesr (~jamesr@c-24-6-103-140.hsd1.ca.comcast.net)
  70. # [04:44] * Quits: jamesr (~jamesr@c-24-6-103-140.hsd1.ca.comcast.net) (Client Quit)
  71. # [04:45] * Quits: romeo_ (~romeo__@x1-6-00-07-95-57-08-bb.k38.webspeed.dk) (Quit: Leaving)
  72. # [04:50] * Quits: dglazkov (~dglazkov@75-37-194-175.lightspeed.lsatca.sbcglobal.net) (Quit: dglazkov)
  73. # [04:55] * Joins: dbaron (~dbaron@c-98-234-51-190.hsd1.ca.comcast.net)
  74. # [05:03] * Quits: weinig (~weinig@2620:0:1b00:1191:223:32ff:feaf:7f36) (Ping timeout: 248 seconds)
  75. # [05:03] * Joins: paul_irish (~paul_iris@c-76-21-40-62.hsd1.ca.comcast.net)
  76. # [05:10] * Joins: jwalden (~waldo@c-71-202-165-226.hsd1.ca.comcast.net)
  77. # [05:11] * Joins: MikeSmithX (~MikeSmith@EM111-188-29-155.pool.e-mobile.ne.jp)
  78. # [05:11] * Quits: MikeSmithX (~MikeSmith@EM111-188-29-155.pool.e-mobile.ne.jp) (Remote host closed the connection)
  79. # [05:14] * Quits: MikeSmith (~MikeSmith@EM114-48-43-237.pool.e-mobile.ne.jp) (Ping timeout: 240 seconds)
  80. # [05:39] * Joins: cgcardona_androi (~cgcardona@72-62-165-117.pools.spcsdns.net)
  81. # [05:47] * Quits: 30BAAVWXX (~homata@h219-110-13-055.catv02.itscom.jp) (Quit: Leaving...)
  82. # [05:51] * Joins: cardona507 (~cardona50@c-67-180-160-250.hsd1.ca.comcast.net)
  83. # [05:58] * cgcardona_androi is now known as cgcardonaAndroid
  84. # [06:02] * Joins: dglazkov (~dglazkov@75-37-194-175.lightspeed.lsatca.sbcglobal.net)
  85. # [06:03] * Quits: dglazkov (~dglazkov@75-37-194-175.lightspeed.lsatca.sbcglobal.net) (Client Quit)
  86. # [06:26] * Quits: cgcardonaAndroid (~cgcardona@72-62-165-117.pools.spcsdns.net) (Quit: Bye)
  87. # [06:27] * Joins: cgcardona_androi (~cgcardona@72-62-165-117.pools.spcsdns.net)
  88. # [06:29] * cgcardona_androi is now known as cgcardonaAndroid
  89. # [06:43] * Joins: mamund_ (mamund@frost.nullshells.net)
  90. # [06:43] * Quits: nimbupani (~nimbupani@c-24-22-131-46.hsd1.wa.comcast.net) (Quit: nimbupani)
  91. # [06:44] * Quits: mamund (mamund@2001:49f0:1017::2) (*.net *.split)
  92. # [06:44] * Quits: TabAtkins (~tabatkins@nat/google/x-wqjlvkrenxgjbpmj) (*.net *.split)
  93. # [06:44] * Quits: mcarter (~mcarter@c-69-181-57-153.hsd1.ca.comcast.net) (*.net *.split)
  94. # [06:44] * Quits: Philip` (~philip@zaynar.co.uk) (*.net *.split)
  95. # [06:44] * Quits: aboodman (~aa@nat/google/x-pbgoxblqsrbyqbxz) (*.net *.split)
  96. # [06:44] * Quits: ksemeks (~ksemeks@alpha.linux.hr) (*.net *.split)
  97. # [06:44] * Quits: s0enke (~soenke@naturalborngrillers.org) (*.net *.split)
  98. # [06:46] * Joins: ksemeks (~ksemeks@alpha.linux.hr)
  99. # [06:47] * Joins: TabAtkins (~tabatkins@nat/google/x-hfpxyfagznfrttki)
  100. # [06:47] * Joins: Philip` (~philip@zaynar.co.uk)
  101. # [06:47] * Joins: aboodman (~aa@nat/google/x-xkawaiozmfdnqzyj)
  102. # [06:49] * Joins: s0enke (~soenke@naturalborngrillers.org)
  103. # [06:50] * Joins: mcarter (~mcarter@c-69-181-57-153.hsd1.ca.comcast.net)
  104. # [07:03] * Joins: weinig (~weinig@c-69-181-125-223.hsd1.ca.comcast.net)
  105. # [07:13] * Quits: weinig (~weinig@c-69-181-125-223.hsd1.ca.comcast.net) (Quit: weinig)
  106. # [07:28] * Joins: paradisaeidae (~chatzilla@ppp167-253-233.static.internode.on.net)
  107. # [07:31] * Quits: titacgs (~titacgs@201.250.188.146) (Ping timeout: 258 seconds)
  108. # [07:34] * Quits: cgcardonaAndroid (~cgcardona@72-62-165-117.pools.spcsdns.net) (Quit: Bye)
  109. # [07:36] * Joins: boblet (~boblet@p1201-ipbf709osakakita.osaka.ocn.ne.jp)
  110. # [08:02] * Quits: cardona507 (~cardona50@c-67-180-160-250.hsd1.ca.comcast.net) (Quit: zzzzz)
  111. # [08:17] * Joins: MikeSmith (~MikeSmith@EM114-48-93-226.pool.e-mobile.ne.jp)
  112. # [08:25] * Disconnected
  113. # [08:26] * Attempting to rejoin channel #whatwg
  114. # [08:27] * Rejoined channel #whatwg
  115. # [08:27] * Topic is 'WHATWG: http://www.whatwg.org/ -- logs: http://krijnhoetmer.nl/irc-logs/ -- stats: http://gavinsharp.com/irc/whatwg.html -- Please leave your sense of logic at the door, thanks!'
  116. # [08:27] * Set by annevk42 on Mon Oct 19 23:03:06
  117. # [08:37] * Joins: MikeSmithX (~MikeSmith@EM114-48-52-11.pool.e-mobile.ne.jp)
  118. # [08:38] * Quits: MikeSmithX (~MikeSmith@EM114-48-52-11.pool.e-mobile.ne.jp) (Client Quit)
  119. # [08:40] * Quits: MikeSmith (~MikeSmith@EM114-48-93-226.pool.e-mobile.ne.jp) (Ping timeout: 276 seconds)
  120. # [08:45] * Quits: Amorphous (jan@unaffiliated/amorphous) (Ping timeout: 260 seconds)
  121. # [08:59] * Joins: Amorphous (jan@unaffiliated/amorphous)
  122. # [09:00] * Joins: Heimidal (~heimidal@unaffiliated/heimidal)
  123. # [09:04] * Quits: TabAtkins_ (~tabatkins@nat/google/x-kwljvmquaxgjkwte) (Read error: Operation timed out)
  124. # [09:12] * Quits: roc (~roc@121.98.230.221) (Quit: roc)
  125. # [09:15] * Joins: MikeSmith (~MikeSmith@EM114-48-52-11.pool.e-mobile.ne.jp)
  126. # [09:16] * Quits: jwalden (~waldo@c-71-202-165-226.hsd1.ca.comcast.net) (Quit: ChatZilla 0.9.86-rdmsoft [XULRunner 1.9.2.7/20100723203751])
  127. # [09:16] * Quits: paradisaeidae (~chatzilla@ppp167-253-233.static.internode.on.net) (Quit: ChatZilla 0.9.86 [Firefox 3.6.8/20100722155716])
  128. # [09:24] * Quits: dbaron (~dbaron@c-98-234-51-190.hsd1.ca.comcast.net) (Quit: g'night)
  129. # [09:41] * Joins: oal (~oal@5.79-160-122.customer.lyse.net)
  130. # [10:25] * Joins: Maurice (copyman@5ED573FA.cable.ziggo.nl)
  131. # [10:26] * Joins: svl (~me@ip565744a7.direct-adsl.nl)
  132. # [10:28] * Joins: lucasalvini (~luca@host12-56-dynamic.7-87-r.retail.telecomitalia.it)
  133. # [10:29] * Quits: lucasalvini (~luca@host12-56-dynamic.7-87-r.retail.telecomitalia.it) (Client Quit)
  134. # [10:30] * Quits: MikeSmith (~MikeSmith@EM114-48-52-11.pool.e-mobile.ne.jp) (Quit: This computer has gone to sleep)
  135. # [10:36] * Joins: ROBOd (~robod@89.123.152.174)
  136. # [10:37] * Quits: kinetik (~kinetik@121.98.132.55) (Read error: Operation timed out)
  137. # [10:39] * Joins: kinetik (~kinetik@121.98.132.55)
  138. # [10:41] * Quits: micheil (~micheil@124-170-235-35.dyn.iinet.net.au) (Quit: micheil)
  139. # [10:45] * Joins: payman_s (~payman@81-229-81-24-no65.tbcn.telia.com)
  140. # [10:47] * Joins: micheil (~micheil@124-170-235-35.dyn.iinet.net.au)
  141. # [10:53] * Joins: MikeSmith (~MikeSmith@EM114-48-52-11.pool.e-mobile.ne.jp)
  142. # [10:54] * Quits: MikeSmith (~MikeSmith@EM114-48-52-11.pool.e-mobile.ne.jp) (Client Quit)
  143. # [11:15] * Joins: eighty4 (~eighty4@c-76c8e455.012-403-6c6b701.cust.bredbandsbolaget.se)
  144. # [11:22] * Quits: Maurice (copyman@5ED573FA.cable.ziggo.nl)
  145. # [11:22] * Joins: Maurice (copyman@5ED573FA.cable.ziggo.nl)
  146. # [11:26] * Quits: Maurice (copyman@5ED573FA.cable.ziggo.nl) (Client Quit)
  147. # [11:28] * Quits: kbrosnan (~kbrosnan@ip24-250-54-36.ri.ri.cox.net) (Ping timeout: 240 seconds)
  148. # [11:30] * Joins: Maurice (copyman@5ED573FA.cable.ziggo.nl)
  149. # [11:32] * Joins: kbrosnan (~kbrosnan@ip24-250-54-36.ri.ri.cox.net)
  150. # [11:33] * Quits: ksemeks (~ksemeks@alpha.linux.hr) (Ping timeout: 264 seconds)
  151. # [11:33] * Joins: ksemeks (~ksemeks@alpha.linux.hr)
  152. # [11:37] * Joins: workmad3 (~workmad3@cpc3-bagu10-0-0-cust651.1-3.cable.virginmedia.com)
  153. # [11:39] * Quits: hamcore (hamcore@unaffiliated/msmosso)
  154. # [11:53] * Joins: 30BAAVX6J (~maikmerte@port-92-201-5-188.dynamic.qsc.de)
  155. # [11:53] * Quits: 30BAAVX6J (~maikmerte@port-92-201-5-188.dynamic.qsc.de) (Remote host closed the connection)
  156. # [11:53] * Joins: maikmerten (~maikmerte@port-92-201-5-188.dynamic.qsc.de)
  157. # [12:01] * Joins: MikeSmith (~MikeSmith@EM114-48-52-11.pool.e-mobile.ne.jp)
  158. # [12:09] * Joins: ZombieLoffe (~e@unaffiliated/zombieloffe)
  159. # [12:50] <annevk> so only Mozilla has implemented ele.spellcheck and they do it as a boolean rather than enumerable attribute as the spec requires?
  160. # [12:50] <jgraham> annevk: Yes
  161. # [12:51] <annevk> I guess the spec wanted it to be in sync with .contentEditable
  162. # [12:51] * Quits: Alystair (Alystair@bas1-toronto10-1279558760.dsl.bell.ca) (Ping timeout: 260 seconds)
  163. # [12:51] <jgraham> hybi list fail - Thomas was blue not green. Henry was green. (and James was red)
  164. # [12:51] <annevk> maybe even on my request
  165. # [12:52] <annevk> jgraham, euh?!
  166. # [12:52] <jgraham> Yeah, I think the spec isn't going to happen
  167. # [12:53] <jgraham> annevk: Greg's extension of Hixie's transport analogy had a vehicle that is green, runs on rails, and answers to the name Thomas
  168. # [12:53] <jgraham> But Thomas was blue
  169. # [12:54] <jgraham> (possibly Thomas the Tank Engine is an Anglo-American curio)
  170. # [12:55] <annevk> Filed a bug on spellcheck.
  171. # [12:55] * Quits: workmad3 (~workmad3@cpc3-bagu10-0-0-cust651.1-3.cable.virginmedia.com) (Remote host closed the connection)
  172. # [12:55] <jgraham> annevk: I already filed a bug indicating that throwing SYNTAX_ERR wasn't going to work
  173. # [12:59] * Quits: seventh (seventh@64-9-175-104.fwd.datafoundry.com) (Ping timeout: 258 seconds)
  174. # [13:07] * Joins: Rik` (~Rik`@ill67-3-88-164-129-230.fbx.proxad.net)
  175. # [13:10] * Quits: Smylers (~smylers@host86-162-120-82.range86-162.btcentralplus.com) (Ping timeout: 245 seconds)
  176. # [13:11] * Joins: Smylers (~smylers@host86-162-120-82.range86-162.btcentralplus.com)
  177. # [13:33] <MikeSmith> annevk: I added an "HTML elements organized by function" section to the HtmlR doc -
  178. # [13:33] <MikeSmith> http://dev.w3.org/html5/markup/elements-by-function.html
  179. # [13:33] <MikeSmith> (I think you had suggested it should have one)
  180. # [13:51] * Quits: svl (~me@ip565744a7.direct-adsl.nl) (Quit: And back he spurred like a madman, shrieking a curse to the sky.)
  181. # [13:52] * Quits: Rik` (~Rik`@ill67-3-88-164-129-230.fbx.proxad.net) (Remote host closed the connection)
  182. # [14:12] <annevk> ah yeah
  183. # [14:12] <erlehmann_> annevk, I want to do some very light DOM manipulation in PHP and intend to use html5lib to get the DOM. Two question: First, is that a good choice? Second, what solution would you recommend to manipulate said DOM in PHP?
  184. # [14:12] <annevk> well, I suggested the main draft would be done in that way, but I suppose this works too
  185. # [14:12] * erlehmann_ is now known as erlehmann
  186. # [14:12] <annevk> erlehmann_, I don't have experience with the PHP html5lib unfortunately
  187. # [14:13] <annevk> erlehmann, nor with PHP DOM manipulation :(
  188. # [14:13] <annevk> erlehmann, though overall that sounds like the best way if you plan on using PHP
  189. # [14:13] <erlehmann> just saw you as project owner on google code. that's the python part then, right?
  190. # [14:14] <jgraham> erlehmann: Well I think PHP isn't a good choice :)
  191. # [14:14] <jgraham> But if hat is a constraint then html5lib is a good choice for parsing the HTML
  192. # [14:14] <jgraham> as long as speed is not your main concern
  193. # [14:15] <jgraham> gsnedders and ezyang were mainly responsible for the PHP verson
  194. # [14:15] <erlehmann> jgraham, i agree wholeheartedly. when i applied for an internship in early 2008 and they asked me if i knew PHP, i told them why i hate it.
  195. # [14:15] <annevk> erlehmann, oh, yeah, I did the original Python version in part way back though jgraham knows and did more :)
  196. # [14:15] <erlehmann> but right now i have a gsoc project to finish.
  197. # [14:16] <erlehmann> and since i am writing a wordpress plugin … well ;)
  198. # [14:16] <jgraham> Ah
  199. # [14:16] <jgraham> You chose the wrong problem
  200. # [14:16] <jgraham> :)
  201. # [14:17] <jgraham> It is at least worth trying using PHP html5lib
  202. # [14:17] * Quits: ZombieLoffe (~e@unaffiliated/zombieloffe)
  203. # [14:17] <erlehmann> probably. but all my friends are using wordpress.
  204. # [14:17] <erlehmann> and me too. though i will look into habari Really Soon Now [TM]
  205. # [14:18] <jgraham> That's PHP too, right?
  206. # [14:18] <erlehmann> yeah. i should probably bully my hoster into getting me some WSGI goodness so i can install a python-based imageboard instead of my boring old blog.
  207. # [14:20] * Joins: smaug_ (~chatzilla@cs181150024.pp.htv.fi)
  208. # [14:21] <erlehmann> i'll look if PHP Simple HTML DOM Parser does it for me. i do not have that many edge cases and it looks nice and usable.
  209. # [14:22] * Quits: smaug (~chatzilla@80-186-88-120.elisa-mobile.fi) (Ping timeout: 260 seconds)
  210. # [14:23] * smaug_ is now known as smaug
  211. # [14:34] * Quits: yutak_home (~kee@U017209.ppp.dion.ne.jp) (Quit: Ex-Chat)
  212. # [14:35] * Joins: cedricv (~cedric@202.152.243.247)
  213. # [14:40] * Quits: MikeSmith (~MikeSmith@EM114-48-52-11.pool.e-mobile.ne.jp) (Ping timeout: 240 seconds)
  214. # [14:45] * Joins: MikeSmith (~MikeSmith@EM114-48-222-150.pool.e-mobile.ne.jp)
  215. # [14:46] * Quits: MikeSmith (~MikeSmith@EM114-48-222-150.pool.e-mobile.ne.jp) (Client Quit)
  216. # [14:47] * Joins: MikeSmith (~MikeSmith@EM114-48-222-150.pool.e-mobile.ne.jp)
  217. # [15:00] * Quits: erlehmann (~erlehmann@dslb-088-075-059-120.pools.arcor-ip.net) (Ping timeout: 246 seconds)
  218. # [15:06] * Joins: henrikbjorn (~hb@c83-249-65-174.bredband.comhem.se)
  219. # [15:09] * Quits: henrikbjorn (~hb@c83-249-65-174.bredband.comhem.se) (Remote host closed the connection)
  220. # [15:18] * Joins: henrikbjorn (~hb@c83-249-65-174.bredband.comhem.se)
  221. # [15:25] <gsnedders> The PHP html5lib is really quite out of date
  222. # [15:25] <gsnedders> There's access to the libxml HTML parser from the DOM extension
  223. # [15:35] * Quits: Martijnc (~Martijnc@91.176.44.232) (Ping timeout: 245 seconds)
  224. # [15:35] * Quits: henrikbjorn (~hb@c83-249-65-174.bredband.comhem.se) (Remote host closed the connection)
  225. # [15:40] * Joins: Martijnc (~Martijnc@91.176.77.23)
  226. # [15:47] * Quits: MikeSmith (~MikeSmith@EM114-48-222-150.pool.e-mobile.ne.jp) (Quit: This computer has gone to sleep)
  227. # [15:51] * Parts: mikekelly (mikek@s3x0r.biz)
  228. # [15:52] * Joins: MikeSmith (~MikeSmith@EM114-48-222-150.pool.e-mobile.ne.jp)
  229. # [15:52] * Quits: MikeSmith (~MikeSmith@EM114-48-222-150.pool.e-mobile.ne.jp) (Client Quit)
  230. # [15:53] * Joins: MikeSmith (~MikeSmith@EM114-48-222-150.pool.e-mobile.ne.jp)
  231. # [15:53] * Joins: kennyluck (~kennyluck@EM114-48-214-150.pool.e-mobile.ne.jp)
  232. # [15:56] * Quits: boblet (~boblet@p1201-ipbf709osakakita.osaka.ocn.ne.jp) (Quit: boblet)
  233. # [16:07] * Joins: bobchao (~cctw@118-171-85-191.dynamic.hinet.net)
  234. # [16:23] * Joins: workmad3 (~workmad3@cpc3-bagu10-0-0-cust651.1-3.cable.virginmedia.com)
  235. # [16:37] * Quits: workmad3 (~workmad3@cpc3-bagu10-0-0-cust651.1-3.cable.virginmedia.com) (Remote host closed the connection)
  236. # [16:46] * Joins: titacgs (~titacgs@201.250.188.146)
  237. # [17:05] * Joins: workmad3 (~workmad3@cpc3-bagu10-0-0-cust651.1-3.cable.virginmedia.com)
  238. # [17:24] * Joins: bobchao1 (~cctw@140.109.16.221)
  239. # [17:24] * Quits: bobchao (~cctw@118-171-85-191.dynamic.hinet.net) (Ping timeout: 240 seconds)
  240. # [17:38] * Quits: maikmerten (~maikmerte@port-92-201-5-188.dynamic.qsc.de) (Remote host closed the connection)
  241. # [17:47] * Joins: ZombieLoffe (~e@unaffiliated/zombieloffe)
  242. # [17:50] * Quits: workmad3 (~workmad3@cpc3-bagu10-0-0-cust651.1-3.cable.virginmedia.com) (Remote host closed the connection)
  243. # [18:06] * Quits: bobchao1 (~cctw@140.109.16.221) (Ping timeout: 240 seconds)
  244. # [18:14] * Joins: bobchao (~cctw@118-171-85-191.dynamic.hinet.net)
  245. # [18:16] * Joins: cgcardona_androi (~cgcardona@184-195-217-34.pools.spcsdns.net)
  246. # [18:18] * Quits: cgcardona_androi (~cgcardona@184-195-217-34.pools.spcsdns.net) (Client Quit)
  247. # [18:18] * Joins: cgcardona_androi (~cgcardona@184-195-217-34.pools.spcsdns.net)
  248. # [18:19] * Quits: cgcardona_androi (~cgcardona@184-195-217-34.pools.spcsdns.net) (Remote host closed the connection)
  249. # [18:20] * Joins: cgcardona_androi (~cgcardona@184-195-217-34.pools.spcsdns.net)
  250. # [18:21] * Quits: cgcardona_androi (~cgcardona@184-195-217-34.pools.spcsdns.net) (Client Quit)
  251. # [18:25] * Joins: cardona507 (~cardona50@c-67-180-160-250.hsd1.ca.comcast.net)
  252. # [18:31] * Joins: annevk5 (~annevk@5355737B.cable.casema.nl)
  253. # [18:57] * Quits: daedb (~daed@78-72-108-100-no178.tbcn.telia.com) (Remote host closed the connection)
  254. # [19:00] * Joins: Rik` (~Rik`@ill67-3-88-164-129-230.fbx.proxad.net)
  255. # [19:06] * Quits: ZombieLoffe (~e@unaffiliated/zombieloffe)
  256. # [19:13] * Joins: seventh (seventh@64-9-158-247.fwd.datafoundry.com)
  257. # [19:19] * Quits: cedricv (~cedric@202.152.243.247)
  258. # [19:22] * Quits: cardona507 (~cardona50@c-67-180-160-250.hsd1.ca.comcast.net) (Quit: zzzzz)
  259. # [19:32] * Quits: drunknbass (~drunknbas@76.91.255.83) (Remote host closed the connection)
  260. # [19:38] * Joins: drunknbass (~drunknbas@76.91.255.83)
  261. # [19:57] * Quits: Rik` (~Rik`@ill67-3-88-164-129-230.fbx.proxad.net) (Remote host closed the connection)
  262. # [20:09] * Joins: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br)
  263. # [20:12] * Quits: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br) (Remote host closed the connection)
  264. # [20:14] * Joins: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br)
  265. # [20:19] <Hixie> jgraham: yeah i considered saying gordon was green and thomas wasn't an eletric locomotive, but i figured that was maybe being pedantic about hte wrong thing :-)
  266. # [20:20] <Hixie> wait, gordon was blue
  267. # [20:20] <Hixie> man it's been too long
  268. # [20:20] <Hixie> (or possibly not long enough)
  269. # [20:21] <Workshiva> The former
  270. # [20:21] <annevk5> now you mention trains, apparently there's a Marklin shop here in Utrecht
  271. # [20:21] <Workshiva> Thomas is awesome
  272. # [20:21] <annevk5> thought of your train set when I saw that :)
  273. # [20:22] * Quits: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br) (Remote host closed the connection)
  274. # [20:22] <Hixie> :-)
  275. # [20:27] <jgraham> gordon was indeed blue
  276. # [20:27] <Workshiva> Gordon was the fat Thomas
  277. # [20:27] <Workshiva> That's how I always thought of him
  278. # [20:28] <Hixie> thomas was a switcher, gordon was for long haul... though i don't think the people who wrote the stories understood the difference
  279. # [20:28] * Joins: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br)
  280. # [20:29] * Quits: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br) (Remote host closed the connection)
  281. # [20:29] <Lachy> Percy was the green one.
  282. # [20:30] * gsnedders finally realizes what you're on about
  283. # [20:30] <gsnedders> Oh man… Bunch of kids.
  284. # [20:32] <Workshiva> Yeah, that hurts bad coming from you
  285. # [20:35] <jgraham> Heh, I see that I didn't make abarth's list of people from browser vendors who are worth listening to
  286. # [20:35] <jgraham> I guess I should try harder or something
  287. # [20:36] <Workshiva> Maybe he doesn't know you're from a browser vendor
  288. # [20:36] * Joins: ttepasse (~ttepasse@ip-109-90-160-217.unitymediagroup.de)
  289. # [20:36] <jgraham> I suppose that is possible
  290. # [20:36] * Joins: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br)
  291. # [20:37] <jgraham> But it seems unlikely
  292. # [20:37] * Quits: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br) (Max SendQ exceeded)
  293. # [20:37] * Joins: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br)
  294. # [20:37] * Joins: MikeSmithX (~MikeSmith@EM114-48-33-90.pool.e-mobile.ne.jp)
  295. # [20:38] * Quits: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br) (Max SendQ exceeded)
  296. # [20:38] * Joins: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br)
  297. # [20:39] * Quits: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br) (Max SendQ exceeded)
  298. # [20:39] * Quits: titacgs (~titacgs@201.250.188.146) (Ping timeout: 260 seconds)
  299. # [20:39] * Joins: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br)
  300. # [20:39] <Philip`> jgraham: The list was only a "for example", and looks like it's intentionally listing one person per browser vendor
  301. # [20:39] * Quits: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br) (Max SendQ exceeded)
  302. # [20:40] * Joins: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br)
  303. # [20:40] * Quits: MikeSmithX (~MikeSmith@EM114-48-33-90.pool.e-mobile.ne.jp) (Client Quit)
  304. # [20:40] <gsnedders> Hixie: How does you writing a separate Web Sockets spec to the IETF one help? Would you keep writing your spec if browser buy-in stuck with the IETF branch?
  305. # [20:40] * Quits: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br) (Max SendQ exceeded)
  306. # [20:41] * Joins: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br)
  307. # [20:41] * Quits: MikeSmith (~MikeSmith@EM114-48-222-150.pool.e-mobile.ne.jp) (Ping timeout: 240 seconds)
  308. # [20:41] <Hixie> gsnedders: no, if browsers aren't on board it's like with websql, i'd stop editing
  309. # [20:41] * Quits: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br) (Max SendQ exceeded)
  310. # [20:41] * Joins: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br)
  311. # [20:41] <gsnedders> Hixie: That wasn't quite clear on the list.
  312. # [20:42] <Hixie> well if i didn't i'd just be writing pointless fiction that didn't affect anyone anyway
  313. # [20:42] <Hixie> so it's rather moot
  314. # [20:42] * Quits: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br) (Max SendQ exceeded)
  315. # [20:43] * Joins: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br)
  316. # [20:43] * Quits: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br) (Max SendQ exceeded)
  317. # [20:43] * Joins: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br)
  318. # [20:44] * Joins: erlehmann (~erlehmann@dslb-088-075-059-120.pools.arcor-ip.net)
  319. # [20:44] * Quits: ttepasse (~ttepasse@ip-109-90-160-217.unitymediagroup.de) (Read error: Operation timed out)
  320. # [20:44] * Quits: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br) (Max SendQ exceeded)
  321. # [20:44] * Joins: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br)
  322. # [20:45] * Quits: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br) (Max SendQ exceeded)
  323. # [20:45] * Joins: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br)
  324. # [20:45] <jgraham> Philip`: But I want to be important :p
  325. # [20:46] * Quits: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br) (Max SendQ exceeded)
  326. # [20:46] * Joins: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br)
  327. # [20:46] <Workshiva> jgraham: Clearly you need to eliminate the Opera employees before you in the ranking
  328. # [20:46] <Workshiva> That way you end up on the next list
  329. # [20:47] * Quits: cristianl (~cristianl@201-40-169-13.paemt704.dsl.brasiltelecom.net.br) (Max SendQ exceeded)
  330. # [20:47] <Philip`> jgraham: You could be just a smidgen behind annevk5 on the perceived importance scale
  331. # [20:47] <Philip`> Or you could be right at the bottom
  332. # [20:48] <Philip`> so you should eliminate every single Opera employee, just to be sure
  333. # [20:48] * Joins: MikeSmith (~MikeSmith@EM114-48-33-90.pool.e-mobile.ne.jp)
  334. # [20:48] <jgraham> Hmm
  335. # [20:49] <jgraham> How to kill the dutch?
  336. # [20:49] <Hixie> you'd be pretty important if you went on a rampaging homocide stream, but i'd urge you to consider if that's the right kind of importance for you
  337. # [20:49] <jgraham> Maybe make a series of tiny holes in their levees
  338. # [20:49] <Hixie> streak, even
  339. # [20:49] <Workshiva> You could crash the tulip market
  340. # [20:53] <jgraham> All of this sounds like too much effort really
  341. # [20:53] <annevk5> clearly abarth should be arrested for inciting violence
  342. # [20:54] <jgraham> I think I will just have to develop a zen-like perspective on my own insignificance
  343. # [20:56] <annevk5> is this my cue for saying you're not? or something? ;p
  344. # [20:56] <jgraham> No
  345. # [20:56] <jgraham> I am developing an indifference to it, remember
  346. # [20:56] <jgraham> If you say I'm not it will only confuse and upset me
  347. # [20:57] <jgraham> So I might go back to plotting to kill the Dutch
  348. # [20:57] <annevk5> I think I stand by my original statement
  349. # [21:01] * Joins: bentruyman (~bentruyma@c-71-194-42-115.hsd1.il.comcast.net)
  350. # [21:06] <jgraham> I had forgotten about Percy
  351. # [21:07] * Joins: cristianl (~cristianl@201-34-114-225.paemt704.dsl.brasiltelecom.net.br)
  352. # [21:07] <jgraham> Is it me or does Sordor sound uncomfortably like Mordor
  353. # [21:07] * Quits: cristianl (~cristianl@201-34-114-225.paemt704.dsl.brasiltelecom.net.br) (Max SendQ exceeded)
  354. # [21:08] * Joins: cristianl (~cristianl@201-34-114-225.paemt704.dsl.brasiltelecom.net.br)
  355. # [21:08] * Quits: cristianl (~cristianl@201-34-114-225.paemt704.dsl.brasiltelecom.net.br) (Excess Flood)
  356. # [21:08] * Joins: cristianl (~cristianl@201-34-114-225.paemt704.dsl.brasiltelecom.net.br)
  357. # [21:08] <jgraham> I would never have liked Thomas The Tank Engine so much if I had thought he was mainly carrying Orcs
  358. # [21:08] <jgraham> s/Sordor/Sodor/
  359. # [21:08] <jgraham> Which I guess makes a difference
  360. # [21:09] <jgraham> But still
  361. # [21:14] * Joins: dbaron (~dbaron@c-98-234-51-190.hsd1.ca.comcast.net)
  362. # [21:29] <gsnedders> Time to make me hate zcorpan again, and bring PHP html5lib up to date
  363. # [21:31] * Quits: erlehmann (~erlehmann@dslb-088-075-059-120.pools.arcor-ip.net) (Quit: Ex-Chat)
  364. # [21:32] <gsnedders> Uh, the Python tests don't run for me
  365. # [21:33] <gsnedders> jgraham: You broke running tests with UTF-16 Python
  366. # [21:35] * Quits: f1lt3r (~f1lt3r@64.119.159.231) (Read error: Connection reset by peer)
  367. # [21:46] <jgraham> gsnedders: Ah, I think I expected that
  368. # [21:46] <jgraham> I may even have mentioned it in the commit log
  369. # [21:46] <jgraham> But I had no easy way to test
  370. # [21:47] <jgraham> gsnedders: (that is a lame excuse, yes, but I didn't really have time to fix it then)
  371. # [21:48] <jgraham> gsnedders: http://code.google.com/p/html5lib/source/detail?r=964568c175092c45156fe5a32a211e0d5d3781d8
  372. # [21:48] <jgraham> Probably
  373. # [21:48] <gsnedders> jgraham: There's lots of breakage, not just that
  374. # [21:49] <jgraham> Oh
  375. # [21:49] <gsnedders> Like, creating http://code.google.com/p/html5lib/source/detail?r=964568c175092c45156fe5a32a211e0d5d3781d8
  376. # [21:49] <gsnedders> Um, wrong clipboard
  377. # [21:49] <gsnedders> encode_entity_map
  378. # [21:49] <gsnedders> That throws an exception. :)
  379. # [21:50] <gsnedders> Which means import html5lib fails :)
  380. # [21:50] <jgraham> gsnedders: That has nothing to do with me
  381. # [21:50] <jgraham> Possibly
  382. # [21:50] <jgraham> Unless it was adding more entities that broke it
  383. # [21:50] <jgraham> Which is just silly
  384. # [21:51] * Joins: f1lt3r (~f1lt3r@64.119.159.231)
  385. # [21:52] <gsnedders> Adding non-BMP entities for the first time would
  386. # [21:52] <gsnedders> Now, to actually get tests passing instead of merely running
  387. # [21:55] * Quits: kennyluck (~kennyluck@EM114-48-214-150.pool.e-mobile.ne.jp) (Ping timeout: 240 seconds)
  388. # [22:00] * Joins: kennyluck (~kennyluck@EM114-48-29-14.pool.e-mobile.ne.jp)
  389. # [22:08] <gsnedders> Huh, now I really don't get what's going on.
  390. # [22:08] <gsnedders> I appear to be hitting a data corruption bug in Python
  391. # [22:10] <gsnedders> Hah. This is awesome.
  392. # [22:11] <gsnedders> Negative lookbehind assertion in regexp causing data corruption.
  393. # [22:12] <Philip`> Got a test case?
  394. # [22:13] <gsnedders> Oh, no
  395. # [22:13] <gsnedders> I see what's going on
  396. # [22:14] <gsnedders> Hah, that is evil.
  397. # [22:14] <gsnedders> I can't write code.
  398. # [22:15] <gsnedders> Also: I just introduced a bug without breaking any tests.
  399. # [22:15] <gsnedders> We need more tests.
  400. # [22:16] <jgraham> What bug?
  401. # [22:18] <gsnedders> Stripping lone surrogate bytes would also strip the byte where the other half of the surrogate should be
  402. # [22:18] <jgraham> You would always remove two bytes rather than one?
  403. # [22:20] <gsnedders> Four bytes, two characters.
  404. # [22:20] <jgraham> So a test with {lone surrogate}{other} -> {replacemnt}{other} would be sufficient to test it
  405. # [22:21] <gsnedders> Yeah, I've added that
  406. # [22:21] <gsnedders> https://code.google.com/p/html5lib/source/detail?r=46df29539c714df260a18f280dfeaf96e7af62c5
  407. # [22:21] <jgraham> Hah, byte counting fail :)
  408. # [22:23] <gsnedders> I haven't tested on UCS4, but I've made no change to the code it uses effectively
  409. # [22:23] * gsnedders wonders where his UCS4 build is
  410. # [22:23] <jgraham> gsnedders: I'm glad it was clear how I intended it to be patched for the UCS2 case :)
  411. # [22:24] * Joins: titacgs (~titacgs@201.250.188.146)
  412. # [22:24] <jgraham> gsnedders: Why remove the null from the regexp?
  413. # [22:24] <jgraham> Surely it is faster with it in?
  414. # [22:25] <gsnedders> More common code.
  415. # [22:25] <jgraham> Also, your patch is wrong
  416. # [22:25] <gsnedders> How?
  417. # [22:25] <jgraham> It doesn't take account of lone surrogates at the end of chunks
  418. # [22:26] <gsnedders> That's not a new issue
  419. # [22:27] <jgraham> Isn't it?
  420. # [22:27] <gsnedders> Well, we at least throw parse errors in that case
  421. # [22:27] <gsnedders> So it would fail tokenizer tests for that
  422. # [22:28] <jgraham> Can you actually end up with a non-lone surrogate at the end of the chunk in the UCS4 case?
  423. # [22:29] <jgraham> i.e. can you actually split the surrogate pair?
  424. # [22:30] <jgraham> It depends if we are reading bytes or characters
  425. # [22:30] <gsnedders> In the UCS4 case? No, you can never have a valid surrogate.
  426. # [22:31] <gsnedders> In the UCS2 case? Sure.
  427. # [22:31] <jgraham> Right, so we don't have the bug in the UCS4 case
  428. # [22:32] <jgraham> So the UCS2 patch is wrong in the sense that it misses a case that the UCS4 code covers
  429. # [22:33] <gsnedders> Indeed
  430. # [22:33] <gsnedders> But the UCS2 behaviour is already wrong in that case
  431. # [22:33] <gsnedders> And I've not made it any worse than it was before
  432. # [22:33] <jgraham> Agreed
  433. # [22:33] <jgraham> But the patch is still wrong :)
  434. # [22:33] <gsnedders> The patch is right, just incomplete. ;P
  435. # [22:33] <jgraham> However you want to think of it
  436. # [22:34] <jgraham> (it seems like since you are fixing it now, this would be a good time to make it right because otherwise we will have a subtle bug that will almost never happen but be reasonably surprising when it does)
  437. # [22:35] * Joins: svl (~me@ip565744a7.direct-adsl.nl)
  438. # [22:36] <jgraham> (you need to do roughly the same thing as the CR thing
  439. # [22:37] <jgraham> but bonus points for making it not that ugly)
  440. # [22:38] <gsnedders> It's harder than the CR thing
  441. # [22:38] <gsnedders> the CR thing is easy because you can just convert it to \n and ignore a LF in the next chunk
  442. # [22:39] <gsnedders> In this case you can't known what the right behaviour is until you get the next chunk… if there is a next chunk.
  443. # [22:43] <Philip`> Can you just stick the character onto the front of the next chunk?
  444. # [22:43] <jgraham> That might work
  445. # [22:43] <Philip`> or, uh, something like that
  446. # [22:43] * Philip` has no idea how the code works really
  447. # [22:43] <gsnedders> Philip`: What if there's no next chunk?
  448. # [22:44] <gsnedders> (That's the problem with that solution)
  449. # [22:44] <jgraham> gsnedders: You just make sure that having a character in the unget buffer menas there is a next chunk
  450. # [22:45] <jgraham> Which I think is straightforward with the current code
  451. # [22:48] <jgraham> (just do data = self._danglingCharacter + self.dataStream.read(chunkSize)
  452. # [22:49] <jgraham> and then if some_regexp.match(data[-1]): self._danglingCharacter = data[-1]; data = data[:-1]; else: self._danglingCharacter = ""
  453. # [22:49] <jgraham> )
  454. # [22:49] <jgraham> or something
  455. # [22:50] <jgraham> There's not even any need for a regexp
  456. # [22:51] <jgraham> (just use ord)
  457. # [22:51] * Quits: kennyluck (~kennyluck@EM114-48-29-14.pool.e-mobile.ne.jp) (Ping timeout: 260 seconds)
  458. # [22:52] <gsnedders> jgraham: I don't get how to use the unget buffer for that
  459. # [22:52] <gsnedders> With how unget works, that is
  460. # [22:54] <gsnedders> Like, there is no buffer for unget
  461. # [22:54] <jgraham> gsnedders: I mean you have to create one
  462. # [22:54] <jgraham> that's self._danglingCharacter above
  463. # [22:55] <jgraham> Sorry, I don't think I was very clear
  464. # [22:56] <gsnedders> Really you're still not :)
  465. # [22:57] * Joins: bobchao1 (~cctw@118-171-89-205.dynamic.hinet.net)
  466. # [22:57] <jgraham> gsnedders: All I'm saying is
  467. # [22:58] * Quits: bobchao (~cctw@118-171-85-191.dynamic.hinet.net) (Ping timeout: 252 seconds)
  468. # [22:58] <jgraham> If the last character is a \r or an unpaired surrogate, make a property that points to that character and slice it off the end of the chunk
  469. # [22:58] <jgraham> The next time we go to get a chunk, add it on the start
  470. # [22:59] <jgraham> this is sure to work because the signal for "we don't need no more chunks" is that readChunk returns nothing
  471. # [22:59] <jgraham> So there is a cost of one more cycle through readChunk if this is the last chunk
  472. # [23:00] <jgraham> but that isn't very common so it can be slow
  473. # [23:00] <jgraham> (we need to make sure we still do the right thing in that case of course)
  474. # [23:01] <jgraham> Then the only special magic we need is to make sure we detect when the last character is special and save it for next time
  475. # [23:01] <jgraham> Is that clearer, or am I talking nonsense?
  476. # [23:06] <gsnedders> That's clear
  477. # [23:07] * Joins: m_W (~mwilcox56@c-76-98-55-26.hsd1.nj.comcast.net)
  478. # [23:07] * Quits: dbaron (~dbaron@c-98-234-51-190.hsd1.ca.comcast.net) (Quit: 8403864 bytes have been tenured, next gc will be global.)
  479. # [23:08] <jgraham> I guess one special case is if the document is _only_ a \r character
  480. # [23:08] <jgraham> then you save the character but get an empty chunk back
  481. # [23:09] <jgraham> But you can probably deal with that where readChunk is called, or something
  482. # [23:10] * Quits: annevk5 (~annevk@5355737B.cable.casema.nl) (Quit: annevk5)
  483. # [23:12] <jgraham> Or in readchunk I guess
  484. # [23:12] <jgraham> Just by checking if length > 1 before you slice anything of
  485. # [23:12] <jgraham> f
  486. # [23:12] <jgraham> Which seems much simpler
  487. # [23:17] * Quits: ROBOd (~robod@89.123.152.174) (Quit: .)
  488. # [23:36] * Joins: baba (~sallabanc@69.50.70.12)
  489. # [23:59] * Quits: gavin_ (~gavin@firefox/developer/gavin) (Ping timeout: 240 seconds)
  490. # Session Close: Sun Aug 01 00:00:00 2010

The end :)