Options:
- # Session Start: Sun Feb 01 00:00:00 2009
- # Session Ident: #whatwg
- # [00:01] * Joins: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [00:12] * Quits: nessy (n=nessy@124-168-147-235.dyn.iinet.net.au) ("This computer has gone to sleep")
- # [00:25] * Quits: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [00:43] * Joins: tantek (n=tantek@c-67-161-5-143.hsd1.ca.comcast.net)
- # [00:52] <Lachy> I updated the element summary template. http://lachy.id.au/temp/template.html
- # [01:09] * Quits: Maurice (i=copyman@5ED548D4.cable.ziggo.nl) ("Disconnected...")
- # [01:10] * Joins: tantek_ (n=tantek@c-67-161-5-143.hsd1.ca.comcast.net)
- # [01:11] * Joins: dbaron (n=dbaron@c-98-234-51-190.hsd1.ca.comcast.net)
- # [01:17] * Quits: supergreg (n=Miranda@unaffiliated/supergreg) (Read error: 110 (Connection timed out))
- # [01:18] * Quits: svl (n=me@ip565744a7.direct-adsl.nl) ("And back he spurred like a madman, shrieking a curse to the sky.")
- # [01:26] * Quits: tantek (n=tantek@c-67-161-5-143.hsd1.ca.comcast.net) (Read error: 110 (Connection timed out))
- # [01:43] * Quits: tantek_ (n=tantek@c-67-161-5-143.hsd1.ca.comcast.net)
- # [01:59] * Joins: rubys (n=rubys@cpe-075-182-092-038.nc.res.rr.com)
- # [02:09] * Joins: tantek (n=tantek@adsl-71-135-124-193.dsl.pltn13.pacbell.net)
- # [02:20] <Lachy> I managed to write a simple python script using html5lib to read the spec source and find all the element summaries. Tomorrow, I should be able to extend the script to generate element summaries for the authoring guide
- # [02:21] <Hixie> neat
- # [02:21] <Hixie> let me know if you want me to do a POST to a particular url every time i do a commit
- # [02:21] <Hixie> that way you could have your spec autoregenerate when i update the spec
- # [02:21] <Lachy> yeah, it's a lot better than I had been doing, which involved a lot more manual work. This should at least spead up progress a lot
- # [02:23] <Lachy> you check in so frequently compared with me, and most of your changes won't affect the element summaries, so that wouldn't be worth the effort
- # [02:24] <Lachy> I would like some way to know that element summaries have changed, though, if possible, so I know I should regenerate that section.
- # [02:25] <Lachy> what do you think about the template design I linked above?
- # [02:25] <Lachy> any suggestions for improvement?
- # [02:40] * Joins: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [02:47] <Hixie> the DOM interface part is going to be huge for some elements
- # [02:48] <Hixie> i dunno
- # [02:48] <Hixie> are the categories and suchlike going to be links?
- # [02:49] <Hixie> maybe the attributes and IDL should both just be <dl>s with name/definition pairs
- # [02:50] <karlcow> Lachy: feel free to share it somewhere ;) I mean the script
- # [02:50] <Lachy> yeah, they will all be linked once anolis generates them
- # [02:50] <Lachy> karlcow, it will be checked into CVS once it's finished
- # [02:50] <karlcow> cool
- # [02:51] <Lachy> Hixie, do you mean <dl><dt>Attributes <dd>attr1<dd>attr2 ... <dt>DOM Interfaces <dd>prop1<dd>prop2 ...</dl>?
- # [02:51] <Hixie> no
- # [02:51] * Quits: kinetik (n=kinetik@121.98.132.55) (Read error: 110 (Connection timed out))
- # [02:51] <Lachy> or do you mean each attribute <dt>Attr <dd>description of attr
- # [02:51] <Hixie> the latter
- # [02:52] <Hixie> i was about to type an example but you got there first :-)
- # [02:52] * Joins: doublec (n=Chris_Do@118-92-147-136.dsl.dyn.ihug.co.nz)
- # [02:53] <Lachy> if each attribute was to have a description, I was considering using a table instead.
- # [02:53] <Hixie> that's fine too
- # [02:53] <Lachy> But then I'd need to write short summary descriptions for each attribute, though I wanted to avoid cluttering the summary boxes with too much information
- # [02:54] * Quits: xydyx (n=hdh@58.187.22.19) (Remote closed the connection)
- # [02:54] <Lachy> hmm, maybe something like the comments for each attr in the HTML4 DTD would be ok. e.g. http://www.w3.org/TR/html401/struct/links.html#edef-A
- # [02:56] <Hixie> just have the full descriptions there
- # [02:57] <Hixie> you'll probably have to split the attribute descriptions from the more meaty "how to use the element" text anyway
- # [02:58] * Quits: rubys (n=rubys@cpe-075-182-092-038.nc.res.rr.com) ("Leaving.")
- # [02:59] <Lachy> maybe. it depends how much I need to write for each attribute description
- # [02:59] <Lachy> if it's a lot, then it's not really going to fit in the space too well
- # [02:59] <Hixie> on the web the space is infinite :-)
- # [03:00] <Hixie> i wouldn't worry too much about having the right template
- # [03:00] <Hixie> once you've got descriptions for a dozen elements or two, you'll have a much better idea of what you have to put in the template
- # [03:02] <Lachy> reload the template and take a look now
- # [03:07] * Quits: tantek (n=tantek@adsl-71-135-124-193.dsl.pltn13.pacbell.net)
- # [03:08] * Joins: kinetik (n=kinetik@121.98.132.55)
- # [03:13] * jwalden wonders if the HTTP spec will ever sanction use of Set-Cookie, WWW-Authenticate, and Proxy-Authenticate as they are actually used
- # [03:13] * Quits: doublec (n=Chris_Do@118-92-147-136.dsl.dyn.ihug.co.nz) ("ChatZilla 0.9.79-rdmsoft [XULRunner 1.8.0.9/2006120508]")
- # [03:14] <Lachy> wtf? Sam incorrectly defined the meaning of a strawman http://lists.w3.org/Archives/Public/public-html/2009Jan/0688.html and then Leif incorrectly pointed out something isn't a strawman, either by it's real definition, nor Sam's http://lists.w3.org/Archives/Public/public-html/2009Feb/0000.html
- # [03:14] * jwalden suspects not, based on http://trac.tools.ietf.org/wg/httpbis/trac/ticket/129
- # [03:52] * Joins: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [04:18] * Joins: myakura (n=myakura@p3020-ipbf505marunouchi.tokyo.ocn.ne.jp)
- # [04:22] * Quits: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [05:16] * Joins: erlehmann (n=erlehman@86.59.25.121)
- # [05:16] * Quits: heycam (n=cam@124-168-42-25.dyn.iinet.net.au) (Read error: 101 (Network is unreachable))
- # [05:42] * Joins: roc (n=roc@121-72-198-197.dsl.telstraclear.net)
- # [06:02] * Joins: heycam (n=cam@124-168-1-1.dyn.iinet.net.au)
- # [06:03] * Joins: dave_levin (n=dave_lev@c-98-203-247-78.hsd1.wa.comcast.net)
- # [06:19] <Hixie> Lachy: i was surprised to see sam's suggestion of actually having people call each other out; it'll be interesting to see how such an approach fares
- # [06:19] * Joins: eric_carlson (n=ericc@adsl-67-112-12-110.dsl.anhm01.pacbell.net)
- # [06:19] <Hixie> Lachy: the attribute side looks good; i still think the idl side should use a similar technique instead of webidl
- # [06:34] <heycam> hey Hixie i'm here for a bit
- # [06:34] <Hixie> hey
- # [06:34] <Hixie> two things
- # [06:35] <Hixie> 1 - when you have a moment, i would appreciate your feedback on the WindowProxy section in the spec
- # [06:35] <heycam> k i'll have a look during the week
- # [06:35] <Hixie> (it's very short)
- # [06:36] <Hixie> 2 - we have to say that the "this" keyword in JS at the global scope returns the WindowProxy object instead of the actual global object
- # [06:36] <heycam> uh oh
- # [06:36] <Hixie> i'm unsure whether it's best to say this in WebIDL, with the JS binding stuff, or if we should say it in HTML5, in a JS-specific section
- # [06:36] <Hixie> (or in ES 3.x, but they probably don't want a forward dependency on html5)
- # [06:37] <heycam> so it's a violation of es3?
- # [06:37] <heycam> i haven't looked much into the whole split window thing: is it that "this" at the global scope returns one object, while a different object is in the scope chain?
- # [06:41] <heycam> as to whether it's more appropriate in web idl or html5, dunno
- # [06:41] <heycam> guess i'll need to look at what exactly it entails
- # [06:42] <Hixie> yeah it's a violation of es 3.1
- # [06:42] <Hixie> yes, the top object in the scope chain is the global object (the Window object for that script's Document)
- # [06:42] <Hixie> and the .window, this, etc, attributes all return a WindowProxy object
- # [06:44] * Joins: MikeSmith (n=MikeSmit@210.229.158.64)
- # [06:44] * Joins: dimich (n=dimich@207-118-25-126.dyn.centurytel.net)
- # [06:45] <heycam> so perhaps it needn't be a violation, it's just that evaluating the contents of a <script> is done with some extra futzing of this/scope chain, rather than being whatever a global code execution says in ES
- # [06:46] <heycam> in web idl, i don't say anything about what object is the global object, or what happens when a top level script is executed
- # [06:46] <heycam> i just say what properties exist on the global, and what their functionality is, etc.
- # [06:46] <Hixie> html5 mentions what the global object is, without mentioning its importance to js
- # [06:47] <heycam> would you want an explicit hook for html5 to define what's the global object?
- # [06:47] <heycam> (an explicit hook in web idl that is?)
- # [06:47] <Hixie> i'm ok with the current text, but i'm also very happy to make it explicit, sure
- # [06:47] <Hixie> if we do add such text, we can add text to make |this| different at the same time
- # [06:48] <Hixie> see a recent e-mail on whatwg about this for some background, btw
- # [06:48] <Hixie> from bz i think
- # [06:53] <heycam> so this behaviour is the same when executing a <script> as when, say, firing an event listener?
- # [06:57] * Quits: roc (n=roc@121-72-198-197.dsl.telstraclear.net)
- # [06:57] <heycam> maybe web idl should have a definition for executing a script with a particular object in the scope chain?
- # [06:58] <heycam> i see the definition for "The script settings determined from the node" seems pretty generic
- # [06:58] <heycam> would you imagine this split window thing would be applicable to other languages?
- # [06:58] * heycam wonders why some <dfn>s are italicised now
- # [07:00] <heycam> it'd be nice if whatwg@whatwg.org mails had an Archived-At header, like w3.org lists
- # [07:00] <Hixie> sorry, had a real life distraction
- # [07:00] * Joins: roc (n=roc@121-72-198-197.dsl.telstraclear.net)
- # [07:01] <Hixie> yes, the behavior is the same for all scripts, javascript:, <script>, event handlers, whatever. Though event handlers already have an explicit definition of the scope chain somewhere in html5.
- # [07:01] <heycam> so the only diff is the replacement of the scope chain object with some other object (the "inner" window)
- # [07:01] <Hixie> do other languages have a global object, even?
- # [07:01] * heycam shrugs
- # [07:02] <heycam> gotta go, bbl
- # [07:02] <Hixie> k
- # [07:02] <Hixie> later
- # [07:02] <Hixie> thanks
- # [07:04] * Quits: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [07:06] * Quits: roc (n=roc@121-72-198-197.dsl.telstraclear.net)
- # [07:38] * Quits: myakura (n=myakura@p3020-ipbf505marunouchi.tokyo.ocn.ne.jp) ("Leaving...")
- # [08:38] * Quits: dbaron (n=dbaron@c-98-234-51-190.hsd1.ca.comcast.net) ("8403864 bytes have been tenured, next gc will be global.")
- # [08:42] * Joins: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [08:42] * Quits: MikeSmith (n=MikeSmit@210.229.158.64) ("Tomorrow to fresh woods, and pastures new.")
- # [08:53] * Joins: shepazu (n=schepers@c-24-8-217-41.hsd1.co.comcast.net)
- # [09:30] * Parts: erlehmann (n=erlehman@86.59.25.121)
- # [10:00] * Joins: mlpug (n=user@a88-115-168-225.elisa-laajakaista.fi)
- # [10:01] * Joins: zdobersek (n=zan@cpe-92-37-76-190.dynamic.amis.net)
- # [10:07] * Joins: maikmerten (n=maikmert@L929b.l.pppool.de)
- # [10:10] * Joins: Maurice (i=copyman@5ED548D4.cable.ziggo.nl)
- # [10:16] * Quits: dimich (n=dimich@207-118-25-126.dyn.centurytel.net)
- # [11:03] * Joins: ROBOd (n=robod@89.122.216.38)
- # [11:55] * Quits: mlpug (n=user@a88-115-168-225.elisa-laajakaista.fi) (Remote closed the connection)
- # [12:08] * Quits: olliej_ (n=oliver@c-67-164-125-23.hsd1.ca.comcast.net)
- # [12:22] * Joins: xydyx (n=hdh@58.187.21.63)
- # [12:35] * Joins: nessy (n=nessy@124-168-147-235.dyn.iinet.net.au)
- # [12:36] * Quits: nessy (n=nessy@124-168-147-235.dyn.iinet.net.au) (Remote closed the connection)
- # [12:40] * Joins: svl (n=me@ip565744a7.direct-adsl.nl)
- # [12:59] * Joins: myakura (n=myakura@p3020-ipbf505marunouchi.tokyo.ocn.ne.jp)
- # [13:33] * jgraham has parts of html5lib limping along in python 3
- # [13:34] <jgraham> Like enough to do html5lib.parse(b"<html></html>) but not enough to deal with the possibility of passing in a file object with the encoding set but the wrong error handling mode
- # [13:34] <jgraham> Which I don't really know how to deal with apart from saying "don't do that"
- # [13:36] <jgraham> This is, of course a distraction and I should really be concentrating on the MathML + SVG stuff but it is quite a fun distraction
- # [13:39] <takkaria> it's interesting. when it came to data interchange, people first jumped on XML for it. now they're jumping on RDF-in-XML
- # [13:39] <takkaria> I wonder if there's going to be an abstraction over that at some point, too
- # [14:22] * Joins: supergreg (n=Miranda@8.80-203-29.nextgentel.com)
- # [14:42] <gsnedders> Is it bad that Safari's lack of regex in it's find box is annoying
- # [14:42] <gsnedders> *its
- # [14:42] <gsnedders> ?
- # [14:52] <Lachy> gsnedders, I don't know of any browser that supports regex searching
- # [15:37] <gsnedders> Does anyone have any data about how common different character sets are?
- # [15:40] <karlcow> Lachy: https://addons.mozilla.org/en-US/firefox/addon/6534 something like this maybe?
- # [15:43] <Lachy> karlcow, thanks
- # [15:47] * Quits: maikmerten (n=maikmert@L929b.l.pppool.de) (Remote closed the connection)
- # [16:13] * Quits: eric_carlson (n=ericc@adsl-67-112-12-110.dsl.anhm01.pacbell.net)
- # [16:14] <hsivonen> http://video.dld-conference.com/watch/dTMYg3z?t=dld09 at 20 minutes
- # [16:19] <Philip`> gsnedders: Do you mean something like http://philip.html5.org/data/charsets.html ?
- # [16:19] <gsnedders> Philip`: yes
- # [16:19] <Philip`> gsnedders: I don't think anyone has any data like that
- # [16:20] <gsnedders> Philip`: That data there should be enough to work from
- # [16:20] <Philip`> Oh, right, I have data like that - that was convenient
- # [16:21] <gsnedders> Philip`: :P
- # [16:21] <gsnedders> I'd be interested in a bigger less bias sample, though
- # [16:21] <gsnedders> (Seeming dmoz is bias towards English)
- # [16:22] <Philip`> I don't think the concept of "less bias" exists
- # [16:22] <Philip`> You can just choose the direction in which you want the bias to be
- # [16:22] <Philip`> Ooh, a blizzard
- # [16:23] <Philip`> (dmoz is definitely biased towards English, and I think it's also biased towards western European)
- # [16:24] <Philip`> (so it's really bad for e.g. Chinese sites)
- # [16:25] <Philip`> *.uk: 161279 pages
- # [16:25] <Philip`> *.de: 295359
- # [16:25] <Philip`> *.jp: 119322
- # [16:25] <Philip`> *.cn: 7582
- # [16:25] <jcranmer> .zh?
- # [16:26] <Philip`> *.zh: 36
- # [16:27] <Philip`> Whoops
- # [16:27] <Philip`> That's because '.' was a regexp
- # [16:27] <Philip`> If I make it \. then there's 0
- # [16:27] <Philip`> (The other numbers don't change much)
- # [16:28] <gsnedders> Philip`: Yeah, that's my problem.
- # [16:28] <gsnedders> What's ZH?
- # [16:28] <annevk> hsivonen, interesting
- # [16:29] * myakura thinks those *.jp sites are years old
- # [16:29] <annevk> hsivonen, didn't know he was CEO of CC
- # [16:29] <annevk> (that Flash player they are using is also really cool btw)
- # [16:30] <Philip`> gsnedders: Alexa's list might be better, since it's got ~20K for *.cn (and similar for *.jp and *.uk)
- # [16:31] <gsnedders> Philip`: But do you have data for that? :P
- # [16:31] <Philip`> gsnedders: (though it only lists sites rather than pages, so it's entirely biased towards the initial entry pages)
- # [16:32] <Philip`> gsnedders: No, but you could easily download the list and then download a few thousand .cn pages and search them for patterns :-)
- # [16:39] <karlcow> Philip`: there would be benefit of having separate sets for different countries. I had this discussion with blooberry about the mama too.
- # [16:40] <karlcow> The issue is how to create these sets
- # [16:41] <Philip`> karlcow: The problem is that it would encourage comparison between datasets, and comparisons are very dangerous because you're usually comparing the bias of the samples rather than real differences
- # [16:41] <karlcow> There is also all the invisible Web. The one behind passwd or intranets
- # [16:42] <karlcow> Philip`: yes indeed, you have to always give the context of your studies, plus a bit of error calculations.
- # [16:42] <karlcow> There is nothing *true* in sample surveys, just giving you hints but not certainty
- # [16:43] * karlcow has suddenly a big wave of astrophysical studies and research coming back to his head
- # [16:44] <blooberry> With publicly available sets, the burden of determining the set is pretty much off the entity that is doing the analysis. Any other set requires a lot of extra legwork, both in getting and describing the additional URLs, examining its bias. Not to say that publicly available sets don't have bias, it is just that they are more easily documented.
- # [16:45] <annevk> can someone quickly tell me whether the character that e.g. ë represents can have multiple Unicode representations?
- # [16:45] <annevk> e.g. its own character and one using two characters that combine into one?
- # [16:45] <Philip`> karlcow: Error calculations don't seem especially helpful here - I can say "(4.56+/-0.07)% of pages in dmoz.org have property P" based on the sample size, but that says nothing meaningful about other data sets (e.g. dmoz.org yesterday or tomorrow, or some other selection of pages)
- # [16:47] <karlcow> Philip`: again context… and calculation of confidence depending on your data set.
- # [16:48] <gsnedders> annevk: What is euml?
- # [16:48] <annevk> data:text/html,ë
- # [16:48] <gsnedders> annevk: Yes, it has at least two representations
- # [16:48] <annevk> thx
- # [16:49] <karlcow> U+000EB
- # [16:49] <gsnedders> karlcow: Yeah, I could find that. I don't however know Unicode codepoints off the top of my head. :)
- # [16:50] * karlcow neither
- # [16:56] <blooberry> So, is there a way to use available URL sets that is both reproduceable and has the ability to improve the set in ways that are statistically meaningful?
- # [16:56] <karlcow> ah the combining diaresis is 0308
- # [16:56] <karlcow> and the diaresis is 00A8
- # [16:57] <karlcow> blooberry: as a downloaded set or as an accessible set ?
- # [16:57] * karlcow will have to explain that
- # [16:59] <karlcow> an URL set is "biological". It changes in time, because of the nature of the Web. So for example to compare the results of two different tools, we need a downloaded dated set of URIs and their content (and metadata going with it)
- # [17:00] <blooberry> karlcow: *nodding agreement*
- # [17:01] <karlcow> Then there is the quality of the set… and here same kind of issues than in astrophysics when I was doing star fields research. At a point you have to select the stars, because if you pick up randomly, you get many interesting stats, but not really usable in a scientific way
- # [17:02] <blooberry> What if you balance it between selected and random?
- # [17:02] <karlcow> I guess the balanced set helps you to test the quality of the random
- # [17:03] <karlcow> s/balanced/selected/
- # [17:16] * Joins: maikmerten (n=maikmert@L929b.l.pppool.de)
- # [17:20] <karlcow> the selected set helps you to test hypothesis in a controlled environment. For example, new Web sites released by Web agencies (recent contracts) to see what are the new professional practices.
- # [17:22] <karlcow> on a Japanese only set and on a few years, we could also see for example if people are switching to utf-8 or not.
- # [17:22] <karlcow> without necessary knowing if it's a local choice or a constraints of the tools to produce Web sites.
- # [17:24] * Joins: eric_carlson (n=ericc@nat/apple/x-cf8b7c2f4b3eee75)
- # [17:24] <karlcow> blooberry: did you play with different user agent strings for the MAMA, faking the string to see the variability?
- # [17:28] <blooberry> That's been on my list of things to do. I played around with it once with the Alexa Top 1000 and had interesting results, but they were hard to interpret.
- # [17:28] <blooberry> I've never come up with the perfect way to compare the results between UA strings.
- # [17:28] <blooberry> I kept finding different things that were variable between versions.
- # [17:32] <blooberry> I pulled content using the latest UAs from Opera, FF, and MSIE, as well as the forthcoming Opera UA, as well as a UA called "foo" for comparison. 8-}
- # [17:34] * Joins: dbaron (n=dbaron@c-98-234-51-190.hsd1.ca.comcast.net)
- # [17:36] <karlcow> blooberry: hehe. The Power of Foo! known by old monks in China for generations. They tackle issues on bad Web pages.
- # [17:37] <blooberry> yes, they are the experts in Kung Foo
- # [17:38] <blooberry> I found that I had to ignore both IMG references and hyperlink HREFs, because many sites randomize the arguments for them to aid tracking...it just makes the analysis job harder. ;-}
- # [17:40] <blooberry> At a certain point I was really wondering if what I was deleting was too crucial to throw away. I'm not sure of the answer to that.
- # [18:19] * Quits: myakura (n=myakura@p3020-ipbf505marunouchi.tokyo.ocn.ne.jp) ("Leaving...")
- # [19:06] * Joins: zdobersek1 (n=zan@cpe-92-37-76-190.dynamic.amis.net)
- # [19:06] * Quits: zdobersek (n=zan@cpe-92-37-76-190.dynamic.amis.net) (Read error: 104 (Connection reset by peer))
- # [19:21] * Quits: Amorphous (i=jan@unaffiliated/amorphous) (Read error: 110 (Connection timed out))
- # [19:25] * Joins: Amorphous (i=jan@unaffiliated/amorphous)
- # [19:27] * Joins: erlehmann (n=erlehman@86.59.25.121)
- # [19:56] * Quits: kinetik (n=kinetik@121.98.132.55) ("leaving")
- # [19:56] * Joins: kinetik (n=kinetik@121.98.132.55)
- # [20:16] * Joins: MikeSmith (n=MikeSmit@m290f36d0.tmodns.net)
- # [20:19] * Quits: maikmerten (n=maikmert@L929b.l.pppool.de) (Remote closed the connection)
- # [20:20] * Joins: mstange (n=markus@buntes215.wohnheim.uni-kl.de)
- # [20:46] * Quits: MikeSmith (n=MikeSmit@m290f36d0.tmodns.net) (Read error: 110 (Connection timed out))
- # [20:59] * Joins: MikeSmith (n=MikeSmit@m290f36d0.tmodns.net)
- # [21:07] * Quits: MikeSmith (n=MikeSmit@m290f36d0.tmodns.net) ("Tomorrow to fresh woods, and pastures new.")
- # [21:12] * Joins: olliej (n=oliver@c-67-164-125-23.hsd1.ca.comcast.net)
- # [21:26] * Joins: nessy (n=nessy@124-168-147-235.dyn.iinet.net.au)
- # [21:40] * Quits: heycam (n=cam@124-168-1-1.dyn.iinet.net.au) (Read error: 60 (Operation timed out))
- # [21:43] * Joins: heycam (n=cam@203-217-88-114.dyn.iinet.net.au)
- # [22:07] <Lachy> Looks like some people involved with HTML4All have started creating their own HTML spec http://html4all.org/mailman/archives/list_html4all.org/2009-February/001038.html
- # [22:07] * Quits: ROBOd (n=robod@89.122.216.38) ("http://www.robodesign.ro")
- # [22:12] <gsnedders> I'm just confused trying to read the wiki draft
- # [22:16] * Hixie looks to see if they have any good ideas worth taking into html5
- # [22:23] <Lachy> my script to generate the element summaries for the authoring guide is working well, though it's not quite finished
- # [22:24] <karlcow> they called it "4.1"
- # [22:24] <gsnedders> 5 > 4.1. EOF;
- # [22:24] <gsnedders> :P
- # [22:24] <karlcow> gsnedders: and you want candies with that?
- # [22:25] <gsnedders> karlcow: Nah, not really
- # [22:25] <gsnedders> karlcow: A minion that could write an English essay for me would be nice though
- # [22:25] <karlcow> or maybe an enhancer, because it seems there is an issue with the length of your thing
- # [22:26] <Lachy> gsnedders, do you know how I can use the html5lib serialiser to pretty print the output from my script?
- # [22:27] <Lachy> or at least print it in a way that doesn't insert new lines everywhere
- # [22:27] <gsnedders> Lachy: Not off the top of my head
- # [22:27] <Lachy> I'm using the example provided here http://code.google.com/p/html5lib/wiki/UserDocumentation#Serialization_of_Streams
- # [22:28] <Lachy> but it seems the print statement adds a LF after every line. Is there a way to stop that?
- # [22:29] * karlcow wonders if they built it from scratch or they if took the source of html 5 and changed some of the things
- # [22:29] <gsnedders> karlcow: Parts are from html5 (run through tidy and converted to XHTML 1.0 Strict), others are from scratch, as far as I can tell
- # [22:29] <Lachy> ah, it works if I use: "print item," The comma must suppress the LF
- # [22:30] <karlcow> that gives now html5, html 4.1, html6, hmm what else
- # [22:30] <karlcow> gsnedders: ah. thanks
- # [22:30] <gsnedders> Lachy: You probably want to just write to file directly
- # [22:31] <karlcow> http://html4all.org/wiki/index.php/Inaugural_Members
- # [22:31] * Parts: zdobersek1 (n=zan@cpe-92-37-76-190.dynamic.amis.net)
- # [22:31] <Lachy> gsnedders, I figured if I write to standard output, then I can eventually just pipe it to other scripts that incorporate those summaries into the main document
- # [22:31] <gsnedders> Lachy: import sys
- # [22:32] <gsnedders> Lachy: sys.stdout.write(item)
- # [22:32] * gsnedders shrugs
- # [22:33] <karlcow> hmmm mainly rob http://html4all.org/wiki/index.php/Special:Recentchanges
- # [22:34] <karlcow> at least Rob does all the wiki edits
- # [22:34] * Joins: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [22:35] <Lachy> gsnedders, thanks
- # [22:35] <Lachy> Hixie, this is the output from my script so far http://lachy.id.au/temp/template.html
- # [22:36] <gsnedders> Because authors love IDL!
- # [22:36] <Lachy> gsnedders, I need to do something better with the IDL, eventually
- # [22:36] <Hixie> Lachy: looks familiar :-P
- # [22:36] <Lachy> I just have to figure out what and how. For now, it's easier to just stick it in there as-is as a placeholder
- # [22:36] * Quits: mstange (n=markus@buntes215.wohnheim.uni-kl.de) ("ChatZilla 0.9.84-2009010213 [Firefox 3.2a1pre/20090201020604]")
- # [22:37] <Lachy> Hixie, it should. My script just steals all the content from the script and reformats it
- # [22:37] * Joins: mstange (n=markus@buntes215.wohnheim.uni-kl.de)
- # [22:37] <Hixie> :-)
- # [22:38] <karlcow> http://code.google.com/intl/fr/search/#q=%22import%20html5lib%22
- # [22:39] <karlcow> http://search.koders.com/default.aspx?s=%22import+html5lib%22&btn=&la=%2A&li=%2A
- # [22:40] <karlcow> hmm beautifulsoup not a lot of results either http://search.koders.com/default.aspx?s=%22import+BeautifulSoup%22&btn=&la=%2A&li=%2A
- # [22:41] <gsnedders> It doesn't even include Anolis! :'(
- # [22:56] <Lachy> http://dev.w3.org/html5/html-author/#the-html-vocabulary-and-apis
- # [22:59] * Quits: dglazkov (n=dglazkov@c-98-207-88-44.hsd1.ca.comcast.net)
- # [23:00] <Lachy> karlcow, as you requested yesterday, the script to generate the element summaries is now in CVS http://dev.w3.org/html5/html-author/utils/
- # [23:00] <Lachy> see elements.py and elements.template.html
- # [23:02] <karlcow> excellent
- # [23:02] <karlcow> lachy++
- # [23:03] <Philip`> Lachy: "print item," prints a space at the end, which probably isn't what you want
- # [23:04] <Lachy> Philip`, ok. I changed it to sys.stdout.write() as gsnedders suggested anyway
- # [23:05] <Philip`> (Ooh, the ground is covered in snow, for the first time since ages and ages ago)
- # [23:06] <Philip`> karlcow: Your searches will miss people writing "from BeautifulSoup import ..."
- # [23:08] * Joins: webben (n=webben@91.84.14.159)
- # [23:12] <Lachy> http://dev.w3.org/html5/html-author/#the-html-vocabulary-and-apis now has category descriptions, and appropriate links from each element summary
- # [23:14] * Quits: xydyx (n=hdh@58.187.21.63) (Read error: 54 (Connection reset by peer))
- # [23:14] * Joins: xydyx (n=hdh@58.187.21.63)
- # [23:14] <karlcow> Philip`: indeed.
- # [23:16] * Quits: olliej (n=oliver@c-67-164-125-23.hsd1.ca.comcast.net)
- # [23:19] * Joins: heycam` (n=cam@124-168-33-158.dyn.iinet.net.au)
- # [23:20] * Quits: dbaron (n=dbaron@c-98-234-51-190.hsd1.ca.comcast.net) ("8403864 bytes have been tenured, next gc will be global.")
- # [23:21] * Quits: heycam (n=cam@203-217-88-114.dyn.iinet.net.au) (Read error: 101 (Network is unreachable))
- # [23:28] <gsnedders> gsnedders top tip: remember the syrup when making flapjack.
- # [23:29] <gsnedders> It doesn't work when you forget it
- # [23:30] * Quits: Maurice (i=copyman@5ED548D4.cable.ziggo.nl) ("Disconnected...")
- # [23:31] * Quits: heycam` (n=cam@124-168-33-158.dyn.iinet.net.au) ("bye")
- # [23:38] * Joins: tndH (n=Rob@james-baillie-pc083-014.student-halls.leeds.ac.uk)
- # [23:43] * Quits: annevk (n=annevk@213.52.39.8) (Read error: 110 (Connection timed out))
- # [23:53] * Joins: olliej (n=oliver@nat/apple/x-eafc60171c09b306)
- # [23:54] * Quits: tantek (n=tantek@adsl-63-195-114-133.dsl.snfc21.pacbell.net)
- # [23:55] <olliej> yoyo all
- # Session Close: Mon Feb 02 00:00:00 2009
The end :)