Options:
- # Session Start: Sun Aug 12 00:00:00 2007
- # Session Ident: #whatwg
- # [00:05] * Quits: met_ (n=Hassman@r5bx220.net.upc.cz) ("Chemists never die, they just stop reacting.")
- # [00:16] * Parts: hasather_ (n=hasather@22.80-203-71.nextgentel.com)
- # [00:38] * Joins: aroben (n=adamrobe@c-67-160-250-192.hsd1.ca.comcast.net)
- # [00:42] * Quits: tantek (n=tantek@w-mob300-128-62-207-11.public.utexas.edu)
- # [00:47] <othermaciej> Molly's post is confusing
- # [00:47] <othermaciej> I'm not sure which things she thinks are a problem or why
- # [00:48] <takkaria> it seemed a bit of an undirected rant to me
- # [00:51] <othermaciej> actually it sounds like a Microsoft enemies ist
- # [00:51] <othermaciej> *list
- # [00:51] <othermaciej> this might be an accident
- # [00:52] <othermaciej> but I know that Microsoft is against many advanced features in HTML5, against the new ECMAScript standards process, and against AIR
- # [00:52] <takkaria> AIR?
- # [00:52] * takkaria is new around these parts
- # [00:53] <othermaciej> it's the Adobe Flash+HTML for local apps thing
- # [00:53] <mgdm> http://en.wikipedia.org/wiki/Adobe_AIR
- # [00:55] <takkaria> ah, right. last I heard, it was still Apollo
- # [00:56] <takkaria> is that MS as a company that's against them, or the IE team against them?
- # [00:56] * mgdm is very new around these parts, hello everyone
- # [01:00] <webben> I think folks would benefit from waiting for Molly to clarify her post (or asking her too) rather than trying to guess what she means.
- # [01:01] <webben> othermaciej: Is there some document somewhere that explains how the standards process has changed? I wasn't aware it had.
- # [01:01] <othermaciej> takkaria: MS is against them because it encroaches on (a) native windows app development with win32 or .NET and (b) Silverlight
- # [01:02] * takkaria nods
- # [01:02] <takkaria> that's pretty much why they're being added, so I'm not suprised, really
- # [01:08] * Joins: tantek (n=tantek@rrcs-67-78-64-62.sw.biz.rr.com)
- # [01:10] <othermaciej> well, I commented on Molly's post
- # [01:10] <othermaciej> although mostly responding to her, not her commenters
- # [01:10] <othermaciej> who seem to be off on another tangent altogether
- # [01:20] * Quits: webben (n=webben@dip5-fw.corp.ukl.yahoo.com)
- # [01:22] * Quits: aroben (n=adamrobe@unaffiliated/aroben)
- # [01:23] * Joins: aroben (n=adamrobe@c-67-160-250-192.hsd1.ca.comcast.net)
- # [02:03] * Joins: annevk (n=annevk@84.77.4.8)
- # [02:06] * Quits: aroben (n=adamrobe@unaffiliated/aroben)
- # [02:14] * Quits: tantek (n=tantek@rrcs-67-78-64-62.sw.biz.rr.com)
- # [02:24] * Quits: takkaria (n=takkaria@spc1-with2-0-0-cust148.bagu.broadband.ntl.com) ("leaving")
- # [02:25] <annevk> the comments on molly.com latest post are just sad
- # [02:26] * Joins: zcorpan (n=zcorpan@84-216-41-76.sprayadsl.telenor.se)
- # [02:26] * Joins: takkaria_ (n=takkaria@isparp.co.uk)
- # [02:27] * takkaria_ is now known as takkaria
- # [02:33] <annevk> xml parser error: http://tommorris.org/blog/
- # [02:33] * annevk hits "Reparse document as HTML"
- # [02:37] * Joins: aroben (n=adamrobe@c-67-160-250-192.hsd1.ca.comcast.net)
- # [02:45] <Philip`> XHTML isn't great at marketing itself, since the only time you ever notice that a site uses XHTML is when it gives you an XML parse error
- # [02:50] * Quits: tndH (i=Rob@adsl-87-102-89-143.karoo.KCOM.COM) (Read error: 110 (Connection timed out))
- # [03:13] * Quits: takkaria (n=takkaria@isparp.co.uk) ("leaving")
- # [03:21] * Joins: takkaria (n=takkaria@isparp.co.uk)
- # [03:23] * takkaria is now known as tak|sleep
- # [03:31] * weinig is now known as weinig|food
- # [03:54] * Quits: aroben (n=adamrobe@unaffiliated/aroben)
- # [03:58] * Joins: aroben (n=adamrobe@c-67-160-250-192.hsd1.ca.comcast.net)
- # [04:00] * Quits: zcorpan (n=zcorpan@84-216-41-76.sprayadsl.telenor.se) (Read error: 110 (Connection timed out))
- # [04:07] * Quits: aroben (n=adamrobe@unaffiliated/aroben)
- # [04:41] * weinig|food is now known as weinig
- # [05:50] * Joins: jruderman (n=jruderma@c-67-169-24-116.hsd1.ca.comcast.net)
- # [06:22] * Joins: tantek (n=tantek@rrcs-67-78-64-62.sw.biz.rr.com)
- # [06:30] * Quits: tantek (n=tantek@rrcs-67-78-64-62.sw.biz.rr.com)
- # [07:01] * Joins: aroben (n=adamrobe@c-67-160-250-192.hsd1.ca.comcast.net)
- # [07:01] * Quits: aroben (n=adamrobe@unaffiliated/aroben) (Remote closed the connection)
- # [07:02] * Joins: aroben (n=adamrobe@c-67-160-250-192.hsd1.ca.comcast.net)
- # [07:23] * Joins: tantek (n=tantek@rrcs-67-78-64-62.sw.biz.rr.com)
- # [07:37] <Hixie> i don't really understand molly
- # [07:37] <Hixie> didn't she used to be pro progress?
- # [07:42] <othermaciej> I'm not sure if she's as against progress as she sounds
- # [07:43] <othermaciej> I thought it was funny that she picked on AIR but not Silverlight
- # [07:49] <Lachy> it looks like JF and others are still complaining about pave the cowpaths (see his comment on molly's blog). I think he just misunderstands it, misrepresents it and then objects to it
- # [07:49] <Lachy> but I think we need to reword it to make it a little less vague
- # [07:50] <Lachy> this is an alternative wording I've come up with:
- # [07:50] <Hixie> i still don't really understand why this was never a problem before the w3c htmlwg
- # [07:50] <Lachy> Where possible, evaluate the success and failure of existing practices or solutions and consider adopting and/or improving upon them in preference to forbidding them and/or inventing new features.
- # [07:51] <tantek> o hai hixie, othermaciej & lachy
- # [07:51] <Hixie> lachy: what's the current wording?
- # [07:51] <Hixie> hey tantek
- # [07:51] <Lachy> hi tantek
- # [07:51] * tantek hopes he didn't speak too out of line on molly's post.
- # [07:51] <Lachy> Hixie: "When a practice is already widespread among authors, consider adopting it rather than forbidding it or inventing something new."
- # [07:52] <Hixie> Lachy: i prefer your new wording, yeah
- # [07:52] <othermaciej> hello tantek
- # [07:52] <othermaciej> Lachy: I agree it should be reworded, I like your wording
- # [07:52] <tantek> Hixie, this *was* a problem before (the new) w3c htmlwg
- # [07:52] <othermaciej> Lachy: I do think a lot of people are misunderstanding it
- # [07:53] <Lachy> IIRC, a few people have compared pave the cowpaths with adopting tables for layout, since it's so common, but such comparisons miss the point about evaluation the solutions
- # [07:53] <othermaciej> Lachy: but some misunderstandings seem to be wildly out of line with even the current wording
- # [07:53] <Hixie> lachy: we might want to look at the usemap="" thing as another example for that btw, since it's not accessiblity-related and doesn't involve xml syntax...
- # [07:53] <othermaciej> for example, claiming that "pave the cowpaths" would calll for removing particular html features
- # [07:53] <tantek> pave the cowpaths is not about adopting markup usage literally - it's about analyzing people's *data* *publication* behaviors
- # [07:53] <Hixie> Lachy: it highlights the focus on research
- # [07:54] <tantek> at least in the context of microformats
- # [07:54] <tantek> people are starting to use "pave the cowpaths" out of context
- # [07:54] <Hixie> though as othermaciej says, that's probably best given as a separate principle
- # [07:54] <Hixie> to avoid confusing people
- # [07:54] <othermaciej> I think the principle might need a new name as well as new text along lines proposed by Lachy
- # [07:55] <Lachy> one issue with that new wording is that it sounds a lot like Don't Reinvent the Wheel
- # [07:55] <tantek> i'm ok with confused people as long as they keep their dialog civil
- # [07:55] <Hixie> tantek: what i meant about the problem is that before the w3c htmlwg started, we had very few people argue from a position of authority rather than using rational discussion and basing their opinions on research
- # [07:56] <tantek> interesting
- # [07:56] <Hixie> tantek: still now, indeed, the discussions in whatwg's list are more rational than many on public-html (though there has been much progress on that front in public-html, i must admit)
- # [07:56] <tantek> well that's hopeful
- # [07:56] <Hixie> it might just be that the w3c community has fostered the development of "experts" whose entire careers are positioned around being on working groups and having their way
- # [07:58] <Lachy> yeah, Ben Millard on public-html has done some good work on finding table examples for doing research on headers. I wish that had happened when the headers debate started and we were asking for it
- # [07:58] <Lachy> but at least it is improving now
- # [07:58] <Hixie> yeah
- # [07:58] <Hixie> i'm gonna have to look at that issue soon, i have a ton of data i collected in early july about it
- # [07:59] <Hixie> i guess i can do that after <img alt>
- # [07:59] <Lachy> what needs to be done for alt?
- # [08:00] * Joins: mpt (n=mpt@121-72-130-73.dsl.telstraclear.net)
- # [08:04] <Hixie> see the spec for my wip revamp of <img alt>
- # [08:08] <Lachy> Hixie, another example for a piece of text with an alternate graphical representation would be some company logos
- # [08:08] <Hixie> yeah
- # [08:09] <tantek> Hixie, yeah I need to write up the table headers usage examples for hCalendar more properly
- # [08:09] <Lachy> You're also missing "An iconic representation of the surrounding content" - it's similar to "graphical representation of some of the surrounding text", but not quite the same
- # [08:10] <tantek> wow: http://twitter.com/rogerjohansson/statuses/200225882
- # [08:11] <Lachy> see, for example, http://www.webnauts.net/ - the wheelchair icon represents the concept of accessibility, the magnifying glass represents SEO, etc.
- # [08:11] <Lachy> (although, the alt text used on that site is ironically inappropriate)
- # [08:15] <Lachy> woah, this is surprising too http://twitter.com/rogerjohansson/statuses/132542642
- # [08:17] <tantek> clearly Roger felt bullied whether or not he actually was
- # [08:18] <othermaciej> I'm not sure why he felt bullied
- # [08:18] * Quits: aroben (n=adamrobe@unaffiliated/aroben)
- # [08:18] <tantek> i think there is a bit of a culture transition clash going on
- # [08:19] <othermaciej> I don't see that many messages from him, and none that are in my public-html folder seem to have engendered particularly heated responses
- # [08:19] <Lachy> some people, particularly the accessibility advocates, just have difficulty seeing our point of view and object whenever someone questions the conventional wisdom
- # [08:20] <Lachy> I'll speak to Roger later and find out what he really thinks
- # [08:20] <tantek> standards organizations, and w3c is no exception, often designed things "a priori", depending on the opinion of "experts" to just "pick the right thing" a priori, based on their expertise.
- # [08:21] <tantek> the approach that microformats (and apparently WHATWG) is taking with scientifically challenging features and demanding justification through research/evidence is quite new
- # [08:21] <tantek> and quite threatening to those that are used to simply getting their way because they say so
- # [08:21] <tantek> as in, how dare anyone question their expertise?
- # [08:21] <tantek> shouldn't their expertise be sufficient to justify things?
- # [08:23] <Lachy> one thing I've also noticed is that some people refuse to look at the issues from the authoring point of view, and only consider the benefits for those with accessibility-related needs
- # [08:23] <tantek> yes, i've noticed that tooo
- # [08:24] <tantek> it's like, just because you ask, people aren't necessarily going to do it. econ 101, psych 101 etc.
- # [08:24] <Lachy> i.e. they advocate features regardless of how easy they are for authors, and refuse to consider authoring incentives beyond legal and moral arguments
- # [08:24] <othermaciej> it really depends on what the standards body in question is, as far as past experience goes
- # [08:24] <tantek> Lachy, to be fair, *some* advocate features that way
- # [08:25] <tantek> painting all accessibility advocates with the same brush is inaccurate
- # [08:25] <othermaciej> my understanding is that, for instance, things like the C99 or H.264 had quite a bit of empirical work go into changes
- # [08:25] <Lachy> yeah, I realise it's not all, but it is at least a few of the more vocal ones
- # [08:25] <othermaciej> go into their design rather
- # [08:25] <tantek> yes on both counts
- # [08:26] <othermaciej> I have seen a number of people appear to take the stance that disagreeing with their preferred technical approach means you are against human rights
- # [08:26] * Joins: aroben (n=adamrobe@c-67-160-250-192.hsd1.ca.comcast.net)
- # [08:26] <Lachy> indeed, though I think some recent discussions clarified that a bit
- # [08:27] <othermaciej> voicing opinions like that tends to lead to communication breakdown
- # [08:27] <othermaciej> John Foliot still seems to stand by this stance
- # [08:27] <othermaciej> I say this based on both his comment post on Molly's blog and some private communication after I raised that issue
- # [08:27] <tantek> indeed, labeling someone "against human rights" is quite close to Godwin's law.
- # [08:28] <tantek> and of course, one should simply call out such ad hominem attacks for what they are
- # [08:29] * Lachy google's Godwin's law
- # [08:29] <Lachy> ah http://en.wikipedia.org/wiki/Godwin's_Law
- # [08:29] <tantek> ah yes John Foliot. he's the one that provided the most ad hominem attack fodder
- # [08:30] <Lachy> tantek: also the one to most vocally complain about the attitude problems
- # [08:30] <tantek> keep pointing out peoples' ad hominem attacks, for those are the true attitude problems
- # [08:30] <tantek> if you can't be civil, then you have an attitude problem and don't deserve any sympathy
- # [08:31] <tantek> or put more economically, IMHO, uncivil comments are lower priority than civil comments
- # [08:59] * Quits: weinig (i=weinig@nat/apple/x-1562dfdb61c10418)
- # [09:01] * Joins: wakaba_ (n=w@114.165.210.220.dy.bbexcite.jp)
- # [09:19] * Quits: wakaba (n=w@114.165.210.220.dy.bbexcite.jp) (Read error: 110 (Connection timed out))
- # [09:22] * Quits: dolphinling (n=chatzill@rbpool2-75.shoreham.net) ("ChatZilla 0.9.78.1-rdmsoft [XULRunner 1.8.0.9/2006120508]")
- # [10:14] * Joins: ROBOd (n=robod@86.34.246.154)
- # [10:34] * Joins: hasather_ (n=hasather@22.80-203-71.nextgentel.com)
- # [11:00] * tak|sleep is now known as takkaria
- # [11:11] * Quits: jruderman (n=jruderma@c-67-169-24-116.hsd1.ca.comcast.net)
- # [11:14] <takkaria> http://twitter.com/rogerjohansson/statuses/200225882
- # [11:17] <takkaria> oh. wrong window there
- # [11:17] * Quits: annevk (n=annevk@84.77.4.8) (Read error: 110 (Connection timed out))
- # [11:35] * Quits: tantek (n=tantek@rrcs-67-78-64-62.sw.biz.rr.com)
- # [12:06] * Joins: Codler (n=Codler@84-218-5-107.eurobelladsl.telenor.se)
- # [12:10] * Joins: zcorpan (n=zcorpan@84-216-42-194.sprayadsl.telenor.se)
- # [12:15] * Joins: Ducki (n=Ducki@d83-176-151-170.cust.tele2.de)
- # [12:16] <hsivonen> zcorpan: Re: attribute parsing, I don't have anything to add to what Philip` said
- # [12:16] <zcorpan> hsivonen: ok
- # [12:17] * hsivonen suffers from a bad case of sudden summer flu and should probably go offline instead of reading irc logs
- # [12:18] <hsivonen> hmm. looks like Roger Johansson removed the twitters mentioned in the IRC log today
- # [12:33] <zcorpan> btw, ben millard and me have discussed possible ways to improve the table headers algorithm
- # [12:34] <zcorpan> it will probably be forwarded to the list at some point
- # [12:35] <zcorpan> the current algorithm is inadequate for many tables
- # [12:59] * Joins: Darkluna (n=Codler@84-218-5-127.eurobelladsl.telenor.se)
- # [13:09] <Lachy> I had a long, rational and very productive discussion with Roger on ICQ a few hours ago. I think we were able to clarify a lot of issues and understand each others perspectives a lot better.
- # [13:09] <Lachy> I suspect that's why he removed the twitters
- # [13:12] * Quits: Codler (n=Codler@84-218-5-107.eurobelladsl.telenor.se) (Read error: 110 (Connection timed out))
- # [13:33] * Joins: maikmerten (n=maikmert@Lafbe.l.pppool.de)
- # [13:39] * Quits: Lachy (n=Lachy@124-168-4-56.dyn.iinet.net.au) ("ChatZilla 0.9.78.1 [Firefox 2.0.0.4/2007051502]")
- # [13:40] * Quits: aroben (n=adamrobe@unaffiliated/aroben)
- # [13:44] * Joins: Lachy (n=Lachy@124-168-4-56.dyn.iinet.net.au)
- # [13:52] * Parts: hasather_ (n=hasather@22.80-203-71.nextgentel.com)
- # [13:53] * Joins: hasather_ (n=hasather@22.80-203-71.nextgentel.com)
- # [13:53] * Quits: hasather_ (n=hasather@22.80-203-71.nextgentel.com) (Client Quit)
- # [13:53] * Joins: hasather_ (n=hasather@22.80-203-71.nextgentel.com)
- # [14:01] * Quits: Lachy (n=Lachy@124-168-4-56.dyn.iinet.net.au) ("ChatZilla 0.9.78.1 [Firefox 2.0.0.4/2007051502]")
- # [14:05] * Joins: Ducki_ (i=Ducki@nrdh-d9b980c9.pool.mediaWays.net)
- # [14:15] * Quits: Ducki (n=Ducki@d83-176-151-170.cust.tele2.de) (Read error: 110 (Connection timed out))
- # [14:17] * Quits: Ducki_ (i=Ducki@nrdh-d9b980c9.pool.mediaWays.net) (Read error: 113 (No route to host))
- # [14:32] * Parts: hasather_ (n=hasather@22.80-203-71.nextgentel.com)
- # [14:32] * Joins: hasather_ (n=hasather@22.80-203-71.nextgentel.com)
- # [14:57] * Joins: tndH (i=Rob@adsl-87-102-89-143.karoo.KCOM.COM)
- # [14:59] * Joins: Whiskey_M (n=Richard@host-84-9-127-20.bulldogdsl.com)
- # [14:59] <Whiskey_M> 'lo
- # [15:00] <zcorpan> yo
- # [15:00] * Parts: hasather_ (n=hasather@22.80-203-71.nextgentel.com)
- # [15:00] * Joins: hasather_ (n=hasather@22.80-203-71.nextgentel.com)
- # [15:00] <Whiskey_M> hi Zcorpan, how goes?
- # [15:01] <zcorpan> goes well thank you, you?
- # [15:01] <Whiskey_M> none too shabby :-)
- # [15:25] * Joins: met_ (n=Hassman@r5bx220.net.upc.cz)
- # [15:32] * Joins: Lachy (n=Lachlan@124-168-4-56.dyn.iinet.net.au)
- # [15:33] * Joins: tantek (n=tantek@rrcs-67-78-64-62.sw.biz.rr.com)
- # [15:50] * Darkluna is now known as Codler
- # [15:58] * Quits: tantek (n=tantek@rrcs-67-78-64-62.sw.biz.rr.com)
- # [16:02] * Quits: Codler (n=Codler@84-218-5-127.eurobelladsl.telenor.se) ("- nbs-irc 2.21 - www.nbs-irc.net -")
- # [16:52] * Joins: tantek (n=tantek@w-mob300-128-62-207-11.public.utexas.edu)
- # [16:55] * Joins: virtuelv (n=virtuelv@96.80-202-82.nextgentel.com)
- # [17:05] <met_> http://manwithnoblog.com/2007/08/12/are-we-becoming-complacent/
- # [17:08] <Whiskey_M> read it earlier today
- # [17:08] <Lachy> wow, see Joe's comment on molly's article. http://www.molly.com/2007/08/11/dear-w3c-dear-wasp/#comment-611988
- # [17:10] <Whiskey_M> isn't that normal Joe ;-)
- # [17:18] <Lachy> possibly, but it's ironic that he said that after my comment explaining that we are not against accessibility features. I just don't understand why he'd come back with a strawman argument about us dropping the alt attribute
- # [17:25] * virtuelv notes that he respectfully disagrees with fantasai on browser vendor influence over CSS
- # [17:26] <virtuelv> from http://fantasai.inkedblade.net/weblog/2007/css-wg-opening-up/
- # [17:41] * Joins: hendry (n=hendry@91.84.53.136)
- # [17:47] * Joins: Ducki (n=Ducki@213-102-92-251.cust.tele2.de)
- # [17:48] * Joins: sirPalook (n=chatzill@athedsl-300001.home.otenet.gr)
- # [17:49] <sirPalook> Noob (just read the specs). So in <p><tt><b>foo</p>, 'tt' and 'b' are *not* closed, but cloned?
- # [17:53] <sirPalook> In the DOM that is
- # [17:54] <Philip`> If you have something after that (other than a </b> or </tt> or EOF), then yes
- # [17:54] <Philip`> http://james.html5.org/cgi-bin/parsetree/parsetree.py?source=%3Cp%3E%3Ctt%3E%3Cb%3Efoo%3C%2Fp%3Ebar
- # [17:54] <Lachy> sirPalook, yes, I believe they are reopened after the P element, or inside any following block level element (the adoption agency algorithm is confusing)
- # [17:55] <sirPalook> I'm trying to figure out the differences with SGML, which theoretically sais that every closing tag closes all open tags...
- # [17:55] <sirPalook> Yep, thanks.
- # [17:56] <Lachy> sirPalook, yes, that's the theory and that's how OpenSP implemented it. Not sure if that's actually defined in the SGML spec, though
- # [17:56] <sirPalook> If i understand correctly, cloning happens for "Formatting tags"
- # [17:56] <Lachy> yes
- # [17:56] <sirPalook> Lachy. Yep I never invested any precious brain cells on SGML either :)
- # [17:59] * Philip` wonders if it's possible to get worse than quadratic behaviour
- # [17:59] <Philip`> (i.e. worse than http://james.html5.org/cgi-bin/parsetree/parsetree.py?source=%3Cp%3E%3Cb%3E%3Cb%3E%3Cb%3E%3Cb%3E%3Cb%3Ex%3Cp%3Ex%3Cp%3Ex%3Cp%3Ex%3Cp%3Ex )
- # [18:01] <sirPalook> nice one. Although I think that DOM sais the "clones" point to the same thing.
- # [18:02] <sirPalook> It's interesting what happens with <A NAME=> though
- # [18:02] * Quits: Whiskey_M (n=Richard@host-84-9-127-20.bulldogdsl.com)
- # [18:08] <Philip`> Not quite sure what you mean about <A NAME=>
- # [18:09] <hendry> hsivonen: do you have an idea with the element isn't used by UAs anymore really for aiding navigation?
- # [18:09] <hendry> hsivonen: do you have an idea with the LINK element isn't used by UAs anymore really for aiding navigation?
- # [18:11] <hasather_> hendry: at least Opera uses it
- # [18:11] * Lachy responded to Joe
- # [18:12] <Lachy> ... on molly's blog
- # [18:12] <Lachy> I hope this clears up the issue http://www.molly.com/2007/08/11/dear-w3c-dear-wasp/#comment-612166
- # [18:12] <Philip`> Does Opera use anything other than <link rel=next>?
- # [18:13] <hendry> i have loads <links on my blog natalian.org. I think I should remove them.
- # [18:13] <hasather_> Philip`: yes
- # [18:14] <hasather_> Philip`: activate the navigation bar (Appearance > Toolbar >Navigation bar)
- # [18:15] <Philip`> hasather_: Aha, thanks - I never noticed that before
- # [18:19] <hendry> hasather_: it's useless on natalian.org :) /me wonders if i can get that view on Opera mobile
- # [18:19] * Joins: Ducki_ (n=Ducki@nrdh-d9b980d5.pool.mediaWays.net)
- # [18:20] <virtuelv> hendry: afaik, no
- # [18:21] <virtuelv> besides, it only uses known values for rel
- # [18:22] * Philip` wonders if it's possible to use some kind of heuristics to decide whether a longdesc string is an attempt at writing a URL or is (as seems far more common) a descriptive phrase
- # [18:23] <Lachy> I think Hixie was going to attempt something like that, there are ways you could test it
- # [18:23] <Philip`> (By "far more common", I mean I looked at about six pages which use longdesc, and most were text instead of links)
- # [18:24] <Philip`> If people are going to the effort of writing long descriptions, it seems unhelpful to just ignore them because they never knew it should have been a link instead
- # [18:24] <Lachy> check if it begins with http://, search the string for spaces (which would indicate a phrase), attempt to resolve and retrieve the URL, etc
- # [18:26] <Lachy> could you just produce a page that lists all the values found in longdesc attributes, and check it manually?
- # [18:26] <Lachy> it shouldn't be too hard for someone to visually inspect the list and recognise phrases vs. URLs
- # [18:27] * takkaria volunteers to do such inspection
- # [18:27] * Quits: sirPalook (n=chatzill@athedsl-300001.home.otenet.gr) (Read error: 110 (Connection timed out))
- # [18:30] <Lachy> Philip`, you could also compare the value with the alt attribute and list the values for alt and longdesc side by side on the result page
- # [18:30] <Philip`> I don't have any data about attribute values at the moment - all I've got is the list at http://canvex.lazyilluminati.com/survey/2007-07-17/analyse.cgi/pages/attr/longdesc
- # [18:31] <Lachy> well, we can rule out wikipedia, since they use it incorrectly and also redundantly
- # [18:31] <Philip`> ...but it shouldn't require significant effort to get some data about alt/longdesc/etc in particular, since they're not that rare
- # [18:32] <Philip`> (though I don't have code that handles non-ASCII character encodings at all correctly, which is a pain if you're trying to actually read the text on a page)
- # [18:34] <Lachy> schirmer.com has empty longdesc=""
- # [18:35] <Lachy> I think Hixie was also going to compare the longdesc attribute with the value of the surrounding <a href="">, if any
- # [18:35] <Lachy> there should be some discussion of this research in the IRC logs somewhere
- # [18:36] <Lachy> the 4th result in your list, for instance, links to the same page, as does wikipedia
- # [18:37] <Lachy> tv.com has <iframe longdesc="">!
- # [18:38] <takkaria> shakedown.fi seems to replicate the title text as longdesc
- # [18:39] * Quits: Ducki (n=Ducki@213-102-92-251.cust.tele2.de) (Read error: 110 (Connection timed out))
- # [18:39] <Philip`> <IMG align="left" alt="Aurajoen soudut" longdesc="Aurajoen soudut" src="images/aurajoki.jpg" title="Aurajoen soudut">
- # [18:39] <Lachy> oh, HTML4 allows longdesc on iframe. I didn't realise that
- # [18:40] <takkaria> Philip`: but a fair few of them don't have alt text at all
- # [18:40] <Philip`> (See http://canvex.lazyilluminati.com/survey/2007-07-17/analyse.cgi/pages/tagattr/iframe/longdesc vs http://canvex.lazyilluminati.com/survey/2007-07-17/analyse.cgi/pages/tagattr/frame/longdesc vs http://canvex.lazyilluminati.com/survey/2007-07-17/analyse.cgi/pages/tagattr/img/longdesc if you're interested in a specific tag)
- # [18:41] <takkaria> heh, a longdesc with the same value as src
- # [18:42] <takkaria> stopitnow.com: <img src="images/logo-homer.gif" alt="stopitnowlogo" ... longdesc="images/logo-homer.gif">
- # [18:46] <takkaria> puntoluce.tv uses longdesc on a product image to link to that product's page; most of the others I've tried have either been attempts at keyword spam, or have been links to the image or to the page they're on
- # [18:50] <takkaria> ACLU in Iowa seem to use it on a "get adobe reader!" image to point to the download page; I wonder if that's copy-and-pasted from an adobe webpage?
- # [18:53] * Joins: Codler (n=Codler@84-218-5-159.eurobelladsl.telenor.se)
- # [18:58] <met_> Lachy, isn't the story about dropping alt based on http://www.whatwg.org/specs/web-apps/current-work/multipage/section-embedded.html#the-img where "alt" isn't marked as required, although bellow is written "The text is given by the alt attribute, which must be present and must have a non-empty value"?
- # [18:59] <takkaria> based on those links (on img), longdesc is generally misused and even when it, there's little useful information there
- # [19:00] <takkaria> if some more links could be dredged up from somewhere, I'd go through and put together a wiki page on longdesc usage
- # [19:01] <Lachy> takkaria, you can use either the whatwg wiki or the HTML WG section of the W3C wiki
- # [19:01] <takkaria> I think I prefer the whatwg one, mediawiki's friendlier
- # [19:02] <takkaria> I'll go through the links there after a bite to eat
- # [19:02] * takkaria bbl
- # [19:02] <Lachy> met_, look at last alternative titled "A key part of the content that has no textual alternative", that says "no alt" below
- # [19:02] <Lachy> that appears to be why it's no longer required
- # [19:04] <Lachy> I wasn't aware of that change until tonight, but it's going to make the validation issue interesting. It's useful to know when an alt has been omitted by mistake, so maybe validators would have to starrt issuing warnings instead.
- # [19:05] <met_> ah I see
- # [19:06] <Lachy> I'm not sure why there is a distinction between no alt and alt="" in the spec
- # [19:07] <met_> but I think that examples mentioned there are not good example, I would prefer even alt="screenshot" before blank alt (personal opinion) - maybe find better examples where alt realle has not sense?
- # [19:09] <Lachy> hmm. Maybe, I'm not sure what the best approach is with screenshots and alt text.
- # [19:10] <Lachy> it might be good to look up some guidelines about alt usage (like Jukka's pages, htmlhelp.com, etc.)
- # [19:10] <met_> just looking in czech accessibility book and there is good example: when has image only decorative meaning e.g. decorative illistration there is preferer blank alt, maybe this
- # [19:13] <met_> like this http://www.cbsnews.com/sections/eveningnews/main3420.shtml - every article has some small picture, which often has no information and is mainly for decorative purposes, to draw readers attention
- # [19:15] <Lachy> yeah, I call those iconic representations
- # [19:16] <met_> or here the man and 2 women http://en.navrcholu.cz/
- # [19:16] <Lachy> though, on News sites like that, the pictures often have associated captions
- # [19:16] <met_> ok its not in 100%
- # [19:17] <Lachy> yeah, but you're right. I already mentioned this exact issue to Hixie earlier today
- # [19:17] <Lachy> or yesterday, actually
- # [19:18] * Quits: Ducki_ (n=Ducki@nrdh-d9b980d5.pool.mediaWays.net) (Read error: 113 (No route to host))
- # [19:49] * Quits: tantek (n=tantek@w-mob300-128-62-207-11.public.utexas.edu)
- # [19:57] <zcorpan> hmm. why is the distributed spec out of sync with svn?
- # [20:00] <zcorpan> some issue with revisions after r999?
- # [20:01] <zcorpan> r999 was 2007-08-10, the spec header says 11 August 2007
- # [20:01] <zcorpan> Hixie: ^
- # [20:02] * takkaria starts doing some research on longdesc properly
- # [20:07] * Parts: hasather_ (n=hasather@22.80-203-71.nextgentel.com)
- # [20:08] * Joins: hasather_ (n=hasather@22.80-203-71.nextgentel.com)
- # [20:15] * Quits: Codler (n=Codler@84-218-5-159.eurobelladsl.telenor.se) (Connection timed out)
- # [20:16] * Parts: hasather_ (n=hasather@22.80-203-71.nextgentel.com)
- # [20:16] * Joins: zcorpan_ (n=zcorpan@84-216-40-215.sprayadsl.telenor.se)
- # [20:16] <zcorpan_> http://methisto.blogspot.com/2007/08/html5-trailer-find-your-hero.html
- # [20:18] * Joins: hasather_ (n=hasather@22.80-203-71.nextgentel.com)
- # [20:18] <zcorpan_> krijnh: in the logs, perhaps you could have a default right border or padding so that the text doesn't jump around when you hover it?
- # [20:37] <Lachy> cool trailer :-)
- # [20:38] * Quits: zcorpan (n=zcorpan@84-216-42-194.sprayadsl.telenor.se) (Read error: 110 (Connection timed out))
- # [20:40] * Quits: maikmerten (n=maikmert@Lafbe.l.pppool.de) (Remote closed the connection)
- # [20:44] <Hixie> zcorpan_: the draft on whatwg.org is my working copy -- i updated it to let people see my wip on alt="", but it's not done yet
- # [21:33] * Joins: tantek (n=tantek@rrcs-67-78-64-62.sw.biz.rr.com)
- # [21:35] * Joins: aroben (n=adamrobe@c-67-160-250-192.hsd1.ca.comcast.net)
- # [21:35] * Quits: aroben (n=adamrobe@unaffiliated/aroben) (Remote closed the connection)
- # [21:36] * Joins: aroben (n=adamrobe@c-67-160-250-192.hsd1.ca.comcast.net)
- # [21:43] <takkaria> http://wiki.whatwg.org/wiki/Longdesc_usage
- # [21:43] <takkaria> if anyone's interested
- # [21:44] * Quits: mpt (n=mpt@121-72-130-73.dsl.telstraclear.net) ("This computer has gone to sleep")
- # [21:44] <takkaria> of the 62 pages that philip gave with <img longdesc="">, I reckon two of them are valid uses according to the spec
- # [21:44] * Joins: mpt (n=mpt@121-72-130-73.dsl.telstraclear.net)
- # [21:51] <Philip`> Two? That's probably better than td@headers :-)
- # [21:51] <takkaria> one of them is explaining what an acessibility icon means
- # [21:52] <takkaria> the other one at one point links to a text/plain document telling the user they can download speech software to hear the website
- # [21:53] <takkaria> overall, I'd have to say that longdesc is horribly broken
- # [21:53] <Philip`> The data does appear to suggest that invisible metadata is bad - authors get no feedback when they do it wrong, and so they almost never get it right
- # [21:54] <Philip`> "3 of which look like SEO attempts (13, 18, 48, 61)" - that's four, not three
- # [21:55] <takkaria> true
- # [21:55] * takkaria fixes
- # [21:55] <takkaria> if I ever do this again, I'm going to use something other than pen and paper
- # [21:56] <Philip`> Perhaps it'd be possible to use html5lib to extract the relevant data from the pages
- # [21:57] <takkaria> well, I used a system of curl'ing all the pages to numerically numbered files, then grepping them for "longdesc", then noting down in a column what category they fit into
- # [21:57] <takkaria> hooking up html5lib to a "is this right? yes/no" UI would simplify it a lot in future, I imagine
- # [21:57] <takkaria> I don't know python or ruby, though, so I'm no help there
- # [21:58] <Philip`> An html5grep might be nice
- # [21:58] * Quits: KevinMarks (n=KevinMar@c-76-102-254-252.hsd1.ca.comcast.net) ("The computer fell asleep")
- # [21:58] <Philip`> (so you can find elements/attributes/etc without worrying about case-sensitivity or extraneous whitespace or whatever)
- # [22:03] <Philip`> http://search.cpan.org/dist/XML-Twig/tools/xml_grep/xml_grep - ah, there's already at least one XML one - all it needs is an HTML5 parser stuck on the front
- # [22:04] * Joins: weinig (i=weinig@nat/apple/x-73fd7878da5a9772)
- # [22:15] <takkaria> what should I do with that page? should I post it to public-html or whatwg?
- # [22:25] * Philip` doesn't know whether the chance of starting a new longdesc discussion on public-html should be considered an opportunity or a danger
- # [22:26] <takkaria> I reckon it's probably a danger at the moment
- # [22:26] <takkaria> I just wonder what to do with the page now I've made it. :)
- # [22:27] <Hixie> i recommend waiting until i post my results and start looking at the topic, and then forwarding it to the relevant list :-)
- # [22:27] <Philip`> Maybe you could mention it on the WHATWG list so people know it exists, and when some longdesc discussion starts again on public-html then point to the page since it's useful data
- # [22:28] * Quits: tantek (n=tantek@rrcs-67-78-64-62.sw.biz.rr.com) (Read error: 110 (Connection timed out))
- # [22:28] <takkaria> Hixie: I can do that, assuming I'm still subscribed in a couple years' time ^_^
- # [22:29] <Hixie> that's a pretty cool study
- # [22:29] <Hixie> were _any_ of them correctly used?
- # [22:30] <takkaria> two, sort=of. but while they might have been correct, they weren't useful
- # [22:30] <takkaria> I was greatly amused by the number of sites trying to do SEO though
- # [22:31] <Philip`> How many were non-correct but would still be useful if a UA acted in a certain way?
- # [22:31] * Hixie logs in to his work computer to see what his own longdesc study's results were like and finds that his ruby parser study, which he was running over the weekend, completed
- # [22:31] <Hixie> sweet
- # [22:34] <Hixie> let's see
- # [22:34] <Hixie> longdesc
- # [22:34] <takkaria> Philip`: er, I'm not sure any
- # [22:34] <hsivonen> w many were non-correct but would still be useful if a UA acted in a certain way?
- # [22:34] <hsivonen> oops. sorry
- # [22:35] <Hixie> so my script found some hundred billion or so longdescs
- # [22:35] <Hixie> er
- # [22:35] <Hixie> hundred billion or so <img>s rather
- # [22:35] <Hixie> of which about a hundred billion had no longdesc=""
- # [22:36] <Hixie> of the hundred million or so that _did_ have longdesc and src and in which neither was blank:
- # [22:37] <Hixie> about 800 thousand had longdesc = src
- # [22:37] <Hixie> about 85 million had longdesc = an ancestor <a href>
- # [22:37] <Hixie> about 7 million pointed to the root of another domain (!)
- # [22:38] * takkaria nods, I got a fair few pointing to the root of another domain
- # [22:38] <Hixie> about 12 million had a space in the longdesc
- # [22:38] * Philip` wonders if the 85 million includes the two million pages of Wikipedia * average number of images per page
- # [22:38] <Hixie> basically i got it down to about 8 million potentially useful longdesc=""s that need further study
- # [22:39] <Hixie> Philip`: might well
- # [22:39] <Hixie> Philip`: wikimedia in general
- # [22:39] <takkaria> nice stats
- # [22:40] <Hixie> looking at the sample of those 8 million, most seem to be URIs to images
- # [22:40] <Philip`> http://en.wikipedia.org/wiki/Image:STS-118_launch_M-cropped.jpg - that one page has five longdescs which match ancestor hrefs
- # [22:41] <hsivonen> Hixie: are you going to post a summary of the results to public-html?
- # [22:41] <Hixie> the result, i think, though, is that longdesc="" data is so widely abused and corrupt that an accessibility tool would get better usability without supporting it than with supporting it
- # [22:41] <Hixie> hsivonen: yeah, when i look at longdesc=""
- # [22:42] * takkaria nods
- # [22:42] <takkaria> ach, I best be off, bbl
- # [22:43] <Philip`> Can an accessibility tool get better usability by automatically determining (/guessing) whether a longdesc is valid or not?
- # [22:43] <Hixie> hard to say
- # [22:44] <Hixie> i don't know how the heuristic would work
- # [22:44] * takkaria is now known as tak|away
- # [22:44] <hsivonen> Hixie: comparing with ancestor href='' and doing a HEAD request to see if the resource is another image
- # [22:45] <Hixie> look at, e.g.: http://photofile.ru/users/marusyan/?page=2
- # [22:46] <Hixie> several of the longdesc=""s on that page look useful to that heuristic
- # [22:48] <zcorpan_> Hixie: ah, ok
- # [22:52] <Hixie> http://wiki.whatwg.org/wiki/Talk:Longdesc_usage
- # [22:52] <Philip`> The exact heuristic could be left as an area for UA developers to innovate in
- # [22:56] <Hixie> i think before we give ATs things to innovate on, we should wait for their products to become usable in the first place.
- # [23:06] * Parts: hasather_ (n=hasather@22.80-203-71.nextgentel.com)
- # [23:09] * Quits: ROBOd (n=robod@86.34.246.154) (Remote closed the connection)
- # [23:22] * Quits: virtuelv (n=virtuelv@96.80-202-82.nextgentel.com) ("Leaving")
- # [23:29] * Quits: hendry (n=hendry@91.84.53.136) ("bedtime")
- # [23:43] * Quits: gsnedders (n=gsnedder@host86-138-194-36.range86-138.btcentralplus.com)
- # [23:46] * Quits: met_ (n=Hassman@r5bx220.net.upc.cz) ("Chemists never die, they just stop reacting.")
- # [23:48] * Joins: gsnedders (n=gsnedder@host86-138-194-36.range86-138.btcentralplus.com)
- # [23:55] * Quits: aroben (n=adamrobe@unaffiliated/aroben)
- # Session Close: Mon Aug 13 00:00:00 2007
The end :)