application/xhtml+xml
?This document is served as text/html
so Google can at least index this one and follow the links.
Let's find out, by sitting and waiting.. And keeping track of this search.
Most XHTML files in /stuff/xhtml/
have an XHTML 1.1 DTD and aren't indexed by Google (File Format: Unrecognized
).
I noticed before Google didn't index some XHTML files. Never knew why. This comment made me rethink about it. Does it indeed have to do with the DTD? Some test documents:
Or does it have to do with the XML declaration? I made two other documents:
type
attribute?Perhaps Google sees this attributes, after which it decides it can't handle it. Tests:
type="application/xhtml+xml"
→ not indexedtype="application/xhtml+xml"
→ not indexedtype="application/xhtml+xml"
→ not indexedtype="application/xhtml+xml"
→ not indexedAll these documents have an XML declaration.
(Made on June 20, 2005)
Hmm, so Google doesn't index XHTML files with an XML declaration. Interesting. Without that declaration they are indexed, even though it's an unrecognized file format.
text/html
?Makes sense.
That's probably the application/xhtml+xml
MIME type. I guess :)