Site NavigationDisclaimerThe views or opinions expressed on this blog are my own and do not necessarily reflect the views or opinions of Oracle Corporation. The views or opinions expressed by visitors on this blog are theirs solely and may not reflect mine. Categories |
Tuesday, August 5. 2008Why is MSNBot ignoring robots.txt?Trackbacks
Trackback specific URI for this entry
No Trackbacks
Comments
Display comments as
(Linear | Threaded)
Lenz,
I think the right address to complain is not <strong>your</strong> blog but the Livesearch blog http://blogs.msdn.com/livesearch
MSNBot could parse the content of your blog, but it is still missing some AI to automatically create a bug report and assign reponsible people at MS to it <img src="/templates/default/img/emoticons/wink.png" alt=";-)" style="display: inline; vertical-align: bottom;" class="emoticon" />
MS and .TXT: does it work with CRLF line terminators? At least that's what you have to use in Microsoft compatible TXT-files elsewhere... (NB: and in .BAT files if you don't like commands randomly skipped). Which MIME type does your web server use?
Did it actually exclude /fisheye/ ?
PS: Is it possible to not have all of my comment in one paragraph?
Whilst I totally agree, Microsoft should follow the robots.txt standard. But you know what - it's not really a standard as much as we would like to think it is.
If you recall a year or so ago, Google lost a major court case because of robots.txt. Ive blogged about it if you have a chance to read it. http://blog.sherifmansour.com/?p=16
The MSN service was again rebranded, this time as a more traditional Internet access service. With the MSN 2.5 release in late 1997, some exclusive content was still offered through the MSN Program Viewer, but the service mainly directed members to "normal" web sites. With the MSN Internet Access 2.6 release in 1998, the MSN Program Viewer was abandoned entirely in favor of the more familiar Internet Explorer interface.
Hope this helps,
Andy Colleman
|
QuicksearchCalendar
ArchivesShow tagged entriesCreative Commons |
|||||||||||||||||||||||||||||||||||||||||||||||||