Skip to content

Google search for 'rust tutorial' returns old content #9955

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
ericsampson opened this issue Oct 19, 2013 · 11 comments
Closed

Google search for 'rust tutorial' returns old content #9955

ericsampson opened this issue Oct 19, 2013 · 11 comments

Comments

@ericsampson
Copy link

I noticed that if you search Google for 'rust tutorial', the current top two links are for non-current versions (0.3 and 0.7 I think?). This could be pretty confusing for new users, who may not realize that they are looking at older content which may not reflect the current implementation. I know this isn't necessarily under Rust's control, but I thought I would log it anyway.

@catamorphism
Copy link
Contributor

Good point. The 0.7 tutorial, at least, has the version number in the lower right-hand corner of the screen. The 0.3 tutorial was created before we started putting the version marker there, but I suppose we could always go back and edit it.

@ericsampson
Copy link
Author

I forgot about that lower-right corner version stamp, it's pretty subtle but at least it's something.

What about for each a) adding the version number in the title header so they would read 'The Rust 0.x Language Tutorial' and b) putting some sort of link to the always-current version near the top? Are these something that could be changed in a template and then regenerate the old files? Maybe this could be useful for a lot of doc pages which will have similar issues with search engines and static links in blog posts etc.

Do you think there's anything that can be done on the search engine side?

@thestinger
Copy link
Contributor

I think we should be purging any documentation from unsupported versions. It's still there in the repository history if someone wants it. Until we have a supported release with bug fixes (LLVM still doesn't!), this would mean only having the latest stable release and the master documentation on the site.

cc @brson

@bstrie
Copy link
Contributor

bstrie commented Oct 19, 2013

In addition to purging the docs of unsupported versions, I think we should forbid Googlebots from crawling anything specific to any point release, including the most recent point release. Which is to say, Google searches for "rust tutorial" should never return anything that's not /doc/master. If people really want the docs for the most recent point releases, that's what the links on rust-lang.org are for. People getting routed to old docs is going to be a living hell for us if we don't take action while we can.

Though I guess first we should figure out if Github Pages even lets you provide a ROBOTS.txt file.

@brson
Copy link
Contributor

brson commented Oct 21, 2013

I'm reluctant to just delete our old release artifacts, since these make up the history of Rust. It would be like deleting my old box of baseball cards sitting in my parents' basement - I need to know they're there for sentimental reasons.

What do I need to do to stop Google from crawling the old docs?

@brson
Copy link
Contributor

brson commented Oct 21, 2013

I could possibly make all the old docs non-public, just keep them archived.

@thestinger
Copy link
Contributor

@brson: using robots.txt would work, and wouldn't require modifying the html to mark them as noindex

@adrientetar
Copy link
Contributor

@brson @thestinger robots.txt is a good idea:

User-agent: *
Disallow: /archive-folder/

OR, in the html head:
<meta name="robots" content="noindex,nofollow">

@adrientetar
Copy link
Contributor

@brson Could you add a robots.txt to the website?
Whenever I google for various things about Rust the first result is tutorial but like 0.3, 0.5 or 0.6; think it could be well confusing for newcomers.

I think we can just stealthly let old doc on the website.

@thestinger
Copy link
Contributor

The documentation is in the git repository history so I don't think we need to keep it around on rust-lang.org. It's quite easy to build the old tutorial/manual since it doesn't require building Rust for that revision.

@brson
Copy link
Contributor

brson commented Jan 8, 2014

I added this file as static.rust-lang.org/robots.txt

User-agent: *
Disallow: /doc/0.3/
Disallow: /doc/0.4/
Disallow: /doc/0.5/
Disallow: /doc/0.6/
Disallow: /doc/0.7/
Disallow: /doc/0.8/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants