Volker Simonis | 15 Aug 22:07

Search facility for the list archives?

Are there any plans for a search facility that can search all the
mailing lists from java.net. I meanwhile counted 33 different lists
and from the names it's not always clear where to look/post for a
specific topic.

I think providing a good search tool would greatly increase the
usefullness of the lists alltogether and would reduce the burdon of
double/cross posting.

I also remember that I've already posted to discuss <at> openjdk on the 1st
of March, however the current archives start from May only. Where have
the old postings gone too?

Regards,
Volker

David Herron | 15 Aug 23:04
Favicon

Re: Search facility for the list archives?

Volker Simonis wrote:
Are there any plans for a search facility that can search all the mailing lists from java.net. I meanwhile counted 33 different lists and from the names it's not always clear where to look/post for a specific topic. I think providing a good search tool would greatly increase the usefullness of the lists alltogether and would reduce the burdon of double/cross posting. I also remember that I've already posted to discuss <at> openjdk on the 1st of March, however the current archives start from May only. Where have the old postings gone too? Regards, Volker
Dunno about the old postings.. they were on differently implemented infrastructure.

It seems to me that going to google and entering:

    site:mail.openjdk.java.net search string

would be a very good way to handle this.  But there's a robot.txt saying otherwise...
 wget http://mail.openjdk.java.net/robots.txt
Yields:
User-agent: *
Disallow: /
There must be a reason for this to which I'm not privy.  To me it makes absolute perfect sense to index this stuff because it's the search engines that draw the most traffic to a web site.

- David Herron

 
Volker Simonis | 16 Aug 00:41

Re: Search facility for the list archives?

On 8/15/07, David Herron <David.Herron@...> wrote:
>
>  Volker Simonis wrote:
>  Are there any plans for a search facility that can search all the
> mailing lists from java.net. I meanwhile counted 33 different lists
> and from the names it's not always clear where to look/post for a
> specific topic.
>
> I think providing a good search tool would greatly increase the
> usefullness of the lists alltogether and would reduce the burdon of
> double/cross posting.
>
> I also remember that I've already posted to discuss <at> openjdk on the 1st
> of March, however the current archives start from May only. Where have
> the old postings gone too?
>
> Regards,
> Volker
>
>  Dunno about the old postings.. they were on differently implemented
> infrastructure.
>
>  It seems to me that going to google and entering:
>
>      site:mail.openjdk.java.net search string
>
>  would be a very good way to handle this.  But there's a robot.txt saying
> otherwise...
>
>  wget http://mail.openjdk.java.net/robots.txt
>  Yields:
>
> User-agent: *
>  Disallow: /
>  There must be a reason for this to which I'm not privy.  To me it makes
> absolute perfect sense to index this stuff because it's the search engines
> that draw the most traffic to a web site.
>
>  - David Herron
>

Hopefully, it is not because the responsible admins are still not sure
if the current infrastructure is the final one :)

Mark Reinhold | 24 Aug 20:15
Favicon

Re: Search facility for the list archives?

> Date: Wed, 15 Aug 2007 14:04:58 -0700
> From: david.herron@...

> ...
> 
> It seems to me that going to google and entering:
> 
>     site:mail.openjdk.java.net search string
> 
> would be a very good way to handle this.  But there's a robot.txt saying
> otherwise...
> 
>      wget http://mail.openjdk.java.net/robots.txt
> 
> Yields:
> 
>     User-agent: *
>     Disallow: /
> 
> There must be a reason for this to which I'm not privy.  To me it makes
> absolute perfect sense to index this stuff because it's the search
> engines that draw the most traffic to a web site.

You're completely right.

There used to be a reason for that robots.txt.  The new openjdk.java.net
domains were actually online a few days before the launch at JavaOne, and
we (obviously) didn't want search engines to index the site prior to the
launch.  We removed the robots.txt files from openjdk.java.net but forgot
to removing the one on mail.o.j.n.

I've now removed it.  Give Google a couple of days, and you should be all
set.

- Mark

Igor Nekrestyanov | 24 Aug 20:19
Favicon

Re: Search facility for the list archives?

Can we also remove https://openjdk.dev.java.net/robots.txt?
So sources will be searchable too.

-igor

Mark Reinhold wrote:
>> Date: Wed, 15 Aug 2007 14:04:58 -0700
>> From: david.herron@...
>>     
>
>   
>> ...
>>
>> It seems to me that going to google and entering:
>>
>>     site:mail.openjdk.java.net search string
>>
>> would be a very good way to handle this.  But there's a robot.txt saying
>> otherwise...
>>
>>      wget http://mail.openjdk.java.net/robots.txt
>>
>> Yields:
>>
>>     User-agent: *
>>     Disallow: /
>>
>> There must be a reason for this to which I'm not privy.  To me it makes
>> absolute perfect sense to index this stuff because it's the search
>> engines that draw the most traffic to a web site.
>>     
>
> You're completely right.
>
> There used to be a reason for that robots.txt.  The new openjdk.java.net
> domains were actually online a few days before the launch at JavaOne, and
> we (obviously) didn't want search engines to index the site prior to the
> launch.  We removed the robots.txt files from openjdk.java.net but forgot
> to removing the one on mail.o.j.n.
>
> I've now removed it.  Give Google a couple of days, and you should be all
> set.
>
> - Mark
>   

Mark Reinhold | 25 Aug 05:45
Favicon

Re: Search facility for the list archives?

> Date: Fri, 24 Aug 2007 22:19:28 +0400
> From: igor.nekrestyanov@...

> Can we also remove https://openjdk.dev.java.net/robots.txt?
> So sources will be searchable too.

So we're in this weird state where most of our content is on
openjdk.java.net, which is hosted on machines that we control
completely, except for the read-only Subversion repository,
which is on openjdk.DEV.java.net, which runs on top of the
regular old java.net infrastructure.

There are, unfortunately, aspects of the java.net infrastructure
over which we have essentially no control, and I'm afraid that
this robots.txt file is one of them.  Sorry.

In the long run, once we've made our Mercurial repositories
public -- and that will be on machines that we control -- then
we can do whatever we want.  We'll also, eventually, have an
OpenGrok instance set up for the OpenJDK sources.  That should
be much more efficient than Google for searching through the
source code.

- Mark

Peter B. Kessler | 16 Aug 01:44
Favicon

Re: Search facility for the list archives?

Volker Simonis wrote:

> Are there any plans for a search facility that can search all the
> mailing lists from java.net. I meanwhile counted 33 different lists
> and from the names it's not always clear where to look/post for a
> specific topic.
> 
> I think providing a good search tool would greatly increase the
> usefullness of the lists alltogether and would reduce the burdon of
> double/cross posting.
> 
> I also remember that I've already posted to discuss <at> openjdk on the 1st
> of March, however the current archives start from May only. Where have
> the old postings gone too?
> 
> Regards,
> Volker

I thought we were depending on

     http://gmane.org/

for searching.  Though, I don't see a way to search a group of
lists, e.g., at

     http://gmane.org/find.php?list=openjdk

but maybe I just don't know the syntax.  I also find that while
I can't find your message by searching for "search facility" on

     http://search.gmane.org/?query=%22search+facility%22&group=gmane.comp.java.openjdk.general

I can find your message by asking for all the topics from that
list and then choosing

     http://comments.gmane.org/gmane.comp.java.openjdk.general/383

Searching for "volker" turns up messages from 2007-03-01 through
2007-08-09, so maybe they are just behind in their indexing.

			... peter

Volker Simonis | 16 Aug 02:12

Re: Search facility for the list archives?

It would be probably a good idea to put a link to
http://gmane.org/find.php?list=openjdk on the main mailing list page
at http://mail.openjdk.java.net/mailman/listinfo and probably also a
corresponding link on the page of each list (e.g.
http://mail.openjdk.java.net/mailman/listinfo/discuss) for people
(like me) that didn't knew gmane.

Maybe a small input form with a "Search at gmane" button below the "To
see the collection of prior postings to the list, visit the discuss
Archives"-line would be even more usefull?

Volker

On 8/16/07, Peter B. Kessler <Peter.Kessler@...> wrote:
> Volker Simonis wrote:
>
> > Are there any plans for a search facility that can search all the
> > mailing lists from java.net. I meanwhile counted 33 different lists
> > and from the names it's not always clear where to look/post for a
> > specific topic.
> >
> > I think providing a good search tool would greatly increase the
> > usefullness of the lists alltogether and would reduce the burdon of
> > double/cross posting.
> >
> > I also remember that I've already posted to discuss <at> openjdk on the 1st
> > of March, however the current archives start from May only. Where have
> > the old postings gone too?
> >
> > Regards,
> > Volker
>
>
> I thought we were depending on
>
>      http://gmane.org/
>
> for searching.  Though, I don't see a way to search a group of
> lists, e.g., at
>
>      http://gmane.org/find.php?list=openjdk
>
> but maybe I just don't know the syntax.  I also find that while
> I can't find your message by searching for "search facility" on
>
>      http://search.gmane.org/?query=%22search+facility%22&group=gmane.comp.java.openjdk.general
>
> I can find your message by asking for all the topics from that
> list and then choosing
>
>      http://comments.gmane.org/gmane.comp.java.openjdk.general/383
>
> Searching for "volker" turns up messages from 2007-03-01 through
> 2007-08-09, so maybe they are just behind in their indexing.
>
>                         ... peter
>


Gmane