J.A.I.L. | 5 Jul 2004 21:12

Frequently used datasets on machine learning

Reading the WEKA book (Data Mining Practical Machine Learning Tools ...), 
on page 81 I found "... the performance of the 1R procedure was reported on 
sixteen datasets frequently used by machine learning researchers...".
Are those 16 datasets really commonly used? If so, are they freely 
downloadable somewhere?

T.I.A.
Jose Antonio I. López
Grigoris Tsoumakas | 6 Jul 2004 14:02
Picon

Re: Frequently used datasets on machine learning

> Reading the WEKA book (Data Mining Practical Machine Learning Tools ...),
> on page 81 I found "... the performance of the 1R procedure was reported
on
> sixteen datasets frequently used by machine learning researchers...".
> Are those 16 datasets really commonly used? If so, are they freely
> downloadable somewhere?

You can find several collections of datasets in "arff" format in the
WEKA home page: http://www.cs.waikato.ac.nz/~ml/weka/
Tom Fawcett | 6 Jul 2004 12:41
Picon

Re: Frequently used datasets on machine learning

On Monday 05 July 2004 12:12 pm, J.A.I.L. wrote:
> Reading the WEKA book (Data Mining Practical Machine Learning Tools ...), 
> on page 81 I found "... the performance of the 1R procedure was reported on 
> sixteen datasets frequently used by machine learning researchers...".
> Are those 16 datasets really commonly used? If so, are they freely 
> downloadable somewhere?

See:

ftp://ftp.ics.uci.edu/pub/ml-repos/machine-learning-databases/

Regards,
-Tom

Gmane