Available Data-Sets
1. UCI ML - http://www.ics.uci.edu/~mlearn/MLSummary.html
More information on UCI ML DATA SETS - http://haydut.cmpe.boun.edu.tr/datasets.htm
2. UCI KDD - http://kdd.ics.uci.edu/
3. StatLog - http://www.liacc.up.pt/ML/statlog/
4. Financial datasets at OSU - http://fisher.osu.edu/fin/osudata.htm
5. KDNuggets - http://www.kdnuggets.com/datasets/
6. S+a+oo - http://www.statoo.com/en/resources/anthill/Datamining/Data/
(i) David Dowe's data links - http://www.csse.monash.edu.au/~dld/datalinks.html
7. Bayesian Network Repository - http://www.cs.huji.ac.il/labs/compbio/Repository/networks.html
(i) ALARM: Domain: A network by medical experts for monitoring patients in intensive care. http://www.cs.huji.ac.il/labs/compbio/Repository/Datasets/alarm/alarm.htm
8. Synthetic dataset generator
(i) http://www.burningart.com/meico/inventions/datagen/index.html
(ii) http://www.almaden.ibm.com/software/quest/Resources/index.shtml
(iii) http://www.datasetgenerator.com/ - available either as a web application or source code - NOT Available. Found at http://www.dcc.ufmg.br/~meira/ch/tp2/datgen/
(iv) http://www.elsevier.com/gej-ng/10/35/61/15/20/11/40/index.htt - DOS executable and source code - Which is this one ?
(v) weka.datagenerators.* classes in WEKA
9. Heart data: Information about heart: http://www.tmc.edu/thi/anatomy.html
(i) UCI ML - ftp://ftp.ics.uci.edu/pub/machine-learning-databases/heart-disease/. Processed Cleveland data will be useful ftp://ftp.ics.uci.edu/pub/machine-learning-databases/arrhythmia/arrhythmia.names. More useful but has a lot of integer valued features.ftp://ftp.ics.uci.edu/pub/machine-learning-databases/echocardiogram/4-5 features are numeric and continuous
(ii) Harvard - http://cardiogenomics.med.harvard.edu/groups/proj1/pages/download_home1.html
(iii) Not available yet - http://www.nhsia.nhs.uk/datasets/pages/default.asp
(iv) Canine heart - requires registration - http://www.ccbm.jhu.edu/. Rabbit data available - http://www.cmbl.jhu.edu/Data/Heart R-S Web Page/3d_data.htm#rabbit heart #1
(v) 1948 - Framingham Heart study -http://www.maths.utas.edu.au/DHStat/Data/Flow.html
10. Elements of statistical learning - http://www-stat-class.stanford.edu/~tibs/ElemStatLearn/
11. Datasets at the Department of Statistics, University of Munich - http://www.stat.uni-muenchen.de/service/datenarchiv/welcome_e.html
12. Delve Datasets - http://www.cs.toronto.edu/~delve/
13. WEKA - http://www.cs.waikato.ac.nz/~ml/weka/
14. MLnet Online Information Service - http://www.mlnet.org/cgi-bin/mlnetois.pl/?File=datasets.html. Homepage - http://www.mlnet.org/welcome.html
15. USPS (ZIP-Code) dataset - http://cervisia.org/machine_learning_data.php
16. Publicly Available Data Sets - http://www.di.unito.it/~mluser/datasets.html
17. StatLib Datasets Archive - http://lib.stat.cmu.edu/datasets/
18. Network intrusion: http://ivpr.cs.uml.edu/shootout/network.html
19. Network intrusion: MIT data: http://www.ll.mit.edu/IST/ideval/data/2000/LLS_DDOS_1.0.html
|