Some pointers to where real data can be delved from the web.
Time-series:
- Economic: http://www.economicswebinstitute.org/ecdata.htm
- Industrial: http://homes.esat.kuleuven.be/~smc/daisy/daisydata.html
- TSDL: http://robjhyndman.com/TSDL/
- UK data: http://data.gov.uk/about
- EEG: http://sccn.ucsd.edu/~arno/fam2data/publicly_available_EEG_data.html
- Mike West: http://www.stat.duke.edu/~mw/ts_data_sets.html
- UWO: http://www.stats.uwo.ca/faculty/aim/epubs/datasets/default.htm
Data mining:
-
MLdata: http://mldata.org/
-
UCI data: http://archive.ics.uci.edu/ml/index.html
-
MLDATA: http://mldata.org/
-
Clopinet: http://clopinet.com/challenges/
-
KD nuggets: http://www.kdnuggets.com/datasets/competitions.html
-
Delicious: http://www.delicious.com/pskomoroch/dataset,
-
http://www.datawrangling.com/some-datasets-available-on-the-web
-
Datamob: http://datamob.org
-
Ranking: http://learningtorankchallenge.yahoo.com/,
-
ed.ac.uk: http://www.inf.ed.ac.uk/teaching/courses/dme/html/datasets0405.html
-
Million Song: http://labrosa.ee.columbia.edu/millionsong/
-
Yandex: http://imat-relpred.yandex.ru/en
-
kaggle: http://www.kaggle.com/
-
Mindboggle: http://mindboggle.info/index.html
-
Statistical Machine Translation: http://www.statmt.org/
BioMed:
- Statlib: http://lib.stat.cmu.edu/datasets/
- StatSci: http://www.statsci.org/datasets.html
- Klein book: http://www.mcw.edu/biostatistics/Faculty/Faculty/JohnPKleinPhD/SurvivalAnalysisBook/DataSetsBothEditions.htm
- PhysioMed: http://physionet.caregroup.harvard.edu/physiobank/database/
- PhysioNet: http://www.physionet.org/challenge/
- GLIMs: http://www.sci.usq.edu.au/staff/dunn/Datasets/tech-glms.html
Software:
-
CVX: http://cvxr.com/cvx/
-
Tfocs: http://tfocs.stanford.edu/
-
Mosek: http://www.mosek.com/
-
Shogun: http://www.shogun-toolbox.org/
-
Mahout: http://mahout.apache.org
-
Skikit-learn: https://scikit-learn.org
ML Networks:
-
NERF: http://www.nerf.be/
-
Kurzweil: http://www.kurzweilai.net/
-
Sciencemag: http://www.sciencemag.org/site/feature/data/compsci/machine_learning.xhtml
-
PASCAL: http://www.pascal-network.org/
ML Conferences:
-
NIPS
-
ICML
-
ECML/KDD
-
COLT
-
ALT
-
ICANN
-
ESANN
Blogs:
-
Hunch: http://hunch.net/
-
Nuit Blanche: http://nuit-blanche.blogspot.se/
-
My Biased Coin: http://mybiasedcoin.blogspot.se/
-
Mark Reid’s: http://mark.reid.name/
-
InherentUncertainty: http://www.inherentuncertainty.org/
Some Books:
-
The elements of statistical learning
-
Learning, Prediction, Games
-
Machine Learning
-
Pattern Recognition