Selected DSL datasets
1. Datasets used for the paper: L.E. Brown, I. Tsamardinos, C.F. Aliferis. "A Novel Algorithm for Scalable and Accurate Bayesian Network Learning", MEDINFO, 2004 [pdf 8 offline]
ALARM [zip 8 offline]
ALARM x3 [zip 8 offline]
ALARM x5 [zip 8 offline]
Child [zip 8 offline]
Munin1 [zip 8 offline]
Gene [zip 8 offline]
Each archive contains data files in Matlab format. See excerpt from the reference paper for more information [pdf 8 offine].
2. Golub's Leukemia dataset:
[data 8 offline],
[reference paper],
[source].
Dataset is provided in tab separated text format (columns correspond to variables (genes); rows correspond to patients; diagnosis (0=ALL, 1=AML) is the first variable).
3. Datasets used for the paper: A. Statnikov, C.F. Aliferis, I. Tsamardinos. "Methods for Multi-Category Cancer Diagnosis from Gene Expression Data: A Comprehensive Evaluation to Inform Decision Support System Development", MEDINFO, 2004 [pdf 8 offline]
Available online at http://www.gems-system.org