Selected DSL datasets


1. Datasets used for the paper: L.E. Brown, I. Tsamardinos, C.F. Aliferis. "A Novel Algorithm for Scalable and Accurate Bayesian Network Learning", MEDINFO, 2004 [pdf 8 offline]

  • ALARM [zip 8 offline]

  • ALARM x3 [zip 8 offline]

  • ALARM x5 [zip 8 offline]

  • Child [zip 8 offline]

  • Munin1 [zip 8 offline]

  • Gene [zip 8 offline]

    Each archive contains data files in Matlab format. See excerpt from the reference paper for more information [pdf 8 offine].
    2. Golub's Leukemia dataset: [data 8 offline], [reference paper], [source].

    Dataset is provided in tab separated text format (columns correspond to variables (genes); rows correspond to patients; diagnosis (0=ALL, 1=AML) is the first variable).
    3. Datasets used for the paper: A. Statnikov, C.F. Aliferis, I. Tsamardinos. "Methods for Multi-Category Cancer Diagnosis from Gene Expression Data: A Comprehensive Evaluation to Inform Decision Support System Development", MEDINFO, 2004 [pdf 8 offline]

    Available online at http://www.gems-system.org