Validation of large data sets, an essential prerequisite for data analysis: an analytical survey of the Bone Marrow Donors Worldwide

1996 
Large data sets like the Bone Marrow Donors Worldwide (BMDW) data set can be used for population genetic analyses. The qualities of such data sets are unique. To be able to use the BMDW data for analyses, several problems, like limited size and selective DR typing, of the data have to be solved and the quality of the registry data subsets has to be examined. We describe these problems and methods to overcome them. Also, we give an overview of the qualities of the different registry subsets. Sixteen of the twenty-nine examined subsets contain data that can be used for population genetic analysis. We will deal with these analyses in the future. Additionally, we present a method to calculate the minimum number of individuals required for reliable haplotype frequency estimation.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    8
    References
    34
    Citations
    NaN
    KQI
    []