SOLiD™ System Human (Yoruban) Data Set
Description
- We have sequenced an anonymous individual of African origin using the SOLiD ligation based sequencing assay that enables a unique form of error correction that improves the raw accuracy of the aligned reads to >99.9% allowing us to accurately call SNPs with 2 reads per allele. We collected several billion mate-pair reads yielding ~18X haploid coverage of aligned sequence and close to 300X clone coverage. Over 98% of the reference genome is covered with at least one uniquely placed read and 99.65% spanned by at least one uniquely placed mate-paired clone. We identified over 3.8 million SNPs, 19% of which are novel. Mate-pair data was used to physically resolve haplotype phases of nearly 2/3 of the genotypes obtained, producing phased segments of up to 210 Kb. We detected 229,375 intra-read indels of which half are novel, 5,590 indels between mate-paired reads, 91 inversions and 565 copy number variations. These variants are available here for public use.
Licensing
- This data is being licensed to you under the Applied Biosystems SOLiD™ System Data Use Agreement (Unsupported). Please read the license in its entirety and ensure that you understand the licensing conditions for use. Your use of this data indicates your acceptance of this licensing agreement. Click on this link to read the licensing agreement.
Support
- This data is being released as "unsupported data" by Applied Biosystems
Download
|
|
| Join Project |
Request to join project
|
| Description |
Applied Biosystems SOLiD™ System Human
(Yoruban) variation data
|
| Developer Info |
Aaron Kitzmiller
|
| Trove Categorization |
- Development Status: 5 - Production/Stable
- Intended Audience: Other Audience
- License: Applied Biosystems SOLiD™ System Data Use Agreement (Unsupported)
- Operating System: Not Applicable
- Programming Language: Not Applicable
- Spoken Language: English
- Topic: Bio-Informatics
|
|