Assemblathon 1: a competitive assessment of de novo short read assembly methods



Download 1.22 Mb.
Page3/7
Date31.03.2018
Size1.22 Mb.
#44740
1   2   3   4   5   6   7

Bibliography





Pop08: , (Pop & Salzberg, 2008),

Tra09: , (Trapnell & Salzberg, 2009),

Cha2: , (Chaisson, Brinza, & Pevzner, Genome Res, 2009),

Pan08: , (Pandey, Nutter, & Prediger, 2008),

Eid1: , (Eid, et al., 2009),

Pou06: , (Pourmand, et al., 2006),

Pou06: , (Pourmand, et al., 2006),

Mil1: , (2010),

Mye3: , (Myers E. , 1995),

Mye1: , (Myers E. , 2005),

Ken1: , (Kent & Haussler, 2001),

Mye2: , (Myers, et al., 2000),

Ven01: , (Venter, et al., 2001),

Bat1: , (Batzoglou, et al., 2002),

Mul1: , (Mullikin & Ning, 2003),

Lan1: , (Lander, et al., 2001),

Mou1: , (Mouse Genome Sequencing Consortium, et al., 2002),

Her1: , (Hernandez, François, Farinelli, Osterås, & Schrenzel, 2008),

War1: , (Warren, Sutton, Jones, & Holt, 2007),

Doh1: , (Dohm, Lottaz, Borodina, & Himmelbauer, 2007),

Jec1: , (Jeck, et al., 2007),

Pev1: , (Pevzner, Tang, & Waterman, 2001),

Cha1: , (Chaisson & Pevzner, Genome Res, 2008),

But1: , (Butler, et al., 2008),

Zer1: , (Zerbino & Birney, 2008),

Sim1: , (Simpson, Wong, Jackman, Schein, Jones, & Birol, 2009),

Li1: , (Li, et al., 2010),

Sim2: , (Simpson & Durbin, Bioinformatics, 2010),

Nar11: , (Narzisi & Mishra, 2011),

Lin01: , (Lin, LI, Shen, Zhang, Papasian, & Deng, 2011),

Hus01: , (Huson, Halpern, Lai, Myers, Reinert, & Sutton, 2001),

Lin1: , (Lindblad-Toh, et al., 2005),

Gne1: , (2011),

Mac1: , (Maccallum, et al., 2009),

Gne1: , (Gnerre, et al., 2011),

Mye2: , (Myers, et al., 2000),

Lan1: , (Lander, et al., 2001),

Bat1: , (Batzoglou, et al., 2002),

War1: , (Warren, Sutton, Jones, & Holt, 2007),

Jec1: , (Jeck, et al., 2007),

Doh1: , (Dohm, Lottaz, Borodina, & Himmelbauer, 2007),

Cha1: , (Chaisson & Pevzner, Genome Res, 2008),

Zer1: , (Zerbino & Birney, 2008),

But1: , (Butler, et al., 2008),

Fuj1: , (Fujita, et al., 2011),

Ill: , (Illumina, Illumina HiSeq 2000),

Pat2: , (Paten, Earl, Nguyen, Diekhans, Zerbino, & Haussler, 2011),

Alt1: , (Altschul, Gish, Miller, Myers, & Lipman, 1990),

Pat1: , (Paten, et al., 2011),

Ber2: , (Bergeron, Mixtacki, & Stoye, 2006),

Med1: , (Medvedev & Brudno, 2009),

Ale1: , (Alekseyev & Pevzner, 2009),

Hol1: , (Holmes & Durbin, 1998),

Dar1: , (Darling, Mau, & Perna, 2010),

Wor1: , (Worley & Gibbs, 2010),

Alk1: , (Alkan, Sajjadian, & Eichler, 2011),

Fuj1: , (Fujita, et al., 2011),

Hed1: , (Hedges, Dudley, & Kumar, 2006),

Li1: , (Li, et al., 2010),

Ric1: , (Richter, Ott, Auch, Schmid, & Huson, 2008),

Kor09: , (Korbel, et al., 2009),

DuJ09: , (Du, Bjornson, Zhang, Kong, Snyder, & Gerstein, 2009),

Dör08: , (Döring, Weese, Rausch, & Reinert, 2008),

Gre1: , (Green, et al., 2010),

Ewi1: , (Ewing, Hillier, Wendl, & Green, 1998),

Pat1: , (Paten, et al., 2011),

Bla1: , (Blanchette, et al., 2004),

Dun1: , (Dunham, et al., 2004),

Kur08: , (Kurtz, Narechania, Stein, & Ware, 2008),



Figure 1 — The phylogeny of the simulated haploid genomes. The root genome derives from human chromosome 13. The α1 and α2 haplotypes form the diploid genome from which we generated reads. The β1 and β2 haplotypes form a diploid out-group genome that was made available to the assemblers.



Figure 2 — N50 statistics. Assemblies are sorted left to right in descending order by scaffold path NG50. Data points for each assembly are slightly offset along the x-axis in order to show overlaps.


Figure 3 — An adjacency graph example demonstrating threads, contig paths and scaffold paths. Each stack of boxes represents a block edge. The nodes of the graph are represented by the left and right ends of the stacked boxes. The adjacency edges are groups of lines that connect the ends of the stacked boxes. Threads are represented inset within the graph as alternating connected boxes and coloured lines. There are three threads shown: black, gray and red. The black and gray threads represent two haplotypes, there are many alternative haplotype threads that result from a mixture of these haplotype segments which are equally plausible given no additional information to de-convolve them. The red thread represents an assembly sequence. For the assembly thread consistent adjacencies are shown in solid red. The dashed red line between the right end of block g and the left end of block i represents a structural error (deletion). The dashed red line between the right end of block k and the left end of block m represents a scaffold gap, because the segment of the assembly in block n contains wild card characters. The example therefore contains three contig paths: (from left to right) blocks ag ACTGAAATCGGGACCCC; blocks i, j, k GGAAC; and block m CC. However the example contains only two scaffold paths because the latter two contig paths are concatenated to form one scaffold path.


Download 1.22 Mb.

Share with your friends:
1   2   3   4   5   6   7




The database is protected by copyright ©ininet.org 2024
send message

    Main page