MPIPZJiao2020

7 Arabidopsis thaliana assemblies

We present chromosome-level, reference-quality assemblies of seven Arabidopsis thaliana accessions (An-1, C24, Cvi-0, Eri-1, Kyo, Ler, Sha) selected across the global range of this predominately ruderal plant. The sequences were assembled from PacBio long reads (45-71x) using Falcon, Canu and MECAT. Chromosome-level scaffolding was achieved with similarity to the reference sequence and validated with genetic maps. Protein-coding genes were annotated in each assembly independently. All types of structural variants and local sequence variants were identified using the whole-genome comparison tool SyRI.

The genome assemblies, gene annotation, orthologous relationships, and sequence differences of any kind can be found in our Data Center. The seeds of accessions can be found in the ABRC stock center.

A manuscript describing assembly and analysis of these genomes can be found on bioRxiv.

Qualities



Col-0*

An-1

C24

Cvi-0

Eri-1

Kyo

Ler

Sha

Contigs

-

151

167

140

200

230

149

143

Pseudo-molecules

5

5

5

5

5

5

5

5

Contig N50 (Mbp)

-

8.2

4.8

7.4

4.8

9.1

11.2

7.0

Contig CL50**

-

2

2

2

2

2

1

1

Chr. length (Mbp)

119.1

118.4

117.7

118.3

117.7

118.8

118.5

118.4

Genes

27,445

27,342

27,214

27,098

27,285

27,574

27,376

27,293

*Reference sequence

**Chromosome number normalized L50 (Jiao et al, 2017, Genome Res)

Contact


For any questions concerning the data contact Wen-Biao Jiao, for any problems downloading/accessing the data contact Joffrey Fitz.