EST and contig naming

    Contigs

    The assembly was done by first clustering the ESTs into groups of similar sequences, using Paracel PCP. Then each cluster was separately assembled into contigs and singletons using CAP4. The contigs are named by their cluster number and their contig number within that cluster.

    For example "contigs.11413-1" and "contigs.11413-2" are the two CAP4 contigs derived from PCP cluster 11413.

    In the AssembliesDB and CONTIG databases the same numbering is used but the prefix "contigs." is replaced with
    DSW01C4 (Daryl Somers Wheat assembly #01, using CAP4) for the June 2002 build, or
    DSW02C4 (Daryl Somers Wheat assembly #02, using CAP4) for the October 2002 build.

    ESTs

    EST names have a three-letter prefix indicating which wheat variety the EST was obtained from, e.g. CSP for Chinese Spring. The codes are given in the table below.


    Cultivar used Code
    93FHB37 93F
    Atlas ATL
    BH1146 BHI
    Brevor BRE
    Butte 86 BUT
    Catoctin CAT
    Cheyenne CHY
    Chinese Spring CSP
    FHB148 FHB
    Florida FLO
    Frontana FRO
    Glenlea GLN
    Harus HAR
    Jingdong No. 1 JIN
    Mercia MER
    Norin 26 NOR
    Norstar NRS
    Novosibirskaya 67 NOV
    Odeon ODE
    PI 294994 PI2
    Powdery Mildew Resistant line PMR
    Soleil SOL
    Sumai 3 SUM
    TAM W101 TAM
    Thatcher Lr1 THR
    Wyuna WYU
    Yanda 1817 YAN