Data Handling of EST sequences

The above is a simplified illustration of how data is handled from sequencing, through processing, to database display. The production of sequence data is from high-throughput sequencers (PC environment) as either *.SCF or *.ab1 tracefiles. The core processing of the data (Unix environment, green highlight) prepares the sequence data by removal of poor quality sequences and vector sequence. The cleaned sequence data is then searched against public and laboratory sequence repositories with BLASTN and BLASTX programs (orange highlight) and the result files formatted for database display. The ACEDB environment is used as an interface to view data under multiple platforms (yellow highlight). Various tools are available to view and query the database results (Network environment).

