In atlas-demo, we demonstrate the steps one has to take in running atlas-type WGS assemblies.
Copy the original files supplied in the downloads to the desired directory (sequence, quality, and vector-masked sequence).
Create trimmed reads file demo.fa.pass.gz (scan in from each end of read looking for 50-base windows of high quality and no vector).
Create indices for demo.fa, demo.fa.qual and demo.fa.screen making it easier to extract reads from the files. The files and indices constitute a simple, file-oriented database that should be replaced with more efficient database tables for a large genome.
Run kmer count (here k=32).
Analyze distribution: find out cutoff repeat frequency, choose cutoff = (2 or 3)*coverage (coverage ~peak in distribution curve).
Create a kill-file for atlas-overlapper.
Separate bins from the overall file of bins and readnames (.fon).
Extract reads from the WGS reads pool (using indices on quality files and vector-masked sequences).
Extend vector masking near ends of read to ends, and undo false masking interior to each read.
Phrap assembly.
For details, please refer to the paper by Havlak, et al.
Paul Havlak, Rui Chen, K. James Durbin, Amy Egan, Yanru Ren, Xing-Zhi Song, George M. Weinstock and Richard A. Gibbs, The Atlas Genome Assembly System Genome Res. 14: 721-732