| 1 | DAWG VERSION 1.0.0 - DNA ASSEMBLY WITH GAPS |
|---|
| 2 | |
|---|
| 3 | Copyright (C) (2005) Reed A. Cartwright - All rights reserved. |
|---|
| 4 | |
|---|
| 5 | DESCRIPTION |
|---|
| 6 | |
|---|
| 7 | Dawg is an application that will simulate nucleotide evolution with gaps. |
|---|
| 8 | |
|---|
| 9 | CONTACT |
|---|
| 10 | |
|---|
| 11 | rac@uga.edu or reed@scit.us |
|---|
| 12 | |
|---|
| 13 | LICENSE |
|---|
| 14 | |
|---|
| 15 | See COPYING for license information. |
|---|
| 16 | |
|---|
| 17 | INSTALLATION |
|---|
| 18 | |
|---|
| 19 | See INSTALL for installation instructions. |
|---|
| 20 | |
|---|
| 21 | DOWNLOAD |
|---|
| 22 | |
|---|
| 23 | Dawg 1.0.0 can be downloaded from <http://www.genetics.uga.edu/sw/>, all |
|---|
| 24 | later versions can be downloaded from the development page for Dawg, |
|---|
| 25 | <http://scit.us/dawg/>. |
|---|
| 26 | |
|---|
| 27 | MAILING LIST |
|---|
| 28 | |
|---|
| 29 | To participate in the development of Dawg you can subscribe to the mailing |
|---|
| 30 | list at <http://scit.us/mailman/listinfo/dawg>. |
|---|
| 31 | |
|---|
| 32 | EXAMPLES |
|---|
| 33 | example0.dawg - minimal |
|---|
| 34 | example1.dawg - typical usage |
|---|
| 35 | example2.dawg - simple indel formation |
|---|
| 36 | example3.dawg - robust indel formation |
|---|
| 37 | example4.dawg - recombination |
|---|
| 38 | |
|---|
| 39 | COMMAND LINE USAGE |
|---|
| 40 | dawg -[scubvh?] file1 [file2...] |
|---|
| 41 | -s: process files serially [default] |
|---|
| 42 | -c: process files combined together |
|---|
| 43 | -u: unbuffered output |
|---|
| 44 | -b: buffered output [default] |
|---|
| 45 | -v: display version information |
|---|
| 46 | -h: display help information |
|---|
| 47 | -?: same as -h |
|---|
| 48 | |
|---|
| 49 | Dawg will read stdin if filename is "-". |
|---|
| 50 | |
|---|
| 51 | FILE FORMAT |
|---|
| 52 | The file format takes a series of statements in the form of "name = value," |
|---|
| 53 | where "name" is alphanumeric and value can be a string, number, boolean, |
|---|
| 54 | tree, or vector of values. A single variable is equivalent to a vector of |
|---|
| 55 | a single entry. |
|---|
| 56 | |
|---|
| 57 | string: "[char-sequence]" |
|---|
| 58 | <<EOF [several lines] EOF |
|---|
| 59 | number: [sign]digits[.digits][(e|E)[sign]digits] |
|---|
| 60 | boolean: true|false |
|---|
| 61 | tree: Newick Format |
|---|
| 62 | vector: { value, value, ...} |
|---|
| 63 | |
|---|
| 64 | OPTIONS |
|---|
| 65 | Name Type Description |
|---|
| 66 | -------------------------------------------------------------------------- |
|---|
| 67 | Tree VT phylogeny |
|---|
| 68 | TreeScale N coefficient to scale branch lengths by |
|---|
| 69 | Sequence VS root sequences |
|---|
| 70 | Length VN length of generated root sequences |
|---|
| 71 | Rates VVN rate of evolution of each root nucleotide |
|---|
| 72 | Model S model of evolution: GTR|JC|K2P|K3P|HKY|F81|F84|TN |
|---|
| 73 | Freqs VN nucleotide (ACGT) frequencies |
|---|
| 74 | Params VN parameters for the model of evolution |
|---|
| 75 | Width N block width for indels and recombination |
|---|
| 76 | Scale VN block position scales |
|---|
| 77 | Gamma VN coefficients of variance for rate heterogenity |
|---|
| 78 | Alpha VN shape parameters |
|---|
| 79 | Iota VN proportions of invariant sites |
|---|
| 80 | GapModel VS models of indel formation: NB|PL|US |
|---|
| 81 | Lambda VN rates of indel formation |
|---|
| 82 | GapParams VVN parameter for the indel model |
|---|
| 83 | Reps N number of data sets to output |
|---|
| 84 | File S output file |
|---|
| 85 | Format S output format: Fasta|Nexus|Phylip|Clustal |
|---|
| 86 | GapSingleChar B output gaps as a single character |
|---|
| 87 | GapPlus B distinguish insertions from deletions in alignment |
|---|
| 88 | LowerCase B output sequences in lowercase |
|---|
| 89 | Translate B translate outputed sequences to amino acids |
|---|
| 90 | NexusCode S text or file to include between datasets in Nexus format |
|---|
| 91 | Seed VN PRNG seed (integers) |
|---|
| 92 | |
|---|
| 93 | DEFAULTS |
|---|
| 94 | |
|---|
| 95 | TreeScale = 1.0 |
|---|
| 96 | Length = 100 |
|---|
| 97 | Model = "JC" |
|---|
| 98 | Freqs = {0.25,0.25,0.25,0.25} |
|---|
| 99 | Params = {1.0,1.0,1.0,1.0,1.0,1.0} |
|---|
| 100 | Width = 1 |
|---|
| 101 | Scale = 1.0 |
|---|
| 102 | Gamma = 0.0 |
|---|
| 103 | Iota = 0.0 |
|---|
| 104 | GapModel = "US" |
|---|
| 105 | GapParams = 1.0 |
|---|
| 106 | Reps = 1 |
|---|
| 107 | Format = "Fasta" |
|---|
| 108 | GapSingleChar = false |
|---|
| 109 | GapPlus = false |
|---|
| 110 | LowerCase = false |
|---|
| 111 | Translate = false |
|---|
| 112 | |
|---|
| 113 | NOTES |
|---|
| 114 | |
|---|
| 115 | The meaning of the "Params" vector is different for each substitution model. |
|---|
| 116 | GTR: Substitution rates A-C, A-G, A-T, C-G, C-T, G-T |
|---|
| 117 | JC: Ignored |
|---|
| 118 | K2P: Transition rate, Transversion rate |
|---|
| 119 | K3P: Alpha (Transitions), Beta (A-T & G-C), Gamma (A-C & G-T) |
|---|
| 120 | HKY: Transition rate, Transversion rate |
|---|
| 121 | F81: Ignored |
|---|
| 122 | F84: Kappa |
|---|
| 123 | TN: Alpha1 (A-G), Alpha2 (C-T), Beta (Transversions) |
|---|
| 124 | |
|---|
| 125 | Parameter "Freqs" is ignored by the models "JC", "K2P", and "K3P". |
|---|
| 126 | |
|---|
| 127 | If "Lambda" is a single value, then it specifies the rate of indel formation, e.g. |
|---|
| 128 | "Lambda = 0.1" is the same as "Lambda = {0.05, 0.05}". The first parameter is the |
|---|
| 129 | insertion rate and the second parameter is the deletion rate. |
|---|
| 130 | |
|---|
| 131 | The first parameter of "GapModel" specifies the distribution model of insertion sizes. |
|---|
| 132 | The second parameter specifies the distribution model of deletion sizes. If only one |
|---|
| 133 | parameter is given it is the model for both insertions and deletions. |
|---|
| 134 | |
|---|
| 135 | The first parameter of "GapParams" is a vector specifying the parameters for the |
|---|
| 136 | gap model of insertions. Likewise the second parameter is a vector specifying the |
|---|
| 137 | parameters for the gap model of deletions. If "GapParams" is not a vector of vectors, |
|---|
| 138 | then it specifies the vector of parameters for both insertions and deletions. |
|---|
| 139 | |
|---|
| 140 | The meaning of the GapParams vector is different for each gap model. |
|---|
| 141 | US: The distribution of gap sizes. |
|---|
| 142 | NB: The number of failures (r), the probability of success (q). |
|---|
| 143 | PL: The rate parameter (a), the maximum gap size. |
|---|
| 144 | |
|---|
| 145 | To create a recombinant tree, you may need to specifically describe and label the |
|---|
| 146 | inner nodes at which the recombination events occur. See example4.dawg. |
|---|
| 147 | |
|---|
| 148 | Gamma takes precidence over Alpha. |
|---|
| 149 | |
|---|
| 150 | Sequence takes precidence over Length. |
|---|
| 151 | |
|---|
| 152 | If NexusCode is the name of a file, the code is read from that file. |
|---|
| 153 | |
|---|
| 154 | The following vector parameters have a size of "Width": "Scale", "Alpha", "Gamma", |
|---|
| 155 | and "Iota". If their size is less than width then the first value in the vector |
|---|
| 156 | will be used to fill in the rest of the values, e.g. "Scale = 1.0" is the same as |
|---|
| 157 | "Scale = {1.0,1.0,1.0}" when "Width = 3". |
|---|