source: release/1.0.0/README @ 730

Revision 212, 5.4 KB checked in by admin, 8 years ago (diff)

Manual Changes

  • Property svn:eol-style set to native
Line 
1DAWG VERSION 1.0.0 - DNA ASSEMBLY WITH GAPS
2
3Copyright (C) (2005) Reed A. Cartwright - All rights reserved.
4
5DESCRIPTION
6
7Dawg is an application that will simulate nucleotide evolution with gaps.
8
9CONTACT
10
11rac@uga.edu or reed@scit.us
12
13LICENSE
14
15See COPYING for license information.
16
17INSTALLATION
18
19See INSTALL for installation instructions.
20
21DOWNLOAD
22
23Dawg 1.0.0 can be downloaded from <http://www.genetics.uga.edu/sw/>, all
24later versions can be downloaded from the development page for Dawg,
25<http://scit.us/dawg/>.
26
27MAILING LIST
28
29To participate in the development of Dawg you can subscribe to the mailing
30list at <http://scit.us/mailman/listinfo/dawg>. 
31
32EXAMPLES
33  example0.dawg - minimal
34  example1.dawg - typical usage
35  example2.dawg - simple indel formation
36  example3.dawg - robust indel formation
37  example4.dawg - recombination
38
39COMMAND LINE USAGE
40  dawg -[scubvh?] file1 [file2...]
41  -s: process files serially [default]
42  -c: process files combined together
43  -u: unbuffered output
44  -b: buffered output [default]
45  -v: display version information
46  -h: display help information
47  -?: same as -h
48
49  Dawg will read stdin if filename is "-".
50
51FILE FORMAT
52  The file format takes a series of statements in the form of "name = value,"
53  where "name" is alphanumeric and value can be a string, number, boolean,
54  tree, or vector of values.  A single variable is equivalent to a vector of
55  a single entry.
56
57  string:  "[char-sequence]"
58           <<EOF [several lines] EOF
59  number:  [sign]digits[.digits][(e|E)[sign]digits]
60  boolean: true|false
61  tree:    Newick Format
62  vector:  { value, value, ...}
63
64OPTIONS
65  Name          Type            Description
66--------------------------------------------------------------------------
67  Tree           VT  phylogeny
68  TreeScale      N   coefficient to scale branch lengths by
69  Sequence       VS  root sequences
70  Length         VN  length of generated root sequences
71  Rates          VVN rate of evolution of each root nucleotide
72  Model          S   model of evolution: GTR|JC|K2P|K3P|HKY|F81|F84|TN
73  Freqs          VN  nucleotide (ACGT) frequencies
74  Params         VN  parameters for the model of evolution
75  Width          N   block width for indels and recombination
76  Scale          VN  block position scales
77  Gamma          VN  coefficients of variance for rate heterogenity
78  Alpha          VN  shape parameters
79  Iota           VN  proportions of invariant sites
80  GapModel       VS  models of indel formation: NB|PL|US
81  Lambda         VN  rates of indel formation
82  GapParams      VVN parameter for the indel model
83  Reps           N   number of data sets to output
84  File           S   output file
85  Format         S   output format: Fasta|Nexus|Phylip|Clustal
86  GapSingleChar  B   output gaps as a single character
87  GapPlus        B   distinguish insertions from deletions in alignment
88  LowerCase      B   output sequences in lowercase
89  Translate      B   translate outputed sequences to amino acids
90  NexusCode      S   text or file to include between datasets in Nexus format
91  Seed           VN  PRNG seed (integers)
92
93DEFAULTS
94
95  TreeScale = 1.0
96  Length = 100
97  Model = "JC"
98  Freqs = {0.25,0.25,0.25,0.25}
99  Params = {1.0,1.0,1.0,1.0,1.0,1.0}
100  Width = 1
101  Scale = 1.0
102  Gamma = 0.0
103  Iota =  0.0
104  GapModel = "US"
105  GapParams = 1.0
106  Reps = 1
107  Format = "Fasta"
108  GapSingleChar = false
109  GapPlus = false
110  LowerCase = false
111  Translate = false
112
113NOTES
114
115The meaning of the "Params" vector is different for each substitution model.
116  GTR: Substitution rates A-C, A-G, A-T, C-G, C-T, G-T
117  JC:  Ignored
118  K2P: Transition rate, Transversion rate
119  K3P: Alpha (Transitions), Beta (A-T & G-C), Gamma (A-C & G-T)
120  HKY: Transition rate, Transversion rate
121  F81: Ignored
122  F84: Kappa
123  TN:  Alpha1 (A-G), Alpha2 (C-T), Beta (Transversions)
124
125Parameter "Freqs" is ignored by the models "JC", "K2P", and "K3P".
126
127If "Lambda" is a single value, then it specifies the rate of indel formation, e.g.
128"Lambda = 0.1" is the same as "Lambda = {0.05, 0.05}".  The first parameter is the
129insertion rate and the second parameter is the deletion rate.
130
131The first parameter of "GapModel" specifies the distribution model of insertion sizes.
132The second parameter specifies the distribution model of deletion sizes.  If only one
133parameter is given it is the model for both insertions and deletions.
134
135The first parameter of "GapParams" is a vector specifying the parameters for the
136gap model of insertions.  Likewise the second parameter is a vector specifying the
137parameters for the gap model of deletions.  If "GapParams" is not a vector of vectors,
138then it specifies the vector of parameters for both insertions and deletions.
139
140The meaning of the GapParams vector is different for each gap model.
141  US: The distribution of gap sizes.
142  NB: The number of failures (r), the probability of success (q).
143  PL: The rate parameter (a), the maximum gap size.
144
145To create a recombinant tree, you may need to specifically describe and label the
146inner nodes at which the recombination events occur.  See example4.dawg.
147
148Gamma takes precidence over Alpha.
149
150Sequence takes precidence over Length.
151
152If NexusCode is the name of a file, the code is read from that file.
153
154The following vector parameters have a size of "Width": "Scale", "Alpha", "Gamma",
155and "Iota".  If their size is less than width then the first value in the vector
156will be used to fill in the rest of the values, e.g. "Scale = 1.0" is the same as
157"Scale = {1.0,1.0,1.0}" when "Width = 3".
Note: See TracBrowser for help on using the repository browser.