Skip to content

Commit 9efbde9

Browse files
committed
v5.7.5
Self-synteny improvement docs
1 parent 0a475a1 commit 9efbde9

30 files changed

Lines changed: 234 additions & 234 deletions

README.md

Lines changed: 2 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,10 @@
11
### SyMAP (Synteny Mapping and Analysis Program) ###
22

3-
**Description**: SyMAP is a software package for detecting, displaying, and querying syntenic relationships between a fully sequenced genome and
4-
(i) another sequenced genome, (ii) draft sequence, and (iii) self-synteny. It is designed for divergent eukaryotic genomes (not bacteria).
3+
**Description**: SyMAP is a software package for detecting, displaying, and querying syntenic relationships between a fully sequenced genome and (i) another sequenced genome, (ii) draft sequence, and (iii) self-synteny. It is designed for divergent eukaryotic genomes (not bacteria).
54

65
**Download SyMAP tarball**: https://github.com/csoderlund/SyMAP/releases. The SyMAP tarball contains the executable jar file, all necessary external software, and demo files.
76

8-
**Requirements**: Java v17 or later and MySQL (or MariaDB). SyMAP uses [MUMmer](https://mummer4.github.io) for the sequence to sequence alignment, which requires Perl.
9-
SyMAP has been tested on Linux and MacOS.
7+
**Requirements**: Java v17 or later and MySQL (or MariaDB). SyMAP uses [MUMmer](https://mummer4.github.io) for the sequence to sequence alignment, which requires Perl. SyMAP has been tested on Linux and MacOS.
108

119
**To use**:
1210
1. Download the latest SyMAP tarball.

docs/3D.html

Lines changed: 0 additions & 120 deletions
This file was deleted.

docs/Demo.html

Lines changed: 7 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -154,9 +154,10 @@ <h3>Draft: Demo-Draft to Demo-Seq2</h3>
154154
<h3>Self-synteny: Demo-Seq</h3>
155155

156156
<table class="tx">
157-
<tr><td colspan="2"><tt>Demo-Seq</tt> has been run against itself, resulting in the <ttb>Dot Plot</ttb> shown on the lower right.
158-
There are a few off-diagonal blocks, which may indicate repetitiveness (but these blocks are probably
159-
too small to be meaningful).
157+
<tr><td colspan="2"><tt>Demo-Seq</tt> has been run against itself.
158+
There are a few blocks, but these blocks are probably too small to be meaningful.
159+
The blocks below the diagonal are mirrored above
160+
the diagonal, e.g. 3.5.1 and 5.3.1 are the same.
160161
<tr> <td style="vertical-align: text-top; width: 670px">
161162
<a href="img/demoSumSelf.png">
162163
<img src="img/demoSumSelf.png" alt="demo Sum Self" style="border: 1px solid black; width: 340px"></a>
@@ -217,7 +218,9 @@ <h2>symap.log</h2>
217218
<b>Algo1 verbose</b>
218219

219220
<pre><small>
220-
Loading annotations
221+
Start calculating cluster hits for demo_seq and demo_seq2
222+
Alignment files in data/seq_results/demo_seq_to_demo_seq2/align/
223+
Loading annotations
221224
10,108 demo_seq genes
222225
10,137 demo_seq2 genes
223226
Scan files to create candidate genes from hits

docs/Details.html

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -77,8 +77,12 @@ <h2>Collinear sets</h2>
7777
In the image on the right, two different sets are shown where the pink highlighted hits is the 1st set,
7878
green is the 2nd; these are <a href="UserGuide.html#wheel" class="ext" target="_blank">Color Wheel</a>
7979
<ttx>Highlight1</ttx> and <ttx>Highlight2</ttx> colors. The sets were broken by an extra gene on Human Chr01.
80-
<p>A collinear set either has all genes to the same strand (+/+, -/-) or different (+/-, -/+).
80+
<p>A collinear set either has all hits/genes to the same strand (+/+, -/-) or different (+/-, -/+).
8181
In the image on the right, all genes in both sets are on different strands.
82+
<p><b>Note</b>: if <ttl>Algo2</ttl>
83+
was used, this guarantees that the strands for the two homologous genes are the same as their hit; this is not
84+
the case for <ttl>Algo1</ttl> (see <a href="UserGuide.html#term" target="_blank" class="ext">Terminology</a>).
85+
8286

8387
<p>&nbsp;
8488
<p>The collinear set algorithm does not consider the amount of overlap of the hit to a gene, or the similarity

docs/Docs.html

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -80,7 +80,7 @@
8080
<ul>
8181
<li><a href=SystemHelp.html class="ext" target="_blank">Parameters and Interface</a>
8282
<li><a href=MUMmer.html class="ext" target="_blank">MUMmer with SyMAP</a>
83-
<li><a href=Demo.html class="ext" target="_blank">Demo Results; Synteny Results</a>
83+
<li><a href=Demo.html class="ext" target="_blank">Results: Demo and Synteny</a>
8484
<li><a href=peach.html class="ext" target="_blank">Ordering wild Prunus yedoensis against Prunus persica</a>
8585
<li><a href=TroubleShoot.html class="ext" target="_blank">Trouble Shooting</a>
8686

@@ -122,9 +122,9 @@ <h4>History of SyMAP</h4>
122122
was written to build physical maps from fingerprinted clones, but is obsolete now.
123123
SyMAP was originally written for FPC maps-to-sequence synteny
124124
(<a href="./fpc/UserGuideFPC.html" class="ext" target="_blank">FPC User Guide</a>),
125-
then was upgraded for sequence-to-sequence synteny. FPC was removed from SyMAP in v5.0.8,
126-
but is still available from Github downloads and
127-
<a href="http://www.agcol.arizona.edu/software/symap" class="ext" target="_blank">www.agcol.arizona.edu</a>.
125+
then was upgraded for sequence-to-sequence synteny. FPC was removed from SyMAP in v5.0.9,
126+
but v5.0.8 is still available from Github downloads; if it does not work with latest Java and MySQL,
127+
email me and I will update it for you.
128128
</ttt>
129129
<!---- END BODY -->
130130
</td></tr>

docs/Query.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -634,7 +634,7 @@ <h4>2.2. Single genes columns</h4>
634634
<ttl>Gene Info</ttl> and <ttl>Annotation</ttl> columns, with one additional column, as follows.
635635
<table class="tx">
636636
<tr><td style="vertical-align: text-top;"><ttx>NumHits</ttx>
637-
<td>This is the number of hits to the gene in the ENTIRE databases, except for SELF synteny.
637+
<td>This is the number of hits to the gene in the ENTIRE databases, including SELF synteny.
638638
</table>
639639
For example:
640640
<table><tr>
@@ -652,7 +652,7 @@ <h4>2.2. Single genes columns</h4>
652652
<td>Gene #1.2 is an orphan for Arab-Cabb but has a count of 1 because it is not an orphan in Arab-Brap.
653653
<br>It is not listed in the Arab-Brap-Cabb table because Arab-Brap has a hit to it.
654654
</table>
655-
655+
Note: For self-synteny, the second row of columns will be empty.
656656
<a id="save"></a>
657657
<h4>2.3 Columns and order shown</h4>
658658

docs/Release.html

Lines changed: 70 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -55,19 +55,82 @@
5555

5656
<tr><td style='vertical-align: top; text-align: left; padding:15px;'>
5757
<!---- START BODY -->
58-
For existing databases, there are no database or build changes since <a href="#v560">v5.6.0</a>.
59-
<br>You must have Java v17.0.11 or later, starting with SyMAP v5.5.0.
58+
For existing databases, there are no database or build changes since <a href="#v575">v5.7.5</a> for self-synteny and <a href="#v560">v5.6.0</a> for all other inputs.
59+
You must have Java v17.0.11 or later, starting with SyMAP v5.5.0.
6060

6161
<h3>Summary of recent major releases</h3>
6262
<table>
63-
<tr><td><a href="#v572">v5.7.2</a><td>(2-Sept-25)<td>Improved synteny algorithm using <ttx>Strict</ttx> option.
64-
<tr><td><a href="#v560">v5.6.0</a><td>(23-Jan-25)<td>Improve <ttx>Cluster Hit</ttx> algorithm <ttx>Algo2</ttx>,
63+
<tr><td><a href="#v572">v5.7.5</a><td>(15-Oct-25)<td>Self-synteny has almost complete support.
64+
<tr><td><a href="#v572">v5.7.2</a><td>(02-Sep-25)<td>Improve synteny algorithm using new <ttx>Strict</ttx> option.
65+
<tr><td><a href="#v560">v5.6.0</a><td>(23-Jan-25)<td>New <ttx>Cluster Hit</ttx> algorithm <ttx>Algo2</ttx>,
6566
which is aware of the exon-intron structure.
66-
<tr><td><a href="#v557">v5.5.7</a><td>(8-Aug-24)<td>Add <ttl>xToSymap</ttl> and improve ease of input.
67+
<tr><td><a href="#v557">v5.5.7</a><td>(08-Aug-24)<td>Add <ttl>xToSymap</ttl> and improve ease of input.
6768
<tr><td><a href="#v556">v5.5.6</a><td>(14-July-24)<td>Improved the collinear set algorithm.
68-
<tr><td><a href="#v540">v5.4.0</a><td>(6-Apr-23)<td>Improved <ttx>Cluster Hit</ttx> algorithm <ttx>Algo1</ttx> for the detection of gene homology.
69+
<tr><td><a href="#v540">v5.4.0</a><td>(06-Apr-23)<td>Improved <ttx>Cluster Hit</ttx> algorithm <ttx>Algo1</ttx> for the detection of gene homology.
6970
</table>
7071

72+
<a id="v575"></a>
73+
<h3>Release: v5.7.5 (25-Oct-25)</h3>
74+
<i>Self-synteny</i>: This release has updated algorithms and <ttp>Queries</ttp> works for self-synteny (see <a href="UserGuide.html#self" target="_blank" class="ext">Self-synteny</a>).
75+
<p>For existing self-synteny databases: Re-run the full <ttp>A&S</ttp> algorithm.
76+
<p>
77+
<ul>
78+
<li>Self-synteny algorithm and display. In summary, the updates are:
79+
<ttp>Queries</ttp> works for self-synteny, the summaries are all correct now, the <ttx>Block#</ttx> and <ttx>Hit#</ttx>
80+
are consistent now, and a few tiny display bug fixes. For more detail:
81+
<ul>
82+
<li>Algorithm:
83+
<ul>
84+
<li>Due to internal changes, the results may not be exactly the same but only on small 'iffy' blocks, e.g.
85+
the demo self-synteny is different, but all the blocks were iffy to start with.
86+
<li>The numbering of blocks and hits will be different.
87+
<li>The self-chromsomes no longer have hits representing the diagonal which reduces the number of hits. However,
88+
the diagonal is still shown on the <ttp>Dotplot</ttp>.
89+
<li><ttx>NumHits</ttx> (<ttp>Query</ttp> <ttl>Single</ttl> column) now includes self-synteny hits.
90+
<li><ttx>Collinear sets</ttx> results were not very useable because it did not mirror sets or work for
91+
self-chromosomes; both of these issues have been fixed.
92+
</ul>
93+
<li><ttx>Block#</ttx> of mirrored blocks (same above and below the diagonal) now have the same number;
94+
this is most obvious in <ttp>Dotplot</ttp>.
95+
<li><ttx>Hit#</ttx> two mirrored hits now have the same number; this is most obvious in <ttp>2D</ttp>.
96+
<li><ttp>2D</ttp>: The following are fixed: For the <ttl>Sequence Filter</ttl>, the <ttx>Gene# Hit</ttx> did not work.
97+
For the hit popup, when the two chromsomes were the same, a 2g hit would show the same
98+
gene for both ends of hit.
99+
<li><ttp>Reports...</ttp>: both <ttl>Summary</ttl> and <ttl>Block Report</ttl> only report
100+
blocks and hits for one side of the diagonal (they no longer double the blocks and hits).
101+
<li><ttp>Query</ttp>: Now works for self-synteny with a few exceptions (see <a href="UserGuide.html#self" target="_blank" class="ext">Self-synteny</a>).
102+
</ul>
103+
104+
<li><ttp>Reports:</ttp>
105+
<ul><li>The <tt>Species</tt> columns have been removed and the name is now
106+
over species specific columns. See <a href="UserGuide.html#breport" target="_blank" class="ext">Report</a>.
107+
</ul>
108+
<li>Small things:
109+
<ul>
110+
<li><ttp>Project Manager</ttp>:
111+
<ul>
112+
<li>All computations that read files sorts them before processing; this is not necessary, but more logical for the output.
113+
For <ttx>Cluster Algo1</ttx>, this can slightly change the hits found.
114+
<li>If only one project is selected, the <ttx>Selected Pair</ttx> cell will be shown as selected.
115+
<li>The lower <ttl>All Pairs</ttl> is now <ttl>All &#10003; Pairs</ttl>.
116+
<li><ttl>Verbose</ttl> did not work for the synteny algorithm if not set the first time
117+
it was run.
118+
<li><ttl>Pseudo Genes</ttl> uses <ttx>Directory</ttx> name in comment instead of <ttx>Abbreviation</ttx> since it does not change.
119+
<li><ttx>Align PROmer</ttx> or <ttx>Align NUCmer</ttx> have been added to the <ttp>Summary</ttp>.
120+
</ul>
121+
<li><ttp>2D</ttp>: <ttl>Sequence Filter</ttl> has a few labels changed and re-ordered.
122+
<li><ttp>Queries</ttp>: The number of blocks and collinear sets was wrong, plus unclear what the numbers were.
123+
Also, it did not work right if there was two abbreviations the same; now it warns the user.
124+
</ul>
125+
<li><small>Developers:</small>
126+
<ul>
127+
<li><small>AnchorMain1 self-synteny save to database methods have been re-written to not be
128+
dependent on the exact Schema. The diagonal hits are no longer loaded in database as they are only used
129+
in the Dotplot, which creates the diagonal on the fly. The <tt>stddev</tt> database column was removed and some columns moved around.
130+
Remove <tt>AnchorMain1.BinStats.java</tt></small>.
131+
</ul>
132+
</ul>
133+
71134
<a id="v574"></a>
72135
<h3>Release: v5.7.4 (30-Sept-25)</h3>
73136
Small <ttp>Synteny</ttp> <ttx>Strict</ttx> and interface improvements.
@@ -310,7 +373,6 @@ <h3>Release: v5.6.7 (22-May-25)</h3>
310373
Any java file I looked at, I took out old CASddd comments.</small>
311374
</ul>
312375

313-
314376
<a id="v566"></a>
315377
<h3>Release: v5.6.6 (6-May-25)</h3>
316378
The MacOS M4 silicon executables have been added to the <tt>/ext</tt> directory.
@@ -2132,8 +2194,7 @@ <h3>Release: v5.1.1 (31July22)</h3>
21322194
<li>Scripts:
21332195
<ul>
21342196
<li>The <tt>symap</tt> and <tt>viewSymap</tt> scripts have been changed from Perl to shell.
2135-
<li>The <tt>viewSymap3D</tt> script has been removed, but the <a href="./3D.html" class="ext" target="_blank">3D</a> documentation explains
2136-
how you could recreate the 3D view.
2197+
<li>The <tt>viewSymap3D</tt> script has been removed; 3D is no longer available.
21372198
</ul>
21382199
<li>Data directory
21392200
<ul>

0 commit comments

Comments
 (0)