PageRenderTime 6ms CodeModel.GetById 1ms app.highlight 2ms RepoModel.GetById 1ms app.codeStats 0ms

/tools/new_operations/coverage.xml

https://bitbucket.org/cistrome/cistrome-harvard/
XML | 91 lines | 71 code | 20 blank | 0 comment | 0 complexity | 0351357b831a814b33419020142e1f20 MD5 | raw file
 1<tool id="gops_coverage_1" name="Coverage">
 2  <description>of a set of intervals on second set of intervals</description>
 3  <command interpreter="python">gops_coverage.py $input1 $input2 $output -1 ${input1.metadata.chromCol},${input1.metadata.startCol},${input1.metadata.endCol},${input1.metadata.strandCol} -2 ${input2.metadata.chromCol},${input2.metadata.startCol},${input2.metadata.endCol},${input2.metadata.strandCol}</command>
 4  <inputs>
 5    <param format="interval" name="input1" type="data" help="First dataset">
 6      <label>What portion of</label>
 7    </param>
 8    <param format="interval" name="input2" type="data" help="Second dataset">
 9      <label>is covered by</label>
10    </param>
11   </inputs>
12  <outputs>
13    <data format="interval" name="output" metadata_source="input1" />
14  </outputs>
15  <code file="operation_filter.py"/>
16  <tests>
17    <test>
18      <param name="input1" value="1.bed" />
19      <param name="input2" value="2.bed" />
20      <output name="output" file="gops_coverage_out.interval" />
21    </test>
22    <test>
23      <param name="input1" value="1.bed" />
24      <param name="input2" value="2_mod.bed" ftype="interval"/>
25      <output name="output" file="gops_coverage_out_diffCols.interval" />
26    </test>
27    <test>
28      <param name="input1" value="gops_bigint.interval" />
29      <param name="input2" value="gops_bigint2.interval" />
30      <output name="output" file="gops_coverage_out2.interval" />
31    </test>
32  </tests>
33  <help>
34
35.. class:: infomark
36
37**TIP:** If your dataset does not appear in the pulldown menu -> it is not in interval format. Use "edit attributes" to set chromosome, start, end, and strand columns.
38
39Find the coverage of intervals in the first dataset on intervals in the second dataset.  The coverage is added as two columns, the first being bases covered, and the second being the fraction of bases covered by that interval.
40
41-----
42
43**Screencasts!**
44
45See Galaxy Interval Operation Screencasts_ (right click to open this link in another window).
46
47.. _Screencasts: http://wiki.g2.bx.psu.edu/Learn/Interval%20Operations
48
49-----
50
51**Example**
52
53
54    if **First dataset** are genes ::
55
56      chr11 5203271 5204877 NM_000518 0 -
57      chr11 5210634 5212434 NM_000519 0 -
58      chr11 5226077 5227663 NM_000559 0 -
59      chr11 5226079 5232587 BC020719  0 -
60      chr11 5230996 5232587 NM_000184 0 -
61
62    and **Second dataset** are repeats::
63
64       chr11      5203895 5203991 L1MA6     500 +
65       chr11      5204163 5204239 A-rich    219 +
66       chr11      5211034 5211167 (CATATA)n 245 +
67       chr11      5211642 5211673 AT_rich    24 +
68       chr11      5226551 5226606 (CA)n     303 +
69       chr11      5228782 5228825 (TTTTTG)n 208 +
70       chr11      5229045 5229121 L1PA11    440 +
71       chr11      5229133 5229319 MER41A   1106 +
72       chr11      5229374 5229485 L2        244 -
73       chr11      5229751 5230083 MLT1A     913 -
74       chr11      5231469 5231526 (CA)n     330 +
75
76    the Result is the coverage density of repeats in the genes::
77
78       chr11 5203271 5204877 NM_000518 0 - 172   0.107098
79       chr11 5210634 5212434 NM_000519 0 - 164   0.091111
80       chr11 5226077 5227663 NM_000559 0 -  55   0.034678
81       chr11 5226079 5232587 BC020719  0 - 860   0.132145
82       chr11 5230996 5232587 NM_000184 0 -  57   0.035827
83
84    For example, the following line of output::
85
86      chr11 5203271 5204877 NM_000518 0 - 172   0.107098
87
88   implies that 172 nucleotides accounting for 10.7% of the this interval (chr11:5203271-5204877) overlap with repetitive elements.
89
90</help>
91</tool>