/tools/metag_tools/blat_coverage_report.xml

https://bitbucket.org/cistrome/cistrome-harvard/ · XML · 75 lines · 50 code · 25 blank · 0 comment · 0 complexity · 1b30da1c17a6510523929dabd8007c65 MD5 · raw file

  1. <tool id="generate_coverage_report" name="Polymorphism of the Reads">
  2. <description>the percentage of reads supporting each nucleotide at each location</description>
  3. <command interpreter="python">blat_coverage_report.py $input1 $output1</command>
  4. <inputs>
  5. <param name="input1" type="data" format="tabular" label="Alignment result"/>
  6. </inputs>
  7. <outputs>
  8. <data name="output1" format="tabular"/>
  9. </outputs>
  10. <tests>
  11. <test>
  12. <param name="input1" value="blat_coverage_report_test1.txt" ftype="tabular" />
  13. <output name="output1" file="blat_coverage_report_test1.out" />
  14. </test>
  15. </tests>
  16. <help>
  17. .. class:: warningmark
  18. **IMPORTANT**. Only works for BLAT **standard** or **pslx** output formats (hint: to output pslx format, add **-out=pslx** in the command).
  19. -----
  20. **What it does**
  21. The tool will generate a table of 6 columns as following:
  22. - 1st column: chromosome id.
  23. - 2nd column: chromosome location.
  24. - 3rd column: the nucleotide from reference genome at the chromosome location (2nd column).
  25. - 4th column: total coverage of the reads (number of reads that were mapped to the chromosome location).
  26. - 5th column: percentage of reads that support nucleotide **A** at this location.
  27. - 6th column: percentage of reads that support nucleotide **T** at this location.
  28. - 7th column: percentage of reads that support nucleotide **C** at this location.
  29. - 8th column: percentage of reads that support nucleotide **G** at this location.
  30. -----
  31. **Example**
  32. - The BLAT pslx results look like the following (tab separated with sequence at the end)::
  33. 30 0 0 0 0 0 0 0 + seq0 30 0 30 chr 4639675 4549207 4549237 1 30, 0, 4549207, cggacagcgccgccaccaacaaagccacca, cggacagcgccgccaccaacaaagccacca,
  34. 30 0 0 0 0 0 0 0 + seq1 30 0 30 chr 4639675 614777 614807 1 30, 0, 614777, aaaacaccggatgctccggcgctggcagat, aaaacaccggatgctccggcgctggcagat,
  35. 28 1 0 0 0 0 0 0 + seq2 30 0 29 chr 4639675 3289283 3289312 1 29, 0, 3289283, tttgcttttagtacaccggattcagaacc, tttgctttcagtacaccggattcagaacc,
  36. 30 0 0 0 0 0 0 0 + seq4 30 0 30 chr 4639675 2665584 2665614 1 30, 0, 2665584, cacgctacgtgcgcccccgcccagaaggcg, cacgctacgtgcgcccccgcccagaaggcg,
  37. The 14th column is the chromosome id, and the 16th and 17th columns shows the reads were mapped to chromosome start and end locations.
  38. - The report showed overall coverage of reads on each chromosome location (partial result)::
  39. +-------+----------+------+------+--------+------+--------+------+
  40. | title | location | ref. | cov. | A | T | C | G |
  41. +-------+----------+------+------+--------+------+--------+------+
  42. | chr | 614777 | A | 1 | A(100) | T(0) | C(0) | G(0) |
  43. | chr | 614778 | A | 1 | A(100) | T(0) | C(0) | G(0) |
  44. | chr | 614779 | A | 1 | A(100) | T(0) | C(0) | G(0) |
  45. +-------+----------+------+------+--------+------+--------+------+
  46. -----
  47. **Reference**
  48. **BLAT**: Kent, W James, BLAT--the BLAST-like alignment tool. (2002) Genome Research:12(4) 656-664.
  49. </help>
  50. </tool>