/tools/metag_tools/short_reads_figure_high_quality_length.xml

https://bitbucket.org/cistrome/cistrome-harvard/ · XML · 75 lines · 51 code · 24 blank · 0 comment · 0 complexity · d49253b2321a3a72089a25ccc3af9e9d MD5 · raw file

  1. <tool id="hist_high_quality_score" name="Histogram">
  2. <description> of high quality score reads </description>
  3. <command interpreter="python">short_reads_figure_high_quality_length.py $input1 $output1 $input2</command>
  4. <inputs>
  5. <page>
  6. <param name="input1" type="data" format="qualsolexa,qual454,txtseq.zip" label="Quality score file" help="No dataset? Read tip below"/>
  7. <param name="input2" type="integer" size="5" value="20" label="Quality score threshold" />
  8. </page>
  9. </inputs>
  10. <outputs>
  11. <data name="output1" format="pdf" />
  12. </outputs>
  13. <requirements>
  14. <requirement type="python-module">rpy</requirement>
  15. </requirements>
  16. <tests>
  17. <test>
  18. <param name="input1" value="solexa.qual" ftype="qualsolexa" />
  19. <param name="input2" value="5" />
  20. <output name="output1" file="solexa_high_quality_hist.pdf" ftype="pdf"/>
  21. </test>
  22. <test>
  23. <param name="input1" value="454.qual" ftype="qual454" />
  24. <param name="input2" value="5" />
  25. <output name="output1" file="454_high_quality_hist.pdf" ftype="pdf"/>
  26. </test>
  27. </tests>
  28. <help>
  29. .. class:: warningmark
  30. To use this tool, your dataset needs to be in the *Quality Score* format. Click the pencil icon next to your dataset to set the datatype to *Quality Score* (see below for examples).
  31. -----
  32. **What it does**
  33. This tool takes Quality Files generated by Roche (454), Illumina (Solexa), or ABI SOLiD machines and builds a histogram of lengths of high quality reads.
  34. -----
  35. **Examples of Quality Data**
  36. Roche (454) or ABI SOLiD data::
  37. &gt;seq1
  38. 23 33 34 25 28 28 28 32 23 34 27 4 28 28 31 21 28
  39. Illumina (Solexa) data::
  40. -40 -40 40 -40 -40 -40 -40 40
  41. -----
  42. **Note**
  43. - Quality score data::
  44. &gt;seq1
  45. 23 33 34 25 28 28 28 32 23 34 27 4 28 28 31 21 28
  46. - If the threshold is set to 20:
  47. - a low quality score 4 in the middle separated two segments of lengths 11 and 5.
  48. - The histogram will be built based on the numbers (11, 5).
  49. - For Illumina (Solexa) data, only the maximal of the 4 values will be used.
  50. </help>
  51. </tool>