PageRenderTime 36ms CodeModel.GetById 20ms RepoModel.GetById 0ms app.codeStats 0ms

/tools/ilmn_pacbio/smrtpipe_hybrid.xml

https://bitbucket.org/cistrome/cistrome-harvard/
XML | 59 lines | 47 code | 9 blank | 3 comment | 0 complexity | 586189e7fad02b485dba12cb6e51a687 MD5 | raw file
  1. <tool id="smrtpipe_hybrid" name="AHA" version="1.0.0">
  2. <description>Assemble contigs from a set of contigs and PacBio reads.</description>
  3. <command interpreter="python">
  4. smrtpipe_galaxy.py --nproc=24 --dat_extension=fasta --output=data/scaffold.fasta --galaxy_output=${outfile} ${iniFile}
  5. </command>
  6. <!--
  7. <command>cp ${iniFile} ${outfile}</command>
  8. -->
  9. <inputs>
  10. <param name="contigs" format="fasta" type="data" label="Starting Contigs"/>
  11. <param name="reads" format="fasta" type="data" label="PacBio Reads"/>
  12. <param name="schedule" type="text" value="6,3,75;6,3,75;5,3,75;5,3,75;6,2,75;6,2,75;5,2,75;5,2,75" label="Parameter Schedule" size="60"/>
  13. </inputs>
  14. <configfiles>
  15. <configfile name="iniFile">
  16. [input]
  17. assembled_contigs:${contigs}
  18. file:${reads}
  19. [HybridAssembly]
  20. instrumentModel=RS
  21. cleanup=False
  22. untangler=pacbio
  23. #set $schedule2 = $schedule.replace('X',';')
  24. paramSchedule=${schedule2}
  25. dontFillin=False
  26. longReadsAsStrobe=True
  27. exactQueryIds=True
  28. rm4Opts=-minMatch 7 -minFrac 0.1 -minPctIdentity 65 -bestn 10 -noSplitSubreads
  29. numberProcesses=16
  30. cluster=False
  31. minRepeatLength=100000
  32. </configfile>
  33. </configfiles>
  34. <outputs>
  35. <data name="outfile" format="fasta" label="Hybrid assembly contigs from ${on_string}"/>
  36. </outputs>
  37. <help>
  38. **What it does**
  39. The AHA assembly algorithm is an AMOS_-based pipeline
  40. for finishing bacterial-sized
  41. genomes using draft contigs and PacBio reads.
  42. .. _AMOS: http://sourceforge.net/apps/mediawiki/amos
  43. **Parameter list**
  44. Parameter schedule
  45. The parameter schedule is a semi-colon delimited list of triples. Each triple represents an iteration of hybrid assembly (alignment/scaffolding/gap-filling). The three paremeters for each iteration are the Z-score, number of reads required to define a link, and the minimum length of subreads used in links.
  46. **Output**
  47. FASTA file containing scaffolded and gap-filled contigs resulting from the
  48. hybrid assembly.
  49. </help>
  50. </tool>