AHA === AHA is the hybrid scaffolding approach found in SMRT Analysis. Contigs from a draft assembly generated by a different assembler can be joined using PacBio long reads. The current draft genome size limit for AHA in SMRT Analysis 1.4 is 160 Mbases and 20.000 contigs. On the SMRT Pipe command line fasta files can be used directly for long read coverage. If a mixture of fasta and bas.h5 files is available for PacBio long read coverage it is suggested to export filtered_subreads from the bas.h5 files and combine them with the other fasta file in order to use the following input.xml file. AHA requires a params.xml and an input.xml. params.xml ---------- Here is a params.xml:: 6,3,75;6,3,75;6,2,75;6,2,75 pacbio False True -minMatch 10 -minPctIdentity 70 -bestn 10 \ -noSplitSubreads 4 input.xml --------- The input.xml specifies the contigs and long reads used to scaffold them:: job.sh ------ The job.sh will contain the command to run the AHA hybrid assembly module using SMRT Pipe. :: smrtpipe.py --params=params.xml xml:input.xml Running AHA ----------- First ensure you have setup the SMRT Analysis environment:: source /opt/smrtanalysis/etc/setup.sh Afterwards, now run ``job.sh``:: source job.sh