LAST¶
LAST is a tool which finds similar regions in DNA sequences and aligns them.
LAST is available as a module on Apocrita.
Usage¶
To run the default version of LAST, simply load the last
module:
module load last
$ lastdb -h
Usage: lastdb [options] output-name fasta-sequence-file(s)
Prepare sequences for subsequent alignment with lastal.
Main Options:
-h, --help: show all options and their default settings, and exit
-p: interpret the sequences as proteins
-R: repeat-marking options (default=10)
-c: soft-mask lowercase letters (in reference and query sequences)
-u: seeding scheme (default: YASS for DNA, else exact-match seeds)
[...]
$ lastal -h
Usage: lastal [options] lastdb-name fasta-sequence-file(s)
Find and align similar sequences.
Cosmetic options:
-h, --help: show all options and their default settings, and exit
-V, --version: show version information, and exit
-v: be verbose: write messages about what lastal is doing
-f: output format: TAB, MAF, BlastTab, BlastTab+ (default=MAF)
[...]
A typical session with LAST involves preparing a database with
lastdb
and then analysing it with lastal
, for example:
lastdb -cR01 humdb humanMito.fa
lastal humdb fuguMito.fa > myalns.maf
Core Usage
To ensure that LAST uses the correct number of cores, the
-P${NSLOTS}
option should be used.
Example job¶
Serial job¶
Here is an example job running on 4 cores and 8GB of memory:
#!/bin/bash
#$ -cwd
#$ -j y
#$ -pe smp 4
#$ -l h_rt=1:0:0
#$ -l h_vmem=2G
module load last
lastdb -P${NSLOTS} -cR01 humdb humanMito.fa
lastal -P${NSLOTS} humdb fuguMito.fa > myalns.maf
When the job completes the output file (myalns.maf
) should
contain data about the sequence alignments. The tutorial
(referenced below) has information about how to interpret
the contents of this file.