Overview

Diffuser是一个基于分布式的生物序列特征生成网站,因此需要在每个单一节点安装所需的软件:ViennaRNA-2.4.8VSL2psipredspineX。本文以deepamp节点为例,分别安装和配置以上4个软件。

1. 安装ViennaRNA-2.4.8

该软件用于预测RNA序列结构,安装步骤如下:

wget https://www.tbi.univie.ac.at/RNA/download/sourcecode/2_4_x/ViennaRNA-2.4.8.tar.gz
tar -zxvf ViennaRNA-2.4.8.tar.gz
cd ViennaRNA-2.4.8
./configure
make
sudo make install

2. 安装VSL2

该软件用于生成预测的蛋白质序列的disorder information,只需下载后解压,无需安装:

wget http://www.dabi.temple.edu/disprot/download/VSL2.tar.gz
tar -xvf VSL2.tar.gz

3. 安装psipred

该软件用于预测Protein序列的二级结构,安装步骤如下:

wget http://bioinfadmin.cs.ucl.ac.uk/downloads/psipred/psipred.4.02.tar.gz
tar -xvf psipred.4.02.tar.gz

在解压成功后,还需进行简单的配置。首先,由于该软件部分代码需要tcsh支持,因此我们需要先进行安装:

sudo apt-get install tcsh

其次,该软件当中调用了blast,因此我们需要指定blast的安装路径和数据库路径,这里blast的我们用的是2.2.26版本而数据库我们使用的是uniref50。配置如下:

cd psipred
vim runpsipred

将下面的代码:

# The name of the BLAST data bank
set dbname = uniref90

# Where the NCBI programs have been installed
set ncbidir = /usr/local/bin

# Where the PSIPRED V4 programs have been installed
set execdir = ./bin

# Where the PSIPRED V4 data files have been installed
set datadir = ./data

改为:

# The name of the BLAST data bank
set dbname = /feagen/uniref/uniref50/uniref50

# Where the NCBI programs have been installed
set ncbidir = /var/www/cgi-bin/blast-2.2.26/bin

# Where the PSIPRED V4 programs have been installed
set execdir = /var/www/cgi-bin/psipred/bin

# Where the PSIPRED V4 data files have been installed
set datadir = /var/www/cgi-bin/psipred/data

4. 安装spineX

spineX的压缩包可以到http://sparks.informatics.iupui.edu/上下载,下载完成后,解压并进行如下配置:

tar -zxvf spineXpublic.tgz
cd spineXpublic
vim spX.pl

到指定行修改脚本文件,89行,指定blast安装路径:

$blastdir = '/var/www/cgi-bin-diffuser/blast-2.2.26';

108行,修改数据库路径:

system("$blastdir/bin/blastpgp -d /feagen/uniref/uniref50/uniref50 -j 3 -i $infl -Q $tmprf -a 4 > $workdir/_tmp2.$irnd")==0 or die " Aborting(spineX): Can't do psiblast\n";

128行,删除缓存文件:

system("rm -r $workdir")==0 or print "Couldn't remove temp. dir. $workdir\n";