一边亲胸一边摸进内裤里,黑人巨摘花第一次出血,高压开关机械特性测试仪

TransDecoder能夠從轉(zhuǎn)錄本序列中鑒定候選編碼區(qū)。這些轉(zhuǎn)錄本序列可以來(lái)自于Trinity的從頭組裝，或者來(lái)自于Cufflinks或者StringTie的組裝結(jié)果。

軟件安裝

從https://github.com/TransDecoder/TransDecoder/releases下載最新版的TransDecoder，以v5.5.0為例

mkdir -p ~/opt/biosoft && cd ~/opt/biosoft
wget https://github.com/TransDecoder/TransDecoder/archive/TransDecoder-v5.5.0.zip
unzip TransDecoder-v5.5.0.zip
mv TransDecoder-TransDecoder-v5.5.0 TransDecoder-v5.5.0

運(yùn)行TransDecoder

我們從cufflinks或stringtie輸出的gtf文件開(kāi)始分析流程，因此你會(huì)有兩個(gè)輸入文件

transcripts.gtf: 記錄預(yù)測(cè)轉(zhuǎn)錄本的GTF文件
genome.fasta: 參考基因組序列

第一步: 從GTF文件中提取FASTA序列

~/opt/biosoft/TransDecoder-v5.5.0/util/gtf_genome_to_cdna_fasta.pl transcripts.gtf genome.fasta > transcripts.fasta

第二步: 將GTF文件轉(zhuǎn)成GFF3格式

~/opt/biosoft/TransDecoder-v5.5.0/util/gtf_to_alignment_gff3.pl transcripts.gtf > transcripts.gff3

第三步: 預(yù)測(cè)轉(zhuǎn)錄本中長(zhǎng)的開(kāi)放閱讀框, 默認(rèn)是100個(gè)氨基酸，可以用-m修改

~/opt/biosoft/TransDecoder-v5.5.0/TransDecoder.LongOrfs -t transcripts.fasta

第四步: 使用DIAMOND對(duì)上一步輸出的transcripts.fasta.transdecoder.pep在蛋白數(shù)據(jù)庫(kù)中進(jìn)行搜索，尋找同源證據(jù)支持

# 下載數(shù)據(jù)并解壓縮
wget ftp://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/complete/uniprot_sprot.fasta.gz
gunzip uniprot_sprot.fasta.gz
# 建立索引
diamond makedb --in uniprot_sprot.fasta --db uniprot_sprot.fasta
# BLASTP比對(duì)
diamond blastp -d uniprot_sprot.fasta -q transcripts.fasta.transdecoder_dir/longest_orfs.pep --evalue 1e-5 --max-target-seqs 1 > blastp.outfmt6

關(guān)于DIAMOND的使用，參考這篇說(shuō)明DIAMOND: 超快的蛋白序列比對(duì)軟件

第五步: 預(yù)測(cè)可能的編碼區(qū)

~/opt/biosoft/TransDecoder-v5.5.0/TransDecoder.Predict \
    -t transcripts.fasta \
    --retain_blastp_hits blastp.outfmt6

第六步: 生成基于參考基因組的編碼區(qū)注釋文件

~/opt/biosoft/TransDecoder-v5.5.0/util/cdna_alignment_orf_to_genome_orf.pl \
     transcripts.fasta.transdecoder.gff3 \
     transcripts.gff3 \
     transcripts.fasta > transcripts.fasta.transdecoder.genome.gff3

最終輸出文件如下:

transcripts.fasta.transdecoder.pep: 最終預(yù)測(cè)的CDS對(duì)應(yīng)的蛋白序列
transcripts.fasta.transdecoder.cds: 最終預(yù)測(cè)的CDS序列
transcripts.fasta.transdecoder.gff3: 最終ORF對(duì)應(yīng)的GFF3
transcripts.fasta.transdecoder.bed: 以BED格式存放ORF位置信息
transcripts.fasta.transdecoder.genome.gff3: 基于參考基因組的GFF3文件

其中BED和GFF3可以放到IGV上展示，手動(dòng)檢查下結(jié)果

假如是Trinity從頭預(yù)測(cè)的轉(zhuǎn)錄本，沒(méi)有參考基因組，那么就運(yùn)行第三步，第四步和第五步

參考資料

https://github.com/TransDecoder/TransDecoder/wiki

三个男躁一个女,国精产品一区一手机的秘密,麦子交换系列最经典十句话,欧美国产综合欧美视频

使用TransDecoder尋找轉(zhuǎn)錄本中的編碼區(qū)

使用TransDecoder尋找轉(zhuǎn)錄本中的編碼區(qū)

軟件安裝

運(yùn)行TransDecoder

參考資料

三个男躁一个女,国精产品一区一手机的秘密,麦子交换系列最经典十句话,欧美 国产 综合 欧美 视频

使用TransDecoder尋找轉(zhuǎn)錄本中的編碼區(qū)

軟件安裝

運(yùn)行TransDecoder

參考資料

三个男躁一个女,国精产品一区一手机的秘密,麦子交换系列最经典十句话,欧美国产综合欧美视频