從fasta中提取或者過濾掉多個序列

本文轉載自查看原文 2018-03-21 19:19 1543 工具

Google了一下，現成的工具不多。

自己寫代碼也可以，就是速度肯定不快，而且每次寫也很麻煩。

偶然看到QIIME的filter_fasta.py有這個功能，從name list中提取多個序列。

filter_fasta.py -f extract_no_N_200.fasta -o remain.fasta -s out.list

[REQUIRED]

-f, --input_fasta_fp
Path to the input fasta file
-o, --output_fasta_fp
The output fasta filepath
[OPTIONAL]

-m, --otu_map
An OTU map where sequences ids are those which should be retained.
-s, --seq_id_fp
A list of sequence identifiers (or tab-delimited lines with a seq identifier in the first field) which should be retained.
-b, --biom_fp
A biom file where otu identifiers should be retained.
-a, --subject_fasta_fp
A fasta file where the seq ids should be retained.
-p, --seq_id_prefix
Keep seqs where seq_id starts with this prefix.
--sample_id_fp
Keep seqs where seq_id starts with a sample id listed in this file. Must be newline delimited and may not contain a header.
-n, --negate
Discard passed seq ids rather than keep passed seq ids. [default: False]
--mapping_fp
Mapping file path (for use with –valid_states). [default: None]
--valid_states
Description of sample ids to retain (for use with –mapping_fp). [default: None]

60w條序列瞬間就處理完了。　　

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 FastJson過濾掉不需要的返回字段 json過濾掉不需要的字段過濾掉map集合中key或value為空的值過濾掉Abp框架不需要記錄的日志 jmeter，測登錄，要不要過濾掉JS,CSS等請求？感覺過濾掉了壓出來的數據就不真實? Mybatis自動生成的BO對象繼承公共父類(BO中過濾掉公共屬性) 在ionic這個框架下(Angular JS)，對URL進行重寫，過濾掉URL中的#號 MySQL 大批量插入，如何過濾掉重復數據？ js方法傳遞包含反斜杠\的參數時，會把\過濾掉 Swift - 從相冊中選擇視頻（過濾掉照片，使用UIImagePickerController）