BIANA: Biologic Interactions and Network Analysis Copyright (C) 2009
Javier Garcia-Garcia, Emre Guney, Baldo Oliva
This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by the
Free Software Foundation, either version 3 of the License, or (at your
option) any later version.
This program is distributed in the hope that it will be useful, but
WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General
Public License for more details.
You should have received a copy of the GNU General Public License
along with this program. If not, see
<http://www.gnu.org/licenses/>.
|
get_clustalw_alignment(sequencesList)
Executes clustalw to get the multiple sequence alignment between the
proteins in the list |
source code
|
|
|
get_cd_hit_clusters(fasta_file,
output_path=' ./ ' ,
sequence_identity_threshold=0.95)
Executes CD-HIT with the sequences fasta file and saves the results
in the ouput_path |
source code
|
|
|
blast_cd_hit_clusters(cd_hit_clusters_file,
output_fd,
dbaccess,
length_blast_db=None,
effective_length_space_search=None)
Performs a blast between all the proteins belonging to the same
cd-hit cluster |
source code
|
|
|
blast_sequence(blastDatabase,
sequenceObject,
temporalOutputPath=None)
Does a blast of sequence sequenceObject against database
"blastDatabase" |
source code
|
|
|
_calculate_similarities(dbaccess,
length_blast_db=None,
effective_length_space_search=None,
sequenceID_list=[ ] ,
fd_output_file=sys.stdout,
representant=None)
Calculates the similarity between all proteins in the list
"cluster_sequences". |
source code
|
|
|
|
|
parse_blastall_output(fd_blastall_output,
temporalOutputFile_fd=None,
return_only_ids=False,
limit_to_sequenceIDs=Set([]))
"fd_blastall_output" is the output fd of the blast process
(input for this method) |
source code
|
|
|
parse_bl2seq_output(sequenceID_A,
sequenceID_B,
bl2seq_output=None,
fd_output_file=None) |
source code
|
|