Tarea Seminario 6

Download as pdf or txt
Download as pdf or txt
You are on page 1of 6

Carla Merino Ruiz

71312422W

Nombre del organismo: Bradyrhizobium diazoefficiens USDA 110

Identificación del gen asignado (tamaño): blr0013 (534pb)

Proteína codificada (tamaño): putative transposase (177aa)

Secuencia del gen en formato FASTA

>bja:blr0013 no KO assigned | (GenBank) blr0013; ORF_ID:blr0013;


putative transposase (N)
atggaagcgcgcggctgtgaacacgccgacaagctgcgtcagctcgagttgactctggag
caggcacgctacgaggcaactcgcgctcctcggcgatacgaggctgtcgattcggacaat
cgccttgttgccggcgagctggagcggcgctggaacgaacgcctggtcgctgtccgcgag
cttgagggcaagcgcgacacgttgttggctacgccggagatgacgttgagcgacatcgat
cgcgatcggttacttgcgcttgattccgaactcaagagagcttgggaaagtccgggtgca
acagctgcgacgcgcaacaggatcatccgaagcttcatcaatgagatcgttgtgcgcatg
cgcgacgaggtgctggacttgattgtccactggcatggcggcgatcacacggcattgcag
gtgaggaggaaccgcaccggcgagcaccgcggcggatgtggtcgatctcgttcgtgtcct
cgcgcgtcagatgcccgacagcatcatcgccgcggttctcaaccgcgccagtag

Secuencia de la proteina en formato FASTA

>bja:blr0013 no KO assigned | (GenBank) blr0013; ORF_ID:blr0013;


putative transposase (A)

MEARGCEHADKLRQLELTLEQARYEATRAPRRYEAVDSDNRLVAGELERRWNERLVAVRE

LEGKRDTLLATPEMTLSDIDRDRLLALDSELKRAWESPGATAATRNRIIRSFINEIVVRM

RDEVLDLIVHWHGGDHTALQVRRNRTGEHRGGCGRSRSCPRASDARQHHRRGSQPRQ
Alineamiento de la secuencia de nucleótidos con la primera secuencia
de menos del 100% de identidad en BLAST de ácidos nucleicos
Query 1 ATGGAAGCGCGCGGCTGTGAACACGCCGACAAGCTGCGTCAGCTCGAGTTGACTCTGGAG 60
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 12282 ATGGAAGCGCGCGGCTGTGAACACGCCGACAAGCTGCGTCAGCTCGAGTTGACTCTGGAG 12341

Query 61 CAGGCACGCTACGAGGCAACTCGCGCTCCTCGGCGATACGAGGCTGTCGATTCGGACAAT 120


||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 12342 CAGGCACGCTACGAGGCAACTCGCGCTCCTCGGCGATACGAGGCTGTCGATTCGGACAAT 12401

Query 121 CGCCTTGTTGCCGGCGAGCTGGAGCGGCGCTGGAACGAACGCCTGGTCGCTGTCCGCGAG 180


||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 12402 CGCCTTGTTGCCGGCGAGCTGGAGCGGCGCTGGAACGAACGCCTGGTCGCTGTCCGCGAG 12461

Query 181 CTTGAGGGCAAGCGCGACACGTTGTTGGCTACGCCGGAGATGACGTTGAGCGACATCGAT 240


||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 12462 CTTGAGGGCAAGCGCGACACGTTGTTGGCTACGCCGGAGATGACGTTGAGCGACATCGAT 12521

Query 241 CGCGATCGGTTACTTGCGCTTGATTCCGAACTCAAGAGAGCTTGGGAAAGTCCGGGTGCA 300


||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 12522 CGCGATCGGTTACTTGCGCTTGATTCCGAACTCAAGAGAGCTTGGGAAAGTCCGGGTGCA 12581

Query 301 ACAGCTGCGACGCGCAACAGGATCATCCGAAGCTTCATCAATGAGATCGTTGTGCGCATG 360


||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 12582 ACAGCTGCGACGCGCAACAGGATCATCCGAAGCTTCATCAATGAGATCGTTGTGCGCATG 12641

Query 361 CGCGACGAGGTGCTGGACTTGATTGTCCACTGGCATGGCGGCGATCACACGGCATTGCAG 420


||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 12642 CGCGACGAGGTGCTGGACTTGATTGTCCACTGGCATGGCGGCGATCACACGGCATTGCAG 12701

Query 421 GTGAGGAGGAACCGCACCGGCGAGCACCGCGGCGGATGTGGTCGATCTCGTTCGTGTCCT 480


||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 12702 GTGAGGAGGAACCGCACCGGCGAGCACCGCGGCGGATGTGGTCGATCTCGTTCGTGTCCT 12761

Query 481 CGCGCGTCAGATGCCCGACAGCATCATCGCCGCGGTTCTCAACCGCGCCAGTAG 534


||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 12762 CGCGCGTCAGATGCCCGACAGCATCATCGCCGCGGTTCTCAACCGCGCCAGTAG 12815
Búsqueda de homologías de la secuencia de aminoácidos con otras
proteinas depositadas en la base de datos

Proteina 1

Nombre de la proteina y microorganismo: transposase [Bradyrhizobium]

Longitud de la proteina: 177aa

Porcentaje de identidad con nuestra proteina: 100%

Número de acceso: WP_011082845.1

Proteina 2

Nombre de la proteina y microorganismo: recombinase zinc beta ribbon domain-containing


protein Bradyrhizobium

Longitud de la proteina: 327 aa

Porcentaje de identidad con nuestra proteina:90%

Número de acceso: WP_247831379.1

Proteina 3

Nombre de la proteina y microorganismo: recombinase family protein Bradyrhizobium sp.


180

Longitud de la proteina: 689 aa

Porcentaje de identidad con nuestra proteina: 72%

Número de acceso: MCK1493538.1

Proteina 4

Nombre de la proteina y microorganismo: helix-turn-helix domain-containing protein,


partial Mesorhizobium silamurunense

Longitud de la proteina: 362 aa

Porcentaje de identidad con nuestra proteina: 61,18%

Número de acceso: WP_192257211.1


Secuencias en formato FASTA de las proteinas 1 a 4 en orden:

>WP_011082845.1
MEARGCEHADKLRQLELTLEQARYEATRAPRRYEAVDSDNRLVAGELERRWNERLVAVRELEGKRDTLLA
TPEMTLSDIDRDRLLALDSELKRAWESPGATAATRNRIIRSFINEIVVRMRDEVLDLIVHWHGGDHTALQ
VRRNRTGEHRGGCGRSRSCPRASDARQHHRRGSQPRQ

>WP_247831379.1
MIKDHHEGYITWAEFERNQRLITDNANGKSFMSRGSVRRGEGLLAGLLRCGHCGRKLHVAYNGTHSTVGR
YHCRGSQINHGGDPCISFGRLRADRAISAEVIARLQPLGVQAAVAVMEARGREHADKLRQLELTLEQARY
EATRARRRYEAVDPDNRLVAGELERRWNERLVAVRELEGERETLLATPEMTLSDIDRDRLLALGADLERA
WESPGATAATRKRIIRSLIHEIVVCIRDEVLDLIVHWHGGDHTALQVRKNRTGEHRWSTAADVVDLVRVL
ARQMPDSTIAAVLNRASKSTGRGNCVQITNDPQRATGIVLAMRSGKS

>MCK1493538.1
MSKITPEHLARQAVVYVRQSTADQVINNRESQRRQYGLADRARQLGWNEVVVIDDDLGRSGGGTARPGFE
KLLAAICEGRVGAVVSIEASRLARNGRDWHTLLEFCGLVGTLIVDEDGVYDPRHPNDRLLLGMKGTMSEM
ELSIFRQRSLEALKQKAHRGELFLNVAIGYLKVSHDRIEKDPDRRIKEALALVFTKFAEMQTLRQVHLWL
RQERITLPAVSHGPEGRHVEWKLPVYNTIYHILTNPIYAGAYAFGRSGSRVTIEAGRKRIVRGFRRERSD
WEVLIKDHHEGYITWAEFERNQRLITDNANGKSFMSRGSVRCGEALLAGLLRCGHCGRKLHVAYSGTHST
VGRYHCRGSQINHGGDPCISFGGLRVDRAISAEVIARLQPLGVKAALAAMEARGREHAEKLRQLELALEQ
ARYEATRARRHYEAVDPDHRLVAGELERRWNERLLAVRALEDERGAFLAKPETTLREGDRERLLALGSDL
ERAWNSTGATPATRKRIIRTVIREIVVRIHDEAIELVIHWQGGDHTALKTRKNRTGQHRWRTSADVIDLV
RVLARQMPDNTIAAVLNRAGKSTGRGNSWTRARVCHLRNQQAIAPYRECERAERGEVTLDEAAAALKVSP
STVRRLIAEQSLPAHQLCKGAPWVIKALDLEHPEVKKAAHARRFRRPSSGDLRQRELEL

>WP_192257211.1
AGLLRCGHCGRKLHVAYSGENGSSGRYHCRGGQLNHGGAPCISFGGMRIDRAIGAEVIERLQPFGVEAAI
NAVEARRIENAEKRRQIELALEQARYEAALARRRYEAVDPNNRLVAAELEHRWNERLLAARALEDERNVL
AAAPQSSLSATERDRLLALGADVERAWNSSGAPPATRKRIIRTLIDEIVVRIEEDALNLVIRWQGGDHTP
LRVRKNRAGQHRWGTDADVVELVAVLARQMPDQAIAAVLNRAGKKTGKGNGWTRSRVCFLRNHRRIPPYR
EGERAERGEVTLEETAKILNVSEATVRRMIQEKLLAARQYCKGAPWVIQNRDLDREDLQRIADARRSRRT
PSEDPRQNSLAL
Clustal omega

WP_192257211.1 ------------------------------------------------------------ 0

WP_011082845.1 ------------------------------------------------------------ 0

WP_247831379.1 ------------------------------------------------------------ 0

MCK1493538.1 MSKITPEHLARQAVVYVRQSTADQVINNRESQRRQYGLADRARQLGWNEVVVIDDDLGRS 60

WP_192257211.1 ------------------------------------------------------------ 0

WP_011082845.1 ------------------------------------------------------------ 0

WP_247831379.1 ------------------------------------------------------------ 0

MCK1493538.1 GGGTARPGFEKLLAAICEGRVGAVVSIEASRLARNGRDWHTLLEFCGLVGTLIVDEDGVY 120

WP_192257211.1 ------------------------------------------------------------ 0

WP_011082845.1 ------------------------------------------------------------ 0

WP_247831379.1 ------------------------------------------------------------ 0

MCK1493538.1 DPRHPNDRLLLGMKGTMSEMELSIFRQRSLEALKQKAHRGELFLNVAIGYLKVSHDRIEK 180

WP_192257211.1 ------------------------------------------------------------ 0

WP_011082845.1 ------------------------------------------------------------ 0

WP_247831379.1 ------------------------------------------------------------ 0

MCK1493538.1 DPDRRIKEALALVFTKFAEMQTLRQVHLWLRQERITLPAVSHGPEGRHVEWKLPVYNTIY 240

WP_192257211.1 ------------------------------------------------------------ 0

WP_011082845.1 ------------------------------------------------------------ 0

WP_247831379.1 -------------------------------------------MIKDHHEGYITWAEFER 17

MCK1493538.1 HILTNPIYAGAYAFGRSGSRVTIEAGRKRIVRGFRRERSDWEVLIKDHHEGYITWAEFER 300

WP_192257211.1 ---------------------------AGLLRCGHCGRKLHVAYSGENGSSGRYHCRGGQ 33

WP_011082845.1 ------------------------------------------------------------ 0

WP_247831379.1 NQRLITDNANGKSFMSRGSVRRGEGLLAGLLRCGHCGRKLHVAYNGTHSTVGRYHCRGSQ 77

MCK1493538.1 NQRLITDNANGKSFMSRGSVRCGEALLAGLLRCGHCGRKLHVAYSGTHSTVGRYHCRGSQ 360


WP_192257211.1 LNHGGAPCISFGGMRIDRAIGAEVIERLQPFGVEAAINAVEARRIENAEKRRQIELALEQ 93

WP_011082845.1 ---------------------------------------MEARGCEHADKLRQLELTLEQ 21

WP_247831379.1 INHGGDPCISFGRLRADRAISAEVIARLQPLGVQAAVAVMEARGREHADKLRQLELTLEQ 137

MCK1493538.1 INHGGDPCISFGGLRVDRAISAEVIARLQPLGVKAALAAMEARGREHAEKLRQLELALEQ 420

:*** *:*:* **:**:***

WP_192257211.1 ARYEAALARRRYEAVDPNNRLVAAELEHRWNERLLAARALEDERNVLAAAPQSSLSATER 153

WP_011082845.1 ARYEATRAPRRYEAVDSDNRLVAGELERRWNERLVAVRELEGKRDTLLATPEMTLSDIDR 81

WP_247831379.1 ARYEATRARRRYEAVDPDNRLVAGELERRWNERLVAVRELEGERETLLATPEMTLSDIDR 197

MCK1493538.1 ARYEATRARRHYEAVDPDHRLVAGELERRWNERLLAVRALEDERGAFLAKPETTLREGDR 480

*****: * *:***** ::****.***:******:*.* **.:* .: * *: :* :*

WP_192257211.1 DRLLALGADVERAWNSSGAPPATRKRIIRTLIDEIVVRIEEDALNLVIRWQGGDHTPLRV 213

WP_011082845.1 DRLLALDSELKRAWESPGATAATRNRIIRSFINEIVVRMRDEVLDLIVHWHGGDHTALQV 141

WP_247831379.1 DRLLALGADLERAWESPGATAATRKRIIRSLIHEIVVCIRDEVLDLIVHWHGGDHTALQV 257

MCK1493538.1 ERLLALGSDLERAWNSTGATPATRKRIIRTVIREIVVRIHDEAIELVIHWQGGDHTALKT 540

:*****.::::***:* ** ***:****:.* **** :.::.::*:::*:***** *:.

WP_192257211.1 RKNRAGQHRWGTDADVVELVAVLARQMPDQAIAAVLNRAGKKTGKGNGWTRSRVCFLRNH 273

WP_011082845.1 RRNRTGEHRGGCGRSRS---------CPRASDARQHHRRGSQPRQ--------------- 177

WP_247831379.1 RKNRTGEHRWSTAADVVDLVRVLARQMPDSTIAAVLNRASKSTGRGNCVQITNDPQRATG 317

MCK1493538.1 RKNRTGQHRWRTSADVIDLVRVLARQMPDNTIAAVLNRAGKSTGRGNSWTRARVCHLRNQ 600

*:**:*:** . * : * :* ... :

WP_192257211.1 RRIPPYREGERAERGEVTLEETAKILNVSEATVRRMIQEKLLAARQYCKGAPWVIQNRDL 333

WP_011082845.1 ------------------------------------------------------------ 177

WP_247831379.1 -IVLAMRSGKS------------------------------------------------- 327

MCK1493538.1 QAIAPYRECERAERGEVTLDEAAAALKVSPSTVRRLIAEQSLPAHQLCKGAPWVIKALDL 660

WP_192257211.1 DREDLQRIADARRSRRTPSEDPRQNSLAL 362

WP_011082845.1 ----------------------------- 177

WP_247831379.1 ----------------------------- 327

MCK1493538.1 EHPEVKKAAHARRFRRPSSGDLRQRELEL 689

You might also like