BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphyem206i04 (1850 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|NP_001045743.1| Os02g0125100 [Oryza sativa (japonica cultiva... 909 0.0 gb|EAY84290.1| hypothetical protein OsI_005523 [Oryza sativa (in... 890 0.0 ref|NP_567405.1| aconitase family protein / aconitate hydratase ... 796 0.0 gb|EDQ57380.1| predicted protein [Physcomitrella patens subsp. p... 791 0.0 emb|CAB40778.1| putative protein [Arabidopsis thaliana] >gi|7268... 783 0.0 >ref|NP_001045743.1| Os02g0125100 [Oryza sativa (japonica cultivar-group)] dbj|BAD07969.1| putative 3-isopropylmalate dehydratase large subunit [Oryza sativa (japonica cultivar-group)] dbj|BAF07657.1| Os02g0125100 [Oryza sativa (japonica cultivar-group)] Length = 514 Score = 909 bits (2349), Expect = 0.0 Identities = 450/512 (87%), Positives = 471/512 (91%), Gaps = 6/512 (1%) Frame = +3 Query: 102 ALSAVTRVVEQPTAWG-----AAFEMAPTQQ-LRANASLRRARPGXXXXXXXXXXXXXXX 263 ++SA + V + A+ AA +AP+QQ L+ S RRAR G Sbjct: 3 SISAASPVAGKAAAFAHKNELAAAAVAPSQQQLQRRVSGRRARSGRVRAVATPARAPRAP 62 Query: 264 XSTGSVKSAMTMTEKILARASERASLEPGENVWVDVDVLMTHDVCGPGTIGIFEREFGED 443 STGSVKSAMTMTEKILARASERASLEPGENVWVDVDVLMTHDVCGPGTIGIF+REFGED Sbjct: 63 SSTGSVKSAMTMTEKILARASERASLEPGENVWVDVDVLMTHDVCGPGTIGIFKREFGED 122 Query: 444 AKVWDREKVVIIPDHYIFTSDERANRNVDILRDFCMEQKIKYFYDIKDLSDFRANPDYKG 623 AKVWDREKVVIIPDHYIFTSDERANRNVDILRDFCMEQ IKYFYDIKDLS+F+ANPDYKG Sbjct: 123 AKVWDREKVVIIPDHYIFTSDERANRNVDILRDFCMEQNIKYFYDIKDLSNFKANPDYKG 182 Query: 624 VCHIALAQEGHCRPGEVLIGTDSHTCNAGAFGQFATGIGNTDAGFVMGTGKALLKVPPTI 803 VCH+ALAQEGHCRPGEVL+GTDSHTCNAGAFGQFATGIGNTDAGFVMGTGKALLKVPPTI Sbjct: 183 VCHVALAQEGHCRPGEVLLGTDSHTCNAGAFGQFATGIGNTDAGFVMGTGKALLKVPPTI 242 Query: 804 RFTLDGEMPPYLLAKDLILQIIGEISVSGATYKSMEFVGSTVDSLNMEERMTLCNMVIEA 983 RF LDGEMPPYLLAKDLILQIIGEISVSGATYKSMEFVGSTV+SLNMEERMTLCNMVIEA Sbjct: 243 RFVLDGEMPPYLLAKDLILQIIGEISVSGATYKSMEFVGSTVESLNMEERMTLCNMVIEA 302 Query: 984 GGKNGVVPADETTFKYLEGKTSVEYEPAYSDAQARFVSDYRFDVSKLEPVVAKPHSPDNR 1163 GGKNGVVPAD+TTF YLEGKTSVEYEP YSDAQARFVSDYRFDVSKLEPV+AKPHSPDNR Sbjct: 303 GGKNGVVPADQTTFNYLEGKTSVEYEPVYSDAQARFVSDYRFDVSKLEPVIAKPHSPDNR 362 Query: 1164 ALARECKDVKIDRVYIGSCTGGKTEDFFAAAKVFLASGKKVKVPTFLVPATQKVWMDVYS 1343 ALARECKDVKIDRVYIGSCTGGKTEDFFAAAKVFLASGKKVKVPTFLVPATQKVWMD+YS Sbjct: 363 ALARECKDVKIDRVYIGSCTGGKTEDFFAAAKVFLASGKKVKVPTFLVPATQKVWMDIYS 422 Query: 1344 IPVPGSGGKTCSQIFEEAGCDTPASPNCGACLGGPRDTYARMNEPKVCVATTNRNFPGRM 1523 IPVPG+GGKTCSQIFEEAGCDTPASP+CGACLGGPRDTYARMNEP VCV+TTNRNFPGRM Sbjct: 423 IPVPGAGGKTCSQIFEEAGCDTPASPSCGACLGGPRDTYARMNEPMVCVSTTNRNFPGRM 482 Query: 1524 GHKEGQIYLASPYTAAASALTGYVTDPRDFLM 1619 GHKEGQIYLASP+TAAASALTGYVTDPRDFLM Sbjct: 483 GHKEGQIYLASPFTAAASALTGYVTDPRDFLM 514 >gb|EAY84290.1| hypothetical protein OsI_005523 [Oryza sativa (indica cultivar-group)] gb|EAZ21573.1| hypothetical protein OsJ_005056 [Oryza sativa (japonica cultivar-group)] Length = 485 Score = 890 bits (2301), Expect = 0.0 Identities = 429/447 (95%), Positives = 442/447 (98%) Frame = +3 Query: 279 VKSAMTMTEKILARASERASLEPGENVWVDVDVLMTHDVCGPGTIGIFEREFGEDAKVWD 458 VKSAMTMTEKILARASERASLEPGENVWVDVDVLMTHDVCGPGTIGIF+REFGEDAKVWD Sbjct: 39 VKSAMTMTEKILARASERASLEPGENVWVDVDVLMTHDVCGPGTIGIFKREFGEDAKVWD 98 Query: 459 REKVVIIPDHYIFTSDERANRNVDILRDFCMEQKIKYFYDIKDLSDFRANPDYKGVCHIA 638 REKVVIIPDHYIFTSDERANRNVDILRDFCMEQ IKYFYDIKDLS+F+ANPDYKGVCH+A Sbjct: 99 REKVVIIPDHYIFTSDERANRNVDILRDFCMEQNIKYFYDIKDLSNFKANPDYKGVCHVA 158 Query: 639 LAQEGHCRPGEVLIGTDSHTCNAGAFGQFATGIGNTDAGFVMGTGKALLKVPPTIRFTLD 818 LAQEGHCRPGEVL+GTDSHTCNAGAFGQFATGIGNTDAGFVMGTGKALLKVPPTIRF LD Sbjct: 159 LAQEGHCRPGEVLLGTDSHTCNAGAFGQFATGIGNTDAGFVMGTGKALLKVPPTIRFVLD 218 Query: 819 GEMPPYLLAKDLILQIIGEISVSGATYKSMEFVGSTVDSLNMEERMTLCNMVIEAGGKNG 998 GEMPPYLLAKDLILQIIGEISVSGATYKSMEFVGSTV+SLNMEERMTLCNMVIEAGGKNG Sbjct: 219 GEMPPYLLAKDLILQIIGEISVSGATYKSMEFVGSTVESLNMEERMTLCNMVIEAGGKNG 278 Query: 999 VVPADETTFKYLEGKTSVEYEPAYSDAQARFVSDYRFDVSKLEPVVAKPHSPDNRALARE 1178 VVPAD+TTF YLEGKTSVEYEP YSDAQARFVSDYRFDVSKLEPV+AKPHSPDNRALARE Sbjct: 279 VVPADQTTFNYLEGKTSVEYEPVYSDAQARFVSDYRFDVSKLEPVIAKPHSPDNRALARE 338 Query: 1179 CKDVKIDRVYIGSCTGGKTEDFFAAAKVFLASGKKVKVPTFLVPATQKVWMDVYSIPVPG 1358 CKDVKIDRVYIGSCTGGKTEDFFAAAKVFLASGKKVKVPTFLVPATQKVWMD+YSIPVPG Sbjct: 339 CKDVKIDRVYIGSCTGGKTEDFFAAAKVFLASGKKVKVPTFLVPATQKVWMDIYSIPVPG 398 Query: 1359 SGGKTCSQIFEEAGCDTPASPNCGACLGGPRDTYARMNEPKVCVATTNRNFPGRMGHKEG 1538 +GGKTCSQIFEEAGCDTPASP+CGACLGGPRDTYARMNEP VCV+TTNRNFPGRMGHKEG Sbjct: 399 AGGKTCSQIFEEAGCDTPASPSCGACLGGPRDTYARMNEPMVCVSTTNRNFPGRMGHKEG 458 Query: 1539 QIYLASPYTAAASALTGYVTDPRDFLM 1619 QIYLASP+TAAASALTGYVTDPRDFLM Sbjct: 459 QIYLASPFTAAASALTGYVTDPRDFLM 485 >ref|NP_567405.1| aconitase family protein / aconitate hydratase family protein [Arabidopsis thaliana] gb|AAK76516.1| unknown protein [Arabidopsis thaliana] gb|AAM51226.1| unknown protein [Arabidopsis thaliana] Length = 509 Score = 796 bits (2055), Expect = 0.0 Identities = 372/450 (82%), Positives = 417/450 (92%) Frame = +3 Query: 267 STGSVKSAMTMTEKILARASERASLEPGENVWVDVDVLMTHDVCGPGTIGIFEREFGEDA 446 +TGSVK+ MTMTEKILARASE++ + PG+N+WV+VDVLMTHDVCGPG GIF+REFGE A Sbjct: 59 TTGSVKTGMTMTEKILARASEKSLVVPGDNIWVNVDVLMTHDVCGPGAFGIFKREFGEKA 118 Query: 447 KVWDREKVVIIPDHYIFTSDERANRNVDILRDFCMEQKIKYFYDIKDLSDFRANPDYKGV 626 KVWD EK+V+IPDHYIFT+D+RANRNVDI+R+ C EQ IKYFYDI DL +F+ANPDYKGV Sbjct: 119 KVWDPEKIVVIPDHYIFTADKRANRNVDIMREHCREQNIKYFYDITDLGNFKANPDYKGV 178 Query: 627 CHIALAQEGHCRPGEVLIGTDSHTCNAGAFGQFATGIGNTDAGFVMGTGKALLKVPPTIR 806 CH+ALAQEGHCRPGEVL+GTDSHTC AGAFGQFATGIGNTDAGFV+GTGK LLKVPPT+R Sbjct: 179 CHVALAQEGHCRPGEVLLGTDSHTCTAGAFGQFATGIGNTDAGFVLGTGKILLKVPPTMR 238 Query: 807 FTLDGEMPPYLLAKDLILQIIGEISVSGATYKSMEFVGSTVDSLNMEERMTLCNMVIEAG 986 F LDGEMP YL AKDLILQIIGEISV+GATYK+MEF G+T++SL+MEERMTLCNMV+EAG Sbjct: 239 FILDGEMPSYLQAKDLILQIIGEISVAGATYKTMEFSGTTIESLSMEERMTLCNMVVEAG 298 Query: 987 GKNGVVPADETTFKYLEGKTSVEYEPAYSDAQARFVSDYRFDVSKLEPVVAKPHSPDNRA 1166 GKNGV+P D TT Y+E +TSV +EP YSD A FV+DYRFDVSKLEPVVAKPHSPDNRA Sbjct: 299 GKNGVIPPDATTLNYVENRTSVPFEPVYSDGNASFVADYRFDVSKLEPVVAKPHSPDNRA 358 Query: 1167 LARECKDVKIDRVYIGSCTGGKTEDFFAAAKVFLASGKKVKVPTFLVPATQKVWMDVYSI 1346 LARECKDVKIDRVYIGSCTGGKTEDF AAAK+F A+G+KVKVPTFLVPATQKVWMDVY++ Sbjct: 359 LARECKDVKIDRVYIGSCTGGKTEDFMAAAKLFHAAGRKVKVPTFLVPATQKVWMDVYAL 418 Query: 1347 PVPGSGGKTCSQIFEEAGCDTPASPNCGACLGGPRDTYARMNEPKVCVATTNRNFPGRMG 1526 PVPG+GGKTC+QIFEEAGCDTPASP+CGACLGGP DTYAR+NEP+VCV+TTNRNFPGRMG Sbjct: 419 PVPGAGGKTCAQIFEEAGCDTPASPSCGACLGGPADTYARLNEPQVCVSTTNRNFPGRMG 478 Query: 1527 HKEGQIYLASPYTAAASALTGYVTDPRDFL 1616 HKEGQIYLASPYTAAASALTG V DPR+FL Sbjct: 479 HKEGQIYLASPYTAAASALTGRVADPREFL 508 >gb|EDQ57380.1| predicted protein [Physcomitrella patens subsp. patens] Length = 518 Score = 791 bits (2042), Expect = 0.0 Identities = 372/450 (82%), Positives = 413/450 (91%) Frame = +3 Query: 267 STGSVKSAMTMTEKILARASERASLEPGENVWVDVDVLMTHDVCGPGTIGIFEREFGEDA 446 STG+VK AMT TEKILA ASE+ L PGENVWV DVLMTHDVCGPGTIGIF++EFG++A Sbjct: 69 STGAVKQAMTATEKILANASEKTKLAPGENVWVKADVLMTHDVCGPGTIGIFKKEFGQNA 128 Query: 447 KVWDREKVVIIPDHYIFTSDERANRNVDILRDFCMEQKIKYFYDIKDLSDFRANPDYKGV 626 KVWDREK+V+IPDHYIFTSDERANRNVDILRDF EQ IKYFYDI D S+FRANPDYKGV Sbjct: 129 KVWDREKIVLIPDHYIFTSDERANRNVDILRDFAREQDIKYFYDITDRSNFRANPDYKGV 188 Query: 627 CHIALAQEGHCRPGEVLIGTDSHTCNAGAFGQFATGIGNTDAGFVMGTGKALLKVPPTIR 806 CH+ALAQEGHCRPGEVL GTDSHTCNAGAFGQFATGIGNTDAGF+MGTGK L+KVPPT+R Sbjct: 189 CHVALAQEGHCRPGEVLFGTDSHTCNAGAFGQFATGIGNTDAGFIMGTGKLLIKVPPTLR 248 Query: 807 FTLDGEMPPYLLAKDLILQIIGEISVSGATYKSMEFVGSTVDSLNMEERMTLCNMVIEAG 986 F LDGEMP YLLAKDLILQIIGEISV+GATY++MEFVG+ VD++ ME+RMTLCNMV+EAG Sbjct: 249 FVLDGEMPKYLLAKDLILQIIGEISVAGATYRAMEFVGTAVDAMTMEDRMTLCNMVVEAG 308 Query: 987 GKNGVVPADETTFKYLEGKTSVEYEPAYSDAQARFVSDYRFDVSKLEPVVAKPHSPDNRA 1166 GKNGVVPAD TT KYLEGKTS Y+ SD A F+ +YRFDVSKLEP+VAKPHSPDNR Sbjct: 309 GKNGVVPADATTAKYLEGKTSKPYQVFTSDGNASFLQEYRFDVSKLEPLVAKPHSPDNRG 368 Query: 1167 LARECKDVKIDRVYIGSCTGGKTEDFFAAAKVFLASGKKVKVPTFLVPATQKVWMDVYSI 1346 LARECKDVKIDRVYIGSCTGGKTEDF AAA++ SG+KVKVPTFLVPATQKVWMD+YS+ Sbjct: 369 LARECKDVKIDRVYIGSCTGGKTEDFLAAAELLAISGQKVKVPTFLVPATQKVWMDLYSL 428 Query: 1347 PVPGSGGKTCSQIFEEAGCDTPASPNCGACLGGPRDTYARMNEPKVCVATTNRNFPGRMG 1526 PVPG+ GKTC++IF++AGCDTPASP+C ACLGGPRDTYARMN+P+VCV+TTNRNFPGRMG Sbjct: 429 PVPGTDGKTCAEIFQQAGCDTPASPSCAACLGGPRDTYARMNDPQVCVSTTNRNFPGRMG 488 Query: 1527 HKEGQIYLASPYTAAASALTGYVTDPRDFL 1616 HKEGQIYLASPYTAAASALTG+VTDPR+FL Sbjct: 489 HKEGQIYLASPYTAAASALTGFVTDPREFL 518 >emb|CAB40778.1| putative protein [Arabidopsis thaliana] emb|CAB78385.1| putative protein [Arabidopsis thaliana] Length = 509 Score = 783 bits (2021), Expect = 0.0 Identities = 368/450 (81%), Positives = 412/450 (91%) Frame = +3 Query: 267 STGSVKSAMTMTEKILARASERASLEPGENVWVDVDVLMTHDVCGPGTIGIFEREFGEDA 446 +TGSVK+ MTMTEKILARASE++ + PG+N+WV+VDVLMTHDVCGPG GIF+REFGE A Sbjct: 59 TTGSVKTGMTMTEKILARASEKSLVVPGDNIWVNVDVLMTHDVCGPGAFGIFKREFGEKA 118 Query: 447 KVWDREKVVIIPDHYIFTSDERANRNVDILRDFCMEQKIKYFYDIKDLSDFRANPDYKGV 626 KVWD EK+V+IPDHYIFT+D+RANRNVDI+R+ C EQ IKYFYDI DL +F+ANPDYKGV Sbjct: 119 KVWDPEKIVVIPDHYIFTADKRANRNVDIMREHCREQNIKYFYDITDLGNFKANPDYKGV 178 Query: 627 CHIALAQEGHCRPGEVLIGTDSHTCNAGAFGQFATGIGNTDAGFVMGTGKALLKVPPTIR 806 CH+ALAQEGHCRPGEVL+GTDSHTC AGAFGQFATGIGNTDAGFV+GTGK LLKVPPT+R Sbjct: 179 CHVALAQEGHCRPGEVLLGTDSHTCTAGAFGQFATGIGNTDAGFVLGTGKILLKVPPTMR 238 Query: 807 FTLDGEMPPYLLAKDLILQIIGEISVSGATYKSMEFVGSTVDSLNMEERMTLCNMVIEAG 986 F LDGEMP YL AKDLILQIIGEISV+GATYK+MEF G+T++SL+MEERMTLCNMV+EAG Sbjct: 239 FILDGEMPSYLQAKDLILQIIGEISVAGATYKTMEFSGTTIESLSMEERMTLCNMVVEAG 298 Query: 987 GKNGVVPADETTFKYLEGKTSVEYEPAYSDAQARFVSDYRFDVSKLEPVVAKPHSPDNRA 1166 GKNGV+P D TT Y+E + P YSD A FV+DYRFDVSKLEPVVAKPHSPDNRA Sbjct: 299 GKNGVIPPDATTLNYVEACILSCFLPVYSDGNASFVADYRFDVSKLEPVVAKPHSPDNRA 358 Query: 1167 LARECKDVKIDRVYIGSCTGGKTEDFFAAAKVFLASGKKVKVPTFLVPATQKVWMDVYSI 1346 LARECKDVKIDRVYIGSCTGGKTEDF AAAK+F A+G+KVKVPTFLVPATQKVWMDVY++ Sbjct: 359 LARECKDVKIDRVYIGSCTGGKTEDFMAAAKLFHAAGRKVKVPTFLVPATQKVWMDVYAL 418 Query: 1347 PVPGSGGKTCSQIFEEAGCDTPASPNCGACLGGPRDTYARMNEPKVCVATTNRNFPGRMG 1526 PVPG+GGKTC+QIFEEAGCDTPASP+CGACLGGP DTYAR+NEP+VCV+TTNRNFPGRMG Sbjct: 419 PVPGAGGKTCAQIFEEAGCDTPASPSCGACLGGPADTYARLNEPQVCVSTTNRNFPGRMG 478 Query: 1527 HKEGQIYLASPYTAAASALTGYVTDPRDFL 1616 HKEGQIYLASPYTAAASALTG V DPR+FL Sbjct: 479 HKEGQIYLASPYTAAASALTGRVADPREFL 508