BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphyem201b23 (1438 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EAZ10667.1| hypothetical protein OsJ_000492 [Oryza sativa (ja... 454 e-126 gb|EAY72663.1| hypothetical protein OsI_000510 [Oryza sativa (in... 450 e-124 emb|CAO69008.1| unnamed protein product [Vitis vinifera] 407 e-112 ref|NP_201127.2| unknown protein [Arabidopsis thaliana] >gi|4895... 369 e-100 emb|CAN78083.1| hypothetical protein [Vitis vinifera] 355 6e-96 >gb|EAZ10667.1| hypothetical protein OsJ_000492 [Oryza sativa (japonica cultivar-group)] Length = 328 Score = 454 bits (1168), Expect = e-126 Identities = 237/312 (75%), Positives = 257/312 (82%) Frame = +3 Query: 162 LPPPQQTIEKLENMVDGGNYYEAQQMYKSTSARYIAFQRYSEALDIFQSGALIQLKHGQV 341 LPPPQQTIEKLENMV GNYYEAQQMYKST ARYIA Q+Y EALDI QSGAL+QLKHGQV Sbjct: 19 LPPPQQTIEKLENMVAEGNYYEAQQMYKSTGARYIAAQKYLEALDILQSGALVQLKHGQV 78 Query: 342 TCGAELAVLFVDTLVKGRLPYNEETFDRIRKMYEAFPRISISDFLGXXXXXXGQKLSEAI 521 TCG ELA++FVDTLVK LPYNEETFDRIRKMY+AFPRIS+ FLG GQKLSEAI Sbjct: 79 TCGGELAIMFVDTLVKAALPYNEETFDRIRKMYDAFPRISVPHFLGDDYDDDGQKLSEAI 138 Query: 522 SAAKVRAESCSSFLKAAIRWSAEFGTSRNGSPELHVMLAEYIYSESPETDMTKVSSHFVH 701 SAAKVR+ESCSSFL+AAIR AE N + + A+ DMTKVSSHFV Sbjct: 139 SAAKVRSESCSSFLRAAIR--AEILCLFNCYSLMPLWFAKVRVDCRWMQDMTKVSSHFVR 196 Query: 702 GNDPKKFASMLVNFMGKCYPGEDDTAIARGVLTYLSQGNLRDANFLMYDTKEQLYSGVLV 881 GNDPKKFASML NFMGKCYPGEDDTAIARGVL YLSQGNLRDAN LM + K+QL S L Sbjct: 197 GNDPKKFASMLANFMGKCYPGEDDTAIARGVLMYLSQGNLRDANLLMDELKDQLKSADLE 256 Query: 882 FPKTDLIQFIKYLLPTLERDAYPLFRTLRQKYKTSTDRDPVFEELLNEIAAIFYGMRRQN 1061 PKTDLIQFIKYLLPTLERDAYPLFRTLRQKYKTSTD DPVFEELL+EIAA FYG+R Q+ Sbjct: 257 IPKTDLIQFIKYLLPTLERDAYPLFRTLRQKYKTSTDHDPVFEELLDEIAAKFYGIRSQS 316 Query: 1062 PLQGLFGEMFKI 1097 L+GLFG+MF++ Sbjct: 317 ALEGLFGDMFRV 328 >gb|EAY72663.1| hypothetical protein OsI_000510 [Oryza sativa (indica cultivar-group)] Length = 328 Score = 450 bits (1157), Expect = e-124 Identities = 236/312 (75%), Positives = 256/312 (82%) Frame = +3 Query: 162 LPPPQQTIEKLENMVDGGNYYEAQQMYKSTSARYIAFQRYSEALDIFQSGALIQLKHGQV 341 LPPPQQTIEKLENMV GNYYEAQQMYKST ARYIA Q+Y EALDI QSGAL+QLKHGQV Sbjct: 19 LPPPQQTIEKLENMVAEGNYYEAQQMYKSTGARYIAAQKYLEALDILQSGALVQLKHGQV 78 Query: 342 TCGAELAVLFVDTLVKGRLPYNEETFDRIRKMYEAFPRISISDFLGXXXXXXGQKLSEAI 521 TCG ELA++FVDTLVK LPYNEETFDRIRKMY AFPRIS+ FLG GQKLSEAI Sbjct: 79 TCGGELAIMFVDTLVKAALPYNEETFDRIRKMYSAFPRISVPHFLGDDYDDDGQKLSEAI 138 Query: 522 SAAKVRAESCSSFLKAAIRWSAEFGTSRNGSPELHVMLAEYIYSESPETDMTKVSSHFVH 701 SAAKVR+ESCSSFL+AAIR AE N + + A+ DMTKVSSHFV Sbjct: 139 SAAKVRSESCSSFLRAAIR--AETLCLFNCYSLMPLWFAKVRVDCRWMQDMTKVSSHFVR 196 Query: 702 GNDPKKFASMLVNFMGKCYPGEDDTAIARGVLTYLSQGNLRDANFLMYDTKEQLYSGVLV 881 GNDPKKFASML NFMGKCYPGEDDTAIARGVL YLSQGNLRDAN LM + K+QL S L Sbjct: 197 GNDPKKFASMLANFMGKCYPGEDDTAIARGVLMYLSQGNLRDANLLMDELKDQLKSADLE 256 Query: 882 FPKTDLIQFIKYLLPTLERDAYPLFRTLRQKYKTSTDRDPVFEELLNEIAAIFYGMRRQN 1061 PKTDLIQFIKYLLPTLERDAYPLFRTLRQKYKTSTDR+ VFEELL+EIAA FYG+R Q+ Sbjct: 257 IPKTDLIQFIKYLLPTLERDAYPLFRTLRQKYKTSTDREAVFEELLDEIAAKFYGIRSQS 316 Query: 1062 PLQGLFGEMFKI 1097 L+GLFG+MF++ Sbjct: 317 ALEGLFGDMFRV 328 >emb|CAO69008.1| unnamed protein product [Vitis vinifera] Length = 350 Score = 407 bits (1047), Expect = e-112 Identities = 202/312 (64%), Positives = 248/312 (79%) Frame = +3 Query: 162 LPPPQQTIEKLENMVDGGNYYEAQQMYKSTSARYIAFQRYSEALDIFQSGALIQLKHGQV 341 LPP Q+ I+KLE V+ G+YY AQQMYKS SARY + +R EALDI +SGA IQL+ GQV Sbjct: 11 LPPAQEHIDKLEKTVNDGDYYGAQQMYKSISARYASAERCYEALDILESGACIQLEKGQV 70 Query: 342 TCGAELAVLFVDTLVKGRLPYNEETFDRIRKMYEAFPRISISDFLGXXXXXXGQKLSEAI 521 TCGAELA+LFV+TLVKG+ PY++ T DR+RK+Y+ FP+IS+ L QKLSEA+ Sbjct: 71 TCGAELAILFVETLVKGKFPYDDNTLDRVRKIYKNFPQISVPQQLEDDDDM--QKLSEAL 128 Query: 522 SAAKVRAESCSSFLKAAIRWSAEFGTSRNGSPELHVMLAEYIYSESPETDMTKVSSHFVH 701 AAK R E CSSFLKAA++WSAEFG R GSPE+H MLAEY+YSESPE DM+++S HFV Sbjct: 129 GAAKTRVEVCSSFLKAAMKWSAEFGFHRQGSPEIHDMLAEYLYSESPELDMSRISLHFVR 188 Query: 702 GNDPKKFASMLVNFMGKCYPGEDDTAIARGVLTYLSQGNLRDANFLMYDTKEQLYSGVLV 881 GN+P+KFAS LVNFMGKCYPGEDD AIAR VL YLS GNLRDAN+LM + K+Q+ S L Sbjct: 189 GNNPEKFASTLVNFMGKCYPGEDDLAIARAVLMYLSLGNLRDANYLMDEVKKQVESKELD 248 Query: 882 FPKTDLIQFIKYLLPTLERDAYPLFRTLRQKYKTSTDRDPVFEELLNEIAAIFYGMRRQN 1061 +P++DL +FI YLL TL+RDA PLF LRQ YK+S DR+P F ELL+EIA FYG+RR+N Sbjct: 249 YPESDLTEFIDYLLLTLQRDALPLFNMLRQSYKSSIDREPAFNELLDEIAEKFYGVRRRN 308 Query: 1062 PLQGLFGEMFKI 1097 P+QG+FG+ FK+ Sbjct: 309 PMQGMFGDFFKV 320 >ref|NP_201127.2| unknown protein [Arabidopsis thaliana] gb|AAT47814.1| At5g63220 [Arabidopsis thaliana] dbj|BAD43990.1| putative protein [Arabidopsis thaliana] Length = 324 Score = 369 bits (946), Expect = e-100 Identities = 182/313 (58%), Positives = 233/313 (74%), Gaps = 1/313 (0%) Frame = +3 Query: 162 LPPPQQTIEKLENMVDGGNYYEAQQMYKSTSARYIAFQRYSEALDIFQSGALIQLKHGQV 341 LPP Q+ I+KL +++ GNYY A QMYKS SARY+ QR+SEALDI SGA I+L+HG V Sbjct: 10 LPPVQEHIDKLRKVIEEGNYYGALQMYKSISARYVTAQRFSEALDILFSGACIELEHGLV 69 Query: 342 TCGAELAVLFVDTLVKGRLPYNEETFDRIRKMYEAFPRISISDFLGXXXXXXG-QKLSEA 518 CGA+LA+LFVDTLVK + P N+ET DRIR +++ FPR+ + L Q L E+ Sbjct: 70 NCGADLAILFVDTLVKAKSPCNDETLDRIRCIFKLFPRVPVPPHLVDVSDDEDVQNLQES 129 Query: 519 ISAAKVRAESCSSFLKAAIRWSAEFGTSRNGSPELHVMLAEYIYSESPETDMTKVSSHFV 698 + A+ R E+ +SFL+AAI+WSAEFG R G PELH ML +Y+Y+E PE DM ++S HFV Sbjct: 130 LGEARSRVENLTSFLRAAIKWSAEFGGPRTGYPELHAMLGDYLYTECPELDMVRISRHFV 189 Query: 699 HGNDPKKFASMLVNFMGKCYPGEDDTAIARGVLTYLSQGNLRDANFLMYDTKEQLYSGVL 878 DP+KFASMLVNFMG+CYPGEDD AIAR VL YLS GN++DANF+M + K+Q + Sbjct: 190 RAEDPEKFASMLVNFMGRCYPGEDDLAIARAVLMYLSMGNMKDANFMMDEIKKQAETKNP 249 Query: 879 VFPKTDLIQFIKYLLPTLERDAYPLFRTLRQKYKTSTDRDPVFEELLNEIAAIFYGMRRQ 1058 ++DLIQFI YLL TL+RDA PLF LR KYK+S DRD + ELL+EIA FYG++R+ Sbjct: 250 ELSESDLIQFISYLLETLQRDALPLFNMLRVKYKSSIDRDQLLNELLDEIAERFYGVQRK 309 Query: 1059 NPLQGLFGEMFKI 1097 NPLQG+FG++FK+ Sbjct: 310 NPLQGMFGDIFKM 322 >emb|CAN78083.1| hypothetical protein [Vitis vinifera] Length = 391 Score = 355 bits (910), Expect = 6e-96 Identities = 181/285 (63%), Positives = 219/285 (76%), Gaps = 8/285 (2%) Frame = +3 Query: 183 IEKLENMVDGGNYYEAQQMYKSTSARYIAFQRYSEALDIFQSGALIQLKHGQVTCGAELA 362 I+KLE V+ G+YY AQQMYKS SARY + +RY EALDI +SGA IQL+ GQVTCGAELA Sbjct: 63 IDKLEKTVNDGDYYGAQQMYKSISARYASAERYYEALDILESGACIQLEKGQVTCGAELA 122 Query: 363 VLFVDTLVKGRLPYNEETFD--------RIRKMYEAFPRISISDFLGXXXXXXGQKLSEA 518 +LFV+TLVKG+ PY++ T D R+RK+Y+ FP+IS+ L QKLSEA Sbjct: 123 ILFVETLVKGKFPYDDNTLDYDKIFMLDRVRKIYKNFPQISVPQQLEDDDDM--QKLSEA 180 Query: 519 ISAAKVRAESCSSFLKAAIRWSAEFGTSRNGSPELHVMLAEYIYSESPETDMTKVSSHFV 698 + AAK R E CSSFLKAA++WSAEFG R GSPE+H MLAEY+YSESPE DM+++S HFV Sbjct: 181 LGAAKTRVEVCSSFLKAAMKWSAEFGFHRQGSPEIHDMLAEYLYSESPELDMSRISLHFV 240 Query: 699 HGNDPKKFASMLVNFMGKCYPGEDDTAIARGVLTYLSQGNLRDANFLMYDTKEQLYSGVL 878 GN+P+KFAS LVNFMGKCYPGEDD AIAR VL YLS GNLRDAN+LM + K+Q+ S L Sbjct: 241 RGNNPEKFASTLVNFMGKCYPGEDDLAIARAVLMYLSLGNLRDANYLMDEVKKQVESKEL 300 Query: 879 VFPKTDLIQFIKYLLPTLERDAYPLFRTLRQKYKTSTDRDPVFEE 1013 +P++DL +FI YLL TL+RDA PLF LRQ YK+S DR+P F E Sbjct: 301 DYPESDLTEFIDYLLLTLQRDALPLFNMLRQSYKSSIDREPAFNE 345