BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphylf049j09 (1854 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|NP_001056296.1| Os05g0558800 [Oryza sativa (japonica cultiva... 729 0.0 gb|EAY98989.1| hypothetical protein OsI_020222 [Oryza sativa (in... 726 0.0 emb|CAI84658.1| hypothetical protein [Nicotiana tabacum] 200 3e-49 emb|CAO65474.1| unnamed protein product [Vitis vinifera] 194 2e-47 ref|NP_194620.1| unknown protein [Arabidopsis thaliana] >gi|3068... 180 3e-43 >ref|NP_001056296.1| Os05g0558800 [Oryza sativa (japonica cultivar-group)] gb|AAT85094.1| unknown protein [Oryza sativa (japonica cultivar-group)] dbj|BAF18210.1| Os05g0558800 [Oryza sativa (japonica cultivar-group)] gb|EAZ35246.1| hypothetical protein OsJ_018729 [Oryza sativa (japonica cultivar-group)] Length = 451 Score = 729 bits (1883), Expect = 0.0 Identities = 377/478 (78%), Positives = 392/478 (82%) Frame = +3 Query: 243 MEDENXXXXXXXXXXXXXXXKSKARDVPLEPKAEPQVEESSSKGGSQTPEASFVHYYQTN 422 M+DEN KSKARD PLEPKAEPQVEESSSKG SQTPEA FVHYYQTN Sbjct: 1 MDDENGLELSLGLSLGGTSGKSKARDAPLEPKAEPQVEESSSKGVSQTPEAPFVHYYQTN 60 Query: 423 AENQEHNSKQRHSPAAPPFGNFWGQTGGSSALVADGSNEPMSHQSQFPRYQDGWISNNNG 602 AENQEH+SKQRHSPAAPPFGNFWGQ G SS VADGSNE Sbjct: 61 AENQEHSSKQRHSPAAPPFGNFWGQPGSSSVPVADGSNE--------------------- 99 Query: 603 NNSEEQKLVSSKRKLLSEEISFQKRHHTAADEADAFSKSSDGGVKNAPISISTDDGSTGE 782 QK VSSKRKLLSEEISFQK+ +TAA++ DAFSKSSDGGVKNAPISISTDDGSTGE Sbjct: 100 -----QKPVSSKRKLLSEEISFQKKPNTAAEQPDAFSKSSDGGVKNAPISISTDDGSTGE 154 Query: 783 NEDVAESEAEGSNSWLVAQREDNAKGSVVNRGSDRKRSSDDAAVGFQGKRRPXXXXXXXX 962 NEDVAESEAEGSNSWLVAQRED+AKGSVVNRGSDRKRSSDDAAVGFQGKR+P Sbjct: 155 NEDVAESEAEGSNSWLVAQREDSAKGSVVNRGSDRKRSSDDAAVGFQGKRQPSFSGSESS 214 Query: 963 XXKLVHGNPLSLQASNVVVMPYQVPGQVSAPPSITNASNFPPVCTVQLMPPTNNGLAVQT 1142 KL GNPLSLQASNVV +PYQVP QVSAPPSITNASNF PVCTVQL PPTNNGLAV T Sbjct: 215 SGKLPQGNPLSLQASNVVAVPYQVPSQVSAPPSITNASNFTPVCTVQLRPPTNNGLAV-T 273 Query: 1143 MGGASQLAFGYPTVQLPTLETSSSWAFGAPPPAMSSFNGKDKAERAVTKQVDDGKKPQEA 1322 MG SQ+AFGYP VQLPTLETSSSWAFGAPP AMSSF KDK ERA Q DDGKK QEA Sbjct: 274 MGSTSQVAFGYPAVQLPTLETSSSWAFGAPPQAMSSFTAKDKVERAGISQADDGKKTQEA 333 Query: 1323 GASSSAHVEDEKKADRGSPLMGSGIRPGIAPNVKFGGSGSYPNLPWVSTTGTGPNGRTIS 1502 GASSSA VED+KK+DR PLMGS IRPGIAPNVKFGGSGSYP+LPWVSTTGTGPNGRTIS Sbjct: 334 GASSSALVEDDKKSDRALPLMGSAIRPGIAPNVKFGGSGSYPDLPWVSTTGTGPNGRTIS 393 Query: 1503 GVTYKFGRNEVKIVCACHGTHMIPEEFVRHASADAPGQENNTTLPAFPVGNQAASAQN 1676 GVTYKFGRNEVKIVCACHGTHM PEEF+RHASADAPGQEN+ TLPAFPVGNQAASAQN Sbjct: 394 GVTYKFGRNEVKIVCACHGTHMTPEEFMRHASADAPGQENSATLPAFPVGNQAASAQN 451 >gb|EAY98989.1| hypothetical protein OsI_020222 [Oryza sativa (indica cultivar-group)] Length = 451 Score = 726 bits (1875), Expect = 0.0 Identities = 376/478 (78%), Positives = 391/478 (81%) Frame = +3 Query: 243 MEDENXXXXXXXXXXXXXXXKSKARDVPLEPKAEPQVEESSSKGGSQTPEASFVHYYQTN 422 M+DEN KSKARD PLEPKAEPQVEESSSKG SQTPEA FVHYYQTN Sbjct: 1 MDDENGLELSLGLSLGGTSGKSKARDAPLEPKAEPQVEESSSKGVSQTPEAPFVHYYQTN 60 Query: 423 AENQEHNSKQRHSPAAPPFGNFWGQTGGSSALVADGSNEPMSHQSQFPRYQDGWISNNNG 602 AENQEH+SKQRHSPAAPPFGNFWGQ G SS VADGSNE Sbjct: 61 AENQEHSSKQRHSPAAPPFGNFWGQPGSSSVPVADGSNE--------------------- 99 Query: 603 NNSEEQKLVSSKRKLLSEEISFQKRHHTAADEADAFSKSSDGGVKNAPISISTDDGSTGE 782 QK VSSKRKLLSEEISFQK+ +TAA++ DAFSKSSDGGVKNAPISISTDDGSTGE Sbjct: 100 -----QKPVSSKRKLLSEEISFQKKPNTAAEQPDAFSKSSDGGVKNAPISISTDDGSTGE 154 Query: 783 NEDVAESEAEGSNSWLVAQREDNAKGSVVNRGSDRKRSSDDAAVGFQGKRRPXXXXXXXX 962 NEDVAESEAEGSNSWLVAQRED+AKGSVVNRGSDRKRSSDDAAVGFQGKR+P Sbjct: 155 NEDVAESEAEGSNSWLVAQREDSAKGSVVNRGSDRKRSSDDAAVGFQGKRQPSFSGSESS 214 Query: 963 XXKLVHGNPLSLQASNVVVMPYQVPGQVSAPPSITNASNFPPVCTVQLMPPTNNGLAVQT 1142 KL GNPLSLQASNVV +PYQVP QVSAPPSITNASNF PVCTVQL PPTNN LAV T Sbjct: 215 SGKLPQGNPLSLQASNVVAVPYQVPSQVSAPPSITNASNFTPVCTVQLRPPTNNELAV-T 273 Query: 1143 MGGASQLAFGYPTVQLPTLETSSSWAFGAPPPAMSSFNGKDKAERAVTKQVDDGKKPQEA 1322 MG SQ+AFGYP VQLPTLETSSSWAFGAPP AMSSF KDK ERA Q DDGKK QEA Sbjct: 274 MGSTSQVAFGYPAVQLPTLETSSSWAFGAPPQAMSSFTAKDKVERAGISQADDGKKTQEA 333 Query: 1323 GASSSAHVEDEKKADRGSPLMGSGIRPGIAPNVKFGGSGSYPNLPWVSTTGTGPNGRTIS 1502 GASSSA VED+KK+DR PLMGS IRPGIAPNVKFGGSGSYP+LPWVSTTGTGPNGRTIS Sbjct: 334 GASSSALVEDDKKSDRALPLMGSAIRPGIAPNVKFGGSGSYPDLPWVSTTGTGPNGRTIS 393 Query: 1503 GVTYKFGRNEVKIVCACHGTHMIPEEFVRHASADAPGQENNTTLPAFPVGNQAASAQN 1676 GVTYKFGRNEVKIVCACHGTHM PEEF+RHASADAPGQEN+ TLPAFPVGNQAASAQN Sbjct: 394 GVTYKFGRNEVKIVCACHGTHMTPEEFMRHASADAPGQENSATLPAFPVGNQAASAQN 451 >emb|CAI84658.1| hypothetical protein [Nicotiana tabacum] Length = 510 Score = 200 bits (508), Expect = 3e-49 Identities = 143/420 (34%), Positives = 209/420 (49%), Gaps = 55/420 (13%) Frame = +3 Query: 582 WISNNNGNNSEEQKL---VSSKRKLLSEEISFQKRHHTAADEADAFSKSSDGGVKNAPIS 752 W+ N++ E+ V KRK L E S QK+ AD K+ + + IS Sbjct: 97 WVQNDSRPVEVEEDRRADVGDKRKNLFRESSQQKKQEREGHHADTHDKT-----RTSHIS 151 Query: 753 ISTDDGSTGENEDVAESEAEGSNSWLVAQREDNAKGSVVNRGS---DRKRSSDDAAVGFQ 923 I+TD+GST ENEDVA+SE GS S + Q ++++K V + G ++ S A+ G + Sbjct: 152 ITTDEGSTAENEDVADSETVGSTSRQILQHDESSKRFVGSSGLAEVHKELRSVPASSGVE 211 Query: 924 --GKRR-PXXXXXXXXXXKLVHGNPLSLQASNVVVMPYQVPGQVSAPPSITNASNFPPVC 1094 G+RR + + P Q+ N++ +PY +P S S T+ +++P Sbjct: 212 LIGQRRFTISSEKDVKFGNIPYTIPFQGQSINIMNLPYSMPLN-SNTVSTTSTTSYPVPG 270 Query: 1095 TVQLMPPT--NNGLAVQTMGGASQLAFGYPTVQLPTLETSSSWAFGA------------P 1232 +QLM T + + + L FGY +VQLPTL+ + + P Sbjct: 271 VMQLMATTCVDRPPSHPVIPAYLPLMFGYSSVQLPTLDNDNLHGVASHLLQLHPSHGRGP 330 Query: 1233 PPAMSSFNGKDKAERAVT----KQVD----DGKKPQEAGASSSAHVEDEKKADRGS---- 1376 + +G + ++ A + K D DG+ + + H +E RG Sbjct: 331 LGSDKQKDGPNISQAAASSIPHKSSDSVQYDGRAMEHVKGNGRQHKAEETSNSRGEENVK 390 Query: 1377 --------------------PLMGSGIRPGIAPNVKFGGSGSYPNLPWVSTTGTGPNGRT 1496 P S IRPG+A ++KFGGSGSYPNLPWVSTTG GPNGRT Sbjct: 391 GSNISFRAKDPPDQPRAEAVPSEFSTIRPGLAADLKFGGSGSYPNLPWVSTTGPGPNGRT 450 Query: 1497 ISGVTYKFGRNEVKIVCACHGTHMIPEEFVRHASADAPGQENNTTLPAFPVGNQAASAQN 1676 ISGVTY++ +++IVCACHG+HM P++FVRHAS + QE T + +FP N AASAQ+ Sbjct: 451 ISGVTYRYSSTQIRIVCACHGSHMSPDDFVRHASVEQTSQEPGTGVSSFPSSNPAASAQS 510 >emb|CAO65474.1| unnamed protein product [Vitis vinifera] Length = 352 Score = 194 bits (493), Expect = 2e-47 Identities = 141/391 (36%), Positives = 190/391 (48%), Gaps = 47/391 (12%) Frame = +3 Query: 645 LLSEEISFQKRHHTAADEADAFSKSSDGGVKNAPISISTDDGSTGENEDVAESEAEGSNS 824 +L +E++ QK+H AD K+ K + ISI+T+DGST ENEDVAESE + S S Sbjct: 1 MLFDEVNHQKKHDREVHHADLHEKT-----KTSHISITTEDGSTAENEDVAESEVDVSTS 55 Query: 825 WLVAQREDNAKGSVVNRGSDRKRSSDDAAVGFQGKRRPXXXXXXXXXXKLVHGNPLSLQA 1004 L + +D +K V +D +AV QG++R L + P +Q+ Sbjct: 56 RLASHPDDGSKRFVHG-------VTDSSAVDLQGQKRFNFSSENEFKRNLAYCVPFPVQS 108 Query: 1005 SNVVVMPYQVPGQVSAPPSITNASNFPPVCTVQLMPPTNNGLAVQTMGGASQLAFGYPTV 1184 N+ V PY + + S P + PV T G L FGY V Sbjct: 109 VNINV-PYSLHAKESNP-----RAGIQPVNT-----------------GNLPLMFGYSPV 145 Query: 1185 QLPTLETSSSWAF------------GAPPPAMSSFNGKDKAERAVTKQVD---------- 1298 QLP L+ ++SW G PP N K +A + + Sbjct: 146 QLPMLDKNNSWGVASHSQQFHPPYAGKGPPNSDKHNDGLKISQAAVQAIPHNSPEASHYD 205 Query: 1299 ---------DGKK-PQEAGASSSAHVEDEKKADR-------GSPLM--------GSGIRP 1403 DGK+ P E G SS ED+ K + G+ S IRP Sbjct: 206 GRALELTKGDGKQHPTEEG--SSCQTEDDVKGNNVIFRSKDGTDQAITEVFSYESSAIRP 263 Query: 1404 GIAPNVKFGGSGSYPNLPWVSTTGTGPNGRTISGVTYKFGRNEVKIVCACHGTHMIPEEF 1583 GIA ++KFGGSGS+PNLPWVSTTG+ NG+TISGVTY++ R++++IVCACHG+HM PEEF Sbjct: 264 GIAADMKFGGSGSHPNLPWVSTTGS--NGKTISGVTYRYSRDQIRIVCACHGSHMSPEEF 321 Query: 1584 VRHASADAPGQENNTTLPAFPVGNQAASAQN 1676 V+HAS + E T L FP N A SAQ+ Sbjct: 322 VQHASEEHANPETGTGLAPFPSSNPATSAQS 352 >ref|NP_194620.1| unknown protein [Arabidopsis thaliana] ref|NP_849467.1| unknown protein [Arabidopsis thaliana] ref|NP_001078466.1| unknown protein [Arabidopsis thaliana] emb|CAB43905.1| putative protein [Arabidopsis thaliana] emb|CAB81479.1| putative protein [Arabidopsis thaliana] gb|AAL38791.1| unknown protein [Arabidopsis thaliana] gb|AAM20360.1| unknown protein [Arabidopsis thaliana] gb|AAM61641.1| unknown [Arabidopsis thaliana] Length = 425 Score = 180 bits (457), Expect = 3e-43 Identities = 146/476 (30%), Positives = 210/476 (44%), Gaps = 10/476 (2%) Frame = +3 Query: 243 MEDENXXXXXXXXXXXXXXXKSKARDVPLEPKAEPQVEESSSKGGSQTPEA-----SFVH 407 M+D+N K+K + A E ++GG ++ + +F+H Sbjct: 1 MDDDNGLELSLGLSCGGSTGKAKGNN---NNNAGSSSENYRAEGGDRSAKVIDDFKNFLH 57 Query: 408 YYQTNAENQEHNSKQRHSPAAPPFGNFWGQTGGSSALVADGSNEPMSHQSQFPRYQDGWI 587 S++ S PP NF+ + A+ S +P+ W+ Sbjct: 58 PTSQRPAEPSSGSQRSDSGQQPP-QNFFNDLSKAPTTEAEASTKPL------------WV 104 Query: 588 SNNNGNNSEEQKLVSSKRKLLSEEISFQKRHHTAADEADAFSKSSDGGVKNAPISISTDD 767 + E +K +KRK ++ K+ + D K + K + +S +TD+ Sbjct: 105 ED------ESRKEAGNKRKFGFPGMNDDKKKEKDSSHVDMHEKKT----KASHVSTATDE 154 Query: 768 GSTGENEDVAESEAEGSNSWLVAQREDNAKGSVVNRGSDRKRSSDDAAVGFQGKRRPXXX 947 GST ENEDVAESE G +S N VV +D + D G + Sbjct: 155 GSTAENEDVAESEVGGGSS-------SNHAKEVVRPPTDT--NIVDNLTGQRRSNHGGSG 205 Query: 948 XXXXXXXKLVHGNPLSLQASNVVV-MPYQVPGQVSAPPSITNASNFPPVCTVQLMPPTNN 1124 + + P ++ NVV MPY +P T S T L P N Sbjct: 206 TEEFTMRNMSYTVPFTVHPQNVVTSMPYSLP---------TKESGQHAAATSLLQPNAN- 255 Query: 1125 GLAVQTMGGASQLAFGYPTVQLPTLETSSSWAFGAPPPAMSSFNGKDKAERAVTKQVDDG 1304 G + FGY VQLP L+ S G + S F G+ + A K +G Sbjct: 256 -------AGNLPIMFGYSPVQLPMLDKDGSG--GIVALSQSPFAGRVPSNSATAK--GEG 304 Query: 1305 KKP-QEAGASSSAHVE---DEKKADRGSPLMGSGIRPGIAPNVKFGGSGSYPNLPWVSTT 1472 K+P E G+S A D + S I+PG+A +VKFGGSG+ PNLPWVSTT Sbjct: 305 KQPVAEEGSSEDASERPTGDNSNLNTAFSFDFSAIKPGMAADVKFGGSGARPNLPWVSTT 364 Query: 1473 GTGPNGRTISGVTYKFGRNEVKIVCACHGTHMIPEEFVRHASADAPGQENNTTLPA 1640 G+GP+GRTISGVTY++ N++KIVCACHG+HM PEEFVRHAS + E++ + A Sbjct: 365 GSGPHGRTISGVTYRYNANQIKIVCACHGSHMSPEEFVRHASEEYVSPESSMGMTA 420