BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphylf042g20 (1255 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|NP_001065493.1| Os10g0577700 [Oryza sativa (japonica cultiva... 609 e-172 gb|AAL58188.1|AC027037_10 unknown protein [Oryza sativa (japonic... 572 e-161 gb|EAY79638.1| hypothetical protein OsI_033597 [Oryza sativa (in... 420 e-116 emb|CAO63369.1| unnamed protein product [Vitis vinifera] 392 e-107 ref|NP_565805.1| shikimate kinase-related [Arabidopsis thaliana]... 392 e-107 >ref|NP_001065493.1| Os10g0577700 [Oryza sativa (japonica cultivar-group)] gb|ABB48026.1| expressed protein [Oryza sativa (japonica cultivar-group)] dbj|BAF27330.1| Os10g0577700 [Oryza sativa (japonica cultivar-group)] Length = 375 Score = 609 bits (1570), Expect = e-172 Identities = 302/319 (94%), Positives = 315/319 (98%) Frame = +3 Query: 201 RASVSASPAPAQDYEFTDGNGEVELRLDIGKLGIENSRDVFVDVDDTSLLIRAKSDGTLR 380 RASVS+S APA+DYEFTDG GEVELRLDIGKLGIENSRDVFVDVDDTSLL+RAKSDGTLR Sbjct: 57 RASVSSSTAPAKDYEFTDGGGEVELRLDIGKLGIENSRDVFVDVDDTSLLVRAKSDGTLR 116 Query: 381 TLMNVKQLFDRIKSSETIWFIDEDQLVANLKKVEQELKWPDIDESWESLTSGITQLLTGI 560 TL+NVKQLFDRIKSSETIWFIDEDQLV NLKKVEQELKWPDIDESWESLTSGITQLLTGI Sbjct: 117 TLINVKQLFDRIKSSETIWFIDEDQLVVNLKKVEQELKWPDIDESWESLTSGITQLLTGI 176 Query: 561 SVHIVGDSTDINEAVAKEIAEGIGYLPVCTSELLESATQKSIDTWVASEGVDSVAEAECV 740 SVHIVGDSTDINEAVAKEIAEGIGYLPVCTSELLESAT+KSID W+ASEGVDSVAEAECV Sbjct: 177 SVHIVGDSTDINEAVAKEIAEGIGYLPVCTSELLESATEKSIDKWLASEGVDSVAEAECV 236 Query: 741 VLESLSSHVRTVVATLGGKQGAASRFDKWQYLHAGFTVWLSVSDANDEASAKEEARRSVS 920 VLESLSSHVRTVVATLGGKQGAASRFDKWQYLHAGFTVWLSVSDA+DEASAKEEARRSVS Sbjct: 237 VLESLSSHVRTVVATLGGKQGAASRFDKWQYLHAGFTVWLSVSDASDEASAKEEARRSVS 296 Query: 921 AGNVAYAKADVVVKLGGWEPEYTRAVAQGCLVALKQLTLADKKLAGKKSLYIRLGCRGDW 1100 +GNVAYAKADVVVKLGGW+PEYTRAVAQGCLVALKQLTLADKKLAGKKSLY+RLGCRGDW Sbjct: 297 SGNVAYAKADVVVKLGGWDPEYTRAVAQGCLVALKQLTLADKKLAGKKSLYMRLGCRGDW 356 Query: 1101 PNLEPPGWDPESDAPPSNI 1157 PN+EPPGWDP+SDAPP+NI Sbjct: 357 PNIEPPGWDPDSDAPPTNI 375 >gb|AAL58188.1|AC027037_10 unknown protein [Oryza sativa (japonica cultivar-group)] Length = 336 Score = 572 bits (1473), Expect = e-161 Identities = 290/333 (87%), Positives = 302/333 (90%), Gaps = 28/333 (8%) Frame = +3 Query: 243 EFTDGNGEVELRLDIGKLGIENSRDVFVDVDDTSLLIRAKSDGTLRTLMNVKQLFDRIKS 422 +FTDG GEVELRLDIGKLGIENSRDVFVDVDDTSLL+RAKSDGTLRTL+NVKQLFDRIKS Sbjct: 4 KFTDGGGEVELRLDIGKLGIENSRDVFVDVDDTSLLVRAKSDGTLRTLINVKQLFDRIKS 63 Query: 423 SETIWFIDEDQLVANLKKVEQELKWPDIDESWESLTSGITQLLTGISVHIVGDSTDINEA 602 SETIWFIDEDQLV NLKKVEQELKWPDIDESWESLTSGITQLLTGISVHIVGDSTDINEA Sbjct: 64 SETIWFIDEDQLVVNLKKVEQELKWPDIDESWESLTSGITQLLTGISVHIVGDSTDINEA 123 Query: 603 VAKEIAEGIGYLPVCTSELLESATQKSIDTWVASEGVDSVAEAECVVLESLS-------- 758 VAKEIAEGIGYLPVCTSELLESAT+KSID W+ASEGVDSVAEAECVVLESLS Sbjct: 124 VAKEIAEGIGYLPVCTSELLESATEKSIDKWLASEGVDSVAEAECVVLESLSRLLLWATP 183 Query: 759 --------------------SHVRTVVATLGGKQGAASRFDKWQYLHAGFTVWLSVSDAN 878 SHVRTVVATLGGKQGAASRFDKWQYLHAGFTVWLSVSDA+ Sbjct: 184 SLLPLVDVLLSMLKRLFLPRSHVRTVVATLGGKQGAASRFDKWQYLHAGFTVWLSVSDAS 243 Query: 879 DEASAKEEARRSVSAGNVAYAKADVVVKLGGWEPEYTRAVAQGCLVALKQLTLADKKLAG 1058 DEASAKEEARRSVS+GNVAYAKADVVVKLGGW+PEYTRAVAQGCLVALKQLTLADKKLAG Sbjct: 244 DEASAKEEARRSVSSGNVAYAKADVVVKLGGWDPEYTRAVAQGCLVALKQLTLADKKLAG 303 Query: 1059 KKSLYIRLGCRGDWPNLEPPGWDPESDAPPSNI 1157 KKSLY+RLGCRGDWPN+EPPGWDP+SDAPP+NI Sbjct: 304 KKSLYMRLGCRGDWPNIEPPGWDPDSDAPPTNI 336 >gb|EAY79638.1| hypothetical protein OsI_033597 [Oryza sativa (indica cultivar-group)] Length = 968 Score = 420 bits (1080), Expect = e-116 Identities = 222/276 (80%), Positives = 230/276 (83%) Frame = +3 Query: 243 EFTDGNGEVELRLDIGKLGIENSRDVFVDVDDTSLLIRAKSDGTLRTLMNVKQLFDRIKS 422 +FTDG GEVELRLDIGKLGIENSRDVFVDVDDTSLL+RAKSDGTLRTL+NVKQLFDRIKS Sbjct: 731 KFTDGGGEVELRLDIGKLGIENSRDVFVDVDDTSLLVRAKSDGTLRTLINVKQLFDRIKS 790 Query: 423 SETIWFIDEDQLVANLKKVEQELKWPDIDESWESLTSGITQLLTGISVHIVGDSTDINEA 602 SETIWFIDEDQLV NLKKVEQELKWPDIDESWESLTSGITQLLTGISVHIVGDSTDINEA Sbjct: 791 SETIWFIDEDQLVVNLKKVEQELKWPDIDESWESLTSGITQLLTGISVHIVGDSTDINEA 850 Query: 603 VAKEIAEGIGYLPVCTSELLESATQKSIDTWVASEGVDSVAEAECVVLESLSSHVRTVVA 782 VAKEIAEGIG HVRTVVA Sbjct: 851 VAKEIAEGIG-------------------------------------------HVRTVVA 867 Query: 783 TLGGKQGAASRFDKWQYLHAGFTVWLSVSDANDEASAKEEARRSVSAGNVAYAKADVVVK 962 TLGGKQGAASRFDKWQYLHAGFTVWLSVSDA+DEASAKEEARRSVS+GNVAYAKADVVVK Sbjct: 868 TLGGKQGAASRFDKWQYLHAGFTVWLSVSDASDEASAKEEARRSVSSGNVAYAKADVVVK 927 Query: 963 LGGWEPEYTRAVAQGCLVALKQLTLADKKLAGKKSL 1070 LGGW+PEYTRAVAQGCLVALKQLTLADKKLAG+ S+ Sbjct: 928 LGGWDPEYTRAVAQGCLVALKQLTLADKKLAGEVSI 963 >emb|CAO63369.1| unnamed protein product [Vitis vinifera] Length = 371 Score = 392 bits (1008), Expect = e-107 Identities = 188/313 (60%), Positives = 249/313 (79%) Frame = +3 Query: 204 ASVSASPAPAQDYEFTDGNGEVELRLDIGKLGIENSRDVFVDVDDTSLLIRAKSDGTLRT 383 +++S +P+ +YEF+D + E+ELRL +G G +SRD+FVD +D+SL I K G+ T Sbjct: 55 STISVNPS---NYEFSDASSEMELRLQLGGGGTLSSRDIFVDAEDSSLKIGVKQSGSFIT 111 Query: 384 LMNVKQLFDRIKSSETIWFIDEDQLVANLKKVEQELKWPDIDESWESLTSGITQLLTGIS 563 L+ + +L+++IKSSETIW+IDEDQLV NLKK + +LKWPDI ESWESLT+G QLL G S Sbjct: 112 LVEINKLYEKIKSSETIWYIDEDQLVVNLKKQDPDLKWPDIVESWESLTAGAMQLLKGTS 171 Query: 564 VHIVGDSTDINEAVAKEIAEGIGYLPVCTSELLESATQKSIDTWVASEGVDSVAEAECVV 743 ++IVGDST+IN+ VA+E+A G+GY P+ T ELLE+ ++SID+WV ++G +SVAEAE V Sbjct: 172 IYIVGDSTEINDKVARELAVGLGYTPLNTKELLETFAKQSIDSWVTADGSESVAEAESAV 231 Query: 744 LESLSSHVRTVVATLGGKQGAASRFDKWQYLHAGFTVWLSVSDANDEASAKEEARRSVSA 923 LE+LSSHVR V+ATLGG GAA R DKW++L+AGFTVWLS S++ DE SAKEEARR + Sbjct: 232 LENLSSHVRAVIATLGGLHGAARRADKWRHLYAGFTVWLSQSESIDEESAKEEARRHIQE 291 Query: 924 GNVAYAKADVVVKLGGWEPEYTRAVAQGCLVALKQLTLADKKLAGKKSLYIRLGCRGDWP 1103 G++ Y+ ADVVVKL GW+ ++ + VAQ L ALKQL ++DKKL GKKSLYIRLGCRGDWP Sbjct: 292 GSLGYSNADVVVKLHGWDADHAKTVAQASLSALKQLIMSDKKLPGKKSLYIRLGCRGDWP 351 Query: 1104 NLEPPGWDPESDA 1142 +++PPGWDP + A Sbjct: 352 DIKPPGWDPSTGA 364 >ref|NP_565805.1| shikimate kinase-related [Arabidopsis thaliana] gb|AAL06514.1|AF412061_1 At2g35500/T32F12.12 [Arabidopsis thaliana] gb|AAL67100.1| At2g35500/T32F12.12 [Arabidopsis thaliana] gb|AAC36171.2| expressed protein [Arabidopsis thaliana] Length = 387 Score = 392 bits (1007), Expect = e-107 Identities = 191/313 (61%), Positives = 243/313 (77%) Frame = +3 Query: 210 VSASPAPAQDYEFTDGNGEVELRLDIGKLGIENSRDVFVDVDDTSLLIRAKSDGTLRTLM 389 +SA DYEFTDG EVELRL + I + +D+ VD D TSL ++ K +G L TL+ Sbjct: 70 LSAVSTSTIDYEFTDGGKEVELRLRLKTGEILSPKDISVDADGTSLAVKEKRNGLLITLL 129 Query: 390 NVKQLFDRIKSSETIWFIDEDQLVANLKKVEQELKWPDIDESWESLTSGITQLLTGISVH 569 LF++I SETIW+IDEDQLV N+KKV+ ELKWPDI ESWESLT+G+ QLL G S++ Sbjct: 130 ETNHLFEKIMPSETIWYIDEDQLVVNMKKVDGELKWPDIVESWESLTAGMMQLLKGASIY 189 Query: 570 IVGDSTDINEAVAKEIAEGIGYLPVCTSELLESATQKSIDTWVASEGVDSVAEAECVVLE 749 IVGDST+IN+ V++E+A G+GY P+ + ELLES ++++ID+W+ +EG DSVAEAE VLE Sbjct: 190 IVGDSTEINQKVSRELAVGLGYSPLDSKELLESFSKQTIDSWILAEGPDSVAEAESSVLE 249 Query: 750 SLSSHVRTVVATLGGKQGAASRFDKWQYLHAGFTVWLSVSDANDEASAKEEARRSVSAGN 929 SLSSHVRTVV+TLGGK GAA R D+W++L++GFTVW+S ++A DE SAKEEARRS Sbjct: 250 SLSSHVRTVVSTLGGKHGAAGRADQWRHLYSGFTVWVSQTEATDEESAKEEARRSKQERE 309 Query: 930 VAYAKADVVVKLGGWEPEYTRAVAQGCLVALKQLTLADKKLAGKKSLYIRLGCRGDWPNL 1109 + Y+ ADVVVKL GW+P + ++VAQ L ALKQL ++DK L GKKSLYIRLGCRGDWPN+ Sbjct: 310 IGYSNADVVVKLQGWDPTHAKSVAQASLSALKQLIISDKGLPGKKSLYIRLGCRGDWPNI 369 Query: 1110 EPPGWDPESDAPP 1148 +PPGWDP SD P Sbjct: 370 KPPGWDPSSDTGP 382