BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphyem115a17 (982 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|NP_001049150.1| Os03g0178200 [Oryza sativa (japonica cultiva... 363 7e-99 gb|AAO13463.1| Hypothetical protein [Oryza sativa (japonica cult... 233 1e-59 gb|ABR25393.1| yggt family protein [Oryza sativa (indica cultiva... 230 1e-58 emb|CAO43888.1| unnamed protein product [Vitis vinifera] 192 3e-47 ref|NP_680180.1| YGGT family protein [Arabidopsis thaliana] >gi|... 167 1e-39 >ref|NP_001049150.1| Os03g0178200 [Oryza sativa (japonica cultivar-group)] gb|ABF94280.1| YGGT family protein, expressed [Oryza sativa (japonica cultivar-group)] dbj|BAF11064.1| Os03g0178200 [Oryza sativa (japonica cultivar-group)] Length = 238 Score = 363 bits (933), Expect = 7e-99 Identities = 189/242 (78%), Positives = 205/242 (84%), Gaps = 6/242 (2%) Frame = +2 Query: 38 MASPNTDPPQHHASAPPLLLAIRHLPFPGAHRPSRALPAPDLTPLARRLDELAAAAAAHP 217 MAS N DPP+HH S PPLLLA+RHLPFPG HRP RALPAPDL PLA RL+ELAAAAAAHP Sbjct: 1 MASRNADPPRHHPSTPPLLLAMRHLPFPGVHRP-RALPAPDLAPLAARLEELAAAAAAHP 59 Query: 218 LLKPLFAFHSHLSTFSQRRRRTMVAMRQ----AACPLSGEHCFAAVLGGSVAGLVVANGI 385 LLKPLFAFHSHL+ FSQ RRR M MR+ CPLSGEHCFAAVLG SVAG+VV++GI Sbjct: 60 LLKPLFAFHSHLAAFSQSRRRAMATMRRRRTTGECPLSGEHCFAAVLGDSVAGVVVSSGI 119 Query: 386 NNFLNLYNTVLVVRLVLTWFPNTPPAIVAPLSTICDPYLNIFRGIIPPLGGTLDLSPILA 565 NNFL+LYNTVLVVRLVLTWFPNTPPAIVAPLSTICDPYLNIFRGIIPPLGGTLDLSPILA Sbjct: 120 NNFLSLYNTVLVVRLVLTWFPNTPPAIVAPLSTICDPYLNIFRGIIPPLGGTLDLSPILA 179 Query: 566 FLALNFFTSTAAALPAELPNSA--ASSSASSSSSVVQPDLTANQRKWMRRVRSGKSQETD 739 FL LN +STAAALPAELP+ A S A+SSSSV LTAN+RKWMRR+R KSQE + Sbjct: 180 FLVLNALSSTAAALPAELPDPAPPTSRGATSSSSV----LTANRRKWMRRIRPVKSQEGE 235 Query: 740 HD 745 + Sbjct: 236 EE 237 >gb|AAO13463.1| Hypothetical protein [Oryza sativa (japonica cultivar-group)] gb|EAY88768.1| hypothetical protein OsI_010001 [Oryza sativa (indica cultivar-group)] gb|EAZ25794.1| hypothetical protein OsJ_009277 [Oryza sativa (japonica cultivar-group)] Length = 157 Score = 233 bits (594), Expect = 1e-59 Identities = 124/163 (76%), Positives = 136/163 (83%), Gaps = 2/163 (1%) Frame = +2 Query: 263 SQRRRRTMVAMRQAACPLSGEHCFAAVLGGSVAGLVVANGINNFLNLYNTVLVVRLVLTW 442 + RRRRT CPLSGEHCFAAVLG SVAG+VV++GINNFL+LYNTVLVVRLVLTW Sbjct: 3 TMRRRRTT-----GECPLSGEHCFAAVLGDSVAGVVVSSGINNFLSLYNTVLVVRLVLTW 57 Query: 443 FPNTPPAIVAPLSTICDPYLNIFRGIIPPLGGTLDLSPILAFLALNFFTSTAAALPAELP 622 FPNTPPAIVAPLSTICDPYLNIFRGIIPPLGGTLDLSPILAFL LN +STAAALPAELP Sbjct: 58 FPNTPPAIVAPLSTICDPYLNIFRGIIPPLGGTLDLSPILAFLVLNALSSTAAALPAELP 117 Query: 623 NSA--ASSSASSSSSVVQPDLTANQRKWMRRVRSGKSQETDHD 745 + A S A+SSSSV LTAN+RKWMRR+R KSQE + + Sbjct: 118 DPAPPTSRGATSSSSV----LTANRRKWMRRIRPVKSQEGEEE 156 >gb|ABR25393.1| yggt family protein [Oryza sativa (indica cultivar-group)] Length = 169 Score = 230 bits (586), Expect = 1e-58 Identities = 121/172 (70%), Positives = 136/172 (79%), Gaps = 6/172 (3%) Frame = +2 Query: 248 HLSTFSQRRRRTMVAMRQ----AACPLSGEHCFAAVLGGSVAGLVVANGINNFLNLYNTV 415 HL+ FSQ RRR M MR+ CPLSGEHCFAAVLG SVAG+VV++GINNFL+LYNTV Sbjct: 1 HLAAFSQSRRRAMATMRRRRTTGECPLSGEHCFAAVLGDSVAGVVVSSGINNFLSLYNTV 60 Query: 416 LVVRLVLTWFPNTPPAIVAPLSTICDPYLNIFRGIIPPLGGTLDLSPILAFLALNFFTST 595 LVVRLVLTWFPNTPPAIVAPLSTICDPYLN FRGI+PPLGGTLDLSPILAFL L +ST Sbjct: 61 LVVRLVLTWFPNTPPAIVAPLSTICDPYLNFFRGILPPLGGTLDLSPILAFLVLYALSST 120 Query: 596 AAALPAELPN--SAASSSASSSSSVVQPDLTANQRKWMRRVRSGKSQETDHD 745 AALPA+LP+ S A SSSSV LTAN+RKWMRR+R +S + + Sbjct: 121 VAALPADLPDPPPPTSRGAPSSSSV----LTANRRKWMRRIRPVQSPDRQEE 168 >emb|CAO43888.1| unnamed protein product [Vitis vinifera] Length = 162 Score = 192 bits (488), Expect = 3e-47 Identities = 97/142 (68%), Positives = 115/142 (80%) Frame = +2 Query: 311 PLSGEHCFAAVLGGSVAGLVVANGINNFLNLYNTVLVVRLVLTWFPNTPPAIVAPLSTIC 490 PLS ++ A + G SVAG+VVANGI NFLN+YNT+L+VRLVLTWFPN+PPAIV+PLST+C Sbjct: 18 PLSNQNFAAILPGDSVAGIVVANGILNFLNIYNTLLIVRLVLTWFPNSPPAIVSPLSTLC 77 Query: 491 DPYLNIFRGIIPPLGGTLDLSPILAFLALNFFTSTAAALPAELPNSAASSSASSSSSVVQ 670 DPYLNIFRGIIPPLGGTLDLSPILAFL LN FTSTAAALPAELP + SS + + Sbjct: 78 DPYLNIFRGIIPPLGGTLDLSPILAFLVLNAFTSTAAALPAELPMAGVPQQIPSSHTRLF 137 Query: 671 PDLTANQRKWMRRVRSGKSQET 736 DLT Q+KWMRR+ +S+ + Sbjct: 138 -DLTTTQKKWMRRLCGNRSKSS 158 >ref|NP_680180.1| YGGT family protein [Arabidopsis thaliana] emb|CAC34485.1| putative protein [Arabidopsis thaliana] dbj|BAC43053.1| unknown protein [Arabidopsis thaliana] gb|AAO63329.1| At5g21920 [Arabidopsis thaliana] gb|AAO73904.1| expressed protein [Arabidopsis thaliana] Length = 251 Score = 167 bits (423), Expect = 1e-39 Identities = 93/140 (66%), Positives = 105/140 (75%), Gaps = 1/140 (0%) Frame = +2 Query: 326 HCFAAVL-GGSVAGLVVANGINNFLNLYNTVLVVRLVLTWFPNTPPAIVAPLSTICDPYL 502 H FAAVL G SVAGLVVANG+ NFLN+YNT+LVVRLVLTWFP+ PPAIV PLST+CDPYL Sbjct: 118 HGFAAVLPGDSVAGLVVANGLINFLNIYNTILVVRLVLTWFPSAPPAIVNPLSTLCDPYL 177 Query: 503 NIFRGIIPPLGGTLDLSPILAFLALNFFTSTAAALPAELPNSAASSSASSSSSVVQPDLT 682 NIFRG IPPLGG LDLSPILAFL LN FTS+A ALP ELP++ + S +SS Sbjct: 178 NIFRGFIPPLGG-LDLSPILAFLVLNAFTSSAMALPCELPSADGAVSPASS--------- 227 Query: 683 ANQRKWMRRVRSGKSQETDH 742 + KW+RR R S DH Sbjct: 228 --ETKWVRRRR--LSSHKDH 243