BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphylf028j02 (1481 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EAZ05588.1| hypothetical protein OsI_026820 [Oryza sativa (in... 423 e-116 ref|NP_001060997.1| Os08g0151300 [Oryza sativa (japonica cultiva... 421 e-116 gb|EAZ05589.1| hypothetical protein OsI_026821 [Oryza sativa (in... 231 2e-76 ref|NP_176575.1| ATMYB103 (myb domain protein 103); DNA binding ... 263 2e-68 emb|CAO65913.1| unnamed protein product [Vitis vinifera] 258 8e-67 >gb|EAZ05588.1| hypothetical protein OsI_026820 [Oryza sativa (indica cultivar-group)] Length = 359 Score = 423 bits (1087), Expect = e-116 Identities = 245/375 (65%), Positives = 255/375 (68%), Gaps = 53/375 (14%) Frame = +1 Query: 172 MGHHSCCNQQKVKRGLWSPEEDEKLIRYITTHGYGCWSEVPEKAGLQRCGKSCRLRWINY 351 MGHHSCCNQQKVKRGLWSPEEDEKLIRYITTHGYGCWSEVPEKAGLQRCGKSCRLRWINY Sbjct: 1 MGHHSCCNQQKVKRGLWSPEEDEKLIRYITTHGYGCWSEVPEKAGLQRCGKSCRLRWINY 60 Query: 352 LRPDIRRGRFTAEEEKLIISLHAIVGNRWAHIASHLPGRTDNEIKNYWNSWIKKKIRKP- 528 LRPDIRRGRFTAEEEKLIISLHAIVGNRWAHIASHLPGRTDNEIKNYWNSWIKKKIRKP Sbjct: 61 LRPDIRRGRFTAEEEKLIISLHAIVGNRWAHIASHLPGRTDNEIKNYWNSWIKKKIRKPA 120 Query: 529 -----ANTTSPSN-PPCSTATSDMLAANLREVPTFSTTDH--QLDAIINQNL--TLPSKL 678 A TTSP+N PPCSTATSD + P F DH QLDAIINQNL +LP KL Sbjct: 121 AAAAAATTTSPNNPPPCSTATSD---HHHLPPPAFGGADHHLQLDAIINQNLISSLPPKL 177 Query: 679 AVGCGGGGGQDSP---TGLPHNCPLFMFDXXXXXXXXXXXXXXXG----------QHPFI 819 A GG DSP GLPH+CPLFMFD HPFI Sbjct: 178 A------GGDDSPPAVPGLPHHCPLFMFDTTTTGAGGAVSPPPPSSLIPTHLHHHHHPFI 231 Query: 820 ASFTAAMAY-APSYLPPLVDGM-GMG-AMDCVXXXXXXXXXXSDQAAATGMANGCF---- 978 ASFTAAMA PSYLPPLVDGM MG AMDC AAA NG + Sbjct: 232 ASFTAAMAADTPSYLPPLVDGMAAMGAAMDC-------SLEDGQTAAAMAATNGYYQHHQ 284 Query: 979 -------EQKQQEEEQLGH-------------DQWD-DEAQHLFMWDQE-LTPSNLEAMQ 1092 E +++E+ QLGH QWD +EAQHL MWDQE LT SNLEAMQ Sbjct: 285 KHQQLEIELEEEEQRQLGHHHHQHHHEHEHENHQWDEEEAQHLLMWDQEVLTSSNLEAMQ 344 Query: 1093 SGAPSLLFMGPNDHD 1137 SGA SLLFMGPNDHD Sbjct: 345 SGAHSLLFMGPNDHD 359 >ref|NP_001060997.1| Os08g0151300 [Oryza sativa (japonica cultivar-group)] dbj|BAC64999.1| myb transcription factor (ATMYB4)-like protein [Oryza sativa (japonica cultivar-group)] dbj|BAF22911.1| Os08g0151300 [Oryza sativa (japonica cultivar-group)] gb|EAZ41524.1| hypothetical protein OsJ_025007 [Oryza sativa (japonica cultivar-group)] Length = 359 Score = 421 bits (1081), Expect = e-116 Identities = 244/375 (65%), Positives = 254/375 (67%), Gaps = 53/375 (14%) Frame = +1 Query: 172 MGHHSCCNQQKVKRGLWSPEEDEKLIRYITTHGYGCWSEVPEKAGLQRCGKSCRLRWINY 351 MGHHSCCNQQKVKRGLWSPEEDEKLIRYITTHGYGCWSEVPEKAGLQRCGKSCRLRWINY Sbjct: 1 MGHHSCCNQQKVKRGLWSPEEDEKLIRYITTHGYGCWSEVPEKAGLQRCGKSCRLRWINY 60 Query: 352 LRPDIRRGRFTAEEEKLIISLHAIVGNRWAHIASHLPGRTDNEIKNYWNSWIKKKIRKP- 528 LRPDIRRGRFTAEEEKLIISLHAIVGNRWAHIASHLPGRTDNEIKNYWNSWIKKKIRKP Sbjct: 61 LRPDIRRGRFTAEEEKLIISLHAIVGNRWAHIASHLPGRTDNEIKNYWNSWIKKKIRKPA 120 Query: 529 -----ANTTSPSN-PPCSTATSDMLAANLREVPTFSTTDH--QLDAIINQNL--TLPSKL 678 A TTSP+N PPCSTATSD + P F DH QLDAIINQNL +LP KL Sbjct: 121 AAAAAATTTSPNNPPPCSTATSD---HHHLPPPAFGGADHHLQLDAIINQNLISSLPPKL 177 Query: 679 AVGCGGGGGQDSP---TGLPHNCPLFMFDXXXXXXXXXXXXXXXG----------QHPFI 819 A G DSP GLPH+CPLFMFD HPFI Sbjct: 178 AT------GDDSPPAVPGLPHHCPLFMFDTTTTGAGGAISPPPPSSLIPTHLHHHHHPFI 231 Query: 820 ASFTAAMAY-APSYLPPLVDGM-GMG-AMDCVXXXXXXXXXXSDQAAATGMANGCF---- 978 ASFTAAMA PSYLPPLVDGM MG AMDC AAA NG + Sbjct: 232 ASFTAAMAADTPSYLPPLVDGMAAMGAAMDC-------SLEDGQTAAAMAATNGYYQHHQ 284 Query: 979 -------EQKQQEEEQLGH-------------DQWD-DEAQHLFMWDQE-LTPSNLEAMQ 1092 E +++E+ QLGH QWD +EAQHL MWDQE LT SNLEAMQ Sbjct: 285 KHQQLEIELEEEEQRQLGHHHHQHHHEHEHENHQWDEEEAQHLLMWDQEVLTSSNLEAMQ 344 Query: 1093 SGAPSLLFMGPNDHD 1137 SGA SLLFMGPNDHD Sbjct: 345 SGAHSLLFMGPNDHD 359 >gb|EAZ05589.1| hypothetical protein OsI_026821 [Oryza sativa (indica cultivar-group)] Length = 324 Score = 231 bits (588), Expect(2) = 2e-76 Identities = 155/289 (53%), Positives = 167/289 (57%), Gaps = 53/289 (18%) Frame = +1 Query: 430 NRWAHIASHLPGRTDNEIKNYWNSWIKKKIRKP------ANTTSPSN-PPCSTATSDMLA 588 ++WAHIASHLPGRTDNEIKNYWNSWIKKKIRKP A TTSP+N PPCST TSD Sbjct: 52 DQWAHIASHLPGRTDNEIKNYWNSWIKKKIRKPAAAAAAATTTSPNNPPPCSTVTSD--- 108 Query: 589 ANLREVPTFSTTDH--QLDAIINQNL--TLPSKLAVGCGGGGGQDSP---TGLPHNCPLF 747 + P F DH QLDAIINQNL +LP KLA G DSP GLPH+CPLF Sbjct: 109 HHHLPPPAFGGADHHLQLDAIINQNLISSLPPKLAT------GDDSPPAVPGLPHHCPLF 162 Query: 748 MFDXXXXXXXXXXXXXXXG----------QHPFIASFTAAMAY-APSYLPPLVDGM-GMG 891 MFD HPFIASFTAAMA PSYLPPLVDGM MG Sbjct: 163 MFDTTTTGAGGAISPPPPSSLIPTHLHHHHHPFIASFTAAMAADTPSYLPPLVDGMAAMG 222 Query: 892 -AMDCVXXXXXXXXXXSDQAAATGMANGCF-----------EQKQQEEEQLGH------- 1014 AMDC AAA NG + E +++E+ QLGH Sbjct: 223 AAMDC-------SLEDGQTAAAMAATNGYYQHHQKHQQLEIELEEEEQRQLGHHHHQHHH 275 Query: 1015 ------DQWD-DEAQHLFMWDQE-LTPSNLEAMQSGAPSLLFMGPNDHD 1137 QWD +EAQHL MWDQE LT SNLEAMQSGA SLLFMGPNDHD Sbjct: 276 EHEHENHQWDEEEAQHLLMWDQEVLTSSNLEAMQSGAHSLLFMGPNDHD 324 Score = 81.3 bits (199), Expect(2) = 2e-76 Identities = 36/44 (81%), Positives = 36/44 (81%) Frame = +3 Query: 321 EELPVAVDQLPETGHQARAVHGGGGEADHQPPRHCWQQVGPYCQ 452 EELP VDQLPETGH ARAVHGGGGEA HQP RHCWQQVG Q Sbjct: 3 EELPAPVDQLPETGHPARAVHGGGGEAHHQPARHCWQQVGSCMQ 46 >ref|NP_176575.1| ATMYB103 (myb domain protein 103); DNA binding / transcription factor [Arabidopsis thaliana] gb|AAF25949.1|AF214116_1 putative transcription factor [Arabidopsis thaliana] gb|AAG52460.1|AC010852_17 putative MYB family transcription factor; 19087-20744 [Arabidopsis thaliana] gb|AAS10034.1| MYB transcription factor [Arabidopsis thaliana] dbj|BAE98761.1| putative MYB family transcription factor [Arabidopsis thaliana] Length = 370 Score = 263 bits (673), Expect = 2e-68 Identities = 122/151 (80%), Positives = 129/151 (85%) Frame = +1 Query: 172 MGHHSCCNQQKVKRGLWSPEEDEKLIRYITTHGYGCWSEVPEKAGLQRCGKSCRLRWINY 351 MGHHSCCNQQKVKRGLWSPEEDEKLIRYITTHGYGCWSEVPEKAGLQRCGKSCRLRWINY Sbjct: 1 MGHHSCCNQQKVKRGLWSPEEDEKLIRYITTHGYGCWSEVPEKAGLQRCGKSCRLRWINY 60 Query: 352 LRPDIRRGRFTAEEEKLIISLHAIVGNRWAHIASHLPGRTDNEIKNYWNSWIKKKIRKPA 531 LRPDIRRGRF+ EEEKLIISLH +VGNRWAHIASHLPGRTDNEIKNYWNSWIKKKIRKP Sbjct: 61 LRPDIRRGRFSPEEEKLIISLHGVVGNRWAHIASHLPGRTDNEIKNYWNSWIKKKIRKPH 120 Query: 532 NTTSPSNPPCSTATSDMLAANLREVPTFSTT 624 + S P +T T + ++ STT Sbjct: 121 HHYSRHQPSVTTVTLNADTTSIATTIEASTT 151 >emb|CAO65913.1| unnamed protein product [Vitis vinifera] Length = 304 Score = 258 bits (659), Expect = 8e-67 Identities = 133/195 (68%), Positives = 144/195 (73%) Frame = +1 Query: 172 MGHHSCCNQQKVKRGLWSPEEDEKLIRYITTHGYGCWSEVPEKAGLQRCGKSCRLRWINY 351 MGHHSCCNQQKVKRGLWSPEEDEKLIRYITT+GYGCWSEVPEKAGLQRCGKSCRLRWINY Sbjct: 1 MGHHSCCNQQKVKRGLWSPEEDEKLIRYITTYGYGCWSEVPEKAGLQRCGKSCRLRWINY 60 Query: 352 LRPDIRRGRFTAEEEKLIISLHAIVGNRWAHIASHLPGRTDNEIKNYWNSWIKKKIRKPA 531 LRPDIRRGRFT EEEKLII+LH +VGNRWAHIASHLPGRTDNEIKNYWNSWIKKKIRK Sbjct: 61 LRPDIRRGRFTPEEEKLIINLHGVVGNRWAHIASHLPGRTDNEIKNYWNSWIKKKIRK-- 118 Query: 532 NTTSPSNPPCSTATSDMLAANLREVPTFSTTDHQLDAIINQNLTLPSKLAVGCGGGGGQD 711 PS P S+ T+ E QLD ++NQ+L + Q+ Sbjct: 119 ----PSAPLASSITN-------TEHSQLGYGSSQLD-MVNQDLMMKQP---------AQE 157 Query: 712 SPTGLPHNCPLFMFD 756 + P PLFMFD Sbjct: 158 TLFSSP--APLFMFD 170