BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphyem212d05 (1365 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EAY75265.1| hypothetical protein OsI_003112 [Oryza sativa (in... 582 e-164 ref|NP_001043784.1| Os01g0662700 [Oryza sativa (japonica cultiva... 580 e-164 emb|CAN74179.1| hypothetical protein [Vitis vinifera] 492 e-137 emb|CAO66094.1| unnamed protein product [Vitis vinifera] 490 e-136 ref|NP_176255.2| naphthoate synthase, putative / dihydroxynaphth... 470 e-131 >gb|EAY75265.1| hypothetical protein OsI_003112 [Oryza sativa (indica cultivar-group)] Length = 331 Score = 582 bits (1499), Expect = e-164 Identities = 309/381 (81%), Positives = 317/381 (83%), Gaps = 4/381 (1%) Frame = +1 Query: 55 MDAAERRLARVTAHLLPF-LRLP---APPLAPSPAAATSSPASDSYRRVHSDVPSEPPEW 222 MDAA RRLARVTAHLLP L LP AP LAPSPAA SPASDSYRRVH DVPSEPPEW Sbjct: 1 MDAAGRRLARVTAHLLPSSLPLPLASAPTLAPSPAA---SPASDSYRRVHGDVPSEPPEW 57 Query: 223 RAATDESGKEFVDILYEKAVGEGIAKITINRPDRRNAFRPLTVKELMRAFNDARDDGSIG 402 RAATDESGK FVDILY+KAVGEGIAKITINRPDRRNAFRPLTVKELMRAF DARDD SIG Sbjct: 58 RAATDESGKGFVDILYDKAVGEGIAKITINRPDRRNAFRPLTVKELMRAFEDARDDSSIG 117 Query: 403 VIILSGKGTKAFCSGGDQALRDSDGYVDFDNFGRLNVLDLQVQIRRLPKPVIAMVAGYAV 582 VIIL+GKGT++FCSGGDQALRD+DGYVDFD+FGRLNVLDLQVQIRRLPKPVIAMVAGYAV Sbjct: 118 VIILTGKGTQSFCSGGDQALRDADGYVDFDSFGRLNVLDLQVQIRRLPKPVIAMVAGYAV 177 Query: 583 GGGHVLHMVCDLTIAADNAIFGQTGPKVGSFDAGYGSSIMSRLVGPKKAREMWFLSRFYT 762 GGGHVLHMVCDLTIAADNAIFGQTGPKVGSFDAGYGSSIMSRLVGPKKAREMWFLSRFYT Sbjct: 178 GGGHVLHMVCDLTIAADNAIFGQTGPKVGSFDAGYGSSIMSRLVGPKKAREMWFLSRFYT 237 Query: 763 ADEAERMGLVNVVVPVSA***TAEKKTTYERSSALFIKTVGWNMPHLRCLCTNHVLYALY 942 ADEA+RMGLVNVVVP Sbjct: 238 ADEADRMGLVNVVVP--------------------------------------------- 252 Query: 943 M*LAGLEGETVKWCRQILRNSPTAIRVLKSALNAADDGHAGLQELGGNATLIFYGTEEAK 1122 LA LE ETVKWCR+ILRNSPTAIRVLKSALNAADDGHAGLQELGGNATLIFYGTEEAK Sbjct: 253 --LADLERETVKWCRKILRNSPTAIRVLKSALNAADDGHAGLQELGGNATLIFYGTEEAK 310 Query: 1123 EGKNAYMERRRPDFSKFPRKP 1185 EGKNAYMERRRPDFSKFPRKP Sbjct: 311 EGKNAYMERRRPDFSKFPRKP 331 >ref|NP_001043784.1| Os01g0662700 [Oryza sativa (japonica cultivar-group)] dbj|BAB91741.1| putative naphthoate synthase menB [Oryza sativa (japonica cultivar-group)] dbj|BAF05698.1| Os01g0662700 [Oryza sativa (japonica cultivar-group)] gb|EAZ12987.1| hypothetical protein OsJ_002812 [Oryza sativa (japonica cultivar-group)] Length = 331 Score = 580 bits (1496), Expect = e-164 Identities = 308/381 (80%), Positives = 317/381 (83%), Gaps = 4/381 (1%) Frame = +1 Query: 55 MDAAERRLARVTAHLLPF-LRLP---APPLAPSPAAATSSPASDSYRRVHSDVPSEPPEW 222 MDAA RRLARVTAHLLP L LP AP LAPSPAA SPASDSYRRVH DVPSEPPEW Sbjct: 1 MDAAGRRLARVTAHLLPSSLPLPLASAPTLAPSPAA---SPASDSYRRVHGDVPSEPPEW 57 Query: 223 RAATDESGKEFVDILYEKAVGEGIAKITINRPDRRNAFRPLTVKELMRAFNDARDDGSIG 402 RAATDESGK FVDILY+KAVGEGIAKITINRPDRRNAFRPLTVKELMRAF DARDD SIG Sbjct: 58 RAATDESGKGFVDILYDKAVGEGIAKITINRPDRRNAFRPLTVKELMRAFEDARDDSSIG 117 Query: 403 VIILSGKGTKAFCSGGDQALRDSDGYVDFDNFGRLNVLDLQVQIRRLPKPVIAMVAGYAV 582 VIIL+GKGT++FCSGGDQALRD+DGYVDFD+FGRLNVLDLQVQIRRLPKPVIAMVAGYAV Sbjct: 118 VIILTGKGTQSFCSGGDQALRDADGYVDFDSFGRLNVLDLQVQIRRLPKPVIAMVAGYAV 177 Query: 583 GGGHVLHMVCDLTIAADNAIFGQTGPKVGSFDAGYGSSIMSRLVGPKKAREMWFLSRFYT 762 GGGHVLHMVCDLTIAADNAIFGQTGPKVGSFDAGYG+SIMSRLVGPKKAREMWFLSRFYT Sbjct: 178 GGGHVLHMVCDLTIAADNAIFGQTGPKVGSFDAGYGTSIMSRLVGPKKAREMWFLSRFYT 237 Query: 763 ADEAERMGLVNVVVPVSA***TAEKKTTYERSSALFIKTVGWNMPHLRCLCTNHVLYALY 942 ADEA+RMGLVNVVVP Sbjct: 238 ADEADRMGLVNVVVP--------------------------------------------- 252 Query: 943 M*LAGLEGETVKWCRQILRNSPTAIRVLKSALNAADDGHAGLQELGGNATLIFYGTEEAK 1122 LA LE ETVKWCR+ILRNSPTAIRVLKSALNAADDGHAGLQELGGNATLIFYGTEEAK Sbjct: 253 --LADLERETVKWCRKILRNSPTAIRVLKSALNAADDGHAGLQELGGNATLIFYGTEEAK 310 Query: 1123 EGKNAYMERRRPDFSKFPRKP 1185 EGKNAYMERRRPDFSKFPRKP Sbjct: 311 EGKNAYMERRRPDFSKFPRKP 331 >emb|CAN74179.1| hypothetical protein [Vitis vinifera] Length = 336 Score = 492 bits (1267), Expect = e-137 Identities = 260/379 (68%), Positives = 280/379 (73%), Gaps = 1/379 (0%) Frame = +1 Query: 52 DMDA-AERRLARVTAHLLPFLRLPAPPLAPSPAAATSSPASDSYRRVHSDVPSEPPEWRA 228 D++A A RRLA V HL P L P + TS+P DSYRRVH +VP+ W Sbjct: 8 DLNANARRRLASVAHHLTP-LHSTTPNCSSLGFHTTSAP--DSYRRVHGEVPTHDVTWNP 64 Query: 229 ATDESGKEFVDILYEKAVGEGIAKITINRPDRRNAFRPLTVKELMRAFNDARDDGSIGVI 408 A DESGK F DI+YEKAVGE IAKITINRP+RRNAFRP TVKEL+RAFNDARDD S+GVI Sbjct: 65 ACDESGKAFTDIIYEKAVGEAIAKITINRPERRNAFRPNTVKELIRAFNDARDDSSVGVI 124 Query: 409 ILSGKGTKAFCSGGDQALRDSDGYVDFDNFGRLNVLDLQVQIRRLPKPVIAMVAGYAVGG 588 I +GKGTKAFCSGGDQA R DGY D D+FGRLNVLDLQ+QIRRLPKPVIAMVAGYAVGG Sbjct: 125 IFTGKGTKAFCSGGDQAFRGRDGYADHDDFGRLNVLDLQMQIRRLPKPVIAMVAGYAVGG 184 Query: 589 GHVLHMVCDLTIAADNAIFGQTGPKVGSFDAGYGSSIMSRLVGPKKAREMWFLSRFYTAD 768 GHVLHMVCDLTIAADNAIFGQTGPKVGSFD GYGSSIMSRL+GPKKAREMWF +RFYTA Sbjct: 185 GHVLHMVCDLTIAADNAIFGQTGPKVGSFDXGYGSSIMSRLIGPKKAREMWFTARFYTAS 244 Query: 769 EAERMGLVNVVVPVSA***TAEKKTTYERSSALFIKTVGWNMPHLRCLCTNHVLYALYM* 948 EAE+MGLVN+VVP Sbjct: 245 EAEKMGLVNIVVP----------------------------------------------- 257 Query: 949 LAGLEGETVKWCRQILRNSPTAIRVLKSALNAADDGHAGLQELGGNATLIFYGTEEAKEG 1128 L LE ETVKWCR+ILRNSPTAIRVLKSALNA DDGHAGLQELGGNAT IFYGTEE EG Sbjct: 258 LENLEKETVKWCREILRNSPTAIRVLKSALNAVDDGHAGLQELGGNATFIFYGTEEGNEG 317 Query: 1129 KNAYMERRRPDFSKFPRKP 1185 K AY+ERR PDFSKFPR+P Sbjct: 318 KTAYLERRPPDFSKFPRRP 336 >emb|CAO66094.1| unnamed protein product [Vitis vinifera] Length = 336 Score = 490 bits (1261), Expect = e-136 Identities = 259/379 (68%), Positives = 280/379 (73%), Gaps = 1/379 (0%) Frame = +1 Query: 52 DMDA-AERRLARVTAHLLPFLRLPAPPLAPSPAAATSSPASDSYRRVHSDVPSEPPEWRA 228 D++A A RRLA V HL P L P + TS+P DSYRRVH +VP+ W Sbjct: 8 DLNANARRRLASVAHHLTP-LHSTTPNCSSLGFHTTSAP--DSYRRVHGEVPTHDVTWNP 64 Query: 229 ATDESGKEFVDILYEKAVGEGIAKITINRPDRRNAFRPLTVKELMRAFNDARDDGSIGVI 408 A DESGK F DI+YEKAVGE IAKITINRP+RRNAFRP TVKEL+RAFNDARDD S+GVI Sbjct: 65 ACDESGKAFTDIIYEKAVGEAIAKITINRPERRNAFRPNTVKELIRAFNDARDDSSVGVI 124 Query: 409 ILSGKGTKAFCSGGDQALRDSDGYVDFDNFGRLNVLDLQVQIRRLPKPVIAMVAGYAVGG 588 I +GKGTKAFCSGGDQA R DGY D D+FGRLNVLDLQ+QIRRLPKPVIAMVAGYAVGG Sbjct: 125 IFTGKGTKAFCSGGDQAFRGRDGYADHDDFGRLNVLDLQMQIRRLPKPVIAMVAGYAVGG 184 Query: 589 GHVLHMVCDLTIAADNAIFGQTGPKVGSFDAGYGSSIMSRLVGPKKAREMWFLSRFYTAD 768 GHVLHMVCDLTIAADNAIFGQTGPKVGSFD+GYGSSIMSRL+GPKKAREMWF +RFYTA Sbjct: 185 GHVLHMVCDLTIAADNAIFGQTGPKVGSFDSGYGSSIMSRLIGPKKAREMWFTARFYTAS 244 Query: 769 EAERMGLVNVVVPVSA***TAEKKTTYERSSALFIKTVGWNMPHLRCLCTNHVLYALYM* 948 EAE+MGLVN+VV Sbjct: 245 EAEKMGLVNIVVQ----------------------------------------------- 257 Query: 949 LAGLEGETVKWCRQILRNSPTAIRVLKSALNAADDGHAGLQELGGNATLIFYGTEEAKEG 1128 L LE ETVKWCR+ILRNSPTAIRVLKSALNA DDGHAGLQELGGNAT IFYGTEE EG Sbjct: 258 LENLEKETVKWCREILRNSPTAIRVLKSALNAVDDGHAGLQELGGNATFIFYGTEEGNEG 317 Query: 1129 KNAYMERRRPDFSKFPRKP 1185 K AY+ERR PDFSKFPR+P Sbjct: 318 KTAYLERRPPDFSKFPRRP 336 >ref|NP_176255.2| naphthoate synthase, putative / dihydroxynaphthoic acid synthetase, putative / DHNA synthetase, putative [Arabidopsis thaliana] Length = 337 Score = 470 bits (1210), Expect = e-131 Identities = 248/381 (65%), Positives = 279/381 (73%), Gaps = 3/381 (0%) Frame = +1 Query: 52 DMDAAERRLARVTAHLLPFLRLPAPPLAPSPAAATSSPASDSYRRVHSDVPSEPPEWRAA 231 ++ +A RRL+ VT HL+P PA A S ++S D + +VH +VP+ W+ Sbjct: 6 ELGSASRRLSVVTNHLIPIGFSPAR--ADSVELCSASSMDDRFHKVHGEVPTHEVVWKKT 63 Query: 232 T---DESGKEFVDILYEKAVGEGIAKITINRPDRRNAFRPLTVKELMRAFNDARDDGSIG 402 + KEFVDI+YEKA+ EGIAKITINRP+RRNAFRP TVKELMRAFNDARDD S+G Sbjct: 64 DFFGEGDNKEFVDIIYEKALDEGIAKITINRPERRNAFRPQTVKELMRAFNDARDDSSVG 123 Query: 403 VIILSGKGTKAFCSGGDQALRDSDGYVDFDNFGRLNVLDLQVQIRRLPKPVIAMVAGYAV 582 VIIL+GKGTKAFCSGGDQALR DGY D ++ GRLNVLDLQVQIRRLPKPVIAMVAGYAV Sbjct: 124 VIILTGKGTKAFCSGGDQALRTQDGYADPNDVGRLNVLDLQVQIRRLPKPVIAMVAGYAV 183 Query: 583 GGGHVLHMVCDLTIAADNAIFGQTGPKVGSFDAGYGSSIMSRLVGPKKAREMWFLSRFYT 762 GGGH+LHMVCDLTIAADNAIFGQTGPKVGSFDAGYGSSIMSRLVGPKKAREMWF++RFYT Sbjct: 184 GGGHILHMVCDLTIAADNAIFGQTGPKVGSFDAGYGSSIMSRLVGPKKAREMWFMTRFYT 243 Query: 763 ADEAERMGLVNVVVPVSA***TAEKKTTYERSSALFIKTVGWNMPHLRCLCTNHVLYALY 942 A EAE+MGL+N VVP Sbjct: 244 ASEAEKMGLINTVVP--------------------------------------------- 258 Query: 943 M*LAGLEGETVKWCRQILRNSPTAIRVLKSALNAADDGHAGLQELGGNATLIFYGTEEAK 1122 L LE ETVKWCR+ILRNSPTAIRVLK+ALNA DDGHAGLQ LGG+ATL+FYGTEEA Sbjct: 259 --LEDLEKETVKWCREILRNSPTAIRVLKAALNAVDDGHAGLQGLGGDATLLFYGTEEAT 316 Query: 1123 EGKNAYMERRRPDFSKFPRKP 1185 EG+ AYM RR PDFSKF R+P Sbjct: 317 EGRTAYMHRRPPDFSKFHRRP 337