BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphyem207o16 (1842 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EAZ02384.1| hypothetical protein OsI_023616 [Oryza sativa (in... 635 e-180 gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-bind... 635 e-180 gb|EAZ09932.1| hypothetical protein OsI_031164 [Oryza sativa (in... 420 e-115 dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like p... 419 e-115 emb|CAO23259.1| unnamed protein product [Vitis vinifera] 400 e-109 >gb|EAZ02384.1| hypothetical protein OsI_023616 [Oryza sativa (indica cultivar-group)] Length = 551 Score = 635 bits (1639), Expect = e-180 Identities = 326/510 (63%), Positives = 383/510 (75%), Gaps = 17/510 (3%) Frame = +2 Query: 101 EASGVGFNLHHRFSPVVRRWAEARGHPGAWWPEARQ--SSTEYYSALSRHDRALFARRGL 274 E SG+GF+LHHR+SP+V+RWAE RGH G WP + S EYYSALSRHD ALFARRGL Sbjct: 23 EVSGLGFDLHHRYSPIVQRWAEERGHAGVSWPAGAEVIGSPEYYSALSRHDHALFARRGL 82 Query: 275 ADGNGLLTFADGNATV-FDGSLHYAEVAVGTPNATFLVALDTGSNLFWVPCDCKHCAPLA 451 A G+GL+TFADGN T+ DGSLHYAEVAVGTPN TFLVALDTGS+LFWVPCDCK CAPL Sbjct: 83 AQGDGLVTFADGNITLRLDGSLHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQCAPLG 142 Query: 452 NLT---GQGGPDLRPYSPRQSSTSKTVTCEHEFCKPPNACATRNSSCPYTVKYLSANTST 622 NLT G GGP+LR YSP +SSTSKTVTC C PNACAT SSCPY V+Y ANTS+ Sbjct: 143 NLTAVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANTSS 202 Query: 623 SGVLVEDVLYLTREK-QGGGATGEVVKAPIVFGCGQEQTXXXXXXXXXXXXXXXXMGNVS 799 SG LVEDVLYLTREK A G V+ P+VFGCGQ QT M VS Sbjct: 203 SGELVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEKVS 262 Query: 800 VPNMLASSGLVASNSFSMCFSEDGIGRINFGDAGSRGQAETPFIVRNIHPAYNISITTIN 979 VP++LAS+G+V SNSFSMCFS+DG+GRINFGD GS Q+ETPFIV++ H YNISIT+++ Sbjct: 263 VPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKSTHSYYNISITSMS 322 Query: 980 VENKSLPVEFTAVVDSGTSFTYLNDPAYTELATNFNSQIREKRANWSDSV-----PFEYC 1144 V +K+LP+ F A+ DSGTSFTYLNDPAYT TNFN+QI E+RAN+S S PFEYC Sbjct: 323 VGDKNLPLGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYC 382 Query: 1145 YGLSPDQKEVLIPDVSLTTRGGALFPVTRP-FVLIVDDTNGKVRVVGYCLAVQKSNISIN 1321 Y LSPDQ V +P VSLTT GGA+FPVT P + + TNG++R++GYCLAV KS++ I+ Sbjct: 383 YSLSPDQTTVELPIVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLPID 442 Query: 1322 IIGQNFMTGLKVVFDRERSVLGWQKFDCYKNVRMAD--APERSPSPAPGPTA-AHLKPQE 1492 IIGQNFMTGLKVVF+RE+SVLGWQKFDCYK+ +M D + SPSP+PGPT +PQE Sbjct: 443 IIGQNFMTGLKVVFNREKSVLGWQKFDCYKDEKMTDDGSSVGSPSPSPGPTTHVFPQPQE 502 Query: 1493 NDATNNRS-YPGAAPVPRPTGAGHVGRPAF 1579 +D+ R+ PGAAPVPR + A GR F Sbjct: 503 SDSPAGRTPIPGAAPVPRSSSAAAGGRAGF 532 >gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa] dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa (japonica cultivar-group)] gb|EAZ38300.1| hypothetical protein OsJ_021783 [Oryza sativa (japonica cultivar-group)] Length = 551 Score = 635 bits (1639), Expect = e-180 Identities = 326/510 (63%), Positives = 383/510 (75%), Gaps = 17/510 (3%) Frame = +2 Query: 101 EASGVGFNLHHRFSPVVRRWAEARGHPGAWWPEARQ--SSTEYYSALSRHDRALFARRGL 274 E SG+GF+LHHR+SP+V+RWAE RGH G WP + S EYYSALSRHD ALFARRGL Sbjct: 23 EVSGLGFDLHHRYSPIVQRWAEERGHAGVSWPAGAEVIGSPEYYSALSRHDHALFARRGL 82 Query: 275 ADGNGLLTFADGNATV-FDGSLHYAEVAVGTPNATFLVALDTGSNLFWVPCDCKHCAPLA 451 A G+GL+TFADGN T+ DGSLHYAEVAVGTPN TFLVALDTGS+LFWVPCDCK CAPL Sbjct: 83 AQGDGLVTFADGNITLRLDGSLHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQCAPLG 142 Query: 452 NLT---GQGGPDLRPYSPRQSSTSKTVTCEHEFCKPPNACATRNSSCPYTVKYLSANTST 622 NLT G GGP+LR YSP +SSTSKTVTC C PNACAT SSCPY V+Y ANTS+ Sbjct: 143 NLTAVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANTSS 202 Query: 623 SGVLVEDVLYLTREK-QGGGATGEVVKAPIVFGCGQEQTXXXXXXXXXXXXXXXXMGNVS 799 SG LVEDVLYLTREK A G V+ P+VFGCGQ QT M VS Sbjct: 203 SGELVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEKVS 262 Query: 800 VPNMLASSGLVASNSFSMCFSEDGIGRINFGDAGSRGQAETPFIVRNIHPAYNISITTIN 979 VP++LAS+G+V SNSFSMCFS+DG+GRINFGD GS Q+ETPFIV++ H YNISIT+++ Sbjct: 263 VPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKSTHSYYNISITSMS 322 Query: 980 VENKSLPVEFTAVVDSGTSFTYLNDPAYTELATNFNSQIREKRANWSDSV-----PFEYC 1144 V +K+LP+ F A+ DSGTSFTYLNDPAYT TNFN+QI E+RAN+S S PFEYC Sbjct: 323 VGDKNLPLGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYC 382 Query: 1145 YGLSPDQKEVLIPDVSLTTRGGALFPVTRP-FVLIVDDTNGKVRVVGYCLAVQKSNISIN 1321 Y LSPDQ V +P VSLTT GGA+FPVT P + + TNG++R++GYCLAV KS++ I+ Sbjct: 383 YSLSPDQTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLPID 442 Query: 1322 IIGQNFMTGLKVVFDRERSVLGWQKFDCYKNVRMAD--APERSPSPAPGPTA-AHLKPQE 1492 IIGQNFMTGLKVVF+RE+SVLGWQKFDCYK+ +M D + SPSP+PGPT +PQE Sbjct: 443 IIGQNFMTGLKVVFNREKSVLGWQKFDCYKDEKMTDDGSSVGSPSPSPGPTTHVFPQPQE 502 Query: 1493 NDATNNRS-YPGAAPVPRPTGAGHVGRPAF 1579 +D+ R+ PGAAPVPR + A GR F Sbjct: 503 SDSPAGRTPIPGAAPVPRSSSAAAGGRAGF 532 >gb|EAZ09932.1| hypothetical protein OsI_031164 [Oryza sativa (indica cultivar-group)] Length = 732 Score = 420 bits (1079), Expect = e-115 Identities = 231/466 (49%), Positives = 296/466 (63%), Gaps = 5/466 (1%) Frame = +2 Query: 104 ASGVGFNLHHRFSPVVRRWAEARGHPGAWWPEARQSSTEYYSALSRHDRALFARRGLADG 283 A + ++HHR+S VRRWA A P + EYY+AL+ HD G+ G Sbjct: 24 AEALSLDVHHRYSAAVRRWAAAAAPP--------HGTAEYYAALAGHDGLRRRSLGVGGG 75 Query: 284 NG--LLTFADGNATVF---DGSLHYAEVAVGTPNATFLVALDTGSNLFWVPCDCKHCAPL 448 G FADGN T G LHYA VA+GTPN TFLVALDTGS+LFWVPCDC CAPL Sbjct: 76 GGGAEFAFADGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPL 135 Query: 449 ANLTGQGGPDLRPYSPRQSSTSKTVTCEHEFCKPPNACATRNSSCPYTVKYLSANTSTSG 628 + G YSP QS+TS+ V C C NAC ++++SCPY+++YLS NTS+SG Sbjct: 136 QS-PNYGSLKFDVYSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSG 194 Query: 629 VLVEDVLYLTREKQGGGATGEVVKAPIVFGCGQEQTXXXXXXXXXXXXXXXXMGNVSVPN 808 VLVEDVLYLT + A ++V API+FGCGQ QT M + SVP+ Sbjct: 195 VLVEDVLYLTSDS----AQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPS 250 Query: 809 MLASSGLVASNSFSMCFSEDGIGRINFGDAGSRGQAETPFIVRNIHPAYNISITTINVEN 988 +LAS GL A+NSFSMCF +DG GRINFGD GS Q ETP V +P YNI+IT I V + Sbjct: 251 LLASKGL-AANSFSMCFGDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGS 309 Query: 989 KSLPVEFTAVVDSGTSFTYLNDPAYTELATNFNSQIREKRANWSDSVPFEYCYGLSPDQK 1168 KS+ EF+A+VDSGTSFT L+DP YT++ ++F++QIR R S+PFE+CY +S + Sbjct: 310 KSISTEFSAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSAN-- 367 Query: 1169 EVLIPDVSLTTRGGALFPVTRPFVLIVDDTNGKVRVVGYCLAVQKSNISINIIGQNFMTG 1348 ++ P+VSLT +GG++FPV P + I D+ VGYCLA+ KS +N+IG+NFM+G Sbjct: 368 GIVHPNVSLTAKGGSIFPVNDPIITITDNA---FNPVGYCLAIMKSE-GVNLIGENFMSG 423 Query: 1349 LKVVFDRERSVLGWQKFDCYKNVRMADAPERSPSPAPGPTAAHLKP 1486 LKVVFDRER VLGW+ F+CY + P +PSP+ P L P Sbjct: 424 LKVVFDRERMVLGWKNFNCYNFDESSRLPV-NPSPSAVPPKPGLGP 468 >dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza sativa (japonica cultivar-group)] Length = 732 Score = 419 bits (1077), Expect = e-115 Identities = 230/466 (49%), Positives = 296/466 (63%), Gaps = 5/466 (1%) Frame = +2 Query: 104 ASGVGFNLHHRFSPVVRRWAEARGHPGAWWPEARQSSTEYYSALSRHDRALFARRGLADG 283 A + ++HHR+S VRRWA A P + EYY+AL+ HD G+ G Sbjct: 24 AEALSLDVHHRYSAAVRRWAAAAAPP--------HGTAEYYAALAGHDGLRRRSLGVGGG 75 Query: 284 NG--LLTFADGNATVF---DGSLHYAEVAVGTPNATFLVALDTGSNLFWVPCDCKHCAPL 448 G FADGN T G LHYA VA+GTPN TFLVALDTGS+LFWVPCDC CAP Sbjct: 76 GGGAEFAFADGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPF 135 Query: 449 ANLTGQGGPDLRPYSPRQSSTSKTVTCEHEFCKPPNACATRNSSCPYTVKYLSANTSTSG 628 + G YSP QS+TS+ V C C NAC ++++SCPY+++YLS NTS+SG Sbjct: 136 QS-PNYGSLKFDVYSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSG 194 Query: 629 VLVEDVLYLTREKQGGGATGEVVKAPIVFGCGQEQTXXXXXXXXXXXXXXXXMGNVSVPN 808 VLVEDVLYLT + A ++V API+FGCGQ QT M + SVP+ Sbjct: 195 VLVEDVLYLTSDS----AQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPS 250 Query: 809 MLASSGLVASNSFSMCFSEDGIGRINFGDAGSRGQAETPFIVRNIHPAYNISITTINVEN 988 +LAS GL A+NSFSMCF +DG GRINFGD GS Q ETP V +P YNI+IT I V + Sbjct: 251 LLASKGL-AANSFSMCFGDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGS 309 Query: 989 KSLPVEFTAVVDSGTSFTYLNDPAYTELATNFNSQIREKRANWSDSVPFEYCYGLSPDQK 1168 KS+ EF+A+VDSGTSFT L+DP YT++ ++F++QIR R S+PFE+CY +S + Sbjct: 310 KSISTEFSAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSAN-- 367 Query: 1169 EVLIPDVSLTTRGGALFPVTRPFVLIVDDTNGKVRVVGYCLAVQKSNISINIIGQNFMTG 1348 ++ P+VSLT +GG++FPV P + I D+ VGYCLA+ KS +N+IG+NFM+G Sbjct: 368 GIVHPNVSLTAKGGSIFPVNDPIITITDNA---FNPVGYCLAIMKSE-GVNLIGENFMSG 423 Query: 1349 LKVVFDRERSVLGWQKFDCYKNVRMADAPERSPSPAPGPTAAHLKP 1486 LKVVFDRER VLGW+ F+CY + P +PSP+ P+ L P Sbjct: 424 LKVVFDRERMVLGWKNFNCYNFDESSRLPV-NPSPSAVPSKPGLGP 468 >emb|CAO23259.1| unnamed protein product [Vitis vinifera] Length = 823 Score = 400 bits (1028), Expect = e-109 Identities = 213/457 (46%), Positives = 287/457 (62%), Gaps = 5/457 (1%) Frame = +2 Query: 119 FNLHHRFSPVVRRWAEARGH--PGAWWPEARQSSTEYYSALSRHDRALFARRGLADGNGL 292 F +HHRFS V++W+E G+ P WP + S EYY+ L+ DRAL RR L+D +GL Sbjct: 28 FQMHHRFSEPVKKWSEGAGNGFPAGNWPA--KGSFEYYAELAHRDRALRGRR-LSDIDGL 84 Query: 293 LTFADGNATVFDGSL---HYAEVAVGTPNATFLVALDTGSNLFWVPCDCKHCAPLANLTG 463 LTF+DGN+T SL HY V++GTP FLVALDTGS+LFWVPCDC CAP T Sbjct: 85 LTFSDGNSTFRISSLGFLHYTTVSLGTPGKKFLVALDTGSDLFWVPCDCSRCAPTEGTTY 144 Query: 464 QGGPDLRPYSPRQSSTSKTVTCEHEFCKPPNACATRNSSCPYTVKYLSANTSTSGVLVED 643 +L Y+P+ SSTS+ VTC++ C N C S+CPY V Y+SA TSTSG+LVED Sbjct: 145 ASDFELSIYNPKGSSTSRKVTCDNSLCAHRNRCLGTFSNCPYMVSYVSAETSTSGILVED 204 Query: 644 VLYLTREKQGGGATGEVVKAPIVFGCGQEQTXXXXXXXXXXXXXXXXMGNVSVPNMLASS 823 VL+LT E E V+A + FGCGQ QT + +SVP++L+ Sbjct: 205 VLHLTTEDN----RQEFVEAYVTFGCGQVQTGSFLDIAAPNGLFGLGLEKISVPSILSKE 260 Query: 824 GLVASNSFSMCFSEDGIGRINFGDAGSRGQAETPFIVRNIHPAYNISITTINVENKSLPV 1003 G A +SFSMCF DGIGRI+FGD GS Q ETPF + +HP YNI++T + V + + Sbjct: 261 GFTA-DSFSMCFGPDGIGRISFGDKGSPDQEETPFNLNALHPTYNITVTQVRVGTTLIDL 319 Query: 1004 EFTAVVDSGTSFTYLNDPAYTELATNFNSQIREKRANWSDSVPFEYCYGLSPDQKEVLIP 1183 +FTA+ DSGTSFTYL DP YT + +F+SQ ++ R +PFE+CY +SP + LIP Sbjct: 320 DFTALFDSGTSFTYLVDPIYTNVLKSFHSQAQDSRRPPDSRIPFEFCYDMSPGENTSLIP 379 Query: 1184 DVSLTTRGGALFPVTRPFVLIVDDTNGKVRVVGYCLAVQKSNISINIIGQNFMTGLKVVF 1363 +SLT +GG+ FPV P ++I + + YC+AV +S +NIIGQNFMTG +++F Sbjct: 380 SMSLTMKGGSQFPVYDPIIIISSQSE-----LIYCMAVVRS-AELNIIGQNFMTGYRIIF 433 Query: 1364 DRERSVLGWQKFDCYKNVRMADAPERSPSPAPGPTAA 1474 DRE+ VLGW++F+C ++ + P R + + P A Sbjct: 434 DREKLVLGWKEFEC-DDIENSSVPIRPRATSVPPAVA 469 Score = 272 bits (696), Expect = 5e-71 Identities = 153/316 (48%), Positives = 195/316 (61%), Gaps = 2/316 (0%) Frame = +2 Query: 476 DLRPYSPRQSSTSKTVTCEHEFCKPPNACATRNSSCPYTVKYLSANTSTSGVLVEDVLYL 655 D YSP SSTS V C C+ N C+ + +CPY + YLS TS++G LVED+L+L Sbjct: 511 DFNIYSPNASSTSINVPCNSTLCQHKNQCSATDDTCPYQISYLSNGTSSTGFLVEDMLHL 570 Query: 656 -TREKQGGGATGEVVKAPIVFGCGQEQTXXXXXXXXXXXXXXXXMGNVSVPNMLASSGLV 832 T + + G+ A I FGCG+ QT MG++SVP++LA GLV Sbjct: 571 VTDDDESKGSD-----AQITFGCGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLV 625 Query: 833 ASNSFSMCFSEDGIGRINFGDAGSRGQAETPFIVRNIHPAYNISITTINVENKSLPVEFT 1012 A +SFSMCF DG GRI+FGD GS GQ ETPF YNISIT I+V S + F Sbjct: 626 A-DSFSMCFGNDGTGRISFGDEGSSGQEETPFNPSKSQLLYNISITQISVGGTSADLNFD 684 Query: 1013 AVVDSGTSFTYLNDPAYTELATNFNSQIREKRANWSDSVPFEYCYGLSPDQKEVLIPDVS 1192 A+ DSGTSFTYLNDPAYT ++ +FN + ++KR++ +PFEYCY +S Q V P V+ Sbjct: 685 AIFDSGTSFTYLNDPAYTSISESFNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVN 744 Query: 1193 LTTRGGALFPVTRPFVLIVDDTNGKVRVVGYCLAVQKSNISINIIGQNFMTGLKVVFDRE 1372 LT +GG F VT P ++IV G V YCL V KS INIIGQNFMTG +++FDRE Sbjct: 745 LTMKGGDNFFVTDP-IVIVSIQGGYV----YCLGVVKSG-DINIIGQNFMTGYRIIFDRE 798 Query: 1373 RSVLGWQKFDC-YKNV 1417 + VLGW K +C Y N+ Sbjct: 799 KMVLGWTKSNCEYMNM 814