BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphyem113o01 (1188 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAD09701.1| pentatricopeptide (PPR) repeat-containing protei... 537 e-151 gb|EAZ07659.1| hypothetical protein OsI_028891 [Oryza sativa (in... 535 e-150 gb|EAZ43361.1| hypothetical protein OsJ_026844 [Oryza sativa (ja... 418 e-115 emb|CAO66535.1| unnamed protein product [Vitis vinifera] 352 4e-95 gb|ABA18111.1| pentatricopeptide repeat protein [Arabidopsis are... 338 6e-91 >dbj|BAD09701.1| pentatricopeptide (PPR) repeat-containing protein-like [Oryza sativa (japonica cultivar-group)] dbj|BAD10436.1| pentatricopeptide (PPR) repeat-containing protein-like [Oryza sativa (japonica cultivar-group)] Length = 393 Score = 537 bits (1384), Expect = e-151 Identities = 278/351 (79%), Positives = 291/351 (82%), Gaps = 3/351 (0%) Frame = +1 Query: 10 PRR-AVSCPRTAPPDADAA--LMVARAEAGDFARAQSIWAQLLHSSTXXXXXXXXXXXXX 180 PRR CPR APP ADAA LM+A AEAGDFA A+S+WAQLLHSS Sbjct: 44 PRRDGAYCPRAAPPHADAAAALMLAHAEAGDFASARSMWAQLLHSSAAPRLRAAAPRLLP 103 Query: 181 XXXXXXXFDEILLAVRELSARDPAAAHGLYPLTVSCFGAAGELVLMEDTVQEMARLGLRV 360 DE LL VREL ARDP AA LYPL V+CFGAAGEL LMED V+EMAR GL V Sbjct: 104 AYARLGRCDEALLVVRELCARDPGAARALYPLAVTCFGAAGELALMEDAVREMARHGLPV 163 Query: 361 DSATGNAFVQHYAASGTVPQMEAAYRRLKRSGLLISADAIRAVASAYISQRKYYKLGEFV 540 DSATGNAFV HYAASGTVPQMEAAYRRLK S LL+S AIRA+ASAYIS RKYYKLGEFV Sbjct: 164 DSATGNAFVCHYAASGTVPQMEAAYRRLKASRLLVSVAAIRAMASAYISHRKYYKLGEFV 223 Query: 541 NDVGLGRRNAGNLLWNLYLLSFAANFKMKSLQRAFLEMVAAGVRPDLTTFNIRAAAFSKM 720 DVGLGRR GNLLWNLYLLSFAANFKMKSLQRAFL+MVAAG PDLTTFN+RA AFSKM Sbjct: 224 TDVGLGRRAGGNLLWNLYLLSFAANFKMKSLQRAFLDMVAAGFTPDLTTFNLRAVAFSKM 283 Query: 721 CMFWDLHLSAEHMRRDGVAPDLVTHGCFVDAYLERRLARNLTFAFDRLDGAGEPVVATDG 900 CMFWDLHL+A+HMRRDGVAPDLVTHGCFVDAYLERRLARNL FAFDRL GAGEPVVATD Sbjct: 284 CMFWDLHLTADHMRRDGVAPDLVTHGCFVDAYLERRLARNLNFAFDRL-GAGEPVVATDA 342 Query: 901 IVFEAFGKGGFHASSEALLEATGGKRRWTYYKLLGVYLRKQRRKNQIFWNY 1053 +VFEAFGKGGFHASSE LLEATGG+RRWTYYKLLGVYLRKQ RKNQIFWNY Sbjct: 343 VVFEAFGKGGFHASSEVLLEATGGERRWTYYKLLGVYLRKQHRKNQIFWNY 393 >gb|EAZ07659.1| hypothetical protein OsI_028891 [Oryza sativa (indica cultivar-group)] Length = 393 Score = 535 bits (1378), Expect = e-150 Identities = 277/351 (78%), Positives = 290/351 (82%), Gaps = 3/351 (0%) Frame = +1 Query: 10 PRR-AVSCPRTAPPDADAA--LMVARAEAGDFARAQSIWAQLLHSSTXXXXXXXXXXXXX 180 PRR CPR APP ADAA LM+A AEAGDFA A+S+WAQLLHSS Sbjct: 44 PRRDGAYCPRAAPPHADAAAALMLAHAEAGDFASARSMWAQLLHSSAAPRLRAAAPRLLP 103 Query: 181 XXXXXXXFDEILLAVRELSARDPAAAHGLYPLTVSCFGAAGELVLMEDTVQEMARLGLRV 360 DE LL VREL ARDP AA LYPL V+CFGAAGEL LMED V+EMAR GL V Sbjct: 104 AYARLGRCDEALLVVRELCARDPGAARALYPLAVTCFGAAGELALMEDAVREMARHGLPV 163 Query: 361 DSATGNAFVQHYAASGTVPQMEAAYRRLKRSGLLISADAIRAVASAYISQRKYYKLGEFV 540 DSATGNAFV HYAASGTVPQMEAAYRRLK S LL+S IRA+ASAYIS RKYYKLGEFV Sbjct: 164 DSATGNAFVCHYAASGTVPQMEAAYRRLKASRLLVSVADIRAMASAYISHRKYYKLGEFV 223 Query: 541 NDVGLGRRNAGNLLWNLYLLSFAANFKMKSLQRAFLEMVAAGVRPDLTTFNIRAAAFSKM 720 DVGLGRR GNLLWNLYLLSFAANFKMKSLQRAFL+MVAAG PDLTTFN+RA AFSKM Sbjct: 224 TDVGLGRRAGGNLLWNLYLLSFAANFKMKSLQRAFLDMVAAGFTPDLTTFNLRAVAFSKM 283 Query: 721 CMFWDLHLSAEHMRRDGVAPDLVTHGCFVDAYLERRLARNLTFAFDRLDGAGEPVVATDG 900 CMFWDLHL+A+HMRRDGVAPDLVTHGCFVDAYLERRLARNL FAFDRL GAGEPVVATD Sbjct: 284 CMFWDLHLTADHMRRDGVAPDLVTHGCFVDAYLERRLARNLNFAFDRL-GAGEPVVATDA 342 Query: 901 IVFEAFGKGGFHASSEALLEATGGKRRWTYYKLLGVYLRKQRRKNQIFWNY 1053 +VFEAFGKGGFHASSE LLEATGG+RRWTYYKLLGVYLRKQ RKNQIFWNY Sbjct: 343 VVFEAFGKGGFHASSEVLLEATGGERRWTYYKLLGVYLRKQHRKNQIFWNY 393 >gb|EAZ43361.1| hypothetical protein OsJ_026844 [Oryza sativa (japonica cultivar-group)] Length = 319 Score = 418 bits (1075), Expect = e-115 Identities = 217/279 (77%), Positives = 227/279 (81%) Frame = +1 Query: 217 LAVRELSARDPAAAHGLYPLTVSCFGAAGELVLMEDTVQEMARLGLRVDSATGNAFVQHY 396 L V L ARDP AA LYPL V+CFGAAGEL LMED V+EMAR GL VDSATGNAFV HY Sbjct: 67 LRVGALCARDPGAARALYPLAVTCFGAAGELALMEDAVREMARHGLPVDSATGNAFVCHY 126 Query: 397 AASGTVPQMEAAYRRLKRSGLLISADAIRAVASAYISQRKYYKLGEFVNDVGLGRRNAGN 576 AASGTVPQMEAAYRRLK S LL+S AIRA+ASAYIS RKYYKLGEFV D Sbjct: 127 AASGTVPQMEAAYRRLKASRLLVSVAAIRAMASAYISHRKYYKLGEFVTD---------- 176 Query: 577 LLWNLYLLSFAANFKMKSLQRAFLEMVAAGVRPDLTTFNIRAAAFSKMCMFWDLHLSAEH 756 MKSLQRAFL+MVAAG PDLTTFN+RA AFSKMCMFWDLHL+A+H Sbjct: 177 ---------------MKSLQRAFLDMVAAGFTPDLTTFNLRAVAFSKMCMFWDLHLTADH 221 Query: 757 MRRDGVAPDLVTHGCFVDAYLERRLARNLTFAFDRLDGAGEPVVATDGIVFEAFGKGGFH 936 MRRDGVAPDLVTHGCFVDAYLERRLARNL FAFDRL GAGEPVVATD +VFEAFGKGGFH Sbjct: 222 MRRDGVAPDLVTHGCFVDAYLERRLARNLNFAFDRL-GAGEPVVATDAVVFEAFGKGGFH 280 Query: 937 ASSEALLEATGGKRRWTYYKLLGVYLRKQRRKNQIFWNY 1053 ASSE LLEATGG+RRWTYYKLLGVYLRKQ RKNQIFWNY Sbjct: 281 ASSEVLLEATGGERRWTYYKLLGVYLRKQHRKNQIFWNY 319 >emb|CAO66535.1| unnamed protein product [Vitis vinifera] Length = 423 Score = 352 bits (902), Expect = 4e-95 Identities = 176/332 (53%), Positives = 234/332 (70%) Frame = +1 Query: 58 AALMVARAEAGDFARAQSIWAQLLHSSTXXXXXXXXXXXXXXXXXXXXFDEILLAVRELS 237 +ALM+ A+ G F +AQ++W ++++SS F E+ + ++S Sbjct: 94 SALMLCYADNGLFPKAQALWDEIINSS-FGPNIQIVSKLIDAYGKMGHFGEVTRILHQVS 152 Query: 238 ARDPAAAHGLYPLTVSCFGAAGELVLMEDTVQEMARLGLRVDSATGNAFVQHYAASGTVP 417 +RD H +Y L +SCFG G+L +ME+ ++EM G VDSATGNAF+++Y+ G++ Sbjct: 153 SRDFNFMHEVYSLAISCFGKGGQLEMMENALKEMVSRGFPVDSATGNAFIRYYSIFGSLT 212 Query: 418 QMEAAYRRLKRSGLLISADAIRAVASAYISQRKYYKLGEFVNDVGLGRRNAGNLLWNLYL 597 +MEAAY RLK+S +LI + IRA++ AYI ++KYY+LG+F+ DVGLGR+N GNLLWNL L Sbjct: 213 EMEAAYDRLKKSRILIEEEGIRAMSFAYIKEKKYYRLGQFLRDVGLGRKNVGNLLWNLLL 272 Query: 598 LSFAANFKMKSLQRAFLEMVAAGVRPDLTTFNIRAAAFSKMCMFWDLHLSAEHMRRDGVA 777 LS+AANFKMKSLQR FLEMV AG PDLTTFNIRA AFS+M +FWDLHLS EHM+ V Sbjct: 273 LSYAANFKMKSLQREFLEMVEAGFAPDLTTFNIRALAFSRMSLFWDLHLSLEHMQHVKVV 332 Query: 778 PDLVTHGCFVDAYLERRLARNLTFAFDRLDGAGEPVVATDGIVFEAFGKGGFHASSEALL 957 DLVT+GC VDAYL+RRL +NL FA +++ P+V+TD VFE GKG FH+SSEA L Sbjct: 333 ADLVTYGCVVDAYLDRRLGKNLDFALKKMNMDDSPLVSTDHFVFEVLGKGDFHSSSEAFL 392 Query: 958 EATGGKRRWTYYKLLGVYLRKQRRKNQIFWNY 1053 E+ +WTY KL+ YL+K+ R NQIFWNY Sbjct: 393 ESK-RNGKWTYRKLIATYLKKKYRSNQIFWNY 423 >gb|ABA18111.1| pentatricopeptide repeat protein [Arabidopsis arenosa] Length = 419 Score = 338 bits (866), Expect = 6e-91 Identities = 170/331 (51%), Positives = 231/331 (69%) Frame = +1 Query: 61 ALMVARAEAGDFARAQSIWAQLLHSSTXXXXXXXXXXXXXXXXXXXXFDEILLAVRELSA 240 ALM+ AE G RA++IW ++L+SS FDE+ ++++A Sbjct: 91 ALMLCFAENGFVLRARTIWDEILNSS-FVPDVFVVSKLISAYEQLGFFDEVAKITKDVAA 149 Query: 241 RDPAAAHGLYPLTVSCFGAAGELVLMEDTVQEMARLGLRVDSATGNAFVQHYAASGTVPQ 420 R +Y L +SCFG G+L LME ++EM G+ +DSAT NA V++++ GT+ + Sbjct: 150 RHSTLLPVVYSLAISCFGKNGQLELMEGVIEEMDSKGMSLDSATANAIVRYFSFFGTLDK 209 Query: 421 MEAAYRRLKRSGLLISADAIRAVASAYISQRKYYKLGEFVNDVGLGRRNAGNLLWNLYLL 600 +E AY RLK+ G++I + IRAV AY+ QRK+Y+L EF++DVGLGRRN GN+LWN LL Sbjct: 210 IEHAYGRLKKFGIVIEEEEIRAVLLAYLKQRKFYRLREFLSDVGLGRRNLGNMLWNSVLL 269 Query: 601 SFAANFKMKSLQRAFLEMVAAGVRPDLTTFNIRAAAFSKMCMFWDLHLSAEHMRRDGVAP 780 S+AA FKMKSLQR F+EM+ AG PDLTTFNIRA AFS+M +FWDLHL+ EHMRR + P Sbjct: 270 SYAAEFKMKSLQREFIEMLDAGFSPDLTTFNIRALAFSRMALFWDLHLTLEHMRRLNIVP 329 Query: 781 DLVTHGCFVDAYLERRLARNLTFAFDRLDGAGEPVVATDGIVFEAFGKGGFHASSEALLE 960 DLVT GC VDAY+++RLARNL F +++++ PVV TD + FE GKG FH SSEA+LE Sbjct: 330 DLVTFGCVVDAYMDKRLARNLEFVYNQMNLDDSPVVLTDPLAFEVLGKGDFHLSSEAVLE 389 Query: 961 ATGGKRRWTYYKLLGVYLRKQRRKNQIFWNY 1053 + ++ WTY KL+GVY++K+ R++QIFWNY Sbjct: 390 FS-TEKNWTYRKLIGVYVKKKLRRDQIFWNY 419