BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphyem202h12 (1679 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAO15431.1| unnamed protein product [Vitis vinifera] 462 e-128 ref|NP_178323.2| pentatricopeptide (PPR) repeat-containing prote... 367 1e-99 dbj|BAD94934.1| hypothetical protein [Arabidopsis thaliana] 367 1e-99 gb|AAC97219.1| hypothetical protein [Arabidopsis thaliana] 298 1e-78 gb|EDQ72926.1| predicted protein [Physcomitrella patens subsp. p... 286 3e-75 >emb|CAO15431.1| unnamed protein product [Vitis vinifera] Length = 455 Score = 462 bits (1189), Expect = e-128 Identities = 235/398 (59%), Positives = 288/398 (72%), Gaps = 5/398 (1%) Frame = +1 Query: 388 DSSSVDCMXXXXXXXXXXXXVRVDDEHSALENSSRPSSPFDILMSQDVLPIEMARSRFLD 567 D S DCM V V+++ S LEN+ P+DIL D+ PIE AR+RF+ Sbjct: 60 DDSGADCMHESYRDSLPLHTVGVEEDRSGLENNGSSRGPYDILTINDISPIEAARARFMQ 119 Query: 568 LIVDHFIGENVIEVAESSG-LDCVQGNDKLNKRKQPEVRYEGDPRFALPLMYIANLYETL 744 +IVDHFI ++VIEVA+S + G DKLNKRK EV+YEGDPRF LPLMY+AN+YETL Sbjct: 120 IIVDHFIDDHVIEVADSEADYNGQSGQDKLNKRKSREVQYEGDPRFVLPLMYVANMYETL 179 Query: 745 VSDVNARLVPLIGSREKTIGVALEAAGGLYRKLSQKFPKKGTCSFRRRELATSHATRTKF 924 V++VN RL L G REKTIGVALEAAGGLYR+L++KFPKKG C+F+RRELATS TRT+F Sbjct: 180 VNEVNIRLASLNGIREKTIGVALEAAGGLYRRLAKKFPKKGPCTFKRRELATSIETRTRF 239 Query: 925 PELVVLEEKRVRFVVINGLVIIDRPNNMRMEDAEWFKRLTGRNEVAISSRDYKFYSPRHK 1104 PELV+ EEKRVRFVV+NGLVI+D+PN++ ++DAEWFKRLTGR+EVA+S+RDYKFYSPRHK Sbjct: 240 PELVIQEEKRVRFVVVNGLVIVDKPNSVPIDDAEWFKRLTGRDEVAVSARDYKFYSPRHK 299 Query: 1105 YRR-SQQPVFDIPGTTALSEDDNSPLVCSS-GFRPPNEMQNQHRSSSKRHIEQLENQPYL 1278 YRR + PV +IPG D SP + S+ GFR NE QNQ + SK H++ L Sbjct: 300 YRRVASNPVSNIPGLPTFPGTDTSPTMASAQGFRTVNEPQNQQATPSKHHMQSLSQ---F 356 Query: 1279 HLLDQPENDTIQQNQHSTHFPPIHQCTTASHLSDNPQQHQS-YLSQHIACMQ-AGQGHLG 1452 H + Q + I Q QHS HF HQC SHL + P HQS + QH+AC+Q GH+ Sbjct: 357 HPIHQNHHQPIHQTQHSAHFSHSHQCGPPSHLPEIPHGHQSPTIPQHMACLQPITGGHVS 416 Query: 1453 GRMNILPTSPAKFCDECGSPYLRATSKFCSECGTKRLG 1566 GR+++LPTSPAKFCDECG+PYLR TSKFCSECG KR G Sbjct: 417 GRLHVLPTSPAKFCDECGAPYLRETSKFCSECGGKRFG 454 >ref|NP_178323.2| pentatricopeptide (PPR) repeat-containing protein [Arabidopsis thaliana] Length = 1141 Score = 367 bits (943), Expect = 1e-99 Identities = 210/401 (52%), Positives = 260/401 (64%), Gaps = 7/401 (1%) Frame = +1 Query: 385 GDSSSVDCMXXXXXXXXXXXXVRVDDEHSALENSSRPSSPFDILMSQDVLPIEMARSRFL 564 GDSSS DCM + V++ S +EN S + +L +DV PIE AR RFL Sbjct: 763 GDSSSADCMHESYRNSMQ---IGVEEGGSNMENKG---SAYIMLNIEDVSPIEAARGRFL 816 Query: 565 DLIVDHFIGENVIEVAESS-GLDCVQGNDKLN---KRKQPEVRYEGDPRFALPLMYIANL 732 +I+D+FI ++VIEV ES D G N KRK + RYEGDP FALPLMYIANL Sbjct: 817 QIILDYFISQHVIEVCESKRDHDVDSGGRDSNSKVKRKSDDTRYEGDPSFALPLMYIANL 876 Query: 733 YETLVSDVNARLVPLIGSREKTIGVALEAAGGLYRKLSQKFPKKGTCSFRRRELATSHAT 912 YETLV + N RL L G R+KTIGVALEAAGGLYRKL++KFPKKGTC +RRRELATS T Sbjct: 877 YETLVGEANVRLASLNGIRDKTIGVALEAAGGLYRKLTKKFPKKGTCMYRRRELATSVET 936 Query: 913 RTKFPELVVLEEKRVRFVVINGLVIIDRPNNMRMEDAEWFKRLTGRNEVAISSRDYKFYS 1092 RT+FPELV+ EEKRVRFVV+NGL I+++P+++ +E+AEWFKRLTGRNEVAIS+RDYKFY Sbjct: 937 RTRFPELVIHEEKRVRFVVVNGLDIVEKPSDLPIEEAEWFKRLTGRNEVAISARDYKFYC 996 Query: 1093 PRHKYRRSQQPVFDIPGTTALSEDDNSPLVCSSGFRPPNEMQNQHRSSSKRHIEQLENQP 1272 PR K+RR Q V I G D+S L + GFR Q S SK H+ L +Q Sbjct: 997 PRRKHRRLQNSVSSINGLPTFPGIDSSTLANTQGFREDQSQQQHTPSPSKHHMSSLSHQF 1056 Query: 1273 Y--LHLLDQPENDTIQQNQH-STHFPPIHQCTTASHLSDNPQQHQSYLSQHIACMQAGQG 1443 + +H Q + +I Q+QH +TH+P S N Q +AC+Q G Sbjct: 1057 HQSIHQSHQ-HHQSIYQSQHAATHYP-----------SQNHQCDPELSHTQMACLQPLTG 1104 Query: 1444 HLGGRMNILPTSPAKFCDECGSPYLRATSKFCSECGTKRLG 1566 +++P SPAKFCD+CG+ YLR TSKFCSECG+KRLG Sbjct: 1105 G-----HVMPNSPAKFCDQCGAQYLRETSKFCSECGSKRLG 1140 >dbj|BAD94934.1| hypothetical protein [Arabidopsis thaliana] Length = 432 Score = 367 bits (943), Expect = 1e-99 Identities = 210/401 (52%), Positives = 260/401 (64%), Gaps = 7/401 (1%) Frame = +1 Query: 385 GDSSSVDCMXXXXXXXXXXXXVRVDDEHSALENSSRPSSPFDILMSQDVLPIEMARSRFL 564 GDSSS DCM + V++ S +EN S + +L +DV PIE AR RFL Sbjct: 54 GDSSSADCMHESYRNSMQ---IGVEEGGSNMENKG---SAYIMLNIEDVSPIEAARGRFL 107 Query: 565 DLIVDHFIGENVIEVAESS-GLDCVQGNDKLN---KRKQPEVRYEGDPRFALPLMYIANL 732 +I+D+FI ++VIEV ES D G N KRK + RYEGDP FALPLMYIANL Sbjct: 108 QIILDYFISQHVIEVCESKRDHDVDSGGRDSNSKVKRKSDDTRYEGDPSFALPLMYIANL 167 Query: 733 YETLVSDVNARLVPLIGSREKTIGVALEAAGGLYRKLSQKFPKKGTCSFRRRELATSHAT 912 YETLV + N RL L G R+KTIGVALEAAGGLYRKL++KFPKKGTC +RRRELATS T Sbjct: 168 YETLVGEANVRLASLNGIRDKTIGVALEAAGGLYRKLTKKFPKKGTCMYRRRELATSVET 227 Query: 913 RTKFPELVVLEEKRVRFVVINGLVIIDRPNNMRMEDAEWFKRLTGRNEVAISSRDYKFYS 1092 RT+FPELV+ EEKRVRFVV+NGL I+++P+++ +E+AEWFKRLTGRNEVAIS+RDYKFY Sbjct: 228 RTRFPELVIHEEKRVRFVVVNGLDIVEKPSDLPIEEAEWFKRLTGRNEVAISARDYKFYC 287 Query: 1093 PRHKYRRSQQPVFDIPGTTALSEDDNSPLVCSSGFRPPNEMQNQHRSSSKRHIEQLENQP 1272 PR K+RR Q V I G D+S L + GFR Q S SK H+ L +Q Sbjct: 288 PRRKHRRLQNSVSSINGLPTFPGIDSSTLANTQGFREDQSQQQHTPSPSKHHMSSLSHQF 347 Query: 1273 Y--LHLLDQPENDTIQQNQH-STHFPPIHQCTTASHLSDNPQQHQSYLSQHIACMQAGQG 1443 + +H Q + +I Q+QH +TH+P S N Q +AC+Q G Sbjct: 348 HQSIHQSHQ-HHQSIYQSQHAATHYP-----------SQNHQCDPELSHTQMACLQPLTG 395 Query: 1444 HLGGRMNILPTSPAKFCDECGSPYLRATSKFCSECGTKRLG 1566 +++P SPAKFCD+CG+ YLR TSKFCSECG+KRLG Sbjct: 396 G-----HVMPNSPAKFCDQCGAQYLRETSKFCSECGSKRLG 431 >gb|AAC97219.1| hypothetical protein [Arabidopsis thaliana] Length = 1107 Score = 298 bits (762), Expect = 1e-78 Identities = 163/279 (58%), Positives = 196/279 (70%), Gaps = 4/279 (1%) Frame = +1 Query: 385 GDSSSVDCMXXXXXXXXXXXXVRVDDEHSALENSSRPSSPFDILMSQDVLPIEMARSRFL 564 GDSSS DCM + V++ S +EN S + +L +DV PIE AR RFL Sbjct: 763 GDSSSADCMHESYRNSMQ---IGVEEGGSNMENKG---SAYIMLNIEDVSPIEAARGRFL 816 Query: 565 DLIVDHFIGENVIEVAESS-GLDCVQGNDKLN---KRKQPEVRYEGDPRFALPLMYIANL 732 +I+D+FI ++VIEV ES D G N KRK + RYEGDP FALPLMYIANL Sbjct: 817 QIILDYFISQHVIEVCESKRDHDVDSGGRDSNSKVKRKSDDTRYEGDPSFALPLMYIANL 876 Query: 733 YETLVSDVNARLVPLIGSREKTIGVALEAAGGLYRKLSQKFPKKGTCSFRRRELATSHAT 912 YETLV + N RL L G R+KTIGVALEAAGGLYRKL++KFPKKGTC +RRRELATS T Sbjct: 877 YETLVGEANVRLASLNGIRDKTIGVALEAAGGLYRKLTKKFPKKGTCMYRRRELATSVET 936 Query: 913 RTKFPELVVLEEKRVRFVVINGLVIIDRPNNMRMEDAEWFKRLTGRNEVAISSRDYKFYS 1092 RT+FPELV+ EEKRVRFVV+NGL I+++P+++ +E+AEWFKRLTGRNEVAIS+RDYKFY Sbjct: 937 RTRFPELVIHEEKRVRFVVVNGLDIVEKPSDLPIEEAEWFKRLTGRNEVAISARDYKFYC 996 Query: 1093 PRHKYRRSQQPVFDIPGTTALSEDDNSPLVCSSGFRPPN 1209 PR K+RR Q V I G D+S L + GFR PN Sbjct: 997 PRRKHRRLQNSVSSINGLPTFPGIDSSTLANTQGFREPN 1035 >gb|EDQ72926.1| predicted protein [Physcomitrella patens subsp. patens] Length = 411 Score = 286 bits (732), Expect = 3e-75 Identities = 170/373 (45%), Positives = 225/373 (60%), Gaps = 5/373 (1%) Frame = +1 Query: 463 EHSALENSSRPSSPFDILMSQDVLPIEMARSRFLDLIVDHFIGENVIEVAESSGLDCVQ- 639 E S ++ SP+ +L DV+PIE R+RFL LI+D+FI +V+ E+ + Sbjct: 75 EQSLTMDNEGSRSPYGVLTLNDVIPIESTRARFLQLIIDYFIRYHVVPATEAPEGNSYSP 134 Query: 640 -GNDKLNKRKQPEVRYEGDPRFALPLMYIANLYETLVSDVNARLVPLIGSREKTIGVALE 816 G DK KRK +V+YEGDPR+ LPL ++ANLYETL+ ++N RL + G +EKT GVALE Sbjct: 135 NGKDKSKKRKSRDVQYEGDPRYLLPLTFVANLYETLIREINQRLATIEGLQEKTFGVALE 194 Query: 817 AAGGLYRKLSQKFPKKGTCSFRRRELATSHATRTKFPELVVLEEKRVRFVVINGLVIIDR 996 AAGGLYR+L +KFPK T +F+RRE+A++ RTKFP+LV EEKRVRFVV++GL +++R Sbjct: 195 AAGGLYRRLVKKFPKSATMTFKRREMASALEARTKFPQLVTGEEKRVRFVVVHGLELVER 254 Query: 997 PNNMRMEDAEWFKRLTGRNEVAISSRDYKFYSPRHKYRRSQQPVFDIPGTTALSEDDNSP 1176 P N+ EDAEWFKRLTGR+E I DYK+++ R K+RR+ P + T L Sbjct: 255 P-NLSPEDAEWFKRLTGRHEAQIYESDYKYFAARVKHRRA--PHHTLSSMTLLQ------ 305 Query: 1177 LVCSSGFRPPNEMQNQHRSSSKRHIEQLENQPYLHLLDQPENDTIQQNQHSTHFPPIHQC 1356 V S F ++Q QH QL++Q + H P Q QH+ H PP Sbjct: 306 -VSSWAF---EQLQQQH---------QLQHQQHQH---SPHVQHQQLTQHTVHTPP---- 345 Query: 1357 TTASHLSDNPQQHQSYLSQHIACMQAGQGHLGGRMNILP---TSPAKFCDECGSPYLRAT 1527 D P H S +Q G +L I+P T P+K+CDECGS Y+R T Sbjct: 346 -HVPRDLDAPGMHGS-------VVQVGLNNL-WLAPIIPCSATGPSKYCDECGSAYIRET 396 Query: 1528 SKFCSECGTKRLG 1566 SKFCSECGTKRLG Sbjct: 397 SKFCSECGTKRLG 409