BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphyem203c14 (1673 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EAZ33142.1| hypothetical protein OsJ_016625 [Oryza sativa (ja... 590 e-166 gb|EAY96831.1| hypothetical protein OsI_018064 [Oryza sativa (in... 588 e-166 emb|CAN62837.1| hypothetical protein [Vitis vinifera] 390 e-106 ref|NP_194653.1| leucine-rich repeat family protein / extensin f... 386 e-105 ref|NP_179568.1| leucine-rich repeat family protein / extensin f... 384 e-104 >gb|EAZ33142.1| hypothetical protein OsJ_016625 [Oryza sativa (japonica cultivar-group)] Length = 402 Score = 590 bits (1520), Expect = e-166 Identities = 300/392 (76%), Positives = 324/392 (82%), Gaps = 2/392 (0%) Frame = +1 Query: 256 TSQLGLGAAFGVWINXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEYTALQALKAAIAE 435 +S+L L AAFGVWIN EY ALQALKAA+ E Sbjct: 14 SSRLALAAAFGVWINGAASSSPQSQ---------------------EYEALQALKAAVVE 52 Query: 436 DPHGALSSWQGPNVCAYKGVYCSAPPDGSXXXXXXXXGIDLNHANLRGTLPAAVSLLAHL 615 DP GAL+SWQGPNVCAY+GVYCSAPPD + GIDLN ANLRGTLPAAVSLLAHL Sbjct: 53 DPRGALASWQGPNVCAYRGVYCSAPPDDAAASGAVVAGIDLNRANLRGTLPAAVSLLAHL 112 Query: 616 TFLHLNSNRLAGAVPDTLRDLQYLTELDLSNNLFSGPFPAATLLMPSLVYLDLRFNGFSG 795 TFLHLNSNRLAG PD+LRDLQYLTELDLSNNLFSGPFPAA LL+PSLVYLDLRFN FSG Sbjct: 113 TFLHLNSNRLAGQPPDSLRDLQYLTELDLSNNLFSGPFPAAALLIPSLVYLDLRFNAFSG 172 Query: 796 ELPDEVFAKN-LDAIFLNNNQFEGQIPETLWSSPATVITLAYNRLIGPVPTAYGYGAGGR 972 +P E FAK+ LDA+FLNNNQF+G+IPETLWSSPATVITLA NRL GPVP+AYGYG GR Sbjct: 173 GIPAEAFAKSSLDALFLNNNQFDGEIPETLWSSPATVITLANNRLTGPVPSAYGYG--GR 230 Query: 973 VREVLFLNNKLTGCIPEALGFLPSIEVLDLSNNSLSGHLPSTLSCLSGIEVLNIAHNQFT 1152 VREVLFLNNKLTGCIPE LGFLP+IEVLDLS NSLSGHLP TLSCL+GIEVLNIAHNQFT Sbjct: 231 VREVLFLNNKLTGCIPEELGFLPTIEVLDLSYNSLSGHLPPTLSCLAGIEVLNIAHNQFT 290 Query: 1153 GELPNLVCDLKRITNLSVSFNFFSGISQDCDRLAGRSVFDFVGNCIPGRGLQRPQPECDG 1332 GELP+LVCDLKRITNLSVSFNFFSGISQ C+RLAGRSVFDFVGNC+PGRGLQRP PECDG Sbjct: 291 GELPDLVCDLKRITNLSVSFNFFSGISQHCNRLAGRSVFDFVGNCVPGRGLQRPPPECDG 350 Query: 1333 APGDGGLSCLR-IPGTRPVACGEAAVSIGIGI 1425 PGDGGLSCLR IP TRPV C +A+VS+G+G+ Sbjct: 351 GPGDGGLSCLRSIPVTRPVPCAQASVSVGVGV 382 >gb|EAY96831.1| hypothetical protein OsI_018064 [Oryza sativa (indica cultivar-group)] Length = 406 Score = 588 bits (1517), Expect = e-166 Identities = 300/392 (76%), Positives = 323/392 (82%), Gaps = 2/392 (0%) Frame = +1 Query: 256 TSQLGLGAAFGVWINXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEYTALQALKAAIAE 435 +S+L L AAFGVWIN EY ALQALKAA+ E Sbjct: 14 SSRLALAAAFGVWINGAASSSPQSQ---------------------EYEALQALKAAVVE 52 Query: 436 DPHGALSSWQGPNVCAYKGVYCSAPPDGSXXXXXXXXGIDLNHANLRGTLPAAVSLLAHL 615 DP GAL+SWQGPNVCAY+GVYCSAPPD + GIDLN ANLRGTLPAAVSLLAHL Sbjct: 53 DPRGALASWQGPNVCAYRGVYCSAPPDDAAASGAVVAGIDLNRANLRGTLPAAVSLLAHL 112 Query: 616 TFLHLNSNRLAGAVPDTLRDLQYLTELDLSNNLFSGPFPAATLLMPSLVYLDLRFNGFSG 795 TFLHLNSNRLAG PD+LRDLQYLTELDLSNNLFSGPFPAA LL+PSLVYLDLRFN FSG Sbjct: 113 TFLHLNSNRLAGQPPDSLRDLQYLTELDLSNNLFSGPFPAAALLIPSLVYLDLRFNAFSG 172 Query: 796 ELPDEVFAKN-LDAIFLNNNQFEGQIPETLWSSPATVITLAYNRLIGPVPTAYGYGAGGR 972 +P E FAK+ LDA+FLNNNQF+G+IPETLWSSPATVITLA NRL GPVP+AYGYG GR Sbjct: 173 GIPAEAFAKSSLDALFLNNNQFDGEIPETLWSSPATVITLANNRLTGPVPSAYGYG--GR 230 Query: 973 VREVLFLNNKLTGCIPEALGFLPSIEVLDLSNNSLSGHLPSTLSCLSGIEVLNIAHNQFT 1152 VREVLFLNNKLTGCIPE LGFLP+IEVLDLS NSLSGHLP TLSCL+GIEVLNIAHNQFT Sbjct: 231 VREVLFLNNKLTGCIPEELGFLPTIEVLDLSYNSLSGHLPPTLSCLAGIEVLNIAHNQFT 290 Query: 1153 GELPNLVCDLKRITNLSVSFNFFSGISQDCDRLAGRSVFDFVGNCIPGRGLQRPQPECDG 1332 GELP+LVCDLKRITNLSVSFNFFSGISQ CDRLAGRSVFDFVGNC+PGRGLQRP PECDG Sbjct: 291 GELPDLVCDLKRITNLSVSFNFFSGISQHCDRLAGRSVFDFVGNCVPGRGLQRPPPECDG 350 Query: 1333 APGDGGLSCLR-IPGTRPVACGEAAVSIGIGI 1425 GDGGLSCLR IP TRPV C +A+VS+G+G+ Sbjct: 351 GQGDGGLSCLRSIPVTRPVPCAQASVSVGVGV 382 >emb|CAN62837.1| hypothetical protein [Vitis vinifera] Length = 398 Score = 390 bits (1001), Expect = e-106 Identities = 190/335 (56%), Positives = 248/335 (74%), Gaps = 2/335 (0%) Frame = +1 Query: 397 YTALQALKAAIAEDPHGALSSWQGPNVCAYKGVYCSAPP-DGSXXXXXXXXGIDLNHANL 573 + ALQA K+AI +DP L +W G NVCAY+GV+C+ P DGS GIDLN ANL Sbjct: 56 HLALQAWKSAITDDPLKVLRTWVGSNVCAYRGVFCADPEEDGSGQTGPVVAGIDLNRANL 115 Query: 574 RGTLPAAVSLLAHLTFLHLNSNRLAGAVPDTLRDLQYLTELDLSNNLFSGPFPAATLLMP 753 +GTL +S+L ++ LHL+ NR G VP++ R L L ELDLSNN FSGPFP TLLMP Sbjct: 116 QGTLVKELSVLTDMSLLHLSGNRFTGTVPESFRYLLSLKELDLSNNHFSGPFPTVTLLMP 175 Query: 754 SLVYLDLRFNGFSGELPDEVFAKNLDAIFLNNNQFEGQIPETLWSSPATVITLAYNRLIG 933 +L+YLD+RFN F+G +PD++F K LDAI +NNNQF+G++P L +SPA+VI LA N+ G Sbjct: 176 NLIYLDIRFNNFAGPIPDDLFNKELDAIIINNNQFDGELPPNLGNSPASVINLANNKFSG 235 Query: 934 PVPTAYGYGAGGRVREVLFLNNKLTGCIPEALGFLPSIEVLDLSNNSLSGHLPSTLSCLS 1113 +PT++ Y +++E+LFLNN+LTGCIPE +G +EVLDLS+NSL GHLP+++SCL Sbjct: 236 NIPTSFAY-MNPKLKEILFLNNQLTGCIPEGVGMWDGMEVLDLSHNSLMGHLPNSISCLE 294 Query: 1114 GIEVLNIAHNQFTGELPNLVCDLKRITNLSVSFNFFSGISQDCDRLAGRSV-FDFVGNCI 1290 IEVLN+AHN+ +G L +LVC L+ I NL+V++NFFSG Q+C +L R+V FDF NCI Sbjct: 295 EIEVLNLAHNKLSGSLSDLVCSLRSIVNLTVAYNFFSGFGQECSKLFFRNVGFDFSVNCI 354 Query: 1291 PGRGLQRPQPECDGAPGDGGLSCLRIPGTRPVACG 1395 PGR +QRPQP+C PG GGL+CLRIP RP+ CG Sbjct: 355 PGRDMQRPQPDCSVIPG-GGLNCLRIPSARPLICG 388 >ref|NP_194653.1| leucine-rich repeat family protein / extensin family protein [Arabidopsis thaliana] emb|CAB79682.1| extensin-like protein [Arabidopsis thaliana] gb|AAM13857.1| putative extensin [Arabidopsis thaliana] gb|AAM51323.1| putative extensin [Arabidopsis thaliana] Length = 415 Score = 386 bits (992), Expect = e-105 Identities = 194/341 (56%), Positives = 246/341 (72%) Frame = +1 Query: 397 YTALQALKAAIAEDPHGALSSWQGPNVCAYKGVYCSAPPDGSXXXXXXXXGIDLNHANLR 576 Y ALQ K+A+ EDP L +W G +VC+YKGV+CS S IDLNHANL+ Sbjct: 77 YNALQVWKSAMREDPSNVLKTWVGSDVCSYKGVFCSGQSITS---------IDLNHANLK 127 Query: 577 GTLPAAVSLLAHLTFLHLNSNRLAGAVPDTLRDLQYLTELDLSNNLFSGPFPAATLLMPS 756 GTL ++LL+ L LHLNSNR +G +PD+ + L L ELDLSNN SGPFP TL +P+ Sbjct: 128 GTLVKDLALLSDLNILHLNSNRFSGQIPDSFKSLASLQELDLSNNKLSGPFPLVTLYIPN 187 Query: 757 LVYLDLRFNGFSGELPDEVFAKNLDAIFLNNNQFEGQIPETLWSSPATVITLAYNRLIGP 936 LVYLDLRFN +G +P+E+F K LDAI LNNNQF G+IP L +SPA+VI LA NR G Sbjct: 188 LVYLDLRFNSLTGFIPEELFNKRLDAILLNNNQFVGEIPRNLGNSPASVINLANNRFSGE 247 Query: 937 VPTAYGYGAGGRVREVLFLNNKLTGCIPEALGFLPSIEVLDLSNNSLSGHLPSTLSCLSG 1116 +PT++G G RV+EVL LNN+LTGCIPE++G IEV D+S N+L GH+P T+SCLS Sbjct: 248 IPTSFGL-TGSRVKEVLLLNNQLTGCIPESVGMFSEIEVFDVSYNALMGHVPDTISCLSA 306 Query: 1117 IEVLNIAHNQFTGELPNLVCDLKRITNLSVSFNFFSGISQDCDRLAGRSVFDFVGNCIPG 1296 IE+LN+AHN+F+GE+P+LVC L+ + NL+V+FNFFSG S +C FDFVGNCIPG Sbjct: 307 IEILNLAHNKFSGEVPDLVCSLRNLINLTVAFNFFSGFSSECSSRVSFG-FDFVGNCIPG 365 Query: 1297 RGLQRPQPECDGAPGDGGLSCLRIPGTRPVACGEAAVSIGI 1419 R QRPQP+C G G G +SC RIP T+P+AC AA+S+G+ Sbjct: 366 RNSQRPQPDCSGYSG-GAMSCFRIP-TQPLAC--AAISVGL 402 >ref|NP_179568.1| leucine-rich repeat family protein / extensin family protein [Arabidopsis thaliana] gb|AAC62138.1| putative disease resistance protein [Arabidopsis thaliana] gb|AAO42156.1| putative disease resistance protein [Arabidopsis thaliana] gb|AAO50553.1| putative disease resistance protein [Arabidopsis thaliana] Length = 402 Score = 384 bits (985), Expect = e-104 Identities = 194/343 (56%), Positives = 248/343 (72%) Frame = +1 Query: 397 YTALQALKAAIAEDPHGALSSWQGPNVCAYKGVYCSAPPDGSXXXXXXXXGIDLNHANLR 576 Y ALQ+ K+AI EDP G L +W G +VC+Y+GV+CS S IDLN ANL+ Sbjct: 72 YNALQSWKSAITEDPSGVLKTWVGEDVCSYRGVFCSGSSITS---------IDLNKANLK 122 Query: 577 GTLPAAVSLLAHLTFLHLNSNRLAGAVPDTLRDLQYLTELDLSNNLFSGPFPAATLLMPS 756 GT+ +SLL+ LT LHLNSNR +G +PD+ ++L L ELDLSNN FSG FP TL +P+ Sbjct: 123 GTIVKDLSLLSDLTILHLNSNRFSGQIPDSFKNLDSLQELDLSNNRFSGSFPQVTLYIPN 182 Query: 757 LVYLDLRFNGFSGELPDEVFAKNLDAIFLNNNQFEGQIPETLWSSPATVITLAYNRLIGP 936 LVYLDLRFN F+G +P+ +F K LDAI LNNNQF G+IP L S A+VI LA N+L G Sbjct: 183 LVYLDLRFNNFTGSIPENLFNKQLDAILLNNNQFTGEIPGNLGYSTASVINLANNKLSGE 242 Query: 937 VPTAYGYGAGGRVREVLFLNNKLTGCIPEALGFLPSIEVLDLSNNSLSGHLPSTLSCLSG 1116 +PT++G G +++EVLFLNN+LTGCIPE++G IEV D+S NSL GH+P T+SCLS Sbjct: 243 IPTSFGI-TGSKLKEVLFLNNQLTGCIPESVGLFSDIEVFDVSFNSLMGHVPDTISCLSE 301 Query: 1117 IEVLNIAHNQFTGELPNLVCDLKRITNLSVSFNFFSGISQDCDRLAGRSVFDFVGNCIPG 1296 IEVLN+ HN+F+G+LP+LVC L+ + NL+VSFNFFSG S C L+ FDF GNCIPG Sbjct: 302 IEVLNLGHNKFSGDLPDLVCTLRNLINLTVSFNFFSGFSSQCSSLS--VGFDFTGNCIPG 359 Query: 1297 RGLQRPQPECDGAPGDGGLSCLRIPGTRPVACGEAAVSIGIGI 1425 +G QRPQP+C PG G LSC RIP +P+ C AA+S+G+ + Sbjct: 360 KGYQRPQPDCSAIPG-GQLSCFRIP-AQPLTC--AAISLGLKV 398