BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphylf051k03 (1650 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|NP_001045533.1| Os01g0971400 [Oryza sativa (japonica cultiva... 262 6e-68 dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa (ja... 262 6e-68 emb|CAO47540.1| unnamed protein product [Vitis vinifera] 242 6e-62 ref|NP_567983.1| XCP1 (XYLEM CYSTEINE PEPTIDASE 1); cysteine-typ... 229 4e-58 ref|NP_564126.1| XCP2 (XYLEM CYSTEINE PEPTIDASE 2); cysteine-typ... 225 8e-57 >ref|NP_001045533.1| Os01g0971400 [Oryza sativa (japonica cultivar-group)] dbj|BAF07447.1| Os01g0971400 [Oryza sativa (japonica cultivar-group)] Length = 368 Score = 262 bits (669), Expect = 6e-68 Identities = 148/234 (63%), Positives = 157/234 (67%), Gaps = 8/234 (3%) Frame = +3 Query: 663 VATKTNAGSCWAFSTVAAVEGINQIVTGNLTALSEQELIXXXXXXXXXXXXXLMDYAFKY 842 V + GSCWAFSTVAAVEGIN IVTGNLT LSEQELI LMDYAF Y Sbjct: 157 VKNQGQCGSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNNGCSGGLMDYAFSY 216 Query: 843 IAANGGLHTEEDYPYLMEEGTCERKRSEDGD------VVTITGYEDVPRNNEQALLKAHA 1004 IAANGGLHTEE YPYLMEEGTC R +E D VTI+GYEDVPRNNEQALLKA A Sbjct: 217 IAANGGLHTEESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYEDVPRNNEQALLKALA 276 Query: 1005 HQPVSVAIEASGRNFRFYSGKCMNKLE*NVSARSPLCRACRECLMTPLWDG-AGSRRD-G 1178 HQPVSVAIEASGRNF+FYSG ++DG G+R D G Sbjct: 277 HQPVSVAIEASGRNFQFYSG--------------------------GVFDGPCGTRLDHG 310 Query: 1179 RRNVGYGTAAXXXXXXIIVKNSWGSHWGEKGYIRMRRGTGKREGLCGTNKMASY 1340 VGYGTA+ IIVKNSWGSHWGEKGYIRMRRGTGK +GLCG NKMASY Sbjct: 311 VTAVGYGTAS-KGHDYIIVKNSWGSHWGEKGYIRMRRGTGKHDGLCGINKMASY 363 Score = 107 bits (266), Expect(2) = 5e-45 Identities = 53/77 (68%), Positives = 60/77 (77%), Gaps = 2/77 (2%) Frame = +2 Query: 296 ETNKKVSTYWLGLNEFADLTHDEFKAVYLGVRTGLPRSRNTFSSNFRYEDV--VELPKAM 469 E NKK++ YWLGLNEFADLTHDEFKA YLG+ T P RN+ FRYE+V LPK + Sbjct: 88 EENKKITGYWLGLNEFADLTHDEFKAAYLGL-TLTPARRNSNDQLFRYEEVEAASLPKEV 146 Query: 470 DWRKKGAVTDVKNQGQC 520 DWRKKGAVT+VKNQGQC Sbjct: 147 DWRKKGAVTEVKNQGQC 163 Score = 99.4 bits (246), Expect(2) = 5e-45 Identities = 45/57 (78%), Positives = 53/57 (92%) Frame = +1 Query: 121 PSDFSIVGYSEEDLVSHDRIMELFEKWLSKYRKAYASFEEKLKRFEVFKDNLKHIDE 291 PS+ SIVGYSEEDL SH+R+MELFEK+++KYRKAY+S EEKL+RFEVFKDNL HIDE Sbjct: 32 PSELSIVGYSEEDLASHERLMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDE 88 >dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa (japonica cultivar-group)] gb|EAY77396.1| hypothetical protein OsI_005243 [Oryza sativa (indica cultivar-group)] gb|EAZ14987.1| hypothetical protein OsJ_004812 [Oryza sativa (japonica cultivar-group)] Length = 365 Score = 262 bits (669), Expect = 6e-68 Identities = 148/234 (63%), Positives = 157/234 (67%), Gaps = 8/234 (3%) Frame = +3 Query: 663 VATKTNAGSCWAFSTVAAVEGINQIVTGNLTALSEQELIXXXXXXXXXXXXXLMDYAFKY 842 V + GSCWAFSTVAAVEGIN IVTGNLT LSEQELI LMDYAF Y Sbjct: 154 VKNQGQCGSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNNGCSGGLMDYAFSY 213 Query: 843 IAANGGLHTEEDYPYLMEEGTCERKRSEDGD------VVTITGYEDVPRNNEQALLKAHA 1004 IAANGGLHTEE YPYLMEEGTC R +E D VTI+GYEDVPRNNEQALLKA A Sbjct: 214 IAANGGLHTEESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYEDVPRNNEQALLKALA 273 Query: 1005 HQPVSVAIEASGRNFRFYSGKCMNKLE*NVSARSPLCRACRECLMTPLWDG-AGSRRD-G 1178 HQPVSVAIEASGRNF+FYSG ++DG G+R D G Sbjct: 274 HQPVSVAIEASGRNFQFYSG--------------------------GVFDGPCGTRLDHG 307 Query: 1179 RRNVGYGTAAXXXXXXIIVKNSWGSHWGEKGYIRMRRGTGKREGLCGTNKMASY 1340 VGYGTA+ IIVKNSWGSHWGEKGYIRMRRGTGK +GLCG NKMASY Sbjct: 308 VTAVGYGTAS-KGHDYIIVKNSWGSHWGEKGYIRMRRGTGKHDGLCGINKMASY 360 Score = 107 bits (266), Expect(2) = 5e-45 Identities = 53/77 (68%), Positives = 60/77 (77%), Gaps = 2/77 (2%) Frame = +2 Query: 296 ETNKKVSTYWLGLNEFADLTHDEFKAVYLGVRTGLPRSRNTFSSNFRYEDV--VELPKAM 469 E NKK++ YWLGLNEFADLTHDEFKA YLG+ T P RN+ FRYE+V LPK + Sbjct: 85 EENKKITGYWLGLNEFADLTHDEFKAAYLGL-TLTPARRNSNDQLFRYEEVEAASLPKEV 143 Query: 470 DWRKKGAVTDVKNQGQC 520 DWRKKGAVT+VKNQGQC Sbjct: 144 DWRKKGAVTEVKNQGQC 160 Score = 99.4 bits (246), Expect(2) = 5e-45 Identities = 45/57 (78%), Positives = 53/57 (92%) Frame = +1 Query: 121 PSDFSIVGYSEEDLVSHDRIMELFEKWLSKYRKAYASFEEKLKRFEVFKDNLKHIDE 291 PS+ SIVGYSEEDL SH+R+MELFEK+++KYRKAY+S EEKL+RFEVFKDNL HIDE Sbjct: 29 PSELSIVGYSEEDLASHERLMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDE 85 >emb|CAO47540.1| unnamed protein product [Vitis vinifera] Length = 352 Score = 242 bits (617), Expect = 6e-62 Identities = 132/220 (60%), Positives = 149/220 (67%), Gaps = 1/220 (0%) Frame = +3 Query: 684 GSCWAFSTVAAVEGINQIVTGNLTALSEQELIXXXXXXXXXXXXXLMDYAFKYIAANGGL 863 GSCWAFSTVAAVEGINQIVTGNLT LSEQELI LMDYAF +IA+NGGL Sbjct: 156 GSCWAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGL 215 Query: 864 HTEEDYPYLMEEGTCERKRSEDGDVVTITGYEDVPRNNEQALLKAHAHQPVSVAIEASGR 1043 H E+DYPYLMEEGTCE ++ ED D+VTI+GYEDVP +E++LLKA AHQP+SVAIEASGR Sbjct: 216 HKEDDYPYLMEEGTCEEQK-EDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGR 274 Query: 1044 NFRFYSGKCMNKLE*NVSARSPLCRACRECLMTPLWDGAGSRRD-GRRNVGYGTAAXXXX 1220 +F+FYSG N G+ D G VGYG++ Sbjct: 275 DFQFYSGGVFN-------------------------GPCGTELDHGVAAVGYGSS--KGL 307 Query: 1221 XXIIVKNSWGSHWGEKGYIRMRRGTGKREGLCGTNKMASY 1340 IIVKNSWG WGEKGYIRM+R TGK EGLCG NKMASY Sbjct: 308 DYIIVKNSWGPKWGEKGYIRMKRNTGKTEGLCGINKMASY 347 Score = 112 bits (280), Expect = 8e-23 Identities = 51/75 (68%), Positives = 63/75 (84%) Frame = +2 Query: 296 ETNKKVSTYWLGLNEFADLTHDEFKAVYLGVRTGLPRSRNTFSSNFRYEDVVELPKAMDW 475 E NK+VS+YWLGLNEFADL+H+EFK+ YLG+R PRSR+ +S FRY DV +LP+++DW Sbjct: 82 ERNKEVSSYWLGLNEFADLSHEEFKSKYLGLRAEFPRSRD-YSGEFRYRDVADLPESVDW 140 Query: 476 RKKGAVTDVKNQGQC 520 RKKGAVT VKNQG C Sbjct: 141 RKKGAVTHVKNQGAC 155 Score = 77.0 bits (188), Expect = 4e-12 Identities = 38/70 (54%), Positives = 50/70 (71%) Frame = +1 Query: 127 DFSIVGYSEEDLVSHDRIMELFEKWLSKYRKAYASFEEKLKRFEVFKDNLKHIDES*DQQ 306 DFSIVGYS EDL D+++ FE W+SK+ K Y S EEKL RFEVF++NL HIDE +++ Sbjct: 28 DFSIVGYSPEDLTCIDKLIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDER-NKE 86 Query: 307 ESEHLLAWSE 336 S + L +E Sbjct: 87 VSSYWLGLNE 96 >ref|NP_567983.1| XCP1 (XYLEM CYSTEINE PEPTIDASE 1); cysteine-type peptidase [Arabidopsis thaliana] sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 precursor (AtXCP1) gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana] emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana] emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana] dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana] gb|AAO50712.1| unknown protein [Arabidopsis thaliana] Length = 355 Score = 229 bits (584), Expect = 4e-58 Identities = 125/231 (54%), Positives = 151/231 (65%), Gaps = 5/231 (2%) Frame = +3 Query: 663 VATKTNAGSCWAFSTVAAVEGINQIVTGNLTALSEQELIXXXXXXXXXXXXXLMDYAFKY 842 V + GSCWAFSTVAAVEGINQI TGNL++LSEQELI LMDYAF+Y Sbjct: 152 VKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQY 211 Query: 843 IAANGGLHTEEDYPYLMEEGTCERKRSEDGDVVTITGYEDVPRNNEQALLKAHAHQPVSV 1022 I + GGLH E+DYPYLMEEG C+ ++ ED + VTI+GYEDVP N++++L+KA AHQPVSV Sbjct: 212 IISTGGLHKEDDYPYLMEEGICQEQK-EDVERVTISGYEDVPENDDESLVKALAHQPVSV 270 Query: 1023 AIEASGRNFRFY-----SGKCMNKLE*NVSARSPLCRACRECLMTPLWDGAGSRRDGRRN 1187 AIEASGR+F+FY +GKC L+ V+A Sbjct: 271 AIEASGRDFQFYKGGVFNGKCGTDLDHGVAA----------------------------- 301 Query: 1188 VGYGTAAXXXXXXIIVKNSWGSHWGEKGYIRMRRGTGKREGLCGTNKMASY 1340 VGYG++ +IVKNSWG WGEKG+IRM+R TGK EGLCG NKMASY Sbjct: 302 VGYGSS--KGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEGLCGINKMASY 350 Score = 99.0 bits (245), Expect(2) = 7e-36 Identities = 43/75 (57%), Positives = 57/75 (76%) Frame = +2 Query: 296 ETNKKVSTYWLGLNEFADLTHDEFKAVYLGVRTGLPRSRNTFSSNFRYEDVVELPKAMDW 475 + N ++++YWLGLNEFADLTH+EFK YLG+ + S+NFRY D+ +LPK++DW Sbjct: 84 QRNNEINSYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDITDLPKSVDW 143 Query: 476 RKKGAVTDVKNQGQC 520 RKKGAV VK+QGQC Sbjct: 144 RKKGAVAPVKDQGQC 158 Score = 76.6 bits (187), Expect(2) = 7e-36 Identities = 33/55 (60%), Positives = 45/55 (81%) Frame = +1 Query: 127 DFSIVGYSEEDLVSHDRIMELFEKWLSKYRKAYASFEEKLKRFEVFKDNLKHIDE 291 DFSIVGY+ E L + D+++ELFE W+S++ KAY S EEK+ RFEVF++NL HID+ Sbjct: 30 DFSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQ 84 >ref|NP_564126.1| XCP2 (XYLEM CYSTEINE PEPTIDASE 2); cysteine-type peptidase/ peptidase [Arabidopsis thaliana] sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 precursor (AtXCP2) gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana] gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana] gb|AAO44088.1| At1g20850 [Arabidopsis thaliana] dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana] Length = 356 Score = 225 bits (573), Expect = 8e-57 Identities = 125/231 (54%), Positives = 148/231 (64%), Gaps = 5/231 (2%) Frame = +3 Query: 663 VATKTNAGSCWAFSTVAAVEGINQIVTGNLTALSEQELIXXXXXXXXXXXXXLMDYAFKY 842 V + + GSCWAFSTVAAVEGIN+IVTGNLT LSEQELI LMDYAF+Y Sbjct: 153 VKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEY 212 Query: 843 IAANGGLHTEEDYPYLMEEGTCERKRSEDGDVVTITGYEDVPRNNEQALLKAHAHQPVSV 1022 I NGGL EEDYPY MEEGTCE ++ E + VTI G++DVP N+E++LLKA AHQP+SV Sbjct: 213 IVKNGGLRKEEDYPYSMEEGTCEMQKDE-SETVTINGHQDVPTNDEKSLLKALAHQPLSV 271 Query: 1023 AIEASGRNFRFYS-----GKCMNKLE*NVSARSPLCRACRECLMTPLWDGAGSRRDGRRN 1187 AI+ASGR F+FYS G+C L+ V+A Sbjct: 272 AIDASGREFQFYSGGVFDGRCGVDLDHGVAA----------------------------- 302 Query: 1188 VGYGTAAXXXXXXIIVKNSWGSHWGEKGYIRMRRGTGKREGLCGTNKMASY 1340 VGYG++ IIVKNSWG WGEKGYIR++R TGK EGLCG NKMAS+ Sbjct: 303 VGYGSS--KGSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLCGINKMASF 351 Score = 99.4 bits (246), Expect(2) = 2e-39 Identities = 46/76 (60%), Positives = 58/76 (76%), Gaps = 1/76 (1%) Frame = +2 Query: 296 ETNKKVSTYWLGLNEFADLTHDEFKAVYLGVRTGLPRSRNTFS-SNFRYEDVVELPKAMD 472 ETNKK +YWLGLNEFADL+H+EFK +YLG++T + R S + F Y DV +PK++D Sbjct: 84 ETNKKGKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVD 143 Query: 473 WRKKGAVTDVKNQGQC 520 WRKKGAV +VKNQG C Sbjct: 144 WRKKGAVAEVKNQGSC 159 Score = 88.2 bits (217), Expect(2) = 2e-39 Identities = 39/56 (69%), Positives = 47/56 (83%) Frame = +1 Query: 127 DFSIVGYSEEDLVSHDRIMELFEKWLSKYRKAYASFEEKLKRFEVFKDNLKHIDES 294 D+SIVGYS EDL SHD+++ELFE W+S + KAY + EEK RFEVFKDNLKHIDE+ Sbjct: 30 DYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDET 85