BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphylf046j07 (2091 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value sp|P02879|RICI_RICCO Ricin precursor [Contains: Ricin A chain (r... 439 e-121 emb|CAA26230.1| unnamed protein product [Ricinus communis] 436 e-120 gb|AAB22582.1| proricin A chain [Ricinus communis] 426 e-117 sp|P06750|AGGL_RICCO Agglutinin precursor (RCA) [Contains: Agglu... 407 e-111 gb|AAB22584.1| agglutinin I; proRCA I [Ricinus communis] 396 e-108 >sp|P02879|RICI_RICCO Ricin precursor [Contains: Ricin A chain (rRNA N-glycosidase); Linker peptide; Ricin B chain] emb|CAA26939.1| ricin precursor [Ricinus communis] emb|CAA37095.1| pre-propolypeptide (AA -35 to 541) [Ricinus communis] Length = 576 Score = 439 bits (1130), Expect = e-121 Identities = 258/592 (43%), Positives = 336/592 (56%), Gaps = 4/592 (0%) Frame = +3 Query: 90 GGTVLLILMLNLLVAPWLCFASSDGALPPATASENATTVLNKYYPQVRLSTVGLSAPRYE 269 GG ++I M VA WLCF S+ G T +N + K YP + +T G + Y Sbjct: 4 GGNTIVIWMY--AVATWLCFGSTSGW--SFTLEDN--NIFPKQYPIINFTTAGATVQSYT 57 Query: 270 NFIAAVRAALKK--DEIHGIPVLRCNPYPLLSERYLFVTLTNKAWYSITLKLDVTGAYFT 443 NFI AVR L D H IPVL +++R++ V L+N A S+TL LDVT AY Sbjct: 58 NFIRAVRGRLTTGADVRHEIPVLPNRVGLPINQRFILVELSNHAELSVTLALDVTNAYVV 117 Query: 444 AYRAGNYSCDIHKIISGLSVTCKYARGTSSHVLMEPSSTGSHPVDHLAEDL-EGVRWGTN 620 YRAGN + H + + + ++ LA +L E + G Sbjct: 118 GYRAGNSAYFFHPDNQEDAEAITHLFTDVQNRYTFAFGGNYDRLEQLAGNLRENIELGNG 177 Query: 621 ALDEAISSLYRFPLGMATIREWADGIRTCIMMITNAARFQYIERRMSAAIRHGNDETEDP 800 L+EAIS+LY + G + A CI MI+ AARFQYIE M IR+ DP Sbjct: 178 PLEEAISALYYYSTGGTQLPTLARSFIICIQMISEAARFQYIEGEMRTRIRYNRRSAPDP 237 Query: 801 SLHSLALRWRDLSAAIEESHQGVFAAPITVLRRSNEVLPVDSVRRAAPFIALMYCHCQKP 980 S+ +L W LS AI+ES+QG FA+PI + RR+ V V P IALM C P Sbjct: 238 SVITLENSWGRLSTAIQESNQGAFASPIQLQRRNGSKFSVYDVSILIPIIALMVYRCAPP 297 Query: 981 LKETQPLPWVDLFSDEIPMLIRSVVQGSADAPACKETEPTSRIVGPDGNCVDVRNGWYQD 1160 S + +LIR VV + +A C + EP RIVG +G CVDVR+G + + Sbjct: 298 P------------SSQFSLLIRPVVP-NFNADVCMDPEPIVRIVGRNGLCVDVRDGRFHN 344 Query: 1161 GAMVQLWPCKSYTAVNQLWTFKRDGTIRSNGNYCLTASGSTPGDYVMIFRCPDSPTDAVV 1340 G +QLWPCKS T NQLWT KRD TIRSNG CLT G +PG YVMI+ C + TDA Sbjct: 345 GNAIQLWPCKSNTDANQLWTLKRDNTIRSNGK-CLTTYGYSPGVYVMIYDCNTAATDATR 403 Query: 1341 WEVRDDGTIVS-KSGLVLSASSTASYTVLAVQTDNRSTGQSWTPTNDTNPFMAAIVGYHD 1517 W++ D+GTI++ +S LVL+A+S S T L VQT+ + Q W PTN+T PF+ IVG + Sbjct: 404 WQIWDNGTIINPRSSLVLAATSGNSGTTLTVQTNIYAVSQGWLPTNNTQPFVTTIVGLYG 463 Query: 1518 LCLQVDGEDVWVASCVGGKPEQAWALYPDGSIRPKQKQDGCLAPDAKNELKLVQVFPCDP 1697 LCLQ + VW+ C K EQ WALY DGSIRP+Q +D CL D+ +V++ C P Sbjct: 464 LCLQANSGQVWIEDCSSEKAEQQWALYADGSIRPQQNRDNCLTSDSNIRETVVKILSCGP 523 Query: 1698 TSSGQRWVFWSDGSILNLGTELVMDVRGSDPSLKQIIVYTATGNPNQKWAPM 1853 SSGQRW+F +DG+ILNL + LV+DVR SDPSLKQII+Y G+PNQ W P+ Sbjct: 524 ASSGQRWMFKNDGTILNLYSGLVLDVRASDPSLKQIILYPLHGDPNQIWLPL 575 >emb|CAA26230.1| unnamed protein product [Ricinus communis] Length = 565 Score = 436 bits (1122), Expect = e-120 Identities = 254/579 (43%), Positives = 330/579 (56%), Gaps = 4/579 (0%) Frame = +3 Query: 129 VAPWLCFASSDGALPPATASENATTVLNKYYPQVRLSTVGLSAPRYENFIAAVRAALKK- 305 VA WLCF S+ G T +N + K YP + +T G + Y NFI AVR L Sbjct: 4 VATWLCFGSTSGW--SFTLEDN--NIFPKQYPIINFTTAGATVQSYTNFIRAVRGRLTTG 59 Query: 306 -DEIHGIPVLRCNPYPLLSERYLFVTLTNKAWYSITLKLDVTGAYFTAYRAGNYSCDIHK 482 D H IPVL +++R++ V L+N A S+TL LDVT AY YRAGN + H Sbjct: 60 ADVRHDIPVLPNRVGLPINQRFILVELSNHAELSVTLALDVTNAYVVGYRAGNSAYFFHP 119 Query: 483 IISGLSVTCKYARGTSSHVLMEPSSTGSHPVDHLAEDL-EGVRWGTNALDEAISSLYRFP 659 + + + ++ LA +L E + G L+EAIS+LY + Sbjct: 120 DNQEDAEAITHLFTDVQNRYTFAFGGNYDRLEQLAGNLRENIELGNGPLEEAISALYYYS 179 Query: 660 LGMATIREWADGIRTCIMMITNAARFQYIERRMSAAIRHGNDETEDPSLHSLALRWRDLS 839 G + A CI MI+ AARFQYIE M IR+ DPS+ +L W LS Sbjct: 180 TGGTQLPTLARSFIICIQMISEAARFQYIEGEMRTRIRYNRRSAPDPSVITLENSWGRLS 239 Query: 840 AAIEESHQGVFAAPITVLRRSNEVLPVDSVRRAAPFIALMYCHCQKPLKETQPLPWVDLF 1019 AI+ES+QG FA+PI + RR+ V V P IALM C P Sbjct: 240 TAIQESNQGAFASPIQLQRRNGSKFSVYDVSILIPIIALMVYRCAPPP------------ 287 Query: 1020 SDEIPMLIRSVVQGSADAPACKETEPTSRIVGPDGNCVDVRNGWYQDGAMVQLWPCKSYT 1199 S + +LIR VV + +A C + EP RIVG +G CVDVR+G + +G +QLWPCKS T Sbjct: 288 SSQFSLLIRPVVP-NFNADVCMDPEPIVRIVGRNGLCVDVRDGRFHNGNAIQLWPCKSNT 346 Query: 1200 AVNQLWTFKRDGTIRSNGNYCLTASGSTPGDYVMIFRCPDSPTDAVVWEVRDDGTIVS-K 1376 NQLWT KRD TIRSNG CLT G +PG YVMI+ C + TDA W++ D+GTI++ + Sbjct: 347 DANQLWTLKRDNTIRSNGK-CLTTYGYSPGVYVMIYDCNTAATDATRWQIWDNGTIINPR 405 Query: 1377 SGLVLSASSTASYTVLAVQTDNRSTGQSWTPTNDTNPFMAAIVGYHDLCLQVDGEDVWVA 1556 S LVL+A+S S T L VQT+ + Q W PTN+T PF+ IVG + LCLQ + VW+ Sbjct: 406 SSLVLAATSGNSGTTLTVQTNIYAVSQGWLPTNNTQPFVTTIVGLYGLCLQANSGQVWIE 465 Query: 1557 SCVGGKPEQAWALYPDGSIRPKQKQDGCLAPDAKNELKLVQVFPCDPTSSGQRWVFWSDG 1736 C K EQ WALY DGSIRP+Q +D CL D+ +V++ C P SSGQRW+F +DG Sbjct: 466 DCSSEKAEQQWALYADGSIRPQQNRDNCLTSDSNIRETVVKILSCGPASSGQRWMFKNDG 525 Query: 1737 SILNLGTELVMDVRGSDPSLKQIIVYTATGNPNQKWAPM 1853 +ILNL + LV+DVR SDPSLKQII+Y G+PNQ W P+ Sbjct: 526 TILNLYSGLVLDVRRSDPSLKQIILYPLHGDPNQIWLPL 564 >gb|AAB22582.1| proricin A chain [Ricinus communis] Length = 541 Score = 426 bits (1094), Expect = e-117 Identities = 244/554 (44%), Positives = 318/554 (57%), Gaps = 4/554 (0%) Frame = +3 Query: 204 VLNKYYPQVRLSTVGLSAPRYENFIAAVRAALKK--DEIHGIPVLRCNPYPLLSERYLFV 377 + K YP + +T G + Y NFI AVR L D H IPVL +++R++ V Sbjct: 1 IFPKQYPIINFTTAGATVQSYTNFIRAVRGRLTTGADVRHDIPVLPNRVGLPINQRFILV 60 Query: 378 TLTNKAWYSITLKLDVTGAYFTAYRAGNYSCDIHKIISGLSVTCKYARGTSSHVLMEPSS 557 L+N A S+TL LDVT AY YRAGN + H + + + Sbjct: 61 ELSNHAELSVTLALDVTNAYVVGYRAGNSAYFFHPDNQEDAEAITHLFTDVQNRYTFAFG 120 Query: 558 TGSHPVDHLAEDL-EGVRWGTNALDEAISSLYRFPLGMATIREWADGIRTCIMMITNAAR 734 ++ LA +L E + G L+EAIS+LY + G + A CI MI+ AAR Sbjct: 121 GNYDRLEQLAGNLRENIELGNGPLEEAISALYYYSTGGTQLPTLARSFIICIQMISEAAR 180 Query: 735 FQYIERRMSAAIRHGNDETEDPSLHSLALRWRDLSAAIEESHQGVFAAPITVLRRSNEVL 914 FQYIE M IR+ DPS+ +L W LS AI+ES+QG FA+PI + RR+ Sbjct: 181 FQYIEGEMRTRIRYNRRSAPDPSVITLENSWGRLSTAIQESNQGAFASPIQLQRRNGSKF 240 Query: 915 PVDSVRRAAPFIALMYCHCQKPLKETQPLPWVDLFSDEIPMLIRSVVQGSADAPACKETE 1094 V V P IALM C P S + +LIR VV + +A C + E Sbjct: 241 SVYDVSILIPIIALMVYRCAPPP------------SSQFSLLIRPVVP-NFNADVCMDPE 287 Query: 1095 PTSRIVGPDGNCVDVRNGWYQDGAMVQLWPCKSYTAVNQLWTFKRDGTIRSNGNYCLTAS 1274 P RIVG +G CVDVR+G + +G +QLWPCKS T NQLWT KRD TIRSNG CLT Sbjct: 288 PIVRIVGRNGLCVDVRDGRFHNGNAIQLWPCKSNTDANQLWTLKRDNTIRSNGK-CLTTY 346 Query: 1275 GSTPGDYVMIFRCPDSPTDAVVWEVRDDGTIVS-KSGLVLSASSTASYTVLAVQTDNRST 1451 G +PG YVMI+ C + TDA W++ D+GTI++ +S LVL+A+S S T L VQT+ + Sbjct: 347 GYSPGVYVMIYDCNTAATDATRWQIWDNGTIINPRSSLVLAATSGNSGTTLTVQTNIYAV 406 Query: 1452 GQSWTPTNDTNPFMAAIVGYHDLCLQVDGEDVWVASCVGGKPEQAWALYPDGSIRPKQKQ 1631 Q W PTN+T PF+ IVG + LCLQ + VW+ C K EQ WALY DGSIRP+Q + Sbjct: 407 SQGWLPTNNTQPFVTTIVGLYGLCLQANSGQVWIEDCSSEKAEQQWALYADGSIRPQQNR 466 Query: 1632 DGCLAPDAKNELKLVQVFPCDPTSSGQRWVFWSDGSILNLGTELVMDVRGSDPSLKQIIV 1811 D CL D+ +V++ C P SSGQRW+F +DG+ILNL + LV+DVR SDPSLKQII+ Sbjct: 467 DNCLTSDSNIRETVVKILSCGPASSGQRWMFKNDGTILNLYSGLVLDVRRSDPSLKQIIL 526 Query: 1812 YTATGNPNQKWAPM 1853 Y G+PNQ W P+ Sbjct: 527 YPLHGDPNQIWLPL 540 >sp|P06750|AGGL_RICCO Agglutinin precursor (RCA) [Contains: Agglutinin A chain (rRNA N-glycosidase); Agglutinin B chain] gb|AAA33869.1| prepro-agglutinin Length = 564 Score = 407 bits (1045), Expect = e-111 Identities = 244/578 (42%), Positives = 319/578 (55%), Gaps = 3/578 (0%) Frame = +3 Query: 129 VAPWLCFASSDGALPPATASENATTVLNKYYPQVRLSTVGLSAPRYENFIAAVRAALKK- 305 VA WLCF S+ G T +N + K YP + +T + Y NFI AVR+ L Sbjct: 4 VATWLCFGSTSGW--SFTLEDN--NIFPKQYPIINFTTADATVESYTNFIRAVRSHLTTG 59 Query: 306 -DEIHGIPVLRCNPYPLLSERYLFVTLTNKAWYSITLKLDVTGAYFTAYRAGNYSCDIHK 482 D H IPVL +S+R++ V L+N A S+TL LDVT AY RAGN + H Sbjct: 60 ADVRHEIPVLPNRVGLPISQRFILVELSNHAELSVTLALDVTNAYVVGCRAGNSAYFFHP 119 Query: 483 IISGLSVTCKYARGTSSHVLMEPSSTGSHPVDHLAEDLEGVRWGTNALDEAISSLYRFPL 662 + + + ++ L E + GT L++AIS+LY + Sbjct: 120 DNQEDAEAITHLFTDVQNSFTFAFGGNYDRLEQLGGLRENIELGTGPLEDAISALYYYST 179 Query: 663 GMATIREWADGIRTCIMMITNAARFQYIERRMSAAIRHGNDETEDPSLHSLALRWRDLSA 842 I A CI MI+ AARFQYIE M IR+ DPS+ +L W LS Sbjct: 180 CGTQIPTLARSFMVCIQMISEAARFQYIEGEMRTRIRYNRRSAPDPSVITLENSWGRLST 239 Query: 843 AIEESHQGVFAAPITVLRRSNEVLPVDSVRRAAPFIALMYCHCQKPLKETQPLPWVDLFS 1022 AI+ES+QG FA+PI + RR+ V V P IALM C P S Sbjct: 240 AIQESNQGAFASPIQLQRRNGSKFNVYDVSILIPIIALMVYRCAPPP------------S 287 Query: 1023 DEIPMLIRSVVQGSADAPACKETEPTSRIVGPDGNCVDVRNGWYQDGAMVQLWPCKSYTA 1202 + +LIR VV + +A C + EP RIVG +G CVDV + DG +QLWPCKS T Sbjct: 288 SQFSLLIRPVVP-NFNADVCMDPEPIVRIVGRNGLCVDVTGEEFFDGNPIQLWPCKSNTD 346 Query: 1203 VNQLWTFKRDGTIRSNGNYCLTASGSTPGDYVMIFRCPDSPTDAVVWEVRDDGTIVS-KS 1379 NQLWT ++D TIRSNG CLT S S+P V+I+ C + A W++ D+ TI++ +S Sbjct: 347 WNQLWTLRKDSTIRSNGK-CLTISKSSPRQQVVIYNCSTATVGATRWQIWDNRTIINPRS 405 Query: 1380 GLVLSASSTASYTVLAVQTDNRSTGQSWTPTNDTNPFMAAIVGYHDLCLQVDGEDVWVAS 1559 GLVL+A+S S T L VQT+ + Q W PTN+T PF+ IVG + +CLQ + VW+ Sbjct: 406 GLVLAATSGNSGTKLTVQTNIYAVSQGWLPTNNTQPFVTTIVGLYGMCLQANSGKVWLED 465 Query: 1560 CVGGKPEQAWALYPDGSIRPKQKQDGCLAPDAKNELKLVQVFPCDPTSSGQRWVFWSDGS 1739 C K EQ WALY DGSIRP+Q +D CL DA + +V++ C P SSGQRW+F +DG+ Sbjct: 466 CTSEKAEQQWALYADGSIRPQQNRDNCLTTDANIKGTVVKILSCGPASSGQRWMFKNDGT 525 Query: 1740 ILNLGTELVMDVRGSDPSLKQIIVYTATGNPNQKWAPM 1853 ILNL LV+DVR SDPSLKQIIV+ GN NQ W P+ Sbjct: 526 ILNLYNGLVLDVRRSDPSLKQIIVHPFHGNLNQIWLPL 563 >gb|AAB22584.1| agglutinin I; proRCA I [Ricinus communis] Length = 540 Score = 396 bits (1017), Expect = e-108 Identities = 234/553 (42%), Positives = 307/553 (55%), Gaps = 3/553 (0%) Frame = +3 Query: 204 VLNKYYPQVRLSTVGLSAPRYENFIAAVRAALKK--DEIHGIPVLRCNPYPLLSERYLFV 377 + K YP + +T + Y NFI AVR+ L D H IPVL +S+R++ V Sbjct: 1 IFPKQYPIINFTTADATVESYTNFIRAVRSHLTTGADVRHEIPVLPNRVGLPISQRFILV 60 Query: 378 TLTNKAWYSITLKLDVTGAYFTAYRAGNYSCDIHKIISGLSVTCKYARGTSSHVLMEPSS 557 L+N A S+TL LDVT AY RAGN + H + + + Sbjct: 61 ELSNHAELSVTLALDVTNAYVVGCRAGNSAYFFHPDNQEDAEAITHLFTDVQNSFTFAFG 120 Query: 558 TGSHPVDHLAEDLEGVRWGTNALDEAISSLYRFPLGMATIREWADGIRTCIMMITNAARF 737 ++ L E + GT L++AIS+LY + I A CI MI+ AARF Sbjct: 121 GNYDRLEQLGGLRENIELGTGPLEDAISALYYYSTCGTQIPTLARSFMVCIQMISEAARF 180 Query: 738 QYIERRMSAAIRHGNDETEDPSLHSLALRWRDLSAAIEESHQGVFAAPITVLRRSNEVLP 917 QYIE M IR+ DPS+ +L W LS AI+ES+QG FA+PI + RR+ Sbjct: 181 QYIEGEMRTRIRYNRRSAPDPSVITLENSWGRLSTAIQESNQGAFASPIQLQRRNGSKFN 240 Query: 918 VDSVRRAAPFIALMYCHCQKPLKETQPLPWVDLFSDEIPMLIRSVVQGSADAPACKETEP 1097 V V P IALM C P S + +LIR VV + +A C + EP Sbjct: 241 VYDVSILIPIIALMVYRCAPPP------------SSQFSLLIRPVVP-NFNADVCMDPEP 287 Query: 1098 TSRIVGPDGNCVDVRNGWYQDGAMVQLWPCKSYTAVNQLWTFKRDGTIRSNGNYCLTASG 1277 RIVG +G CVDV + DG +QLWPCKS T NQLWT ++D TIRSNG CLT S Sbjct: 288 IVRIVGRNGLCVDVTGEEFFDGNPIQLWPCKSNTDWNQLWTLRKDSTIRSNGK-CLTISK 346 Query: 1278 STPGDYVMIFRCPDSPTDAVVWEVRDDGTIVS-KSGLVLSASSTASYTVLAVQTDNRSTG 1454 S+P V+I+ C + A W++ D+ TI++ +SGLVL+A+S S T L VQT+ + Sbjct: 347 SSPRQQVVIYNCSTATVGATRWQIWDNRTIINPRSGLVLAATSGNSGTKLTVQTNIYAVS 406 Query: 1455 QSWTPTNDTNPFMAAIVGYHDLCLQVDGEDVWVASCVGGKPEQAWALYPDGSIRPKQKQD 1634 Q W PTN+T PF+ IVG + +CLQ + VW+ C K EQ WALY DGSIRP+Q +D Sbjct: 407 QGWLPTNNTQPFVTTIVGLYGMCLQANSGKVWLEDCTSEKAEQQWALYADGSIRPQQNRD 466 Query: 1635 GCLAPDAKNELKLVQVFPCDPTSSGQRWVFWSDGSILNLGTELVMDVRGSDPSLKQIIVY 1814 CL DA + +V++ C P SSGQRW+F +DG+ILNL LV+DVR SDPSLKQIIV+ Sbjct: 467 NCLTTDANIKGTVVKILSCGPASSGQRWMFKNDGTILNLYNGLVLDVRRSDPSLKQIIVH 526 Query: 1815 TATGNPNQKWAPM 1853 GN NQ W P+ Sbjct: 527 PFHGNLNQIWLPL 539