BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphyst015h15 (1410 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAO49803.1| unnamed protein product [Vitis vinifera] 558 e-157 gb|EAZ34923.1| hypothetical protein OsJ_018406 [Oryza sativa (ja... 539 e-151 gb|EAY98680.1| hypothetical protein OsI_019913 [Oryza sativa (in... 536 e-150 ref|NP_179037.2| PIP (proline iminopeptidase); prolyl aminopepti... 525 e-147 ref|NP_973454.1| PIP (proline iminopeptidase); prolyl aminopepti... 525 e-147 >emb|CAO49803.1| unnamed protein product [Vitis vinifera] Length = 331 Score = 558 bits (1437), Expect = e-157 Identities = 256/327 (78%), Positives = 292/327 (89%), Gaps = 2/327 (0%) Frame = +3 Query: 228 MDLSGQ--PLRRDLYPHIEPYDSGFLNVSGVHTIYYEQSGNPHGHPVVFLHGGPGAGTSP 401 MDL + L R+LYP IEPY SGFL VS +H+IY+EQSGNP+GHPVVF+HGGPG GTSP Sbjct: 1 MDLGKEVPELNRNLYPPIEPYSSGFLKVSDLHSIYWEQSGNPNGHPVVFIHGGPGGGTSP 60 Query: 402 GNRRFFDPEFYRIVLFDQRGAGRSTPHACLEENTTWDLVADIEKLREHLDIPEWQVFGGS 581 NR FFDP+FYRI+LFDQRGAG+STPHACL +NTTWDLV DIEKLREHL+IPEWQVFGGS Sbjct: 61 SNRTFFDPDFYRIILFDQRGAGKSTPHACLVDNTTWDLVNDIEKLREHLEIPEWQVFGGS 120 Query: 582 WGSTLALAYSQNHPDKVTGIVLRGIFLLRKKELDWFYEGGAAAIFPDAWEPFRDFIPEDE 761 WGSTLALAYSQ+HPDKVTG+VLRGIFLLRKKELDWFYEGGAAAI+PDAWEPFRD IPE+E Sbjct: 121 WGSTLALAYSQSHPDKVTGMVLRGIFLLRKKELDWFYEGGAAAIYPDAWEPFRDLIPENE 180 Query: 762 RNCFITAYSKRLTSSDTDVQVEAAKRWTMWEMMTAHLIQNHENIKRGDDDKFSLAFARIE 941 R+C I AY KRL S D + Q AA+ WT WEMMTAHL+ N ENIK+GDDDKFSLAFARIE Sbjct: 181 RDCLIDAYHKRLNSDDMETQYAAARAWTKWEMMTAHLLPNEENIKKGDDDKFSLAFARIE 240 Query: 942 NHYFVNKGFLSSDSYLLYNVDKTRHIKAFIVQGRYDVCCPMMSAWDLHKAWPEAEFKVVP 1121 NHYFVNKGF SDS+LL N++K RHI A IVQGRYD+CCP+M+AWDLHKAWPEA+FK+VP Sbjct: 241 NHYFVNKGFFPSDSFLLDNIEKIRHINATIVQGRYDMCCPIMTAWDLHKAWPEADFKIVP 300 Query: 1122 DAGHSANEVGVAAELVSANEKLKSMLR 1202 DAGHSANE+G+AAELV+ANEKLK++++ Sbjct: 301 DAGHSANELGIAAELVAANEKLKNIIK 327 >gb|EAZ34923.1| hypothetical protein OsJ_018406 [Oryza sativa (japonica cultivar-group)] Length = 294 Score = 539 bits (1389), Expect = e-151 Identities = 264/321 (82%), Positives = 271/321 (84%) Frame = +3 Query: 243 QPLRRDLYPHIEPYDSGFLNVSGVHTIYYEQSGNPHGHPVVFLHGGPGAGTSPGNRRFFD 422 QPLR+DLYP EPYD GFL VSGVHTIYYEQSGNP GHPVVFLHGGPGAGTSPGNRRFFD Sbjct: 11 QPLRKDLYPQTEPYDFGFLKVSGVHTIYYEQSGNPQGHPVVFLHGGPGAGTSPGNRRFFD 70 Query: 423 PEFYRIVLFDQRGAGRSTPHACLEENTTWDLVADIEKLREHLDIPEWQVFGGSWGSTLAL 602 PEF+RIVLFDQ VFGGSWGSTLAL Sbjct: 71 PEFFRIVLFDQ-------------------------------------VFGGSWGSTLAL 93 Query: 603 AYSQNHPDKVTGIVLRGIFLLRKKELDWFYEGGAAAIFPDAWEPFRDFIPEDERNCFITA 782 AYS++HPDKVTGIVLRGIFLLRKKELDWFYEGGAAAIFPDAWEPFRDFIPEDERNCFI A Sbjct: 94 AYSESHPDKVTGIVLRGIFLLRKKELDWFYEGGAAAIFPDAWEPFRDFIPEDERNCFIAA 153 Query: 783 YSKRLTSSDTDVQVEAAKRWTMWEMMTAHLIQNHENIKRGDDDKFSLAFARIENHYFVNK 962 YSKRLTSSD DVQ EAAKRWTMWEMMTAHLIQNHENIKRG+DDKFSLAFARIENHYFVNK Sbjct: 154 YSKRLTSSDADVQAEAAKRWTMWEMMTAHLIQNHENIKRGEDDKFSLAFARIENHYFVNK 213 Query: 963 GFLSSDSYLLYNVDKTRHIKAFIVQGRYDVCCPMMSAWDLHKAWPEAEFKVVPDAGHSAN 1142 GFL SDS+LL NVDK RHIKAFIVQGRYDVCCPMMSAWDLHKAWPEAEFK+VPDAGHSAN Sbjct: 214 GFLPSDSHLLDNVDKIRHIKAFIVQGRYDVCCPMMSAWDLHKAWPEAEFKMVPDAGHSAN 273 Query: 1143 EVGVAAELVSANEKLKSMLRK 1205 EVGVAAELVSANEKLKSM K Sbjct: 274 EVGVAAELVSANEKLKSMFTK 294 >gb|EAY98680.1| hypothetical protein OsI_019913 [Oryza sativa (indica cultivar-group)] Length = 294 Score = 536 bits (1381), Expect = e-150 Identities = 263/321 (81%), Positives = 270/321 (84%) Frame = +3 Query: 243 QPLRRDLYPHIEPYDSGFLNVSGVHTIYYEQSGNPHGHPVVFLHGGPGAGTSPGNRRFFD 422 Q LR+DLYP EPYD GFL VSGVHTIYYEQSGNP GHPVVFLHGGPGAGTSPGNRRFFD Sbjct: 11 QQLRKDLYPQTEPYDFGFLKVSGVHTIYYEQSGNPQGHPVVFLHGGPGAGTSPGNRRFFD 70 Query: 423 PEFYRIVLFDQRGAGRSTPHACLEENTTWDLVADIEKLREHLDIPEWQVFGGSWGSTLAL 602 PEF+RIVLFDQ VFGGSWGSTLAL Sbjct: 71 PEFFRIVLFDQ-------------------------------------VFGGSWGSTLAL 93 Query: 603 AYSQNHPDKVTGIVLRGIFLLRKKELDWFYEGGAAAIFPDAWEPFRDFIPEDERNCFITA 782 AYS++HPDKVTGIVLRGIFLLRKKELDWFYEGGAAAIFPDAWEPFRDFIPEDERNCFI A Sbjct: 94 AYSESHPDKVTGIVLRGIFLLRKKELDWFYEGGAAAIFPDAWEPFRDFIPEDERNCFIAA 153 Query: 783 YSKRLTSSDTDVQVEAAKRWTMWEMMTAHLIQNHENIKRGDDDKFSLAFARIENHYFVNK 962 YSKRLTSSD DVQ EAAKRWTMWEMMTAHLIQNHENIKRG+DDKFSLAFARIENHYFVNK Sbjct: 154 YSKRLTSSDADVQAEAAKRWTMWEMMTAHLIQNHENIKRGEDDKFSLAFARIENHYFVNK 213 Query: 963 GFLSSDSYLLYNVDKTRHIKAFIVQGRYDVCCPMMSAWDLHKAWPEAEFKVVPDAGHSAN 1142 GFL SDS+LL NVDK RHIKAFIVQGRYDVCCPMMSAWDLHKAWPEAEFK+VPDAGHSAN Sbjct: 214 GFLPSDSHLLDNVDKIRHIKAFIVQGRYDVCCPMMSAWDLHKAWPEAEFKMVPDAGHSAN 273 Query: 1143 EVGVAAELVSANEKLKSMLRK 1205 EVGVAAELVSANEKLKSM K Sbjct: 274 EVGVAAELVSANEKLKSMFTK 294 >ref|NP_179037.2| PIP (proline iminopeptidase); prolyl aminopeptidase [Arabidopsis thaliana] sp|P93732|PIP_ARATH Proline iminopeptidase (PIP) (Prolyl aminopeptidase) (PAP) gb|AAL24398.1| proline iminopeptidase [Arabidopsis thaliana] gb|AAM48009.1| proline iminopeptidase [Arabidopsis thaliana] Length = 380 Score = 525 bits (1353), Expect = e-147 Identities = 247/317 (77%), Positives = 275/317 (86%), Gaps = 1/317 (0%) Frame = +3 Query: 252 RRDLYPHIEPYDSGFLNVSGVHTIYYEQSGNPHGHPVVFLHGGPGAGTSPGNRRFFDPEF 431 +R LY IEPY SG L VS VHT+Y+EQSG P GHPVVFLHGGPG GT+P NRRFFDPEF Sbjct: 63 KRTLYAPIEPYSSGNLKVSDVHTLYWEQSGKPDGHPVVFLHGGPGGGTAPSNRRFFDPEF 122 Query: 432 YRIVLFDQRGAGRSTPHACLEENTTWDLVADIEKLREHLDIPEWQVFGGSWGSTLALAYS 611 YRIVLFDQRGAG+STPHACLEENTTWDLV DIEKLREHL IPEW VFGGSWGSTLALAYS Sbjct: 123 YRIVLFDQRGAGKSTPHACLEENTTWDLVNDIEKLREHLKIPEWLVFGGSWGSTLALAYS 182 Query: 612 QNHPDKVTGIVLRGIFLLRKKELDWFYEGGAAAIFPDAWEPFRDFIPEDER-NCFITAYS 788 Q+HPDKVTG+VLRGIFLLRKKE+DWFYEGGAAAI+PDAWE FRD IPE+ER + + AY Sbjct: 183 QSHPDKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWEEFRDLIPENERGSSLVDAYH 242 Query: 789 KRLTSSDTDVQVEAAKRWTMWEMMTAHLIQNHENIKRGDDDKFSLAFARIENHYFVNKGF 968 KRL S D ++Q AA+ WT WEMMTA+L N EN+++ +DDKFSLAFARIENHYFVNKGF Sbjct: 243 KRLNSDDLEIQYAAARAWTKWEMMTAYLRPNLENVQKAEDDKFSLAFARIENHYFVNKGF 302 Query: 969 LSSDSYLLYNVDKTRHIKAFIVQGRYDVCCPMMSAWDLHKAWPEAEFKVVPDAGHSANEV 1148 SDS+LL NVDK RHIK IVQGRYDVCCPMMSAWDLHKAWPEAE K+V DAGHSANE Sbjct: 303 FPSDSHLLDNVDKIRHIKTTIVQGRYDVCCPMMSAWDLHKAWPEAELKIVYDAGHSANEP 362 Query: 1149 GVAAELVSANEKLKSML 1199 G++AELV ANEK+K+++ Sbjct: 363 GISAELVVANEKMKALM 379 >ref|NP_973454.1| PIP (proline iminopeptidase); prolyl aminopeptidase [Arabidopsis thaliana] gb|AAD20113.1| proline iminopeptidase [Arabidopsis thaliana] Length = 329 Score = 525 bits (1353), Expect = e-147 Identities = 247/317 (77%), Positives = 275/317 (86%), Gaps = 1/317 (0%) Frame = +3 Query: 252 RRDLYPHIEPYDSGFLNVSGVHTIYYEQSGNPHGHPVVFLHGGPGAGTSPGNRRFFDPEF 431 +R LY IEPY SG L VS VHT+Y+EQSG P GHPVVFLHGGPG GT+P NRRFFDPEF Sbjct: 12 KRTLYAPIEPYSSGNLKVSDVHTLYWEQSGKPDGHPVVFLHGGPGGGTAPSNRRFFDPEF 71 Query: 432 YRIVLFDQRGAGRSTPHACLEENTTWDLVADIEKLREHLDIPEWQVFGGSWGSTLALAYS 611 YRIVLFDQRGAG+STPHACLEENTTWDLV DIEKLREHL IPEW VFGGSWGSTLALAYS Sbjct: 72 YRIVLFDQRGAGKSTPHACLEENTTWDLVNDIEKLREHLKIPEWLVFGGSWGSTLALAYS 131 Query: 612 QNHPDKVTGIVLRGIFLLRKKELDWFYEGGAAAIFPDAWEPFRDFIPEDER-NCFITAYS 788 Q+HPDKVTG+VLRGIFLLRKKE+DWFYEGGAAAI+PDAWE FRD IPE+ER + + AY Sbjct: 132 QSHPDKVTGLVLRGIFLLRKKEIDWFYEGGAAAIYPDAWEEFRDLIPENERGSSLVDAYH 191 Query: 789 KRLTSSDTDVQVEAAKRWTMWEMMTAHLIQNHENIKRGDDDKFSLAFARIENHYFVNKGF 968 KRL S D ++Q AA+ WT WEMMTA+L N EN+++ +DDKFSLAFARIENHYFVNKGF Sbjct: 192 KRLNSDDLEIQYAAARAWTKWEMMTAYLRPNLENVQKAEDDKFSLAFARIENHYFVNKGF 251 Query: 969 LSSDSYLLYNVDKTRHIKAFIVQGRYDVCCPMMSAWDLHKAWPEAEFKVVPDAGHSANEV 1148 SDS+LL NVDK RHIK IVQGRYDVCCPMMSAWDLHKAWPEAE K+V DAGHSANE Sbjct: 252 FPSDSHLLDNVDKIRHIKTTIVQGRYDVCCPMMSAWDLHKAWPEAELKIVYDAGHSANEP 311 Query: 1149 GVAAELVSANEKLKSML 1199 G++AELV ANEK+K+++ Sbjct: 312 GISAELVVANEKMKALM 328