BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphyem212k10 (1407 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa (jap... 530 e-148 gb|EAY88098.1| hypothetical protein OsI_009331 [Oryza sativa (in... 479 e-135 gb|EAZ25161.1| hypothetical protein OsJ_008644 [Oryza sativa (ja... 479 e-135 ref|NP_564109.1| oxidoreductase, 2OG-Fe(II) oxygenase family pro... 416 e-114 gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Ara... 414 e-114 >dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa (japonica cultivar-group)] Length = 310 Score = 530 bits (1364), Expect = e-148 Identities = 260/310 (83%), Positives = 270/310 (87%), Gaps = 6/310 (1%) Frame = +3 Query: 111 MAPSRPLMSGIRPPRVFPKRGGRTSPYDVXXXXXXXXXXXXXXXXXFGVISLPVSAPIAT 290 MAPSRPLM GIRPPRVFP RGGRTSP + FGV SLPVSAP A Sbjct: 1 MAPSRPLMRGIRPPRVFPTRGGRTSPLALALAALLLASALLLALIAFGVFSLPVSAPNAA 60 Query: 291 T------GGEAESTDSRPARPRGRRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEEC 452 T GG+AE D RP R R RRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEEC Sbjct: 61 TTDSAAAGGDAEPADPRPPRTRARRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEEC 120 Query: 453 EYLIGLAKPHMVKSTVVDSTTGKSEDSRVRTSSGMFLQRGQDKVIRAIEKRIADYTFIPV 632 +YLIGLAKPHMVKSTVVDSTTGKS+DSRVRTSSGMFLQRG+DKVIRAIEKRIADYTFIP+ Sbjct: 121 DYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPM 180 Query: 633 EHGEGLQVLHYEVGQKYEPHFDYFFDEFNTKNGGQRVATLLMYLSDVEEGGETIFPDANV 812 EHGEGLQVLHYEVGQKYEPHFDYF DE+NTKNGGQR+ATLLMYLSDVEEGGETIFPDANV Sbjct: 181 EHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRMATLLMYLSDVEEGGETIFPDANV 240 Query: 813 NNSSLPWYTELSECARRGLAVKPKMGDALLFWSMKPDATLDPLSLHGGCPVINGNKWSST 992 N+SSLPWY ELSECAR+GLAVKPKMGDALLFWSMKPDATLDPLSLHGGCPVI GNKWSST Sbjct: 241 NSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSST 300 Query: 993 KWMHVHEYKA 1022 KWMHV EYKA Sbjct: 301 KWMHVREYKA 310 >gb|EAY88098.1| hypothetical protein OsI_009331 [Oryza sativa (indica cultivar-group)] Length = 387 Score = 479 bits (1233), Expect(2) = e-135 Identities = 238/286 (83%), Positives = 248/286 (86%), Gaps = 6/286 (2%) Frame = +3 Query: 111 MAPSRPLMSGIRPPRVFPKRGGRTSPYDVXXXXXXXXXXXXXXXXXFGVISLPVSAPIAT 290 MAPSRPLM GIRPPRVFP RGGRTSP + FGV SLPVSAP A Sbjct: 1 MAPSRPLMRGIRPPRVFPTRGGRTSPLALALAALLLASALLLTLIAFGVFSLPVSAPNAA 60 Query: 291 T------GGEAESTDSRPARPRGRRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEEC 452 T GG+AE D RP R R RRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEEC Sbjct: 61 TTDSAAAGGDAEPADPRPPRTRARRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEEC 120 Query: 453 EYLIGLAKPHMVKSTVVDSTTGKSEDSRVRTSSGMFLQRGQDKVIRAIEKRIADYTFIPV 632 +YLIGLAKPHMVKSTVVDSTTGKS+DSRVRTSSGMFLQRG+DKVIRAIEKRIADYTFIP+ Sbjct: 121 DYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPM 180 Query: 633 EHGEGLQVLHYEVGQKYEPHFDYFFDEFNTKNGGQRVATLLMYLSDVEEGGETIFPDANV 812 EHGEGLQVLHYEVGQKYEPHFDYF DE+NTKNGGQR+ATLLMYLSDVEEGGETIFPDANV Sbjct: 181 EHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRMATLLMYLSDVEEGGETIFPDANV 240 Query: 813 NNSSLPWYTELSECARRGLAVKPKMGDALLFWSMKPDATLDPLSLH 950 N+SSLPWY ELSECAR+GLAVKPKMGDALLFWSMKPDATLDPLSLH Sbjct: 241 NSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATLDPLSLH 286 Score = 28.9 bits (63), Expect(2) = e-135 Identities = 15/30 (50%), Positives = 18/30 (60%) Frame = +1 Query: 952 GAVLLLMGINGRQPSGCMSMSTKLELSKVV 1041 G VL NG QPSGCMS + + + S VV Sbjct: 309 GVVLSSKETNGHQPSGCMSSNPEEKESLVV 338 >gb|EAZ25161.1| hypothetical protein OsJ_008644 [Oryza sativa (japonica cultivar-group)] Length = 376 Score = 479 bits (1233), Expect(2) = e-135 Identities = 238/286 (83%), Positives = 248/286 (86%), Gaps = 6/286 (2%) Frame = +3 Query: 111 MAPSRPLMSGIRPPRVFPKRGGRTSPYDVXXXXXXXXXXXXXXXXXFGVISLPVSAPIAT 290 MAPSRPLM GIRPPRVFP RGGRTSP + FGV SLPVSAP A Sbjct: 1 MAPSRPLMRGIRPPRVFPTRGGRTSPLALALAALLLASALLLALIAFGVFSLPVSAPNAA 60 Query: 291 T------GGEAESTDSRPARPRGRRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEEC 452 T GG+AE D RP R R RRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEEC Sbjct: 61 TTDSAAAGGDAEPADPRPPRTRARRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEEC 120 Query: 453 EYLIGLAKPHMVKSTVVDSTTGKSEDSRVRTSSGMFLQRGQDKVIRAIEKRIADYTFIPV 632 +YLIGLAKPHMVKSTVVDSTTGKS+DSRVRTSSGMFLQRG+DKVIRAIEKRIADYTFIP+ Sbjct: 121 DYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPM 180 Query: 633 EHGEGLQVLHYEVGQKYEPHFDYFFDEFNTKNGGQRVATLLMYLSDVEEGGETIFPDANV 812 EHGEGLQVLHYEVGQKYEPHFDYF DE+NTKNGGQR+ATLLMYLSDVEEGGETIFPDANV Sbjct: 181 EHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRMATLLMYLSDVEEGGETIFPDANV 240 Query: 813 NNSSLPWYTELSECARRGLAVKPKMGDALLFWSMKPDATLDPLSLH 950 N+SSLPWY ELSECAR+GLAVKPKMGDALLFWSMKPDATLDPLSLH Sbjct: 241 NSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATLDPLSLH 286 Score = 28.9 bits (63), Expect(2) = e-135 Identities = 15/30 (50%), Positives = 18/30 (60%) Frame = +1 Query: 952 GAVLLLMGINGRQPSGCMSMSTKLELSKVV 1041 G VL NG QPSGCMS + + + S VV Sbjct: 298 GVVLSSKETNGHQPSGCMSSNPEEKESLVV 327 >ref|NP_564109.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana] gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein from Gallus gallus gi|212530. [Arabidopsis thaliana] gb|ABE02413.1| At1g20270 [Arabidopsis thaliana] Length = 287 Score = 416 bits (1068), Expect = e-114 Identities = 198/257 (77%), Positives = 221/257 (85%) Frame = +3 Query: 249 FGVISLPVSAPIATTGGEAESTDSRPARPRGRRDLSEGLGERGAQWTEVISWEPRAFVYH 428 FGV SLP++ E+ D R R + SEGLG+RG QWTEV+SWEPRAFVYH Sbjct: 37 FGVFSLPIN------NDESSPIDLSYFR-RAATERSEGLGKRGDQWTEVLSWEPRAFVYH 89 Query: 429 NFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSEDSRVRTSSGMFLQRGQDKVIRAIEKRI 608 NFLSKEECEYLI LAKPHMVKSTVVDS TGKS+DSRVRTSSG FL+RG+DK+I+ IEKRI Sbjct: 90 NFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGRDKIIKTIEKRI 149 Query: 609 ADYTFIPVEHGEGLQVLHYEVGQKYEPHFDYFFDEFNTKNGGQRVATLLMYLSDVEEGGE 788 ADYTFIP +HGEGLQVLHYE GQKYEPH+DYF DEFNTKNGGQR+AT+LMYLSDVEEGGE Sbjct: 150 ADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGE 209 Query: 789 TIFPDANVNNSSLPWYTELSECARRGLAVKPKMGDALLFWSMKPDATLDPLSLHGGCPVI 968 T+FP AN+N SS+PWY ELSEC ++GL+VKP+MGDALLFWSM+PDATLDP SLHGGCPVI Sbjct: 210 TVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVI 269 Query: 969 NGNKWSSTKWMHVHEYK 1019 GNKWSSTKWMHV EYK Sbjct: 270 RGNKWSSTKWMHVGEYK 286 >gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana] Length = 287 Score = 414 bits (1064), Expect = e-114 Identities = 197/257 (76%), Positives = 221/257 (85%) Frame = +3 Query: 249 FGVISLPVSAPIATTGGEAESTDSRPARPRGRRDLSEGLGERGAQWTEVISWEPRAFVYH 428 FGV SLP++ E+ D R R + SEGLG+RG QWTEV+SWEPRAFVYH Sbjct: 37 FGVFSLPIN------NDESSPIDLSYFR-RAATERSEGLGKRGDQWTEVLSWEPRAFVYH 89 Query: 429 NFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSEDSRVRTSSGMFLQRGQDKVIRAIEKRI 608 NFLSKEECEYLI LAKPHMVKSTVVDS TGKS+DSRVRTSSG FL+RG+DK+I+ IEKRI Sbjct: 90 NFLSKEECEYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRRGRDKIIKTIEKRI 149 Query: 609 ADYTFIPVEHGEGLQVLHYEVGQKYEPHFDYFFDEFNTKNGGQRVATLLMYLSDVEEGGE 788 ADYTFIP +HGEGLQVLHYE GQKYEPH+DYF DEFNTKNGGQR+AT+LMYLSDVEEGGE Sbjct: 150 ADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGE 209 Query: 789 TIFPDANVNNSSLPWYTELSECARRGLAVKPKMGDALLFWSMKPDATLDPLSLHGGCPVI 968 T+FP AN+N SS+PWY ELSEC ++GL+VKP+MGDALLFWSM+PDATLDP SLHGGCPVI Sbjct: 210 TVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVI 269 Query: 969 NGNKWSSTKWMHVHEYK 1019 GNKWSSTKW+HV EYK Sbjct: 270 RGNKWSSTKWIHVGEYK 286