BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphylf039b08 (1317 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAD09428.1| putative 3-methyladenine-DNA glycosylase [Oryza ... 543 e-152 gb|EAZ07434.1| hypothetical protein OsI_028666 [Oryza sativa (in... 482 e-134 ref|NP_973818.1| methyladenine glycosylase family protein [Arabi... 278 5e-73 gb|AAF81290.1|AC027656_7 Contains similarity to a putative DNA-3... 246 3e-63 emb|CAO66678.1| unnamed protein product [Vitis vinifera] 218 6e-55 >dbj|BAD09428.1| putative 3-methyladenine-DNA glycosylase [Oryza sativa (japonica cultivar-group)] dbj|BAD09657.1| putative 3-methyladenine-DNA glycosylase [Oryza sativa (japonica cultivar-group)] Length = 339 Score = 543 bits (1399), Expect = e-152 Identities = 267/338 (78%), Positives = 289/338 (85%), Gaps = 12/338 (3%) Frame = +1 Query: 52 MLTTTTHSRHH-AFEKSPSH--MKNI------DRKLQAMSHA-SKYLQRIYPLGIQRTXX 201 MLTT++HSRHH AFE+SP+H MKNI D AM+HA SKY+QRIYPLGIQR+ Sbjct: 1 MLTTSSHSRHHHAFERSPNHSMMKNIADRNKHDLLQSAMNHAASKYMQRIYPLGIQRSSS 60 Query: 202 XXXXXXXXXXXXXXXXXXXXXXXXWEPKVPLLYGGTFSPWGDVLVSLERRRE--DDKVSD 375 WEPKVPLLYGGTFSPWGDVLVSLERRRE DDKVSD Sbjct: 61 NLTLSSLSLSQNSNDSSLSSSNSSWEPKVPLLYGGTFSPWGDVLVSLERRREEDDDKVSD 120 Query: 376 HDVEGEDEEFDCSDPGSLHKCSWITKNSDEAYVQFHDECWGVPVYNDNRLFELLTLSGML 555 HDVEG +E+FDCS+PGSLH+CSWITKNSDEAYVQFHDECWGVPVYNDNRLFELL LSGML Sbjct: 121 HDVEGGEEDFDCSEPGSLHRCSWITKNSDEAYVQFHDECWGVPVYNDNRLFELLALSGML 180 Query: 556 IDHNWTEILKRRDVYRQAFADFDHNAVAKMDENDIAELSANKELKLAECRVRCIVENAKC 735 IDHNWTEILKRRD+YR+AFADFD + VAKMDEND+AE+S NKELKLAECRVRCI+ENAKC Sbjct: 181 IDHNWTEILKRRDMYREAFADFDPSTVAKMDENDVAEISGNKELKLAECRVRCIIENAKC 240 Query: 736 IQKVAKKFGSFSGYIWGHVNHRPMVGKYKHHKYIPFRTPKSEAVSKDLVRRGFRLVGPVI 915 IQKVAK+FGSFSGYIWGHVNHRP VG+YKHHKYIPFRTPKSEAVSKDLVRRGFRLVGPVI Sbjct: 241 IQKVAKEFGSFSGYIWGHVNHRPTVGRYKHHKYIPFRTPKSEAVSKDLVRRGFRLVGPVI 300 Query: 916 VYSFMQASGMVIDHLVDCFRFPECVRLAERSWGITNVA 1029 VYSFMQASG+VIDHLVDCFRFPEC+ LA+RSWGITNVA Sbjct: 301 VYSFMQASGIVIDHLVDCFRFPECLHLADRSWGITNVA 338 >gb|EAZ07434.1| hypothetical protein OsI_028666 [Oryza sativa (indica cultivar-group)] gb|EAZ43154.1| hypothetical protein OsJ_026637 [Oryza sativa (japonica cultivar-group)] Length = 304 Score = 482 bits (1241), Expect = e-134 Identities = 240/316 (75%), Positives = 257/316 (81%), Gaps = 9/316 (2%) Frame = +1 Query: 109 MKNI------DRKLQAMSHA-SKYLQRIYPLGIQRTXXXXXXXXXXXXXXXXXXXXXXXX 267 MKNI D AM+HA SKY+QRIYPLGIQR+ Sbjct: 1 MKNIADRNKHDLLQSAMNHAASKYMQRIYPLGIQRSSSNLTLSSLSLSQNSNDSSLSSSN 60 Query: 268 XXWEPKVPLLYGGTFSPWGDVLVSLERRRE--DDKVSDHDVEGEDEEFDCSDPGSLHKCS 441 WEPKVPLLYGGTFSPWGDVLVSLERRRE DDKVSDHDVEG +E+FDCS+PGSLH+CS Sbjct: 61 SSWEPKVPLLYGGTFSPWGDVLVSLERRREEDDDKVSDHDVEGGEEDFDCSEPGSLHRCS 120 Query: 442 WITKNSDEAYVQFHDECWGVPVYNDNRLFELLTLSGMLIDHNWTEILKRRDVYRQAFADF 621 WITKNSDEAYVQFHDECWGVPVYNDNRLFELL LSGMLIDHNWTEILKRRD+YR+AFADF Sbjct: 121 WITKNSDEAYVQFHDECWGVPVYNDNRLFELLALSGMLIDHNWTEILKRRDMYREAFADF 180 Query: 622 DHNAVAKMDENDIAELSANKELKLAECRVRCIVENAKCIQKVAKKFGSFSGYIWGHVNHR 801 D + VAKMDEND+AE+S NKELKLAECR VAK+FGSFSGYIWGHVNHR Sbjct: 181 DPSTVAKMDENDVAEISGNKELKLAECR-------------VAKEFGSFSGYIWGHVNHR 227 Query: 802 PMVGKYKHHKYIPFRTPKSEAVSKDLVRRGFRLVGPVIVYSFMQASGMVIDHLVDCFRFP 981 P VG+YKHHKYIPFRTPKSEAVSKDLVRRGFRLVGPVIVYSFMQASG+VIDHLVDCFRFP Sbjct: 228 PTVGRYKHHKYIPFRTPKSEAVSKDLVRRGFRLVGPVIVYSFMQASGIVIDHLVDCFRFP 287 Query: 982 ECVRLAERSWGITNVA 1029 EC+ LA+RSWGITNVA Sbjct: 288 ECLHLADRSWGITNVA 303 >ref|NP_973818.1| methyladenine glycosylase family protein [Arabidopsis thaliana] Length = 311 Score = 278 bits (712), Expect = 5e-73 Identities = 140/290 (48%), Positives = 193/290 (66%), Gaps = 3/290 (1%) Frame = +1 Query: 151 SKYLQRIYPLGIQRTXXXXXXXXXXXXXXXXXXXXXXXXXX---WEPKVPLLYGGTFSPW 321 +K+L+RIYP+ +QR+ E K+ L G Sbjct: 31 AKHLKRIYPITLQRSTSSSFSLSSISLSLSQNSTDSVSTDSNSTLEQKISLALG------ 84 Query: 322 GDVLVSLERRREDDKVSDHDVEGEDEEFDCSDPGSLHKCSWITKNSDEAYVQFHDECWGV 501 L+S RRE V + ++F+ SD +C+WITK SDE YV FHD+ WGV Sbjct: 85 ---LISSPHRREIF-VPKSIPQQLCQDFNSSDEPK--RCNWITKKSDEVYVMFHDQQWGV 138 Query: 502 PVYNDNRLFELLTLSGMLIDHNWTEILKRRDVYRQAFADFDHNAVAKMDENDIAELSANK 681 PVY+DN LFE L +SGML+D+NWTEILKR++ +R+AF +FD N VAKM E +IAE+++NK Sbjct: 139 PVYDDNLLFEFLAMSGMLMDYNWTEILKRKEHFREAFCEFDPNRVAKMGEKEIAEIASNK 198 Query: 682 ELKLAECRVRCIVENAKCIQKVAKKFGSFSGYIWGHVNHRPMVGKYKHHKYIPFRTPKSE 861 + L E RVRCIV+NAKCI KV +FGSFS ++WG ++++P++ K+K+ + +P R+PK+E Sbjct: 199 AIMLQESRVRCIVDNAKCITKVVNEFGSFSSFVWGFMDYKPIINKFKYSRNVPLRSPKAE 258 Query: 862 AVSKDLVRRGFRLVGPVIVYSFMQASGMVIDHLVDCFRFPECVRLAERSW 1011 +SKD+++RGFR VGPVIV+SFMQA+G+ IDHLVDCFR +CV LAER W Sbjct: 259 IISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVDCFRHGDCVSLAERPW 308 >gb|AAF81290.1|AC027656_7 Contains similarity to a putative DNA-3-methyladenine glycosylase I F9E10.6 gi|6646756 from Arabidopsis thaliana BAC F9E10 gb|AC013258 Length = 298 Score = 246 bits (628), Expect = 3e-63 Identities = 129/290 (44%), Positives = 181/290 (62%), Gaps = 3/290 (1%) Frame = +1 Query: 151 SKYLQRIYPLGIQRTXXXXXXXXXXXXXXXXXXXXXXXXXX---WEPKVPLLYGGTFSPW 321 +K+L+RIYP+ +QR+ E K+ L G Sbjct: 31 AKHLKRIYPITLQRSTSSSFSLSSISLSLSQNSTDSVSTDSNSTLEQKISLALG------ 84 Query: 322 GDVLVSLERRREDDKVSDHDVEGEDEEFDCSDPGSLHKCSWITKNSDEAYVQFHDECWGV 501 L+S RRE V + ++F+ SD +C+WITK SDE YV FHD+ WGV Sbjct: 85 ---LISSPHRREIF-VPKSIPQQLCQDFNSSDEPK--RCNWITKKSDEVYVMFHDQQWGV 138 Query: 502 PVYNDNRLFELLTLSGMLIDHNWTEILKRRDVYRQAFADFDHNAVAKMDENDIAELSANK 681 PVY+DN LFE L +SGML+D+NWTEILKR++ +R+AF +FD N VAKM E +IAE+++NK Sbjct: 139 PVYDDNLLFEFLAMSGMLMDYNWTEILKRKEHFREAFCEFDPNRVAKMGEKEIAEIASNK 198 Query: 682 ELKLAECRVRCIVENAKCIQKVAKKFGSFSGYIWGHVNHRPMVGKYKHHKYIPFRTPKSE 861 + L E R V +FGSFS ++WG ++++P++ K+K+ + +P R+PK+E Sbjct: 199 AIMLQESR-------------VVNEFGSFSSFVWGFMDYKPIINKFKYSRNVPLRSPKAE 245 Query: 862 AVSKDLVRRGFRLVGPVIVYSFMQASGMVIDHLVDCFRFPECVRLAERSW 1011 +SKD+++RGFR VGPVIV+SFMQA+G+ IDHLVDCFR +CV LAER W Sbjct: 246 IISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVDCFRHGDCVSLAERPW 295 >emb|CAO66678.1| unnamed protein product [Vitis vinifera] Length = 375 Score = 218 bits (556), Expect = 6e-55 Identities = 94/190 (49%), Positives = 139/190 (73%) Frame = +1 Query: 433 KCSWITKNSDEAYVQFHDECWGVPVYNDNRLFELLTLSGMLIDHNWTEILKRRDVYRQAF 612 +C+W+T N+D +Y+ FHDE WGVPV++D +LFELL LSG L + W IL +R ++R+ F Sbjct: 151 RCAWVTPNTDLSYIAFHDEEWGVPVHDDKKLFELLVLSGALAELTWPTILSKRHIFREVF 210 Query: 613 ADFDHNAVAKMDENDIAELSANKELKLAECRVRCIVENAKCIQKVAKKFGSFSGYIWGHV 792 ADFD AVAK++E + + ++E ++R I+ENA+ + KV +FGSF YIW V Sbjct: 211 ADFDPIAVAKLNEKKLMAPGSIASSLISELKLRGIIENARQMSKVIDEFGSFDEYIWSFV 270 Query: 793 NHRPMVGKYKHHKYIPFRTPKSEAVSKDLVRRGFRLVGPVIVYSFMQASGMVIDHLVDCF 972 NH+P+V ++++ +++P +TPK++ +SKDLVRRGFR VGP ++YSFMQ +G+ DHL+ CF Sbjct: 271 NHKPIVSRFRYPRHVPVKTPKADVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLISCF 330 Query: 973 RFPECVRLAE 1002 RF +CV AE Sbjct: 331 RFQDCVTAAE 340