BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphylf030e19 (1278 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAD09428.1| putative 3-methyladenine-DNA glycosylase [Oryza ... 521 e-146 gb|EAZ07434.1| hypothetical protein OsI_028666 [Oryza sativa (in... 476 e-132 ref|NP_973818.1| methyladenine glycosylase family protein [Arabi... 284 1e-74 gb|AAF81290.1|AC027656_7 Contains similarity to a putative DNA-3... 251 6e-65 gb|EAZ01894.1| hypothetical protein OsI_023126 [Oryza sativa (in... 221 6e-56 >dbj|BAD09428.1| putative 3-methyladenine-DNA glycosylase [Oryza sativa (japonica cultivar-group)] dbj|BAD09657.1| putative 3-methyladenine-DNA glycosylase [Oryza sativa (japonica cultivar-group)] Length = 339 Score = 521 bits (1343), Expect = e-146 Identities = 259/325 (79%), Positives = 275/325 (84%), Gaps = 12/325 (3%) Frame = +1 Query: 79 AFEKNPNH--MKNI------DRKLQAMSHA-SKYLQRIYPLGIQRXXXXXXXXXXXXXXX 231 AFE++PNH MKNI D AM+HA SKY+QRIYPLGIQR Sbjct: 13 AFERSPNHSMMKNIADRNKHDLLQSAMNHAASKYMQRIYPLGIQRSSSNLTLSSLSLSQN 72 Query: 232 XXXXXXXXXXXXWEPKVPLVYGGTFNPWGDVLVSLDRRRE--DDKVSDQDVEG-EEEFDC 402 WEPKVPL+YGGTF+PWGDVLVSL+RRRE DDKVSD DVEG EE+FDC Sbjct: 73 SNDSSLSSSNSSWEPKVPLLYGGTFSPWGDVLVSLERRREEDDDKVSDHDVEGGEEDFDC 132 Query: 403 SEPGSLHRCSWITKNSDEAYVQFHDECWGVPVYNDNRLFELLALSGMLIDHNWTEILKRR 582 SEPGSLHRCSWITKNSDEAYVQFHDECWGVPVYNDNRLFELLALSGMLIDHNWTEILKRR Sbjct: 133 SEPGSLHRCSWITKNSDEAYVQFHDECWGVPVYNDNRLFELLALSGMLIDHNWTEILKRR 192 Query: 583 DMYREAFVDFDHNAVAKMDENDIAEISTNKELKLAECRVRCIVENAKCIQKVAKDFGSFS 762 DMYREAF DFD + VAKMDEND+AEIS NKELKLAECRVRCI+ENAKCIQKVAK+FGSFS Sbjct: 193 DMYREAFADFDPSTVAKMDENDVAEISGNKELKLAECRVRCIIENAKCIQKVAKEFGSFS 252 Query: 763 GYIWGHVNHRPMVGKYKHHKYIPFRTPKSEAVSKDLVRRGFRLVGPVIVYSFMQASGMVI 942 GYIWGHVNHRP VG+YKHHKYIPFRTPKSEAVSKDLVRRGFRLVGPVIVYSFMQASG+VI Sbjct: 253 GYIWGHVNHRPTVGRYKHHKYIPFRTPKSEAVSKDLVRRGFRLVGPVIVYSFMQASGIVI 312 Query: 943 DHLVDCFRFPECVRLAERSWGITNV 1017 DHLVDCFRFPEC+ LA+RSWGITNV Sbjct: 313 DHLVDCFRFPECLHLADRSWGITNV 337 >gb|EAZ07434.1| hypothetical protein OsI_028666 [Oryza sativa (indica cultivar-group)] gb|EAZ43154.1| hypothetical protein OsJ_026637 [Oryza sativa (japonica cultivar-group)] Length = 304 Score = 476 bits (1225), Expect = e-132 Identities = 241/315 (76%), Positives = 254/315 (80%), Gaps = 10/315 (3%) Frame = +1 Query: 103 MKNI------DRKLQAMSHA-SKYLQRIYPLGIQRXXXXXXXXXXXXXXXXXXXXXXXXX 261 MKNI D AM+HA SKY+QRIYPLGIQR Sbjct: 1 MKNIADRNKHDLLQSAMNHAASKYMQRIYPLGIQRSSSNLTLSSLSLSQNSNDSSLSSSN 60 Query: 262 XXWEPKVPLVYGGTFNPWGDVLVSLDRRRE--DDKVSDQDVEG-EEEFDCSEPGSLHRCS 432 WEPKVPL+YGGTF+PWGDVLVSL+RRRE DDKVSD DVEG EE+FDCSEPGSLHRCS Sbjct: 61 SSWEPKVPLLYGGTFSPWGDVLVSLERRREEDDDKVSDHDVEGGEEDFDCSEPGSLHRCS 120 Query: 433 WITKNSDEAYVQFHDECWGVPVYNDNRLFELLALSGMLIDHNWTEILKRRDMYREAFVDF 612 WITKNSDEAYVQFHDECWGVPVYNDNRLFELLALSGMLIDHNWTEILKRRDMYREAF DF Sbjct: 121 WITKNSDEAYVQFHDECWGVPVYNDNRLFELLALSGMLIDHNWTEILKRRDMYREAFADF 180 Query: 613 DHNAVAKMDENDIAEISTNKELKLAECRVRCIVENAKCIQKVAKDFGSFSGYIWGHVNHR 792 D + VAKMDEND+AEIS NKELKLAECR VAK+FGSFSGYIWGHVNHR Sbjct: 181 DPSTVAKMDENDVAEISGNKELKLAECR-------------VAKEFGSFSGYIWGHVNHR 227 Query: 793 PMVGKYKHHKYIPFRTPKSEAVSKDLVRRGFRLVGPVIVYSFMQASGMVIDHLVDCFRFP 972 P VG+YKHHKYIPFRTPKSEAVSKDLVRRGFRLVGPVIVYSFMQASG+VIDHLVDCFRFP Sbjct: 228 PTVGRYKHHKYIPFRTPKSEAVSKDLVRRGFRLVGPVIVYSFMQASGIVIDHLVDCFRFP 287 Query: 973 ECVRLAERSWGITNV 1017 EC+ LA+RSWGITNV Sbjct: 288 ECLHLADRSWGITNV 302 >ref|NP_973818.1| methyladenine glycosylase family protein [Arabidopsis thaliana] Length = 311 Score = 284 bits (726), Expect = 1e-74 Identities = 142/289 (49%), Positives = 192/289 (66%), Gaps = 3/289 (1%) Frame = +1 Query: 145 SKYLQRIYPLGIQRXXXXXXXXXXXXXXXXXXXXXXXXXXX---WEPKVPLVYGGTFNPW 315 +K+L+RIYP+ +QR E K+ L G Sbjct: 31 AKHLKRIYPITLQRSTSSSFSLSSISLSLSQNSTDSVSTDSNSTLEQKISLALG------ 84 Query: 316 GDVLVSLDRRREDDKVSDQDVEGEEEFDCSEPGSLHRCSWITKNSDEAYVQFHDECWGVP 495 L+S RRE + ++F+ S+ RC+WITK SDE YV FHD+ WGVP Sbjct: 85 ---LISSPHRREIFVPKSIPQQLCQDFNSSDEPK--RCNWITKKSDEVYVMFHDQQWGVP 139 Query: 496 VYNDNRLFELLALSGMLIDHNWTEILKRRDMYREAFVDFDHNAVAKMDENDIAEISTNKE 675 VY+DN LFE LA+SGML+D+NWTEILKR++ +REAF +FD N VAKM E +IAEI++NK Sbjct: 140 VYDDNLLFEFLAMSGMLMDYNWTEILKRKEHFREAFCEFDPNRVAKMGEKEIAEIASNKA 199 Query: 676 LKLAECRVRCIVENAKCIQKVAKDFGSFSGYIWGHVNHRPMVGKYKHHKYIPFRTPKSEA 855 + L E RVRCIV+NAKCI KV +FGSFS ++WG ++++P++ K+K+ + +P R+PK+E Sbjct: 200 IMLQESRVRCIVDNAKCITKVVNEFGSFSSFVWGFMDYKPIINKFKYSRNVPLRSPKAEI 259 Query: 856 VSKDLVRRGFRLVGPVIVYSFMQASGMVIDHLVDCFRFPECVRLAERSW 1002 +SKD+++RGFR VGPVIV+SFMQA+G+ IDHLVDCFR +CV LAER W Sbjct: 260 ISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVDCFRHGDCVSLAERPW 308 >gb|AAF81290.1|AC027656_7 Contains similarity to a putative DNA-3-methyladenine glycosylase I F9E10.6 gi|6646756 from Arabidopsis thaliana BAC F9E10 gb|AC013258 Length = 298 Score = 251 bits (642), Expect = 6e-65 Identities = 131/289 (45%), Positives = 180/289 (62%), Gaps = 3/289 (1%) Frame = +1 Query: 145 SKYLQRIYPLGIQRXXXXXXXXXXXXXXXXXXXXXXXXXXX---WEPKVPLVYGGTFNPW 315 +K+L+RIYP+ +QR E K+ L G Sbjct: 31 AKHLKRIYPITLQRSTSSSFSLSSISLSLSQNSTDSVSTDSNSTLEQKISLALG------ 84 Query: 316 GDVLVSLDRRREDDKVSDQDVEGEEEFDCSEPGSLHRCSWITKNSDEAYVQFHDECWGVP 495 L+S RRE + ++F+ S+ RC+WITK SDE YV FHD+ WGVP Sbjct: 85 ---LISSPHRREIFVPKSIPQQLCQDFNSSDEPK--RCNWITKKSDEVYVMFHDQQWGVP 139 Query: 496 VYNDNRLFELLALSGMLIDHNWTEILKRRDMYREAFVDFDHNAVAKMDENDIAEISTNKE 675 VY+DN LFE LA+SGML+D+NWTEILKR++ +REAF +FD N VAKM E +IAEI++NK Sbjct: 140 VYDDNLLFEFLAMSGMLMDYNWTEILKRKEHFREAFCEFDPNRVAKMGEKEIAEIASNKA 199 Query: 676 LKLAECRVRCIVENAKCIQKVAKDFGSFSGYIWGHVNHRPMVGKYKHHKYIPFRTPKSEA 855 + L E R V +FGSFS ++WG ++++P++ K+K+ + +P R+PK+E Sbjct: 200 IMLQESR-------------VVNEFGSFSSFVWGFMDYKPIINKFKYSRNVPLRSPKAEI 246 Query: 856 VSKDLVRRGFRLVGPVIVYSFMQASGMVIDHLVDCFRFPECVRLAERSW 1002 +SKD+++RGFR VGPVIV+SFMQA+G+ IDHLVDCFR +CV LAER W Sbjct: 247 ISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVDCFRHGDCVSLAERPW 295 >gb|EAZ01894.1| hypothetical protein OsI_023126 [Oryza sativa (indica cultivar-group)] Length = 426 Score = 221 bits (564), Expect = 6e-56 Identities = 99/185 (53%), Positives = 136/185 (73%) Frame = +1 Query: 424 RCSWITKNSDEAYVQFHDECWGVPVYNDNRLFELLALSGMLIDHNWTEILKRRDMYREAF 603 RC+W+T SD YV FHDE WGVPV++D RLFELL LSG L + W EILKRR ++RE F Sbjct: 187 RCAWVTPTSDPCYVIFHDEEWGVPVHDDRRLFELLVLSGALAELTWPEILKRRQLFREIF 246 Query: 604 VDFDHNAVAKMDENDIAEISTNKELKLAECRVRCIVENAKCIQKVAKDFGSFSGYIWGHV 783 VDFD A++K++E + + L+E ++R +VENA+ I K+ +FGSF Y WG + Sbjct: 247 VDFDPVAISKINEKKLVAPGSVANSLLSEQKLRAVVENARQILKIVDEFGSFDRYCWGFL 306 Query: 784 NHRPMVGKYKHHKYIPFRTPKSEAVSKDLVRRGFRLVGPVIVYSFMQASGMVIDHLVDCF 963 NH+P+V K+++ + +P ++PK++ +SKD+VRRGFR VGP I+YSFMQA+G+ DHLV CF Sbjct: 307 NHKPIVSKFRYPRQVPVKSPKADMISKDMVRRGFRGVGPTIIYSFMQAAGLTNDHLVSCF 366 Query: 964 RFPEC 978 RF EC Sbjct: 367 RFKEC 371