BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphyem126a22 (1169 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_001012761.1| S1/P1 Nuclease [Tetrahymena thermophila SB21... 309 2e-82 ref|XP_001012760.1| S1/P1 Nuclease [Tetrahymena thermophila SB21... 281 8e-74 ref|XP_001448701.1| hypothetical protein GSPATT00016146001 [Para... 199 4e-49 ref|XP_001428481.1| hypothetical protein GSPATT00031314001 [Para... 196 2e-48 ref|XP_001432219.1| hypothetical protein GSPATT00034299001 [Para... 196 3e-48 >ref|XP_001012761.1| S1/P1 Nuclease [Tetrahymena thermophila SB210] gb|EAR92516.1| S1/P1 Nuclease [Tetrahymena thermophila SB210] Length = 324 Score = 309 bits (792), Expect = 2e-82 Identities = 156/324 (48%), Positives = 216/324 (66%), Gaps = 2/324 (0%) Frame = +3 Query: 48 MKSTQILVLLCLVSMCTCWWDVGHMMTAEVAR*ELNQRDPELMAYISQIATAYNALADDR 227 M++T +L +L L++ +CWWD GHMMTAE+A+ E+ R+ L I + T N L D R Sbjct: 1 MRNTFLLCIL-LIAGVSCWWDGGHMMTAEIAKQEILARNATLYEQIEKYVTILNPLCDAR 59 Query: 228 SQTFVQAASWADDVKDKAMDFMYGWHFIDRPINPDGLWAVTPESEMNSNSVTSMEKFRFE 407 SQ FVQAASWADD+KD AM+F YGWHF D+P NP GL+ + + NS+T +++ E Sbjct: 60 SQDFVQAASWADDIKDDAMNFWYGWHFYDKPENPQGLYVILDQDNQVYNSITGIKRAIQE 119 Query: 408 LQKKDYRPA*NS*KISMQKSFMMRMAVHVVGDIH*PLHNTNFFNNTYAKGDLGGNFQKIV 587 L +K Y P N+ IS+Q++ MMR+ +H+VGD+H PLHN N +N TY +GDLGGN +KI+ Sbjct: 120 LSRKYYLPLQNNLNISVQQAIMMRLLIHIVGDMHQPLHNVNMYNYTYTQGDLGGNKEKIL 179 Query: 588 TL*NKTLNLHSYFDSGAEVL*NIDRPLNATA*QFITNFAQELRGKYTRAYFGARMN*VTP 767 L ++ LHSYFDSGA L + RPL ++ A E R +Y R+YFG RMN TP Sbjct: 180 LLNKTSMILHSYFDSGATRLDSFPRPLTQEKLSNLSALAYEFRAQYPRSYFGQRMNVTTP 239 Query: 768 Q*WSDESYGIARDFIYAFIKNTNQITPDFDAKAKQITQEQVTIGGYRLADWLYDTFKNTT 947 + W+ ESY IA +FIY F+ TNQITP++D ++ +I ++Q+ +GGYRLAD L F+N T Sbjct: 240 EQWAQESYDIAHNFIYPFVTKTNQITPEWDTESYEIIKQQLALGGYRLADILLGIFQNQT 299 Query: 948 S-TNSTKSA-LQEETPSNLRRKFL 1013 + N TKS Q + SNLR+ + Sbjct: 300 APVNQTKSTNTQTNSTSNLRKTII 323 >ref|XP_001012760.1| S1/P1 Nuclease [Tetrahymena thermophila SB210] gb|EAR92515.1| S1/P1 Nuclease [Tetrahymena thermophila SB210] Length = 330 Score = 281 bits (718), Expect = 8e-74 Identities = 145/321 (45%), Positives = 204/321 (63%), Gaps = 14/321 (4%) Frame = +3 Query: 48 MKSTQILVLLCLVSMCTCWWDVGHMMTAEVAR*ELNQRDPELMAYISQIATAYNALADDR 227 MK+ I +L +VS WWD GHM+T EVA+ E+ RDP L I + T N L D R Sbjct: 1 MKTQSIFAVLLIVSSVFGWWDGGHMITVEVAKQEILARDPALYLKIEKYVTILNPLCDAR 60 Query: 228 SQTFVQAASWADDVKDKAMDFMYGWHFIDRPINPDGLWAVTPESEMNSNSVTSMEKFRFE 407 SQTFVQAASWADD+KD AM+F WHF ++PIN +GL+ V + +N+NS+ ++++ E Sbjct: 61 SQTFVQAASWADDIKDPAMNFWDKWHFFNKPINEEGLYVVLDQDSLNNNSINALKRCIQE 120 Query: 408 LQKKDYRPA*NS*KISMQKSFMMRMAVHVVGDIH*PLHNTNFFNNTYA--KGDLGGNFQK 581 LQK + P N IS+Q++ MMR +H+VGD+H PLHNTN FN T++ +GDLGGN + Sbjct: 121 LQKNNTTPINNPDNISVQQAIMMRYLIHIVGDMHQPLHNTNLFNYTFSTNQGDLGGNKEN 180 Query: 582 IVTL*NKTLNLHSYFDSGAEVL*NIDRPLNATA*QFITNFAQELRGKYTRAYFGARMN*V 761 ++ L ++ LH YFDSGA L + RPL+ Q +T+FA R +Y R++F R+N Sbjct: 181 VILLNGTSMVLHYYFDSGALRLADFSRPLSQEQEQQVTDFAASFRAQYPRSFFNERVNIT 240 Query: 762 TPQ*WSDESYGIARDFIYAFIKNTNQITPDFDAKAKQITQEQVTIGGYRLADWLYDTF-- 935 P+ W+ ESY IA IY ++K TN++TP++D ++ ++Q+ +GGYRLAD L F Sbjct: 241 LPEMWAQESYEIAVRDIYPYLKLTNKVTPEWDNLQYEMIKQQIALGGYRLADLLTSVFNP 300 Query: 936 ----------KNTTSTNSTKS 968 N+TSTNST S Sbjct: 301 PVPPTPVQDSNNSTSTNSTSS 321 >ref|XP_001448701.1| hypothetical protein GSPATT00016146001 [Paramecium tetraurelia strain d4-2] emb|CAK81304.1| unnamed protein product [Paramecium tetraurelia] Length = 306 Score = 199 bits (505), Expect = 4e-49 Identities = 106/293 (36%), Positives = 168/293 (57%), Gaps = 4/293 (1%) Frame = +3 Query: 69 VLLCLVSMCTCWWDVGHMMTAEVAR*ELNQRDPELMAYISQIATAYNALADDRSQTFVQA 248 +L+ + + CWWDVGHMMTA++A+ L P+++A+ + N+L D +S TF +A Sbjct: 5 LLITISYVVQCWWDVGHMMTAQIAKNYLKDNRPDVLAWADSLVQDLNSLTDGKSNTFAEA 64 Query: 249 ASWADDVKDKAMDFMYGWHFIDRPINPDGLWAVTPESEMNSNSVTSMEKFRFELQKKDYR 428 A W DD+K+ FM WH+ DRPINPDGL + N NS+ ++ + L + + Sbjct: 65 AVWMDDIKETGTAFMNDWHYTDRPINPDGLLIKLDDQLRNINSIYAINQAVSVL--TNTK 122 Query: 429 PA*NS*KISMQKSFMMRMAVHVVGDIH*PLHNTNFFNNTYAKGDLGGNFQKIVTL*NKTL 608 A N + +M K+ M+R+ +HV+GD+H PLH+T FFN++Y GD GGNF K+ + Sbjct: 123 TAKN--RHTMFKAQMIRVLLHVIGDMHQPLHDTTFFNSSYPNGDQGGNFMKVQLENGTLV 180 Query: 609 NLHSYFDSGAEVL*N----IDRPLNATA*QFITNFAQELRGKYTRAYFGARMN*VTPQ*W 776 NLHS++D+GA + RPL+ + +++ ++ E+ KY + ++ P W Sbjct: 181 NLHSFWDAGAFAFSPNNSFLVRPLSQSDSEYLNKWSLEVISKYQITKYN-NIDMTNPTVW 239 Query: 777 SDESYGIARDFIYAFIKNTNQITPDFDAKAKQITQEQVTIGGYRLADWLYDTF 935 + Y A F+Y I ++N D+ +A+Q +E + IGGYRLA L D F Sbjct: 240 TYVGYRQAVQFVYPMIASSNNYNKDYTQQAQQFCEENLAIGGYRLAQKLIDVF 292 >ref|XP_001428481.1| hypothetical protein GSPATT00031314001 [Paramecium tetraurelia strain d4-2] emb|CAK61083.1| unnamed protein product [Paramecium tetraurelia] Length = 306 Score = 196 bits (499), Expect = 2e-48 Identities = 105/304 (34%), Positives = 173/304 (56%), Gaps = 4/304 (1%) Frame = +3 Query: 81 LVSMCTCWWDVGHMMTAEVAR*ELNQRDPELMAYISQIATAYNALADDRSQTFVQAASWA 260 L+SM CWW+VGHMMTA++A+ L P+++A+ + +N+L D +S TF +AA W Sbjct: 9 LISMVYCWWEVGHMMTAQIAKNYLKDNRPDVLAWADSLVQDFNSLTDGKSNTFAEAAVWL 68 Query: 261 DDVKDKAMDFMYGWHFIDRPINPDGLWAVTPESEMNSNSVTSMEKFRFELQKKDYRPA*N 440 DD+K+ F+ WH+ DRPINPDGL + N NS+ ++ + L + + A N Sbjct: 69 DDIKETGTGFLNDWHYTDRPINPDGLLIKIDDQGRNINSIYAINQAVSVLTNQ--KTAKN 126 Query: 441 S*KISMQKSFMMRMAVHVVGDIH*PLHNTNFFNNTYAKGDLGGNFQKIVTL*NKTLNLHS 620 + ++ K+ M+R+ +HV+GDIH PLH+ F+N++Y GD GGNF KI +N HS Sbjct: 127 --RHTVFKAQMIRVLLHVIGDIHQPLHDVTFWNSSYPNGDAGGNFMKIQLSNGTLMNFHS 184 Query: 621 YFDSGAEVL*N----IDRPLNATA*QFITNFAQELRGKYTRAYFGARMN*VTPQ*WSDES 788 ++DSGA + RPL+ + Q++ +++EL K+ ++ + + + P W+ Sbjct: 185 FWDSGAVSFAPNNSFMARPLSQSDSQYLDKWSKELIAKFPKSKY-SNYDMTNPSVWTYLG 243 Query: 789 YGIARDFIYAFIKNTNQITPDFDAKAKQITQEQVTIGGYRLADWLYDTFKNTTSTNSTKS 968 + A+ F+Y I +N D++ +A +E ++IGGYRL L + + N K Sbjct: 244 FRQAQQFVYPMIATSNSYNSDYEKQAIAFCEENLSIGGYRLGAKLIEIYDQILQ-NEAKL 302 Query: 969 ALQE 980 +L E Sbjct: 303 SLNE 306 >ref|XP_001432219.1| hypothetical protein GSPATT00034299001 [Paramecium tetraurelia strain d4-2] emb|CAK64822.1| unnamed protein product [Paramecium tetraurelia] Length = 306 Score = 196 bits (498), Expect = 3e-48 Identities = 105/305 (34%), Positives = 169/305 (55%), Gaps = 4/305 (1%) Frame = +3 Query: 66 LVLLCLVSMCTCWWDVGHMMTAEVAR*ELNQRDPELMAYISQIATAYNALADDRSQTFVQ 245 L+LL + + CWWDVGHMMTA++A+ L P+ +A+ + N+L D +S TF + Sbjct: 4 LLLLAISYVVQCWWDVGHMMTAQIAKNYLKDNRPDTLAWADSLVQDLNSLTDGKSNTFAE 63 Query: 246 AASWADDVKDKAMDFMYGWHFIDRPINPDGLWAVTPESEMNSNSVTSMEKFRFELQKKDY 425 AA W DD+K+ FM WH+ DRPINPDGL + N NS+ ++ + L Sbjct: 64 AAVWMDDIKETGTSFMNDWHYTDRPINPDGLLIKIEDQNRNINSIYAINQAVSVLTNS-- 121 Query: 426 RPA*NS*KISMQKSFMMRMAVHVVGDIH*PLHNTNFFNNTYAKGDLGGNFQKIVTL*NKT 605 + A N + ++ K+ M+R+ +HV+GD+H PLH+T F+N++Y GD GGNF K+ Sbjct: 122 KTARN--RHTVFKAQMLRVLLHVIGDLHQPLHDTTFWNSSYPNGDQGGNFMKVQLENGTL 179 Query: 606 LNLHSYFDSGAEVL*N----IDRPLNATA*QFITNFAQELRGKYTRAYFGARMN*VTPQ* 773 +NLHS++D+GA + RPL+ + +++ ++ ++ KY + ++ P Sbjct: 180 VNLHSFWDAGAFAFSPNNSFLVRPLSQSDQEYLNKWSLDVIKKYQFTKY-INLDMTNPSV 238 Query: 774 WSDESYGIARDFIYAFIKNTNQITPDFDAKAKQITQEQVTIGGYRLADWLYDTFKNTTST 953 W+ Y A F+Y I +N D+ +A++ +E + IGGYRLA L D + S Sbjct: 239 WTYVGYRQAIQFVYPMIAGSNNYNKDYVKQAQEFCEENLAIGGYRLAQKLIDIYDQILSN 298 Query: 954 NSTKS 968 + S Sbjct: 299 EAKLS 303