BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphyst015h17 (1065 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EAY85293.1| hypothetical protein OsI_006526 [Oryza sativa (in... 311 4e-83 ref|NP_001046503.1| Os02g0266100 [Oryza sativa (japonica cultiva... 303 1e-80 ref|NP_567250.1| proline-rich family protein [Arabidopsis thalia... 165 5e-39 gb|AAO44073.1| At4g03120 [Arabidopsis thaliana] >gi|110743941|db... 163 2e-38 gb|AAD14440.1| putative C-type U1 snRNP [Arabidopsis thaliana] >... 156 3e-36 >gb|EAY85293.1| hypothetical protein OsI_006526 [Oryza sativa (indica cultivar-group)] Length = 238 Score = 311 bits (798), Expect = 4e-83 Identities = 165/263 (62%), Positives = 172/263 (65%), Gaps = 12/263 (4%) Frame = +2 Query: 68 MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRNYYQQFEEQQTQSLIDQRIKEHLGQ 247 MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVR YYQQFEEQQTQSLIDQRIKEHLGQ Sbjct: 1 MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRTYYQQFEEQQTQSLIDQRIKEHLGQ 60 Query: 248 AAAFQVGAPFNQHLLSFPGNVPRPRLPILPTPVMPHGVPQAPGAPLIPGVRPPILPALGV 427 AAAFQVGAPFNQHLLSFPG VPRPRLPILPTP MP GVPQ PGAPL+PGVRPPILPA G+ Sbjct: 61 AAAFQVGAPFNQHLLSFPGGVPRPRLPILPTPGMPLGVPQVPGAPLMPGVRPPILPAPGI 120 Query: 428 PGKNAH*ADVLVL*KIYPSNSSFMEQDEMYCSRHVKGYPGAPTVPTIPQTGAP------- 586 PG ++ + VKGYPGAP VPT+PQTGAP Sbjct: 121 PG-------------------------QIQKQQCVKGYPGAPNVPTMPQTGAPPGSMPPG 155 Query: 587 -----TMLMQMAXXXXXXXXXXXTSGAPGAPIHNSAAPPAMYQTNXXXXXXXXXXXXXXX 751 +M MQMA TSGAPGAPI NS APPAMYQTN Sbjct: 156 SMPPGSMPMQMAPLPRPPTLPPPTSGAPGAPIPNSGAPPAMYQTNPPQPAGPTSGAPPPV 215 Query: 752 XXXXXXXXXXXXFSYAQLSEGNH 820 FSYAQ EGNH Sbjct: 216 AAPPPAAPPQAPFSYAQPPEGNH 238 >ref|NP_001046503.1| Os02g0266100 [Oryza sativa (japonica cultivar-group)] dbj|BAD27897.1| putative u1 small nuclear ribonucleoprotein C [Oryza sativa (japonica cultivar-group)] dbj|BAF08417.1| Os02g0266100 [Oryza sativa (japonica cultivar-group)] gb|EAZ22511.1| hypothetical protein OsJ_005994 [Oryza sativa (japonica cultivar-group)] Length = 228 Score = 303 bits (776), Expect = 1e-80 Identities = 162/263 (61%), Positives = 166/263 (63%), Gaps = 12/263 (4%) Frame = +2 Query: 68 MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRNYYQQFEEQQTQSLIDQRIKEHLGQ 247 MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVR YYQQFEEQQTQSLIDQRIKEHLGQ Sbjct: 1 MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRTYYQQFEEQQTQSLIDQRIKEHLGQ 60 Query: 248 AAAFQVGAPFNQHLLSFPGNVPRPRLPILPTPVMPHGVPQAPGAPLIPGVRPPILPALGV 427 AAAFQVGAPFNQHLLSFPG VPRPRLPILPTP MP GVPQ PGAPL+PGVRPPILPA G+ Sbjct: 61 AAAFQVGAPFNQHLLSFPGGVPRPRLPILPTPGMPLGVPQVPGAPLMPGVRPPILPAPGI 120 Query: 428 PGKNAH*ADVLVL*KIYPSNSSFMEQDEMYCSRHVKGYPGAPTVPTIPQTGAP------- 586 P GYPGAP VPT+PQTGAP Sbjct: 121 P-----------------------------------GYPGAPNVPTMPQTGAPPGSMPPG 145 Query: 587 -----TMLMQMAXXXXXXXXXXXTSGAPGAPIHNSAAPPAMYQTNXXXXXXXXXXXXXXX 751 +M MQMA TSGAPGAPI NS APPAMYQTN Sbjct: 146 SMPPGSMPMQMAPLPRPPTLPPPTSGAPGAPIPNSGAPPAMYQTNPPQPAGPTSGAPPPV 205 Query: 752 XXXXXXXXXXXXFSYAQLSEGNH 820 FSYAQ EGNH Sbjct: 206 SAPPPAAPPQAPFSYAQPPEGNH 228 >ref|NP_567250.1| proline-rich family protein [Arabidopsis thaliana] dbj|BAD93742.1| putative C-type U1 snRNP [Arabidopsis thaliana] Length = 207 Score = 165 bits (417), Expect = 5e-39 Identities = 93/142 (65%), Positives = 100/142 (70%), Gaps = 20/142 (14%) Frame = +2 Query: 68 MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRNYYQQFEEQQTQSLIDQRIKEHLGQ 247 MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVR YYQQFEEQQTQSLIDQRIKEHLGQ Sbjct: 1 MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRIYYQQFEEQQTQSLIDQRIKEHLGQ 60 Query: 248 AAAF-QVGAPFNQHLLSF--------PGNVPR-PRLPILPTPVM-------PHGVPQ--- 367 + QVGA FNQH+L+ PG++P R P+LP P+M P GVPQ Sbjct: 61 TGGYQQVGAVFNQHMLARPRPPMMLPPGSMPMGMRPPVLPRPMMPPQGYMPPPGVPQMMA 120 Query: 368 APGAPLIPGVRPPILPALGVPG 433 PGAPL P PP L PG Sbjct: 121 PPGAPLPP---PPQNGILRPPG 139 >gb|AAO44073.1| At4g03120 [Arabidopsis thaliana] dbj|BAE99804.1| putative C-type U1 snRNP [Arabidopsis thaliana] Length = 207 Score = 163 bits (412), Expect = 2e-38 Identities = 92/142 (64%), Positives = 100/142 (70%), Gaps = 20/142 (14%) Frame = +2 Query: 68 MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRNYYQQFEEQQTQSLIDQRIKEHLGQ 247 MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVR Y+QQFEEQQTQSLIDQRIKEHLGQ Sbjct: 1 MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRIYHQQFEEQQTQSLIDQRIKEHLGQ 60 Query: 248 AAAF-QVGAPFNQHLLSF--------PGNVPR-PRLPILPTPVM-------PHGVPQ--- 367 + QVGA FNQH+L+ PG++P R P+LP P+M P GVPQ Sbjct: 61 TGGYQQVGAVFNQHMLARPRPPMMLPPGSMPMGMRPPVLPRPMMPPQGYMPPPGVPQMMA 120 Query: 368 APGAPLIPGVRPPILPALGVPG 433 PGAPL P PP L PG Sbjct: 121 PPGAPLPP---PPQNGILRPPG 139 >gb|AAD14440.1| putative C-type U1 snRNP [Arabidopsis thaliana] emb|CAB77797.1| putative C-type U1 snRNP [Arabidopsis thaliana] Length = 112 Score = 156 bits (394), Expect = 3e-36 Identities = 78/105 (74%), Positives = 85/105 (80%), Gaps = 10/105 (9%) Frame = +2 Query: 68 MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRNYYQQFEEQQTQSLIDQRIKEHLGQ 247 MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVR YYQQFEEQQTQSLIDQRIKEHLGQ Sbjct: 1 MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRIYYQQFEEQQTQSLIDQRIKEHLGQ 60 Query: 248 AAAF-QVGAPFNQHLLSF--------PGNVPR-PRLPILPTPVMP 352 + QVGA FNQH+L+ PG++P R P+LP P+MP Sbjct: 61 TGGYQQVGAVFNQHMLARPRPPMMLPPGSMPMGMRPPVLPRPMMP 105