BLASTX 2.2.17 Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= bphyst016b18 (929 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 5,815,196 sequences; 2,006,227,497 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EAZ26021.1| hypothetical protein OsJ_009504 [Oryza sativa (ja... 172 3e-41 ref|NP_001049340.1| Os03g0210400 [Oryza sativa (japonica cultiva... 172 3e-41 gb|EAY88999.1| hypothetical protein OsI_010232 [Oryza sativa (in... 168 4e-40 emb|CAO40560.1| unnamed protein product [Vitis vinifera] 91 8e-17 ref|NP_563700.1| hydroxyproline-rich glycoprotein family protein... 81 8e-14 >gb|EAZ26021.1| hypothetical protein OsJ_009504 [Oryza sativa (japonica cultivar-group)] Length = 799 Score = 172 bits (436), Expect = 3e-41 Identities = 88/127 (69%), Positives = 96/127 (75%) Frame = +2 Query: 2 TPEPTQAELASASDKEEISSIFLEFLDLFGDAQSIKKATTRHTNLFSRKRSILPSKKRRA 181 T EPT+ E+ S +DKE+ISSIFLEFLDLFGDAQ+IKKAT RH FSRKRS+L SKKRRA Sbjct: 607 TAEPTEGEVTSLADKEDISSIFLEFLDLFGDAQAIKKATNRHLTHFSRKRSMLSSKKRRA 666 Query: 182 DDGIMSDRDKLAKTGDVTQPVMGTDPNAPNPPVWPATSEASXXXXXXXXXXXXXXXXXXT 361 DD IMSDRDKLA+ GD TQPV+GTDPNA NPPVWPATSEAS T Sbjct: 667 DDVIMSDRDKLARIGDGTQPVVGTDPNAHNPPVWPATSEASGQQWGAAYAPQATYPAYGT 726 Query: 362 YDYSHQM 382 YDYSHQM Sbjct: 727 YDYSHQM 733 >ref|NP_001049340.1| Os03g0210400 [Oryza sativa (japonica cultivar-group)] gb|ABF94590.1| Pre-mRNA processing protein prp39, putative, expressed [Oryza sativa (japonica cultivar-group)] dbj|BAF11254.1| Os03g0210400 [Oryza sativa (japonica cultivar-group)] Length = 789 Score = 172 bits (436), Expect = 3e-41 Identities = 88/127 (69%), Positives = 96/127 (75%) Frame = +2 Query: 2 TPEPTQAELASASDKEEISSIFLEFLDLFGDAQSIKKATTRHTNLFSRKRSILPSKKRRA 181 T EPT+ E+ S +DKE+ISSIFLEFLDLFGDAQ+IKKAT RH FSRKRS+L SKKRRA Sbjct: 597 TAEPTEGEVTSLADKEDISSIFLEFLDLFGDAQAIKKATNRHLTHFSRKRSMLSSKKRRA 656 Query: 182 DDGIMSDRDKLAKTGDVTQPVMGTDPNAPNPPVWPATSEASXXXXXXXXXXXXXXXXXXT 361 DD IMSDRDKLA+ GD TQPV+GTDPNA NPPVWPATSEAS T Sbjct: 657 DDVIMSDRDKLARIGDGTQPVVGTDPNAHNPPVWPATSEASGQQWGAAYAPQATYPAYGT 716 Query: 362 YDYSHQM 382 YDYSHQM Sbjct: 717 YDYSHQM 723 >gb|EAY88999.1| hypothetical protein OsI_010232 [Oryza sativa (indica cultivar-group)] Length = 799 Score = 168 bits (426), Expect = 4e-40 Identities = 87/127 (68%), Positives = 94/127 (74%) Frame = +2 Query: 2 TPEPTQAELASASDKEEISSIFLEFLDLFGDAQSIKKATTRHTNLFSRKRSILPSKKRRA 181 T EPT E+ S +DKE+ISSIFLEFLDLFGDAQ+IKKAT RH FS KRS+L SKKRRA Sbjct: 607 TAEPTDGEVTSLADKEDISSIFLEFLDLFGDAQAIKKATNRHLTHFSWKRSMLSSKKRRA 666 Query: 182 DDGIMSDRDKLAKTGDVTQPVMGTDPNAPNPPVWPATSEASXXXXXXXXXXXXXXXXXXT 361 DD IMSDRDKLA+ GD TQPV+GTDPNA NPPVWPATSEAS T Sbjct: 667 DDVIMSDRDKLARIGDGTQPVVGTDPNAHNPPVWPATSEASGQQWGAAYAPQATYPAYGT 726 Query: 362 YDYSHQM 382 YDYSHQM Sbjct: 727 YDYSHQM 733 >emb|CAO40560.1| unnamed protein product [Vitis vinifera] Length = 754 Score = 91.3 bits (225), Expect = 8e-17 Identities = 53/110 (48%), Positives = 65/110 (59%), Gaps = 13/110 (11%) Frame = +2 Query: 11 PTQAELASASDKEEISSIFLEFLDLFGDAQSIKKATTRHTNLFSRKRSILPSKKRRADDG 190 P ASA+++EE+SSIFLEFLDLFGDAQSIKKA RH LF RS KKR A+D Sbjct: 559 PESPNAASAAEREELSSIFLEFLDLFGDAQSIKKADDRHAKLFLHHRSTSELKKRHAEDF 618 Query: 191 IMSDRDKLAKT----GDVTQPVMGTDPNAPN---------PPVWPATSEA 301 + SD+ KLAK+ Q +MG P+A N P WP ++A Sbjct: 619 LASDKAKLAKSYSGVPSPAQSLMGAYPSAQNQWASGYGLQPQAWPQATQA 668 >ref|NP_563700.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gb|AAL07170.1| unknown protein [Arabidopsis thaliana] gb|AAM14126.1| unknown protein [Arabidopsis thaliana] gb|AAM65430.1| unknown [Arabidopsis thaliana] Length = 768 Score = 81.3 bits (199), Expect = 8e-14 Identities = 47/111 (42%), Positives = 62/111 (55%), Gaps = 12/111 (10%) Frame = +2 Query: 5 PEPTQAELASASDKEEISSIFLEFLDLFGDAQSIKKATTRHTNLFSRKRSILPSKKRRAD 184 P+ +AS++++EE+S I++EFL +FGD +SIKKA +H LF RS KKR AD Sbjct: 572 PDADAQNIASSTEREELSLIYIEFLGIFGDVKSIKKAEDQHVKLFYPHRSTSELKKRSAD 631 Query: 185 DGIMSDRDKLAKTGDVT---QPVMGTDPN---------APNPPVWPATSEA 301 D + SDR K+AKT + T QPV PN A P WP A Sbjct: 632 DFLASDRTKMAKTYNGTPPAQPVSNAYPNAQAQWSGGYAAQPQTWPPAQAA 682