BLASTX 1.5.4-Paracel [2003-06-05]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AF361819.F
(1647 letters)
Database: BlastDB/NCBI/blast/db/FASTA/2004_07_04_01_00_0/nr
1,895,298 sequences; 629,712,918 total letters
Searching...................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAK32832.1| At1g09870 [Arabidopsis thaliana] 410 e-113
pir||B84497 hypothetical protein At2g11240 [imported] - Ara... 205 6e-69
pir||T04018 hypothetical protein F17A8.60 - Arabidopsis tha... 157 3e-58
pir||H84465 hypothetical protein At2g05200 [imported] - Ara... 144 4e-58
ref|NP_671791.1| expressed protein [Arabidopsis thaliana] 134 8e-30
pir||T01474 hypothetical protein T24H24.17 - Arabidopsis th... 132 6e-42
pir||C84528 hypothetical protein At2g15380 [imported] - Ara... 132 1e-38
dbj|BAB08714.1| non-LTR retroelement reverse transcriptase-... 124 9e-54
pir||G84721 hypothetical protein At2g31520 [imported] - Ara... 124 1e-34
pir||G84649 hypothetical protein At2g25550 [imported] - Ara... 124 1e-34
>gb|AAK32832.1| At1g09870 [Arabidopsis thaliana]
Length = 201
Score = 410 bits (1054), Expect = e-113
Identities = 201/201 (100%), Positives = 201/201 (100%)
Frame = +1
Query: 469 MFQAPGFSLQTHPISSNTFLVGLFPSFNDALLAKVSWRLLTKPSSMLAKVLLGKYCQSSS 648
MFQAPGFSLQTHPISSNTFLVGLFPSFNDALLAKVSWRLLTKPSSMLAKVLLGKYCQSSS
Sbjct: 1 MFQAPGFSLQTHPISSNTFLVGLFPSFNDALLAKVSWRLLTKPSSMLAKVLLGKYCQSSS 60
Query: 649 FLDCKVSNNASHGWRGICLGRDLLKTQLGRAIGTGKDSKIWQEPWISLSKPVTPMGPATQ 828
FLDCKVSNNASHGWRGICLGRDLLKTQLGRAIGTGKDSKIWQEPWISLSKPVTPMGPATQ
Sbjct: 61 FLDCKVSNNASHGWRGICLGRDLLKTQLGRAIGTGKDSKIWQEPWISLSKPVTPMGPATQ 120
Query: 829 FSQELTVADLICPSSNSKSGEYSAKTGYLVANLREERPLLNPPVVEFDWLREICFFIAAS 1008
FSQELTVADLICPSSNSKSGEYSAKTGYLVANLREERPLLNPPVVEFDWLREICFFIAAS
Sbjct: 121 FSQELTVADLICPSSNSKSGEYSAKTGYLVANLREERPLLNPPVVEFDWLREICFFIAAS 180
Query: 1009 QAQFGTNLPSRLHSALPKSRR 1071
QAQFGTNLPSRLHSALPKSRR
Sbjct: 181 QAQFGTNLPSRLHSALPKSRR 201
>pir||B84497 hypothetical protein At2g11240 [imported] - Arabidopsis
thaliana|gi|4263655|gb|AAD15377.1| putative non-LTR
retroelement reverse transcriptase [Arabidopsis thaliana]
Length = 1044
Score = 205 bits (522), Expect = 2e-51
Identities = 103/209 (49%), Positives = 145/209 (69%)
Frame = +3
Query: 1017 IWDKSPFKTSFCSSQITSLRKGLETSKLLTNLPPTGLGDVPLLPWILWTIWISRNKRIFE 1196
+WD SPFKT+ +S+ITS+++GLE SKLL LPP G+G L WILW +W RNK IFE
Sbjct: 803 VWDLSPFKTTLQASRITSMKQGLEVSKLLVTLPPIGIGQGQLPIWILWNLWNCRNKLIFE 862
Query: 1197 KRQITNFEAVAQAVSQAREWCASQLVSPQSPQLRLPYQIEEIGLDTIRGFTDAAWKAETK 1376
++ I++ + ++Q++SQ+ EW +Q+ + +S + EI LDTI+ TDA+W+ ET
Sbjct: 863 QKHISSMDLISQSISQSTEWLGAQIQASKSKIVIPGISPSEIDLDTIQCSTDASWREETL 922
Query: 1377 EAGFGWHFSDFLCNTERHGRSSASNVRSPLMAEALAMLHAIHQARDLGYKKLSLASDSQQ 1556
+AGFGW F D + E H +++A N+RSPL+A+A A+ AI A DLG+KKL +ASDSQQ
Sbjct: 923 QAGFGWVFVDHSNHLESHHKAAAMNIRSPLLAKASALSLAIQHAADLGFKKLVVASDSQQ 982
Query: 1557 LIKALNLELQSKELYGILHDILSLSLTFD 1643
L+K LN E EL+GI+ DI LSL F+
Sbjct: 983 LVKVLNGEPHPMELHGIVFDISVLSLNFE 1011
Score = 167 bits (422), Expect = 8e-40
Identities = 72/104 (69%), Positives = 94/104 (90%)
Frame = +3
Query: 3 QKWVEWIMQCITSVSYSFLLNGAAQGSVKPQRGIRQGDPLSPYIFILCSEVLSGLCQKAQ 182
Q W+ WI+QCIT+VSYSFLLNG+AQG+V P+RG+RQGDPLSP++FI+CSEVLSGLC+KAQ
Sbjct: 426 QTWISWILQCITTVSYSFLLNGSAQGAVTPERGLRQGDPLSPFLFIICSEVLSGLCRKAQ 485
Query: 183 QNGDLLGVRVAKGSPRLKHLLFADDTMFFCKFDVQSCQNLMLIL 314
+G LLG+RV+KG+PR+ HLLFADDT+FFC+ D++SC+ + IL
Sbjct: 486 LDGSLLGLRVSKGNPRVNHLLFADDTIFFCRSDLKSCKTFLCIL 529
Score = 152 bits (385), Expect(2) = 6e-69
Identities = 66/112 (58%), Positives = 90/112 (79%)
Frame = +1
Query: 544 SFNDALLAKVSWRLLTKPSSMLAKVLLGKYCQSSSFLDCKVSNNASHGWRGICLGRDLLK 723
+FNDALLAK+SWR++ PS +L ++LLGKYC++SSFLDC V+ +SHGWRGIC G+DL+K
Sbjct: 686 NFNDALLAKLSWRIVQSPSCVLVRILLGKYCRTSSFLDCSVTAASSHGWRGICTGKDLIK 745
Query: 724 TQLGRAIGTGKDSKIWQEPWISLSKPVTPMGPATQFSQELTVADLICPSSNS 879
+QLG+ IG+G D+ +W EPW+SLS TPMGPA + + +TVA LIC ++ S
Sbjct: 746 SQLGKVIGSGLDTLVWNEPWLSLSTSSTPMGPALEQFKSMTVAQLICQTTKS 797
Score = 133 bits (334), Expect(2) = 6e-69
Identities = 62/77 (80%), Positives = 70/77 (90%)
Frame = +2
Query: 311 PELFGRKKKDMFNFIIDRIRQRAKSWSSRFLSTAGKATMLKAVLVAMPTYTMSCFKLPGS 490
P++FGRKK+D+FN I+DRIRQR+ SWSSRFLSTAGK TMLK+VL +MPTYTMSCFKL S
Sbjct: 580 PKMFGRKKRDLFNQIVDRIRQRSLSWSSRFLSTAGKTTMLKSVLASMPTYTMSCFKLLVS 639
Query: 491 LCKRIQSALTRFWWDSS 541
LCKRIQSALT FWWDSS
Sbjct: 640 LCKRIQSALTHFWWDSS 656
>pir||T04018 hypothetical protein F17A8.60 - Arabidopsis
thaliana|gi|4538901|emb|CAB39638.1| RNA-directed DNA
polymerase-like protein [Arabidopsis
thaliana]|gi|7267666|emb|CAB78094.1| RNA-directed DNA
polymerase-like protein [Arabidopsis thaliana]
Length = 1274
Score = 157 bits (396), Expect = 9e-37
Identities = 69/103 (66%), Positives = 87/103 (83%)
Frame = +3
Query: 6 KWVEWIMQCITSVSYSFLLNGAAQGSVKPQRGIRQGDPLSPYIFILCSEVLSGLCQKAQQ 185
KW+ W+MQC+ +VSYSFL+NG+ QGSV P RG+RQGDPLSPY+FILC+EVLSGLC+KAQ+
Sbjct: 542 KWIRWVMQCVCTVSYSFLINGSPQGSVVPSRGLRQGDPLSPYLFILCTEVLSGLCRKAQE 601
Query: 186 NGDLLGVRVAKGSPRLKHLLFADDTMFFCKFDVQSCQNLMLIL 314
G ++G+RVA+GSP++ HLLFADDTMFFCK + C L IL
Sbjct: 602 KGVMVGIRVARGSPQVNHLLFADDTMFFCKTNPTCCGALSNIL 644
Score = 130 bits (327), Expect(2) = 3e-58
Identities = 72/178 (40%), Positives = 97/178 (54%), Gaps = 37/178 (20%)
Frame = +1
Query: 565 AKVSWRLLTKPSSMLAKVLLGKYCQSSSFLDCKVSNN-ASHGWRGICLGRDLLKTQLGRA 741
AK+SWR+L +P S+L++VLLGKYC +SSF+DC S + ASHGWRGI GRDLL+ LG +
Sbjct: 801 AKLSWRILKEPHSLLSRVLLGKYCNTSSFMDCSASPSFASHGWRGILAGRDLLRKGLGWS 860
Query: 742 IGTGKDSKIWQEPWISLSKPVTPMGPATQFSQELTVADLICPSSNS-------------- 879
IG G +W E W+S S P TP+GP T+ +++L+V DLIC S
Sbjct: 861 IGQGDSINVWTEAWLSPSSPQTPIGPPTETNKDLSVHDLICHDVKSWNVEAIRKHLPQYE 920
Query: 880 ----------------------KSGEYSAKTGYLVANLREERPLLNPPVVEFDWLREI 987
KSGEY+ KTGY +A L ++F+W + I
Sbjct: 921 DQIRKITINALPLQDSLVWLPVKSGEYTTKTGYALAKLNS----FPASQLDFNWQKNI 974
Score = 124 bits (310), Expect = 8e-27
Identities = 76/218 (34%), Positives = 111/218 (50%)
Frame = +3
Query: 987 LFFHCSFASSIWDKSPFKTSFCSSQITSLRKGLETSKLLTNLPPTGLGDVPLLPWILWTI 1166
L C +A +W+ +P + + +S+ L +K + LPPTGLG PL PW+LW +
Sbjct: 1024 LMLLCPYAKKVWELAPVLFNPSEATHSSVALLLVDAKRMVALPPTGLGSAPLYPWLLWHL 1083
Query: 1167 WISRNKRIFEKRQITNFEAVAQAVSQAREWCASQLVSPQSPQLRLPYQIEEIGLDTIRGF 1346
W +RN+ IF+ + V +A+ AR W +QL+ P Y L F
Sbjct: 1084 WKARNRLIFDNHSCSEEGLVLKAILDARAWMEAQLLI-HHPSPISDYPSPTPNLKVTSCF 1142
Query: 1347 TDAAWKAETKEAGFGWHFSDFLCNTERHGRSSASNVRSPLMAEALAMLHAIHQARDLGYK 1526
DAAW + G GW D + +SS+S V S LMAE LA+ A+ A G +
Sbjct: 1143 VDAAW-TTSGYCGMGWFLQDPYKVKIKENQSSSSFVGSALMAETLAVHLALVDALSTGVR 1201
Query: 1527 KLSLASDSQQLIKALNLELQSKELYGILHDILSLSLTF 1640
+L++ SD ++LI LN EL G+LHDI LS++F
Sbjct: 1202 QLNVFSDCKELISLLNSGKSIVELRGLLHDIRELSVSF 1239
Score = 119 bits (299), Expect(2) = 3e-58
Identities = 54/76 (71%), Positives = 65/76 (85%)
Frame = +2
Query: 311 PELFGRKKKDMFNFIIDRIRQRAKSWSSRFLSTAGKATMLKAVLVAMPTYTMSCFKLPGS 490
PE FGR+K+D+F+ I+DRIRQR+ SWS RFLS+AGK +LKAVL +MP+Y M CFKLP S
Sbjct: 695 PEHFGRRKRDIFSSIVDRIRQRSHSWSIRFLSSAGKQILLKAVLSSMPSYAMMCFKLPAS 754
Query: 491 LCKRIQSALTRFWWDS 538
LCK+IQS LTRFWWDS
Sbjct: 755 LCKQIQSVLTRFWWDS 770
>pir||H84465 hypothetical protein At2g05200 [imported] - Arabidopsis
thaliana|gi|4755191|gb|AAD29058.1| putative non-LTR
retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1229
Score = 144 bits (363), Expect = 6e-33
Identities = 66/103 (64%), Positives = 81/103 (78%)
Frame = +3
Query: 9 WVEWIMQCITSVSYSFLLNGAAQGSVKPQRGIRQGDPLSPYIFILCSEVLSGLCQKAQQN 188
W++W+++C+TSVSYSFL+NG QG V P RG+RQGDPLSP +FILC+EVLSGLC +AQ+
Sbjct: 497 WIDWVLECVTSVSYSFLINGTPQGKVVPTRGLRQGDPLSPCLFILCTEVLSGLCTRAQRL 556
Query: 189 GDLLGVRVAKGSPRLKHLLFADDTMFFCKFDVQSCQNLMLILS 317
L GVRV+ PR+ HLLFADDTMFF K D +SC L ILS
Sbjct: 557 RQLPGVRVSINGPRVNHLLFADDTMFFSKSDPESCNKLSEILS 599
Score = 131 bits (330), Expect(2) = 4e-58
Identities = 69/178 (38%), Positives = 101/178 (55%), Gaps = 36/178 (20%)
Frame = +1
Query: 550 NDALLAKVSWRLLTKPSSMLAKVLLGKYCQSSSFLDCKVSNNASHGWRGICLGRDLLKTQ 729
ND+LLAK+ WRLL P S+L+++LLGKYC SSSF++CK+ + SHGWR I GR++LK
Sbjct: 757 NDSLLAKLGWRLLNSPESLLSRILLGKYCHSSSFMECKLPSQPSHGWRSIIAGREILKEG 816
Query: 730 LGRAIGTGKDSKIWQEPWISLSKPVTPMGPATQFSQELTVADLI---------------- 861
LG I G+ IW +PW+S+SKP+ P+GPA + Q+L V+ LI
Sbjct: 817 LGWLITNGEKVSIWNDPWLSISKPLVPIGPALREHQDLRVSALINQNTLQWDWNKIAVIL 876
Query: 862 -----------CPSSNS---------KSGEYSAKTGYLVANLREERPLLNPPVVEFDW 975
PSS KSG+Y++++GY +A++ + P +F+W
Sbjct: 877 PNYENLIKQLPAPSSRGVDKLAWLPVKSGQYTSRSGYGIASVAS----IPIPQTQFNW 930
Score = 118 bits (295), Expect(2) = 4e-58
Identities = 52/76 (68%), Positives = 65/76 (85%)
Frame = +2
Query: 311 PELFGRKKKDMFNFIIDRIRQRAKSWSSRFLSTAGKATMLKAVLVAMPTYTMSCFKLPGS 490
PE FGR+K+D+F IID+IRQ++ SW+SRFLS AGK MLKAVL +MP Y+MSCFKLP +
Sbjct: 649 PEHFGRRKRDIFGAIIDKIRQKSHSWASRFLSQAGKQVMLKAVLASMPLYSMSCFKLPSA 708
Query: 491 LCKRIQSALTRFWWDS 538
LC++IQS LTRFWWD+
Sbjct: 709 LCRKIQSLLTRFWWDT 724
Score = 90.1 bits (222), Expect = 1e-16
Identities = 66/220 (30%), Positives = 101/220 (45%), Gaps = 2/220 (0%)
Frame = +3
Query: 987 LFFHCSFASSIWDKSPFKTSFCSSQITSLRKGLETSKLLTNLPPTGLGDVPLLPWILWTI 1166
LFFHC FA+ +W+ +P + + +S+ L K LPPTG+ L PWI
Sbjct: 984 LFFHCEFAAQVWELAPLQETTVPPG-SSMLDALSLLKKAIILPPTGVTSAALFPWICG-- 1040
Query: 1167 WISRNKRIFEKRQITNFEAVAQAVSQAREWCASQLVSPQSPQLRLPYQIEEIGLDTIRGF 1346
I+ K + +A+ A W ++Q P++ + P + F
Sbjct: 1041 -------IYGK-----LGTMTRAILDALAWQSAQRCLPKTRNVVHPISQLPVLRSGYFCF 1088
Query: 1347 TDAAWKAETKEAGFGWHFSDFLCNTERHGRSSASNVRSP--LMAEALAMLHAIHQARDLG 1520
DAAW A++ AG GW F + SA R P L AEA A+ A+ A LG
Sbjct: 1089 VDAAWIAQSSLAGSGWVFQSATALEKETATYSAGCRRLPSALSAEAWAIKSALLHALQLG 1148
Query: 1521 YKKLSLASDSQQLIKALNLELQSKELYGILHDILSLSLTF 1640
L + SDS+ ++ AL + E+YG+L +I +L ++F
Sbjct: 1149 RTDLMVLSDSKSVVDALTSNISINEIYGLLMEIRALRVSF 1188
>ref|NP_671791.1| expressed protein [Arabidopsis thaliana]
Length = 315
Score = 134 bits (336), Expect = 8e-30
Identities = 75/208 (36%), Positives = 107/208 (51%)
Frame = +3
Query: 987 LFFHCSFASSIWDKSPFKTSFCSSQITSLRKGLETSKLLTNLPPTGLGDVPLLPWILWTI 1166
+ FHC FA +W +PF + S S+R+GL T L LPPTG+ PL PWI W++
Sbjct: 90 MLFHCQFAQLVWSPTPFARHWASVGAGSVREGLITGCKLACLPPTGIASGPLAPWICWSL 149
Query: 1167 WISRNKRIFEKRQITNFEAVAQAVSQAREWCASQLVSPQSPQLRLPYQIEEIGLDTIRGF 1346
W SRN+++F R T E + +A++ A+EW A+Q SP
Sbjct: 150 WKSRNQKVFSTRFFTPEETLLKAITNAKEWLAAQGKSP---------------------- 187
Query: 1347 TDAAWKAETKEAGFGWHFSDFLCNTERHGRSSASNVRSPLMAEALAMLHAIHQARDLGYK 1526
D +W+ + K AG GW F++ T H + NV SP MAE+LA + +A G
Sbjct: 188 -DGSWRNDLKAAGLGWTFTNPAGLTSHHS-ALCKNVSSPFMAESLACGAVVLEAVRAGAD 245
Query: 1527 KLSLASDSQQLIKALNLELQSKELYGIL 1610
L SD QQL+ A+N L E++GI+
Sbjct: 246 SFLLESDYQQLVVAINARLVLLEVHGII 273
>pir||T01474 hypothetical protein T24H24.17 - Arabidopsis
thaliana|gi|3377824|gb|AAC28197.1| contains similarity
to reverse transcriptases [Arabidopsis
thaliana]|gi|7267156|emb|CAB77868.1| putative reverse
transcriptase [Arabidopsis thaliana]
Length = 1077
Score = 132 bits (332), Expect(2) = 6e-42
Identities = 62/76 (81%), Positives = 68/76 (88%)
Frame = +2
Query: 311 PELFGRKKKDMFNFIIDRIRQRAKSWSSRFLSTAGKATMLKAVLVAMPTYTMSCFKLPGS 490
PELFGRKKKD+F I+DRI+QRA SWSSRFLS+AGK TMLK+VL MPTYTMSCF+LP S
Sbjct: 689 PELFGRKKKDLFTAIVDRIKQRALSWSSRFLSSAGKLTMLKSVLSTMPTYTMSCFQLPLS 748
Query: 491 LCKRIQSALTRFWWDS 538
LCKRIQS LTRFWWDS
Sbjct: 749 LCKRIQSTLTRFWWDS 764
Score = 77.0 bits (188), Expect = 1e-12
Identities = 42/71 (59%), Positives = 52/71 (73%)
Frame = +3
Query: 1431 GRSSASNVRSPLMAEALAMLHAIHQARDLGYKKLSLASDSQQLIKALNLELQSKELYGIL 1610
G SSAS+V SPL+AE L L + A + K +S ASDSQ ++KALN +LQ KEL+GIL
Sbjct: 978 GNSSASDVGSPLIAETLVTLADVRVACESDIKAISFASDSQIIVKALNQKLQVKELHGIL 1037
Query: 1611 HDILSLSLTFD 1643
HDILSLS +FD
Sbjct: 1038 HDILSLSSSFD 1048
Score = 76.6 bits (187), Expect = 2e-12
Identities = 36/55 (65%), Positives = 42/55 (75%)
Frame = +3
Query: 150 EVLSGLCQKAQQNGDLLGVRVAKGSPRLKHLLFADDTMFFCKFDVQSCQNLMLIL 314
EVLS +C KAQ+NG L G+RVA G PR+ HLLFADDTMFFC+ D +SC L IL
Sbjct: 584 EVLSRMCLKAQENGSLPGIRVAMGCPRVNHLLFADDTMFFCRVDPKSCLKLKSIL 638
Score = 63.2 bits (152), Expect(2) = 6e-42
Identities = 57/152 (37%), Positives = 74/152 (48%), Gaps = 4/152 (2%)
Frame = +1
Query: 544 SFNDALLAKVSWRLLTKPSSMLAKVLLGKYCQSSSFLDCKVSNNASHGWRGICLGRDLLK 723
+FN ALLAK+SWRLL PS +LAK+LLGKYC+ S+ LD + N L R LL
Sbjct: 795 TFNKALLAKLSWRLLQNPSCLLAKLLLGKYCKHSTLLDWTTNWNKE-------LIRQLLP 847
Query: 724 TQLGRAIGTGKDSKIWQEPWISLSKPVTPMGPATQFSQELTVADLICPSSNSKSGEYSAK 903
E I L KP + +G A +++ P+S SG Y+A
Sbjct: 848 V---------------YEKDILLLKP-SLLGAADRWAW--------LPTS---SGLYAAN 880
Query: 904 TGYLVANLREERPLLNPP----VVEFDWLREI 987
+GY A L+E L NPP V F+W I
Sbjct: 881 SGYFEA-LKETASLENPPNHDLVKNFNWRSNI 911
>pir||C84528 hypothetical protein At2g15380 [imported] - Arabidopsis
thaliana|gi|4544375|gb|AAD22286.1| putative non-LTR
retroelement reverse transcriptase [Arabidopsis thaliana]
Length = 1311
Score = 132 bits (331), Expect = 3e-29
Identities = 84/193 (43%), Positives = 108/193 (55%)
Frame = +3
Query: 1062 ITSLRKGLETSKLLTNLPPTGLGDVPLLPWILWTIWISRNKRIFEKRQITNFEAVAQAVS 1241
+ S+ +GL + L LPP G+G PL PWI WTIW SRN+ IF K+ I +A+ +A+
Sbjct: 1108 LQSVTEGLIAAIKLICLPPIGIGLGPLAPWIFWTIWTSRNQLIFNKKSIPVVDALGRALR 1167
Query: 1242 QAREWCASQLVSPQSPQLRLPYQIEEIGLDTIRGFTDAAWKAETKEAGFGWHFSDFLCNT 1421
+ P P L + + TDA+WK + AG GW F D T
Sbjct: 1168 KL----------PFKPLLLI---------NPFICLTDASWKTDA-HAGLGWIFMDNNERT 1207
Query: 1422 ERHGRSSASNVRSPLMAEALAMLHAIHQARDLGYKKLSLASDSQQLIKALNLELQSKELY 1601
G + ++V SPLMAEALA L AI A L +S SDS L+KALN +LQSKEL+
Sbjct: 1208 FSKGSLAVAHVGSPLMAEALATLIAIRVAIGLALTHISFVSDSLTLVKALNSKLQSKELH 1267
Query: 1602 GILHDILSLSLTF 1640
GILHDIL+LS TF
Sbjct: 1268 GILHDILALSSTF 1280
Score = 100 bits (249), Expect(2) = 1e-38
Identities = 51/103 (49%), Positives = 63/103 (60%)
Frame = +3
Query: 6 KWVEWIMQCITSVSYSFLLNGAAQGSVKPQRGIRQGDPLSPYIFILCSEVLSGLCQKAQQ 185
+W+ WIMQC+ +VSYSFL NG A+G EVLSGLC A
Sbjct: 825 QWISWIMQCVCTVSYSFLFNGQAKG-----------------------EVLSGLCNVALS 861
Query: 186 NGDLLGVRVAKGSPRLKHLLFADDTMFFCKFDVQSCQNLMLIL 314
+G L+G+RV+K SPR+ HLLFADDTMFFC+ D +SC L IL
Sbjct: 862 HGSLVGLRVSKQSPRVNHLLFADDTMFFCRSDPKSCPKLKQIL 904
Score = 84.0 bits (206), Expect(2) = 1e-38
Identities = 37/45 (82%), Positives = 42/45 (93%)
Frame = +2
Query: 404 STAGKATMLKAVLVAMPTYTMSCFKLPGSLCKRIQSALTRFWWDS 538
STAGK TMLK+VL AMP+YTMSCFK+P +LCKRIQSALTRFWWD+
Sbjct: 920 STAGKLTMLKSVLSAMPSYTMSCFKIPNNLCKRIQSALTRFWWDT 964
Score = 35.8 bits (81), Expect = 2.9
Identities = 15/31 (48%), Positives = 20/31 (64%)
Frame = +1
Query: 781 WISLSKPVTPMGPATQFSQELTVADLICPSS 873
W+SL P PMGP T+ +Q V+DL+ P S
Sbjct: 973 WLSLISPQAPMGPPTEDTQNWLVSDLLFPGS 1003
>dbj|BAB08714.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
thaliana]
Length = 1197
Score = 124 bits (311), Expect(2) = 9e-54
Identities = 69/186 (37%), Positives = 92/186 (49%), Gaps = 36/186 (19%)
Frame = +1
Query: 538 FPSFNDALLAKVSWRLLTKPSSMLAKVLLGKYCQSSSFLDCKVSNNASHGWRGICLGRDL 717
F ++N ALLAK SWR+L PS +LA+VLLGKYCQ+ SFL ASHGWR + GRDL
Sbjct: 688 FKAYNVALLAKQSWRVLPHPSCLLARVLLGKYCQNESFLKALSPTFASHGWRDLLAGRDL 747
Query: 718 LKTQLGRAIGTGKDSKIWQEPWISLSKPVTPMGPATQFSQELTVADLICPSS-------- 873
L+ QLG IG K +WQ+ W++ K + P+GPA S++ V DL P+S
Sbjct: 748 LRKQLGWTIGNDKSVLVWQDAWLAHDKRIQPVGPAPLLSKDWKVKDLFLPNSVDWDEGKI 807
Query: 874 ----------------------------NSKSGEYSAKTGYLVANLREERPLLNPPVVEF 969
+K G Y+ KTGY + PV+ F
Sbjct: 808 EAAIPFVKDLVLAIRPSRAGGADKRIWLGAKDGLYTTKTGYKATLCDVDSDSSRDPVLSF 867
Query: 970 DWLREI 987
+W E+
Sbjct: 868 NWFDEV 873
Score = 118 bits (296), Expect = 3e-25
Identities = 75/219 (34%), Positives = 111/219 (50%), Gaps = 3/219 (1%)
Frame = +3
Query: 993 FHCSFASSIWDKSPFKTSFCSSQITSLRKGLETSKLLTNLPPTGLGDVPLLPWILWTIWI 1172
F+C F +W +P + T ++ GL K LP GLG L PWI WT+WI
Sbjct: 925 FNCMFVQRVWRLAPLEGIDLFLVFTDVKLGLSWLKKKKTLPSVGLGYCALYPWICWTLWI 984
Query: 1173 SRNKRIFEKRQITNFEAVAQAVSQAREWCASQLVSPQSP-QLR--LPYQIEEIGLDTIRG 1343
+RN+++F + E + +A+ A+EW +Q P S Q+R + + E L++
Sbjct: 985 TRNQKVFSNILFSELETINKAIQDAKEWQFAQ--EPNSRFQVRSSMGLRAEVFSLNSTVI 1042
Query: 1344 FTDAAWKAETKEAGFGWHFSDFLCNTERHGRSSASNVRSPLMAEALAMLHAIHQARDLGY 1523
TDAAW T AG GW +++ S +V SPL+AEAL + A+ A +LG
Sbjct: 1043 HTDAAWSVLTCSAGLGWTLKSPGIPLKKNSALS-EHVPSPLVAEALVVHSALWSAHNLGL 1101
Query: 1524 KKLSLASDSQQLIKALNLELQSKELYGILHDILSLSLTF 1640
+ L SD Q L A+ + E++GIL DI S S +F
Sbjct: 1102 HDVILKSDCQVLTMAIISQFPLSEIHGILQDITSSSSSF 1140
Score = 110 bits (276), Expect(2) = 9e-54
Identities = 51/76 (67%), Positives = 61/76 (80%)
Frame = +2
Query: 314 ELFGRKKKDMFNFIIDRIRQRAKSWSSRFLSTAGKATMLKAVLVAMPTYTMSCFKLPGSL 493
E FGR+KKD+F I+D+I+Q+A SWSSRFLS AGK ML++VL A+P Y MSCF LP SL
Sbjct: 585 EHFGRRKKDLFTSIVDKIKQKAISWSSRFLSGAGKQVMLQSVLTAIPVYAMSCFLLPLSL 644
Query: 494 CKRIQSALTRFWWDSS 541
C RIQS LTRFWWD +
Sbjct: 645 CDRIQSTLTRFWWDKT 660
Score = 38.9 bits (89), Expect = 0.35
Identities = 18/37 (48%), Positives = 24/37 (64%)
Frame = +3
Query: 27 QCITSVSYSFLLNGAAQGSVKPQRGIRQGDPLSPYIF 137
+C+++VSYSFL+NG G V P RG +G SP F
Sbjct: 498 KCVSTVSYSFLVNGVPMGEVIPSRGKSKGLQSSPADF 534
>pir||G84721 hypothetical protein At2g31520 [imported] - Arabidopsis
thaliana|gi|4582447|gb|AAD24831.1| putative non-LTR
retroelement reverse transcriptase [Arabidopsis thaliana]
Length = 1524
Score = 124 bits (311), Expect = 6e-27
Identities = 54/99 (54%), Positives = 72/99 (72%)
Frame = +3
Query: 6 KWVEWIMQCITSVSYSFLLNGAAQGSVKPQRGIRQGDPLSPYIFILCSEVLSGLCQKAQQ 185
KW+ WIM + SV YS L+NG+ G + P RGIRQGDPLSPY+FILC ++LS L
Sbjct: 757 KWIGWIMAAVKSVHYSVLINGSPHGYITPTRGIRQGDPLSPYLFILCGDILSHLINGRAS 816
Query: 186 NGDLLGVRVAKGSPRLKHLLFADDTMFFCKFDVQSCQNL 302
+GDL GVR+ G+P + HL FADD++FFC+ +V++CQ L
Sbjct: 817 SGDLRGVRIGNGAPAITHLQFADDSLFFCQANVRNCQAL 855
Score = 104 bits (260), Expect(2) = 1e-34
Identities = 45/77 (58%), Positives = 58/77 (74%)
Frame = +2
Query: 311 PELFGRKKKDMFNFIIDRIRQRAKSWSSRFLSTAGKATMLKAVLVAMPTYTMSCFKLPGS 490
PE FGRKKK+MF +IIDR+++R +WS+RFLS AGK MLK+V +AMP Y MSCFKLP
Sbjct: 910 PEQFGRKKKEMFEYIIDRVKKRTSTWSARFLSPAGKEIMLKSVALAMPVYAMSCFKLPKG 969
Query: 491 LCKRIQSALTRFWWDSS 541
+ I+S L FWW+ +
Sbjct: 970 IVSEIESLLMNFWWEKA 986
Score = 66.6 bits (161), Expect(2) = 1e-34
Identities = 38/104 (36%), Positives = 59/104 (56%)
Frame = +1
Query: 547 FNDALLAKVSWRLLTKPSSMLAKVLLGKYCQSSSFLDCKVSNNASHGWRGICLGRDLLKT 726
FNDALLAK +WRL+ P+S+ A+V+ +Y + S LD KV S+GW + G LLK
Sbjct: 1017 FNDALLAKQAWRLIQYPNSLFARVMKARYFKDVSILDAKVRKQQSYGWASLLDGIALLKK 1076
Query: 727 QLGRAIGTGKDSKIWQEPWISLSKPVTPMGPATQFSQELTVADL 858
IG G++ +I + + S P P+ + +E+T+ +L
Sbjct: 1077 GTRHLIGDGQNIRIGLDNIVD-SHPPRPLNTEETY-KEMTINNL 1118
Score = 62.4 bits (150), Expect = 3e-08
Identities = 53/203 (26%), Positives = 89/203 (43%), Gaps = 9/203 (4%)
Frame = +3
Query: 993 FHCSFASSIWDKSPFKTSFCSSQITSLRKGLETSKLLTNLPPTGLGDV-PLLP-WILWTI 1166
F C FA+ W S +S +Q+ S S +L + T + D LLP W++W I
Sbjct: 1255 FTCPFATMAWWLSD--SSLIRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLIWRI 1312
Query: 1167 WISRNKRIFEKRQITNFEAVAQAVSQAREWCASQLVSPQSPQLRLPYQIEEIGLDT---- 1334
W +RN +F K + + + V A ++ +W + ++P QI E ++
Sbjct: 1313 WKARNNVVFNKFRESPSKTVLSAKAETHDWLNATQSHKKTPSPTR--QIAENKIEWRNPP 1370
Query: 1335 ---IRGFTDAAWKAETKEAGFGWHFSDFLCNTERHGRSSASNVRSPLMAEALAMLHAIHQ 1505
++ DA + + EA GW + G ++ +PL AE A+L A+ Q
Sbjct: 1371 ATYVKCNFDAGFDVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLEAETKALLAALQQ 1430
Query: 1506 ARDLGYKKLSLASDSQQLIKALN 1574
GY ++ + D Q LI +N
Sbjct: 1431 TWIRGYTQVFMEGDCQTLINLIN 1453
>pir||G84649 hypothetical protein At2g25550 [imported] - Arabidopsis
thaliana|gi|4432866|gb|AAD20714.1| putative non-LTR
retroelement reverse transcriptase [Arabidopsis thaliana]
Length = 1750
Score = 124 bits (311), Expect = 6e-27
Identities = 54/99 (54%), Positives = 72/99 (72%)
Frame = +3
Query: 6 KWVEWIMQCITSVSYSFLLNGAAQGSVKPQRGIRQGDPLSPYIFILCSEVLSGLCQKAQQ 185
KW+ WIM + SV YS L+NG+ G + P RGIRQGDPLSPY+FILC ++LS L
Sbjct: 983 KWIGWIMAAVKSVHYSVLINGSPHGYITPTRGIRQGDPLSPYLFILCGDILSHLINGRAS 1042
Query: 186 NGDLLGVRVAKGSPRLKHLLFADDTMFFCKFDVQSCQNL 302
+GDL GVR+ G+P + HL FADD++FFC+ +V++CQ L
Sbjct: 1043 SGDLRGVRIGNGAPAITHLQFADDSLFFCQANVRNCQAL 1081
Score = 104 bits (260), Expect(2) = 1e-34
Identities = 45/77 (58%), Positives = 58/77 (74%)
Frame = +2
Query: 311 PELFGRKKKDMFNFIIDRIRQRAKSWSSRFLSTAGKATMLKAVLVAMPTYTMSCFKLPGS 490
PE FGRKKK+MF +IIDR+++R +WS+RFLS AGK MLK+V +AMP Y MSCFKLP
Sbjct: 1136 PEQFGRKKKEMFEYIIDRVKKRTSTWSARFLSPAGKEIMLKSVALAMPVYAMSCFKLPKG 1195
Query: 491 LCKRIQSALTRFWWDSS 541
+ I+S L FWW+ +
Sbjct: 1196 IVSEIESLLMNFWWEKA 1212
Score = 66.6 bits (161), Expect(2) = 1e-34
Identities = 38/104 (36%), Positives = 59/104 (56%)
Frame = +1
Query: 547 FNDALLAKVSWRLLTKPSSMLAKVLLGKYCQSSSFLDCKVSNNASHGWRGICLGRDLLKT 726
FNDALLAK +WRL+ P+S+ A+V+ +Y + S LD KV S+GW + G LLK
Sbjct: 1243 FNDALLAKQAWRLIQYPNSLFARVMKARYFKDVSILDAKVRKQQSYGWASLLDGIALLKK 1302
Query: 727 QLGRAIGTGKDSKIWQEPWISLSKPVTPMGPATQFSQELTVADL 858
IG G++ +I + + S P P+ + +E+T+ +L
Sbjct: 1303 GTRHLIGDGQNIRIGLDNIVD-SHPPRPLNTEETY-KEMTINNL 1344
Score = 63.2 bits (152), Expect = 2e-08
Identities = 53/203 (26%), Positives = 89/203 (43%), Gaps = 9/203 (4%)
Frame = +3
Query: 993 FHCSFASSIWDKSPFKTSFCSSQITSLRKGLETSKLLTNLPPTGLGDV-PLLP-WILWTI 1166
F C FA+ W S +S +Q+ S S +L + T + D LLP W++W I
Sbjct: 1481 FTCPFATMAWRLSD--SSLIRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLIWRI 1538
Query: 1167 WISRNKRIFEKRQITNFEAVAQAVSQAREWCASQLVSPQSPQLRLPYQIEEIGLDT---- 1334
W +RN +F K + + + V A ++ +W + ++P QI E ++
Sbjct: 1539 WKARNNVVFNKFRESPSKTVLSAKAETHDWLNATQSHKKTPSPTR--QIAENKIEWRNPP 1596
Query: 1335 ---IRGFTDAAWKAETKEAGFGWHFSDFLCNTERHGRSSASNVRSPLMAEALAMLHAIHQ 1505
++ DA + + EA GW + G ++ +PL AE A+L A+ Q
Sbjct: 1597 ATYVKCNFDAGFDVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLEAETKALLAALQQ 1656
Query: 1506 ARDLGYKKLSLASDSQQLIKALN 1574
GY ++ + D Q LI +N
Sbjct: 1657 TWIRGYTQVFMEGDCQTLINLIN 1679
Database: BlastDB/NCBI/blast/db/FASTA/2004_07_04_01_00_0/nr
Posted date: Jul 5, 2004 10:54 PM
Number of letters in database: 629,712,918
Number of sequences in database: 1,895,298
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
length of database: 629,712,918
effective HSP length: 131
effective length of database: 381,428,880
effective search space used: 159055842960
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)