BLASTX 1.4.13-Paracel [2002-12-12]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= PTA.CL.1.cl.162.Contig1
(2382 letters)
Database: BlastDB/NCBI/blast/db/2003_02_17_02_00_1/nr
1,339,046 sequences; 429,188,541 total letters
Searching...................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_563726.1| clathrin protein family; protein id: At1g... 1120 0.0
dbj|BAC43049.1| putative protein destination factor [Arabid... 246 1e-63
ref|NP_192174.1| unknown protein; protein id: At4g02650.1 [... 246 1e-63
pir||T05400 hypothetical protein F10M6.80 - Arabidopsis tha... 243 1e-62
ref|NP_567892.1| putative protein; protein id: At4g32285.1,... 243 1e-62
>ref|NP_563726.1| clathrin protein family; protein id: At1g05020.1 [Arabidopsis
thaliana]|gi|25406826|pir||B86184 hypothetical protein
[imported] - Arabidopsis
thaliana|gi|4056423|gb|AAC97997.1|AAC97997 Similar to
clathrin assembly protein gb|X68878 (AP180) from Rattus
norvegicus. EST gb|W43552 comes from this gene.
[Arabidopsis thaliana]|gi|26450013|dbj|BAC42127.1|
putative clathrin protein [Arabidopsis thaliana]
Length = 653
Score = 1120 bits (2898), Expect = 0.0
Identities = 577/653 (88%), Positives = 577/653 (88%)
Frame = -2
Query: 2060 MPSKLKKAIGAVKDQTSISLAKVANGATGGGDLTTLEVAILKATSHDEEVPIDDRLVTEI 1881
MPSKLKKAIGAVKDQTSISLAKVANGATGGGDLTTLEVAILKATSHDEEVPIDDRLVTEI
Sbjct: 1 MPSKLKKAIGAVKDQTSISLAKVANGATGGGDLTTLEVAILKATSHDEEVPIDDRLVTEI 60
Query: 1880 LXXXXXXXXXXXXXXXXXXXXXXXXRNWIVALKSLVLVLRIFQDGDPYFPREVLHAMKRG 1701
L RNWIVALKSLVLVLRIFQDGDPYFPREVLHAMKRG
Sbjct: 61 LGIISSKKSHAASCAAAIGRRIGRTRNWIVALKSLVLVLRIFQDGDPYFPREVLHAMKRG 120
Query: 1700 AKILNLSSFRDDSNSCPWDFTAFVRTFALYLDERLDCFLTGKLQRRYTNREQTGRISTNS 1521
AKILNLSSFRDDSNSCPWDFTAFVRTFALYLDERLDCFLTGKLQRRYTNREQTGRISTNS
Sbjct: 121 AKILNLSSFRDDSNSCPWDFTAFVRTFALYLDERLDCFLTGKLQRRYTNREQTGRISTNS 180
Query: 1520 TTRSRFNPKAGIKSHEPAVRDMKPVMLLDKITYWQKLLDRAIATRPTGDAKANRLVKMSL 1341
TTRSRFNPKAGIKSHEPAVRDMKPVMLLDKITYWQKLLDRAIATRPTGDAKANRLVKMSL
Sbjct: 181 TTRSRFNPKAGIKSHEPAVRDMKPVMLLDKITYWQKLLDRAIATRPTGDAKANRLVKMSL 240
Query: 1340 YAVMQESFDLYRDISDGLALLLDSFFHLQYQSCINAFQACVRASKQFEELNAFYDLSKSI 1161
YAVMQESFDLYRDISDGLALLLDSFFHLQYQSCINAFQACVRASKQFEELNAFYDLSKSI
Sbjct: 241 YAVMQESFDLYRDISDGLALLLDSFFHLQYQSCINAFQACVRASKQFEELNAFYDLSKSI 300
Query: 1160 GIGRTSEYPSIQKISLELLETLQEFLKDQSSFPASSGLYXXXXXXXXXXXXXXXXXXXXS 981
GIGRTSEYPSIQKISLELLETLQEFLKDQSSFPASSGLY S
Sbjct: 301 GIGRTSEYPSIQKISLELLETLQEFLKDQSSFPASSGLYPSPNSFLPPPPSSKDSAVSSS 360
Query: 980 LDFGDSTIDTSERYSDYGSFRSTSLEDLMSRTEAGTSSPPMSCHSEPYGGGRDDPNGNNF 801
LDFGDSTIDTSERYSDYGSFRSTSLEDLMSRTEAGTSSPPMSCHSEPYGGGRDDPNGNNF
Sbjct: 361 LDFGDSTIDTSERYSDYGSFRSTSLEDLMSRTEAGTSSPPMSCHSEPYGGGRDDPNGNNF 420
Query: 800 DTVSTKSLPNNPSVSASNXXXXXXXXXDVSNTAEAEDVEDKKKQDDSKAETFDPWEALML 621
DTVSTKSLPNNPSVSASN DVSNTAEAEDVEDKKKQDDSKAETFDPWEALML
Sbjct: 421 DTVSTKSLPNNPSVSASNLILDLLSLDDVSNTAEAEDVEDKKKQDDSKAETFDPWEALML 480
Query: 620 RDDPXXXXXXXXXEPSTAEDHQRDSGNWLLALEETATQVQGNNSMAIVPFGLDDPMPAFQ 441
RDDP EPSTAEDHQRDSGNWLLALEETATQVQGNNSMAIVPFGLDDPMPAFQ
Sbjct: 481 RDDPKKKIETIEEEPSTAEDHQRDSGNWLLALEETATQVQGNNSMAIVPFGLDDPMPAFQ 540
Query: 440 AATDQYNPFLEEPVAQLATAGEPMITFGGLALTGFQPEPTFQVNVPDDFEPSSTPTFKAT 261
AATDQYNPFLEEPVAQLATAGEPMITFGGLALTGFQPEPTFQVNVPDDFEPSSTPTFKAT
Sbjct: 541 AATDQYNPFLEEPVAQLATAGEPMITFGGLALTGFQPEPTFQVNVPDDFEPSSTPTFKAT 600
Query: 260 ETLPMKCDPXXXXXXXXXXXXXXENGGVNQQSVLQEQQIWLQNQKKIIAKHLS 102
ETLPMKCDP ENGGVNQQSVLQEQQIWLQNQKKIIAKHLS
Sbjct: 601 ETLPMKCDPFTTFESFGFGETFSENGGVNQQSVLQEQQIWLQNQKKIIAKHLS 653
>dbj|BAC43049.1| putative protein destination factor [Arabidopsis thaliana]
Length = 611
Score = 246 bits (628), Expect = 1e-63
Identities = 131/339 (38%), Positives = 209/339 (61%), Gaps = 5/339 (1%)
Frame = -2
Query: 2054 SKLKKAIGAVKDQTSISLAKVANGATGGGDLTTLEVAILKATSHDEEVPIDDRLVTEILX 1875
SKLK+AIGAVKDQTS+ LAKV ++ LT LE+A++KAT HD+ P +D+ + EIL
Sbjct: 4 SKLKRAIGAVKDQTSVGLAKVGGRSSS---LTELEIAVVKATRHDD-YPAEDKYIREILC 59
Query: 1874 XXXXXXXXXXXXXXXXXXXXXXXRNWIVALKSLVLVLRIFQDGDPYFPREVLHAMKRGAK 1695
+NW VALK+L+L+ R+ DGD + +E+ A +RG +
Sbjct: 60 LTSYSRNYVSACVATLSRRLNKTKNWSVALKTLILIQRLLTDGDRAYEQEIFFATRRGTR 119
Query: 1694 ILNLSSFRDDSNSCPWDFTAFVRTFALYLDERLDCFLTGKLQRRYTNRE-----QTGRIS 1530
+LN+S FRD S S WD++AFVRT+ALYLDERLD + G+ ++ + +G
Sbjct: 120 LLNMSDFRDASQSDSWDYSAFVRTYALYLDERLDYRMQGRRGKKKSGGGGGGDGDSGEED 179
Query: 1529 TNSTTRSRFNPKAGIKSHEPAVRDMKPVMLLDKITYWQKLLDRAIATRPTGDAKANRLVK 1350
+ T + KA + +P V +MK + +++ + Q+LLDR +A RPTG+AK NR+V
Sbjct: 180 DHRGTSNDIRSKAIVVKSKP-VAEMKTEKIFNRVQHLQQLLDRFLACRPTGNAKNNRVVI 238
Query: 1349 MSLYAVMQESFDLYRDISDGLALLLDSFFHLQYQSCINAFQACVRASKQFEELNAFYDLS 1170
+++Y +++ESF LY +I++ + +L++ F L I ++ R SKQF+EL+ FY
Sbjct: 239 VAMYPIVKESFQLYYNITEIMGVLIERFMELDIHDSIKVYEIFCRVSKQFDELDPFYGWC 298
Query: 1169 KSIGIGRTSEYPSIQKISLELLETLQEFLKDQSSFPASS 1053
K++ + R+SEYP ++KI+ + L+ + EF++D+S+ A +
Sbjct: 299 KNMAVARSSEYPELEKITQKKLDLMDEFIRDKSALAAQT 337
>ref|NP_192174.1| unknown protein; protein id: At4g02650.1 [Arabidopsis
thaliana]|gi|7486857|pir||T01084 hypothetical protein
T10P11.8 - Arabidopsis
thaliana|gi|3892046|gb|AAC78254.1|AAC78254 predicted
protein destination factor [Arabidopsis
thaliana]|gi|7269025|emb|CAB80758.1| predicted protein
destination factor [Arabidopsis thaliana]
Length = 676
Score = 246 bits (628), Expect = 1e-63
Identities = 131/339 (38%), Positives = 209/339 (61%), Gaps = 5/339 (1%)
Frame = -2
Query: 2054 SKLKKAIGAVKDQTSISLAKVANGATGGGDLTTLEVAILKATSHDEEVPIDDRLVTEILX 1875
SKLK+AIGAVKDQTS+ LAKV ++ LT LE+A++KAT HD+ P +D+ + EIL
Sbjct: 4 SKLKRAIGAVKDQTSVGLAKVGGRSSS---LTELEIAVVKATRHDD-YPAEDKYIREILC 59
Query: 1874 XXXXXXXXXXXXXXXXXXXXXXXRNWIVALKSLVLVLRIFQDGDPYFPREVLHAMKRGAK 1695
+NW VALK+L+L+ R+ DGD + +E+ A +RG +
Sbjct: 60 LTSYSRNYVSACVATLSRRLNKTKNWSVALKTLILIQRLLTDGDRAYEQEIFFATRRGTR 119
Query: 1694 ILNLSSFRDDSNSCPWDFTAFVRTFALYLDERLDCFLTGKLQRRYTNRE-----QTGRIS 1530
+LN+S FRD S S WD++AFVRT+ALYLDERLD + G+ ++ + +G
Sbjct: 120 LLNMSDFRDASQSDSWDYSAFVRTYALYLDERLDYRMQGRRGKKKSGGGGGGDGDSGEED 179
Query: 1529 TNSTTRSRFNPKAGIKSHEPAVRDMKPVMLLDKITYWQKLLDRAIATRPTGDAKANRLVK 1350
+ T + KA + +P V +MK + +++ + Q+LLDR +A RPTG+AK NR+V
Sbjct: 180 DHRGTSNDIRSKAIVVKSKP-VAEMKTEKIFNRVQHLQQLLDRFLACRPTGNAKNNRVVI 238
Query: 1349 MSLYAVMQESFDLYRDISDGLALLLDSFFHLQYQSCINAFQACVRASKQFEELNAFYDLS 1170
+++Y +++ESF LY +I++ + +L++ F L I ++ R SKQF+EL+ FY
Sbjct: 239 VAMYPIVKESFQLYYNITEIMGVLIERFMELDIHDSIKVYEIFCRVSKQFDELDPFYGWC 298
Query: 1169 KSIGIGRTSEYPSIQKISLELLETLQEFLKDQSSFPASS 1053
K++ + R+SEYP ++KI+ + L+ + EF++D+S+ A +
Sbjct: 299 KNMAVARSSEYPELEKITQKKLDLMDEFIRDKSALAAQT 337
>pir||T05400 hypothetical protein F10M6.80 - Arabidopsis
thaliana|gi|2864615|emb|CAA16962.1| putative protein
[Arabidopsis thaliana]|gi|7270132|emb|CAB79946.1|
putative protein [Arabidopsis thaliana]
Length = 842
Score = 243 bits (619), Expect = 1e-62
Identities = 141/363 (38%), Positives = 210/363 (57%), Gaps = 33/363 (9%)
Frame = -2
Query: 2060 MPSKLKKAIGAVKDQTSISLAKVANGATGGGDLTTLEVAILKATSHDEEVPIDDRLVTEI 1881
M ++KAIG VKDQTSI +AKVA+ LEVAI+KATSHD++ D+ + EI
Sbjct: 208 MALSMRKAIGVVKDQTSIGIAKVASNMA-----PDLEVAIVKATSHDDDQS-SDKYIREI 261
Query: 1880 LXXXXXXXXXXXXXXXXXXXXXXXXRNWIVALKSLVLVLRIFQDGDPYFPREVLHAMKRG 1701
L R+WIVALK+L+LV R+ +GDP F E+L+A +RG
Sbjct: 262 LSLTSLSRGYVHACVTSVSRRLKKTRDWIVALKALMLVHRLLNEGDPLFQEEILYATRRG 321
Query: 1700 AKILNLSSFRDDSNSCPWDFTAFVRTFALYLDERLDCFLTGKLQRR----YTNREQTGRI 1533
+ILN+S FRD+++S WD +AFVRT+A YLD+RL+ L + R ++ + G
Sbjct: 322 TRILNMSDFRDEAHSSSWDHSAFVRTYASYLDQRLELALFERRGRNGGGSSSSHQSNGDD 381
Query: 1532 STNSTTRSRFNP--------------------------KAGIKSHEPAV---RDMKPVML 1440
N + +P + G + + +V R+M P +
Sbjct: 382 GYNRSRDDFRSPPPRTYDYETGNGFGMPKRSRSFGDVNEIGAREEKKSVTPLREMTPERI 441
Query: 1439 LDKITYWQKLLDRAIATRPTGDAKANRLVKMSLYAVMQESFDLYRDISDGLALLLDSFFH 1260
K+ + Q+LLDR ++ RPTG AK +R++ +++Y V++ESF LY DI + LA+LLD FF
Sbjct: 442 FGKMGHLQRLLDRFLSCRPTGLAKNSRMILIAMYPVVKESFRLYADICEVLAVLLDKFFD 501
Query: 1259 LQYQSCINAFQACVRASKQFEELNAFYDLSKSIGIGRTSEYPSIQKISLELLETLQEFLK 1080
++Y C+ AF A A+KQ +EL AFY K G+ R+SEYP +Q+I+ +LLETL+EF++
Sbjct: 502 MEYTDCVKAFDAYASAAKQIDELIAFYHWCKDTGVARSSEYPEVQRITSKLLETLEEFVR 561
Query: 1079 DQS 1071
D++
Sbjct: 562 DRA 564
>ref|NP_567892.1| putative protein; protein id: At4g32285.1, supported by cDNA:
gi_18700098 [Arabidopsis thaliana]
Length = 635
Score = 243 bits (619), Expect = 1e-62
Identities = 141/363 (38%), Positives = 210/363 (57%), Gaps = 33/363 (9%)
Frame = -2
Query: 2060 MPSKLKKAIGAVKDQTSISLAKVANGATGGGDLTTLEVAILKATSHDEEVPIDDRLVTEI 1881
M ++KAIG VKDQTSI +AKVA+ LEVAI+KATSHD++ D+ + EI
Sbjct: 1 MALSMRKAIGVVKDQTSIGIAKVASNMA-----PDLEVAIVKATSHDDDQS-SDKYIREI 54
Query: 1880 LXXXXXXXXXXXXXXXXXXXXXXXXRNWIVALKSLVLVLRIFQDGDPYFPREVLHAMKRG 1701
L R+WIVALK+L+LV R+ +GDP F E+L+A +RG
Sbjct: 55 LSLTSLSRGYVHACVTSVSRRLKKTRDWIVALKALMLVHRLLNEGDPLFQEEILYATRRG 114
Query: 1700 AKILNLSSFRDDSNSCPWDFTAFVRTFALYLDERLDCFLTGKLQRR----YTNREQTGRI 1533
+ILN+S FRD+++S WD +AFVRT+A YLD+RL+ L + R ++ + G
Sbjct: 115 TRILNMSDFRDEAHSSSWDHSAFVRTYASYLDQRLELALFERRGRNGGGSSSSHQSNGDD 174
Query: 1532 STNSTTRSRFNP--------------------------KAGIKSHEPAV---RDMKPVML 1440
N + +P + G + + +V R+M P +
Sbjct: 175 GYNRSRDDFRSPPPRTYDYETGNGFGMPKRSRSFGDVNEIGAREEKKSVTPLREMTPERI 234
Query: 1439 LDKITYWQKLLDRAIATRPTGDAKANRLVKMSLYAVMQESFDLYRDISDGLALLLDSFFH 1260
K+ + Q+LLDR ++ RPTG AK +R++ +++Y V++ESF LY DI + LA+LLD FF
Sbjct: 235 FGKMGHLQRLLDRFLSCRPTGLAKNSRMILIAMYPVVKESFRLYADICEVLAVLLDKFFD 294
Query: 1259 LQYQSCINAFQACVRASKQFEELNAFYDLSKSIGIGRTSEYPSIQKISLELLETLQEFLK 1080
++Y C+ AF A A+KQ +EL AFY K G+ R+SEYP +Q+I+ +LLETL+EF++
Sbjct: 295 MEYTDCVKAFDAYASAAKQIDELIAFYHWCKDTGVARSSEYPEVQRITSKLLETLEEFVR 354
Query: 1079 DQS 1071
D++
Sbjct: 355 DRA 357
Database: BlastDB/NCBI/blast/db/2003_02_17_02_00_1/nr
Posted date: Feb 17, 2003 10:02 AM
Number of letters in database: 429,188,541
Number of sequences in database: 1,339,046
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
length of database: 429,188,541
effective HSP length: 131
effective length of database: 253,773,515
effective search space used: 167998066930
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)