BLASTX 1.4.13-Paracel [2002-12-12]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= PTA.CL.2.cl.256.Contig1
(2192 letters)
Database: BlastDB/NCBI/blast/db/2003_02_17_02_00_1/nr
1,339,046 sequences; 429,188,541 total letters
Searching...................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_173278.1| unknown protein; protein id: At1g18420.1 [... 287 0.0
ref|NP_188473.1| unknown protein; protein id: At3g18440.1, ... 143 e-100
ref|NP_179338.1| unknown protein; protein id: At2g17470.1 [... 119 5e-64
ref|NP_173919.1| hypothetical protein; protein id: At1g2548... 112 8e-64
ref|NP_564935.1| expressed protein; protein id: At1g68600.1... 116 4e-49
>ref|NP_173278.1| unknown protein; protein id: At1g18420.1 [Arabidopsis
thaliana]|gi|25370611|pir||B86318 protein F15H18.9
[imported] - Arabidopsis
thaliana|gi|6714301|gb|AAF25997.1|AC013354_16 F15H18.9
[Arabidopsis thaliana]
Length = 581
Score = 287 bits (735), Expect(5) = 0.0
Identities = 161/229 (70%), Positives = 165/229 (71%), Gaps = 33/229 (14%)
Frame = -3
Query: 588 K*GGALRHCAIMVM---------------------------GVTWLHSLRDPGTFQSLSQ 490
K GGALRHCAIMVM G+ LR G +
Sbjct: 345 KVGGALRHCAIMVMALHGCILSEIQAAEDRRREFRNELQRVGIEGAKVLRYIGESLKKME 404
Query: 489 KT------FYNLQILREI*DKSMQSNCRLLVNAKNWEIGNRPRVRDLTDEQKISNLDSDL 328
K Y + E + LLVNAKNWEIGNRPRVRDLTDEQKISNLDSDL
Sbjct: 405 KLNPIEDILYEIHQAAEELQSKIDKKSYLLVNAKNWEIGNRPRVRDLTDEQKISNLDSDL 464
Query: 327 SRILAHKSQSEATLRPPKNWDDVTTAANLSSATMLPYLQSRTMIHKQPSWPSRISITPGS 148
SRILAHKSQSEATLRPPKNWDDVTTAANLSSATMLPYLQSRTMIHKQPSWPSRISITPGS
Sbjct: 465 SRILAHKSQSEATLRPPKNWDDVTTAANLSSATMLPYLQSRTMIHKQPSWPSRISITPGS 524
Query: 147 MLQPPLGEPGKMYESASNLSLATFASLLIEFVARLENLVNAYDELSVKA 1
MLQPPLGEPGKMYESASNLSLATFASLLIEFVARLENLVNAYDELSVKA
Sbjct: 525 MLQPPLGEPGKMYESASNLSLATFASLLIEFVARLENLVNAYDELSVKA 573
Score = 181 bits (459), Expect(5) = 0.0
Identities = 88/89 (98%), Positives = 88/89 (98%)
Frame = -1
Query: 1178 VSAFFATYAKLYPTMKPYEYGFRVFLLTYCYVIVSGYKTGEFMETAVSRFLLIALGASVG 999
V AFFATYAKLYPTMKPYEYGFRVFLLTYCYVIVSGYKTGEFMETAVSRFLLIALGASVG
Sbjct: 182 VVAFFATYAKLYPTMKPYEYGFRVFLLTYCYVIVSGYKTGEFMETAVSRFLLIALGASVG 241
Query: 998 LIVNTCIYPIWAGEDLHNLVAKNFVNVAT 912
LIVNTCIYPIWAGEDLHNLVAKNFVNVAT
Sbjct: 242 LIVNTCIYPIWAGEDLHNLVAKNFVNVAT 270
Score = 135 bits (339), Expect(5) = 0.0
Identities = 66/77 (85%), Positives = 66/77 (85%)
Frame = -2
Query: 811 GCVNGYLECVAYDTIPSRILVYEAVAEDPVYSGYRSAVQYLFLVVGVIVKMSFASWEPPH 632
GCVNGYLECVAYDTIPSRILVYEAVAEDPVYSGYRSAVQ MSFASWEPPH
Sbjct: 274 GCVNGYLECVAYDTIPSRILVYEAVAEDPVYSGYRSAVQ---STSQEDTLMSFASWEPPH 330
Query: 631 GPYKSFRYPWALYVKVG 581
GPYKSFRYPWALYVKVG
Sbjct: 331 GPYKSFRYPWALYVKVG 347
Score = 126 bits (317), Expect(5) = 0.0
Identities = 63/63 (100%), Positives = 63/63 (100%)
Frame = -2
Query: 1624 SYFSDKITGVVKKLKDVLVTAWEMGTADPRKMIFSAKMGLALTLTSILIFFKIPGLELSG 1445
SYFSDKITGVVKKLKDVLVTAWEMGTADPRKMIFSAKMGLALTLTSILIFFKIPGLELSG
Sbjct: 61 SYFSDKITGVVKKLKDVLVTAWEMGTADPRKMIFSAKMGLALTLTSILIFFKIPGLELSG 120
Query: 1444 HYL 1436
HYL
Sbjct: 121 HYL 123
Score = 124 bits (311), Expect = 6e-27
Identities = 59/63 (93%), Positives = 60/63 (94%)
Frame = -1
Query: 1958 MGAPKLESFRRGSMFDGSFRRGSMFDGSFRQSMRDRLILQSRGYSNVNDDDKTSVRCCSY 1779
M APKLESFRRGSMFDGSFRRGSMFDGSFRQSMRDRLILQSRGYSNVNDDDKTSVRCCSY
Sbjct: 1 MAAPKLESFRRGSMFDGSFRRGSMFDGSFRQSMRDRLILQSRGYSNVNDDDKTSVRCCSY 60
Query: 1778 RLY 1770
+
Sbjct: 61 SYF 63
Score = 74.3 bits (181), Expect(5) = 0.0
Identities = 36/52 (69%), Positives = 37/52 (70%)
Frame = -3
Query: 1413 FCIGATFSKGCNRXXXXXXXXXXXXGMSWISEMTGNWADVFNAASIFVVGIY 1258
F IGATFSKGCNR GMSWISEMTGNWADVFNAASIFVV +
Sbjct: 135 FSIGATFSKGCNRGLGTLSAGGLALGMSWISEMTGNWADVFNAASIFVVAFF 186
>ref|NP_188473.1| unknown protein; protein id: At3g18440.1, supported by cDNA:
gi_20466393 [Arabidopsis
thaliana]|gi|11994107|dbj|BAB01110.1|
gb|AAF25997.1~gene_id:MYF24.16~strong similarity to
unknown protein [Arabidopsis
thaliana]|gi|20466394|gb|AAM20514.1| unknown protein
[Arabidopsis thaliana]|gi|23198100|gb|AAN15577.1| unknown
protein [Arabidopsis thaliana]
Length = 598
Score = 143 bits (360), Expect(5) = e-100
Identities = 65/86 (75%), Positives = 76/86 (87%)
Frame = -1
Query: 1169 FFATYAKLYPTMKPYEYGFRVFLLTYCYVIVSGYKTGEFMETAVSRFLLIALGASVGLIV 990
F AT+ KLYP+MK YEYGFRVFLLTYCY+++SG++TG+F+E A+SRFLLIALGA V L V
Sbjct: 181 FLATFMKLYPSMKAYEYGFRVFLLTYCYILISGFRTGQFIEVAISRFLLIALGAGVSLGV 240
Query: 989 NTCIYPIWAGEDLHNLVAKNFVNVAT 912
N IYPIWAGEDLHNLV KNF+NVAT
Sbjct: 241 NMFIYPIWAGEDLHNLVVKNFMNVAT 266
Score = 101 bits (252), Expect(5) = e-100
Identities = 48/81 (59%), Positives = 58/81 (71%)
Frame = -2
Query: 811 GCVNGYLECVAYDTIPSRILVYEAVAEDPVYSGYRSAVQYLFLVVGVIVKMSFASWEPPH 632
GCVNGYL C+ Y+ IPS+IL Y+A +EDPVY GYRSAV+ + MSFA WEPPH
Sbjct: 270 GCVNGYLRCLEYERIPSKILTYQA-SEDPVYKGYRSAVESTSQEESL---MSFAIWEPPH 325
Query: 631 GPYKSFRYPWALYVKVGWCIE 569
GPYKSF YPW YVK+ ++
Sbjct: 326 GPYKSFNYPWKNYVKLSGALK 346
Score = 97.4 bits (241), Expect(5) = e-100
Identities = 86/236 (36%), Positives = 112/236 (47%), Gaps = 40/236 (16%)
Frame = -3
Query: 588 K*GGALRHCAIMVMGVTW--LHSLRDPGTF-----QSLSQKTFYNLQILREI*DK----- 445
K GAL+HCA VM + L ++ P Q L + ++LRE+ +K
Sbjct: 340 KLSGALKHCAFTVMALHGCILSEIQAPEERRQVFRQELQRVGVEGAKLLRELGEKVKKME 399
Query: 444 --------------------SMQSNCRLLVNAKNWEIGNRPRVRDLTDEQKISNLDSDLS 325
+ LLVN++ WEIGNR ++ ++ +S DSD
Sbjct: 400 KLGPVDLLFEVHLAAEELQHKIDKKSYLLVNSECWEIGNRA-TKESEPQELLSLEDSDPP 458
Query: 324 R-----ILAHKSQSEATLRPPKNWDDVTTAANLSSATMLPYLQSRTMIHKQPSWPSRISI 160
I A KS SEA L P +W + N A L R KQ SWP+R+ +
Sbjct: 459 ENHAPPIYAFKSLSEAVLEIPPSWGE----KNHREA-----LNHRPTFSKQVSWPARLVL 509
Query: 159 TPGSMLQ---PPLGEPGKMYESASNLSLATFASLLIEFVARLENLVNAYDELSVKA 1
P PL E K YESAS LSLATFASLLIEFVARL+N+V+A+ ELS KA
Sbjct: 510 PPHLETTNGASPLVETTKTYESASALSLATFASLLIEFVARLQNVVDAFKELSQKA 565
Score = 72.0 bits (175), Expect(5) = e-100
Identities = 33/60 (55%), Positives = 45/60 (75%)
Frame = -2
Query: 1627 CSYFSDKITGVVKKLKDVLVTAWEMGTADPRKMIFSAKMGLALTLTSILIFFKIPGLELS 1448
C S+KI+GV KDV AWEMG +DPRK++FSAK+GLALT+ ++LIF++ P +LS
Sbjct: 56 CGNLSEKISGVYDDAKDVARKAWEMGVSDPRKIVFSAKIGLALTIVALLIFYQEPNPDLS 115
Score = 40.8 bits (94), Expect(5) = e-100
Identities = 20/50 (40%), Positives = 27/50 (54%)
Frame = -3
Query: 1413 FCIGATFSKGCNRXXXXXXXXXXXXGMSWISEMTGNWADVFNAASIFVVG 1264
F IGAT SKG NR GM+ +S + G+W ++F SIF +G
Sbjct: 131 FTIGATLSKGFNRALGTLSAGGLALGMAELSTLFGDWEEIFCTLSIFCIG 180
>ref|NP_179338.1| unknown protein; protein id: At2g17470.1 [Arabidopsis
thaliana]|gi|25370610|pir||E84552 hypothetical protein
At2g17470 [imported] - Arabidopsis
thaliana|gi|4914368|gb|AAD32904.1| unknown protein
[Arabidopsis thaliana]
Length = 538
Score = 119 bits (298), Expect(3) = 5e-64
Identities = 55/88 (62%), Positives = 68/88 (76%)
Frame = -1
Query: 1178 VSAFFATYAKLYPTMKPYEYGFRVFLLTYCYVIVSGYKTGEFMETAVSRFLLIALGASVG 999
++ F A+Y+KL+P MKPYEY FRVFLLT+C V+VSG TG+F TA RFL I +GA+
Sbjct: 130 LAGFIASYSKLHPAMKPYEYAFRVFLLTFCIVLVSGNNTGDFFSTAYYRFLFIVVGATTC 189
Query: 998 LIVNTCIYPIWAGEDLHNLVAKNFVNVA 915
L+VN I+PIWAGEDLH LVA NF +VA
Sbjct: 190 LVVNIFIFPIWAGEDLHKLVANNFKSVA 217
Score = 94.4 bits (233), Expect(3) = 5e-64
Identities = 41/80 (51%), Positives = 57/80 (71%)
Frame = -2
Query: 811 GCVNGYLECVAYDTIPSRILVYEAVAEDPVYSGYRSAVQYLFLVVGVIVKMSFASWEPPH 632
GCVNGYL+CV Y+ +PS+IL Y+ ++DP+YSGYRSA+Q ++ FA WEPPH
Sbjct: 222 GCVNGYLQCVEYERVPSKILTYQT-SDDPLYSGYRSAIQSTNQEESLL---DFAIWEPPH 277
Query: 631 GPYKSFRYPWALYVKVGWCI 572
GPY++F +PW YVK+ +
Sbjct: 278 GPYRTFNHPWKNYVKLSGAV 297
Score = 77.0 bits (188), Expect(3) = 5e-64
Identities = 72/229 (31%), Positives = 105/229 (45%), Gaps = 33/229 (14%)
Frame = -3
Query: 588 K*GGALRHCAIMVMGV----------------TWLHSLRDPGT---------------FQ 502
K GA+RHCA VM + + H L+ G +
Sbjct: 292 KLSGAVRHCAFTVMAIHGCILSEIQAAPEKRQAFRHELQRVGNEGAKVLRLIGEKVEKME 351
Query: 501 SLSQKTFYN-LQILREI*DKSMQSNCRLLVNAKNWEIGN-RPRVRDLTDEQKISNLDSDL 328
+L N +Q E + S LLVN+++W + + +E + + L
Sbjct: 352 NLGPGEILNDVQRAAEELQMKIDSKSYLLVNSESWAATKEKAEAEEYEEEAHETKVIKSL 411
Query: 327 SRILAHKSQSEATLRPPKNWDDVTTAANLSSATMLPYLQSRTMIHKQPSWPSRISITPGS 148
S+I S S + P + +D + + + ML + +WPS +S GS
Sbjct: 412 SQIWDTNSSSNN--QNPASGNDESQIWESTESMML---------RNRETWPS-VSFIGGS 459
Query: 147 MLQPPLGEPGKMYESASNLSLATFASLLIEFVARLENLVNAYDELSVKA 1
++ + K+YESAS+LSLATFASLLIEFVARLENLVNA++ELS KA
Sbjct: 460 VVNETVY---KVYESASSLSLATFASLLIEFVARLENLVNAFEELSTKA 505
Score = 38.1 bits (87), Expect = 0.57
Identities = 16/32 (50%), Positives = 24/32 (75%)
Frame = -2
Query: 1558 EMGTADPRKMIFSAKMGLALTLTSILIFFKIP 1463
E+G +D R++ F+ KMG+AL L S++IF K P
Sbjct: 31 ELGHSDRRRIFFAVKMGMALALCSVVIFLKEP 62
>ref|NP_173919.1| hypothetical protein; protein id: At1g25480.1 [Arabidopsis
thaliana]|gi|25370613|pir||A86385 hypothetical protein
F2J7.18 [imported] - Arabidopsis
thaliana|gi|12321496|gb|AAG50799.1|AC079281_1
hypothetical protein [Arabidopsis thaliana]
Length = 548
Score = 112 bits (280), Expect(3) = 8e-64
Identities = 55/88 (62%), Positives = 64/88 (72%)
Frame = -1
Query: 1178 VSAFFATYAKLYPTMKPYEYGFRVFLLTYCYVIVSGYKTGEFMETAVSRFLLIALGASVG 999
++ F A+Y KLYP MK YEY FRVFLLTYC V+VSG + +F TA RFLLI +GA +
Sbjct: 158 IAGFSASYLKLYPAMKSYEYAFRVFLLTYCIVLVSGNNSRDFFSTAYYRFLLILVGAGIC 217
Query: 998 LIVNTCIYPIWAGEDLHNLVAKNFVNVA 915
L VN I PIWAGEDLH LV KNF +VA
Sbjct: 218 LGVNIFILPIWAGEDLHKLVVKNFKSVA 245
Score = 98.2 bits (243), Expect(3) = 8e-64
Identities = 45/80 (56%), Positives = 58/80 (72%)
Frame = -2
Query: 811 GCVNGYLECVAYDTIPSRILVYEAVAEDPVYSGYRSAVQYLFLVVGVIVKMSFASWEPPH 632
GCVNGYL+CV Y+ IPS+IL Y+A ++DP+YSGYRS VQ ++ FA WEPPH
Sbjct: 250 GCVNGYLQCVEYERIPSKILTYQA-SDDPLYSGYRSVVQSTSQEDSLL---DFAVWEPPH 305
Query: 631 GPYKSFRYPWALYVKVGWCI 572
GPYK+F +PWA YVK+ +
Sbjct: 306 GPYKTFHHPWANYVKLSGAV 325
Score = 79.3 bits (194), Expect(3) = 8e-64
Identities = 71/228 (31%), Positives = 103/228 (45%), Gaps = 32/228 (14%)
Frame = -3
Query: 588 K*GGALRHCAIMVMGV----------------TWLHSLRDPGT---------------FQ 502
K GA+RHCA MVM + + L+ G +
Sbjct: 320 KLSGAVRHCAFMVMAMHGCILSEIQAAPEKRQAFRQELQRVGNEGAKVLRLFGEKVEKME 379
Query: 501 SLSQ-KTFYNLQILREI*DKSMQSNCRLLVNAKNWEIGNRPRVRDLTDEQKISNLDSDLS 325
LS ++Q E + SN LLVN+++W + + +Q D S
Sbjct: 380 KLSPGNVLKDVQRAAEELQMKIDSNSFLLVNSESW-AAMKEKAEAEEAQQNYHEAKDDES 438
Query: 324 RILAHKSQSEATLRPPKNWDDVTTAANLSSATMLPYLQSRTMIHKQPSWPSRISITPGSM 145
+++ SQ P + + + + + L M+ + +WPS +S GSM
Sbjct: 439 KVIQSLSQIWDNNNNPHHQNQ-----HAGNDSQLWISTESMMLRNRENWPS-VSFIGGSM 492
Query: 144 LQPPLGEPGKMYESASNLSLATFASLLIEFVARLENLVNAYDELSVKA 1
+ K+YESAS+LSLATFASLLIEFVARL+N+VNAY+ELS KA
Sbjct: 493 INE---IESKVYESASSLSLATFASLLIEFVARLQNIVNAYEELSTKA 537
Score = 48.5 bits (114), Expect = 4e-04
Identities = 25/53 (47%), Positives = 33/53 (62%)
Frame = -2
Query: 1621 YFSDKITGVVKKLKDVLVTAWEMGTADPRKMIFSAKMGLALTLTSILIFFKIP 1463
+ SD IT K L D+ +EMG +D RK+ FS KMG+AL L S +I+ K P
Sbjct: 38 FCSDGITASWKALYDIGAKLYEMGRSDRRKVYFSVKMGMALALCSFVIYLKEP 90
>ref|NP_564935.1| expressed protein; protein id: At1g68600.1, supported by cDNA:
gi_16648825 [Arabidopsis
thaliana]|gi|16648826|gb|AAL25603.1| At1g68600/F24J5_14
[Arabidopsis thaliana]|gi|22655352|gb|AAM98268.1|
At1g68600/F24J5_14 [Arabidopsis thaliana]
Length = 537
Score = 116 bits (291), Expect(3) = 4e-49
Identities = 55/88 (62%), Positives = 68/88 (76%)
Frame = -1
Query: 1178 VSAFFATYAKLYPTMKPYEYGFRVFLLTYCYVIVSGYKTGEFMETAVSRFLLIALGASVG 999
++ F A+Y KLY +MKPYEY FRVF LTYC V+VSG + +F+ TA R LLI LGA++
Sbjct: 147 LAGFGASYLKLYASMKPYEYAFRVFKLTYCIVLVSGNNSRDFLSTAYYRILLIGLGATIC 206
Query: 998 LIVNTCIYPIWAGEDLHNLVAKNFVNVA 915
L+VN ++PIWAGEDLH LVAKNF NVA
Sbjct: 207 LLVNVFLFPIWAGEDLHKLVAKNFKNVA 234
Score = 97.8 bits (242), Expect(3) = 4e-49
Identities = 45/80 (56%), Positives = 58/80 (72%)
Frame = -2
Query: 811 GCVNGYLECVAYDTIPSRILVYEAVAEDPVYSGYRSAVQYLFLVVGVIVKMSFASWEPPH 632
GCVNGYL+CV Y+ IPS+IL Y+A ++DP+YSGYRSAVQ ++ FA WEPPH
Sbjct: 239 GCVNGYLQCVEYERIPSKILTYQA-SDDPLYSGYRSAVQSTSQEDSLL---DFAIWEPPH 294
Query: 631 GPYKSFRYPWALYVKVGWCI 572
GPYK+F +PW YVK+ +
Sbjct: 295 GPYKTFNHPWKNYVKLSGAV 314
Score = 71.2 bits (173), Expect = 6e-11
Identities = 57/164 (34%), Positives = 86/164 (51%), Gaps = 5/164 (3%)
Frame = -3
Query: 477 NLQILREI*DKSMQSNCRLLVNAKNW----EIGNRPRVRDLTDEQKISNLDSDLSRILAH 310
++Q E + S LLVN+++W E R+ E K D ++++
Sbjct: 380 DVQRAAEALQMKIDSKSYLLVNSESWAAIKEQAEAEEARENDQEAK-----DDETKVIKS 434
Query: 309 KSQSEATLRPPKNWDDVTTAANLSSATMLPYLQSRTMIHK-QPSWPSRISITPGSMLQPP 133
SQ WD + S+ ++ + +M+ K + WPS +S G+++
Sbjct: 435 LSQI---------WDTNNNNNHQSNDQSQHWMSTESMMLKNREMWPS-MSFIDGTVVNEI 484
Query: 132 LGEPGKMYESASNLSLATFASLLIEFVARLENLVNAYDELSVKA 1
K+YESAS+LSLATFASLLIEFVARL+N+VNA++ELS KA
Sbjct: 485 ---ECKVYESASSLSLATFASLLIEFVARLQNIVNAFEELSTKA 525
Score = 37.4 bits (85), Expect = 0.96
Identities = 17/33 (51%), Positives = 23/33 (69%)
Frame = -2
Query: 1561 WEMGTADPRKMIFSAKMGLALTLTSILIFFKIP 1463
+ +G +D RK+ FS KMG+AL L S +IF K P
Sbjct: 47 YALGHSDRRKLYFSIKMGIALALCSFVIFLKEP 79
Score = 25.8 bits (55), Expect(3) = 4e-49
Identities = 10/11 (90%), Positives = 11/11 (99%)
Frame = -1
Query: 545 ALHGCILSEIQ 513
A+HGCILSEIQ
Sbjct: 323 AMHGCILSEIQ 333
Database: BlastDB/NCBI/blast/db/2003_02_17_02_00_1/nr
Posted date: Feb 17, 2003 10:02 AM
Number of letters in database: 429,188,541
Number of sequences in database: 1,339,046
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
length of database: 429,188,541
effective HSP length: 131
effective length of database: 253,773,515
effective search space used: 152010335485
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)