BLASTX 1.4.13-Paracel [2002-12-12]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= At1g02420.f
(1476 letters)
Database: BlastDB/NCBI/blast/db/2003_02_27_02_00_4/nr
1,347,713 sequences; 431,508,457 total letters
Searching...................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_171744.1| hypothetical protein; protein id: At1g0242... 925 0.0
gb|AAG00894.1|AC064879_12 Hypothetical protein [Arabidopsis... 923 0.0
gb|AAN77307.1| Hypothetical protein [Oryza sativa (japonica... 484 e-135
ref|NP_191695.1| putative protein; protein id: At3g61360.1,... 204 2e-51
pir||A96735 hypothetical protein F23N20.5 [imported] - Arab... 202 8e-51
>ref|NP_171744.1| hypothetical protein; protein id: At1g02420.1 [Arabidopsis thaliana]
Length = 491
Score = 925 bits (2390), Expect = 0.0
Identities = 466/491 (94%), Positives = 466/491 (94%)
Frame = +1
Query: 1 MMMILKPLSSHHVSNFRLSVSFLHSVALSDAKVPVEEEGDDAETVFRMINGSNLQVELKE 180
MMMILKPLSSHHVSNFRLSVSFLHSVALSDAKVPVEEEGDDAETVFRMINGSNLQVELKE
Sbjct: 1 MMMILKPLSSHHVSNFRLSVSFLHSVALSDAKVPVEEEGDDAETVFRMINGSNLQVELKE 60
Query: 181 SLSSSGIHLSKDLIDRVLKRVRFSHGNPIQTLEFYRYASAIRGFYHSSFSLDTMLYILGR 360
SLSSSGIHLSKDLIDRVLKRVRFSHGNPIQTLEFYRYASAIRGFYHSSFSLDTMLYILGR
Sbjct: 61 SLSSSGIHLSKDLIDRVLKRVRFSHGNPIQTLEFYRYASAIRGFYHSSFSLDTMLYILGR 120
Query: 361 NRKFDQIWELLIETKRKDRSLISPRTMQVVLGRVAKLCSVRQTVESFWKFKRLVPDFFDT 540
NRKFDQIWELLIETKRKDRSLISPRTMQVVLGRVAKLCSVRQTVESFWKFKRLVPDFFDT
Sbjct: 121 NRKFDQIWELLIETKRKDRSLISPRTMQVVLGRVAKLCSVRQTVESFWKFKRLVPDFFDT 180
Query: 541 ACFNALLRTLCQEKSMTDARNVYHSLKHQFQPDLQTFNILLSGWKXXXXXXXXXXXMKGK 720
ACFNALLRTLCQEKSMTDARNVYHSLKHQFQPDLQTFNILLSGWK MKGK
Sbjct: 181 ACFNALLRTLCQEKSMTDARNVYHSLKHQFQPDLQTFNILLSGWKSSEEAEAFFEEMKGK 240
Query: 721 GLKPDVVTYNSLIDVYCKDREIEKAYKLIDKMREEEETPDVITYTTVIGGLGLIGQPDKA 900
GLKPDVVTYNSLIDVYCKDREIEKAYKLIDKMREEEETPDVITYTTVIGGLGLIGQPDKA
Sbjct: 241 GLKPDVVTYNSLIDVYCKDREIEKAYKLIDKMREEEETPDVITYTTVIGGLGLIGQPDKA 300
Query: 901 REVLKEMKEYGCYPDVAAYNAAIRNFCIARRLGDADKLVDEMVKKGLSPNATTYNLFFRV 1080
REVLKEMKEYGCYPDVAAYNAAIRNFCIARRLGDADKLVDEMVKKGLSPNATTYNLFFRV
Sbjct: 301 REVLKEMKEYGCYPDVAAYNAAIRNFCIARRLGDADKLVDEMVKKGLSPNATTYNLFFRV 360
Query: 1081 LSLANDLGRSWELYVRMLGNECLPNTQSCMFLIKMFKRHEKVDMAMRLWEDMVVKGFGSY 1260
LSLANDLGRSWELYVRMLGNECLPNTQSCMFLIKMFKRHEKVDMAMRLWEDMVVKGFGSY
Sbjct: 361 LSLANDLGRSWELYVRMLGNECLPNTQSCMFLIKMFKRHEKVDMAMRLWEDMVVKGFGSY 420
Query: 1261 XXXXXXXXXXXXXXAKVEEAEKCLLEMVEKGHRPSNVSFKRIKLLMELANKHDEVNNLIQ 1440
AKVEEAEKCLLEMVEKGHRPSNVSFKRIKLLMELANKHDEVNNLIQ
Sbjct: 421 SLVSDVLLDLLCDLAKVEEAEKCLLEMVEKGHRPSNVSFKRIKLLMELANKHDEVNNLIQ 480
Query: 1441 KMAIFSTEIPR 1473
KMAIFSTEIPR
Sbjct: 481 KMAIFSTEIPR 491
>gb|AAG00894.1|AC064879_12 Hypothetical protein [Arabidopsis thaliana]|gi|25511622|pir||F86154
T6A9.11 protein - Arabidopsis thaliana
Length = 490
Score = 923 bits (2385), Expect = 0.0
Identities = 465/490 (94%), Positives = 465/490 (94%)
Frame = +1
Query: 4 MMILKPLSSHHVSNFRLSVSFLHSVALSDAKVPVEEEGDDAETVFRMINGSNLQVELKES 183
MMILKPLSSHHVSNFRLSVSFLHSVALSDAKVPVEEEGDDAETVFRMINGSNLQVELKES
Sbjct: 1 MMILKPLSSHHVSNFRLSVSFLHSVALSDAKVPVEEEGDDAETVFRMINGSNLQVELKES 60
Query: 184 LSSSGIHLSKDLIDRVLKRVRFSHGNPIQTLEFYRYASAIRGFYHSSFSLDTMLYILGRN 363
LSSSGIHLSKDLIDRVLKRVRFSHGNPIQTLEFYRYASAIRGFYHSSFSLDTMLYILGRN
Sbjct: 61 LSSSGIHLSKDLIDRVLKRVRFSHGNPIQTLEFYRYASAIRGFYHSSFSLDTMLYILGRN 120
Query: 364 RKFDQIWELLIETKRKDRSLISPRTMQVVLGRVAKLCSVRQTVESFWKFKRLVPDFFDTA 543
RKFDQIWELLIETKRKDRSLISPRTMQVVLGRVAKLCSVRQTVESFWKFKRLVPDFFDTA
Sbjct: 121 RKFDQIWELLIETKRKDRSLISPRTMQVVLGRVAKLCSVRQTVESFWKFKRLVPDFFDTA 180
Query: 544 CFNALLRTLCQEKSMTDARNVYHSLKHQFQPDLQTFNILLSGWKXXXXXXXXXXXMKGKG 723
CFNALLRTLCQEKSMTDARNVYHSLKHQFQPDLQTFNILLSGWK MKGKG
Sbjct: 181 CFNALLRTLCQEKSMTDARNVYHSLKHQFQPDLQTFNILLSGWKSSEEAEAFFEEMKGKG 240
Query: 724 LKPDVVTYNSLIDVYCKDREIEKAYKLIDKMREEEETPDVITYTTVIGGLGLIGQPDKAR 903
LKPDVVTYNSLIDVYCKDREIEKAYKLIDKMREEEETPDVITYTTVIGGLGLIGQPDKAR
Sbjct: 241 LKPDVVTYNSLIDVYCKDREIEKAYKLIDKMREEEETPDVITYTTVIGGLGLIGQPDKAR 300
Query: 904 EVLKEMKEYGCYPDVAAYNAAIRNFCIARRLGDADKLVDEMVKKGLSPNATTYNLFFRVL 1083
EVLKEMKEYGCYPDVAAYNAAIRNFCIARRLGDADKLVDEMVKKGLSPNATTYNLFFRVL
Sbjct: 301 EVLKEMKEYGCYPDVAAYNAAIRNFCIARRLGDADKLVDEMVKKGLSPNATTYNLFFRVL 360
Query: 1084 SLANDLGRSWELYVRMLGNECLPNTQSCMFLIKMFKRHEKVDMAMRLWEDMVVKGFGSYX 1263
SLANDLGRSWELYVRMLGNECLPNTQSCMFLIKMFKRHEKVDMAMRLWEDMVVKGFGSY
Sbjct: 361 SLANDLGRSWELYVRMLGNECLPNTQSCMFLIKMFKRHEKVDMAMRLWEDMVVKGFGSYS 420
Query: 1264 XXXXXXXXXXXXXAKVEEAEKCLLEMVEKGHRPSNVSFKRIKLLMELANKHDEVNNLIQK 1443
AKVEEAEKCLLEMVEKGHRPSNVSFKRIKLLMELANKHDEVNNLIQK
Sbjct: 421 LVSDVLLDLLCDLAKVEEAEKCLLEMVEKGHRPSNVSFKRIKLLMELANKHDEVNNLIQK 480
Query: 1444 MAIFSTEIPR 1473
MAIFSTEIPR
Sbjct: 481 MAIFSTEIPR 490
>gb|AAN77307.1| Hypothetical protein [Oryza sativa (japonica cultivar-group)]
Length = 531
Score = 484 bits (1245), Expect = e-135
Identities = 238/453 (52%), Positives = 319/453 (69%), Gaps = 3/453 (0%)
Frame = +1
Query: 121 DAETVFRMINGSNLQVELKESLSSSGIHLSKDLIDRVLKRVRFSHGNPIQTLEFYRYASA 300
DA+ V+RM+ + ++ +LS+SG+ +S L+D VL+R RF+HG+P++ L A
Sbjct: 43 DADAVYRMVTAAPTPSAMESALSASGVAISAPLLDLVLRRFRFAHGDPLRALSLLSLALD 102
Query: 301 IRGFYHSSFSLDTMLYILGRNRKFDQIWELLIETKRKDRSLISPRTMQVVLGRVAKLCSV 480
G S F+LDT LY+LGR R+F +W+LL ++R ++PRT VVLGRVAK+CSV
Sbjct: 103 RHGVAPSPFALDTALYVLGRARRFAHMWDLLRSSRRLVPDAVTPRTAMVVLGRVAKVCSV 162
Query: 481 RQTVESFWKFKRLVP---DFFDTACFNALLRTLCQEKSMTDARNVYHSLKHQFQPDLQTF 651
R+TV+SF + R++ D + FNALLRTLCQEKSM+DARNVYH+LK++F+ + QTF
Sbjct: 163 RETVDSFRRLSRMLRGRGDDQEGQLFNALLRTLCQEKSMSDARNVYHALKYEFKVNRQTF 222
Query: 652 NILLSGWKXXXXXXXXXXXMKGKGLKPDVVTYNSLIDVYCKDREIEKAYKLIDKMREEEE 831
NILLSGWK M+ G++PD+VTYNSLID +CK+R +E AYKL+D+MRE++
Sbjct: 223 NILLSGWKSAEDAEAFVAEMRELGVEPDLVTYNSLIDCHCKNRGVENAYKLLDEMREKDI 282
Query: 832 TPDVITYTTVIGGLGLIGQPDKAREVLKEMKEYGCYPDVAAYNAAIRNFCIARRLGDADK 1011
+PDVITYT++IGGLGLIGQPDKA+ +LKEM E GCYPDV AYN AIRNF IA+RLGDA
Sbjct: 283 SPDVITYTSLIGGLGLIGQPDKAKHLLKEMHELGCYPDVPAYNTAIRNFVIAKRLGDAFA 342
Query: 1012 LVDEMVKKGLSPNATTYNLFFRVLSLANDLGRSWELYVRMLGNECLPNTQSCMFLIKMFK 1191
L++EM KGL PNATTYNLFFR A D+G +W+LY RM C PNTQSCMF++++
Sbjct: 343 LMEEMASKGLMPNATTYNLFFRCYYWAYDIGSAWQLYERMRSEGCFPNTQSCMFIVRLCH 402
Query: 1192 RHEKVDMAMRLWEDMVVKGFGSYXXXXXXXXXXXXXXAKVEEAEKCLLEMVEKGHRPSNV 1371
RH +V A+ LW DMV GFGS+ K++EAE+C +M+E G +PSNV
Sbjct: 403 RHGRVAQALELWSDMVNNGFGSFTLVSDVLFDLLCDEGKLDEAERCFHQMIELGQKPSNV 462
Query: 1372 SFKRIKLLMELANKHDEVNNLIQKMAIFSTEIP 1470
+F+RIK+LM+LAN+ + + L +MA F P
Sbjct: 463 AFRRIKILMQLANREESIARLTAQMAQFGRLAP 495
>ref|NP_191695.1| putative protein; protein id: At3g61360.1, supported by cDNA:
gi_16604614 [Arabidopsis
thaliana]|gi|11358118|pir||T47928 hypothetical protein
T20K12.260 - Arabidopsis
thaliana|gi|6850903|emb|CAB71066.1| putative protein
[Arabidopsis thaliana]|gi|25054868|gb|AAN71923.1| unknown
protein [Arabidopsis thaliana]
Length = 498
Score = 204 bits (520), Expect = 2e-51
Identities = 127/426 (29%), Positives = 219/426 (50%), Gaps = 9/426 (2%)
Frame = +1
Query: 205 LSKDLIDRVLKRVRFSHGNPIQTLEFYRYASAIRGFYHSSFSLDTMLYILGRNRKFDQIW 384
LS + + VL R+ +H N ++ LEF++Y+ +S S + L+IL R R FDQ W
Sbjct: 64 LSPEFVSEVLGRLFAAHSNGLKALEFFKYSLKSSKSSPTSDSFEKTLHILARMRYFDQAW 123
Query: 385 ELLIETKRKDRSLISPRTMQVVLGRVAKLCSVRQTVESFWKF-KRLVPDFFDTACFNALL 561
L+ E ++ +L+S ++M ++L ++AK S +T+E+F K K + F FN LL
Sbjct: 124 ALMAEVRKDYPNLLSFKSMSILLCKIAKFGSYEETLEAFVKMEKEIFRKKFGVDEFNILL 183
Query: 562 RTLCQEKSMTDARNVYHSLKHQFQPDLQTFNILLSGWKXXXXXXXXXXX---MKGKGLKP 732
R C E+ M +AR+++ L +F PD++T NILL G+K M +G KP
Sbjct: 184 RAFCTEREMKEARSIFEKLHSRFNPDVKTMNILLLGFKEAGDVTATELFYHEMVKRGFKP 243
Query: 733 DVVTYNSLIDVYCKDREIEKAYKLIDKMREEEETPDVITYTTVIGGLGLIGQPDKAREVL 912
+ VTY ID +CK R +A +L + M + V TT+I G G+ KAR++
Sbjct: 244 NSVTYGIRIDGFCKKRNFGEALRLFEDMDRLDFDITVQILTTLIHGSGVARNKIKARQLF 303
Query: 913 KEMKEYGCYPDVAAYNAAIRNFCIARRLGDADKLVDEMVKKGLSPNATTYNLFFRVLSLA 1092
E+ + G PD AYNA + + + A K++ EM +KG+ P++ T++ F + +
Sbjct: 304 DEISKRGLTPDCGAYNALMSSLMKCGDVSGAIKVMKEMEEKGIEPDSVTFHSMFIGMMKS 363
Query: 1093 NDLGRS--WELYVRMLGNECLPNTQSCMFLIKMFKRHEKVDMAMRLWEDMVVKGFGSYXX 1266
+ G + E Y +M +P T + + L+K+F + +V++ + LW+ M+ KG+ +
Sbjct: 364 KEFGFNGVCEYYQKMKERSLVPKTPTIVMLMKLFCHNGEVNLGLDLWKYMLEKGYCPHGH 423
Query: 1267 XXXXXXXXXXXXAKVEEAEKCLLEMVEKGHRPSNVSFKRIKLLMELAN---KHDEVNNLI 1437
+ +A +C + VE+G S ++ ++ + N K +E+ I
Sbjct: 424 ALELLTTALCARRRANDAFECSWQTVERGRCVSEPVYRMLETSLSSNNELKKLEELKEEI 483
Query: 1438 QKMAIF 1455
QK+ F
Sbjct: 484 QKLHSF 489
>pir||A96735 hypothetical protein F23N20.5 [imported] - Arabidopsis
thaliana|gi|12323433|gb|AAG51696.1|AC016972_15
hypothetical protein; 31939-30407 [Arabidopsis
thaliana]|gi|15223927|ref|NP_177262.1| hypothetical
protein; protein id: At1g71060.1 [Arabidopsis thaliana]
Length = 510
Score = 202 bits (515), Expect = 8e-51
Identities = 132/476 (27%), Positives = 246/476 (50%), Gaps = 4/476 (0%)
Frame = +1
Query: 31 HHVSNFRLSVSFLHSVALSDAKVPVEEEGDDAETVFRMINGSNLQVELKESLSSSGIHLS 210
H SNF L SF H+ ++ + +V + DAE + +++ +++ L+ + + LS
Sbjct: 37 HKASNFTLYGSF-HASSV-ETQVSANDASQDAERICKILTKFT-DSKVETLLNEASVKLS 93
Query: 211 KDLIDRVLKRVRFSHGNPIQTLEFYRYASAIRGFYHSSFSLDTMLYILGRNRKFDQIWEL 390
LI+ VLK++ + + L +++A +GF H++ + + ++ LG+ ++F IW L
Sbjct: 94 PALIEEVLKKLSNAG---VLALSVFKWAENQKGFKHTTSNYNALIESLGKIKQFKLIWSL 150
Query: 391 LIETKRKDRSLISPRTMQVVLGRVAKLCSVRQTVESFWKFKRLVPDFFDTACFNALLRTL 570
+ + K K L+S T ++ R A+ V++ + +F K + +++ FN +L TL
Sbjct: 151 VDDMKAK--KLLSKETFALISRRYARARKVKEAIGAFHKMEEFGFKM-ESSDFNRMLDTL 207
Query: 571 CQEKSMTDARNVYHSLKHQ-FQPDLQTFNILLSGWKXXXXXXXXXXX---MKGKGLKPDV 738
+ +++ DA+ V+ +K + F+PD++++ ILL GW MK +G +PDV
Sbjct: 208 SKSRNVGDAQKVFDKMKKKRFEPDIKSYTILLEGWGQELNLLRVDEVNREMKDEGFEPDV 267
Query: 739 VTYNSLIDVYCKDREIEKAYKLIDKMREEEETPDVITYTTVIGGLGLIGQPDKAREVLKE 918
V Y +I+ +CK ++ E+A + ++M + P + ++I GLG + + A E +
Sbjct: 268 VAYGIIINAHCKAKKYEEAIRFFNEMEQRNCKPSPHIFCSLINGLGSEKKLNDALEFFER 327
Query: 919 MKEYGCYPDVAAYNAAIRNFCIARRLGDADKLVDEMVKKGLSPNATTYNLFFRVLSLAND 1098
K G + YNA + +C ++R+ DA K VDEM KG+ PNA TY++ L
Sbjct: 328 SKSSGFPLEAPTYNALVGAYCWSQRMEDAYKTVDEMRLKGVGPNARTYDIILHHLIRMQR 387
Query: 1099 LGRSWELYVRMLGNECLPNTQSCMFLIKMFKRHEKVDMAMRLWEDMVVKGFGSYXXXXXX 1278
++E+Y M C P + +++MF E++DMA+++W++M KG
Sbjct: 388 SKEAYEVYQTM---SCEPTVSTYEIMVRMFCNKERLDMAIKIWDEMKGKGVLPGMHMFSS 444
Query: 1279 XXXXXXXXAKVEEAEKCLLEMVEKGHRPSNVSFKRIKLLMELANKHDEVNNLIQKM 1446
K++EA + EM++ G RP F R+K + + D+V +L+ KM
Sbjct: 445 LITALCHENKLDEACEYFNEMLDVGIRPPGHMFSRLKQTLLDEGRKDKVTDLVVKM 500
Database: BlastDB/NCBI/blast/db/2003_02_27_02_00_4/nr
Posted date: Feb 27, 2003 4:02 PM
Number of letters in database: 431,508,457
Number of sequences in database: 1,347,713
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
length of database: 431,508,457
effective HSP length: 127
effective length of database: 260,348,906
effective search space used: 94767001784
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)