南寡霉素生物合成基因簇的制作方法

文档序号:422261阅读:909来源:国知局
专利名称:南寡霉素生物合成基因簇的制作方法
技术领域
本发明涉及的是一种抗生素生物合成基因簇,特别是一种作为二十六元大环内酯类抗生素的南寡霉素生物合成基因簇,属于基因技术领域。
背景技术
链霉菌及其近缘放线菌由于可以产生大量的天然抗生素而具有极其重要的应用价值。聚酮化合物是这些天然产物最多的类群之一,因其巨大的药用价值而被广泛地用于医用、兽用和农用等领域。例如抗细菌抗生素红霉素(erythromycin)、抗真菌抗生素两性霉素B(amphotericin B)、抗寄生虫抗生素阿维菌素(avermectin)、肿瘤抑制剂雷帕霉素(rapamycin)、抗肿瘤抗生素柔红霉素(daunarubicin)等。聚酮化合物是由聚酮合酶(PKS)催化形成的,以模块结构形式组成的PKS以类似脂肪酸生物合成的方式通过连续的缩合反应将一些简单的羧酸分子催化形成聚酮化合物,详见David A Hopwood and David H Sherman.Molecular genetics of polyketides and itscomparison to fatty acid biosynthesis(聚酮化合物的分子遗传学以及与脂肪酸生物合成的比较)。Annual Review Genetics(遗传学年度综述)。1990,24;37-66。每一个模块在聚酮链形成过程中只负责一步缩合反应,它至少包含一个β-酮酯酰合成酶(KS)结构域,一个酰基转移酶(AT)结构域和一个酰基载体蛋白(ACP)结构域。此外,还可能包含一个β-酮酯酰还原酶(KR)结构域,一个脱水酶(DH)结构域和一个酯酰还原酶(ER)结构域,它们决定着加入的延伸单位的还原步骤。此外,还需要硫酯酶(TE)结构域的作用催化聚酮链的环化与释放。最后,还要经过羟基化、糖基化、甲基化和酰基化等修饰步骤。这些步骤对于大多数终产物的生物活性来说是至关重要的。聚酮生物合成PKS的模块结构组成具有一定的可塑性,使得人们能够通过改变模块的数目、延伸模块的特异性或结构域的插入或失活等基因工程操作获得新的聚酮衍生物。
南寡霉素(nanligomycin)是由南昌链霉菌(Streptomyces nanchangensis)产生的一种二十六元大环内酯类抗生素,它是线粒体ATP酶的特异抑制剂,具有强烈的抗真菌活性,还是一种潜在的免疫抑制剂和抗肿瘤抑制剂,它与同属二十六元大环内酯类抗生素的寡霉素(oligomycin)具有相似的结构,但在C-14,C14-C15,C-26却完全不同,为一全新结构,同时在基因的组成和结构上存在着明显的差异,为一全新基因簇。

发明内容
本发明的目的在于提供一种南寡霉素生物合成基因簇共13个基因的核苷酸序列或互补序列(序列1),使其可用于医药、工业、农业的化合物或蛋白的创新和发展。本发明7个用于编码聚酮合酶(PKS),它包含17个模块,共79个结构域,负责催化南寡霉素聚酮糖苷配基的生物合成;另有2个基因,即nlmB,nlmOI编码参与南寡霉素生物合成修饰的蛋白,负责催化聚酮链的氧化;还有2个基因,即nlmTI,nlmTII负责编码参与南寡霉素生物合成基因簇转座的蛋白;还有1个调节基因,即nlmRI参与南寡霉素生物合成的调节;还有1个丁酰辅酶A还原酶基因,即ccrA参与南寡霉素前体物的生物合成。这些核苷酸序列是分别选自于序列1中的nlmA1(5469-23915),nlmA2(23938-38337),nlmA3(38626-50133),nlmA4(93935-82242),nlmA5(82170-76564),nlmA6(50289-64196),nlmA7(64243-75024),nlmB(76408-75191),nlmOI(94138-94641),nlmTI(255-1208),nlmTII(98193-96412),nlmRI(2168-4990),ccrA(94705-96048)。
本发明还提供了以至少来自于序列1聚酮合酶序列中的一个片段与来自于其它聚酮合酶基因簇的序列来构建重组载体以获得新型聚酮合酶的途径。
本发明还提供了在基因工程微生物体中提高南寡霉素产量的途径。
本发明还提供了得到至少包含部分序列1中DNA序列的重组DNA载体的途径。
本发明还提供了产生南寡霉素生物合成基因被打断或加倍的微生物体的途径,至少其中之一的基因包含有序列1中的核苷酸序列。
序列1的互补序列可依据DNA碱基互补原则随时得到。序列1的核苷酸序列或部分核苷酸序列可以通过聚合酶链式反应(PCR)或用合适的限制性内切酶酶切相应的DNA或使用其它合适的技术得到。通过本发明所提供的核苷酸序列或部分核苷酸序列,可利用聚合酶链式反应(PCR)的方法或包含本发明序列的DNA作为探针进行Southern杂交的方法,从其它生物体得到与南寡霉素生物合成基因相似的基因。
包含本发明所提供核苷酸序列或至少部分序列的克隆基因或DNA片段可以通过打断南寡霉素生物合成的一个或几个步骤而得到新的南寡霉素衍生物。包含DNA片段或基因可以用来提高南寡霉素或其衍生物的产量。
包含本发明所提供核苷酸序列或至少部分序列的克隆DNA可用来从南昌链霉菌基因组文库中定位更多的文库质粒。这些文库质粒至少包含有本发明中的部分序列,也包含有南昌链霉菌基因组中以前邻近区域未克隆的DNA。
本发明所提供的核苷酸序列可以被修饰或突变。这些途径包括插入或置换,聚合酶链式反应,错误介导聚合酶链式反应,位点特异性突变,不同序列的重新连接,或通过紫外线或化学试剂。
本发明所提供的核苷酸序列可以通过序列的不同部分或其它来源的同源序列进行直接进化(DNA shuffling)。
通过缺失或失活来自于相同或不同聚酮合酶系统的一个或多个聚酮合酶结构域、模块或基因,或增加一个或多个聚酮合酶结构域、模块或基因而产生新的聚酮化合物。
包含本发明的序列或至少部分序列的克隆基因可以通过合适的表达系统在外源宿主中表达以得到修饰的聚酮合酶或更高的生物活性或更高的产量。这些外源宿主包括链霉菌、大肠杆菌、芽孢杆菌、酵母、植物和动物等。
包含本发明的核苷酸序列或至少部分序列的片段或结构域或模块或基因可以用来构建聚酮合酶库或聚酮合酶衍生库或组合库。
南寡霉素生物合成修饰基因的核苷酸序列提供了通过缺失或改造这些修饰基因而得到南寡霉素衍生物的途径。
含有本发明的核苷酸序列或至少部分序列的基因或基因簇可以在异源宿主中表达并通过DNA芯片技术了解它们在宿主代谢链中的功能。
包含本发明的氨基酸序列或至少部分序列的多肽可能在去除或替代某个或某些氨基酸之后仍有生物活性甚至有新的生物学活性,或者提高了产量或优化蛋白动力学特征或其它致力于得到的性质。
通过合适的技术缺失,连接本发明中的氨基酸序列可以得到新的蛋白或酶,进而产生新的聚酮或相关联的产物。
本发明所提供的氨基酸序列可以用来分离需要的蛋白质并可以用于抗体制备。
本发明所提供的氨基酸序列提供了预测聚酮合酶三维结构的可能。
本发明具有实质性特点和显著的进步,本发明所提供的基因及其蛋白质,抗体也可以用来查找和发展可用于医药、工业、农业的化合物或蛋白。


图1南寡霉素(A)和寡霉素(B)的化学结构1中


图2南寡霉素生物合成基因簇的组成图如图2,箭头A1-A7代表聚酮合酶基因,箭头B代表细胞色素P450基因,箭头OI代表氧化酶基因,箭头TI、TII代表转座酶基因,箭头RI代表转录调节蛋白基因,箭头ccrA代表丁酰辅酶A还原酶基因。
图3南寡霉素聚酮生物合成模型如图3,框表示I型PKS蛋白亚基,线表示I型PKS模块(module),圆表示各结构域。KS代表β-酮酯酰合成酶结构域,ATa代表乙酸加载延伸单位的结构域,ATb代表丁酸加载延伸单位的结构域,ATp代表丙酸加载延伸单位的结构域,KR代表β-酮酯酰还原酶结构域,DH代表脱水酶结构域,ER代表酯酰还原酶结构域,ACP代表酰基载体蛋白结构域,TE代表硫酯酶结构域。
具体实施例方式
以来自红霉素生物合成基因簇中一DNA片段为探针与南昌链霉菌总DNA基因文库进行杂交,从中获得包含有其同源序列的阳性科斯质粒(cosmid),分别选取其中4个阳性科斯质粒8D1,6C2,8G1,16C4采用鸟枪法进行核苷酸序列测定。将DNA片段用550 Sonic Dismembrator超声波(Fisher Scientific公司)断裂并0.7%用低熔点琼脂糖回收其中1.6-2.0kb片段,再经过Geneclean II reagent kit(Bio 101,Inc公司)纯化后克隆至pUC18的SmaI位点(预先经CIAP处理),构建成一系列测序亚克隆。测序亚克隆质粒DNA的制备采用Prep 96 Plasmid Kit(Qiagen公司)。序列的测定采用BigDye Terminator Cycle Sequencing Kits(Applied Biosystem Division,Perkin Elmer公司)在377 DNA Sequencers(PE/ABD)上自动完成,测序通用引物为5’GTA AAA CGA CGG CCA GT 3’(forward),5’GCG GAT AAC AAT TTC ACA CAG G3’(reverse)。序列的ORF分析是通过日本国立卫生研究院的在线服务器http//watson.nih.go.jp/~jun/cgi-bin/frameplot.pl提供的Frame-Plot 2.3.2在线软件进行。序列的同源性比较是通过美国国家生物技术信息中心的在线服务器提供的PSI-BLAST软件(http//www.ncbi.nlm.nih.gov/BLAST)进行。
以下结合图1、图2、图3对本发明进一步详细描述本发明中的整个南昌霉素生物合成基因簇共13个基因,具体为(1)聚酮合酶基因,即nlmA1,nlmA2,nlmA3,nlmA4,nlmA5,nlmA6,nlmA7共7个基因;(2)南寡霉素的修饰基因,即nlmB,nlmOI共2个基因;(3)南寡霉素转座酶基因,即nlmTI,nlmTII共2个基因;(4)南寡霉素的调节基因,即nlmRI;(5)南寡霉素前体物合成基因,即ccrA。
聚酮合酶基因以下是编码催化南昌链霉菌NS3226中二十六元大环内酯类抗生素南寡霉素聚酮糖苷配基生物合成所需的7个I型聚酮合酶开放读码框,即nlmA1,nlmA2,nlmA3,nlmA4,nlmA5,nlmA6,nlmA7的核苷酸序列或互补序列及其相应的氨基酸序列。
7个I型聚酮合酶开放读码框,其模块或结构域,即酮基合成酶结构域、酰基转移酶结构域、酮基还原酶结构域、脱水酶结构域、烯酰基还原酶结构域、酰基载体蛋白结构域、硫酯酶结构域的核苷酸序列或互补序列及其相应的氨基酸序列。
序列1中有7个基因(nlmA1-nlmA7)是用于编码聚酮合酶,其中共包含17个模块,共有79个结构域,负责催化南寡霉素聚酮糖苷配基的生物合成(图2,图3)。
NlmA1包含4个模块,即加载模块和模块1,模块2,模块3。加载模块含3个结构域KS-L,AT-L,ACP-L,负责聚酮链的起始合成,催化引入一个乙酸作为合成起始单位并最终形成南寡霉素C33-C34碳链骨架。模块1含4个结构域KS1,AT1,KR1,ACP1,负责催化引入一个乙酸延伸单位最终形成南寡霉素的C31-C32碳链骨架。模块2含4个结构域KS2,AT2,KR2,ACP2,负责催化引入一个丙酸延伸单位最终形成南寡霉素的C29-C30碳链骨架,并在C30上带有一个甲基支链。模块3含6个结构域KS3,AT3,DH3,ER3,KR3,ACP3,负责催化引入一个乙酸延伸单位最终形成南寡霉素的C27-C28碳链骨架。各结构域的氨基酸位置如表1所示。
表1 聚酮合酶NlmA1所包含的结构域及其氨基酸位置模块 结构域 在序列2中氨基酸位置KS-L 19-428加载模块 AT-L 534-827ACP-L914-985KS1 1006-1418AT1 1524-1819模块1KR1 2104-2283ACP1 2372-2453KS2 2478-2902AT2 3009-3312模块2KR2 3629-3808ACP2 3896-3974KS3 4012-4418AT3 4524-4814DH3 4875-5063模块3ER3 5385-5689KR3 5699-5879ACP3 5980-6063NlmA2包含3个模块,即模块4,模块5,模块6。模块4含4个结构域KS4,AT4,KR4,ACP4,负责催化引入一个乙酸延伸单位最终形成南寡霉素的C25-C26碳链骨架。模块5含4个结构域KS5,AT5,KR5,ACP5,负责催化引入一个丙酸延伸单位最终形成南寡霉素的C23-C24碳链骨架,并在C24上带有一个甲基支链。模块6含5个结构域KS6,AT6,DH6,KR6,ACP6,负责催化引入一个乙酸延伸单位最终形成南寡霉素的C21-C22碳链骨架。各结构域的氨基酸位置如表2所示。
表2 聚酮合酶NlmA2所包含的结构域及其氨基酸位置模块 结构域在序列3中氨基酸位置KS4 33-450AT4 564-866模块4KR4 1141-1292ACP4 1379-1446KS5 1467-1887AT5 2002-2301模块5KR5 2625-2804ACP5 2912-2979KS6 3003-3429AT6 3542-3839模块6DH6 3898-4050KR6 4357-4534ACP6 4662-4711NlmA3包含2个模块,即模块7和模块8。模块7含6个结构域KS7,AT7,DH7,ER7,KR7,ACP7,负责催化引入一个丁酸延伸单位最终形成南寡霉素的C19-C20碳链骨架,并在C20上带有一个乙基支链。模块8含5个结构域KS8,AT8,DH8,KR8,ACP8,负责催化引入一个乙酸延伸单位最终形成南寡霉素的C17-C18碳链骨架。各结构域的氨基酸位置如表3所示。
表3 聚酮合酶NlmA3所包含的结构域及其氨基酸位置模块 结构域 在序列4中氨基酸位置KS7166-366AT7476-779DH7831-1020模块7ER81343-1634KR71641-1824ACP7 1928-1995模块8 KS82026-2445
AT82548-2852DH82908-3082KR83396-3578ACP8 3681-3748NlmA4包含两个模块,即模块9和模块10。模块9含5个结构域KS9,AT9,DH9,KR9,ACP9,负责催化引入一个乙酸延伸单位最终形成南寡霉素的C15-C16碳链骨架。模块10含6个结构域KS10,AT10,DH10,ER10,KR10,ACP10,负责催化引入一个乙酸延伸单位最终形成南寡霉素的C13-C14碳链骨架。各结构域的氨基酸位置如表4所示。
表4 聚酮合酶NlmA4所包含的结构域及其氨基酸位置模块 结构域 在序列5中氨基酸位置KS933-459AT9585-886模块5 DH9941-1125KR91423-1602ACP9 1711-1771KS10 1797-2224AT10 2324-2615DH10 2670-2853模块6ER10 3171-3460KR10 3468-3647ACP10 3749-3806NlmA5包含1个模块,即模块11。模块7含5个结构域KS11,AT11,DH11,KR11,ACP11,负责催化引入一个丙酸延伸单位最终形成南寡霉素的C11-C12碳链骨架,并在C12上带有一个甲基支链。各结构域的氨基酸位置如表5所示。
表5 聚酮合酶NlmA5所包含的结构域及其氨基酸位置模块 结构域 在序列6中氨基酸位置
KS11 33-459AT11 569-871模块7 DH11 925-1110KR11 1430-1610ACP11 1719-1780NlmA6包含3个模块,即模块12,模块13,模块14。模块12含4个结构域KS12,AT12,KR12,ACP12,负责催化引入一个丙酸延伸单位最终形成南寡霉素的C9-C10碳链骨架,并在C10上带有一个甲基支链。模块13含4个结构域KS13,AT13,KR13,ACP13,负责催化引入一个丙酸延伸单位最终形成南寡霉素的C7-C8碳链骨架,并在C8上带有一个甲基支链。模块14含4个结构域KS14,AT14,KR14,ACP14,负责催化引入一个丙酸延伸单位最终形成南寡霉素的C5-C6碳链骨架,并在C6上带有一个甲基支链。各结构域的氨基酸位置如表6所示。
表6 聚酮合酶NlmA6所包含的结构域及其氨基酸位置模块 结构域 在序列7中氨基酸位置KS12 30-444AT12 554-856模块12KR12 1170-1351ACP12 1459-1524KS13 1547-1964AT13 2072-2374模块13 KR13 2688-2865ACP13 2974-3041KS14 3064-3477AT14 3592-3894模块14 KR14 4199-4378ACP14 4486-4547NlmA7包含2个模块,即模块15和模块16。模块15含4个结构域KS15,AT15,KR15,ACP15,负责催化引入一个丙酸延伸单位最终形成南寡霉素的C3-C4碳链骨架,并在C4上带有一个甲基支链。模块16含6个结构域KS16,AT16,DH16,KR16,ACP16,TE负责催化引入一个乙酸延伸单位最终形成南寡霉素的C1-C2碳链骨架,同时在TE催化下使聚酮链发生环化并从聚酮合酶上释放下来。各结构域的氨基酸位置如表7所示。
表7 聚酮合酶NlmA7所包含的结构域及其氨基酸位置模块 结构域 在序列8中氨基酸位置KS15 32-458AT15 572-874模块15KR15 1177-1357ACP15 1470-1535KS16 1556-1980AT16 2076-2363DH16 2423-2610模块16KR16 2925-3104ACP16 3213-3280TE 3371-3588南寡霉素的修饰基因以下是编码参与南寡霉素聚酮链氧化修饰的2个开放读码框,即nlmB,nlmOI的核苷酸序列或互补序列及其相应的氨基酸序列。
序列1有中存在2个负责南寡霉素氧化修饰的基因,即nlmB,nlmOI(图2),分别编码细胞色素P450和氧化酶,在南寡霉素的生物合成中,分别催化C12和C28的氧化,在相应的位置形成羟基和酮基。它们的核苷酸、氨基酸位置及其功能如表12所示。
表12 南寡霉素的修饰基因的核苷酸、氨基酸位置及其功能基因 序列1中碱基的位置 相应的氨基酸序列 功能nlmB 83059-81845 序列9细胞色素P450nlmOI101284-101787 序列10 氧化酶南寡霉素的转座酶基因以下是编码参与南寡霉素生物合成基因簇转座的2个开放读码框,即nlmTI,nlmTII(图2)的核苷酸序列或互补序列及其相应的氨基酸序列。它们的核苷酸、氨基酸位置及其功能如表13所示。
表13 南寡霉素脱氧糖生物合成基因的核苷酸、氨基酸位置及其功能基因 序列1中碱基的位置 相应的氨基酸序列 功能nlmTI 255-1208序列11转座酶nlmTII 98193-96412 序列12转座酶南寡霉素的调节基因以下是编码参与南寡霉素生物合成调节的1个开放读码框,即nlmRI(图2)的核苷酸序列或互补序列及其相应的氨基酸序列。它们的核苷酸、氨基酸位置及其功能如表14所示。
表14 南寡霉素的调节基因的核苷酸、氨基酸位置及其功能基因 序列1中碱基的位置相应的氨基酸序列 功能nlmR1 2168-4990序列13转录调节蛋白南寡霉素的前体物合成基因以下是编码丁酰辅酶A还原酶的开放读码框,即ccrA(图2)的核苷酸序列或互补序列及其相应的氨基酸序列。它们的核苷酸、氨基酸位置及其功能如表14所示。
表14 南寡霉素的调节基因的核苷酸、氨基酸位置及其功能基因 序列1中碱基的位置相应的氨基酸序列 功能ccrA 94705-96048 序列14 丁酰辅酶A还原酶序列1为南寡霉素生物合成基因簇共13个基因的核苷酸序列或互补序列,全长104096个碱基,包含7个用于编码聚酮合酶的基因(nlmA1-nlmA7),2个参与南寡霉素生物合成修饰的基因(nlmB,nlmOI),2个参与南寡霉素生物合成基因簇转座的基因(nlmTI,nlmTII),1个南寡霉素的调节基因(nlmR1),1个南寡霉素前体物合成基因(ccrA)。
序列2为nlmA1基因(序列1中5469-23915碱基)编码的I型聚酮合酶(NlmA1)的氨基酸序列。
序列3为nlmA2基因(序列1中23938-38337碱基)编码的I型聚酮合酶(NlmA2)的氨基酸序列。
序列4为nlmA3基因(序列1中38626-50133碱基)编码的I型聚酮合酶(NlmA3)的氨基酸序列。
序列5为nlmA4基因(序列1中93935-82242碱基)编码的I型聚酮合酶(NlmA4)的氨基酸序列。
序列6为nlmA5基因(序列1中82170-76564碱基)编码的I型聚酮合酶(NlmA5)的氨基酸序列。
序列7为nlmA6基因(序列1中50289-64196碱基)编码的I型聚酮合酶(NlmA6)的氨基酸序列。
序列8为nlmA7基因(序列1中64243-75024碱基)编码的I型聚酮合酶(NlmA7)的氨基酸序列。
序列9为nlmB基因(序列1中76408-75191碱基)编码的细胞色素P450(NlmB)的氨基酸序列。
序列10为nlmOI基因(序列1中94138-94641碱基)编码的氧化酶(NlmOI)的氨基酸序列。
序列11为nlmTI基因(序列1中255-1208碱基)编码的转座酶(NlmTI)的氨基酸序列。
序列12为nlmTII基因(序列1中98193-96412碱基)编码的转座酶(NlmTII)的氨基酸序列。
序列13为nlmRI基因(序列1中2168-4990碱基)编码的转录调节蛋白(NlmRI)的氨基酸序列。
序列14为ccrA基因(序列1中94705-96048碱基)编码的丁酰辅酶A还原酶(CcrA)的氨基酸序列。
以下根据本发明的内容提供基因序列序列列表
SEQUENCE LISTING<110>上海交通大学<120>南寡霉素生物合成基因簇<160>14<170>PatentIn version 3.1<210>1<211>104096<212>DNA<213>南昌链霉菌NS3226(Streptomyces nlmchangensis n.sp.NS3226)<400>1atggaacggt gggtgcagac ctgcagacgg gaactcttgg accgcaccct gatctggaac 60caccggcacc tgctccacgc cctgcgcgag ttcgagcagt tctacaacgc acaccggccg 120caccagggca tcgcgaacgc cagaccgctg cacgccttac ccaggccgat cgacgatcct 180gagcagatca gccgtctcga catacgacgc cgcgatcgac tcggcgggat cctccacgag 240taccaacatg ccgcatgacc agcacggatg acattctcgg caagggcacg accatcatct 300cgcgacgatc gacagccgcg cgagagcacg ggggcgagcg ccttccaacg agactcccta 360caccctcgca caccacctcc tccagggcgg acgggttttc ggcaggcgca acgctgctga 420cctggcaccg tcgtctggtg cgcgcgagag aacccgacct gggggtacgt caggttccag 480ggcgagctgc gacggcttgg ccatcgggtt gccgccgcac tatccgccgc gctctgcgcc 540gctccggttt accgcccgca ccgcagcgcg cctcccagca gacgtggcgt tccttcctgc 600gctcccaggc ccatacgctg ctcgcctgcg acttcatgcg tgtggagacc gtcttcctca 660aacgtctcta cgtcttcttc gtcatggaga tcaagactcg gcgcgtccat gtcctgggcg 720tcaccgttcg ccctacgggc gcatgggtca cccagttcgc ccgcaacctg ctcaaggatc 780tcgaggagag ggctgggtgc ttccggttcc tcatccgtga ccgggacagc aagttcaccg 840ccgcgttcga cgccgtcttc gccgacaatg gcacagccgt catcccgacc ccgccgcaaa 900gccctcggtc caacgcgttc gccgaacgat ggatacgcac agcccgcgct gaatgcaccg 960accgaatcct catcaccggc gaacgacacc tccgtgccgt ccttaccacg tacgccgagc1020actacaacac cggacgggcc caccgcagcc tcgacctacg cgccccagac gaccgcccga1080gcgtcatccc cctgcctgct gcggtagtcc gacggcgacg gctacttggc ggcctgctca1140acgagtacca caccacgcca ccccaacgac ttctccatcc acaagaaaca cccagctcag1200cggcctgatc gggatattga cacccttcac gcctcctgct ctcggatcga cggcaggtcg1260tgcatgaaaa ccgcctacct gtcaccgctg aaccgctgca tgatgcgctt ggtcagctcg1320cctggcggcc ctggctgccg gcggacacag ccgacggctc cgccagcctt tcccggaaca1380ggtcgggtga aggagccgcc accgtgatac tgccgggttc cggctccttc agtaattcaa1440tgccggggga acgcacgggt ccgtccgcgc accagcatcc tcttgaggtc atgaagtaac1500aggggggtgg aaaaaggggg cgtcttttgt gcggtcatgg agcgctcgtt cgggaatgca1560cgctaccagc tgattctatg agtatgctgc ggattctcta tgcttcctct cgcgcgacga1620ccgatgggga acaatggaga tatagagcgg aagaaggaat tccgcagcgg tccgcgttag1680cgtccccgta ttcctgcgga ggccggcgtg cggaccggat accgcggctt gcgcaccgat1740ttttcgaagg gaccctgatg aacagcactg tgaccgcctg gaaggacgcc atgtgccggg1800agggtctcgg ggcgggcccg gaccacccgg cgggcgtggc ggacctcatc gccgaccttc1860cgatcgagga gggttccgcg gccgcggcca ccctttccac gcagccgttc ttcgactgct1920gcggctgacc cgctcgacca ccccgcggtc gccgcccgca cgggcggcca ccctgccgcc1980gacccgttcc cgcgcggatc ggcgccggtc tgcccgtaag aaacgggccg gacggatcgt2040ggcgtattca cggaatgtga cgactggaaa ggggggtaag ggtcgtgtcc aggtatgacg2100ctggatggtc tggcatatac ctcaaccgtg acgagcgatt ccagattcgg gcagaggggg2160tgatgggatg aaactctcgg aaccctccta ttatccggag attgtcgaac gctccgaaga2220aatctcgttg cttgcccaag acctcgcaaa caccaagcgc ggcgaaggtg ccgtggtcgt2280catccattct gggcccggag tcggtcgtac ggcactgctc gatgaattcc tgcggcagtc2340tgggaattcc ggggcccggg tgtgcgccgc cacgggatcg gccgcggaga ccggcaacga2400gttgggcgtc gtcacccagc tgttcccgga agacgggccg atcgccgctg cggtctggct2460ggcccgggcg ctcgacgacc accacggcga cccgtccccg gatgccgacc ggctcttcga2520catgctgcgc ggggagttcc ggcagggccc gctggtgctg gcggtcgacg acgttcagct2580ggccgacgcg gcgtctctgc ggttcctgct gcacctcata cgccggctgc gcaccactcc2640cgtgctgatc gtcctcactg agcccgtcgg atcgtgcgcc ctcccgctcg ctttccaggc2700ggaacttctc cggcatcccc ggtgccgacg tcttcggctg cagccgctgt ccgtggacgg2760cgtcacccgg atgatagagc cctacgtggc cgagaccgag gtggcgcggc tggccaccca2820gttccatgcc gtcagcggtg gcaaccccgt cctggtgcgc gggctgctgg ctgatcaccg2880ggccggacaa cggctggaag agcagggcat cggcgcacaa tacaacggat acccggcctt2940cactcaagcc gcactggtct cggcttaccg ggacgacccc gtacttttcg aggtcgtttg3000cggcattgcc gtcctcggcg agaacgcgtc tcccgccctc gtggcctgcc tggtcgaccg3060gggagccgat gtggtggccc gtgtcatgac cgcactgaac acggcaagcc tgctcaatgg3120ccccgccttc cgtagcccac tcgttgcgaa ggccctgctg gagctcctgg atgtggagac3180tcgcggagag ctgcaccggc gcgcggccga gctgctccat gccgacgcgg cacttcctgc3240cgacgtcgcg caccatctgc tcgctacccc gatcgccgaa tcctgggtgc tgccgacctt3300gctcgccgcg gccgagcagg ctgtccaggg cggcgggcag gacttcagac tcgactgcct3360
ccggctggca ggccgacagg cggcaaccga ggaggaacgt gccgccgtcg tcgccgcccg3420ggtccggatc ggctgggaga tcgatccccg gctgatcacc ccatggctcg gcgaactcgg3480cgccgcactc cgccgaggac acgtcggcag ctcggacgct gcctggaccg tcaaacactt3540tgtatggcac gaccatgtcg aggaggccgc cgacatcctt tccgcactga tggagcgaac3600cgaggagaac agcgacgcgc acgccgaact cgagatcgtc cggcattggg tgcggtacac3660ctgccccact ttactcgagg gatcggtgga cgcagatgcg ccctccctgt ccggtccgtt3720cccgcagcgg ttccaactga gaccggcctc gtacgccgtc gagatgcttg gccggctttt3780caccgagggc ccctgcgatc aggcggcggc catggccgag gagatccttc gcggctgccg3840gttcggtgag accaccgtcg aagctgtcga aggagccctg ctggtcctcg tctatgccga3900acgtcccggc cgggcactgc actggtgtga ggcgctgctg gagcaggcag gagatcaccc3960caccggcaca gccgctgcga tcctgagcag tattcgcgcc gaaatcgccc ttcggcaagg4020cgcattggaa gaggccgaga cgtacgcgga ccgggccctc aacgccatct cacggctggg4080ctggggggtg gccatcggct cgcccctggc cgtccgggta cgggccgcga tggccgcggg4140tcgcaccggc ctggcagggg cctggctgaa tcaggacgta ccccagggga tgttccgcac4200ccgccacgga ctgctgtaca tgcacgcacg cggtcattac cacctggcca ccgaccgccc4260gactgtcgct ctggaggact tcctgacctg tggccggctg gccaaggagt ggggcatgga4320cgtgcccaca tttctgccgt ggcgcacctc agccgcgctg gcccacctgg ccctgggcaa4380cggcagccgg gccagtgcct tggcacggga gcagctgacc cggcccggcg gcggctggcc4440gaggtgccgg gcggtgtcgc tgcgggtgct cgccgccacg agcgaactcg accgccgccc4500tgctctgtta cgtgagtcgg tcaatctgct ggagagctgt ggcgatcacg tggagttgct4560gcattcgctg gccgaccagt tccaggcgct gtccgaagcg ggggcacccg cgaaggcacg4620gattgcggcc cggcatgcca gaaccgtcgc cgacaattgt ggcacggaga cgctctttcg4680caggctgttc aaggaggagg tgcccgagga caccgacgaa tcggccgact tcgggcagga4740ccaccagggg tttgccagcc tgaccgacgc ggagcggcgg gtcaccgccc tggccgccct4800cgggtactcc aaccgggaga tcggacgcaa gctcttcatc accaagagca ccgtcgagca4860gcacctcacc cgggtctacc ggaagctcgg ggtacgcaac cgggccgacc tcggcgacct4920gctcgccggg atcaacctcg cagcccagcc ccaggtgatg ggcaggacgt cctcggccgc4980cgtcggctga ggacgcaccc cgcggccgac cgtctcgccg ctacaaagag ctgtgcgcgg5040tgctggggga ccggcgtttc agctcggacg ccaggccaca ctcgcactgc cgcgcaaccc5100ggagcaactc gccctgctgc acgaggatcg tccctgatca aggtgccatc gaggaactgc5160tgcgctacat cacgatcgtg cagaacggcg tggaacgcgt cgccacggca gcggtatgag5220cagcgatgcg ggggcacccg gcgcggggtg tccccgcacg ctgtcccggc ccctgccccc5280aggacaccga ctctgcccgt gacacggaaa gcgcaagtca gcaggggcgg aggccttacg5340gcggtcccag ctaaggggtc gcctagtggt tgaggctagg ggccgcccgc tcggatattc5400ggtgtgacct gcggccgggg tgccgcattg aggcgcgctt caggttccta gcgacgtaag5460ggaaacgcat ggcgggtgga tccgagtcag aggccgctga gttcacggcg cgatccgccc5520agccgatcgc ggtggtggga atggcgtgcc ggctgcccgg tgcggcggga ccggcggaat5580ttcgcgccat tctccgcagc ggtacggaag ctgtcggcgc cgccgccccg gatcgtccgt5640acgccccgcc gcggggtggc ttcctggact cggtggaccg tttcgacgcc ggattcttcg5700gtgtctcgcc gcgcgaggcc gcggtcatgg acccgcagca gcggctgatg ctggaactgt5760gctgggaagc actcgaagac tcaggcatcg tgcccgcccg cctcgacggt agcgacgccg5820gtgtcttcgt gggcgccatc accgacgact acgccgtact gtcccgggcc gccggcgtgg5880acgccgccac cccggagacg agcaccggcc tcaaccgggg catgatcgcc aaccgggtct5940cctacaggct gggcctgcgg ggtccgagct ttacggtcga ctcgggacag tcgtcgtccc6000tggtcgccgt gcacctggcc accgagagcc tgcgccgggg cgagtgctcc ctggctctgg6060ccggcggggt gaacctgatc ctcgcagagg acagcacggc cgccgtcgaa cgcttcgggg6120cgctctcccc ggacggccgc tgctacacct tcgacgctcg cgccaacggc tatgtgcgcg6180gcgagggcgg cggtgtcgtc gtcctcaagc ggctcactga cgcggtcgcg gacggcgacg6240acatcctgtg cgtgctcgcg ggcagcgcgg tgaacaacga cggtggcggc gaaggcctga6300ccgtacccga ccgccagggt caggaggccg tgctcaccgc cgcgtacgag caggcgggga6360tctccccgaa cgccgtcgga tacgtggaac tgcacggcac gggaacccct gccggtgacc6420ccgtggaggc cgcggccgtc ggtgccgtgc tcggcgcggg ccgcagtgcg gaacagccgc6480tgctggtcgg ttcggtgaag accaacatcg gccacctcga aggcgccgcc ggtatcgccg6540gactcctcaa ggccgtgctg accgtacgcc accgcgagat ccacgcaagc ctcaacttca6600ccacccccag cactcgcatc cccatgaccg agctgggcct gagtgtcaac acggcactgc6660gtccctggct gagcgaggcc ggcccgctga tcgtgggcgt cagctccttc ggcatggggg6720gcaccaactg tcatgtcgtc ctcacggaat ggcacggcgt cgcaccggtg accgcacccg6780gcatccgccc caacgggaca gcggtgcccc tcctcatcac cggccgggac gagcaggcgc6840tgcgcgacca ggcgcaccac ctgggccggc acctcgacga gcacggtccg ctgcgcctga6900aggacgtcgc ccacaccttg gccgccggcc gcacggcgtt cgagcacagg gccgtgctac6960tcgtccgcga gccgcaggac atgaccgacg gcctcgcccg gctcgccgac ggcacgcccg7020gcccggacct cgtacgcgcc accgcgacct gtagctccct cgccttcctg ttcaccggac7080agggcagcca gcgccccggc atgaccgccg agttgtacca gtcctcgtcc gagtacgcgg7140ccgccctcga cgaggtctgc gcccatctcg atccccagtt gcgggtgccc ttgcgggagg7200tactcttcgc cgcggaagga acggcggaag cggtcctgct cgaccgtacg gagttcaccc7260agcctgcact gttcgccgtc gaagtcgccc tcttccggtt cgcggagcac tgtggcctcg7320tcccacggct gctgctcggc cactccgtcg gcgaactggc cgcggcgcac gtcgccggcg7380tcctgtccct cgccgacgcc tgcagcctgg tcgccgcgcg cggccgactg atgcaggccc7440agccggccac cggggcgatg gcggccatcc aggctacaga gaaggaactt gcgccgttcc7500tcgacgagtc ggtggcggcg gccgccctga acggcccggc ttctaccgtc cttgcgggcg7560
acgaggaagc cgtcctggcc atcgccgcgc actgggcggc caagggccgc agaaccaagc7620ggctgagagt cagccacgcc ttccactcgc cgcacatgga cggcatgctc gaggagttcc7680accgggtcgc cgggcagctg accttcgagg ccccccgtgt cccgatcgtg tcgaacgaga7740cgggcgccct gctcaccgag gcggaagcgt gctcgccgga gtactgggta cggcaagccc7800gcgtgaccgt gcggttcctg gacggagtgc gcctgctgga ggagcagggt gtgaccaccc7860tgctcgaact cggccccgac ggcacgctgt cgtccctggc ccgggactgc ctgcgcggcg7920tcgacgccgt gtccgtgccc ctgctgcgcg gccgcaccga accggaggag gtggtcgccg7980ccctggccac cctccaggtc cgtggtgtgc cgatgcactg ggagcggctg gccaccgagg8040agggcgcccg gcgggtgccg ctgcccacat acccatttca gcggcggcgc cactggctgc8100ccgacctggt cgcccaggat tctgtgcctg cccccggccg ggctgccgga cagcggtccc8160gtcccgtcaa cgagccggcg ccgtcggcgc acgcaccgcg cggcgaccgt acgatgcggg8220agaccgtccg ggcagccgtg gcactggtgc tcgggcacga ctccccggac gacatccccg8280cgcacacgac gttcagggag ttggggctca gctccctgat gctggccgaa gtcggcgagc8340ggctcaccga ggcgaccggg cgccgggtcc ccacgaccct gctcttcgac cacccgactc8400cggacgcact cgtacgcgag ctgacgtccg ggggtgctga acggcccgcg gcgctcacca8460ctgctccctc ggcggcgcac gccgacgacc ccgtcgtggt cgtgggcatg gcctgccggc8520tgcccggagg gatccggtcg ccggaggagt tctggcagtt catggcggcg gacggcgacg8580ccatctctcc gctgcccacc gatcggggct gggccgtctc cggggacttc cccgccgagg8640gcggtttcct ggcggacgtg gccgggttcg acgcggcgtt tttcgggatc tcgccgcgtg8700aggcgttggc gatggatccg cagcagcggc tgctgctcga gacgtcgtgg gaggcgctgg8760agcgggccgg ggtggacgcg ctgtcgctgc gcggcagccg caccggcgtc ttcgtcggcg8820cgagcccctc ggaatacggc cccagactcc acgaaccttc gcaagccgac ggacacgtgc8880tgaccggtac ggcgcccagt gtgctgtccg gccgggtggc ctatgtgctg ggtcttgagg8940gtccggcgct gacggtggac acggcgtgct cgtcgtcgct ggtggcgctg catctggcgg9000cgcaggcgct gcggggcggt gagtgcgact tggccctcgc cggcggtgtg gcggtgatgg9060cgacggcggg catgttcgca gagttcgcgc ggcagggggg tttggctcgt gatggccggt9120gcaaggcgtt tgcggatggt gcggatggta ctgggtgggg tgagggtgtc ggggtgctgg9180tgctttcgcg tttgtcggag gcgcgtcggt gtggttacac ggtgttggcg gtggtgagtg9240gttcggcggt gaattcggat ggtgcgtcga atggtttgac ggcgccgaat ggtccgtcgc9300agcagcgggt gattcgtcag gcgttggcgt cggcggggtt gtcgccgggg gatgtggatg9360tggtggaggc gcatgggacg gggacggcgc tgggtgatcc gatcgaggcg caggcgttgc9420tggccacgta tgggcaggag cgtggggcgg ggcggccgtt gtatgtgggt tcggtgaagt9480cgaatattgg gcatgtgcag gcggctgcgg gtgtggcggg tgtgatcaag tctgtgctgg9540cgttgcggta tggggtgctg ccgcggacgc tgcatgtgga tgtgccgtcg cgggaggtgg9600actggtcggc gggtgcggtg gagttgctga ctgaggcggt ggagtggctg gcggggggcc9660gtccgcggcg ggtgggggtg tcggcgttcg ggatcagcgg taccaacgcc cacgtgatcc9720tggaggaggc gccggagggt gtcgaggaga gcgcggctgg tgaggttgcg ggtgtggtgc9780cgtgggtggt gtcggcgcgg tcggaggagg ggttgcgggc gcaggctgcc cggttggtgg9840agcatgtggt gggcgggtct gggctggggc cggtggatgt gggctggtcg ttggcccggt9900cgcgtgcggt gttggagcac cgggcggtgg tgttgggagg ggatggggag gagttggtgg9960cggggcttcg tgcgttgtgc gatggggtgt tggggccggg tgtggtgcgg ggtgtggctg 10020gtgatggtgg gacggcgttg ttgttcacgg gtcagggtgc gcagcgtgtg ggtatgggcc 10080gggagttgta tgaggcgttt ccggtgttcg cggcggcgtt tgatgcggtg tgtgccgggt 10140tcgaggggat gttgcccggg tcgttgcggg gtgttgtttt tggtgatggt ggcggggttg 10200tggaccgtac ggagtgggcg cagccggcgt tgtttgcgct ggaggtggcg ttgttcgagt 10260tggtcgtgtc gtggggtgtg cgggcggatg tgctggtggg tcactcggtt ggtgagttgg 10320tggcggctca tgtggcgggt gtgtggtcgt tggcggatgc gtgtcgggtg gtggcggcgc 10380ggggtcggtt gatgcaggcg ctgcccgttg gtggggcgat ggttgcggtg cgggtgggtg 10440agggggagtt gccggtgttg ccggaggggg tgtcggtggc ggcggtgaac gggccgcggt 10500cgttggttct ctccggggat gaggggccgg tgcttgagct ggcggcgcgg ctggccgggg 10560agggccggga taccaggcgg ttgagggtct cgcacgcgtt ccattcggcg cggatggagc 10620cgatgctcgc tgagtttgcg caggtgctgg cggcggtgga gttccgtgcg ccgcggatcc 10680cggtgatctc caacgtgacc ggtgaggtgg ccggcgagga gctgaccacg cccgagtact 10740gggtgcgtca ggtacgcgag gccgtccgct tcgccgacgg agtgaacacc gcacacggct 10800cgggcgtccg gcgttatctc gaactcggac ccgacggcgt cctgacctcc ctcgctcacg 10860acatactggc cgagcagggc atcgaccggg atgtggccgt cgtacccgcg ctccgccatg 10920accagcccga atcccgcacg ctgctgaccg ccctcggcca actgcacacc accggcatgg 10980acgtgggctg ggcggccttc ctcgcgccgt acggcgcccg caccgtcgag ctgcccacct 11040acgccttcga acaccaccgt tactggttgg accccgtcgc acccgcctcg gcacctgcgg 11100atcctctccg ctaccgcgcc gagtgggcga gtgtgccgga ctgcgccacg ccgtcgctga 11160gcggtgtcca ggccgtcgtc gtccccgcgg gcgggggcca cctggatgtc ctgccggacg 11220ttacggccgc cctccgggag cacggtgcgc ggaccgtgct ggtcgaggtc gacccggagc 11280gagccgatcg cgccgagatc gccgacgccc tgcgcgcggc gctcggcgag gaaggcggcg 11340gcgtggtgtc gctgctcgcc ctggaccgcg ggcccttcgc gggcgtcgcc gcgaccgctg 11400tgctgctgca ggccctcacc gggctcgacg gcggtggccg cctgtggtcg ctgacgcgtg 11460gcgcggtgtc ggtgagccgc tccgacgcgc tgaccgaccc cgggcaggcc caggtgtggg 11520ggatgggccg cgtcgcggca ctggagcacc ccgagcgctg gggcggcctc gtcgacctgc 11580ccaccgagct ggacgaccgg gcgcgggctc ggctgtgtgc cgtactgtcg ggcagcaccg 11640gtgaggatca ggtggccgtg cgggcggcgg ggctgtacgc ccggcggctg caccgcgtgg 11700cgccccgggt gcccaccacc gaggacgcgg gcgccgcctc cggccagggg gtgggcgacc 11760
gccgggcgta tacgtacggc accgtgctgg tgaccggcgg caccggcgcc ctgggcgcgc 11820acatcgccaa ctggctcgcc aggtccggca cccggcatgt actgctcacc agccgccgtg 11880gcccggacgc cgagggcgct gcggacctca ccgcgcggct gcgggagctg ggcaccgagg 11940tgaccgtcgc cgcatgcgac gtggccgacc ggcagcgcct ggcggacctg atcgccgcac 12000tgtcggcgga ccgaccgctg acgggtgtcg tgcacgcggc cggtgtcctc gacgacgggg 12060tgctcgactc gctcacccca gaccggttcg acgcggtcgc ccggcccaag gtgatcggcg 12120cccggcacct gcacgaactc acgcgcgatc tcgacctgtc cctgttcgtg atgttctcgt 12180ccgtcgtcgg cacggtcggc ctggccggac agggcaacta cgcggccgcc aacgcctacc 12240tggacgccct cgccgtgcac cgggcccagc acggcctgcc ggcaacggcg gtggcctggg 12300gctcctggtc cggcgctggc atggccggcg acacccgggc cgcccgtgac cggctggcgc 12360gcgccggcct ggcgcccctc gaccccgccg ccgccctcgc cgtgctcgac cgggtcatcg 12420ccgacggcga gaccgccgtg accgtcgccg atgtggactg ggagcggttc gcggccgggt 12480tcgcccctgg caggccgcac ccgctgctcg ccgggatccc cgagctatgg cacgcccggc 12540cgcaggagac cggccaggtc accgatgggc cggcggaccg gcttgccgga ctggcgggtg 12600acgaactgcg ccaggcgctc gacgacatgg tgaccgtgga ggtcgccgct gtgctggggt 12660tccgggccaa ggaccgggtg ccgaccgacc gcaccttcaa gtcgctcggc ttcgactcgc 12720tgatcggcgt ggagttccgc aaccggctcg ccgccgcact cggcaggcgg ctgccgccca 12780gcctgatcta cgaccacccc acgccaggca ggctggtaga gcacctggcc gccggagtgg 12840acggcggcga ccagccctcg accgtcggcg ggcgaccggt tgcccccaca cgcacccacg 12900acgaccccgt tgtgatcgtg tccgccgcct gccggttccc cggtggcgtg cgtaccccgg 12960aggacctgtg gcagctcgtc ctcgacggcg gcgacgccat cggccccttc ccggtggacc 13020ggggctggga cctcgaccgc ctctacgatc ccgaccccgg cgcgtcaggc accagttacg 13080tccgcgaggg cggtttcctc accggcgtgg cggacttcga cgcggtgttc ttcgggatct 13140cgccgcgtga ggcgctggcg atggatccgc agcagcggct gctgctcgag acctcgtggg 13200aggcgctgga gcgggccggc atcgtcccgg gctccctggc cggcagccgg accggcgtgt 13260tcgtcggctc caacggccag gactacgcga acctgctgca ctcctccgat gtcgaggggc 13320atgtgctgac cggcacggcc tccagcgtcc tgtccggccg catcgcctac accctggggc 13380tcgagggccc cgcgctgacc gtcgacaccg cctgctcctc ctcgctggtc gccctgcacc 13440tcgccgtcca ggcgctcagc tccggggagt gcgacctcgc gctcgcgggc ggtgtgaccg 13500tcatgtccgg atccgacata ttcgtggagt tctcccggca gcgcggcctg tccgccgacg 13560ggcgctgcaa ggccttcggc cccgacgctg acggcaccgg ctgggccgag ggcgtgggca 13620ccgtcgttct ggaacggctc tccgacgcgc gccgcctggg ccatgaggtg ctgggcgtcg 13680tgcgcggcac cgccgtcaat caggacggcg cctccaacgg gctcagcgcc cccagcgggc 13740gcgcgcagca gcgggttatc cgccaggcgc tggccgacgc cggctgcgca ccgtccgacg 13800tggacgcggt ggaggcgcac ggcaccggca cccggttggg cgaccccatc gaggcgcaag 13860ccctgctcac cacctacggt caggaccgcc ccgccgaccg gccgctgtat ctcggctcca 13920tcaagtcgaa tatcgggcac gcccaggccg ccgccggact ggccggcgtg ctgaagatgc 13980tgttcgcact gaggcacggg cagctcccga agaccctgca cgccccgcgg ccgaccccgg 14040aggtcgactg gtccgagggc gcggtcgccc tgctcaccga ggaccggccc tggccggccg 14100tcgaccggcc gcgccgcgct ggcgtctccg ccttcggcgt cagcggcacc aacgcccacg 14160tgatcctgga gcaggcgccc ccgtcggccg cctccgaccc ggcacccacc gttcggccgc 14220ccgcggtgga cagctccgtc cagccgtggg tgctgaccgc caggtcgggg gaggcgctgg 14280gcgcgctcgc ggaccgcttg cgcgaggcgg cacccggcgc ggtcccggcc gacgtcgcac 14340gctcccttgt gacgaccagg acgatctggg cggagcgcgc cgtgctgctc gccgacggcc 14400gtgacgagta cgcctccggg ctcgccgcgc tggccactgg agagggcgac gcgcgggtcg 14460tgcgcggcac cgccgacacc cgcggccggg tcgtcttcgt cttccccggc cagggcgcgc 14520agtgggccgg catggccgcc cggctgtggg agtcgtcgcc ggagttcgcg cggtggatgg 14580atcgctgtga caaggccctc ggggacctga ccgactggtc cctcgccgag gtgatccacc 14640aggccgacgg agcgcccgga ctggaccgcg tggacgtgct ccagccggcg tcctgggccg 14700tgagcgtctc gctggccgcc ctgtggcgtt cctgcggggt cgaaccggcc gccgtggtgg 14760ggcactcgca gggggagata gccgcggcgt gtgtggcggg tgccctctcg ctggaggacg 14820gcgccatgct ggtgacgctg cgcagccggc tcatccgcga ggagctgtcc gggcacggcg 14880gcatgatgtc ggtggccctg tccccggccg gcacggcgga ccgcatagcc tgctgggagg 14940gcaggatctg cgtcgcagcg cacaacagcc gccgctccac cgtcgtcgcc ggcgagccgg 15000cggcgctggc cgaactgctc gccgcctgcg aggcggacgg catacgggcc cgccgcatcc 15060ccgtggacta cgcttcccac tcaccgcagg tggagcggat cgagcggaag ctgaccgagc 15120tggcggccgg gatcgtgtcg cgctcctcgg agatcccctt ccattccacc gtgaccggta 15180ccaggctcca caccacgggc ctggacgccg ggtactggta ccgcaacctc cgcaagcccg 15240tgctgttcgg gccggtcacc gaggagctcc tcacccaggg ccacgacgtg ttcctggaga 15300tgagcccgca ccccgtactg gtgccggccg tgcaggaggc ctccgacgcg gtcaccgcga 15360cagccgccgc ggtgggcagc ctgcgccgag gcgacggcgg cccggaacgg ttcctgctct 15420cgctggccga ggccttcgtc cgcggtgccc acgtggactg ggcggccgtg ctgggcggca 15480ccggtacccg cctggtcgag ctgcccacct atcccttcca gcgcacgcgg ttctggcccg 15540agccggtcac cccggccacc gcgaccggcg gccaggacga tgcaccgctg tggcaggccg 15600tggagcgcgg cgacgtggcc gccgtggccg ccgaactggc tgtgccggac ggccggtcat 15660tgcgtgacct ggtgcccgcg ctgtccggtt ggcgccgccg ccggagggac tccgcgacgc 15720tcgacatctg gcgctaccgc gtcacctgga cacaggtgaa cctgcccgtg tcggccgccg 15780tgaccggcga ctggctgctg gtgaccgacg accccgacac ggcggtcccc cggtgggtga 15840gcgcggcgct cggcgagggc ctcgccaccg tggtgcggcc ggcggacgtc cccgcatggt 15900cgcgcacgcc ccagggcacc gggtggacgg gcgtggtgtc cctgctgggc ctcacagatc 15960
actcgcaccc gtgtcacccc gccctgtcga ctggggtggc cgcaaccgtg accctgctga 16020ccgcgctgcg agaggccggg atcgaggcac ctctgtggtg tctgaccagc ggcgccgtcg 16080gcaccggtgg cctggaccag gtcacggcac ccaaccaggc ccagctctgg gggctgggcc 16140gggtcgccgg gctggagacc cccgcgacct ggggcggact tgtcgacctg cccgccgaac 16200ccgacgagcg caccgcggcc ttgctgcggg ccgcgctcac cgccgacgga atcgagcagg 16260agtacgccct tcggccttca ggaccgtacg tacgtcggct ggttcgggcg cccctggcgg 16320gcgtggcggc gccgcgctcc tggcgcccgc gacccgacgg caccgtcgtc gtcaccggcg 16380gcaccggagc actgggcgcc agggtggccc gctggctcgc ccgcgcgggc gccgggcacc 16440tgctgctgac cagccggcgc ggcccggccg ccgacggtgc cgtcgaactc tccgaggaac 16500tgcgggcgct gggggccgaa gtgacgatca ccgcctgtga tgtcgccgac cgggcccagc 16560tggccgatgt gctggcagcc gtgcccacgg cgtttccggt cagcgcagtg atccacaccg 16620cgggcgtgag cggcaacgcc ccgctcgccg ggaccaccct cgccgagctg gccgaggtcg 16680tcgccgccaa ggccgccggc gcccgcaatc tggacgagct gctggccggc caggacctcg 16740acgcgttcgt gctgttctcg tccggagccg ccgtctgggg cagcgcgggc cagggcggat 16800atgccgcggc caacgcgtac gccgacgcgc tcgcggccga ccggcgccgg cgagggctcg 16860tcgccacgtc ggtggcctgg ggcagctggg ccggcggcgg catggtcgac gacgacctcg 16920cgcgtgagct ggcccgcggt ggcgttcgct cgatggaccc cgaccgggcg atcgccgctc 16980tccagcaggc gctcgaccac gacgagaccg cgctgacggt ctccgacatg gactgggccc 17040gcttcgccga gacattcacc gccgcccgcc cgcgcccgct gatcgacggc atcccggagg 17100ccgcccccgc ctcggccgaa ccggccggcg atatccccgg cctggccgcg cggctcgcgc 17160agctgcccga cggcgagcgc gaccgggaac tgttggatct ggtcaggaac gccgcggcgc 17220ttgccctcgg gcacacgggc accgagccca tcacaccgtc gaagccgttc aaggaactgg 17280gcttcgactc gctgaccgcg gtcgacctgc gcaaccggct gacggcggcc accgggctgc 17340ggctgcccgc caccctcgtc ttcgactacc ccacgccccg cgcggcggcc gacgcgttgc 17400gggccgtgct gttcgccgcg gacatgccgg tcgacacggc cgcacccgcc cggagcgcct 17460ccgcccgacc ggcggacgac gacccggtcg tcgtcgtggc gatggcctgc cggtatcccg 17520gcggggcgac aacacccgag aagttctggg acctgatcgc tgcgggcgag gacggaatcg 17580gaggctttcc caccgaccgt ggctgggaga tcggccccgg cgcggccttc tcccggaccg 17640gcggtttcct ggcggacgtg gccgggttcg acgcggcgtt tttcgggatc tcgccgcgtg 17700aggcgctggc gatggatccg cagcagcggc tgctgctcga gacgtcgtgg gaggcgctgg 17760agcgggccgg ggtggacgcg ctgtcgctgc gcggcagccg caccggcgtc ttcgtcggcg 17820cgagcccctc ggaatacggc accttggtcg cttccctgga gggaggccag gactatgccc 17880tcactggcgc cgtcggcagt gtgctgtccg gccgggtggc ctatgtgctg ggtcttgagg 17940gtccggcgct gacggtggac acggcgtgct cgtcgtcgct ggtggcgctg catctggcgg 18000cgcaggcgct gcggggcggt gagtgcgacc tggccctcgc cggcggtgtg gccgtgatgg 18060ccacccccaa cgccttcgac gccttcgcgc ggcagggggg tttggctcgt gatggccggt 18120gcaaggcgtt tgcggatggt gcggatggta ctgggtgggg tgagggtgtc ggggtgctgg 18180tgctttcgcg tttgtcggag gcgcgtcggt gtggttacac ggtgttggcg gtggtgagtg 18240gttcggcggt gaattcggat ggtgcgtcga atggtttgac ggcgccgaat ggtccgtcgc 18300agcagcgggt gattcgtcag gcgttggcgt cggcggggtt gtcgccgggg gatgtggatg 18360tggtggaggc gcatgggacg gggacggcgc tgggtgatcc gatcgaggcg caggcgttgc 18420tggccacgta tgggcaggag cgtggggcgg ggcggccgtt gtatgtgggt tcggtgaagt 18480cgaatattgg gcatgtgcag gcggctgcgg gtgtggcggg tgtgatcaag tctgtgctgg 18540cgttgcggta tggggtgctg ccgcggacgc tgcatgtgga tgtgccgtcg cgggaggtgg 18600actggtcggc gggtgcggtg gagttgctga ctgaggcggt ggagtggccg gcggggggcc 18660gtccgcggcg ggtgggggtg tcggcgttcg ggatcagcgg taccaacgcc cacgtgatcc 18720tggaggaggc gccggagggt gtcgaggaga gcgcggctgg tgaggttgcg ggtgtggtgc 18780cgtgggtggt gtcggcgcgg tcggaggagg ggttgcgggc gcaggctgcc cggttggtgg 18840agcatgtggt gggcgggtct gggctggggc cggtggatgt gggctggtcg ttggcccggt 18900cgcgtgcggt gttggagcac cgggcggtgg tgttgggagg ggatggggag gagttggtgg 18960cggggcttcg tgcgttgtgc gatggggtgt tggggccggg tgtggtgcgg ggtgtggctg 19020gtgatggtgg gacggcgttg ttgttcacgg gtcagggtgc gcagcgtgtg ggtatgggcc 19080gggagttgta tgaggcgttt ccggtgttcg cggcggcgtt tgatgcggtg tgtgccgggt 19140tcgaggggat gttgcccggg tcgttgcggg gtgttgtttt tggtgatggt ggcggggttg 19200tggaccgtac ggagtgggcg cagccggcgt tgtttgcgct ggaggtggcg ttgttcgagt 19260tggtcgtgtc gtggggtgtg cgggcggatg tgctggtggg tcactcggtt ggtgagttgg 19320tggcggctca tgtggcgggt gtgtggtcgt tggcggatgc gtgtcgggtg gtggcggcgc 19380ggggtcggtt gatgcaggcg ctgcccgttg gtggggcgat ggttgcggtg cgggtgggtg 19440agggggagtt gccggtgttg ccggaggggg tgtcggtggc ggcggtgaac gggccgcggt 19500cgttggttct ctccggggat gaggggccgg tgcttgagct ggcggcgcgg ctggccgggg 19560agggccggga taccaggcgg ttgagggtct cgcacgcttt ccattcggcg cggatggagc 19620cgatgctcgc tgagttcgcg caggtgctgg cggcggtgga gttccgtgcg ccgcggatcc 19680cggtgatctc caacgtgacc ggtgaggtgg ccggcgagga gctgaccacg cccgagtact 19740gggtgcgtca ggtacgcgag gccgtccgct tcgccgacgg agtgaacacc gcactgggcc 19800gaggtgtgga caagttcctg gagttggggc cgtcgggccc gctgaccgcg atggccgagg 19860aggtcatcga acacaccggc acccgagcgg tctgtgtccc cgtgctccgc gccggacgcc 19920ccgaggacgc caccctgctg cacgcgctcg cggccgtgtt cgtcaccggc gccacagtcg 19980gctggacggc tccgctcgcc ggtaccggag cgcgggccgt ggacctgccg acgtacgcct 20040tccagcacaa gcggtactgg ccgcagccgg cgaccgtcgg ccgggacctg gccgcggccg 20100ggctcgccga ggccgggcat ccgttgctga cggcctggct cccctcgccg gagggcgagg 20160
atgtgctgtg caccggccgg atctccctgg cgacgcatcc ctggctggct gaccatgcgg 20220tgctgggcac cgtgctcgtc cccggcaccg cgttcgtgga cctcgcctgc tgggcgggcc 20280accgagtggg gtgcggcgcg ctgcgtgaac tgaccctcgc cacgccgctg gcgctcgcac 20340aggacatggc ggtgcggctg cggttggtgc tcggcgcgcc cgacgacacc ggctgccgcc 20400cggtcgcgct gtactcgcag caggaaggcg cggacgaagg gacggacggg acgggctgga 20460cgcggcatgc cgagggcctg ctggccccgg gcggcgacgc gtccgtacag ccgcccacgg 20520acttcgagac ctggccggtg acgggctgcg agcccatccc actggacggt ttctacgaag 20580agctcgccga cgcgggcttc tcctacgggc cggtctttcg gggcctgcgg gccgcatggc 20640ggcgcggcgg ccaggtcttc gccgaggtga gtctgcccgc cgacgagacc ggtggcttcg 20700gcgtccatcc ggccctgctc gacgcggccc tgcacgcgct ggggccggtc tcacgggaca 20760cggacgagcc cggctcggcc cggctgccgt tctcctgggg cgaggtacgg gtgcacgcgg 20820ccggcgccga ccgcttgcgg gtctgtctgg tccgggccga ggacggcacc gtcacgttgc 20880atggcgcgga cgccgcgggc cggccggtgg taaccgtcgg ctcgctggtc ctgcgcccga 20940tctcgccaga gcgactgcac ggcggcgcag cggcttttga cgacgcgctc ttcaccactc 21000gctggatgcc gctgagcgtc gccgacggca tcgcatatcc cactgccgac tgcgtactgc 21060tcggtgaccc tctggaacgc gcctggcggc accaccccga cctcgactcg ttcgccgagg 21120cactcgcggc cggcaaggaa aaaccgggta cggtgctcgc tcgctgtccg cgggacatcg 21180cggccggcgt cgaccctgcc gaggcggccc ggcggtgtgc ggagtgggcg ctcgacctgc 21240tcaagcggtg gctggacgac gaccggctga cggactgtca tctcgtgatc ggcacccggc 21300acgcggtgac caccggcgcc gaggaccaga ccgccggccg gacggacgac cccgccgtgc 21360tcgcccagtc cacgcttctc ggcctggtcc gctcggccca gaccgagaac cccggccgtg 21420tcaccctggc cgacttcgac ggcaccgcac ccgacccggc gcacctcatc ctggccgtac 21480ggcaggcgga gccggaggtg gctgtgcgcg ccggccggct ttacgcccgc agactcaccc 21540gtccggacac cgggcgggcc ctggccgtcc cgccgggagc gggctcgtgg cggctggaga 21600gcaccgggcg cggcaccctg gacaacctgg cgctggttcc ctgcgcccag gccgaggagc 21660cgctgggcga gggcatggtg cggatcgccg tacgcgccgc cggggtgaac ttccgggacg 21720tcctgatcgt cctggacatg tatcccggcc gcgcggacct gggcaccgag tgtgccggag 21780tcgtggtgga gacggggcac ggcgtcaccg gactggtccc gggggaccgg gtgatgggca 21840tggtggccgg ggccttcgcg cctaccgccg tggtcgatca gcggttcctg gttcggatac 21900cggacggctg gtcctacgag acggccgctg ccatcccggt cgccttcttg accgcctact 21960acggcctggt cgacctggcc gggctgagtg cgggggagtc ggtgctcgta cacgccgccg 22020ccggcggggt gggcatggcc gccgtccagc tggcccggca cctgggcgcc gaaatgtacg 22080gcaccgcgag cgagccgaag tgggacacgc tgctcgacag cgggctggac cgcgcgcaca 22140tcgcctcctc acggacgacg gtcttcgccg actcggtgat ggaggcgacc gggggtgcgg 22200gcgtggacgt ggtgctgaac tcgctcgcgg gcgagttcgt tgacgcctcg ctgcgggccc 22260tgccgcgagg cggccggttc gtcgagatgg gcaagaccga cctgcgcgat cccgagcggg 22320tcgctgccga gcaccccggg gttcggtacc ggcccttcga cctgggtgag gccggcgcgg 22380accgcatcgc cgaggtcctt gcgcacctgg ccgagctctt cgcctccggt gagctcaccc 22440cgctgcccgt gaccgtctgg gacatccggg acgccccggc cgccttccgt gcgctcagcc 22500aggccgcact caccggcaag ggcgtactga ccgtccccgc cccttccttc gaggccggcg 22560agacggtgct gatcacgggc ggcacaggaa ccctgggcac cctgctggcc cggcacctgg 22620tgaccgagca cgggctgcgc cacgtcatcc tggccggacg ccgtggtacc gagaccgcgg 22680aggtgcggca cctgcgcggc gacgtggcgg aactcggtgc gcgcatcgag gtggtggcct 22740gtgacgccgg cgacgagcgg gccctgcgtc aggtgctgga cgccctcacc gccgagcacc 22800gcctcgcggg cgtcgtgcac gccgccggcg tcaccgacga cggggttgtg tccgccctgg 22860accgcggccg gctgtccgcc gtactccacc cgaaggtgcg cggagcgtgg aacctgcacc 22920ggctcaccgc aggctcggaa ctccggatgt tcgtgctgtt ctcctccgcc tcggccaccc 22980ttggcgcggc gggtcagggc aactacgccg cggccaacgc cttcctcgac gcgctcgccg 23040agcaccggca cgcccttggg ctgcccgcca ccagcctcgc ctgggggctg tgggagcagg 23100ccagcggaat gaccgggcgg ctcctcgacc gcgaccggca gcggatgagc cggtccggca 23160tcgtgcccct cagctccgcg cacggtctcg cgctgttcga cgccgcgcgg ctggccggcc 23220tccccacgct caccccggca cgcctggacc tggccgcgct tcgggtgcgg tacgcacacg 23280agcaggtgcc cgcggtcctt cgggaactcg tccgcgtccg gccgtccgca gccgaggacc 23340ccacgacagc cccggacacc acgaccgcac cgggaccgtc cggtgccatg acactggcgg 23400accggctggc aggactgtcc gccccggagc ggcagcggca cgtcctggac ctggtgcgtc 23460ggcacaccgc ggccgtactg ggccacggct cggccgacga tgtcgacccc gaccaggcgt 23520tcaaggccct cgggttcgac tcgctcaccg cggtcgaact gcgcaaccac ctgcggacgg 23580cgacctccct ggccgtcccc gcgaccctcg tcttcgacca tccgacaccg gccgcgctcg 23640ccgcgcatct cctggaactc gccgcaccgc cggaacggga cccggcgctc cgggtcatgg 23700gcggactcga ccggctcgag gccgacgtcg aggcgctggc ctccggcggc gccgggcacc 23760aggaggaggt ggccacgagg ctgcgccgcg tgctgcgacg cctggagtcg ggcccggggg 23820ccgcccactc cggcacggag gaaacctccc tcgacaccgc ctcggcgacg gaagtcctcg 23880ccttcatcga cagcgaattc ggcgatctcg cctagtacag gtacggagtt gatcgcagtg 23940gtcagtgacg acaagcttgt cgactacctg aagcgggtca ccgcggacct caaacggacc 24000cgtcagcggg tccacgagct ggagtcgggc agcgccgaac cgatcgctgt cgtggcgatg 24060gggtgccgct tccccggagg catcagctcc ccggaagacc tctgggagtt cgtgcgcctg 24120ggcagtgacg ccatctcgga gttccctacc gaccgtggct ggcacaccag ccggctgagc 24180gggaacttcc ggcgggccgg cggattcctt tatgacgcgg gcgacttcga cgcgggtctg 24240ttcgggatct cgccgcgcga ggcgctggcg atggacccgc agcagcggct gctcctcgag 24300acggcctggg agacgctgga gcgggccggt gtcgacccca cctcggtgcg gggcgccgac 24360
ggcggcgtgt tcatcggcat ggccgaccag aagtacggcc cccgcgacga cgaactgctc 24420ggtgaggtca ggggactcgt cctgaccggc acgaccagca gcgttgcctc aggccggatc 24480gcctactcac ttggtctgca agggcccgcg atcaccatcg acaccgcctg ctcgtcgtcg 24540ctggtcgcac tgcacctcgc ggtgcgctcc ctgcgggccg gtgagtgccc gttcgccctc 24600gtcggcggcg ccgcggtgat ggcggagccc accctgttcg cggagatggc cgagcagggt 24660ggcatggccg gagacggccg ctgcaaggcc ttcgcggccg ctgcggacgg caccggctgg 24720ggcgaaggtg tcggcgtact cctgctgcag ccgctgtcca ccgcacgcga gcagggcctg 24780cccgtcctgg ccaccgtacg cggctccgca gtcaaccagg acggagcctc caacggcctc 24840tcggccccca acggccccgc ccagtgccgc gtcatccgca aggcgctcgc cgacgcccag 24900ctcgtcgccg gccagatcga cgccgtggag gcccacggca ccggcaccgc gctcggcgac 24960ccgatcgagg cgcaggcgct gctggccaca tacggccagg accggcccgg cgacgagccc 25020ttgtggctcg gctcggtcaa gtcgaacatc gggcacaccc aggccgcggc cggtatggcc 25080ggcgtcatca agatggtgca ggcgatgcgg cacgggctgc tgccgcgcac cctgcacgtg 25140gacgaaccga cccctgaggc cgactggtcg gccggtgatg tgcggctgct gacggaggag 25200cgggaatggc cggacacggg acggccgcgg cgcgcggcgg tgtcgtcgtt cgggatcagc 25260ggcaccaacg cgcatgtcgt gctggaactg cccaccggca ctgtcgggga gccagccgat 25320gcggccgggc cggttccgga cccgtcggcc tgcgccccga ttccgtggct gctgtccgcc 25380gcgagcgccg acgccctgcg cgcgcaggcc cgcagactgc accgcttcgt ggacacaccc 25440ggtgccccgc gcccgatcga caccgccctg tcgctcacgg tcacccgggc ccgactcgac 25500caccgcgcca tcgtgttcgg caccgaccag gcagaactgc gggccggact gggggcattg 25560gccgcgggcg aaagcacccc gcggactgtg cacggacgga ccgtgccgag cgcgacgatc 25620gcgttcctgt tcaccgggca gggggcgcag cgggcgggca tgggccgggc ggcgtacgcc 25680gcgttccccg agttcgccgc ggcgttcgac gcggtgtgcg cggagctgga cggactgctg 25740ccccggccgc tgaagtcggt gctcttcgcc gagccgaact cggccgacgc cgcactggtc 25800gaccagaccc tgtacgccca gaccggcttg ttcgctttcg aagtggcgct gttccggctg 25860ctggaggagt ggggtgtccg accgggagtg ctgctggggc attccgtcgg cgagctggcc 25920gccgcgcacg tggcgggcgt ctggtcgctg ccggacgcct gccgtgtggt cgcggcgcgg 25980gcgcggctga tgcaggccct gccggaagac ggggcgatgc tgtcggtggc cgccagcgag 26040aagcatatcg ccgaactgct gggcgacctc gccgacgtgg atgtggccgc ggtcaacggc 26100ccggccgtca ccgtgctgtc ggggccgacg ggtgccgtgg cggacgtcgg ggagcggctg 26160gccggcgcgg ggctgcgcac gaaacacttg cgagtgagtc atgccttcca ctccgccctg 26220atggagccga tgctcgcgga attcgcacgc gagatcgccg acgtgacctt ccagcagccc 26280gagctgccga tcatctccaa tctgacgggc cagcaggccg acgcggccga gctgggctcc 26340gccgcctact gggtgcgcca ggtccgtggc accgtacggt tcgccgacgg tgtcggccga 26400ctcgccgcgc acggcgtcac cgcctgcctc gaactcggcc cggacggtgt gctgaccgcc 26460ctggcccgcg actgcctcac ggccgccgcc gatgtggcct tggtgcccgc actgaggcgc 26520gaccaggacg agccggccgc gctcctcgcc gcgctcgccg aactccatgt gcggggcgtc 26580gaggtggact gggccgcgat gctgaccgca cgcggcggcc ggcgcgcagc cctgcccacc 26640tacgccttcc agcgagagcg ctactggctg cccgccaccc cctccgtcgc ctccgccgtc 26700tccgcgcccg ccgagcaggc agaccggctg ctgtaccgcg tcggctggtc gccggtcacc 26760ggtttcgaca ccgaggcgag gccggagggc acctggctcg tcgtcgcttc cccggacgac 26820gagggccgcc gcgtcgccca ggcgctcggc ccgcacaccg tgctcgtggc ccacgacccc 26880gatgacccct ccgggtcggt ggcacggctg cgcggcgccc tgccggccga ccgcccggtg 26940accggggttc tggccctgcc ggagcagacc ggcgcggcgg ccgtggcggc ccagctcgca 27000ctccgcgagg ccctccgcga cgccgaagtg cgcgcccccc tgtggtgcgc gacccgtgcg 27060gcggtctcgg tcggaggtga ggccaccccc ggcgccgcac aggcaccgct gtggggattg 27120aacagggctc tggaaacctg cggggggatg gtggacctgc ctcagcggtt ggactcacgg 27180tccctggggc tgctggccgc ggccctcacc aacccggccg acgccgacga actcgccgta 27240cgcaccggtg ggctgttcgc ccggcggctg cacgccgtgc agcccgtgcc gcgcgctcct 27300cgcccgtggc gggccgacgg gaccgtcctc gtcaccggcg acgtcgagtc ggcaaccgat 27360gacctgctgc ggaggctgag cggtgacggc gagcgccccg tggtgctggc gcggcgaccc 27420ggcaccgccc tgcagaacgg ggccgcgggc gacgggtcgt gcaccgtcgt ggagtgggac 27480cccgcggccg gcgcacccga gacgccctcg ccggtgaccg ccgtcgtaca tctggacaac 27540atccagccgt ccgccccacg ggacgacgcc gaccctttgg ccctggctgc cgcagtggcg 27600gagcggctcc acaccgtcga ccggctgacc gagctgttcg gcaaccagga tctggacgcc 27660ttcgtgctgt tgtcctcggt tgccgggatc tggggcggcg ccgaggacgt cgttcacacc 27720gtcgtgcacg cggccctgga gtccgccgcc gaacgccggg ccgctgccgg cctgcgcggc 27780gcctgcgtag gctggggccc ctgggccggc gccggcgacg ggccggacgt gcccggactc 27840gtacccatgc gccccgagcc ggcactggcc gcgctgtggc acgcgctgga cgacgacgcg 27900gccgtcttcg ccgtcgccga cgtcgactgg ccgcggttcc acccggtcct caccagccgg 27960cgtccccggc ctgtcgtctc cggtctgccc gaggtacggg cgctcaggcc ggcgccatcg 28020gcggcgcccg ccgtcggcat ggacgtcacc gacctggaac accggctgcg ggacctggtg 28080ctcaccgagg ccgcaacggc gctcgggcat gccttccgcg actcgatgga cccgctgcgc 28140cccttccgcg acgccgggtt cgagtcgctc accgccgtgc gtttccgcga ccggatcgcc 28200tccgaaaccg ggctgaacct ctcggccacc ctcgtcttcg accaccccac gcccgaggcg 28260gtcgtggccc acctgctggc cgaactgacc ggggggcggc ccgacgaggc ggagcaggtc 28320agcacccgat cgcacgacga cccggtggtc atcatcggca tggcgtgccg ttaccccggc 28380ggggtcagcg accccgaggg cctgtgggaa ctggtccact ccggacgcga aggcatcggt 28440gacttcccca cggaccgcgg ctgggacctg gcggcgctgc gacgtgccgt cccccacctg 28500gccctcaggg ccggcttcct ccccgacgcc gccgccttcg acgccgcctt cttcgggatc 28560
tcgccgcgcg aggcactggc tatggatccg cagcagcggc tcctactcga agcctcgtgg 28620gaggctgtgg agaccgccgg catcgatccg gcgtcgctgc gcggcagccg caccggggtg 28680ttcgcgggcg tggccggctc cgattacggt gccgcactcg ccggttcgcg tgaggcagag 28740ggctatctga tgaccggaac ggccaccagc gtggtctccg gccggatcgc ctatgttttc 28800ggtctgcagg gcccggcgct caccatcgac acagcctgct cctcctcgct ggtggcgctt 28860cacacggccg tgggcgcgct gcgcaagggc gagtgcgacc tcgcctttgc caccggcgtc 28920gccgtcatct ccaccccgga cgctttcgtg gacttcgcca agcaggacgg gctggcggca 28980gacggccgct gcaaggcgtt cgccgtcggc gccgacggga ccaactgggc cgagggtgtg 29040ggtgtgctgc tcgtcgagcg gctctccgac gcccgccgca acggtcatcg tgtgctggcc 29100gtgctccgcg gcagcgcggt caactcggac ggcgcctcga acgggctcgc cgcacccaac 29160ggcggcgcgc agcagcgggt catccgccag gccctggcgg acgcagggct gaccgccccg 29220gacgtcgacg ccctggaggc gcacggcacc ggcacggcgc tcggcgaccc gatcgaggca 29280caggccgtgc tggccaccta cggccagggt cgccccgccg atcggccgct gtggctgggc 29340tcactgaagt cgaacatcgg ccacagcgcc gccgcagccg gtgtcggcgg cgtgatcaag 29400atggtggagg cgatgcggca cggagtcctg ccgccgaccc tgcacgccga cgagccgacc 29460cacgaggtgg actggtccgt gggcgcggtg gaactgctca ccacggcacg cgactggccc 29520gagaccgggc ggccgcgccg cgccgcggtg tcgtcgttcg gcgtcagcgg caccaacgcg 29580cacgtcatcc tggaacaggg ccccgacctg gccccgggcg gcgtgcccgg tgtccaggag 29640gaccccgcgc ccagggccgc gggaggatgt gccggcaacg ccgtcccctg gctgctgtcc 29700ggacgttccg cccgggctct gcgcgaccag gcggcccgtc tcgccgggca tctgacgcgc 29760ggtgacccgt cggccgaagc gatcggacac gcgctgctca cctcccgtac cgccttcgag 29820caccgggccg tcgtgctggg cggcggtacc gtcgatctcg tcgaaggact ggatgctctc 29880gccgccgggg aaccggcccc gtcggtggtc gccggcgcac ctcgtccgac cggccgtgga 29940cccgtcttcg tctttcccgg ccaaggtggt caatggtccg gcatggcgtc cgaactcctc 30000gacacctgcc cggccttcgc cgcccgctgg gccgagtgcg agcgtgcgtt cgcgccgcac 30060atggacgtct cgctcacgga ggcggtccgc gacgccgcgg ccctggagcg ggtcgatgtc 30120gtccagcccg tgctcttcgc ggtcatggtg tcgctggtcg aggtgtggcg ttcgtacggg 30180gtacggcctg ccgcggtgat cgggcactcg cagggcgaga tcgccgcggc ctgcgtcgcg 30240ggagcgctgt ccctcgacga tgccgcccgc gtcgtcgcgc tgcgcgccag ggcgctcggc 30300gtgctggccg gtgcgggcgg catggtctcc gtcgcactcc cgcccgccga gaccgagggc 30360tggctgcggc gttgggagga ccgcatctcc gtggctgcgg tcaacggccc ctcctccgtc 30420gtggtctcgg gtgaaccggc tgccctggag gaactggtgg agcaagcccg cacccgggac 30480gtccgggtgc gccgcatcga ggtcgactac gcctcgcact cggcacaggt ggcccgtatc 30540gaggacgagg tcctccgact gctggaaccg attcggccgc ggacgtccga ggtccccttc 30600ttctccaccg tctccacgca gtggcaggac accaccgcga tggacgccgc ctattggtac 30660cgcaacctgc gcgatccggt gctgttcgcc ccgtccgtcg gcgcgctcgt cgaccagggg 30720cacacggtgt tcgtagaggt cagccctcac ccggtgctca cctccggcct gctggagacc 30780gctgaacgcg ccgacgtgga cctgacggtc accggcaccc tgcgccgggg tgagggcggg 30840ctcgcccgga tgcgtgcctc gctcgccgaa ctgtgggtgc acggcacgcc cgtcgactgg 30900tcggccgcct tcgacccggc cccggcgggg ccggtgccgt tgcccactta cgccttccag 30960cgcgaccgct attggcccga tccgcgcccg gcgtccgccg acccggtgta cgagaccttc 31020tggcgggcgg tggacgaggc ggacctgccc gcgctgaccg gaaccctggg cgtcaccgac 31080gaccagccgt tgcgcgaggt gctgcccgcc ctgtcggcct ggcggcgcag ccgtacggaa 31140caagcggtca cagacagctg gcgctaccgc gtgtgctgga agcggctgcc ggacgcggcc 31200cccgccgaac tgcccggcac ctggctcctg gtgaccaccg agggcgccgc cggggacccg 31260tccgccgctg ccgccctgca ggcagtgcgg gacgctgccg ggcacaccgt gacccttgct 31320gtcgacagcg acgacgagcc ggcttcactc gcggcggcgc tgcgcgagac gctgcgggga 31380acgcatccgg ccggcgtggt cacactgacc ggcacggacg tctcaccgca cccggtcagc 31440ccggtcgtcc cggtgggcac ggccctcacc gtcaccctgc tccaagccct ggacgcggcc 31500gacgtcgatg cgccgctgtg gtgcctgaca cgcggcgccg tcgccaccga cgacgacacc 31560gccgggcccg gcagcccgct ccagtcggcg ctctgggccc tgggccggat cgcggccgtg 31620gagtctcccg gcaactgggg cggtctcgtc gacctgccgg acaccttcga cgacagcgcc 31680gcgcggcgac tggtgtcggt cctcgccagc ctggacggcg aggatcaggt ggcgctgcgc 31740gtctccggcg cgtacggccg tcggctgatg cgcgccaacc ccactgcctc acccggctcc 31800ggctggcgcc cgcgcggcac cgtgctggtg accggcggca ccggagcgct cggcgggcgc 31860gtcgcccgct ggctcgcccg ggacggcgcg gagcatatcg tgctcgccag ccgccgcggc 31920tcgcaggcac cgggagtcga cgacctggtg gccgaactga gcggcctcgg ggcccaggtg 31980acggtggact cctgcgacct gagcgtggcc tcggaagcgt tcgcgctggt cgaccggata 32040cagcgcgacg gcgaccggat cggggcagtc atccacaccg cgggagcggg tggcctcgga 32100ccgctcgtcg acgcgggact ggacgacatg gagctggcca tggccggcaa ggtcgccggc 32160atcgacaacc tggagcgggc gctggacgac caacagctcg acgcggtcgt ctacttctcc 32220tccatcagcg cgtcctgggg cgccggtgac cacggcatct acgcggcggc caacgccgtc 32280ctggacgccc gcgccgaggc ccggcgtgcg gccggcgtgc acaccgtgtc ggtggcctgg 32340gcgccctggg gcggaggcgg catgatcgac gacccggccg tggcggacac actgaaccgc 32400atgggcctgc ctctggtgga ccccgacctc gcgatcagtg gtctcgccac gatcctcgcc 32460gagggggagg agtcgctgct gctggtggac gtggactggg gcaggttcat cccccagttc 32520accctgcgcc gccccagccg cctgttcgac gaactgcccg aggcacgggc ggcggaggcc 32580gacacggggc ctgccaaggc cgacgcccct tccccgctgg ccggtcggct ggccgggctg 32640agcaaggcca agcgcgccac ggcgctgcgc gacctcgtac gcgagcacgt cgccgcggtg 32700ctgggccaca acgacccggc ggccgtcgat gccggccggg cgctgaagga cctcggcttc 32760
gactcactga cggcggtgga actgcgtgac cgactgagca ccgtggccgc aatgcgcctc 32820ccggccaccc tggtcttcga ccatcccacc atcgccgaac tggccgactt cctggcccgt 32880ggcctggagc cggagacggc ccggccgacc gccgcacccg ccaccgtcgt acgcgtcgac 32940caggacgagc cggtcgccat cgtcgccatg gcctgccgct accccggcga catcgcctcc 33000gcagaggagc tgtggcgtgc tgtccgcgac gagaaggacc tgatcagccc attccccatc 33060aaccggggct ggccggttga ccgactgctg gacgccgatc ccgaccggcc cggcaccagc 33120tacgtcgacc acggcggatt cctgcacgac gccggtgact tcgaccccgg cttcttcggt 33180atctcgccgc gagaggcaca ggccatggac ccgcagcagc ggctgctgct cgaatcgtcc 33240tgggaagtac tggaacgcgc cggtatggtc ccgaagtccc tgcggggcag ccggaccggg 33300gtatacgtcg gtctgaccga ccaggcctac ggcactcgcc tgcgcggatc cttggacggc 33360atggagggct tcctcgtcag cgcgtcgtcc aacgtggcct ccggccggat ctcgtactcg 33420ctcgggctcc agggccctgc gctcaccgtg gacacggcct gctcgtcctc gctggtggcc 33480ctgcacctgg ccacccaggc gctgcgcaac ggcgagtgcg acctcgccat cgcgggcgcc 33540gccaccgtca tgccggaccc cacctccttc atggccttca gccggcagcg cggactggcc 33600gccgacggcc gctgcaagcc gttcgccgcc gccgcggacg ggttctccct cggcgagggt 33660gtcggcgtcc tgctggtgga gcggctgtcc gacgcccgcc ggctcggaca ccccgtgctg 33720gcgctgatcc gtggctccgc ggtgaaccag gacggggcct ccaacggcat caccgcgccc 33780aacggccctt cccaggagcg tgtcatccgg caggccctgg tcaacgccgc gctgcccgcc 33840tccgcggtgg acgtggtgga ggcgcacggg accggcacca ccctcggcga cccgatcgag 33900gcgcaggcgc tgctggccac gtacggccag gaccggcctg ccgaccggcc gctgcgactg 33960ggctcggtca agtcgaactt cgggcacacg caggccgcgg ccggcatggc cggtgttatc 34020aagatggtgc aggccatgcg gcacgagttg atgccgcgca ccctgcacgt ggacgcgccc 34080agcccgcacg tggactggag ctccggggcg gtggagcttc tcgccgaggc gcgaccgtgg 34140ccgcggggtg atgagccgcg ccgtgctggg gtctccgcct tcgggatcag cggcaccaac 34200gcgcatgtcg tcctggaaga ggcatcgcag gagccgacgc ccgacgggag cgccggggcg 34260ccggatacgc cggatacgcc ggacgcgccg gtcgaggcgg acaccggccg tcccctgccg 34320ctcgtcgtct cggcccgcac cccggacgca ctgcgcgacc aggccgcccg cctgaccgcc 34380ctcctggacc gggaagaaca cccggtctcc gacctcgcct actcgcttgc cacggcccgc 34440ggtgtgttgg accgggccgc ggtcgtcgtc gccgcggacc cggacgaact gcgccggaac 34500ctggccgacc tgaccacgag agcggtcgcc gagcggcggg ccgagggcgg cctcgccttc 34560ctcttcaccg gtcagggcgc ccagcgcgcc ggcatgggac gctccctgta cgacgccttc 34620cccgagttcg ccgcggcctt cgacgaggtg tgcgcggaac tcgaccggca cctgccccgc 34680ccgctgcgca ccgtcgtgtg ggccgagccc gggacggacg aggccgcgct gctcgaccag 34740accctgtaca cccagaccgg tttgttcgcc gtcgaggtgg cactgttccg gctgctggag 34800cactggggcg tacggcccga cgccctgctc ggccactcgg tcggcgagct cgccgccgcc 34860cacttggccg gcgtgtggtc gaccgaggac gcggcccggg tggtcgccgc ccgcgcccgg 34920ctgatgcagg aactgccgga gggcggcgcg atgctgtccg tcgctgcggc cggggacgag 34980gtgtccgccg tgctcggcga cgcgtccgcc gaagtcgccg tcgctgcggt caacggcccc 35040gcgtcgctgg tcctgtccgg caccgaggag tccgtgacgg ccgccggcgc ccggctcgcc 35100gaggcggggc tgcgcaccaa gcgacttacc gtcagccatg ccttccactc gtccctcatg 35160gaaccgatgc tcgccgcgta cgagcatgaa ctcgcccagg tcgccttcgc cgagccggcg 35220ttgcccgtcg tctccaacct caccggggag gtggccggcg ccgagctgtg cgaacccgcc 35280tactgggtga ggcaggtgcg gcaggccgtg cggttcgcgg acggggtgcg caccgtgctc 35340gacgagggcg tgaccaccct cctggaactg ggcccggatg gcgtcctgac cgccatggcg 35400caggagtcgg ccggggagcg ggccaccggt atcgccgccc agcgccggga ccgtgaccag 35460gtgcggaccc tgctcaccgc actcggcagg ctccacgtgc gcaccgaacg cgtggactgg 35520gccgcgttct tccgcggcac cggggcccgc cgggtggacc tgcccaccta cgccttccag 35580cgccggcgct actggctgga cacgtcgtcg ggtggtgccg aggcactggc cggcgcgggc 35640ctggcgggta ccggacaccc gctgttgacg gcgtccgcga cgctgcccgg gacgggcgag 35700tccctcttca gcggcagcct gcccggggcc ccggacggac ggcccctgtc gggcggcgag 35760atcctcgaac tggtgctgtg ggcgggtggg aacttcggct gccaccggat cgccggactc 35820gatgtcgccg gatcggtgcc ccacgctccc caggcgccgc tgcaactggt ggtggcagca 35880cccgacgagt ccggaaaccg ggccttcacg ctccacctgg gcccggtggg cggcccgcac 35940ggcccggtgg agggcccctg gacgcggatc gcccacggcg tactcggcgg cacccccacc 36000cccctgccgc cggagcccgg caccgcggcc tggccgccgg ccgacgccga gcccgtcgga 36060gccgacctcg tctggcgccg ggaggacgag ttgttcgccg aactggagct ggccgaacgg 36120aacgcggcgg acgtcgaccg tttcgccctg caccccgggc tgctggcgga ggtgatggag 36180ctgatcgccg gactggccgg agaaccggtc cacttcaccg gggtgacccg gtacgcgacc 36240ggcgccaccg tgctgcgcgt ccatctgacc cgcgtcgccc ccgacaccgt caccgcgctg 36300ctgacggacg cggaaggcga accggtgctc tcggtggacc gggtccaggt ccgtgccgac 36360ggtgcggcgg ccgtgcgctc ggccacggcc gccgcaccgg acgccctgta cgagctgacc 36420tggacaccgg tcggcgccga ggccctccca ccggacaccg gctgggcagt ggtgggcgtc 36480cccgccggcg acctggccaa ggtcctggag gcgcagggcg ccgaggtggc aacccaccct 36540gacctggcgt ccctcggcag cacggccgac cgcggtgaca tgcctggtct tgtcgtcctg 36600tccgtggaga cggcacccgg cgcacccctg gagtccgcgc gactgaccgt tcaccacacc 36660ctgcgtctgg tcggggaact cctcgcggac acccagctca ccggcacccg gttcgccttc 36720gtcacgaggg cgtccgtgtc caccggcgac ggcgcggcgg tcgacccggc gcaggcggcg 36780gtccgcgggc tgctgctctc cgcccaggcc gagcacccgg accggttcgt cgtcgtcgac 36840ctgggcggcc gggaggagga cgccgatctg ctcacggcgg ccgtcggcac ctccctggcg 36900gcggctgagc cgcacctcgc gatccgcgac ggccggctgc tcgtgccgcg gctggcccgg 36960
gtcaccgagc caccgcaggc ctttgcagcc gggcccgagg agcacggcac ggtgctggtc 37020accggcgcca ccggaggcat cggcaccaag atcgtgccgc acctggtggc cgagcacggc 37080gtgcgcaggc tgctgctgct cagccggaag ggccctgacg acccccgcgc ggccgaactg 37140ggccgtgagc tggcagcgta cggcgccgag gcgacgttca cggcctgtga catcgctgac 37200cgcgcggcac tcgaggccgt cctggccgag gtgccggccg agcacccggt gaccgcggtc 37260gtgcacatcg ccggagtcgt ggacgacggc gtgctcacca cactgagccc cgagcgcgtc 37320gacaccgtgc tgcggcccaa ggccgaggcc gcgcagcacc tgcacgagct gaccgccggc 37380cttgagcttt cccatttcgt gctgttctcc tccggcgtcg gcgtgctcgg gggcgccggg 37440caggccaact acgcggcggc caacgccttc ctggacgcgc tggcgcagac cagacaggct 37500gccgggctcc cggcgtcctc gctcgcctgg ggcctgtggg agaccgacat gggcatgtcg 37560gcgcgcctgt ccgaggtcga ccgccgtcga atggcccagg cgggcgtcct cgccctcacc 37620ccgcagcagg gcatcgcgct gttcgaccgg gcgtggaact ccggtgcggc gaccctggta 37680ccgatgagcc tggacacggc tgtgctgcgc aggaaggcag ccgactccgc cctgcccgcc 37740ccgttccgcg cactggtccg cacaccgctg cgccgggccg ccgccggccc cgcacaggcg 37800gcgggacagt ccttcgcgca gcggctggca gagcagcccg ggagcagtcg caggcggctg 37860ctgctggagt tgatccagcg acaggtgggc accgtgctgg actacggggc cgacaccctg 37920ctcgatgccc ggcgcacctt ccgggagctg ggcttcgact cactcaccgc ggtggagctg 37980cgcaatcgcc tggtcgccgc gacgggcgtc cagctttccg ccgcgctggt cttcgaccac 38040cccacggcgg acgcgctcgc cgaatacctg gagagcaagg tcctgcggtc acaggtcggg 38100gcgcccctgc cggtgctcac ccagctcgac cacctggagg ccgcgctcgc ggcgcccccg 38160gccgacaccg ccacccgcga gcagatcgcg gcccggttgc gcgccctggc ctccacctgg 38220agcgcccagc ccgacgacgg ccatggagcc gatgacggcg acatcagcag caagctcgat 38280tccgccacgg atgaagagct gttcgacttc atcagcgggg aattcggaga ggactgagtc 38340cgatggccaa cgagcagcag ctgcgcgact acctcaagcg ggccggcgcc gaactgcacc 38400gtacgcgtcg gcgcctggcc gacgtggagg cacggagcac cgagccggtc gcgatcatcg 38460gaatggcctg ccgctatccc ggcggggcga gcacccccga ggacctttgg cggcttgtga 38520tcgaggagac cgacgcgatc ggcccctacc ccaccgaccg gggctgggac ctggacggct 38580tctaccaccc cgaccccggc aaccccggca cctgctacgc ggacggtggc gggttcatcg 38640acgacatcgc ctcgttcgac gcggccttct ttggcatctc cccgcgtgag gcgcaggcca 38700tggaccctca gcagcgcctg ctcctcgaga cctgctggga agcgctggaa caggccggtc 38760tcgacatcca cgcgttacgc ggcagccgta ccggagtgtt cgccggcctc agccagcagg 38820actacggcac tctgctggcc gccgcaccgg gcgggctgga cggctacgcc gccaccggca 38880cctccaacag cgtcctgtcc ggccgcatct cgtacgtcct gggcctggag ggccccgccg 38940tcaccgtcga caccgcctgc tcctcctcac tggtggccct gcacctcgcc gtgcaggcgc 39000tgcgcaacgg cgagtgcgac ctcgcgctgg cgggcggggc gacgacgttg tccacctccg 39060ccgtccacct ggccctgtcc ggtcagcgcg cactggcacc cgacggccgc tccaaggcgt 39120tctcggcggc ggccgacggt gccggatgga gcgagggagt cggtgtcctc gccgtcgagc 39180ggctgtccga cgcccgccgg ctcgggccac cgggtcctcg cggttctgcg gggcagcgcc 39240gtcaaccagg acggcgcgtc caacgggctg accgcgccca acggcccctc gcagcagcga 39300gtcatccgcc aggcgctggc caacgcgggc ctcacacccg ccgacgtgga catcgtcgag 39360gcgcacggca ccgggaccag tctcggcgac ccgatcgagg cggatgcgct gctgtccacc 39420tacggccagg ccaggccggc cgaccggccg ctgtggctgg gctcgctgaa gtccaacatc 39480gggcacagcg gagccgcggc cggggtggcc ggcgtgatca agatggtgca ggctctgcgg 39540cacggcgtca tgcccaggac gctgcatgcc gaggaaccca ccccgaacgt cgactggtcc 39600tccggcgccg tggaactgct caaccgggcg cgcgactggc ccgcctccgg cacgcgccgc 39660cgggccgccg tctcctcctt cggtatcagc ggcaccaacg cgcatgtcat cctcgaagag 39720gctccgcagg acagtggtcc ggagaccggc gacgaggcgg acccatcacc cgagggaacc 39780ccctggccgc tgctgccgtg ggtgctgtcc gcgcgcagcg aacacgccct gcgcggccag 39840gcccgcgccc tgcacacgca cctgctggcc catcccgaac cggccgacac cgacgtggca 39900ctctcgctcg ccaccacccg gaccggtctt gagtaccggg ccgccgtcct ggccgccgac 39960cgggatggat tcctgaacgc gctggaggcc ctcgccgacg accgccccac caacggggta 40020ctgcgcggaa ccgccgccga gggcaaggcc gtgttcgtct tccccggcca gggcgcgcag 40080tggaccggca tggcccggga actcctcgac acctcgccgg tgttcgcggc caaggcggcc 40140gagtgcgccg cggccatcga ggagttcgtg gacttcaagg tcctggacgt gctgcgcgac 40200gagcccggcg ccgcgtccat ggaccgcatc gaggtcgtcc agcccgtgct gttcaccgtc 40260atggtgtcgc tggccgagct gtggcggtcc ttcggcatcc agcccgacgc cgtggtcggc 40320agctcccagg gcgagatcgc cgccgcccac gtcgcgggcg ggctgaccct cgaggacgca 40380gcccgtgtga tctgcctgcg cagccgcttg ctggccgaga ccctggtcgg caagggcgcc 40440gtcgcgtcgg tggcgctccc cgccgaccag gtgcgcgagc ggctgcggcg ctgggacggc 40500cggctgtccg tcgccggcgt gaacgggccc cgcctcgtcg cggtggccgg ggacgacgcc 40560gcgctcgccg agttcgtcga ggagtgcgcc cgcgacgaca tccgcgccag gaccgtggcc 40620gccaccgtgc cgacgcactg cgccctcgtc gacccgctgc gcgagcggct gctggaactg 40680ctggccccgg tccggccgcg caccggcacc gtgccgctgt actcgacggt gaccgggggc 40740ctgctggaca ccgccaccat ggacgccggc tactggtacg acaacacacg ggcgccggtg 40800ctcttcgaac ccgtcgtccg caccctgctc gccgagggac accacgcttt cgtcgagtcc 40860agcgcgcatc cggtgctggc catggccgtc gagcagacgg tcgatgccac aggggcgccg 40920ggcgtcgtcg tggagtccct gcggcgggac gagggcggcc ccggccggat gctgacctcc 40980ctgaccaagg cccacctggg cggcgtccgc gtcgactggc ccacggtgtt cgccggcacc 41040ggcgcccgca ccgtggatct gcccacctat gccttccagc gcacccggta ctgggccgag 41100accgccgatc gcaccggcga cgtgggctcg gtcggcctgt cgccggtaga ccacccgctg 41160
ctcggcgccc tggtccggat ggccgacggc gacggggccg tgctgaccgg acggctctcc 41220ttgcacactc acggctggct ggccgatcac ggcgtggccg accaggtgat cttccccggc 41280accggcttcg tggagctggc ggtgcttgcc ggagaccagg tcggctgcgg ccggatcgag 41340gaactgaccc tgcacacccc gctcgtcgtg ccccgcaccg gcgcgctcgt cgtccaggtg 41400aacgtccagg cggccgatga caccggagca cgcgcgctcg gcgtgtactc ccgccctgac 41460gacgccggcg ccgacatggt ctggacccgg cacgcctccg gtgtcctcgt ccccgaggac 41520accgtggacg cggaggatac cgacgggctc agcggcgtct ggccccccga gggcgccgag 41580cccgtggcca tctccggtct ctacgacggc atggccgcgg ccggctacca gtacgggccc 41640gggttccgcg gtctgagccg ggcctggcac ctcgacggcg acgtctacgc cgaggtggcg 41700ctgcccgccg accagacgtc agcggcggaa cgctacggcc tgcaccccgc cctcttcgac 41760gccgccctgc acgccatgtt cacctgggac ggcgacgacg gcggcggggt cggcatgccg 41820ttctcctgga ccggggttcg gctgcacgcc accggctgcg cacggctgcg ggtacggctc 41880gcccggcgcg gcgagagcga cttcacggtg acactgacgg acgaggccgg tgaccccgtc 41940gtatcggtcg actccctggt cgtgcgcagg atgaccgggg ccgcgccgga caccgtacgc 42000accgacacgc tctaccggct cgactggaag accgtccggg ccggggagga gacatccgcg 42060ccccgctgcg tcctgctggg caccgacccg ctgggcgtcg ccgccgccct gccaggcacc 42120gcgcgcgtgg ccgacgtcga gcgactcgcc gagctggccg ccgcgggcgg ccccgtcacc 42180gcactgctgc ccgtcgccgg cgacggctcc gccgagcgga tcggcgatcc ggtgatcgac 42240accgtcgccg tgctgcagtc ctggatcgcc gacggccggc tcgacgacac ccggctcgtc 42300gtgctcaccc ggggcgcggt ggcgaccgcc ccaagggagg acgtcaccga cctcgccgcc 42360gccggcgtct ggggcctgat gcgctcggcg cagaacgagc atcccgggcg cttcggcctc 42420atcgacctcg acaccgccga atcctccacc gcggcgctcg gcacggcgct cgcctcggag 42480gaggagcaac tcgcgctgcg cgacggagta ctgcgcgggc cgagcctgac ccggtgggac 42540ccgggcacga ccatcctgcc ccccgccggc gagagcgcct ggcggctgga gaacacccgg 42600cccggcacga tcgagggcct ggacgcggcc ccctgcccgg agcttctcgc ccccctcgga 42660ccccggcagg tacggatcgc cgtccgcgcc gccggcatca acttcaagga cgtggtcgtc 42720gccctcgacc tggtgcccgg actgaccggc ctgggcggcg aggtcgccgg tgtgatcacc 42780gccgtcggcg ctgaggtcac ctaccaccgg gtcggcgacc aggtgttcgg cctggccacc 42840gaggtcttcg gcccggtgac cgtcgccgac gaacgcaccg tccaccggat acccgatggc 42900tggaccttcg aggaggccgc ctccgtcgcc gtcacctata tgacggccta ctacggactt 42960gtcgacctcg gcggcctgog cgccgggcag agcgtgctca tccacgccgg agccggcggc 43020gtcggcagcg ctgccgtcca gctggcccgc cacctcggag ccgaggtcta cgcgacggcc 43080agccccggta agtggggcgc cctgcgcgcc cagggcctgg acggcgcgca catcgccaac 43140tcgcggaccc tcgacttcga gcagtggttc ctgcactcca ccgacggccg gggcatggac 43200gtcgtactcg actgtctggc aggcgagttc gtcgacgcgg gcctcagact gctgccccgc 43260ggcgggcact tcctggagat gggcaagacc gacaagcgcg atgccgaaca ggtcggcgcg 43320gcccacccgg gcgtcgtcta ccgggcgtac gacctgccgg aagccggtcc cgaccgcatc 43380cacgagatgc tggtcaccct caccgggctg ttcgaggacg gtgtcctgcg gccaccgcac 43440gtcaacgcct gggacatccg cgacgcccgg gccgccttcc gggctctgag ccaggccgcg 43500ctcgtcggca aggccgtgct cacccttccc ggcgtcccgt tctcccccca cggcacggtc 43560ctgatcaccg gtggcaccgg tatgctcggc gcactgctcg cccgccacct ggtcaccgcg 43620cacaacgtga ccagcctgct gctcaccagc cggcgcggcc ccgacgcccc gggtgccgcg 43680gagctcacgg cggaactgac cgaggccggt gcccgggtgg acgtcgtcgc gtgcgacgtc 43740gccgaccgcg atcagctggc cgctctgctc gccggcattc ccgccgagcg cccgctgacc 43800gccgtactgc acaccgcggc ggccctggac gacggtctcg tcgagtccct caccgccgag 43860cgcacccgcg ccgttctgcg ccccaaggtc gacggtgccg tccaactgca cgaactcacc 43920cgcgacctcg acctcggcgc gttcgtgctg ttctcgtccc tggccggcac catgggcgcc 43980cccggccagg gcaactacgc cgccgccaat gtgatgctcg acgccctggc cgcacaccgg 44040cgggcccagg ggctgcccgg gctatccctc gcctggggct tctgggatca gcgcagtgag 44100atgtccggca acctcgatga ccgcgacata cagcgcatga gccgcggcgg catcgtgccc 44160atgagcagcg aggagggcct cgccacattc gacctggcct gccgcaccga ccgcgctcaa 44220ctggtccccg cgaggctcga ccccgctgcg ctcgccggga ccaccggtcg ggttcctccc 44280gtcatgcgag ccctgatacc cgctcccgcc cggcgttccg gacgccgctc cgccgaggcc 44340ggggacgact cgctgcgcgc acggctggtc ccgctcaccg gcaccgaacg cacgcgcatc 44400ctgctccagt tggtccgctc gaatgccgcc accgtgcttg gccacactga ccccgacgcg 44460gtcggcgcgg ccacaccctt ccgtgaactc ggcttcgact cgctgaccgc cgttgagttc 44520cgcaaccggc tcaccggcgc cgtcggcttc cggctccctg tcaccgtggt cttcgaccac 44580cccacccccg gcgcactgac cgacttcctc gccgccgaac tcctcggtgg cctggacgaa 44640accgacgccc cggccggtcc gtcccgcgcc acgcccgcgg ccgtcgcccg caccgatgaa 44700gaacccctgg tgatcgtcgg catggcctgc cgctacccgg gcggtatctc caccccggag 44760gagctgtggg acttcgtcct cgcggagcgc gacgccatct ccggcttccc ggaggaccgc 44820ggctggcgcc gcgagcggtc cgccgacggc tccgcgccgc agcagggcgg gttcctcgac 44880cgcgtcgcgg agttcgacgc cgcgttcttc ggcatctccc cccgcgaggc actgaccatg 44940gacccgcagc agcggctgct gctggagacc tcctgggagg ccctcgaacg cgccggaatc 45000gcgccgggta ccctgcgcgg cagtcgtact ggcatcttcg tcggtgccgc cgcctccggg 45060tacaccagtc tgttccgccg cggctcggaa gccctcgccg gatacggcgt gaccggcgcc 45120tccaccagtg tggtgtccgg acgcgtggcc tacgtgctgg gcctggaggg gcccgccgtc 45180accgtggaca cggcctgctc gtcctcgttg gtcgccctgc acaccgccgc gctgtcactg 45240cgcgcgggcg actgcgacct tgccctcgcc ggcggtgtcg ccgtgatgac cagtccgttc 45300ctcttcgacg acttcgccag gcagggcggc ctctcgcccg acggccgctg caaggccttc 45360
gccggttccg ccgacggcac cggctgggcg gagggcaccg gcatggtcct cctcgaacgt 45420ctctccgacg cccgccggaa cgggcatccg gtcctcgcgg tgctgcgcgg cagcgccgtc 45480aaccaggatg gcgcctccaa cgggctgacc gcgcccaacg ggccgtcaca gcaacgggtg 45540atcaggcagg cactcgacag ggccgggctc acccctgcgg acatcgatgc cgtcgaggcg 45600cacggcaccg gaaccgtcct cggtgacccc atcgaggcac aggccgtcct cgccacctac 45660ggacgggacc gggatccgga ccgccccgtg ctgctcggtt ccctgaagtc caacatcggt 45720cacagccagg ccgctgccgg tatcggcgga gtgatcaaga cggtgcaggc cctgctccac 45780ggcatcctgc cccgtagcct gcacatcgac gagccgaccc cgcacgtcga ctggtccgcg 45840ggcgccgtcg atctgctcac cgagacccgc tcctggccgg ccacggacca tccccgccgg 45900gccggtgtgt cctccttcgg cgtcagcggc accaacgccc acgccatcct cgaacaggcc 45960accgagcccg agcccccgat cgtcgatcag gcgcccctgc ccgtcactcc gtggctgctg 46020tccgggcacg acgaacaagg cctgcgcgct caggccgaaa cactcgtgag ctggttgcgg 46080gaacagccgg agggctctgt caccgacatc ggccacgccc tggccacccg ccgggccgca 46140ctggaacacc gagcagccct gccggtcacc gatcgggacg aggcgctcgc ccggctcgcc 46200gagttcgcgg ccggccgcgt ccccgacggg ctgctgcgcg gcacggccca agagggctgc 46260ctcgccctgc tcttcgccgg acagggcacg cagcggcccg gcatggggcg tgacctgtac 46320gcggccttcc ccgcgttcgc ccacgccttc gacgaggcct gcgcacatct cgaccccctg 46380ctcggacggc ccttgcgtga caccgtgttc accgccgagg ccgccgaact cgaccggacc 46440gccatcaccc agcccgccct cttcgccctc gaagtggccc tgtaccggct gctggagtcc 46500tggggcgtgg agccggaata cgtcctcggc cactccgtcg gcgagatcgc cgccgcccat 46560gcggccgggg tcctcgacct gcccgacgcc gcccggctgg tggccgcccg ggggcgcctg 46620atgcaggccc tgccgcccgg cggagccatg ctggccgtgc aggtcgggga aacggaggcc 46680accgaggcgc tcggcgcggt gctcggcgag agggcggcca ccgtggacct ggccgccgtc 46740aacggccccc actcggtggt gttctccggc accgctcgat ccgtggacgc gctcgacgcg 46800cacttcaccg cgcggggtcg gcggacccgc cggctcaccg tgagccacgc cttccactcg 46860ccgctcatgg aaccgatgct cgacgagttc gccgaactgg tgtcccggct gaccttcgcc 46920gcgcccagga tccccgttgt ctccgatctc accggatccg ttctcggcgc gggcgatctc 46980gccgaccctc gccactgggt aaggcatgcc cggcacaccg tccgcttcgc agacggcatc 47040gacaccctcg tcggcgcagg cgtcaccgac ttcctggaac tgggcccgga cgccacgctc 47100gccacgatgg ccgaggactg cttcgccacc gcccccaccg gcgtgtgcac ttcgctgctg 47160cgccgtgacg gatcggaacc ggtcaccctg ctgatggccc tggcccgtgc ccatgtgcac 47220ggcgtcaccg tcgactggaa ggccgtcctc gccggcaccg gcgctcggtg ggtggacctg 47280ccgacctacg ccttccagcg ggagtcctac tggcccgcgg agtccacggc cggacgcagc 47340gacccatcct cggccggctt cgacgacacc gggcaccccc tgctcggcgc catggtcggc 47400gcagccggcg gcgacgtcct gttcaccggc gagctctcgc tggccgccca gccctggctg 47460gctgaccacc gcgtcctgga cgccgtcctg tttcccggca ccggcttcct cgaactcgcc 47520tcctgggcgg gcagccgcct ggacgccggc gacctggagg aactggtcgt ccaccgcccg 47580ctggtgctgc ccgaacacgg cggcgtcacc gtgcaggtgg tcgtcggcga ggccaccgat 47640gaggaccgca ggccggtcgc cgtctactcc cgcgccgccg acgacgccgg atggaccagg 47700catgcggagg gactgctcgc caccggaccc gcagcccagc cggccgaccc gtcggcccac 47760tggccgccgc agggcgccga gcgcgtcgac ctggacgagt tctacgccgg tctggccgac 47820gccgggaccg cctacgggcc ggtgttccag ggcctcaccg cggtctggcg gctggacggc 47880gagatctacg ccgacgtggc gctgcccgcg caggcggccg acgacgccag gggcttcgga 47940gttcaccccg cactgctgga cgccgctctg cacaccctcg cgttcctgcc cggcgccgac 48000cggagcagcg gcccgttcct gccgttcgcc tggcgggacg tcaccgtccc cggccccggc 48060gccacctctt gccggatccg cctcacgccc ggcaacggaa ccgacgaggt ggccgctacc 48120ctctgggacg gcgacggtcg gccgctcgca gccgtgggcg gactgagcct gcgcagcgtc 48180tcccgcactc aactgggcac gtccgcggtc gcgtcgtccc tgttccgtat ggactggaca 48240cccgcctcac agcccagggc cgtcggcgcc ccgacggtcc gctgggccgt ggtgggcccg 48300gacgcccccg gaacacccga catcgaccac tacgccgacc tcgtggccct gcgccggcac 48360ctcgccgacg gcggcccggt acccgaccag gtactcctgc cgtgtgcccc ctccgccggc 48420ggcgccgacg ccggcgcagc ccgcgacgcc gtgcacgcgg cgctgcacac tctgcgtacc 48480tgggcggagg acgagcactt cgccaagagc cggctggtgc tgtgcacccg cggcgccgtc 48540gtggcacaac cgggcgaagg cgtacgcgac ctggcgcacg ccgcggtctg gggcctcgcc 48600cgcagcgcac agctggaaca ccccgaccgg ttcgtcctgg tcgacctcga caccggcacc 48660acgctcgacg acctcacccg gtcgcagctc ctggcccgga ccgagtccac cgacgcggcc 48720cagttcgcga tccgcggcgc cctgaccctc gtaccggccg tcacccgtca ggccggacag 48780gtcccggcgc cggaagcacc gtggccggcc gacggcacca ccctgatcac cggagccggc 48840ggcatgatcg gcggactgct cgcccggcat ctcgtccgcg aacacggcgt acgccacctg 48900ctactcctcg gccgccgcgg cgaggacacc ccgggcatgg ccgagctgcg ccgggaactc 48960accgacgcgg gagccgacgt ccacgtcacc gcctgcgacg ccgccgaccg ggaggccctg 49020gctgccgtac tcggccgcat cccgtccact gcccccttga ccgccgtcgt gcacgccgcg 49080ggcgtcgtcg acgacggggt gctcggctcg gtcactgacg agcaggtcga ccgcgtccta 49140cgccccaaga tcgacgccgc ggtgaacctg caccacctca ccgcccccct cggtctgcgc 49200gccttcgtcg tctgctcctc cctcgccgga gccctcggcg gcggcggcca gtccgcctac 49260gccgctgcca acgcctacct ggacgccctg tgcctgcgac ggcgggccga tggcctgccc 49320gcgctctcgc tcgcctgggg tccttgggag agcagcgccg gcatgaccgc ccagctcgcc 49380gcggccgacc tgcgacgcat ctcccgcgcg ggcatgcagc cgctcacccc ggacgacggg 49440ttggccctct tcgacgcggc ccacgccacc ggggaagcgg tgctgctgcc cttccgcttc 49500gaacccggcg gcctgtccac cgccgaccgg gcgtccctgc cccccgccct gcgccccctg 49560
gtgccccgac cccgacgccg gcctggcgac cccgtccccg gcctgtccgg tctccgcgac 49620cgcctgcgcc ccctgtcgca ggacgaccgg accggcgccc tggagaatct cgtccgcgcc 49680gaggtggcct cggtgctcgc cctgccttcg gcggacgcgg taccggtcac caaggcgttc 49740aagaccctcg gcttcgactc cctgatggcc gtcgacctcc gcaatcgcct cagcgccctc 49800accggtgtca ggctccccgc gaccctggtc ttcgaccacc ccaccccacg ggccctggcc 49860acccgcctgc tcaccggcat ggaactggac accgccaccg ccaccgaccc ggccctgctc 49920gccctgcgcg aactcgaaac cgcggtccgc tcgatggcgc ccggtgccga cgaccgcgga 49980gcgatggcga cccggctgcg ggtgctgctg acagcgctcg aggagaccgc ggacgacacc 50040gacggtgcgg acacggacgg cgataccgac ctcgactcgg tgagcaccga ggaactcgtc 50100aacctgctcg gcgacgagtt cggcctcacc tgagaaccac ccctgcctgc accacccgac 50160ccgaacttag gggtgttcgc ggtcctgaac tggggccggg atccgcgtcc tggcccccta 50220gcctgcaaac aggcctgtcc ttgcgcattg acgaaacacc tgagtgggag ttgagcatga 50280gcagttccat gtcggagatc gtcgacgcgc tgcgggcctc actgctggag aacgagcggc 50340tgcgccagca gaaccagcgg ctcagcgcgg catcctcgga gcccctcgcc atcgtgggca 50400tcggctgccg ctatcccggc ggagtccgtg ataccgaggg cctgtggcag ctcatcgccg 50460agggccgtga cgccatgtcg gacttcccca ccgaccgtgg atgggaggac cgggatgtcc 50520ccgccgcccg caccggcgct ttcctccacg acgcgggcga cttcgacccc gcgttcttcc 50580gcatctcgcc gcgggaggcg atggcgatgg acccccagca gcggctgctg ctggaaacct 50640cctgggaagc cctcgaacgc gccggtatcg acccggtctc gctcaagggc agccgcaccg 50700gcgtgttcat cggcggcgcc ccccaggagt acggcgcgct cgtgatgaac tcagcccagg 50760gcgccggagg ctacgcactc accggcgccc ccggcagtgt cctgtccggc cggatctcct 50820acgtgctggg cctggagggc ccggcggtca cggtggatac cgcgtgctcg tcctccctcg 50880tcgccctgca cctcgcgatc aagtcgctgc gcaccggcga gtgcgacctc gcgctggccg 50940gcggcgttct tgtcctgatc acgccgacca tcttcaccga gttctccgcc accggcggat 51000cggccggcga cggccgctgc aaggcgttct cctcggacgc ggacggcacc ggctggggcg 51060aaggcgcggg cgtcctcgcg atccaacgcc tgtccgacgc gcgccgggac ggcaaccccg 51120tcctcgcggt gatccgcggg tcggccgtga accaggacgg tgcgtcgaac ggtctgagtg 51180ctccgaacgg tccgtcgcag cagcgggtga tccggcaggc gatcgccaat gccgggctga 51240ccctcgcgga cgtcgacatggtcgaggcgc atggcaccgg caccacgctc ggcgacccca 51300tcgaagccga ggcgctgctc gccacctacg gccaggaacg gcacgacggc cggcctctct 51360ggctcggcac cctcaagtcg aacgtcggtc acacccaggc tgcggccggc atctccggcg 51420tcatcaaggc cgccctcgcg ctccagcacg gcatcatgcc caagacgctg cacgtggacg 51480agccgacgcc ggaggtcgac tggtcggcgg gtgcggtgga gctgctgacc gaggcacgtc 51540agtggccgga gaccgggcag ccgcgtcgcg tgggtgtgtc gtccttcggg atcagcggca 51600cgaacgccca cgtcatcctg gagcaggccc ccgaggccgc cccggcggaa caggcggacg 51660gggacgcccc ggcggagctg ccggtgacac cgtgggtggt caccggccgg aacgaggcgg 51720cgctgcgcga gcaggccgca cggctgctgg accacctcac gcagcagccc gacctgagcc 51780cgcgggacgt gggcttctcg ctggtaggga cacgctcggc gttcgagcag cgcgcggtcg 51840tgctgggcgg cgacatggcg gcgctgaccg agggggtccg cgccctggcg gcccaggagc 51900cgaacaccca tgtgatcgcc ggcacggccg aggtccgcag cggcatcgtc ttcgtgttcc 51960cgggtcaggg gtcgcagtgg gttggtatgg ggagggagtt gtgggatgcc tcgccggtgt 52020tcgcggagtc gatggtggcg tgtgagcgtg cgctggcgcc gttcgtggac tggtcgctga 52080aggatgtggt gttccggggc gcggaggatc cgctgtgggc ccgtgtggat gtggtgcagc 52140cggtgttgtg ggcggtgatg gtgtcgctgg ctgcggtgtg gcggtccttc ggggtggagc 52200ctgttgcggt ggtggggcat tcgcagggtg aggtggcggc tgcgtgtgtg gctggtgggt 52260tgtcgttgga ggatggtgcg cgggtggttg cggtgcggtc gcggctggtg cgggagaagt 52320tgtcgggtct gggtgggatg ggttcggtgg cgcttcctgt ggaggcggtg gaggtgcgtc 52380tgggccggtt cgggggccgg gtcggggtgg cggcggtgaa cgggccgacg tcggtggtgg 52440tctccggtga ggtcgaggcg ctggacgcgc tgctggcgga gtgtgaggag gcgggggtgc 52500gggcccgtcg tatcgcggtg gactacgcct cgcattcggc gcaggtggat gcgctcaccg 52560acgacctgct ggcggagctg gccgagttgc ggccgcagtc ctcgtcggtg gcgttctatt 52620cgacggtgac cggtgagcgg ttggacacgg ccgggctgga cgccaggtac tgggtgacga 52680acctgcgcga gcgggtcaac ttcgagcccg tcacgcgact gctggccgag aagggggccg 52740gtgtcttcgt cgagtccagc ccgcacccgg tgctgacggt cgcggtgacg gagaccggcg 52800aggccgcgga ccggtcggtg gtggccgtgg gttcgctgcg gcgcgaggag ggcggtcttc 52860ggcggttcct ggcatcgctg gccgaggcgt acgtcgctgg tgtcccggtg gactggtcgg 52920tgacgttcgc cgggagtggc gcccgtcggg tggacctgcc cacctacgcc ttccagcacc 52980aacgctactg gctggacgac gtggtgttgc cggggcaggg cggtggcggt tcgtccgatc 53040cggcggacgc ggcgttctgg ggcgccgtcg agcgcgcgga cgccgagagt gtggtctcgc 53100tggtggacgg ggcggacgcg caggtgtggg agtcggtgct tccggcgttg tcggcctggc 53160gcaaggggcg tcgtacgcag tcgacgctcg actcgtggcg gtaccggacg gtgtggcgtt 53220cggtgacggt gtcgtcggcg gcttcgctat gtggtgtgtg gctggtggtc agctctggtc 53280cgggtgctcc ggtggagcag gtcacgctgg cgctgacggc tgcgggggct gaggtgcggg 53340tgctggatgt gcctgtggag cgtggggctt tggcggagtg gtttgccgaa gcgggtgagg 53400tcgcgggtgt ggtgtcgctg ctggcgtggg acgaggatga ggcgttggcg tcgtcgctcg 53460cgttggtgca ggcgcatggg gatgccgggt tgtcggcgcc ggtgtgggtg ctgacgcggg 53520gtgcggcggc tgtgggctcg gatgatgccg tatgcgcgac gcagacgtcg ctgtgggcgt 53580ggggtcaggt cgtcggcttg gagctgcccg ctgtgtgggg cggtctggtg gacgttcctg 53640ccgagtggga tgggcgggtg tcgtccgcgc tggctgcggt gctggcggct ggtgagggcg 53700aggaccaggt cgcggtgcgg tcctcgggtg tgtacgcgcg tcgtctggtg tgggcgccgc 53760
tgggcgcggg tgcggctgcg gtgcgggagt tcaagccgca gggcaccgtg ctgatcaccg 53820gtggcaccgg tggtgtcggc ggtcatctgg cgcgctggct ggcgagggag ggcgccgagc 53880acctgctgct ggtcaaccgc actggtgaag gagctgctga acttctcgaa gagctgcgtg 53940gctcgggtgc ggaggtgacg gtggcggcgt gtgatgtgac cgatcgggcg gctttggcgg 54000aactgcttgc tggaatccct gccgaacgtc ctttgaccgc cgtgttccat gctgcggggg 54060tcgcgggcta cggtctggtc cgcgaactgg acgtggcgga tctggatgtc gagatggccg 54120ccaggaccct cggtgcccgt catcttgacg agctgaccgc cgaactcggc ctggatctgg 54180atgcgttcgt ggtgttctcc acgggggctt cggtgtgggg gagtgcgggg aacggggcga 54240atgcggctgc gggtggttat ctggatggtc tgatccgtgg tcgtcgggcg cgtgggctgg 54300tgggttcgtc ggtgtcgtgg ggtggctggg gggccacggc tatggcggtg ggggagacgg 54360cggagcggtt gtcgcgtcgt ggggtgcggt tgctggagcc ggagttggcg gttcgggcgt 54420tgcgtcaggt gctggagcag gatgaggtgt cggtgacggt ggccgacctg gactggtcgt 54480tgttcacgcc ggggtacgcg atggcgcggc gccggccgct gatcgaggac atccccgaag 54540ccgcccgggc actgcgtgac atcaccgaga ctgacgagac ccaggacgcg gcggccggag 54600gactgcggga gcggctggcc gggctggcgg agtcggagca gcaggcgttg ctgctggggc 54660tggtgcgggg tgaggccgcg caggtgctgg cgcacgggtc gacggcggag atcacgccga 54720gcaggccgtt caaggagctc gggttcgact cgctgaccgg gatggagctg cgcaaccgac 54780tgtccaaggc caccggactc cggctgcccg ccaccctcgt cttcgactac cccaacctcc 54840agcaactggc ttccctgctg cgtacggcgc tcatcgacgg tcttccgggg gccggcgccg 54900tcgcgacgac ggtccggctg gtggacgacg aaccgctcgc aatcatcggt atggcctgcc 54960gctacccggg tgacgtccgc gatcccgagg acctgtggcg actggtctcc gaaggccgcg 55020acgaactgtc ggacttcccc accgaccgcg gctgggaacg ttggggtacg cccgcggtcg 55080gtcaggccgg attcctgcac gaggccgggg acttcgacgc tgccttcttc gggatctcgc 55140cccgtgaggc cgcgagcatg gacccgcagc agcggcttct gctggaggtg tcgtgggagg 55200ccttcgagca ggccggcatc gacccctggt cgctgcgcaa cagccccacc ggggtcttcg 55260tcggcggcgg cccgcaggac tatcccacgg tgctgatggg ctcggccgag gccgccagcg 55320gctacggcat gaccggcgcg ctcggcagtg tgatgtccgg ccgggtctcc tacatgctgg 55380gcctggaggg gccggcggtc acggtggaca ccgcgtgctc gtcctccctg gtcgccctcc 55440acctggccgc gcagtccctg cacaacggcg agtgcggtct ggccgtggcc ggcggcgtga 55500ccatcatggc cacgccgggc gcgttcctcg ggttcgacac gttgggcggc ttggctgagg 55560acggccgctg caaggccttc gcggcgtccg cggacggcac cggctgggcc gaaggcgtcg 55620gcatggtcgt cctcgagcgc ctgtcggacg cgcgccgcaa cgggcacgag gtgctggcgg 55680tggtccgcgg gtcggccgtg aaccaggacg gtgcgtcgaa cggtctgagt gctccgaacg 55740gtccgtcgca gcagcgggtg atccgccagg cgctggcgaa cgccggcctg tccgccgcgg 55800acgtcgacat ggtcgaggcg catggcaccg gcaccacgct cggcgacccc atcgaggcgc 55860aggcgctgct ggccacctac ggccaggacc gcccggccga ccggccgctg tggctcggct 55920cggtgaagtc caacttcggt cacacgggtg ccgccgccgg tgtcgcgggc gtcatcaagt 55980ccgtactggc gctgcggcac ggactgatgc cgaagaccct gcatgtcgac gagccgacgc 56040ctgaggtcga ctggtcggcg ggtgcggtgg agctgctgac cgaggcacgt cagtggccgg 56100agacggagca gccgcgtcgc gtgggtgtgt cgtccttcgg gatcagcggg acgaacgcgc 56160acttgatcct ggaggaggct ccgcaggccg cggccgtgga ggacgagcgg gacgggtccg 56220tggccccggt gtcgtcgccg gtggtgccgt gggtcgtgtc gggccgctcg gagaccgcgt 56280tgcgggcgca ggcggcgcga ctggcggagc atctggcgca gcggccggaa gcgggcgcgc 56340tggacgtggg cttctcgctg gtggagtcgc ggtcggcgtt cgaacagcgt gcggtggtgc 56400tgggcgcgga ccgggaggag ttgctggccg gggtacgcgc ggtgggggag ggcgcccagg 56460cgtccggtgt ggtcaccggg cgggccgctc aatccggtgt ggtgttcgtg ttcccgggtc 56520aggggtcgca gtgggttggt atggggaggg agttgtggga tgcctcgccg gtgttcgcgg 56580agtcgatggt ggcgtgtgag cgtgcgctgg cgccgttcgt ggactggtcg ctgaaggatg 56640tggtgttccg gggcgcggag gatccgctgt gggcccgtgt ggatgtggtg cagccggtgt 56700tgtgggcggt gatggtgtcg ctggctgcgg tgtggcggtc cttcggggtg gagcctgttg 56760cggtggtggg gcattcgcag ggtgaggtgg cggctgcgtg tgtggctggt gggttgtcgt 56820tggaggatgg tgcgcgggtg gtcgcggtgc ggtcgcggct ggtgcgggag aagttgtcgg 56880gtctgggtgg gatgggttcg gtggcgcttc ctgtggaggc ggtggaggtg cgtctgggcc 56940ggttcggggg ccgggtcggg gtggcggcgg tgaacgggcc gacgtcggtg gtggtctccg 57000gtgaggtcga ggcgctggac gcgctgctgg cggagtgtga ggaggcgggg gtgcgggccc 57060gtcgtatcgc ggtggactac gcctcgcatt cggcgcaggt ggatgcgctc accgacgacc 57120tgctggcgga gctggccgag ttgcggccgc agtcctcgtc ggtggcgttc tattcgacgg 57180tgaccggtga gcggttggac acggccgggc tggacgccag gtactgggtg acgaacctgc 57240gcgagcgggt caacttcgag ccggtgacac gtctgctggc cgaacgggaa caccaattct 57300tcgtcgagtc cagcccgcac ccggtgctga cggtcgcggt gacggagacc ggcgaggccg 57360cggaccggtc ggtggtggcc gtgggttcgc tgcggcgcga ggagggcggc gtccagcgcc 57420tgttgacgtc gctggccgag gcgtacgtcg ctggggtgcc cgtcgactgg tcgaagacct 57480tccacggcac cggtgcccag tccgtggacc tgcccaccta cgccttccag caccagcact 57540actggctgga cgacgtggtg ttgccggggc agggcggtgg cggttcgtcc gatccggcgg 57600acgcggcgtt ctggggcgcc gtcgagcgcg cggacatcga cagcgtggcc tcgatcgtcg 57660acggggtcga ccagcaggcc tgggaaagcg tcgtcccggc gctgtcggcc tggcgcaagg 57720ggcgtcagga gcgagcgcta ctggattcct ggcggtaccg gacggtgtgg cgttcggtga 57780cggtgtcgtc ggcggcttcg ctatgtggtg tgtggctggt ggtcagctct ggtccgggtg 57840ctccggtgga gcaggtcacg ctggcgctga cggctgcggg ggctgaggtg cgggtgctgg 57900atgtgcctgt ggagcgtggg gctttggcgg agtggtttgc cgaagcgggt gaggtcgcgg 57960
gtgtggtgtc gctgctggcg tgggacgagg atgaggcgtt ggcgtcgtcg ctcgcgttgg 58020tgcaggcgca tggggatgcc gggttgtcgg cgccggtgtg ggtgctgacg cggggtgcgg 58080cggctgtggg ctcggatgat gccgtatgcg cgacgcagac gtcgctgtgg gcgtggggtc 58140aggtcgtcgg cttggagctg cccgctgtgt ggggcggtct ggtggacgtt cctgccgagt 58200gggatgggcg ggtgtcgtcc gcgctggctg cggtgctggc ggctggtgag ggcgaggacc 58260aggtcgcggt gcggtcctcg ggtgtgtacg cgcgtcgtct ggtgtgggcg ccgctgggcg 58320cgggtgcggc tgcggtgcgg gagttcaagc cgcagggcac cgtgctgatc accggtggca 58380ccggtggtgt cggcggtcat ctggcgcgct ggctggcgag ggagggcgcc gagcacctgc 58440tgctggtcaa ccgcactggt gaaggagctg ctgaacttct cgaagagctg cgtggctcgg 58500gtgcggaggt gacggtggcg gcgtgtgatg tgaccgatcg ggcggctttg gcggaactgc 58560ttgctggaat ccctgccgaa cgtcctttga ccgccgtgtt ccatgctgcg ggggtcgcgg 58620gctacggtct ggtccgcgaa ctggacgcgg cggatctgga tgccgagatg gccgccaaga 58680ccctcggtgc ccgtcatctt gacgagctga ccgccgaact cggcctggac ctggaggcgt 58740tcgttctctt ttcctccggc gccgctgtgt ggggaagtgc gggaagcggt ggttacgcgg 58800cggcgaacgg gtacttggat ggtctggcgc aggagcgtcg ggcgcgtggt ctggcggcga 58860cgtcggtgtc gtggggcaac tggaaggaca ccggtctggc gaccgatacg accgcggagc 58920agttggcacg tctcggtgtc cggccgatgg atccggcgct ggcggtagcg gccctccggc 58980aggtgctgga gcacgacgag atcgcgctga ccgtgaccga catggactgg gcgcgcttcg 59040cccccggcta cacgctggcc cgccgccgcc cgctgatcga ggacatcccc gaagccaccc 59100gcgcgctcag cgaggactcc gccgacccgg cgaacgacat ggccggagcc gccctgcggg 59160ccgagctgga aggactgggc cgggccgagc agctcgccgt gctcatggac ctggtgcgta 59220gtgaggtcac ccgcatcctg gcgggtgcct ccgcggccga catcacgccg gagaggccgt 59280tcaaggagct cgggttcgac tcgctgaccg cgatggaact gcgcaacctg ctcaccatcg 59340ccaccggact gcgcctgccc gccacccttg tcttcgacta ccccaatccg cgacagcttg 59400ccgcccatct gtgcgacgaa ctgatcggcg ttggcgcgga tcccgtgggg gccgacgtcg 59460tcgtacgcgg ctcgtccgac gaaccgctgg ccgtcgtcgg catggcctgc cgttacgcgg 59520gcggcgtgtc gacccccgag gacctgtggc agatggtggc ggagaacagg gaagggctca 59580ccgacgtccc ctcctatcga gggtgggagg ggtggaacgt cgccagcctt cgtcgcgccg 59640gcttcctgca cgaggcgggt gacttcgacg ccggtttctt cgggatctcg ccgcgtgagg 59700ccgcgaccat ggacccgcag cagcggcttc tgctggaggt gtcgtgggag gccgtggagc 59760gggccggtat cgaccccaag tcgctgcggg gcagtgacac aggcgttttc gtgggcggta 59820cggccgtcga gtacggcgca ctgctgatga actcgccgac cggccagggc tacgcagtca 59880ccagctcctc cggcagcgtc ttgtcgggtc gtgtctccta caccctcggc ctggaagggc 59940ccgccgtcag cgtggacacg gcatgctcgt cctccctcgt cgccctgcac ctcgccgccc 60000aggcgttgcg caacggcgag tgcggcctcg cgctcaccgg tggtgtcggt ctgatggcca 60060cacctggcgg gttcgtggag ttcgacacgc ttggcggact gtcgtccgac ggccatacca 60120aagcctttgc agcgtccgcc gacggtatcg gctggggcga aggcgtcggc atgatcgtgc 60180tggaacgttt gtcggacgcg cgccgcaacg ggcacgaggt gctggcggtg gtccgcgggt 60240cggcggtcaa ccaggacggt gcgtcgaacg gtctgagtgc tccgaacggt ccgtcgcagc 60300agcgggtgat ccggcaggcg gtcgccaatg ccgggctgac cctcgcggac atcgacatgg 60360tcgaggcgca cggcaccggc accacgctcg gcgaccccat cgaggcgcag gccctgctga 60420acacctacgg tcaggaacga cacgacggcc aaccgctgtg gctcggctcc gtgaaaacca 60480acatcgggca cacgggcgct gccgcgggtg tggcgggcat catcaagtcc gtcctcgccc 60540tgcgcaacgg cgtcatgccg atgaccctga acgtggacgg gccgacaccg aaggtcgact 60600ggtcggcggg agcggtggag ttgctgaccc aggggcggga atggccccag acggaccgta 60660cgcggcgtgc gggtgtgtcc tcgttcggga tcagcggcac caacgcccat gtgatcatcg 60720aggaggcacc cccggccgag gaacccccgg cccagcccgg gaccgacctt ccggcggccc 60780ccgcactcgc gacaccggtc gttccgtggg tgttctccgg acggtcgaac ggagccctgc 60840gcggccaggc cgagcgcctg tcagcactgg cggagaacga acccggcctc gacctcaccg 60900acgcggcgtt ctccctggcg acgacgcgag ccagtctgga acaccgcgcc gtggtgctcg 60960gccgtgacac gtcggaaatg ctcgacggcc tgcgcgggct caccgcacag ggctcggtcg 61020ccggcgtggt ctccggtgtt accgctgccg acagccgtgc tgtctttgtg tttcctggtc 61080aggggtcgca gtgggtgggg atggggcggg agttgtggga ggtttcgtct gtttttgctg 61140agtcgatggt ggcgtgtgag cgggcgttgg tgccgtttgt ggattggtcg ttgcgggatg 61200tggtgttcgg gggtgggggt gatgggttgt gggagcgggt ggatgtggtg cagccggtgt 61260tgtgggcggt gatggtgtcg ttggcggcgg tgtggcggtc gtttggtgtg gagccggctg 61320cggtggtggg gcattcgcag ggtgaggttg cggcggcgtg tgtggcgggg gggttgtcgt 61380tggaggatgg tgcccgggtg gtggcggtgc gttcgcgtct ggtgcgggat gggttgtcgg 61440ggcggggtgg gatggtgtcg gtggggttgt cggtgggtga ggtggaggag tggttggccg 61500ggttgggggg tcgggtgggg gtggcggcgg tgaatgggcc gtcgtcggtg gtggtttcgg 61560gtgaggcgga ggtgttggag gggttgttgg cggggtttga gggtgcgggg gtgcgggcgc 61620gtcggatcgc ggtggattat gcgtcgcatt cggtgcaggt ggatgcgctc ggtgatgatc 61680tgctggcggg gctggcgggt attcggccgg tgtcgtcgtc ggtggcgttc tattcgacgg 61740tgtccgggga gcggatggac acggcggggc tggatgcggg gtactgggtg gcgaatttgc 61800gggagcgggt gttgttcgag ccggtggtgc ggatgctggt ggagcggggc agtgcggtgt 61860tcgtggagtc cagtccgcat ccggtgcttg ccatggcggt ccaggagacc ggtgaggctg 61920tgggccggtc ggtggtcgcg gtggggtcgt tgcggcggga tgacggcggt gctggacggt 61980ttttggcgtc gttggcggag gcgtatgtgg tgggtgcgcc ggtggactgg tcggtgttgt 62040tcgcgggcgc gggtgcgcgg cgggtggatc tgccgacgta tgccttccag caccagcgct 62100actggctgga gggtgtcacc gtcggaggcg agccccagga cacggtggag gatgacacgg 62160
atgccgcgtt ctgggacgcc gtggagcgcg agagcctgtc cgacctcgct gaggtactcg 62220acgtctccga tgccggcgct gcggccgagg cctggctgcc cacgctgtcg gcctggcgca 62280agggccgccg taggcagatg accctcgatt cgtggcgcta ccggactact tggcgcgcgt 62340acagcctgcc ctcaggaacc cgcctgtcgg ggatgtgggt ggtggtggct tctggtgggg 62400atgcgccggt ggtggaggtg cggcgggcgt tggaggcggc tggtgcggag gtgtccgttc 62460gggaggttct cgacggtgtg gcactcgcgg atgtgtcggg tgtggtgtcg ttgctggcgt 62520gggatgaggg gtccgcgttg gagtcgatgt tgcggttggt gcgggcggtt ggtggtggtg 62580aggtgccgtt gtgggtgctg acgcggggtg ccgcggtggt gggtgtggat gatccggtgt 62640cggcggtgca gtcgcaggtg tgggcgttgg ggcaggtggt ggggttggaa cagccccagg 62700gttggggtgg tctggtggat gttcccgggg tgtgggatga gcgggtggcg tcgttgttgg 62760ctggtgtgct ggcggctggt gagggtgagg atcaggtcgc ggtgcgttcg tcgggtgtgt 62820atgggcgtcg tctggtgcgt gctccgcttg gtgggagtcc ggtgccggtg cgggagtggg 62880gtccgtcggg cacggtcctg gtcaccggtg gtactggtgg gatcggtggg catctggcgc 62940ggtggctggc gaaggagggt gccgagcacc tgttgttggt cagccgtggt gagcgggccc 63000agggtgcggc cgaactggtc gaggaggtgc gcgggctggg cgcggaggtg acggtcgccg 63060cgtgtgatgt gaccgaccgg gcggctctcg cggaactgct cgccgagcat cccgtcacct 63120cgatcttcca caccgccggg atcgccgcgc acggcttcct gaccgacctc gacccggctg 63180agctcgggga ccagatgggg gcccgtgtgg tcggggcgcg tcacctggat gagctgtccg 63240ttgagttggg cttggatctg gatgcgttcg tggtgttctc cacgggggct tcggtgtggg 63300ggagtgcggg gaacggggcg aatgcggctg cgggtggtta tctggatggt ctgatccgtg 63360gtcgtcgggc gcgtgggctg gtgggttcgt cggtgtcgtg gggtggctgg ggggccacgg 63420ctatggcggt gggggagacg gcggagcggt tgtcgcgtcg tggggtgcgg ttgctggagc 63480cggagttggc ggttcgggcg ttgcgtcagg tgctggagca ggatgaggtg tcggtgacgg 63540tggccgacct ggactggtcg ttgttcacgc cggggtacgc gatggcgcgg cgccggccgc 63600tgatcgagga catccccgaa gccgcccggg cactgcgtga catcaccgag actgacgaga 63660cccaggacgc ggcggccgga ggactgcggg agcggctggc cgggctggcg gagtcggagc 63720agcaggcgtt gctgctgggg ctggtgcggg gtgaggccgc gcaggtgctg gcgcacgggt 63780cgacggcgga gatcacgccg agcaggccgt tcaaggagct cgggttcgac tcgctgaccg 63840ggatggagct gcgcaaccga ctgtccaagg ccaccggact ccggctgccc gccaccctcg 63900tcttcgacta ccccaacccg caacgcgtca ccgatctctt gctcaccgat ctcgaccagc 63960aggatggccg accgggcatc gccgacgttc tcgacatcaa gcgggaactg tcccggatcg 64020gtgaggcact cgagggcgtc gcacccgatc aacaggcccg tgaggacatc gtcgcccacc 64080ttcgcgatct gatcacccag ctcagcgcta ccgagcagca cggtgccacc gatctcgaag 64140ccgccacgga cgacgagatc ttcgacttca tcgaccgcga cctaggcgtg tcctgaacag 64200gcacctgccg ggttttcaac tgcttcggag tggggtttca cgatgaccga ggacaaactt 64260cgtacctatc tgcgcagggt tacggccgaa ctgcagcaga cccgccagca gctcaaggac 64320agccaggacc gagggcggga gccgctcgcc atcgtgggaa tggcctgtcg acttcccggc 64380ggggccgact cgccggagca actgtggcag atggtgaggg acggcgccga cggggtgggc 64440ggattcccgg acgaccgcgg ctgggacctt acctcgctcc tcagcgacga tcccgaccgt 64500ccgggcacga cgtacaccca ggagggcgcg ttcctgaagg gggcgggtga cttcgacgcc 64560gggctcttcg gtatctcgcc gcgtgaggcc gcgaccatgg acccgcagca gcgactgctt 64620ctggagacct cgtgggaggc gttggaacgg gccgggatcg acccgcactc gctgcggggc 64680agccggaccg gggtattcgt cggcggtacg gccatcgagc acatcgtcaa gctgatgaac 64740tcgccgaccg atcaggggta cgccatcacc ggcggctcgg ggagcatcat gtccggccgg 64800atctcctacg tcctgggctt ggaagggccg gcggtcacca tcgacaccgc gtgctcctcg 64860tctctcgtcg cactgcactc ggccgtacag tcgctccggc agggtgactg ctctctggcg 64920ctggccggcg gcgttgcggt gatggccaca ccctctgcct tcgtgacctt cgcccggcag 64980cgcggactgg ccgcagacgg ccgctgcaaa gcgttctccg acgacgcgga cgggatcggc 65040tggggtgaag gcgtcgccgt cgtgctgctg gaacgtctgt cggacgcgcg gcgcaatggg 65100catgaggtgc tggcggtggt ccgtgggtcg gcggtcaacc aggatggtgc gtcgaacggt 65160ctgagtgctc cgaacggtcc gtcgcagcag cgggtgatcc ggcaggcggt cgccaatgcc 65220gggctgaccc tcgcggacgt cgacatggtc gaggcgcacg gcacggggac cacgctcggc 65280gaccccatcg aggcgcaggc cctgctgaac acctacggtc aggaacgaca cgacggccaa 65340ccgctgtggc tcggctcgct gaaatcgaac atcgcacaca cccaaggcgt ctcaggcgtc 65400gccggcgtca tcaagaccgt gctggccctg cgccacggca ttctgcccaa aaccctgcat 65460gtgggcgagc ggagcagcca ggttgactgg tccgtcggcg cggtggaact gctcactgag 65520gcacgggagt ggccggagac ggggcgtccg cggcgggcgg gtgtgtcgtc gttcgggatc 65580agcggcacca acgtacacgt gatcatcgaa caggccccgc aggaagagtc tgccgagcca 65640cggacggacg aggcgccctc gttggagtcc cccttcgcca cgaagcccgc cacactgccc 65700tggctgatct ccggcaacac cgaggccgca ctgcgtgaac aggccgcccg cctgcgggcc 65760cacctcaatg cccaccccgg cctcgcggca gccgacatcg gtcactccct gctgacgagc 65820cgcaccagat tcgcccaccg cgcggtgctg ttgaccgagc aggacggcga ccggcgcacc 65880gcactgaccg ccctcgccga cggactcgac gcccccggcc tgattcgagg caccggtgac 65940actggcgcgg gtgtggtgtt tgtgtttcct ggtcaggggt cgcagtgggt ggggatgggg 66000cgggagttgt gggaagtctc gtctgtgttt gctgagtcga tggtggcgtg tgagcgggcg 66060ttggcgccgt ttgtggggtg gtctttgcgg gatgtggtgt tcgagggtgg gggtgagggg 66120ctgtggggtc gggtggatgt ggtgcagccg gtgttgtggg cggtgatggt gtcgcttgct 66180gcggtgtggc ggtcgtttgg tgtggagccg gtgggggtgg tggggcattc gcagggtgag 66240gtggcggcgg cgtgtgtggc cgggggcttg tcgctggagg acggcgcccg ggtggtggcg 66300gttcggtcac gcctggtggg agagaggctg tccgggcggg gcgggatggt gtcggtgacg 66360
ttgccggtgg cccaggtgga ggagtggctg gcgggctctg ggggccgggt tggggtggcg 66420gcggtgaacg ggccgtcgtc ggtggtggtc tcgggtgagg tggaggcgct ggacggcctg 66480ctggtcgagc tcgatggcgc gggggtgcgg gcgcgccgga tcgcggtgga ctacgcctcg 66540cattcggcgc aggtggatgc gctcaacgat gatctcctgg cggggttggc ggacattcgg 66600ccggtgtcgt cgccggtggc gttctactcg acggtgaccg gcgagcggat ggacacggca 66660gggctggacg ctgcgtattg ggcggcgaat ctgcgggagc gggtgttgtt cgagccggtg 66720gtccggacgc ttgccgagct ggagcaccag gtgtttgtgg agtccagtcc gcatccggtg 66780cttgcgatgg cggtccagga gacgttggag agcgcgtccg gggccggtgc tgcagtgggg 66840tcgctgcggc gggacgatgg cggtgctgga cggttcttgg cgtcgttggc ggaggcgtat 66900gttgcggggg cgccggtgga ctggtcggtg ttgttcgagg gtacgggtac gcggcgggtg 66960gatctgccga cgtatgcctt ccagcaccag cgttactggc tcgaagacgc ttccgcaccg 67020ggtgcggagg gtgtggtgga tccggtggat gcggcgttct ggggtgcggt agagcgagcg 67080gatgtgcagg gtgttgcggc acttgtggat ggttcggtgc cgggtgtgtg ggagccggtg 67140gtgccggtgc tgtcggcctg gcgcaagggg cgtgaagaac ggtcggtcct ggattcgtgg 67200cgttaccgga ctacttggcg tgcgttcagc ctgccctcag gaacccggct gtcggggatg 67260tggctggtgg tggcttccgg tggggatgcg ccggtggatg aggtgcggca ggcgcttgag 67320gcggctggtg cggaggtgtg tgttcgggcg gatctcgacg gtgcggcact ggcgggtgtg 67380tcgggtgtgg tgtcgttgtt ggcgtgggat gaggggtcgg cggtggtgtc gacggtgggg 67440ttggtgcagg cgtgtggcgg tggtggtgag gtgccgttgt gggtgttgac gcggggtgct 67500gcggtggtgg gtgtggatga tccggtgtcg gcggtgcagt cgcaggtgtg ggcgttgggg 67560caggtggtgg ggttggagca gcccggtggt tggggtggtc tggtggatgt tcccggggtg 67620tgggatgagc gggtggcgtc cttgttggcc ggtgtgctgg cggctggtgg gggtgaggat 67680caggtggcgg tgcgttcgtc gggtgcgtac gggcgtcgtc tggtgcgtgc tccactgggt 67740gcgagcccgg tgcgggtgcg ggagtggagt ccgtcgggca cagcgctggt caccggtggt 67800acgggtggga tcggtgggca tctggcgcgt tggttggcga gggagggtgt cgggcatctg 67860ctgctggtca gccgccgtgg tccggaggcc gagggcgtgg ccgagctggt cgaggagctg 67920ggcggcctgg gtgtggaggt gacggttgtc gcgtgtgatg tgaccgatcg ggcggctctc 67980gcggaactgc tcgccacaat ccccgccgag tatcccctca cgagcgtgtt ccatgctgcg 68040gggatcgcgg gttacggtct ggttcgcgaa ctggatgccg cggggctgga tgccgagatg 68100gccgccaaga ctctcggtgc ccgtcatctc gacgagctga ccgccgaact tggcctggat 68160ctggatgcgt tcgtggtgtt ctcctccggt gccgctgtgt gggggagtgc cggtagcggt 68220ggttacgcgg cggcgaacgc gtatctggat ggtctggcgc gggagcgccg ggcgcgtggt 68280ctggtggcga catcggtgtc gtggggcaac tggaagaaca ccggtctggc gaccgacacc 68340accgcggagc agctgacgcg catcggtgtc cggccgatgg agccggagtt ggcggttcgg 68400gcgttgcggc aggcgctgga gcaggacgag gtgtcaatga cggtggccga catggactgg 68460tcgttgttca cgccggggta cgcgctggcc cgccgccgtc cgctgatcga ggagatcccc 68520gaagccgccc gcgcgctcag cgaggactcc gccgacccgg cgaacgacac ggtcggtggc 68580gactccccct tgcggcagtc cctcgccgca ctgaccgagt ccgagcagca cgaacggctc 68640ctcggtgcgg tccgtacgga agcggcggct gttctcaccc actcgacgac cgacgagatc 68700acggccggca agccgttccg tgacttggga ttcgactccc tgaccgcgat ggaactgcgc 68760aaccggctca acgccgccac cggactccgc ctgcccgcca cgatcgtctt cgactacccc 68820acgccccgcc ggctcgcagg acacctgcac gacaagctct tcgacagtgg tgccgaggtc 68880gcgcttccgc agctgcgggc aacggacgac gacccgatcg tgatcgtggg catggcctgc 68940cgcttccccg gcggggtgcg cggtcccgag gacctgtgga ggctgctcgc cgaggggcgc 69000gacgagatga cggagttccc cgcggaccgg ggctggcaag gaccggccat gaacgccttc 69060gtggaggagt tcggcggcgc ccgacaaggt gccttcctcg cggacgcggc ggagttcgac 69120gctgcgttct tcgggatctc gccgcgtgag gcgcgggcga tggatccgca gcagcggctg 69180ctgcttgaga cctcctggga ggtgcttgaa cgcgccggct acgacccggt ctccctgcgc 69240ggcagccgca ccggcgtgtt tgtcggcggt acgccgcagg aatacacgac ggtcctcatg 69300aactcggccg aggccggtag cggctacgcg ctcaccggta cctccggcag cgtgatgtcg 69360ggccgggtcg cctacaccct gggcctggag ggaccggccg tgacgattga cacggcgtgt 69420tcgtcctcgc ttgtcacgct gcatctggcg gcgcaggcgc tgcgaggcgg agagtgtgac 69480ctcgccctgg tcggtggcgt gacggtcatg gccacacccg gggcctttgt ggagttcgcc 69540cgacagggcg gtctggcggg agacgggcgg tgcaaggcgt tcgccgcggg tgccgacggc 69600accggctggg gcgagggcgt cgggatgctg gccgtccagc ggctctcgga cgcggtgcgg 69660gacggacgtc gggtgctggc ggtggtgcgg ggctcggcgg tgaactccga cggtgcgtcg 69720aacgggctga cagcgccgaa cggtccgtcg cagcagcggg tgatccggca ggcgttggcc 69780tcggcggggc tttcggcggc ggatgtcgat gtggtggagg ggcacgggac gggtacggcg 69840ctgggtgatc cgatcgaggc gcaggcgctg ctggccacct acggtcagga ccgtccggcg 69900gaccggccgt tgtggctcgg ttcggtgaag tccaacatcg gacacaccca gtacgccgcc 69960ggagtcgccg gtgtgatcaa ggccgtactc gcgctccagc accgtctgct gccgaagacg 70020ctgcatgtgg aggagccgac gccggaggtg gactggtcgt cgggtgcggt gggagtgctg 70080acagaggcgc gggagtggcc ggagacggga cgtccgcggc gtgcgggggt gtcggcgttc 70140gggatcagcg ggacgaacgc gcacgtgatt ctggagcagg ctccggaagc cgtagaggag 70200agcgcgtctg gtgagaccgg ttcggtgctg gtgccgtggg tgatctcggc gcggtcggag 70260caggcgttgc gagagcaggc gcggcggctg gccggacacc tgcgcgcaca tgacctgcgc 70320cccgtcgatg tggggttctc gctggccacg acacgggcgg ggctggagca ccgggcggtg 70380ctggtgggac gggagacgtc ggagttcctg gcccagctgg agacggtggc cggggacggg 70440ccggtgtcgg agggcgggac ggcgtttctg ttctccgggc agggctcgca gcgggcgggg 70500atgggcaggg aattgtatga ggcatatccg gtgttcgcgg ccgctttcga tgaggtgtgc 70560
gggcatctgg acgtgctcct ggagcgtccg gtgaaggaag tggtcttcgc cggtggcaag 70620gcgctggacc ggacggtgtt cacccaggcg ggtctgtttg cgcttgaggt ggcgttgttc 70680gagctggtgg gttcgtgggg ggtgcgggcg gatgtgctgc tggggcactc catcggcgag 70740ctggccgcgg cgtacgcggc gggcgtgtgg tcgctcgagg acgcgtgccg ggtggtggcg 70800gcgcggggcc ggctgatgca ggccctgccg gagggcgggg tcatggtcgc ggtggaagcc 70860gcggaggagg agctgcccca gttgccggcg ggggtgtcgg tggcggcggt gaacgggccg 70920cgttcgctgg tgctctccgg cgacgacgaa ccggtgaccg cgctcgcgca gaccttcgcg 70980gggcagggcc ggcgcaccag acggctgacc gtgagccacg ccttccactc cgcgtggatg 71040gagccgatgc tggcggactt cgccgaggtg ctgggctccg tggagttccg tgcaccgcgc 71100atccctgtgg tgtccaacgt gaccgggcag gtcgcgggcg aggagctggc cacccctgat 71160tactgggtgc ggcatgtgcg ggaggcggtc cgattcgctg acggggtgac caccgtgctg 71220gggcggggtg tcgacaagtt cctggagctg ggcccgggtg gcgcactgac cgcgatggcc 71280gaggaggcgc tggaccacac cggtaccgac gccgtctgcg cccccgtcct gcaccccgag 71340catcccgaag cgtcgagcgc cgtccgtggc ctcggacgga tctacgccgt cggcgccccg 71400gccgactggt ccgcgctctt cgccggtacc ggcgcacgcc gtgtcgacct gcccacctac 71460gccttccaac gacggcgctt ctggctcgac tcgctcgcta ccggtagcgg cgatccggcg 71520agcctcggac tcacgaccac cggtcatccg ctgctcggcg ccggcgtgag gctgcccgat 71580tcggacggct tcctgttcac cggcagactt tctctggcca cgcagccgtg gatcgcccag 71640cacgcgctgc tgggcaccgc gctgctgcct ggtaccgcgt tcgtggagct ggcgctgcgc 71700gccggcgccg agtcgggctg cgaggtgatc gaggaactca ccctggaagc ccccctggtg 71760ttggaggagc atggcggtcg cgcggtccac gtgacggtcg gcgggctcga cgagtccggc 71820cggcgcacga tcacgctcca ctcacggccc gacggcgcgg acgacgacga gtcctggctt 71880cggcacgcca ccggcgtact ggtcgagcgg cgcgagacgg agtccgccga tgcgccgacg 71940gagggtgtgt ggccgcccga cggcgccaca cagatctccg tccaggactt ctacccggac 72000atggccgagg ccggattcac ctacgggccg gtcttccagg gcctgcgagt cctgtggagc 72060aaggacggcg agctgttcgc cgaggttcgg ctgccggacg aggcgggcga ggcgggcgat 72120gagggcagcg ggttcggtgt gcacccggca ctgctggacg cggccctgca gcccctcgcc 72180ctcagtgtcc tcggcgggac ggacggccgg caaccggtca agggcggcat gcccttcgtc 72240tggaccgggg tccggctgca cgccacccac gccacggtcg cccgggtcaa gctggccccg 72300gtgggacgca gcgaggtgtc cgtcgtggtg accgacgact cggggctgcc gatcgccacg 72360gtcgactcgc tggccatgcg cgacccgatt ctggaacagt tcactgcctc cgcgccccgg 72420caggatgcgc tgttcggcgt gcggtggacg cccatacccc tcgcggcgca cgctgagccc 72480ggtgagtggg cgatgctcgg cttcgacccg ctggagatcc gccagcgtct cgtcgaggcc 72540ggcctcaccg gtacgccgta tctcgatccg cagtccctga tcgacaccgt ggaatcgggc 72600aagcccgttc cgccagtcgt ggcggtgtcc tgcttcggcg gtgggggcag taccgtcaca 72660gccactcacg aggccgtcgg acgggctctg ggagtgcttc agcactggct cgcggacgcc 72720cgcctcatga gttcccggct ggttctactg acccgaggtg cggttccggc cgtcgacacc 72780gaccggatcg aggacctggc ggcctcggcc gtctggggtc tggtgcgggc ggcccagtcc 72840gagcatccgg accggatcgt gctcatcgac ctcgatgacg accccacgtc gtaccgggcg 72900ctgcccgcgg ccctcggcac cggtgaacca caactcgccc tgcgcacggg cgccgccagc 72960gcgcctcgcc tggcccggca caccggcgcg ccggaggtca ccccgggctt cggccctgac 73020ggcaccgtgc tggtcaccgg gggcaccggg gcgctcggcg cggtcgtcgc ccggcacctc 73080gcggccgcgc atggcgtccg gcacctggta ctggccagcc gcagcggagc cgaagcttca 73140ggcgcggacg cgctgctggc cgacctgacc gagctgggcg ccgacgccac gatcgtggcc 73200tgcgacgtct cggaccgcgc cgcgctggcc gctctgctgg acgccatccc agccgagcgg 73260ccgctgaccg gcgtcgtgca cacggcgggg gtactggcgg acgggacagt cgagtccctc 73320accccggacc aggccgacac ggtgctgcgg gccaaggccg acgcggcctg gcatctgcac 73380gaactgaccg cgctcacgcc ggtgcgggag ttcgtcctct tctcctccgc cgccggactg 73440ctgggcagtc aggggcaggg caactacgcg gccgccaacg ccttcctgga cgccctcgcc 73500gcccaccggc gagccgcggg actggccggt acctcgctgg cctggggctg gtgggacctg 73560cccggcggca tggccgcgga cctcggccgt gccgaacgcg cccggatggc ccgtggtggg 73620ctcaccccct tcacagccga gaccggaatg gacgccttcg accagaccct cgccgccggc 73680accgagcccc tgctcgtccc gatgcgtatg aacaccgcgg tggcgcgggc ttcggccggg 73740cagcagatac cgtcggtgct gcgcgggctg gtccgggccc cccggcgacg ggccgtccga 73800tcggacgagg ggagcgcctc gcggctgcgc gagcggctgg ccggagcgaa cgcggacgag 73860cggctggcca tgctcaccga gctggtccgt gtcgaggccg ctcaggtgct cgggcacagc 73920ggggccgagg ccgtcgagga cggcagcagc ttcgccgagc tgggcttcga ctcgctcacc 73980tcggtcgagt tgcgcaaccg catcggcgag cgcacaggac tgcggctggc gtccacggtc 74040gtcttcgacc accccacacc ggccgccctc gccgccgaac tcggtgaccg gctgggcgat 74100acggccgact tcgtgtcggc cgcgcagccg tccgaggccc ccggagccgg cggctccggc 74160gtcgagacga ccgcggacac ggcggtgatc aacggggtgg aggcgctcta ccggcgctcc 74220atcgagctgg gccggctcga cctggggcac agcgtgctga agaactcggt cgacctgcgg 74280gcgagtttct ccgttcccga cgaggtccgg aatggaccgg agctcgtcag gctcgtcgag 74340ggagcacagc acccgaagat catctgcttc ccgtcgcagt cggtgtgggc gagcaaccag 74400gaactggtcg gcatggccgt accgctgcgc ggagtccgtg acctgtggtc cctgatgctc 74460cccggcttcg tgaccggcca gcccgtcgcc gccgatgtgg acgcggcggc cgagtacgcc 74520gtacgactca tcgaagaact ggtccaggac gagcccttcg tcctggccgg gcgttcctcc 74580ggcggcagga tcgcccatga ggtcgccgtc aggctggagg gacgaggccg tgccccgaag 74640ggactggtgc tgatcgacag ctacatggcc ggctatgagg cgacttccta catcacgccg 74700gtgatggagt ccaaggccct ggagctggag aaggacttcg gtcagatgac cgggacccgg 74760
ctcaccgcga tggccgccta cttcgccatg ttcgaggcat ggcagcctga ggagacctcg 74820gttccgacgc tgctggtgcg ggcttcggag cgttacggca tcgagccggg gcaggagcag 74880cccccggccg aggaatggca gtccgcctgg ccgctgccgc acgacgcgat cgacgtgccg 74940ggtaaccact actccatgat cgaaggcagc ggggacgtca cggcggcggc cgtgcaccgg 75000tggctggtgg agcgtgacgc gtaggaccgc tcaccacgac gggccgtgct ccggcaacgg 75060gagcatggcc cgtcgcacgc gtgcggaggc ggcgccgccg acgccggacc cgccggacga 75120aagaagacga cgggcccagc aggtgtcggc ctgctgggcc cgtcgtccgt gcggtggggc 75180ggatgccgtg tcaccaggtg atgggcaggc tctccagtcc gcggaggatg gaagccggct 75240tcagccggag ttccgactcg ggcaccgcga gccgtacctc cggcatccgc tccatgatgg 75300cccggaacgc ctcctggagt tcgatacggg cgagctgcgc gccgaggcag tagtggatgc 75360ccgtactgaa ggccaggtgc gggttctgct cacgcgccag gtccagccgg tccccgtcct 75420cgaacacgtc ggggtcacga ttggcggtgg cgaccgcggg caggaccacg acaccggcgg 75480gcagcacctt gccgttgctc agctccacct ccgcggtggt gagccggggg gtgatgccgc 75540cggtcgcggt gagcgggacg aaccgcagca gctcgtcgat ggccttcgga agcgcctcgg 75600ggttcgcccg cagcttgtcg aactcctcgg gatggtgcag cagggtgagc aggaacatgc 75660tgatcaggtt ggcggtcgtt tcgtgacccg cgctgaggat gccgatactc agcgtgatga 75720tctcacgctc ggtgagcgta ctgtcttctt cttcgctgac cgcgatcagc tcgctgatca 75780tgtcgtcggc gggcttctgc cgcttgaccg cgatgaggtc accgaagtag ttgaccaacg 75840cgaccgtcgc cgcttccttc tccgcgacct gatgccagtc gccgagcaac gcgttcgacc 75900aggcgtgaaa cgtgtcctgg tcgccggccg gcacgccgag gagctcgcag accacgcgga 75960ccggcagcgg aacggcgaag ttcttcacca gatccaccgg acggggcagg gtctggagct 76020cgtcgaggag ttcgaccacc agctccacga tccggggccg cagctgctcc acccgccgtg 76080cggtgaaagc cttgctgacc agcttgcgca gccgggtgtg ctccggcggg tccatgccga 76140cgagagattc gttcatcagc ttgccggtct cggtctccga catggcggcg gccgcggtcg 76200cgatgacccg gctgctgaac cgggggtcca ggagtacctt gcggacatcg gcgtgcttgg 76260tcaccatcca gccggtgatg ccgtcgggaa acttcacctc gaccacggac tcgccgtccc 76320ggacctcggc cagctccggg ggcagttcac agaccgaggg cgggtccggg aacgggaacg 76380gtatgggttc ggaaggcgct tcggccatgg atggctctcc agattcgtga gggtttctcg 76440ggcgcggcgg aacgacgcgt gggggtgggc aggccgacct tcctcgcagg ctatgcacga 76500tcggccccta caccctcccc ctagctcgcc ccactcgcgt gacgtgcccg gtaccgagcc 76560ccgtcaccgc gtgctggtac tcctggccat cgcgagcagc gccgtcacgt ccatggactt 76620gatctcctcg ctgcggtcgg ccacctccgg ctccgccctc tcgtcgccgg tcagcgccag 76680cagcgcctcc agcagaccgt gttcgcgcag ccggtccagt gggacggaca tcagggcgcg 76740gcgtacggcc gcttcctgct cggcctcggg cgtcgtggcg ctctgcgctc cgtcgggggc 76800cagccgggcg accagcaggg ccaccagtgc gccgagcgtc ggatagtcga agaccaggct 76860gacggggagt cgcagaccgg tggccgcgtt gagccggttg cgcagttcca cggcggtcag 76920cgaggtgaag ccgaggtcgc ggaacggccg gtcggtctcg atctggtccg aggacgcgtg 76980ccccagcacg gcggcgatcc ggtcacggac cagcgccagc agcgtctcct gacgttcggc 77040gggccccatc tccgcgagcc ggtgccgcag ttggctctcc ggggcctggc tcccggtctg 77100cgccgaccgc tgggccggga cccggaccag gccgcgcagc actggcggca gggagccggc 77160ggcggcctgc gcgcgcaggg cggcgctgtt gaggcggacg gggaacacga cgggctcggc 77220gcggcgccag cccaggtcga acaaggcgag gccctggtcc gccggcattg cggacacccc 77280gcccatcgga gagcccgccg aggccccctc ggcgaggtgg ctgtccatac cccgctcggt 77340cgcccacagc ccccagccca gtgaggtggc cgcgagcccg ttgcggcgtc tgtactgggc 77400gagagagtcc aggaaggtgt tggcggccgc gtagttgccc tgtccggtgc cgccgagggt 77460gcccgcgatc gaggagaaca gcacgaaggc ggacaggtcc agctccgctg tcagctcatg 77520gagatgtacc gcggcgtcca ccttggcgcg gaacacccgg tcgatcagct ccggcgccag 77580cgaggccagc agcgcattct cggcagtgcc cgcgcagtgc accaccgcgg tcagcggatg 77640cccggcggaa accccgtcga gtacggccgc cagcgcagcg cggtccgaca cgtcgcaggc 77700caccagcgtc accgtggccc ccagctcctc caagtcgtgg acgagttcac cggcgccctc 77760ggcatccgga ccctgtcggc tgagcagcag cagatggtgt acgccgtgca gctttgcgag 77820acgggtggcg accagcgcgc cgatcccacc ggtcgccccg gtcaccagca ccgtgccgga 77880cgggtcgagc gcgtcgagct ccgtcccggg cacggtgtcc gggacgtcgg cggccggcgg 77940ccggttgagg cgaggcatca gcgcgacccc gtcgcgcagg gcgagttgag cctcccccga 78000cgcgaccgcg gcccgcaccg cccgcccgga ctcgacgctc ccgtcggtgt ccagcaggac 78060cagccggccg ggattctccg actggatcga acgcaccagc ccccacagcg gcgcggagcc 78120gagcccggcc acccggtcat cggctgagat gcccacggcg tccgtggtcg tcaccaccag 78180cggtatcgag gcgagccgct cgtcgagttc ctctgagacc caggtgcgcg ccgcctccag 78240caccagcccg gcgcgttccc gggtaagcgc cgggagctcc gcgaaccccg gaccggccgg 78300ttcaccgtcg tccggtgtga gtacgacgaa cgcgggcacc ggatcacccg cggcgaccga 78360cgcgctcagc gccagcaggt ccgggtggtg cacggcaccg ctgtcggggc ctgcccatgc 78420cggcgccgtg cccaccaccg cccacgccgc cgtcttgacc gtgggcagcg gcaccggtgt 78480ccacaccatg ctcagcagcg aggaatgcgt ggcggtccgc ccggaccctg gatcgccgtc 78540ggtgaggtcg gccggccgca gcagcagcga accgatcgag acgaccgggg tgcccgtgcc 78600gtccacggca cgcagcgaca cggcgctctt gcccgctggg gagagggcta cccgcagggc 78660ggtggcgccg gtggcctgca gggacaccgc gttccaggcg aacgggcgca gcggtgcgct 78720cggggcgtcc agttcaccga ggaagcccac cgcgtgcagg gccgcgtcca gcagcgcggg 78780gtgcagcagg aactcgcccg ccgcctcgtg cagctcctgc ggcagctcga tctcggcgaa 78840gacctcccgg ccacgggacc aaacggcccg cagcccccgg aaggcgggcc cgtaggcgag 78900cgggccgccc gccaggtcct cgtacaggct gccgatgtcg atccgctcgg cgccgcgcgg 78960
cggccagtcg gccggttccg ccttcggtgc cggaccggcc gggcacagga cacccgtggc 79020gtgccgcgtc cactcggcgg ccgtgaactc ttcgtccgtg cgggcgaaga agccgacttc 79080gcgccgcccg gcgtcgtccg gggcgcccac agtgagctgc aggtccacac cgccggtctc 79140gggcagcgtg agcggagtct gaagggtcag ctcctcgatg taggggcact cggcgagatc 79200gcccgccacg gcggccagtt ccacgaacgc ggttcccggc agcggaaccg tccccgccac 79260tctgtggtca gccagccagg gatggtcacg cagcgaaaga cggccggtca ggacgaggcc 79320gtcggactcg gcgggctcca cggcggcccc gagcagcgga tgcccggggg aacgcagacc 79380ggcggcagtc acatcaccgg atccggacac cggcatacgc agccagtagc gctggtgctg 79440gaaggcgtac gtgggcaggt ccaccggccg tgcaccggtg ccctcgaaca ccaccgacca 79500gttcacgggc gcgcccgcga cgtacgcctc ggccagtgac gtgagcagcc gcccggcacc 79560cgcgtcgtcg cgtcgtatgg tcccggacac atgggccgag acccccgcac cgtcgatgat 79620ctcctggagg gccatcgcga cgacagggtg aggactgacc tccagcaggg tccggtgtcc 79680ctcgtcgagc gtggcccgca ctgcggcctc gaaggcgacg gtctcccgca tgttccgcag 79740ccagtacccg gcatccagcc ccgccgtgtc catccgctcc ccggacaccg tcgaatagaa 79800cgccaccgac gacgacaccg gccgaatacc cgccagcccc gccagcagat catcaccgag 79860cgcatccacc tgcaccgaat gcgacgcata atccaccgcg atccgacgcg cccgcacccc 79920cgcaccctca aaccccgcca acaacccctc caacacctcc gcctcacccg aaaccaccac 79980cgacgacggc ccattcaccg ccgccacccc cacccgaccc cccaacccgg ccaaccactc 80040ctccacctca cccaccgaca accccaccga caccatccca ccccgccccg acagcctctc 80100tcccaccagg cgtgaccgaa ccgccaccac ccgggcacca tcctccaacg acaacccccc 80160cgccacacac gccgccgcaa cctcaccctg cgaatgcccc accaccgcag ccggctccac 80220accaaacgac cgccacaccg ccgccaacga caccatcacc gcccacaaca ccggctgcac 80280cacatccacc cgctcccaca acccatcacc cccacccccg aacaccacat cccgcaacga 80340ccaatccaca aacggcacca acgcccgctc acacgccacc atcgactcag caaaaacaga 80400cgaaacctcc cacaactccc gccccatccc cacccactgc gacccctgac caggaaacac 80460aaagaccaca cccataccgc cgtcaccagc agcgcgccca ctcaccaccc cggcccctgg 80520cacgccctcg gccaccgacc gcaaccccgc caacaactcc tcccggtcac cccccagcac 80580caccgcccgt tgctcaaacg ccgaccgcgt ccccaccagc gagaaaccca catccaccgg 80640acccagcccc ggccgctcca caagccactc caacaaccgg ccagcctgcg cccgcaacgc 80700cccctcacca cgagccgaca ccacccacgg caccgacccc atcagcacag gacgatccgc 80760ctcgtccacc tccaccggct ccgcctccgg agcttgctcc aggatcacat gcgcgttggt 80820accgctgacg ccgaaggagg acaccccggc tcggcgcgga cgccccgtct ccggccactc 80880ccgcgcctcg gtcagcagcc gtaccgcacc cgaggaccag tcgacgtgcg gtgtcggctc 80940gtccacatgc agcgtcttcg gcatgacgcc gtgccgcagc gccatgaccg tcttgatcac 81000accgctgacg ccggccgcgt actgggtgtg accgacgttc gacttgatgg aacccagcca 81060cagcggccgg tcggccgggc ggtcctggcc gtaggtggcc agcagggccc gtgcctcgat 81120ggggtcgccg agcgtcgtac ccgtgccgtg cgcctcgacc atgtcgacgt ccgcggtggc 81180caggccggcg ttgtccagga cccgcaggat gaggtggcgc tgggccggcc cgctgggcgc 81240ggtgaggccg ttgctggcac cgtcctggtt gatgccggtt tctcgcacca cggccagcac 81300cgggtgcccg gtccgctggg cgtcggagag ccgctgaaga accagtacgc cgacgccctc 81360gccccagccg gtgccgtcgg cgtccgagga gaacgccttg cagcggccgt cggatgccag 81420cccgccctgc cgggagaact ccgggaacgc gccgggcgtc gccagtactg tgaccccgcc 81480ggccaaggcc agcgagatct cgtccttgcg cagcgcctgc accgccagat gaagggagac 81540cagcgacgac gagcacgccg tgtcgacggt gaccgcggga ccttccaggc cgagcgtgta 81600cgccacccgg ccggagagga cgctgctgga tgtgccggtc atcagatagc cgtccagcgc 81660gtgcggcgac gcggcgagca ggccggcgaa atctccgctc gtgccgccga agaacacccc 81720ggtgtcactg ccccgcaagg accgtggatc gatcctggcg cgctccagcg cctcccagga 81780ggatgccaag agcaaccgct gctgcggatc ggtggtgagc gcctcgcgtg gtgcgatgcc 81840gaagaaggcc gcgtcgaact cgccggcgca gcgcaggaaa ccaccttcca gcggggtcga 81900ggtgccggcc cccgcggcat cggtttcgta cagcccgtcc aggttccagc cgcggtcgtc 81960ggggaagccc gccacggcgt ccgcttcctc cgacagcatc cgccacacgt cttccggcgt 82020gtccgcaccg cccggcagcc ggcacgccat gccgatgatt gcgatcggct cgcgggcccg 82080gtcctcgacc tcacgcagcc ggcgctgaag cccacgtgcc tcggtgacgg cccgcttgag 82140gtattcgcgg tacttctcgt cgtcggacat gacgttctca gctccttggt gatgtgttcc 82200cggcggaatc gcgccgatcg ggcaagacct gacccggccg gtcagagacc gaagccctgg 82260tcgatcgcct cgaagaggtc gtcggtggtc gcggtggcga ggtcgtccag ggcggccggt 82320ccggtatccg gctcacgcca ctccgacagc agcccgcgca gccgtcgcac gacccggtcg 82380cggtcgccgt tgtccgcgtc gatcgtcttg agggcagcct ccagggcgtc cacctccgcg 82440agcgcggcct cgacgccgga cagcccggcc gggacgagag cggtgagcag ttccgtggcg 82500agcgcggtcg gcgtcgggtg ttcgaagacg agcgtggccg gcagacgcag tccggtggcg 82560gcaccgagcc ggttgcggaa ctccacggcg gtcagcgagc tgaagccgag ctcactgaag 82620gaccgctccg ccgtgaccgc ggccaccgag gcgtgcccca gcaccgtcgc cgcgtgggag 82680cgcaccagtt ccagcaccct gcggtgctgc tcgacggcgg gcagccgcgc caggcgcgtg 82740cgcagcccgg aaccgttgct gccggaccgc gccgtacgcc gcgaggtccc ttgcaccaga 82800cctcggagga gatgcggcac ctcgtccgcg ccggaccgca gcgtcgacgg cgtgagccgt 82860gtgatcacct gcaacggcac gtccgccccg caggccaggt cgaagagggc gagcgcctcc 82920gcgtcctcca gcggtaccac gccggctcgt tccatgcggt gcagctcttt gtcgttgagg 82980tgtccggtca tggtgctgcg ggtgttccac aggccccagg cgagggaggt ggcggggagg 83040ccgtgggtgc ggcggtggtg ggcgagggcg tcgaggtagg tgttggcggc ggcgtagttg 83100gcttgtcctg gggcgccgag ttgtccggcg gcggaggagt agaggatgaa ggtggtgacg 83160
gggtggttga gggtgaggtg gtgcaggtgg gtggctgcgt cgatcttggg gcgcaggacg 83220ttgtggagtt gggtgttggt gagggtggtg atcgttgcgt cgtcgaggat gccggcggtg 83280tggatgacgg tggtcagggt gtggtgggag aggagtgcgg cgagttggtc ggggtcggcg 83340gtgtcgcagg cggtgatggt gacgtgggcg ccggccttgg tgagttcggc cgtgagttcg 83400gtggcgccgg gggcgtcggg tccgcgtcgg ctggtgagca gcaggtcggt gacgccgtgc 83460tggtggacga ggtggcgggc gaggagagcg ccgagggtgc cggtgccgcc ggtgatcagg 83520gcggtgccgt tcggatcgat cccccgcggt acggtgagga cggccgggcc ggccggcgtc 83580gcccggttcg catgccggga tgcctcccgg gcacgccgca ggtcgcggac ggtgatccgc 83640ggcggcgtca gcttcccgtc gtggcacagc gacatcaccg cgaccagcat ttcccggatc 83700cggtcgggat cggcctccgc caggtcgtag gtccggtagg ccacgcccgc gtgggcctcg 83760gccacctgcg ccgggtcgcg cctgtcggcc ctgcccgtct ccaggaaccg gccgccgcgc 83820ggcagcaggc gcagcccggc gtccacggac tcaccggcca gacagttcag gacgacgtcc 83880accccgcagc cgccgcttgt ctccatgaac cggtgttcga actccggcgt tcgggagtcc 83940gcgatgtgtg catcgtccag cccgtacttc ctcaacgtcg gccacttgcc ggtggcggcg 84000gtgccataca cctcggctcc cagttggtgg gccagctgca cggccgccag ctgggagccg 84060tccgccgcgt cgtgtaccag caccgacatc ccgggctgta cgtccgccag gtccaccagg 84120ccgtagtaag ggatgagata ggtgagggga acggccgccg cctgctcgta ggaccagccg 84180tcgggcatgg gggccaggca gcggtggtcg tacaccgcga agggcccgga cgagccggag 84240cgcagtgcca gcacccggtc gcccacggcc agatcggtca cctcggcacc cgtctcctgc 84300accgtgccgg ccgcgtagcc gctgatgccc ttgctcccct cggccgggtc gagcgaggcc 84360acgacgtcct ggaagtcgag cccgaccgcg tgcaccgcga tccggacctg accgggcccg 84420agcgggtcca gaacctccgg gcagggcacc agcgccatgt cctcgatcga gccgcggcct 84480gtgctctcca gccgccacgg caccggcccg gacggcggca gcaacgccgg atgcgaacgt 84540gcccgggcca gccgaggcgc ggcgactgtg ccgcgacgaa cggcgatccg gggctcggca 84600caggccagcg cgtccccgaa gacctctcgt gacgtgtcca gcccgtccag gtctaccagg 84660aagaagcggt ccggatgctc ggtctgcgcc gagcccacca gaccccagct ggtggagttc 84720gccaagtcgg tgacgtcctc ggcgtcgccc gcggcgaccg cgcccgaggt gacgaggacc 84780aggcgcgcgg aggcgaaccg gtcatcggcc agccaggcac ggatcacgcc gagcgtcgcc 84840tcggccgccg cgtgcgcgtc agcggcgggg tcatccgaga cacggggggc gagacagcgg 84900accacgatac ccgggaccgg accgttcgcg gccagttcct cgaaggtggc cgccacccgc 84960agtgccgcgc cggtcagccc gaaggggtcg tcgccgacga gggcatagga ctgagcggcc 85020gccacgggca gagccgtcca gtcgagatgg agcagtgcct gctccagcgt gccgcccgag 85080ggctggtcga acacggcggg tttggtcacc acggatccca ccgagacgac cggacgcccg 85140gcctcgtcgg tgaccagcag ggcgagctgg tcccccgagg gggtaagacg gacccgagcc 85200gtcgtcgcgc cggacgcgta cagccgcacc gcggcccagg aggacggcag ccgtacgccc 85260tcgtcccgcc cgccgacaaa cgccatggcg tgcagcgcgg catccagcag ggccgggtgg 85320acgccgtagc gcgtgccctc ctcccgctcc gactcgggca accggacctc ggcgaacagc 85380tcgtcatgct gccgccacag cgcgtgcagg cactggaagg cgggtccgta gtggtagccg 85440ctcgcggcga gctcctggta gacaccgtcc gtctcgactg cctctgcccc ggccggcggc 85500caggacggca ggggagccgg ctctgcgggc gcgtcggcac tcagggtgcc ggtcgcgtgc 85560agcgtccagt cgccgtcact ggtgcgggaa tgcagggtga ccgtgcagcg gcccgagtca 85620ctgtccgggg tggcccggac ctgcacggtg acctcgtcct cggggccggc gcccagcaac 85680agtggtgtgt gcggggtcag tacctccaaa tacgggcggt tcaggagatc gcccgcccgg 85740atcaccaggt cgacgaaggc gctgccgggc agcaggacct ggcccaagag cgcgtggtcg 85800agcagccatg gctgggcgcg cccggacaac cggccggtca gcagcacccc ctcgccgtcg 85860gccagcgtca cagccgctcc cagcagcgga tggtctgccg cctgaagccc catcgcggtc 85920aggtcaccgg cgccacgacg gtgttccagc cagaaccggc gtcgctggaa ggcgtaggtg 85980ggcaggtcga cgcggcgtgc gccggttccg gtgaagaacg cggaccagtc ggccggggcg 86040ccggcggcgt agatccggcc gagggcatgg acgacgctct gcgcttcggg gcgttcggga 86100tgcaggatgg gggtgcagac ggtgccggtg ccggtgtggt ccagcgtctc ctcggccatc 86160gcggtcaggg aacctcccgg acccagttcc aggaacttgt cgaccccgcg ggagtgggcg 86220gtggtgaccc cgtcggcgaa acggaccgct tcccgcacat gccgtaccca gtaggcaggg 86280gtggccagtt cctcgtccgc gatctgcccg gtcacgttgg acaccaccgc aatccgcggc 86340cgacggaact ccaccccagc cagtacctcg gcgaactgag ccagcatcgg ctccatccgc 86400gccgagtgga aagcatgacc gaccgccagc cgcttgatcc gccgcccccg cccggccagc 86460tcctgcgcaa cagcggtgac cggctcctcg tccccggaca gcaccagtga acgcggcccg 86520ttcaccgccg ccaccgacac ccccgccggc agctccggca actcctcctc cgccgcctgg 86580accgcgacca tcaccccacc cgacggcaac gcctgcatca gccggccccg cgccgccacg 86640acccggcacg cgtcctgaag cgaccacacc cccgccacat acgccgccgc cagctcaccg 86700atcgaatgcc ccagcagcac atcggcccgc accccccacg agcccaccag ctcgaacaga 86760gccacctcca acgcgaacag accagcctgg gtccattcgg tccggtccag cacctcacca 86820ccggcgaaga ccgcatcgcg caacgacccg ggcagcatcc cgtcgaaccc ggcgcacacc 86880tcgtcgaagg cagccgcgaa caccgggaac gcctcataca gctcccggcc catactcgcc 86940cgctgcgagc cctgcccgga gaacaggaac gccgtcccgc catccgtgac cgacccgccg 87000gccacccggc cctcggccag cgcctccaac tgtgccagga gctcggacgt ctcccgcccg 87060accagcaccg cccggtgccc cagccccgcc cgcgccgccg ccagcgaaaa ccccacatcc 87120accggacgca gcccatgccg cgcaacatgc ccggccaggt tccgtgcctg ctcccgcagc 87180gcctgcgccg accgcgccga gatcacccac ggcaccagca ccgaaccggt ctcaccaggc 87240gcgctctcct cgacagtctc cggcgcctgc tccaggatca cgtgcgcgtt ggtcccgctg 87300atcccgaacg ccgacacccc cgcacgacgc ggacgccccg tctccggcca ctcccgcgcc 87360
tgggccagca actccaccgc acccgaagac cagtccacct caggagtcgg ctcctccaca 87420tgcagcgtcc tgggcagcac cccgtgccgc atcgccagca ccgacttgat cacacccgcc 87480acacccgcgg ccgcctgcgt atgaccgatg ttcgacttta ccgaccccag ccacaacggc 87540cggtccaccg gacgatcctg cccatacgtg gccagcagcg cctgagcctc gatcggatca 87600cccagcgccg tacccgtccc atgcgcctcc accacatcga catccaccgc ggccagcccc 87660gcgctcgcca gcgcctgacc gatcacccgc tgctgcgacg gaccgttcgg cgccgtcaga 87720ccattcgacg caccgtcctg gttgaccgcc gatccacgca ccaccgccag caccgggtgg 87780ccgtggcgac gcgcatccga caaccgttcc accagcagca tcccgacgcc ctcgccccag 87840ccggtgccgt cggcaccggc ggcgaacgcc ttgcaccgcc cgtccggcgc caagccgccc 87900tgccgggaca gttccacgaa caccccgggc gtcgccatca cggtcacccc cccggccagg 87960gccaggtcgc actcaccgcc ccgcagcgcc tgagccgcaa gatgcagcgc caccagcgac 88020gacgagcacg ccgtgtccag cgtgaccgcg ggtccctcca gacctagcgt gaaggccacc 88080cggccggagg cgacactgcc cgcggacccg gtgccgatga acccttcgac ctcgtcgggc 88140agggtgccgg ggccggaacc gtagtcgtgg tacatcaatc cggcgaacac gccggtccgg 88200ctgccgcgca ccgcgcgcgg atcgatgccg gcccgctcca gcgcctccca cgacgtctcc 88260agcaacagcc gctgctgcgg atccatcgcc agcgcctcac gcggcgagat cccgaagaac 88320tccgcgtcga agtcggcggc ctcgtacagg aacccgcccc gccgcacata cgacgtcccc 88380ggccgggcca gttccgggtc gtacagctcc tccacatccc agccgcggtc ctcgggcatg 88440ccggagatcg cgtctcgctc ggcggccacc aactcccata gcgcatcggg agtagccacc 88500ccgccggggt agcggcaggc catgcccacg atcacgatcg gatcgtcgtc cgtcgccctc 88560agctccggca gggcgacctc ggcaccacgg tcgaaaagct tctcgtgcag gtgttcggcg 88620agccgctggg gcgtggggtg gtcgaacacc agggtggcgg gcaacccgag cccggtcgcg 88680gcccccaccc cgttacgcag ctcgatcgcg gccagcgagt cgaagcccag ctcccggaag 88740ctccggcgcg cttccagacc gtcggtgttc gcatgcccca acgcccgggc cgcgtgcgtc 88800cgtacgaggt cgagcagcag tcgcaggcgt tccgcctcgg gcagctctcg cagcccggcc 88860gcgaacgtca tacccgcata ggccccctcc gctgcgttgg ccgcccggcg caccgggacc 88920cgcacgaacc cacgcagaag cggcggcaaa gtgcccgccg cggcctggga gcgcagcgcg 88980ccgggctcca gcggcagtgg cacggcgacc ggcgcgtcca gcccgactgc cgcgtcgagc 89040agcgccatcc cctgctcttc ggagataggt accacgccgg ctcgttccat gcggtgcagc 89100tctttgtcgt tgaggtgtcc ggtcatggtg ctgcgggtgt tccacaggcc ccaggcgagg 89160gaggtggcgg ggaggccgtg ggtgcggcgg tggtgggcga gggcgtcgag gtaggtgttg 89220gcggcggcgt agttggcttg tcctggggcg ccgagttgtc cggcggcgga ggagtagagg 89280atgaaggtgg tgacggggtg gttgagggtg aggtggtgca ggtgggtggc tgcgtcgatc 89340ttggggcgca ggacgttgtg gagttgggtg ttggtgaggg tggtgatcgt tgcgtcgtcg 89400aggatgccgg cggtgtggat gacggtggtc agggtgtggt gggagaggag tgcggcgagt 89460tggtcggggt cggcggtgtc gcaggcggtg atggtgacgt gggcgccggc cttggtgagt 89520tcggccgtga gttcggtggc gccgggggcg tcgggtccgc gtcggctggt gagcagcagg 89580tcggtgacgc cgtgctggtg gacgaggtgg cgggcgagga gagcgccgag ggtgccggtg 89640ccgccggtga tcagggcggt gccgttcgga tcgatccccc gcggcggccg agggcttccc 89700gccgatgccc gcgtcagccg cggcgccgtg gccctgccct cccgcagtgc cagccgtggc 89760tcgtccaccc cgaccgtcga agccaacgcc gccagggact cggccgtacc gtcgaggtcc 89820acgagcgcga tccggccggg atgttcggtc accgccgagc tgagcaaacc ccacaccgcc 89880gcctgtaccg ggtccggtgc gtcgccatcc gccacggccg ccgcgttcca ggtcaccatc 89940accagccgcg cggtgccgag ccgttcgccg gcgagccagg accgcagcag agccagggcc 90000cgctcggcgg cctcgtgcgc cgcgtcggcg tcgacggagc cgtcagacgt gccgaccgtg 90060ccgacgatga agtccggcac ctcggccccc gcctccaccg cggcttcgag cccggccaga 90120tcggcatgga cgtcggcgtg gggcaggccg gccagcggcc ccggacccag cacagcccag 90180cgccccggcg tctcgcagcc gccgaccggg acccaggacg gcatgagcag cgcgtccacg 90240gccccgtcca gggcctcgac cgctccggcg ggccgcaccg ccagcgactc caccgtggcc 90300acggtgcgcc cggccgcgtc ggcgaagagc agtgcgaggg tgtcgggacc agtgggggcc 90360agccgtaccc gcaggaacga ggccccggcc gcgtgcagcg acaccccgcg ccacgcgaac 90420ggcagcctgg cccgctgctc cctgccgaac tcacccggca cgaatgccga cgcgtgcagc 90480gcggcgtccg caagcgccgg atgcaggcag aactccccgg cgtcaccggt gacgtcctcg 90540ggcagccgga cctcggcgta cacctcctcg ccgtgccgcc acaccgcgcg cagcccctgg 90600aaggccggcc cgtagcgcac accggccgcc gcgaagtccg cgtagcagcc atcggtgtcc 90660agtttctcgg cccccgccgg cggccacacc gtcagctcct ccgcggaagc cgtacggacc 90720gcggtgaggg tgcccccggc gtgccgcgtc cacacgtcgt cggcggaccc ctcgggctgc 90780gtgtacagcc cgaacccgcg gtcgccggag gcgtccgggg agtccacggc gagctgaacg 90840cgcagcccac cgccgcgcgg cagcgccagc ggaacctcca gaacgaggtc tgcgacatgg 90900ttgagcccga cgtgccgggc cgcccacagc gccagttcgg ccacggcggt gcccggcagc 90960aggacagcgt cgtcgattgc gtgatccgcc agccacggct gcgcctcaac ggacagcctg 91020ccggtgaaca gacggccgcc gccgaccgcc ggcaccagcg tggcccccag cagcggatgt 91080tccgcggcga ccaggccggc cgcgcccatg tctcccgccc cgggggcggt ctccagccag 91140taccgggtcc cctggaaggc gtacgtcggc agcggcaccc gcgacgtcgg gcgaccgccg 91200aagaaggtcc gccagtcgat cggcacaccg gccgtgtgag cctgggccag catggccgtc 91260agtgtgcgga cctcggggcg ctgacggcgc agtgccgggg cgaccaccgt gggggtaccg 91320gaggcggcga gggtctcctc ggtcatggac gtgagcgcgg cgtccgcacc cagctccagg 91380aacacggtga cctcctcggc gacgagggtg tgcaccacgt cgtggaaccg gacgggctcg 91440cgtgcctgcc gagcccagta cgccggatcc gcccagtcgg cggcgtccag gacggtgccc 91500gtcacactgg acacgaccgg gatgcgaggc ggctcgaacg acagccgtcc ggccacctcc 91560
ccgagcggct cgaccaccgg atccatcagc ggggagtgga aggcgtggct caccggtatc 91620cgccgggtcc tgcgacccag ctccgtgaag tgcgccgcga tcccggtgac gacgtcctcg 91680tcccccgaca cgaccacgga accgggcccg ttgacggccg cgacaccggc caccgactcg 91740tgcccggcca gcagcggcag tacctcgtcg gccgtggccg ccaccgacac catcgccccg 91800ccctcgggca gctcctgcat cagccgcccc cgggtggcga ccagttcaca ggcgtcggcc 91860agcgacagca tgcccgacac atgcgtggcg gtcagttcgc cgaccgagtg cccggacagc 91920agccgcggac gcacccccca ctcctccagc agccggaaca gtgccacccc gagcgcgaac 91980gtcgccgact gggtgtacag cgtccggtcc agcagccgtg cctcggcgct gttcgcgtcg 92040gccagcagca ggggcagcag cggcatgtcc agccgcgcgc ccagttcggc acagacctcg 92100tccagagccc gggcgtacgc tgggaaggtc tcgtacaact ccctgcccat gccgggccgc 92160tgggcaccct ggccggtgaa caggaaggcg cagcgcacct cgtccgcgac cccgtccacc 92220agcccggcag gacgctcccc gcgcgccagc tcggcgagcc cgcccagcag ttcctcgcgg 92280ctctccgtga gcaacacggc gcggtgctcg aaggcggtac gggtggtcgc cagcgcggcc 92340gccaggtcac cgagcggcag atccggacgg tcgagcagat gggcgtgcag ccgggcggcc 92400tgggcgggca aggccgcggg cgtggccgca gacaacggca cgggcacgac cggcagcacg 92460gcggaaacca tcgaatcgga cacgtcggcc cgctccgctc cgtgagccgg aagatctgcg 92520cccttcgccg tatcggtcgg tggatcggca gtgggcggct cctccaggat gacgtgcgcg 92580ttggtgccgc tgatcccgaa cgaggacaca ccggcccgcc gaggacgccc ggttcgcggc 92640catgcatggt tctccgtcag cagctccacc gtgcccgctg accagtccac ttcgggcgtc 92700ggccggtcga cgtgcagcgt ccggggcaac tgctggtggc gcatcgccat gaccatcttg 92760atgattccgg cagcgcccgc ggcggcctgg gtgtgaccga tattggactt gacggagccg 92820agccgcagag ggctgctgtc cgggcgctcc cggccgtagg tggcgatcag ggcctgcgcc 92880tcgatcgggt cgccgagcgt cgtgccggtg ccgtgcgctt ccaccacgtc gatgtccgcg 92940aagcccagcc gggccgaggc cagcgcctgc tcgatgactc gctcctgcga ggggccgttg 93000ggggaggtca gcccgttgga cgcgccgtcc tggttgaccg cgctgccgcg gacgatcgcc 93060aggacgtcgt gacccaggcg acgggcgtcc gagagccgct cgactacgag caacgcggag 93120ccctcgcccc agccgacccc gtcggcggcg gccgcgaacg ccttgcaccg gccgtcggtt 93180gccaggccgc gctgacggga gttctccacg aaggtggcag gggtggccat gacggtcgcg 93240ccgcctgcca gcgccaggtc gcattccccg tcgcgcagcg cctgcaccgc gaggtgcagc 93300gccaccagcg acgacgagca ggcggtgtcg agggtgacgg ccggaccctc cagctccagg 93360aagtaggaga tccggcccga ggcgacgctg ccggccatgc cggtgctcaa atagccctcg 93420acctcgtcgg gcagggtgcc gggaccggac gcgtagtcgt ggtacatcag cccggtgaag 93480atgccggtcc gtgagccggc cagcgaccgc gggtccactc cggtgttctc cagcgcctcc 93540caggcggtct ccagcagcag gcgctgctgc ggatccatcg ccagcgcctc acgcggcgag 93600atcccgaaga actccgcgtc aaactcggcg gccccgcgca ggaacccgcc ctgggtggca 93660taggtggtgc ccggccggtc ggggtcgacg tcgtagagtt caccgagcgc ccagccgcgg 93720tccgacggca tgtcggtgac ggcgtcctcg ccccgcagga caaggtccca cagctcttcg 93780ggcgaaccga caccgccggg cagtcggcag gccatgccga cgattgctat gggctcggtt 93840tcgccggcct cgagcttctt gatgcgtcga tgggcctgcc gcagatcggc tgccaacttc 93900ttgaggtaat caaccagctg cgcttcgttc gtcacctgag aacctgcctg agagattggc 93960aaaccgcgcc cttcggggcg aagctacgaa cctcaccccc ctaaccgcct cccctcagcc 94020accccggaag gtgtggatgg gcgcatatgg tcgggtaggg gttggcggta ggggcgcccc 94080ctgcctagcc tctgcatgaa ttcccgtgcc gtgccaagga ctggagtata acgagcaatg 94140ggcgttttcg agcaggaagc cgcagaatca acgggggaga aatttgtcag gcccgcggcg 94200ccggaaagga tgcgtgacct cgactttctg ctcggtgatt ttcgtgtgga atggacgaac 94260ttcaccgcag acccgcccgt gaagggcacg gctgcttgga acaccgtgtc gaccttcgcc 94320ggtcacgcgt acgagatgac ccagctggta ccgaaagacg acctcactgg ccgcttcgtc 94380atccagtggg tcgagtcgga gtcgtcattc tccggctatt attacgacga ctggggaaac 94440cgcaccctgc tgaccgcgaa gggatggcag gatgggtacc tttccttcac aggtgaatgc 94500atcgggtttg gccgctggtt cctgctcaaa gagcggtacc aggttatcga cgagaaccac 94560tacctgaaat gcggattcat cagattcgag gcagacggcg aatgggttcc tgcggacgag 94620gtccactgct accgcgtctg aacatgtcga accacccggg aaatcgacgc tcgggttcct 94680gactcccggg aaggtgaacc aaccatgact ctgctgtccg aagcggtacg cgcgggtgcg 94740tcgccacagg aactggagcg ggcggaaccg cccagggagt acaccgccgc gtacatccac 94800tccgaggaca cccggatgtt cgagggggtc gcggacaagg acgtgcgcaa gtcgctgcgg 94860gtcggccggg tgccgatgcc tgaactggcg ccggacgagg tgctggtcgc cgtcatggcc 94920agtgccgtca actacaacac cgtgtggtcg gcgatcttcg agccgctgcc caccttccgc 94980tttctgaggc agttcgccgc gcagggcggc tgggcctcgc ggcacgacct tccctaccac 95040gtgctgggct cggacggcgc cggcgtggtg gtgcgcacgg ggcccggggt gcggcactgg 95100aagaccggcg accacgtggt ggtcagctgc gtccaggccg acgaccagga agcggccacg 95160caggcagacg ggatgctcgg cgccgagcag cgcatatggg gcttcgagac caacttcgga 95220ggcctcgccc attacgcggt ggtccgggcc agtcagctga tccccaagcc cggccatctc 95280agttgggagg aggcagcctg caacccgctg tgcggaggta cggcgtaccg gatgctggtc 95340ggtgaccgtg gcgcccggct taagcagggt gagatcgtgc tgatctgggg tgcggccggc 95400ggcctcggcg cctacgcggt gcaactggtc aagaacggcg gcggcatccc agtcggtgtc 95460gtcagctccc ccgccaaggc ggaggcggct cggcggctcg gctgcgacgt ggtgatcgac 95520cgtcaggaga tcggtctcga cgaccgtacg gcgtacgacc cggccgcggt gatcgagaca 95580ggcaagcagc tggggcgcat catccggcgg gaggtggggg aagacccgca catcgtcttc 95640gagcacgtcg gccggtccac cttcccggtc tccgtttttg cggtacgccg cggcggcacg 95700gtggtgacct gcggctcgag cacgggttat cagcacacct acgacaaccg ctacctgtgg 95760
atgaagctga agcggattat cggcagccac gccgccaacc ttcaggagca gtgggaactg 95820aaccgactgg tgtcccgcgg ccaaatcgtg ccgacccttt ccgcggtcta ccctctggcg 95880gaggtggctg cggccacccg gtcggttcag accaaccgcc acataggaaa ggtcggtgtt 95940ttgtgtctgg ccgaggcacc cgggcagggc gtcaccgacc ccgccctgcg tgcccgggtg 96000ggcgaggagc gcctcagcct cctccgcgac ctttctccca ctgcctgagc cagggaagag 96060gtggtcgagg acctcgcggc gatgctgccg atgacgcgtt cgcagtgggc gttcatccga 96120ggcgccgggg gcgctcttga tcacctccag ttcctccgcc ttgaaggccg catcgaacgc 96180gtccccgtac ttggcgtcgc gatcgcgcag caggaaccgc agggactcca tgcgtacgcc 96240gaggccggcg gcgaggttcc gccggtggtg aggaactcac gccaggttgg actggagtga 96300cgcggtgccg ggtcgatgcc cgctgcgttc aggataacga tctacaacac actccattcg 96360cccagctcaa cgccactcgt gaacactgcc cgatcctgtt gcccattgtg gttatgcggt 96420gagggtgtgc tcgagtcggg tgaggtggct ggtgcgggtg cggtccaggg ggcggtcgtt 96480ccaccaggca ttgaggcgga tcaggttgag tgcgacggcg gagtagatgt gttcgagatg 96540ggtcttcgcc aggcctcggt agcgagcgcg gcgtgtgccg gtgacagcgg tggcctgccg 96600gatggtgccc tcgatgccgg aacgtaaggc gtagtcggtg ttccagtcct tggtcttctg 96660ctgggtgcgg gtgtgccgga gtgcctcggt catctgtctg aggtggaggg agagttggcg 96720gcggttcttc ttcgcggtgg tgcactgggg tttgaagggg caggggatgc agtcgagggc 96780ggcgaagctg accacggtct tggggatacc ttcgctcacg acggggttcc aggtggcgct 96840ggtatgtccg gcggggcagg tggccttccc ggcttcccgg tcgatggtga agtcggtggc 96900agcgaagcct gcctgggctt tggcctggcg ggaggtgtcc agcaggaccg gcgtgatcag 96960tgcgattccg taggtcttca ccgagccgtg gatgagttcg gcggtggcgt agccggagtc 97020gggatagtgc tcatcgggaa gcagcccgcg ttgctggagt gcgtggtgaa tggcattgag 97080tgtcttgctg tccggcactg tggagtgggt ggtggcgatg ttcgtgatca ggttcgggtg 97140tgtgcgtgcc ttctcgggag cggaggtgca ggtttcgctg atgtggagct tgtagccgtt 97200ccagaacatg tcgcgtttgg cggaccagcg agcatcggtg tcataaggcg aggacagacg 97260taggtggccg ggcgggcggc cgtcaccgcc ctcatcggtc ttctcccgcc gcttgacgac 97320ctcgcggccg cctcgggtga tggtgcgggt gtagttctgc acaagcacac accacaggac 97380ctgcaccgcg ggcagctcgc gcagccagac cggagagctg gagtggtaga ccgcgcccag 97440cagggcgaag ccgtcccgcg cgaagtccac ggccagtttc tgctgcctgg cccgggaagt 97500gggcaggcgc cagctgtcca cccgcggtcc gtaccgcctg ctccacgagg ccacgtccac 97560tgcctgtgcg acccagtccg gacccgcgca ggtcagcgct tccagtgcgg cgcgaaccgc 97620ttccccggcc agctccagcc ggttcaggtc ccgcaccgcc gcgaccacgt gggtggagtc 97680cgtgcgctgc ttgcctcctg cggccagcag gccttgttcg gtcagcctgg ccaccaacaa 97740gtccagtacc ttctcttcca ggccatgggc ggcaacccgg ctacggaact gggacaggac 97800actgaagtcg aagccgggat cctccaggcc cagtccgagc gcgtaggacc acgagagttt 97860gtcccgcacc gcctcggcag cctgccggtc agtcaggttc tccgccatct gcagcaccgt 97920gaccaatgcc aagcggcccg gtgaccagcc acgcggcccc gtcaccgcga acgcttccgc 97980gaactcggca tccgcgaaca actcaccgag ccgatcacgc accacgaccg gcaacggcac 98040ctgacgaccg gagtacttcg cccgcaccgc ccgagccacc tccggcgccg gctccggcca 98100cgaccgcggc tccatcggca cgagcccctc ccactccgtg accgggaacg agaaggaccg 98160gccaccagac cagcctgccg caacaatcaa caccccgacc agcacaaaca gcaaatgggc 98220aacaggatcg ggcagcgttg aaccatggca ggatcgacac agacgccgcg ggtaacctgc 98280tggtttccct gccgatgccg cgctttgacg ggcggatcgt gctcgccgtg gacgtctccc 98340cgtggctgcg ctcggacgcg gcctgctcgc ctgagcgact gctctgtcac gtccacggcc 98400gttctcggga ggccgcgcag atcatcccgg gttggccgta ctccttcatc gccgcgctga 98460caccggaccg tacttcgtgg acgcagatcc tggacgtggt ccggctcggg cccgccgacg 98520atgccgcggc cgtcaccgcc gaccagcttc gggctgtggt cgagaggctg atcgcggccg 98580gtcaatggca gccgcgaggc catgtcgatc ggggtgccgc ccgcgacgaa cctgaatgtg 98640ggacatccgg tcatggcgtc gaagatccgg ccgatgccgt cggacagctt gctcgcgctg 98700tcgctgtaca gctcctcggc gccgtccgtg aggtacagtc tcgccgcccc cgggccgtcc 98760cgttgaacct cactgtccac ccctctccgt agctcgcatt cgatgcgaga aaatcgcatc 98820gaatgcgagg cggcagcgaa accgcagtcg tccatccgga cgagtgacag cggctgacta 98880tcgggtctcg ccagccgcta ccgtcgcgcg acgcaacgtt ccggacatcc ctgttcagcg 98940aggtgttgtc cgccctcgcc ggtgccgaca cagcaactgt cgccagttcg catccgatgc 99000aattggcggc gcccgtgccg tttgcgctgg gccgaccacc ggtgtctcat caggtggcca 99060gtgctgcctg ggtgaagcgc gctattcctc ggctggggtt ggggcttggg ggagacgtag 99120ttggaagcat gctccgaggg tccggggggt gagggtgagg gtgcctttgt ggcggtgggc 99180caggtcgcgg gcgatggcga ggcccaggcc ggtgccgccg tggtcgcggg agcgggcgtc 99240gtcgagtcgg acgaagcgtt cgaagatgcg ctcggcgtct tcagtgggca cgcctggtcc 99300gtcgtcgtgc accgtgaggt cgacccaagc gtcctggttt cggatggtga tgtggatgcg 99360gtgtgcggcg tggcgggcgg cgttgtcgat gaggttgcgc agtagtcgtt cgtattcgtc 99420ggggtttccg tgtgcgtgtg cgggggcggt gctgtcgcag gtgagggtca gcggtcgttc 99480ggtgaggggg tattgctcgg tcagccggga ggcgagggct gtcaggtcga cggtttcggg 99540gccggctgtg ggggtgcggg tgtcgaggcg ggcgaggagc agcaggtctt cggcgagggc 99600gtggaggcgg cgggtctggc gtgcggcggt ggtgaccgcg gcgggccagt cggtgcgctc 99660cgggtaggcg agcgcgactt ccaggctggc cagcagtgtg gtgagggggc tgcgtagttc 99720gtgggcggcg tccgcgacga agcggcgttg ctgggcggcg gcgctgtcga ggcgttggag 99780ggtggtgttg atggtggtgg ccagggcggt gatctcgtgt cccgtggcgg ggacggtgac 99840gcgttcgcgg gggtcgctcg cggtgaccga ggcggtgagg acgcggatgg cttcgaccgg 99900ccgcagcgcg atgcggacgg cgaagtaggc gacggcggcg atcagtacga ggctgacgag 99960
cccggctcgc agcagcaggc ggtcggtggc ctcggtgatt gtttcggcga tgtcctcggc 100020tgcgtgcggc agcaccacca catagacccg cagttgggcg tcggcggcga cacccagagc 100080ggcgactctg tcgctgctca gttcatcggc cctgacatcg tggtacatga ccaggtaggt 100140cccaccgtcc ttgccgaacc ggtccccggg ctcggagtcg gggcgcgccg gcatgcgcgt 100200gggaatggtc gtatagcccc agcctgacga ctcgtcatct ggcggggcgg gcaactcggc 100260cttgggcggg gcgggcagca catggcgggt gccgggatcg aactcctcca tgcctccgcc 100320gtaggcgaca gcaccgcggc ggtcggtcgc gacgacctcg tacggcacag tgcttcggcg 100380aacaggaacc acaccctcct ccacctgatc gacgagagcc cggaattgtg cctgcgcctg 100440cccttcggcg atctgcgtgc tctcgcggta gacgtcgtgg tgtacccacc agccgatgcc 100500gaccaggatg acggcggcgg ccgaggcggc ggccagggcc gtgcgggctc gtaccgaacg 100560cggccaccag cggcgtcgcg gcccggtcgc gttcacggtc gtccaccagc ggcgtcgcgg 100620ctcagtcgcg ttcacggtcg tccaccagtc ggtagccggt accccgcacg gtctgcaggg 100680actgccggtg gaacgcggcg tccaccttct tgcgtagcgc gctgacgcgt gcctccacca 100740ggttgggatc ctcggcttcg tcgggccacg cgtgatagag cagatccgtt ttggagaccg 100800cctggcccgc ccggcgggcc agcagctcca gcacggcgaa ctcccggggg tgtgagttcc 100860acccggaccc cggcccggcg gcagacccgg ccggcgacat ccagcgagag gtcgcccacg 100920gcaaggacgg gcggggcgac cgtggcggct cgtctgacca gggcccgcag ccgtgcgacg 100980agcaccacgt aggagaaggg cttggccagg tagtcatcgg cccccgtgtc cagggcctcc 101040gcctgatccc actccccgtc cttggcggtg aggaccagga tgggggtcgc gttgttctcc 101100cggcgcaact gggcgcagac cttgtagccg ttgagtccgg gcagcatcaa gtccaggacg 101160accagggcgt attcgccggt ccgggccatc cacagtccct gccggccgtc atgggcgagg 101220tcgacgctgt agccctcggc ggtcagaccg gtgtgcaggg tgtgggcaag gtccacctcg 101280tcttcgacca ccagaatgcg catgcggtgc agcctcgcac agcgccggcc cgccttcctg 101340atcggccggt caggttggat cagcatccgg tcaacgaggt cccggcatgc tggccgagtc 101400tcttcgccac tactgaaagg ccctgccacc gatgtccgtt gctgaacgtg cccccgccgc 101460ggccaggacg gtttcccctc ccgcacgcgc ccgtcgtcag tcaccgctcc ggcctgtgac 101520cgatggcggt ccccagccgc gggcgcgtct gcggtgacgc gggggtgctg agccgagttg 101580gtcaagctca agggcacgac gggccatcat cacgccacaa gcgatgggga ggtccgagcg 101640ggcgagcgaa agcgaaccct tgcccttacg agccgacatg aggaacaccg cgtcgatggc 101700gaagcggtcg gcggcagatt tacgcctgac cagcagaaac ccagtgatta cacccggaag 101760acaacaagtt atcagatagc ctccaggggg agacttgcgc agacaagtga aaagagcgtg 101820cgcggccacg atcgccacgg cggcagccgt ggccctggcg gcggacatga ccagcccggc 101880gtcggcggag cccgagcgta cggccggtga ccaggccgta cagaccacgc ccaagcaccg 101940cgtcaccctg atcaccggcg accgtgtcgt cgtcgacgcc aagggccgcg tcatcggcct 102000ggagcgggcc aagggccgcg aggggatacc cgtccagatc cgcaaggccg acgggcacac 102060cctcgtcgtg ccggccgacg cggcacggct gatagccgac ggcagactcg accagcggct 102120cttcgacgtc accgagctca acaagtcggc caaccgcaag gcccagaagc agggcctcaa 102180gctgatcgtc ggctacagcg gcacggccgc cgcggcgaag gcggacgtcc gggaggccgg 102240cgacaccaag gtccgcagga ccctgaagtc gctgaacgcg gacgcggtgc tgacgcccaa 102300gggcgacgcg cccgacctgt gggccgcggt caccgacacg ccgtccggcg gcgcgaagac 102360cgcctccggc atcgcccacg tgtggctcga cggggtccgc aaggccagcc tcgacaagtc 102420cgttccgcag atcggcgccg acaaggcgtg ggccgccggg tacaacggca agggcgtcaa 102480gatcgccgtc ctcgacaccg gtgtcgacgc gacccacccg gacctcaagg agcaggtggt 102540cggagagaag aacttctcca cgtcccccga cgcgaccgac aagtacggcc acggcacgca 102600cgtcgcgtcc atcgcggccg gtacgggagc caagtcggcg ggcaagtaca agggcgtcgc 102660accgggcgcc aagctgctca acggcaaggt gctcggcgac gacggctccg gcgacgactc 102720cggcatcctc gccggcatgg agtgggcggt cgagcagggc gccgacgtcg tgaacctcag 102780cctcggcggc ggggacaccc ccgacatcga cccgcttgag gcccaggtca acaagctgtc 102840caaggagaag ggcgtcctct tcgccatcgc cgcgggcaac gacggcgact tcggcgagca 102900gacgatcggc tccccgggca gcgcggaggc cgcgctcacc gtgggcgccg tcgacgacac 102960cgacaagctg gcctcgttct ccagcacggg ccccggcctc gacgggcaga tcaagcccga 103020cgtgaccgca cccggtgtgg acaccacggc cgcctcggcc ccgggcagcg tcatagccca 103080ggaggtcggc gagaagccgc ccggctacgt gagcatctcg ggtacgtcga tggccacccc 103140gcatgtcgcg ggcgccgcgg cgatcctgaa gcagcagcac cccgactgga cgtacaccca 103200gctcaagggt gcgctgaccg gctccgcgaa gggcggcaag tacacgccgt tccagcaggg 103260ttcgggtccg gagtccaggg tcgacaatgg cacagtcaag ccagatccgg tgggtcgctc 103320gatcctcgaa cggtcgggtg agtcggtcgg cccgtccatg cacgtggctc gcacaccgac 103380gacaagccgg tcaccgacaa ggtcacgtac aagaacctcg ggaagaccga tgtcacgctg 103440accctcgcgg tgacggccac cgacccgaag gggcaggccg caccggccgg cttcttcacg 103500ctcggcacca agacgctgac cgtcccggcg ggcggctcgg cctccgccga cctcacggtc 103560aacacgaagc agggcggcac gctcgacggc gcctactccg cgtacgtgac cgccaccggc 103620ggcggccaga gcgtacgcac ggcggcgacg gtgcagcgcg aggtggagtc gtacgacgtc 103680acgctcaagt tcatcgaccg tgacggcaac ccggcgaagt actacaacgc cgaactggac 103740ggtgtcaccg ggctcgcaca gggcaagtgg tactcgccct acgacgagtc cggcaccgtc 103800aaggtccgct ttcccaaggg cggttacatc ttcaactcgg ccgtccacgt cgacccggat 103860gaccccgcca agggtttcga ctgggtgacg cagccgaagc tgagcatcac caagaaggcc 103920acgatcacgg tggacgcgcg gaccgcgaag ccggtggaca tcaccgtgcc cgacgcggcg 103980gcgaagtcgg aggtcgctac gccgttgtac accgtcggcg tgccggacgg cagcaactcg 104040tacggctggt ggctggactc gtacgccaac ttccgtaccg cgcacgccgg tccgca 104096
<210>2<211>6148<212>PRT<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)<400>1Met Ala Gly Gly Ser Glu Ser Glu Ala Ala Glu Phe Thr Ala Arg Ser1 5 10 15Ala Gln Pro Ile Ala Val Val Gly Met Ala Cys Arg Leu Pro Gly Ala20 25 30Ala Gly Pro Ala Glu Phe Arg Ala Ile Leu Arg Ser Gly Thr Glu Ala35 40 45Val Gly Ala Ala Ala Pro Asp Arg Pro Tyr Ala Pro Pro Arg Gly Gly50 55 60Phe Leu Asp Ser Val Asp Arg Phe Asp Ala Gly Phe Phe Gly Val Ser65 70 75 80Pro Arg Glu Ala Ala Val Met Asp Pro Gln Gln Arg Leu Met Leu Glu85 90 95Leu Cys Trp Glu Ala Leu Glu Asp Ser Gly Ile Val Pro Ala Arg Leu100 105 110Asp Gly Ser Asp Ala Gly Val Phe Val Gly Ala Ile Thr Asp Asp Tyr115 120 125Ala Val Leu Ser Arg Ala Ala Gly Val Asp Ala Ala Thr Pro Glu Thr130 135 140Ser Thr Gly Leu Asn Arg Gly Met Ile Ala Asn Arg Val Ser Tyr Arg145 150 155 160Leu Gly Leu Arg Gly Pro Ser Phe Thr Val Asp Ser Gly Gln Ser Ser165 170 175Ser Leu Val Ala Val His Leu Ala Thr Glu Ser Leu Arg Arg Gly Glu180 185 190Gys Ser Leu Ala Leu Ala Gly Gly Val Asn Leu Ile Leu Ala Glu Asp195 200 205Ser Thr Ala Ala Val Glu Arg Phe Gly Ala Leu Ser Pro Asp Gly Arg210 215 220Cys Tyr Thr Phe Asp Ala Arg Ala Asn Gly Tyr Val Arg Gly Glu Gly225 230 235 240Gly Gly Val Val Val Leu Lys Arg Leu Thr Asp Ala Val Ala Asp Gly245 250 255Asp Asp Ile Leu Cys Val Leu Ala Gly Ser Ala Val Asn Asn Asp Gly260 265 270Gly Gly Glu Gly Leu Thr Val Pro Asp Arg Gln Gly Gln Glu Ala Val275 280 285Leu Thr Ala Ala Tyr Glu Gln Ala Gly Ile Ser Pro Ash Ala Val Gly290 295 300Tyr Val Glu Leu His Gly Thr Gly Thr Pro Ala Gly Asp Pro Val Glu305 310 315 320Ala Ala Ala Val Gly Ala Val Leu Gly Ala Gly Arg Ser Ala Glu Gln325 330 335Pro Leu Leu Val Gly Ser Val Lys Thr Asn Ile Gly His Leu Glu Gly
340 345 350Ala Ala Gly Ile Ala Gly Leu Leu Lys Ala Val Leu Thr Val Arg His355 360 365Arg Glu Ile His Ala Ser Leu Asn Phe Thr Thr Pro Ser Thr Arg Ile370 375 380Pro Met Thr Glu Leu Gly Leu Ser Val Asn Thr Ala Leu Arg Pro Trp385 390 395 400Leu Ser Glu Ala Gly Pro Leu Ile Val Gly Val Ser Ser Phe Gly Met405 410 415Gly Gly Thr Asn Cys His Val Val Leu Thr Glu Trp His Gly Val Ala420 425 430Pro Val Thr Ala Pro Gly Ile Arg Pro Asn Gly Thr Ala Val Pro Leu435 440 445Leu Ile Thr Gly Arg Asp Glu Gln Ala Leu Arg Asp Gln Ala His His450 455 460Leu Gly Arg His Leu Asp Glu His Gly Pro Leu Arg Leu Lys Asp Val465 470 475 480Ala His Thr Leu Ala Ala Gly Arg Thr Ala Phe Glu His Arg Ala Val485 490 495Leu Leu Val Arg Glu Pro Gln Asp Met Thr Asp Gly Leu Ala Arg Leu500 505 510Ala Asp Gly Thr Pro Gly Pro Asp Leu Val Arg Ala Thr Ala Thr Cys515 520 525Ser Ser Leu Ala Phe Leu Phe Thr Gly Gln Gly Ser Gln Arg Pro Gly530 535 540Met Thr Ala Glu Leu Tyr Gln Ser Ser Ser Glu Tyr Ala Ala Ala Leu545 550 555 560Asp Glu Val Cys Ala His Leu Asp Pro Gln Leu Arg Val Pro Leu Arg565 570 575Glu Val Leu Phe Ala Ala Glu Gly Thr Ala Glu Ala Val Leu Leu Asp580 585 590Arg Thr Glu Phe Thr Gln Pro Ala Leu Phe Ala Val Glu Val Ala Leu595 600 605Phe Arg Phe Ala Glu His Cys Gly Leu Val Pro Arg Leu Leu Leu Gly610 615 620His Ser Val Gly Glu Leu Ala Ala Ala His Val Ala Gly Val Leu Ser625 630 635 640Leu Ala Asp Ala Cys Ser Leu Val Ala Ala Arg Gly Arg Leu Met Gln645 650 655Ala Gln Pro Ala Thr Gly Ala Met Ala Ala Ile Gln Ala Thr Glu Lys660 665 670Glu Leu Ala Pro Phe Leu Asp Glu Ser Val Ala Ala Ala Ala Leu Asn675 680 685Gly Pro Ala Ser Thr Val Leu Ala Gly Asp Glu Glu Ala Val Leu Ala690 695 700Ile Ala Ala His Trp Ala Ala Lys Gly Arg Arg Thr Lys Arg Leu Arg705 710 715 720
Val Ser His Ala Phe His Ser Pro His Met Asp Gly Met Leu Glu Glu725 730 735Phe His Arg Val Ala Gly Gln Leu Thr Phe Glu Ala Pro Arg Val Pro740 745 750Ile Val Ser Asn Glu Thr Gly Ala Leu Leu Thr Glu Ala Glu Ala Cys755 760 765Ser Pro Glu Tyr Trp Val Arg Gln Ala Arg Val Thr Val Arg Phe Leu770 775 780Asp Gly Val Arg Leu Leu Glu Glu Gln Gly Val Thr Thr Leu Leu Glu785 790 795 800Leu Gly Pro Asp Gly Thr Leu Ser Ser Leu Ala Arg Asp Cys Leu Arg805 810 815Gly Val Asp Ala Val Ser Val Pro Leu Leu Arg Gly Arg Thr Glu Pro820 825 830Glu Glu Val Val Ala Ala Leu Ala Thr Leu Gln Val Arg Gly Val Pro835 840 845Met His Trp Glu Arg Leu Ala Thr Glu Glu Gly Ala Arg Arg Val Pro850 855 860Leu Pro Thr Tyr Pro Phe Gln Arg Arg Arg His Trp Leu Pro Asp Leu865 870 875 880Val Ala Gln Asp Ser Val Pro Ala Pro Gly Arg Ala Ala Gly Gln Arg885 890 895Ser Arg Pro Val Asn Glu Pro Ala Pro Ser Ala His Ala Pro Arg Gly900 905 910Asp Arg Thr Met Arg Glu Thr Val Arg Ala Ala Val Ala Leu Val Leu915 920 925Gly His Asp Ser Pro Asp Asp Ile Pro Ala His Thr Thr Phe Arg Glu930 935 940Leu Gly Leu Ser Ser Leu Met Leu Ala Glu Val Gly Glu Arg Leu Thr945 950 955 960Glu Ala Thr Gly Arg Arg Val Pro Thr Thr Leu Leu Phe Asp His Pro965 970 975Thr Pro Asp Ala Leu Val Arg Glu Leu Thr Ser Gly Gly Ala Glu Arg980 985 990Pro Ala Ala Leu Thr Thr Ala Pro Ser Ala Ala His Ala Asp Asp Pro995 10001005Val Val Val Val Gly Met Ala Cys Arg Leu Pro Gly Gly Ile Arg101010151020Ser Pro Glu Glu Phe Trp Gln Phe Met Ala Ala Asp Gly Asp Ala102510301035Ile Ser Pro Leu Pro Thr Asp Arg Gly Trp Ala Val Ser Gly Asp104010451050Phe Pro Ala Glu Gly Gly Phe Leu Ala Asp Val Ala Gly Phe Asp105510601065Ala Ala Phe Phe Gly Ile Ser Pro Arg Glu Ala Leu Ala Met Asp107010751080
Pro Gln Gln Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu1085 1090 1095Arg Ala Gly Val Asp Ala Leu Ser Leu Arg Gly Ser Arg Thr Gly1100 1105 1110Val Phe Val Gly Ala Ser Pro Ser Glu Tyr Gly Pro Arg Leu His11151120 1125Glu Pro Ser Gln Ala Asp Gly His Val Leu Thr Gly Thr Ala Pro11301135 1140Ser Val Leu Ser Gly Arg Val Ala Tyr Val Leu Gly Leu Glu Gly1145 1150 1155Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala1160 1165 1170Leu His Leu Ala Ala Gln Ala Leu Arg Gly Gly Glu Cys Asp Leu1175 1180 1185Ala Leu Ala Gly Gly Val Ala Val Met Ala Thr Ala Gly Met Phe1190 1195 1200Ala Glu Phe Ala Arg Gln Gly Gly Leu Ala Arg Asp Gly Arg Cys12051210 1215Lys Ala Phe Ala Asp Gly Ala Asp Gly Thr Gly Trp Gly Glu Gly1220 1225 1230Val Gly Val Leu Val Leu Ser Arg Leu Ser Glu Ala Arg Arg Cys1235 1240 1245Gly Tyr Thr Val Leu Ala Val Val Ser Gly Ser Ala Val Asn Ser1250 1255 1260Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln1265 1270 1275Gln Arg Val Ile Arg Gln Ala Leu Ala Ser Ala Gly Leu Ser Pro1280 1285 1290Gly Asp Val Asp Val Val Glu Ala His Gly Thr Gly Thr Ala Leu1295 1300 1305Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Gln1310 1315 1320Glu Arg Gly Ala Gly Arg Pro Leu Tyr Val Gly Ser Val Lys Ser1325 1330 1335Asn Ile Gly His Val Gln Ala Ala Ala Gly Val Ala Gly Val Ile1340 1345 1350Lys Ser Val Leu Ala Leu Arg Tyr Gly Val Leu Pro Arg Thr Leu1355 1360 1365His Val Asp Val Pro Ser Arg Glu Val Asp Trp Ser Ala Gly Ala1370 1375 1380Val Glu Leu Leu Thr Glu Ala Val Glu Trp Leu Ala Gly Gly Arg1385 1390 1395Pro Arg Arg Val Gly Val Ser Ala Phe Gly Ile Ser Gly Thr Asn1400 1405 1410Ala His Val Ile Leu Glu Glu Ala Pro Glu Gly Val Glu Glu Ser1415 1420 1425Ala Ala Gly Glu Val Ala Gly Val Val Pro Trp Val Val Ser Ala
1430 1435 1440Arg Ser Glu Glu Gly Leu Arg Ala Gln Ala Ala Arg Leu Val Glu1445 1450 1455His Val Val Gly Gly Ser Gly Leu Gly Pro Val Asp Val Gly Trp4160 1465 1470Ser Leu Ala Arg Ser Arg Ala Val Leu Glu His Arg Ala Val Val1475 1480 1485Leu Gly Gly Asp Gly Glu Glu Leu Val Ala Gly Leu Arg Ala Leu1490 1495 1500Cys Asp Gly Val Leu Gly Pro Gly Val Val Arg Gly Val Ala Gly1505 1510 1515Asp Gly Gly Thr Ala Leu Leu Phe Thr Gly Gln Gly Ala Gln Arg1520 1525 1530Val Gly Met Gly Arg Glu Leu Tyr Glu Ala Phe Pro Val Phe Ala1535 1540 1545Ala Ala Phe Asp Ala Val Cys Ala Gly Phe Glu Gly Met Leu Pro1550 1555 1560Gly Ser Leu Arg Gly Val Val Phe Gly Asp Gly Gly Gly Val Val1565 1570 1575Asp Arg Thr Glu Trp Ala Gln Pro Ala Leu Phe Ala Leu Glu Val580 1585 1590Ala Leu Phe Glu Leu Val Val Ser Trp Gly Val Arg Ala Asp Val1595 1600 1605Leu Val Gly His Ser Val Gly Glu Leu Val Ala Ala His Val Ala1610 16151620Gly Val Trp Ser Leu Ala Asp Ala Cys Arg Val Val Ala Ala Arg1625 1630 1635Gly Arg Leu Met Gln Ala Leu Pro Val Gly Gly Ala Met Val Ala1640 1645 1650Val Arg Val Gly Glu Gly Glu Leu Pro Val Leu Pro Glu Gly Val1655 1660 1665Ser Val Ala Ala Val Asn Gly Pro Arg Ser Leu Val Leu Ser Gly1670 1675 1680Asp Glu Gly Pro Val Leu Glu Leu Ala Ala Arg Leu Ala Gly Glu1685 1690 1695Gly Arg Asp Thr Arg Arg Leu Arg Val Ser His Ala Phe His Ser1700 1705 1710Ala Arg Met Glu Pro Met Leu Ala Glu Phe Ala Gln Val Leu Ala1715 1720 1725Ala Val Glu Phe Arg Ala Pro Arg Ile Pro Val Ile Ser Asn Val1730 1735 1740Thr Gly Glu Val Ala Gly Glu Glu Leu Thr Thr Pro Glu Tyr Trp1745 1750 1755Val Arg Gln Val Arg Glu Ala Val Arg Phe Ala Asp Gly Val Asn1760 1765 1770Thr Ala His Gly Ser Gly Val Arg Arg Tyr Leu Glu Leu Gly Pro1775 1780 1785
Asp Gly Val Leu Thr Ser Leu Ala His Asp Ile Leu Ala Glu Gln1790 1795 1800Gly Ile Asp Arg Asp Val Ala Val Val Pro Ala Leu Arg His Asp1805 1810 1815Gln Pro Glu Ser Arg Thr Leu Leu Thr Ala Leu Gly Gln Leu His1820 1825 1830Thr Thr Gly Met Asp Val Gly Trp Ala Ala Phe Leu Ala Pro Tyr1835 1840 1845Gly Ala Arg Thr Val Glu Leu Pro Thr Tyr Ala Phe Glu His His1850 1855 1860Arg Tyr Trp Leu Asp Pro Val Ala Pro Ala Ser Ala Pro Ala Asp1865 1870 1875Pro Leu Arg Tyr Arg Ala Glu Trp Ala Ser Val Pro Asp Cys Ala1880 1885 1890Thr Pro Ser Leu Ser Gly Val Gln Ala Val Val Val Pro Ala Gly1895 1900 1905Gly Gly His Leu Asp Val Leu Pro Asp Val Thr Ala Ala Leu Arg1910 1915 1920Glu His Gly Ala Arg Thr Val Leu Val Glu Val Asp Pro Glu Arg1925 1930 1935Ala Asp Arg Ala Glu Ile Ala Asp Ala Leu Arg Ala Ala Leu Gly1940 1945 1950Glu Glu Gly Gly Gly Val Val Ser Leu Leu Ala Leu Asp Arg Gly1955 1960 1965Pro Phe Ala Gly Val Ala Ala Thr Ala Val Leu Leu Gln Ala Leu1970 1975 1980Thr Gly Leu Asp Gly Gly Gly Arg Leu Trp Ser Leu Thr Arg Gly1985 1990 1995Ala Val Ser Val Ser Arg Ser Asp Ala Leu Thr Asp Pro Gly Gln2000 2005 2010Ala Gln Val Trp Gly Met Gly Arg Val Ala Ala Leu Glu His Pro2015 2020 2025Glu Arg Trp Gly Gly Leu Val Asp Leu Pro Thr Glu Leu Asp Asp2030 2035 2040Arg Ala Arg Ala Arg Leu Cys Ala Val Leu Ser Gly Ser Thr Gly2045 2050 2055Glu Asp Gln Val Ala Val Arg Ala Ala Gly Leu Tyr Ala Arg Arg2060 2065 2070Leu His Arg Val Ala Pro Arg Val Pro Thr Thr Glu Asp Ala Gly2075 2080 2085Ala Ala Ser Gly Gln Gly Val Gly Asp Arg Arg Ala Tyr Thr Tyr2090 2095 2100Gly Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly Ala His2105 2110 2115Ile Ala Asn Trp Leu Ala Arg Ser Gly Thr Arg His Val Leu Leu
Thr Ser Arg Arg Gly Pro Asp Ala Glu Gly Ala Ala Asp Leu Thr2135 2140 2145Ala Arg Leu Arg Glu Leu Gly Thr Glu Val Thr Val Ala Ala Cys2150 2155 2160Asp Val Ala Asp Arg Gln Arg Leu Ala Asp Leu Ile Ala Ala Leu2165 2170 2175Ser Ala Asp Arg Pro Leu Thr Gly Val Val His Ala Ala Gly Val2180 2185 2190Leu Asp Asp Gly Val Leu Asp Ser Leu Thr Pro Asp Arg Phe Asp2195 2200 2205Ala Val Ala Arg Pro Lys Val Ile Gly Ala Arg His Leu His Glu2210 2215 2220Leu Thr Arg Asp Leu Asp Leu Ser Leu Phe Val Met Phe Ser Ser2225 2230 2235Val Val Gly Thr Val Gly Leu Ala Gly Gln Gly Asn Tyr Ala Ala2240 2245 2250Ala Asn Ala Tyr Leu Asp Ala Leu Ala Val His Arg Ala Gln His2255 2260 2265Gly Leu Pro Ala Thr Ala Val Ala Trp Gly Ser Trp Ser Gly Ala2270 2275 2280Gly Met Ala Gly Asp Thr Arg Ala Ala Arg Asp Arg Leu Ala Arg2285 2290 2295Ala Gly Leu Ala Pro Leu Asp Pro Ala Ala Ala Leu Ala Val Leu2300 2305 2310Asp Arg Val Ile Ala Asp Gly Glu Thr Ala Val Thr Val Ala Asp2315 2320 2325Val Asp Trp Glu Arg Phe Ala Ala Gly Phe Ala Pro Gly Arg Pro2330 2335 2340His Pro Leu Leu Ala Gly Ile Pro Glu Leu Trp His Ala Arg Pro2345 2350 2355Gln Glu Thr Gly Gln Val Thr Asp Gly Pro Ala Asp Arg Leu Ala2360 2365 2370Gly Leu Ala Gly Asp Glu Leu Arg Gln Ala Leu Asp Asp Met Val2375 2380 2385Thr Val Glu Val Ala Ala Val Leu Gly Phe Arg Ala Lys Asp Arg2390 2395 2400Val Pro Thr Asp Arg Thr Phe Lys Ser Leu Gly Phe Asp Ser Leu2405 2410 2415Ile Gly Val Glu Phe Arg Asn Arg Leu Ala Ala Ala Leu Gly Arg2420 2425 2430Arg Leu Pro Pro Ser Leu Ile Tyr Asp His Pro Thr Pro Gly Arg2435 2440 2445Leu Val Glu His Leu Ala Ala Gly Val Asp Gly Gly Asp Gln Pro2450 2455 2460Ser Thr Val Gly Gly Arg Pro Val Ala Pro Thr Arg Thr His Asp2465 2470 2475Asp Pro Val Val Ile Val Ser Ala Ala Cys Arg Phe Pro Gly Gly
2480 2485 2490Val Arg Thr Pro Glu Asp Leu Trp Gln Leu Val Leu Asp Gly Gly2495 2500 2505Asp Ala Ile Gly Pro Phe Pro Val Asp Arg Gly Trp Asp Leu Asp2510 2515 2520Arg Leu Tyr Asp Pro Asp Pro Gly Ala Ser Gly Thr Ser Tyr Val2525 2530 2535Arg Glu Gly Gly Phe Leu Thr Gly Val Ala Asp Phe Asp Ala Val2540 2545 2550Phe Phe Gly Ile Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gln2555 2560 2565Gln Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala2570 2575 2580Gly Ile Val Pro Gly Ser Leu Ala Gly Ser Arg Thr Gly Val Phe2585 2590 2595Val Gly Ser Asn Gly Gln Asp Tyr Ala Asn Leu Leu His Ser Ser2600 2605 2610Asp Val Glu Gly His Val Leu Thr Gly Thr Ala Ser Ser Val Leu2615 2620 2625Ser Gly Arg Ile Ala Tyr Thr Leu Gly Leu Glu Gly Pro Ala Leu2630 2635 2640Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu2645 2650 2655Ala Val Gln Ala Leu Ser Ser Gly Glu Cys Asp Leu Ala Leu Ala2660 2665 2670Gly Gly Val Thr Val Met Ser Gly Ser Asp Ile Phe Val Glu Phe2675 2680 2685Ser Arg Gln Arg Gly Leu Ser Ala Asp Gly Arg Cys Lys Ala Phe2690 1695 2700Gly Pro Asp Ala Asp Gly Thr Gly Trp Ala Glu Gly Val Gly Thr2705 2710 2715Val Val Leu Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly His Glu2720 2725 2730Val Leu Gly Val Val Arg Gly Thr Ala Val Asn Gln Asp Gly Ala2735 2740 2745Ser Asn Gly Leu Ser Ala Pro Ser Gly Arg Ala Gln Gln Arg Val2750 2755 2760Ile Arg Gln Ala Leu Ala Asp Ala Gly Cys Ala Pro Ser Asp Val2765 2770 2775Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro2780 2785 2790Ile Glu Ala Gln Ala Leu Leu Thr Thr Tyr Gly Gln Asp Arg Pro2795 2800 2805Ala Asp Arg Pro Leu Tyr Leu Gly Ser Ile Lys Ser Asn Ile Gly2810 2815 2820His Ala Gln Ala Ala Ala Gly Leu Ala Gly Val Leu Lys Met Leu
Phe Ala Leu Arg His Gly Gln Leu Pro Lys Thr Leu His Ala Pro2840 2845 2850Arg Pro Thr Pro Glu Val Asp Trp Ser Glu Gly Ala Val Ala Leu2855 2860 2865Leu Thr Glu Asp Arg Pro Trp Pro Ala Val Asp Arg Pro Arg Arg2870 2875 2880Ala Gly Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala His Val2885 2890 2895Ile Leu Glu Gln Ala Pro Pro Ser Ala Ala Ser Asp Pro Ala Pro2900 2905 2910Thr Val Arg Pro Pro Ala Val Asp Ser Ser Val Gln Pro Trp Val2915 2920 2925Leu Thr Ala Arg Ser Gly Glu Ala Leu Gly Ala Leu Ala Asp Arg2930 2935 2940Leu Arg Glu Ala Ala Pro Gly Ala Val Pro Ala Asp Val Ala Arg2945 2950 2955Ser Leu Val Thr Thr Arg Thr Ile Trp Ala Glu Arg Ala Val Leu2960 2965 2970Leu Ala Asp Gly Arg Asp Glu Tyr Ala Ser Gly Leu Ala Ala Leu2975 2980 2985Ala Thr Gly Glu Gly Asp Ala Arg Val Val Arg Gly Thr Ala Asp2990 2995 3000Thr Arg Gly Arg Val Val Phe Val Phe Pro Gly Gln Gly Ala Gln3005 3010 3015Trp Ala Gly Met Ala Ala Arg Leu Trp Glu Ser Ser Pro Glu Phe3020 3025 3030Ala Arg Trp Met Asp Arg Cys Asp Lys Ala Leu Gly Asp Leu Thr3035 3040 3045Asp Trp Ser Leu Ala Glu Val Ile His Gln Ala Asp Gly Ala Pro3050 3055 3060Gly Leu Asp Arg Val Asp Val Leu Gln Pro Ala Ser Trp Ala Val3065 3070 3075Ser Val Ser Leu Ala Ala Leu Trp Arg Ser Cys Gly Val Glu Pro3080 3085 3090Ala Ala Val Val Gly His Ser Gln Gly Glu Ile Ala Ala Ala Cys3095 3100 3105Val Ala Gly Ala Leu Ser Leu Glu Asp Gly Ala Met Leu Val Thr3110 3115 3120Leu Arg Ser Arg Leu Ile Arg Glu Glu Leu Ser Gly His Gly Gly3125 3130 3135Met Met Ser Val Ala Leu Ser Pro Ala Gly Thr Ala Asp Arg Ile3140 3145 3150Ala Cys Trp Glu Gly Arg Ile Cys Val Ala Ala His Asn Ser Arg3155 3160 3165Arg Ser Thr Val Val Ala Gly Glu Pro Ala Ala Leu Ala Glu Leu3170 3175 3180
Leu Ala Ala Cys Glu Ala Asp Gly Ile Arg Ala Arg Arg Ile Pro3185 3190 3195Val Asp Tyr Ala Ser His Ser Pro Gln Val Glu Agr Ile Glu Arg3200 3205 3210Lys Leu Thr Glu Leu Ala Ala Gly Ile Val Ser Arg Ser Ser Glu3215 3220 3225Ile Pro Phe His Ser Thr Val Thr Gly Thr Arg Leu His Thr Thr3230 3235 3240Gly Leu Asp Ala Gly Tyr Trp Tyr Arg Asn Leu Arg Lys Pro Val3245 3250 3255Leu Phe Gly Pro Val Thr Glu Glu Leu Leu Thr Gln Gly His Asp3260 3265 3270Val Phe Leu Glu Met Ser Pro His Pro Val Leu Val Pro Ala Val3275 3280 3285Gln Glu Ala Ser Asp Ala Val Thr Ala Thr Ala Ala Ala Val Gly3290 3295 3300Ser Leu Arg Arg Gly Asp Gly Gly Pro Glu Arg Phe Leu Leu Ser3305 2210 3315Leu Ala Glu Ala Phe Val Arg Gly Ala His Val Asp Trp Ala Ala3320 3325 3330Val Leu Gly Gly Thr Gly Thr Arg Leu Val Glu Leu Pro Thr Tyr3335 3340 3345Pro Phe Gln Arg Thr Arg Phe Trp Pro Glu Pro Val Thr Pro Ala3350 3355 3360Thr Ala Thr Gly Gly Gln Asp Asp Ala Pro Leu Trp Gln Ala Val3365 3370 3375Glu Arg Gly Asp Val Ala Ala Val Ala Ala Glu Leu Ala Val Pro3380 3385 3390Asp Gly Arg Ser Leu Arg Asp Leu Val Pro Ala Leu Ser Gly Trp3395 3400 3405Arg Arg Arg Arg Arg Asp Ser Ala Thr Leu Asp Ile Trp Arg Tyr3410 3415 3420Arg Val Thr Trp Thr Gln Val Asn Leu Pro Val Ser Ala Ala Val3425 3430 3435Thr Gly Asp Trp Leu Leu Val Thr Asp Asp Pro Asp Thr Ala Val3440 3445 3450Pro Arg Trp Val Ser Ala Ala Leu Gly Glu Gly Leu Ala Thr Val3455 3460 3465Val Arg Pro Ala Asp Val Pro Ala Trp Ser Arg Thr Pro Gln Gly34703475 3480Thr Gly Trp Thr Gly Val Val Ser Leu Leu Gly Leu Thr Asp His34853490 3495Ser His Pro Cys His Pro Ala Leu Ser Thr Gly Val Ala Ala Thr3500 3505 3510Val Thr Leu Leu Thr Ala Leu Arg Glu Ala Gly Ile Glu Ala Pro3515 3520 3525Leu Trp Cys Leu Thr Ser Gly Ala Val Gly Thr Gly Gly Leu Asp
3530 35353540Gln Val Thr Ala Pro Asn Gln Ala Gln Leu Trp Gly Leu Gly Arg3545 3550 3555Val Ala Gly Leu Glu Thr Pro Ala Thr Trp Gly Gly Leu Val Asp3560 3565 3570Leu Pro Ala Glu Pro Asp Glu Arg Thr Ala Ala Leu Leu Arg Ala3575 3580 3585Ala Leu Thr Ala Asp Gly Ile Glu Gln Glu Tyr Ala Leu Arg Pro3590 3595 3600Ser Gly Pro Tyr Val Arg Arg Leu Val Arg Ala Pro Leu Ala Gly3605 3610 3615Val Ala Ala Pro Arg Ser Trp Arg Pro Arg Pro Asp Gly Thr Val3620 3625 3630Val Val Thr Gly Gly Thr Gly Ala Leu Gly Ala Arg Val Ala Arg3635 3640 3645Trp Leu Ala Arg Ala Gly Ala Gly His Leu Leu Leu Thr Ser Arg3650 3655 3660Arg Gly Pro Ala Ala Asp Gly Ala Val Glu Leu Ser Glu Glu Leu3665 3670 3675Arg Ala Leu Gly Ala Glu Val Thr Ile Thr Ala Cys Asp Val Ala3680 3685 3690Asp Arg Ala Gln Leu Ala Asp Val Leu Ala Ala Val Pro Thr Ala3695 3700 3705Phe Pro Val Ser Ala Val Ile His Thr Ala Gly Val Ser Gly Asn3710 3715 3720Ala Pro Leu Ala Gly Thr Thr Leu Ala Glu Leu Ala Glu Val Val3725 3730 3735Ala Ala Lys Ala Ala Gly Ala Arg Asn Leu Asp Glu Leu Leu Ala3740 3745 3750Gly Gln Asp Leu Asp Ala Phe Val Leu Phe Ser Ser Gly Ala Ala3755 3760 3765Val Trp Gly Ser Ala Gly Gln Gly Gly Tyr Ala Ala Ala Asn Ala3770 3775 3780Tyr Ala Asp Ala Leu Ala Ala Asp Arg Arg Arg Arg Gly Leu Val3785 3790 3795Ala Thr Ser Val Ala Trp Gly Ser Trp Ala Gly Gly Gly Met Val3800 3805 3810Asp Asp Asp Leu Ala Arg Glu Leu Ala Arg Gly Gly Val Arg Ser3815 3820 3825Met Asp Pro Asp Arg Ala Ile Ala Ala Leu Gln Gln Ala Leu Asp3830 3835 3840Phe Ala Glu Thr Phe Thr Ala Ala Arg Pro Arg Pro Leu Ile Asp3860 3865 3870Gly Ile Pro Glu Ala Ala Pro Ala Ser Ala Glu Pro Ala Gly Asp3875 3880 3885
Ile Pro Gly Leu Ala Ala Arg Leu Ala Gln Leu Pro Asp Gly Glu3890 3895 3900Arg Asp Arg Glu Leu Leu Asp Leu Val Arg Asn Ala Ala Ala Leu3905 3910 3915Ala Leu Gly His Thr Gly Thr Glu Pro Ile Thr Pro Ser Lys Pro3920 3925 3930Phe Lys Glu Leu Gly Phe Asp Ser Leu Thr Ala Val Asp Leu Arg3935 3940 3945Asn Arg Leu Thr Ala Ala Thr Gly Leu Arg Leu Pro Ala Thr Leu39503955 3960Val Phe Asp Tyr Pro Thr Pro Arg Ala Ala Ala Asp Ala Leu Arg3965 3970 3975Ala Val Leu Phe Ala Ala Asp Met Pro Val Asp Thr Ala Ala Pro3980 3985 3990Ala Arg Ser Ala Ser Ala Arg Pro Ala Asp Asp Asp Pro Val Val3995 4000 4005Val Val Ala Met Ala Cys Arg Tyr Pro Gly Gly Ala Thr Thr Pro4010 4015 4020Glu Lys Phe Trp Asp Leu Ile Ala Ala Gly Glu Asp Gly Ile Gly4040 4045 4050Phe Ser Arg Thr Gly Gly Phe Leu Ala Asp Val Ala Gly Phe Asp4055 4060 4065Ala Ala Phe Phe Gly Ile Ser Pro Arg Glu Ala Leu Ala Met Asp4070 4075 4080Pro Gln Gln Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu4085 4090 4095Arg Ala Gly Val Asp Ala Leu Ser Leu Arg Gly Ser Arg Thr Gly4100 4105 4110Val Phe Val Gly Ala Ser Pro Ser Glu Tyr Gly Thr Leu Val Ala4115 4120 4125Ser Leu Glu Gly Gly Gln Asp Tyr Ala Leu Thr Gly Ala Val Gly4130 4135 4140Ser Val Leu Ser Gly Arg Val Ala Tyr Val Leu Gly Leu Glu Gly4145 4150 4155Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala4160 4165 4170Leu His Leu Ala Ala Gln Ala Leu Arg Gly Gly Glu Cys Asp Leu4175 4180 4185Ala Leu Ala Gly Gly Val Ala Val Met Ala Thr Pro Asn Ala Phe4190 4195 4200Asp Ala Phe Ala Arg Gln Gly Gly Leu Ala Arg Asp Gly Arg Cys4205 4225 4215Lys Ala Phe Ala Asp Gly Ala Asp Gly Thr Gly Trp Gly Glu Gly4220 4225 4230
Val Gly Val Leu Val Leu Ser Arg Leu Ser Glu Ala Arg Arg Cys4235 4240 4245Gly Tyr Thr Val Leu Ala Val Val Ser Gly Ser Ala Val Asn Ser4250 4255 4260Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln4265 4270 4275Gln Arg Val Ile Arg Gln Ala Leu Ala Ser Ala Gly Leu Ser Pro4280 4285 4290Gly Asp Val Asp Val Val Glu Ala His Gly Thr Gly Thr Ala Leu4295 4300 4305Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Gln4310 4315 4320Glu Arg Gly Ala Gly Arg Pro Leu Tyr Val Gly Ser Val Lys Ser4325 4330 4335Asn Ile Gly His Val Gln Ala Ala Ala Gly Val Ala Gly Val Ile4340 4345 4350Lys Ser Val Leu Ala Leu Arg Tyr Gly Val Leu Pro Arg Thr Leu4355 4360 4365His Val Asp Val Pro Ser Arg Glu Val Asp Trp Ser Ala Gly Ala3470 3475 4380Val Glu Leu Leu Thr Glu Ala Val Glu Trp Pro Ala Gly Gly Arg4385 4390 4395Pro Arg Arg Val Gly Val Ser Ala Phe Gly Ile Ser Gly Thr Asn4400 4405 4410Ala His Val Ile Leu Glu Glu Ala Pro Glu Gly Val Glu Glu Ser4415 4420 4425Ala Ala Gly Glu Val Ala Gly Val Val Pro Trp Val Val Ser Ala4430 4435 4440Arg Ser Glu Glu Gly Leu Arg Ala Gln Ala Ala Arg Leu Val Glu4445 4450 4455His Val Val Gly Gly Ser Gly Leu Gly Pro Val Asp Val Gly Trp4460 4456 4470Ser Leu Ala Arg Ser Arg Ala Val Leu Glu His Arg Ala Val Val4475 4480 4485Leu Gly Gly Asp Gly Glu Glu Leu Val Ala Gly Leu Arg Ala Leu4490 4495 4500Cys Asp Gly Val Leu Gly Pro Gly Val Val Arg Gly Val Ala Gly4505 4510 4515Asp Gly Gly Thr Ala Leu Leu Phe Thr Gly Gln Gly Ala Gln Arg4520 4525 4530Val Gly Met Gly Arg Glu Leu Tyr Glu Ala Phe Pro Val Phe Ala4535 4540 4545Ala Ala Phe Asp Ala Val Cys Ala Gly Phe Glu Gly Met Leu Pro4550 4555 4560Gly Ser Leu Arg Gly Val Val Phe Gly Asp Gly Gly Gly Val Val4565 4570 4575Asp Arg Thr Glu Trp Ala Gln Pro Ala Leu Phe Ala Leu Glu Val
4580 4585 4590Ala Leu Phe Glu Leu Val Val Ser Trp Gly Val Arg Ala Asp Val4595 4600 4605Leu Val Gly His Ser Val Gly Glu Leu Val Ala Ala His Val Ala4610 4615 4620Gly Val Trp Ser Leu Ala Asp Ala Cys Arg Val Val Ala Ala Arg4625 4630 4635Gly Arg Leu Met Gln Ala Leu Pro Val Gly Gly Ala Met Val Ala4640 4645 4650Val Arg Val Gly Glu Gly Glu Leu Pro Val Leu Pro Glu Gly Val4654 4660 4665Ser Val Ala Ala Val Asn Gly Pro Arg Ser Leu Val Leu Ser Gly4670 4675 4680Asp Glu Gly Pro Val Leu Glu Leu Ala Ala Arg Leu Ala Gly Glu4685 4690 4695Gly Arg Asp Thr Arg Arg Leu Arg Val Ser His Ala Phe His Ser4700 4705 4710Ala Arg Met Glu Pro Met Leu Ala Glu Phe Ala Gln Val Leu Ala4715 4720 4725Ala Val Glu Phe Arg Ala Pro Arg Ile Pro Val Ile Ser Asn Val4730 4735 4740Thr Gly Glu Val Ala Gly Glu Glu Leu Thr Thr Pro Glu Tyr Trp4745 4750 4755Val Arg Gln Val Arg Glu Ala Val Arg Phe Ala Asp Gly Val Asn4760 4765 4770Thr Ala Leu Gly Arg Gly Val Asp Lys Phe Leu Glu Leu Gly Pro4775 4780 4785Ser Gly Pro Leu Thr Ala Met Ala Glu Glu Val Ile Glu His Thr4790 4795 4800Gly Thr Arg Ala Val Cys Val Pro Val Leu Arg Ala Gly Arg Pro4805 4810 4815Glu Asp Ala Thr Leu Leu His Ala Leu Ala Ala Val Phe Val Thr4835 4840 4845Gly Ala Thr Val Gly Trp Thr Ala Pro Leu Ala Gly Thr Gly Ala4835 4840 4845Arg Ala Val Asp Leu Pro Thr Tyr Ala Phe Gln His Lys Arg Tyr4850 4855 4860Trp Pro Gln Pro Ala Thr Val Gly Arg Asp Leu Ala Ala Ala Gly4865 4870 4875Leu Ala Glu Ala Gly His Pro Leu Leu Thr Ala Trp Leu Pro Ser4880 4885 1890Pro Glu Gly Glu Asp Val Leu Cys Thr Gly Arg Ile Ser Leu Ala4895 4900 4905Thr His Pro Trp Leu Ala Asp His Ala Val Leu Gly Thr Val Leu4910 4915 4920Val Pro Gly Thr Ala Phe Val Asp Leu Ala Cys Trp Ala Gly His4925 4930 4935
Arg Val Gly Cys Gly Ala Leu Arg Glu Leu Thr Leu Ala Thr Pro4940 4945 4950Leu Ala Leu Ala Gln Asp Met Ala Val Arg Leu Arg Leu Val Leu4955 4960 4965Gly Ala Pro Asp Asp Thr Gly Cys Arg Pro Val Ala Leu Tyr Ser4970 4975 4980Gln Gln Glu Gly Ala Asp Glu Gly Thr Asp Gly Thr Gly Trp Thr4985 4990 4995Arg His Ala Glu Gly Leu Leu Ala Pro Gly Gly Asp Ala Ser Val5000 5005 5010Gln Pro Pro Thr Asp Phe Glu Thr Trp Pro Val Thr Gly Cys Glu5015 5020 5025Pro Ile Pro Leu Asp Gly Phe Tyr Glu Glu Leu Ala Asp Ala Gly5030 5035 5040Phe Ser Tyr Gly Pro Val Phe Arg Gly Leu Arg Ala Ala Trp Arg5045 5050 5055Arg Gly Gly Gln Val Phe Ala Glu Val Ser Leu Pro Ala Asp Glu5060 5065 5070Thr Gly Gly Phe Gly Val His Pro Ala Lau Leu Asp Ala Ala Leu5075 5080 5085His Ala Leu Gly Pro Val Ser Arg Asp Thr Asp Glu Pro Gly Ser5090 5095 5100Ala Arg Leu Pro Phe Ser Trp Gly Glu Val Arg Val His Ala Ala5105 5110 5115Gly Ala Asp Arg Leu Arg Val Cys Leu Val Arg Ala Glu Asp Gly5120 5125 5130Thr Val Thr Leu His Gly Ala Asp Ala Ala Gly Arg Pro Val Val5135 5140 5145Thr Val Gly Ser Leu Val Leu Arg Pro Ile Ser Pro Glu Arg Leu5150 5155 5160His Gly Gly Ala Ala Ala Phe Asp Asp Ala Leu Phe Thr Thr Arg5115 5170 5175Trp Met Pro Leu Ser Val Ala Asp Gly Ile Ala Ty0r Pro Thr Ala5180 5185 5190Asp Cys Val Lau Leu Gly Asp Pro Leu Glu Arg Ala Trp Arg His5195 5200 5205His Pro Asp Leu Asp Ser Phe Ala Glu Ala Leu Ala Ala Gly Lys5210 5215 5220Glu Lys Pro Gly Thr Val Leu Ala Arg Cys Pro Arg Asp Ile Ala5225 5230 5235Ala Gly Val Asp Pro Ala Glu Ala Ala Arg Arg Cys Ala Glu Trp5240 5245 5250Ala Leu Asp Lau Leu Lys Arg Trp Leu Asp Asp Asp Arg Leu Thr5255 5260 5265Asp Cys His Leu Val Ile Gly Thr Arg His Ala Val Thr Thr Gly
Ala Glu Asp Gln Thr Ala Gly Arg Thr Asp Asp Pro Ala Val Leu5285 5290 5295Ala Gln Ser Thr Leu Leu Gly Leu Val Arg Ser Ala Gln Thr Glu5300 5305 5310Asn Pro Gly Arg Val Thr Leu Ala Asp Phe Asp Gly Thr Ala Pro5315 5320 5325Asp Pro Ala His Leu Ile Leu Ala Val Arg Gln Ala Glu Pro Glu5330 5335 5340Val Ala Val Arg Ala Gly Arg Leu Tyr Ala Arg Arg Leu Thr Arg5345 5350 5355Pro Asp Thr Gly Arg Ala Leu Ala Val Pro Pro Gly Ala Gly Ser5360 5365 5370Trp Arg Leu Glu Ser Thr Gly Arg Gly Thr Leu Asp Asn Leu Ala5375 5380 5385Leu Val Pro Cys Ala Gln Ala Glu Glu Pro Leu Gly Glu Gly Met5390 5395 5400Val Arg Ile Ala Val Arg Ala Ala Gly Val Asn Phe Arg Asp Val5405 5410 5415Leu Ile Val Leu Asp Met Tyr Pro Gly Arg Ala Asp Leu Gly Thr5420 5425 5430Glu Cys Ala Gly Val Val Val Glu Thr Gly His Gly Val Thr Gly5435 5440 5445Leu Val Pro Gly Asp Arg Val Met Gly Met Val Ala Gly Ala Phe5450 5455 5460Ala Pro Thr Ala Val Val Asp Gln Arg Phe Leu Val Arg Ile Pro5465 5470 5475Asp Gly Trp Ser Tyr Glu Thr Ala Ala Ala Ile Pro0 Val Ala Phe5480 5485 5490Leu Thr Ala Tyr Tyr Gly Leu Val Asp Leu Ala Gly Leu Ser Ala5495 5500 5505Gly Glu Ser Val Leu Val His Ala Ala Ala Gly Gly Val Gly Met5510 5515 5520Ala Ala Val Gln Leu Ala Arg His Leu Gly Ala Glu Met Tyr Gly5525 5530 5535Thr Ala Ser Glu Pro Lys Trp Asp Thr Leu Leu Asp Ser Gly Leu5540 5545 5550Asp Arg Ala His Ile Ala Ser Ser Arg Thr Thr Val Phe Ala Asp5555 5560 5565Ser Val Met Glu Ala Thr Gly Gly Ala Gly Val Asp Val Val Leu5570 5575 5580Ash Ser Leu Ala Gly Glu Phe Val Asp Ala Ser Leu Arg Ala Leu5585 5590 5595Pro Arg Gly Gly Arg Phe Val Glu Met Gly Lys Thr Asp Leu Arg5600 5605 5610Asp Pro Glu Arg Val Ala Ala Glu His Pro Gly Val Arg Tyr Arg5615 5620 5625Pro Phe Asp Leu Gly Glu Ala Gly Ala Asp Arg Ile Ala Glu Val
5630 5635 5640Leu Ala His Leu Ala Glu Leu Phe Ala Ser Gly Glu Leu Thr Pro5645 5650 5655Leu Pro Val Thr Val Trp Asp Ile Arg Asp Ala Pro Ala Ala Phe5660 5665 5670Arg Ala Leu Ser Gln Ala Ala Leu Thr Gly Lys Gly Val Leu Thr5675 5680 5685Val Pro Ala Pro Ser Phe Glu Ala Gly Glu Thr Val Leu Ile Thr5690 5695 5700Gly Gly Thr Gly Thr Leu Gly Thr Leu Leu Ala Arg His Leu Val5705 5710 5715Thr Glu His Gly Leu Arg His Val Ile Leu Ala Gly Arg Arg Gly5720 5725 5730Thr Glu Thr Ala Glu Val Arg His Leu Arg Gly Asp Val Ala Glu5735 5740 5745Leu Gly Ala Arg Ile Glu Val Val Ala Cys Asp Ala Gly Asp Glu5750 5755 5760Arg Ala Leu Arg Gln Val Leu Asp Ala Leu Thr Ala Glu His Arg5765 5770 5775Leu Ala Gly Val Val His Ala Ala Gly Val Thr Asp Asp Gly Val5780 5785 5790Val Ser Ala Leu Asp Arg Gly Arg Leu Ser Ala Val Leu His Pro5795 5800 5805Lys Val Arg Gly Ala Trp Asn Leu His Arg Leu Thr Ala Gly Ser5810 5815 5820Glu Leu Arg Met Phe Val Leu Phe Ser Ser Ala Ser Ala Thr Leu5825 5830 5835Gly Ala Ala Gly Gln Gly Asn Tyr Ala Ala Ala Asn Ala Phe Leu5840 5845 5850Asp Ala Leu Ala Glu His Arg His Ala Leu Gly Leu Pro Ala Thr5855 5860 5865Ser Leu Ala Trp Gly Leu Trp Glu Gln Ala Ser Gly Met Thr Gly5870 5875 5880Arg Leu Leu Asp Arg Asp Arg Gln Arg Met Ser Arg Ser Gly Ile5885 5890 5895Val Pro Leu Ser Ser Ala His Gly Leu Ala Leu Phe Asp Ala Ala5900 5905 5910Arg Leu Ala Gly Leu Pro Thr Leu Thr Pro Ala Arg Leu Asp Leu5915 5920 5925Ala Ala Leu Arg Val Arg Try Ala His Glu Gln Val Pro Ala Val5930 5935 5940Leu Arg Glu Leu Val Arg Val Arg Pro Ser Ala Ala Glu Asp Pro5945 5950 5955Thr Thr Ala Pro Asp Thr Thr Thr Ala Pro Gly Pro Ser Gly Ala5960 5965 5970Met Thr Leu Ala Asp Arg Leu Ala Gly Leu Ser Ala Pro Glu Arg5975 5980 5985
Gln Arg His Val Leu Asp Leu Val Arg Arg His Thr Ala Ala Val5990 5995 6000Leu Gly His Gly Ser Ala Asp Asp Val Asp Pro Asp Gln Ala Phe6005 6010 6015Lys Ala Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn6020 6025 6030His Leu Arg Thr Ala Thr Ser Leu Ala Val Pro Ala Thr Leu Val6035 6040 6045Phe Asp His Pro Thr Pro Ala Ala Leu Ala Ala His Leu Leu Glu6050 6055 6060Leu Ala Ala Pro Pro Glu Arg Asp Pro Ala Leu Arg Val Met Gly6065 6070 6075Gly Leu Asp Arg Leu Glu Ala Asp Val Glu Ala Leu Ala Ser Gly6080 6085 6090Gly Ala Gly His Gln Glu Glu Val Ala Thr Arg Leu Arg Arg Val6095 6100 6105Leu Arg Arg Leu Glu Ser Gly Pro Gly Ala Ala His Ser Gly Thr6110 6115 6120Glu Glu Thr Ser Leu Asp Thr Ala Ser Ala Thr Glu Val Leu Ala6125 6130 6135Phe Ile Asp Ser Glu Phe Gly Asp Leu Ala6140 6145<210>3<211>4799<212>PRT<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)<400>1Val Val Ser Asp Asp Lys Leu Val Asp Tyr Leu Lys Arg Val Thr Ala1 5 10 15Asp Leu Lys Arg Thr Arg Gln Arg Val His Glu Leu Glu Ser Gly Ser20 25 30Ala Glu Pro Ile Ala Val Val Ala Met Gly Cys Arg Phe Pro Gly Gly35 40 45Ile Ser Ser Pro Glu Asp Leu Trp Glu Phe Val Arg Leu Gly Ser Asp50 55 60Ala Ile Ser Glu Phe Pro Thr Asp Arg Gly Trp His Thr Ser Arg Leu65 70 75 80Ser Gly Asn Phe Arg Arg Ala Gly Gly Phe Leu Tyr Asp Ala Gly Asp85 90 95Phe Asp Ala Gly Leu Phe Gly Ile Ser Pro Arg Glu Ala Leu Ala Met100 105 110Asp Pro Gln Gln Arg Leu Leu Leu Glu Thr Ala Trp Glu Thr Leu Glu115 120 125Arg Ala Gly Val Asp Pro Thr Ser Val Arg Gly Ala Asp Gly Gly Val130 135 140Phe Ile Gly Met Ala Asp Gln Lys Tyr Gly Pro Arg Asp Asp Glu Leu145 150 155 160
Leu Gly Glu Val Arg Gly Leu Val Leu Thr Gly Thr Thr Ser Ser Val165 170 175Ala Ser Gly Arg Ile Ala Tyr Ser Leu Gly Leu Gln Gly Pro Ala Ile180 185 190Thr Ile Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala195 200 205Val Arg Ser Leu Arg Ala Gly Glu Cys Pro Phe Ala Leu Val Gly Gly210 215 220Ala Ala Val Met Ala Glu Pro Thr Leu Phe Ala Glu Met Ala Glu Gln225 230 235 240Gly Gly Met Ala Gly Asp Gly Arg Cys Lys Ala Phe Ala Ala Ala Ala245 250 255Asp Gly Thr Gly Trp Gly Glu Gly Val Gly Val Leu Leu Leu Gln Pro260 265 270Leu Ser Thr Ala Arg Glu Gln Gly Leu Pro Val Leu Ala Thr Val Arg275 280 285Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro290 295 300Asn Gly Pro Ala Gln Cys Arg Val Ile Arg Lys Ala Leu Ala Asp Ala305 310 315 320Gln Leu Val Ala Gly Gln Ile Asp Ala Val Glu Ala His Gly Thr Gly325 330 335Thr Ala Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala Thr Tyr340 345 350Gly Gln Asp Arg Pro Gly Asp Glu Pro Leu Trp Leu Gly Ser Val Lys355 360 365Ser Asn Ile Gly His Thr Gln Ala Ala Ala Gly Met Ala Gly Val Ile370 375 380Lys Met Val Gln Ala Met Arg His Gly Leu Leu Pro Arg Thr Leu His385 390 395 400Val Asp Glu Pro Thr Pro Glu Ala Asp Trp Ser Ala Gly Asp Val Arg405 410 415Leu Leu Thr Glu Glu Arg Glu Trp Pro Asp Thr Gly Arg Pro Arg Arg420 425 430Ala Ala Val Ser Ser Phe Gly Ile Ser Gly Thr Asn Ala His Val Val435 440 445Leu Glu Leu Pro Thr Gly Thr Val Gly Glu Pro Ala Asp Ala Ala Gly450 455 460Pro Val Pro Asp Pro Ser Ala Cys Ala Pro Ile Pro Trp Leu Leu Ser465 470 475 480Ala Ala Ser Ala Asp Ala Leu Arg Ala Gln Ala Arg Arg Leu His Arg485 490 495Phe Val Asp Thr Pro Gly Ala Pro Arg Pro Ile Asp Thr Ala Leu Ser500 505 510Leu Thr Val Thr Arg Ala Arg Leu Asp His Arg Ala Ile Val Phe Gly515 520 525Thr Asp Gln Ala Glu Leu Arg Ala Gly Leu Gly Ala Leu Ala Ala Gly
530 535 540Glu Ser Thr Pro Arg Thr Val His Gly Arg Thr Val Pro Ser Ala Thr545 550 555 560Ile Ala Phe Leu Phe Thr Gly Gln Gly Ala Gln Arg Ala Gly Met Gly565 570 575Arg Ala Ala Tyr Ala Ala Phe Pro Glu Phe Ala Ala Ala Phe Asp Ala580 585 590Val Cys Ala Glu Leu Asp Gly Leu Leu Pro Arg Pro Leu Lys Ser Val595 600 605Leu Phe Ala Glu Pro Asn Ser Ala Asp Ala Ala Leu Val Asp Gln Thr610 615 620Leu Tyr Ala Gln Thr Gly Leu Phe Ala Phe Glu Val Ala Leu Phe Arg625 630 635 640Leu Leu Glu Glu Trp Gly Val Arg Pro Gly Val Leu Leu Gly His Ser645 650 655Val Gly Glu Leu Ala Ala Ala His Val Ala Gly Val Trp Ser Leu Pro660 665 670Asp Ala Cys Arg Val Val Ala Ala Arg Ala Arg Leu Met Gln Ala Leu675 680 685Pro Glu Asp Gly Ala Met Leu Ser Val Ala Ala Ser Glu Lys His Ile690 695 700Ala Glu Leu Leu Gly Asp Leu Ala Asp Val Asp Val Ala Ala Val Asn705 710 715 720Gly Pro Ala Val Thr Val Leu Ser Gly Pro Thr Gly Ala Val Ala Asp725 730 735Val Gly Glu Arg Leu Ala Gly Ala Gly Leu Arg Thr Lys His Leu Arg740 745 750Val Ser His Ala Phe His Ser Ala Leu Met Glu Pro Met Leu Ala Glu755 760 765Phe Ala Arg Glu Ile Ala Asp Val Thr Phe Gln Gln Pro Glu Leu Pro770 775 780Ile Ile Ser Asn Leu Thr Gly Gln Gln Ala Asp Ala Ala Glu Leu Gly785 790 795 800Ser Ala Ala Tyr Trp Val Arg Gln Val Arg Gly Thr Val Arg Phe Ala805 810 815Asp Gly Val Gly Arg Leu Ala Ala His Gly Val Thr Ala Gys Leu Glu820 825 830Leu Gly Pro Asp Gly Val Leu Thr Ala Leu Ala Arg Asp Cys Leu Thr835 840 845Ala Ala Ala Asp Val Ala Leu Val Pro Ala Leu Arg Arg Asp Gln Asp850 855 860Glu Pro Ala Ala Leu Leu Ala Ala Leu Ala Glu Leu His Val Arg Gly865 870 875 880Val Glu Val Asp Trp Ala Ala Met Leu Thr Ala Arg Gly Gly Arg Arg885 890 895Ala Ala Leu Pro Thr Tyr Ala Phe Gln Arg Glu Arg Tyr Trp Leu Pro900 905 910
Ala Thr Pro Ser Val Ala Ser Ala Val Ser Ala Pro Ala Glu Gln Ala915 920 925Asp Arg Leu Leu Tyr Arg Val Gly Trp Ser Pro Val Thr Gly Phe Asp930 935 940Thr Glu Ala Arg Pro Glu Gly Thr Trp Leu Val Val Ala Ser Pro Asp945 950 955Asp Glu Gly Arg Arg Val Ala Gln Ala Leu Gly Pro His Thr Val Leu965 970 975Val Ala His Asp Pro Asp Asp Pro Ser Gly Ser Val Ala Arg Leu Arg980 985 990Gly Ala Leu Pro Ala Asp Arg Pro Val Thr Gly Val Leu Ala Leu Pro995 10001005Glu Gln Thr Gly Ala Ala Ala Val Ala Ala Gln Leu Ala Leu Arg1010 1015 1020Glu Ala Leu Arg Asp Ala Glu Val Arg Ala Pro Leu Trp Cys Ala1025 1030 1035Thr Arg Ala Ala Val Ser Val Gly Gly Glu Ala Thr Pro Gly Ala1040 1045 1050Ala Gln Ala Pro Leu Trp Gly Leu Asn Arg Ala Leu Glu Thr Cys1055 1060 1065Gly Gly Met Val Asp Leu Pro Gln Arg Leu Asp Ser Arg Ser Leu1070 1075 1080Gly Leu Leu Ala Ala Ala Leu Thr Asn Pro Ala Asp Ala Asp Glu1085 1090 1095Leu Ala Val Arg Thr Gly Gly Leu Phe Ala Arg Arg Leu His Ala1100 1105 1110Val Gln Pro Val Pro Arg Ala Pro Arg Pro Trp Arg Ala Asp Gly1115 1120 1125Thr Val Leu Val Thr Gly Asp Val Glu Ser Ala Thr Asp Asp Leu1130 1135 1140Leu Arg Arg Leu Ser Gly Asp Gly Glu Arg Pro Val Val Leu Ala1145 1150 1155Arg Arg Pro Gly Thr Ala Leu Gln Asn Gly Ala Ala Gly Asp Gly1160 1165 1170Ser Cys Thr Val Val Glu Trp Asp Pro Ala Ala Gly Ala Pro Glu1175 1180 1185Thr Pro Ser Pro Val Thr Ala Val Val His Leu Asp Asn Ile Gln1190 1195 1200Pro Ser Ala Pro Arg Asp Asp Ala Asp Pro Leu Ala Leu Ala Ala1205 1210 1215Ala Val Ala Glu Arg Leu His Thr Val Asp Arg Leu Thr Glu Leu1220 1225 1230Phe Gly Asn Gln Asp Leu Asp Ala Phe Val Leu Leu Ser Ser Val1235 1240 1245Ala Gly Ile Trp Gly Gly Ala Glu Asp Val Val His Thr Val Val1250 1255 1260
His Ala Ala Leu Glu Ser Ala Ala Glu Arg Arg Ala Ala Ala Gly1265 1270 1275Leu Arg Gly Ala Cys Val Gly Trp Gly Pro Trp Ala Gly Ala Gly1280 1285 1290Asp Gly Pro Asp Val Pro Gly Leu Val Pro Met Arg Pro Glu Pro1295 1300 1305Ala Leu Ala Ala Leu Trp His Ala Leu Asp Asp Asp Ala Ala Val1310 1315 1320Phe Ala Val Ala Asp Val Asp Trp Pro Arg Phe His Pro Val Leu1325 1330 1335Thr Ser Arg Arg Pro Arg Pro Val Val Ser Gly Leu Pro Glu Val1340 1345 1350Arg Ala Leu Arg Pro Ala Pro Ser Ala Ala Pro Ala Val Gly Met1355 1360 1365Asp Val Thr Asp Leu Glu His Arg Leu Arg Asp Leu Val Leu Thr1370 1375 1380Glu Ala Ala Thr Ala Leu Gly His Ala Phe Arg Asp Ser Met Asp1385 1390 1395Pro Leu Arg Pro Phe Arg Asp Ala Gly Phe Glu Ser Leu Thr Ala1400 1405 1410Val Arg Phe Arg Asp Arg Ile Ala Ser Glu Thr Gly Leu Asn Leu1415 1420 1425Ser Ala Thr Leu Val Phe Asp His Pro Thr Pro Glu Ala Val Val1430 1435 1440Ala His Leu Leu Ala Glu Leu Thr Gly Gly Arg Pro Asp Glu Ala1445 1450 1455Glu Gln Val Ser Thr Arg Ser His Asp Asp Pro Val Val Ile Ile1460 1465 1470Gly Met Ala Cys Arg Tyr Pro Gly Gly Val Ser Asp Pro Glu Gly1475 1480 1485Leu Trp Glu Leu Val His Ser Gly Arg Glu Gly Ile Gly Asp Phe1490 1495 1500Pro Thr Asp Arg Gly Trp Asp Leu Ala Ala Leu Arg Arg Ala Val1505 1510 1515Pro His Leu Ala Leu Arg Ala Gly Phe Leu Pro Asp Ala Ala Ala1520 1525 1530Phe Asp Ala Ala Phe Phe Gly Ile Ser Pro Arg Glu Ala Leu Ala1535 1540 1545Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Ala Ser Trp Glu Ala1550 1555 1560Val Glu Thr Ala Gly Ile Asp Pro Ala Ser Leu Arg Gly Ser Arg1565 1570 1575Thr Gly Val Phe Ala Gly Val Ala Gly Ser Asp Tyr Gly Ala Ala1580 1585 1590Leu Ala Gly Ser Arg Glu Ala Glu Gly Tyr Leu Met Thr Gly Thr1595 1600 1605Ala Thr Ser Val Val Ser Gly Arg Ile Ala Tyr Val Phe Gly Leu
1610 1615 1620Gln Gly Pro Ala Leu Thr Ile Asp Thr Ala Cys Ser Ser Ser Leu1625 1630 1635Val Ala Leu His Thr Ala Val Gly Ala Leu Arg Lys Gly Glu Cys1640 1645 1650Asp Leu Ala Phe Ala Thr Gly Val Ala Val Ile Ser Thr Pro Asp1655 1660 1665Ala Phe Val Asp Phe Ala Lys Gln Asp Gly Leu Ala Ala Asp Gly1670 1675 1680Arg Cys Lys Ala Phe Ala Val Gly Ala Asp Gly Thr Asn Trp Ala1685 1690 1695Glu Gly Val Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Arg1700 1705 1710Arg Asn Gly His Arg Val Leu Ala Val Leu Arg Gly Ser Ala Val1715 1720 1725Asn Ser Asp Gly Ala Ser Asn Gly Leu Ala Ala Pro Asn Gly Gly1730 1735 1740Ala Gln Gln Arg Val Ile Arg Gln Ala Leu Ala Asp Ala Gly Leu1745 1750 1755Thr Ala Pro Asp Val Asp Ala Leu Glu Ala His Gly Thr Gly Thr1760 1765 1770Ala Leu Gly Asp Pro Ile Glu Ala Gln Ala Val Leu Ala Thr Tyr1775 1780 1785Gly Gln Gly Arg Pro Ala Asp Arg Pro Leu Trp Leu Gly Ser Leu1790 1795 1800Lys Ser Asn Ile Gly His Ser Ala Ala Ala Ala Gly Val Gly Gly1805 1810 1815Val Ile Lys Met Val Glu Ala Met Arg His Gly Val Leu Pro Pro1820 1825 1830Thr Leu His Ala Asp Glu Pro Thr His Glu Val Asp Trp Ser Val1835 1840 1845Gly Ala Val Glu Leu Leu Thr Thr Ala Arg Asp Trp Pro Glu Thr1850 1855 1860Gly Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly Val Ser Gly1865 1870 1875Thr Asn Ala His Val Ile Leu Glu Gln Gly Pro Asp Leu Ala Pro1880 1885 1890Gly Gly Val Pro Gly Val Gln Glu Asp Pro Ala Pro Arg Ala Ala1895 1900 1905Gly Gly Cys Ala Gly Asn Ala Val Pro Trp Leu Leu Sar Gly Arg1910 1915 1920Ser Ala Arg Ala Leu Arg Asp Gln Ala Ala Arg Leu Ala Gly His1925 1930 1935Leu Thr Arg Gly Asp Pro Ser Ala Glu Ala Ile Gly His Ala Leu1940 1945 1950Leu Thr Ser Arg Thr Ala Phe Glu His Arg Ala Val Val Leu Gly1955 1960 1965
Gly Gly Thr Val Asp Leu Val Glu Gly Leu Asp Ala Leu Ala Ala1970 1975 1980Gly Glu Pro Ala Pro Ser Val Val Ala Gly Ala Pro Arg Pro Thr1985 1990 1995Gly Arg Gly Pro Val Phe Val Phe Pro Gly Gln Gly Gly Gln Trp2000 2005 2010Ser Gly Met Ala Ser Glu Leu Leu Asp Thr Cys Pro Ala Phe Ala20152020 2025Ala Arg Trp Ala Glu Cys Glu Arg Ala Phe Ala Pro His Met Asp2030 2035 2040Val Ser Leu Thr Glu Ala Val Arg Asp Ala Ala Ala Leu Glu Arg2045 2050 2055Val Asp Val Val Gln Pro Val Leu Phe Ala Val Met Val Ser Leu2060 2065 2070Val Glu Val Trp Arg Ser Tyr Gly Val Arg Pro Ala Ala Val Ile2075 2080 2085Gly His Ser Gln Gly Glu Ile Ala Ala Ala Cys Val Ala Gly Ala2090 2095 2100Leu Ser Leu Asp Asp Ala Ala Arg Val Val Ala Leu Arg Ala Arg2105 2110 2115Ala Leu Gly Val Leu Ala Gly Ala Gly Gly Met Val Ser Val Ala2120 2125 2130Leu Pro Pro Ala Glu Thr Glu Gly Trp Leu Arg Arg Trp Glu Asp2135 2140 2145Arg Ile Ser Val Ala Ala Val Asn Gly Pro Ser Ser Val Val Val2150 2155 2160Ser Gly Glu Pro Ala Ala Leu Glu Glu Leu Val Glu Gln Ala Arg2165 2170 2175Thr Arg Asp Val Arg Val Arg Arg Ile Glu Val Asp Tyr Ala Ser2180 2185 2190His Ser Ala Gln Val Ala Arg Ile Glu Asp Glu Val Leu Arg Leu2195 2200 2205Leu Glu Pro Ile Arg Pro Arg Thr Ser Glu Val Pro Phe Phe Ser2210 2215 2220Thr Val Ser Thr Gln Trp Gln Asp Thr Thr Ala Met Asp Ala Ala2225 2230 2235Tyr Trp Tyr Arg Asn Leu Arg Asp Pro Val Leu Phe Ala Pro Ser2240 2245 2250Val Gly Ala Leu Val Asp Gln Gly His Thr Val Phe Val Glu Val2255 2260 2265Ser Pro His Pro Val Leu Thr Ser Gly Leu Leu Glu Thr Ala Glu2270 2275 2280Arg Ala Asp Val Asp Leu Thr Val Thr Gly Thr Leu Arg Arg Gly2285 2290 2295Glu Gly Gly Leu Ala Arg Met Arg Ala Ser Leu Ala Glu Leu Trp2300 2305 2310
Val His Gly Thr Pro Val Asp Trp Ser Ala Ala Phe Asp Pro Ala2315 2320 2325Pro Ala Gly Pro Val Pro Leu Pro Thr Tyr Ala Phe Gln Arg Asp2330 2335 2340Arg Tyr Trp Pro Asp Pro Arg Pro Ala Ser Ala Asp Pro Val Tyr2345 2350 2355Glu Thr Phe Trp Arg Ala Val Asp Glu Ala Asp Leu Pro Ala Leu2360 2365 2370Thr Gly Thr Leu Gly Val Thr Asp Asp Gln Pro Leu Arg Glu Val2375 2380 2385Leu Pro Ala Leu Ser Ala Trp Arg Arg Ser Arg Thr Glu Gln Ala2390 2395 2400Val Thr Asp Ser Trp Arg Tyr Arg Val Cys Trp Lys Arg Leu Pro2405 2410 2415Asp Ala Ala Pro Ala Glu Leu Pro Gly Thr Trp Leu Leu Val Thr2420 2425 2430Thr Glu Gly Ala Ala Gly Asp Pro Ser Ala Ala Ala Ala Leu Gln2435 2440 2445Ala Val Arg Asp Ala Ala Gly His Thr Val Thr Leu Ala Val Asp2450 2455 2460Ser Asp Asp Glu Pro Ala Ser Leu Ala Ala Ala Leu Arg Glu Thr2465 2470 2475Leu Arg Gly Thr His Pro Ala Gly Val Val Thr Leu Thr Gly Thr2480 2485 2490Asp Val Ser Pro His Pro Val Ser Pro Val Val Pro Val Gly Thr2495 2500 2505Ala Leu Thr Val Thr Leu Leu Gln Ala Leu Asp Ala Ala Asp Val2510 2515 2520Asp Ala Pro Leu Trp Cys Leu Thr Arg Gly Ala Val Ala Thr Asp2525 2530 2535Asp Asp Thr Ala Gly Pro Gly Ser Pro Leu Gln Ser Ala Leu Trp2540 2545 2550Ala Leu Gly Arg Ile Ala Ala Val Glu Ser Pro Gly Asn Trp Gly2555 2560 2565Gly Leu Val Asp Leu Pro Asp Thr Phe Asp Asp Ser Ala Ala Arg2570 2575 2580Arg Leu Val Ser Val Leu Ala Ser Leu Asp Gly Glu Asp Gln Val2585 2590 2595Ala Leu Arg Val Ser Gly Ala Tyr Gly Arg Arg Leu Met Arg Ala2600 2605 2610Asn Pro Thr Ala Ser Pro Gly Ser Gly Trp Arg Pro Arg Gly Thr2615 2620 2625Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly Gly Arg Val Ala2630 2635 2640Arg Arg Gly Ser Gln Ala Pro Gly Val Asp Asp Leu Val Ala Glu
2660 2665 2670Leu Ser Gly Leu Gly Ala Gln Val Thr Val Asp Ser Cys Asp Leu2675 2680 2685Ser Val Ala Ser Glu Ala Phe Ala Leu Val Asp Arg Ile Gln Arg2690 2695 2700Asp Gly Asp Arg Ile Gly Ala Val Ile His Thr Ala Gly Ala Gly2705 2710 2715Gly Leu Gly Pro Leu Val Asp Ala Gly Leu Asp Asp Met Glu Leu2720 2725 2730Ala Met Ala Gly Lys Val Ala Gly Ile Asp Asn Leu Glu Arg Ala2735 2740 2745Leu Asp Asp Gln Gln Leu Asp Ala Val Val Tyr Phe Ser Ser Ile2750 2755 2760Ser Ala Ser Trp Gly Ala Gly Asp His Gly Ile Tyr Ala Ala Ala2765 2770 2775Asn Ala Val Leu Asp Ala Arg Ala Glu Ala Arg Arg Ala Ala Gly2780 2780 2790Val His Thr Val Ser Val Ala Trp Ala Pro Trp Gly Gly Gly Gly2795 2800 2805Met Ile Asp Asp Pro Ala Val Ala Asp Thr Leu Asn Arg Met Gly2810 2815 2820Leu Pro Leu Val Asp Pro Asp Leu Ala Ile Ser Gly Leu Ala Thr2825 2830 2835Ile Leu Ala Glu Gly Glu Glu Ser Leu Leu Leu Val Asp Val Asp2840 2845 2850Trp Gly Arg Phe Ile Pro Gln Phe Thr Leu Arg Arg Pro Ser Arg2855 2860 2865Leu Phe Asp Glu Leu Pro Glu Ala Arg Ala Ala Glu Ala Asp Thr2870 2875 2880Gly Pro Ala Lys Ala Asp Ala Pro Ser Pro Leu Ala Gly Arg Leu2885 2890 2895Ala Gly Leu Ser Lys Ala Lys Arg Ala Thr Ala Leu Arg Asp Leu2900 2905 2910Val Arg Glu His Val Ala Ala Val Leu Gly His Asn Asp Pro Ala2915 2920 2925Ala Val Asp Ala Gly Arg Ala Leu Lys Asp Leu Gly Phe Asp Ser2930 2935 2940Leu Thr Ala Val Glu Leu Arg Asp Arg Leu Ser Thr Val Ala Ala2945 2950 2955Met Arg Leu Pro Ala Thr Leu Val Phe Asp His Pro Thr Ile Ala2960 2965 2970Glu Leu Ala Asp Phe Leu Ala Arg Gly Leu Glu Pro Glu Thr Ala2975 2980 2985Arg Pro Thr Ala Ala Pro Ala Thr Val Val Arg Val Asp Gln Asp2990 2995 3000Glu Pro Val Ala Ile Val Ala Met Ala Cys Arg Tyr Pro Gly Asp3005 3010 3015
Ile Ala Ser Ala Glu Glu Leu Trp Arg Ala Val Arg Asp Glu Lys3020 3025 3030Asp Leu Ile Ser Pro Phe Pro Ile Asn Arg Gly Trp Pro Val Asp3035 3040 3045Arg Leu Leu Asp Ala Asp Pro Asp Arg Pro Gly Thr Ser Tyr Val3050 3055 3060Asp His Gly Gly Phe Leu His Asp Ala Gly Asp Phe Asp Pro Gly3065 3070 3075Phe Phe Gly Ile Ser Pro Arg Glu Ala Gln Ala Met Asp Pro Gln3080 3085 3090Gln Arg Leu Leu Leu Glu Ser Ser Trp Glu Val Leu Glu Arg Ala3095 3100 3105Gly Met Val Pro Lys Ser Leu Arg Gly Ser Arg Thr Gly Val Tyr3110 3115 3120Val Gly Leu Thr Asp Gln Ala Tyr Gly Thr Arg Leu Arg Gly Ser3125 3130 3135Leu Asp Gly Met Glu Gly Phe Leu Val Ser Ala Ser Ser Asn Val3140 3145 3150Ala Ser Gly Arg Ile Ser Tyr Ser Leu Gly Leu Gln Gly Pro Ala3155 3160 3165Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His3170 3175 3180Leu Ala Thr Gln Ala Leu Arg Asn Gly Glu Cys Asp Leu Ala Ile3185 3190 3195Ala Gly Ala Ala Thr Val Met Pro Asp Pro Thr Ser Phe Met Ala3200 3205 3210Phe Ser Arg Gln Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Pro3215 3220 3225Phe Ala Ala Ala Ala Asp Gly Phe Ser Leu Gly Glu Gly Val Gly3230 3235 3240Val Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly His3245 3250 3255Pro Val Leu Ala Leu Ile Arg Gly Ser Ala Val Asn Gln Asp Gly3260 3265 3270Ala Ser Asn Gly Ile Thr Ala Pro Asn Gly Pro Ser Gln Glu Arg3275 3280 3285Val Ile Arg Gln Ala Leu Val Asn Ala Ala Leu Pro Ala Ser Ala3290 3295 3300Val Asp Val Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp3305 3310 3315Pro Ile Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Gln Asp Arg3320 3325 3330Pro Ala Asp Arg Pro Leu Arg Leu Gly Ser Val Lys Ser Asn Phe3335 3340 3345Gly His Thr Gln Ala Ala Ala Gly Met Ala Gly Val Ile Lys Met3350 3355 3360
Val Gln Ala Met Arg His Glu Leu Met Pro Arg Thr Leu His Val3365 3370 3375Asp Ala Pro Ser Pro His Val Asp Trp Ser Ser Gly Ala Val Glu3380 3385 3390Leu Leu Ala Glu Ala Arg Pro Trp Pro Arg Gly Asp Glu Pro Arg3395 3400 3405Arg Ala Gly Val Ser Ala Phe Gly Ile Ser Gly Thr Asn Ala His3410 3415 3420Val Val Leu Glu Glu Ala Ser Gln Glu Pro Thr Pro Asp Gly Ser3425 3430 3435Ala Gly Ala Pro Asp Thr Pro Asp Thr Pro Asp Ala Pro Val Glu3440 3445 3450Ala Asp Thr Gly Arg Pro Leu Pro Leu Val Val Ser Ala Arg Thr3455 3460 3465Pro Asp Ala Leu Arg Asp Gln Ala Ala Arg Leu Thr Ala Leu Leu3470 3475 3480Asp Arg Glu Glu His Pro Val Ser Asp Leu Ala Tyr Ser Leu Ala3485 3490 3495Thr Ala Arg Gly Val Leu Asp Arg Ala Ala Val Val Val Ala Ala3500 3505 3510Asp Pro Asp Glu Leu Arg Arg Asn Leu Ala Asp Leu Thr Thr Arg3515 3520 3525Ala Val Ala Glu Arg Arg Ala Glu Gly Gly Leu Ala Phe Leu Phe3530 3535 3540Thr Gly Gln Gly Ala Gln Arg Ala Gly Met Gly Arg Ser Leu Tyr3545 3550 3555Asp Ala Phe Pro Glu Phe Ala Ala Ala Phe Asp Glu Val Cys Ala3560 3565 3570Glu Leu Asp Arg His Leu Pro Arg Pro Leu Arg Thr Val Val Trp3575 3580 3585Ala Glu Pro Gly Thr Asp Glu Ala Ala Leu Leu Asp Gln Thr Leu3590 3595 3600Tyr Thr Gln Thr Gly Leu Phe Ala Val Glu Val Ala Leu Phe Arg3605 3610 3615Leu Leu Glu His Trp Gly Val Arg Pro Asp Ala Leu Leu Gly His3620 3625 3630Ser Val Gly Glu Leu Ala Ala Ala His Leu Ala Gly Val Trp Ser3635 3640 3645Thr Glu Asp Ala Ala Arg Val Val Ala Ala Arg Ala Arg Leu Met3650 3655 3660Gln Glu Leu Pro Glu Gly Gly Ala Met Leu Ser Val Ala Ala Ala3665 3670 3675Gly Asp Glu Val Ser Ala Val Leu Gly Asp Ala Ser Ala Glu Val3680 3685 3690Ala Val Ala Ala Val Asn Gly Pro Ala Ser Leu Val Leu Ser Gly3695 3700 3705Thr Glu Glu Ser Val Thr Ala Ala Gly Ala Arg Leu Ala Glu Ala
3710 3715 3720Gly Leu Arg Thr Lys Arg Leu Thr Val Ser His Ala Phe His Ser3725 3730 3735Ser Leu Met Glu Pro Met Leu Ala Ala Tyr Glu His Glu Leu Ala3740 3745 3750Gln Val Ala Phe Ala Glu Pro Ala Leu Pro Val Val Ser Asn Leu3755 3760 3765Thr Gly Glu Val Ala Gly Ala Glu Leu Cys Glu Pro Ala Tyr Trp3770 3775 3780Val Arg Gln Val Arg Gln Ala Val Arg Phe Ala Asp Gly Val Arg3785 3790 3795Thr Val Leu Asp Glu Gly Val Thr Thr Leu Leu Glu Leu Gly Pro3800 3805 3810Asp Gly Val Leu Thr Ala Met Ala Gln Glu Ser Ala Gly Glu Arg3815 3820 3825Ala Thr Gly Ile Ala Ala Gln Arg Arg Asp Arg Asp Gln Val Arg3830 3835 3840Thr Leu Leu Thr Ala Leu Gly Arg Leu His Val Arg Thr Glu Arg3845 3850 3855Val Asp Trp Ala Ala Phe Phe Arg Gly Thr Gly Ala Arg Arg Val3860 3865 3870Asp Leu Pro Thr Tyr Ala Phe Gln Arg Arg Arg Tyr Trp Leu Asp3875 3880 3885Thr Ser Ser Gly Gly Ala Glu Ala Leu Ala Gly Ala Gly Leu Ala3890 3895 3900Gly Thr Gly His Pro Leu Leu Thr Ala Ser Ala Thr Leu Pro Gly3905 3910 3915Thr Gly Glu Ser Leu Phe Ser Gly Ser Leu Pro Gly Ala Pro Asp3920 3925 3930Gly Arg Pro Leu Ser Gly Gly Glu Ile Leu Glu Leu Val Leu Trp3935 3940 3945Ala Gly Gly Asn Phe Gly Cys His Arg Ile Ala Gly Leu Asp Val3950 3955 3960Ala Gly Ser Val Pro His Ala Pro Gln Ala Pro Leu Gln Leu Val3965 3970 3975Val Ala Ala Pro Asp Glu Ser Gly Asn Arg Ala Phe Thr Leu His3980 3985 3990Leu Gly Pro Val Gly Gly Pro His Gly Pro Val Glu Gly Pro Trp3995 4000 4005Thr Arg Ile Ala His Gly Val Leu Gly Gly Thr Pro Thr Pro Leu4010 4015 4020Pro Pro Glu Pro Gly Thr Ala Ala Trp Pro Pro Ala Asp Ala Glu4025 4030 4035Pro Val Gly Ala Asp Leu Val Trp Arg Arg Glu Asp Glu Leu Phe4040 4045 4050Ala Glu Leu Glu Leu Ala Glu Arg Asn Ala Ala Asp Val Asp Arg4055 4060 4065
Phe Ala Leu His Pro Gly Leu Leu Ala Glu Val Met Glu Leu Ile4070 4075 4080Ala Gly Leu Ala Gly Glu Pro Val His Phe Thr Gly Val Thr Arg4085 4090 4095Tyr Ala Thr Gly Ala Thr Val Leu Arg Val His Leu Thr Arg Val4100 4105 4110Ala Pro Asp Thr Val Thr Ala Leu Leu Thr Asp Ala Glu Gly Glu4115 4120 4125Pro Val Leu Ser Val Asp Arg Val Gln Val Arg Ala Asp Gly Ala4130 4140 4140Ala Ala Val Arg Ser Ala Thr Ala Ala Ala Pro Asp Ala Leu Tyr4145 4150 4155Glu Leu Thr Trp Thr Pro Val Gly Ala Glu Ala Leu Pro Pro Asp4160 4165 4170Thr Gly Trp Ala Val Val Gly Val Pro Ala Gly Asp Leu Ala Lys4175 4180 4185Val Leu Glu Ala Gln Gly Ala Glu Val Ala Thr His Pro Asp Leu4190 4195 4200Ala Ser Leu Gly Ser Thr Ala Asp Arg Gly Asp Met Pro Gly Leu4205 4210 4215Val Val Leu Ser Val Glu Thr Ala Pro Gly Ala Pro Leu Glu Ser4220 4225 4230Ala Arg Leu Thr Val His His Thr Leu Arg Leu Val Gly Glu Leu4235 4240 4245Leu Ala Asp Thr Gln Leu Thr Gly Thr Arg Phe Ala Phe Val Thr4250 4255 4260Arg Ala Ser Val Ser Thr Gly Asp Gly Ala Ala Val Asp Pro Ala4265 4270 4275Gln Ala Ala Val Arg Gly Leu Leu Leu Ser Ala Gln Ala Glu His4280 4285 4290Pro Asp Arg Phe Val Val Val Asp Leu Gly Gly Arg Glu Glu Asp4295 4300 4305Ala Asp Leu Leu Thr Ala Ala Val Gly Thr Ser Leu Ala Ala Ala4310 4315 4320Glu Pro His Leu Ala Ile Arg Asp Gly Arg Leu Leu Val Pro Arg4325 4330 4335Leu Ala Arg Val Thr Glu Pro Pro Gln Ala Phe Ala Ala Gly Pro4340 4345 4350Glu Glu His Gly Thr Val Leu Val Thr Gly Ala Thr Gly Gly Ile4355 4360 4365Gly Thr Lys Ile Val Pro His Leu Val Ala Glu His Gly Val Arg4370 4375 4380Arg Leu Leu Leu Leu Ser Arg Lys Gly Pro Asp Asp Pro Arg Ala4385 4390 4395Ala Glu Leu Gly Arg Glu Leu Ala Ala Tyr Gly Ala Glu Ala Thr4400 4405 4410
Phe Thr Ala Cys Asp Ile Ala Asp Arg Ala Ala Leu Glu Ala Val4415 4420 4425Leu Ala Glu Val Pro Ala Glu His Pro Val Thr Ala Val Val His4430 4435 4440Ile Ala Gly Val Val Asp Asp Gly Val Leu Thr Thr Leu Ser Pro4445 4450 4455Glu Arg Val Asp Thr Val Leu Arg Pro Lys Ala Glu Ala Ala Gln4460 4465 4470His Leu His Glu Leu Thr Ala Gly Leu Glu Leu Ser His Phe Val4475 4480 4485Leu Phe Ser Ser Gly Val Gly Val Leu Gly Gly Ala Gly Gln Ala4490 4495 4500Asn Tyr Ala Ala Ala Asn Ala Phe Leu Asp Ala Leu Ala Gln Thr4505 4510 4515Arg Gln Ala Ala Gly Leu Pro Ala Ser Ser Leu Ala Trp Gly Leu4520 4525 4530Trp Glu Thr Asp Met Gly Met Ser Ala Arg Leu Ser Glu Val Asp4535 4540 4545Arg Arg Arg Met Ala Gln Ala Gly Val Leu Ala Leu Thr Pro Gln4550 4555 4560Gln Gly Ile Ala Leu Phe Asp Arg Ala Trp Asn Ser Gly Ala Ala4565 4570 4575Thr Leu Val Pro Met Ser Leu Asp Thr Ala Val Leu Arg Arg Lys4580 4585 4590Ala Ala Asp Ser Ala Leu Pro Ala Pro Phe Arg Ala Leu Val Arg4595 4600 4605Thr Pro Leu Arg Arg Ala Ala Ala Gly Pro Ala Gln Ala Ala Gly4610 4615 4620Gln Ser Phe Ala Gln Arg Leu Ala Glu Gln Pro Gly Ser Ser Arg4625 4630 4635Arg Arg Leu Leu Leu Glu Leu Ile Gln Arg Gln Val Gly Thr Val4640 4645 4650Leu Asp Tyr Gly Ala Asp Thr Leu Leu Asp Ala Arg Arg Thr Phe4655 4660 4665Arg Glu Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn4670 4675 4680Arg Leu Val Ala Ala Thr Gly Val Gln Leu Ser Ala Ala Leu Val4685 4690 4695Phe Asp His Pro Thr Ala Asp Ala Leu Ala Glu Tyr Leu Glu Ser4700 4705 4710Lys Val Leu Arg Ser Gln Val Gly Ala Pro Leu Pro Val Leu Thr4715 4720 4725Gln Leu Asp His Leu Glu Ala Ala Leu Ala Ala Pro Pro Ala Asp4730 4735 4740Thr Ala Thr Arg Glu Gln Ile Ala Ala Arg Leu Arg Ala Leu Ala4745 4750 4755Ser Thr Trp Ser Ala Gln Pro Asp Asp Gly His Gly Ala Asp Asp
4760 4765 4770Gly Asp Ile Ser Ser Lys Leu Asp Ser Ala Thr Asp Glu Glu Leu4775 4780 4785Phe Asp Phe Ile Ser Gly Glu Phe Gly Glu Asp4790 4795<210>4211>3835<212>PRT<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)<400>1Val Ala Gly Ser Ser Thr Thr Ser Pro Arg Ser Thr Arg Pro Ser Leu1 5 10 15Ala Ser Pro Arg Val Arg Arg Arg Pro Trp Thr Leu Ser Ser Ala Cys20 25 30Ser Ser Arg Pro Ala Gly Lys Arg Trp Asn Arg Pro Val Ser Thr Ser35 40 45Thr Arg Tyr Ala Ala Ala Val Pro Glu Cys Ser Pro Ala Ser Ala Ser50 55 60Arg Thr Thr Ala Leu Cys Trp Pro Pro His Arg Ala Gly Trp Thr Ala65 70 75 80Thr Pro Pro Pro Ala Pro Pro Thr Ala Ser Cys Pro Ala Ala Ser Arg85 90 95Thr Ser Trp Ala Trp Arg Ala Pro Pro Ser Pro Ser Thr Pro Pro Ala100 105 110Pro Pro His Trp Trp Pro Cys Thr Ser Pro Cys Arg Arg Cys Ala Thr115 120 125Ala Ser Ala Thr Ser Arg Trp Arg Ala Gly Arg Arg Arg Cys Pro Pro130 135 140Pro Pro Ser Thr Trp Pro Cys Pro Val Ser Ala His Trp His Pro Thr145 150 155 160Ala Ala Pro Arg Arg Ser Arg Arg Arg Pro Thr Val Pro Asp Gly Ala165 170 175Arg Glu Ser Val Ser Ser Pro Ser Ser Gly Cys Pro Thr Pro Ala Gly180 185 190Ser Gly His Arg Val Leu Ala Val Leu Arg Gly Ser Ala Val Asn Gln195 200 205Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gln Gln210 215 220Arg Val Ile Arg Gln Ala Leu Ala Asn Ala Gly Lau Thr Pro Ala Asp225 230 235 240Val Asp Ile Val Glu Ala His Gly Thr Gly Thr Sar Leu Gly Asp Pro245 250 255Ile Glu Ala Asp Ala Leu Leu Ser Thr Tyr Gly Gln Ala Arg Pro Ala260 265 270Asp Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn Ile Gly His Ser275 280 285Gly Ala Ala Ala Gly Val Ala Gly Val Ile Lys Met Val Gln Ala Leu290 295 300
Arg His Gly Val Met Pro Arg Thr Leu His Ala Glu Glu Pro Thr Pro305 310 315 320Asn Val Asp Trp Ser Ser Gly Ala Val Glu Leu Leu Asn Arg Ala Arg325 330 335Asp Trp Pro Ala Ser Gly Thr Arg Arg Arg Ala Ala Val Ser Ser Phe325 345 350Gly Ile Ser Gly Thr Asn Ala His Val Ile Leu Glu Glu Ala Pro Gln355 360 365Asp Ser Gly Pro Glu Thr Gly Asp Glu Ala Asp Pro Ser Pro Glu Gly370 375 380Thr Pro Trp Pro Leu Leu Pro Trp Val Leu Ser Ala Arg Ser Glu His385 390 395 400Ala Leu Arg Gly Gln Ala Arg Ala Leu His Thr His Leu Leu Ala His405 410 415Pro Glu Pro Ala Asp Thr Asp Val Ala Leu Ser Leu Ala Thr Thr Arg420 425 430Thr Gly Leu Glu Tyr Arg Ala Ala Val Leu Ala Ala Asp Arg Asp Gly435 440 445Phe Leu Asn Ala Leu Glu Ala Leu Ala Asp Asp Arg Pro Thr Asn Gly450 455 460Val Leu Arg Gly Thr Ala Ala Glu Gly Lys Ala Val Phe Val Phe Pro465 470 475 480Gly Gln Gly Ala Gln Trp Thr Gly Met Ala Arg Glu Leu Leu Asp Thr485 490 495Ser Pro Val Phe Ala Ala Lys Ala Ala Glu Cys Ala Ala Ala Ile Glu500 505 510Glu Phe Val Asp Phe Lys Val Leu Asp Val Leu Arg Asp Glu Pro Gly515 520 525Ala Ala Ser Met Asp Arg Ile Glu Val Val Gln Pro Val Leu Phe Thr530 535 540Val Met Val Ser Leu Ala Glu Leu Trp Arg Ser Phe Gly Ile Gln Pro545 550 555 560Asp Ala Val Val Gly Ser Ser Gln Gly Glu Ile Ala Ala Ala His Val565 570 575Ala Gly Gly Leu Thr Leu Glu Asp Ala Ala Arg Val Ile Cys Leu Arg580 585 590Ser Arg Leu Leu Ala Glu Thr Leu Val Gly Lys Gly Ala Val Ala Ser595 600 605Val Ala Leu Pro Ala Asp Gln Val Arg Glu Arg Leu Arg Arg Trp Asp610 615 620Gly Arg Leu Ser Val Ala Gly Val Asn Gly Pro Arg Leu Val Ala Val625 630 635 640Ala Gly Asp Asp Ala Ala Leu Ala Glu Phe Val Glu Glu Cys Ala Arg645 650 655Asp Asp Ile Arg Ala Arg Thr Val Ala Ala Thr Val Pro Thr His Cys660 665 670
Ala Leu Val Asp Pro Leu Arg Glu Arg Leu Leu Glu Leu Leu Ala Pro675 680 685Val Arg Pro Arg Thr Gly Thr Val Pro Leu Tyr Ser Thr Val Thr Gly690 695 700Gly Leu Leu Asp Thr Ala Thr Met Asp Ala Gly Tyr Trp Tyr Asp Asn705 710 715 720Thr Arg Ala Pro Val Leu Phe Glu Pro Val Val Arg Thr Leu Leu Ala725 730 735Glu Gly His His Ala Phe Val Glu Ser Ser Ala His Pro Val Leu Ala740 745 750Met Ala Val Glu Gln Thr Val Asp Ala Thr Gly Ala Pro Gly Val Val755 760 765Val Glu Ser Leu Arg Arg Asp Glu Gly Gly Pro Gly Arg Met Leu Thr770 775 780Ser Leu Thr Lys Ala His Leu Gly Gly Val Arg Val Asp Trp Pro Thr785 790 795 800Val Phe Ala Gly Thr Gly Ala Arg Thr Val Asp Leu Pro Thr Tyr Ala805 810 815Phe Gln Arg Thr Arg Tyr Trp Ala Glu Thr Ala Asp Arg Thr Gly Asp820 825 830Val Gly Ser Val Gly Leu Ser Pro Val Asp His Pro Leu Leu Gly Ala835 840 845Leu Val Arg Met Ala Asp Gly Asp Gly Ala Val Leu Thr Gly Arg Leu850 855 860Ser Leu His Thr His Gly Trp Leu Ala Asp His Gly Val Ala Asp Gln865 870 875 880Val Ile Phe Pro Gly Thr Gly Phe Val Glu Leu Ala Val Leu Ala Gly885 890 895Asp Gln Val Gly Cys Gly Arg Ile Glu Glu Leu Thr Leu His Thr Pro900 905 910Leu Val Val Pro Arg Thr Gly Ala Leu Val Val Gln Val Asn Val Gln915 920 925Ala Ala Asp Asp Thr Gly Ala Arg Ala Leu Gly Val Tyr Ser Arg Pro930 935 940Asp Asp Ala Gly Ala Asp Met Val Trp Thr Arg His Ala Ser Gly Val945 950 955 960Leu Val Pro Glu Asp Thr Val Asp Ala Glu Asp Thr Asp Gly Leu Ser965 970 975Gly Val Trp Pro Pro Glu Gly Ala Glu Pro Val Ala Ile Ser Gly Leu980 985 990Tyr Asp Gly Met Ala Ala Ala Gly Tyr Gln Tyr Gly Pro Gly Phe Arg995 1000 1005Gly Leu Ser Arg Ala Trp His Leu Asp Gly Asp Val Tyr Ala Glu1010 1015 1020Val Ala Leu Pro Ala Asp Gln Thr Ser Ala Ala Glu Arg Tyr Gly1025 1030 1035Leu His Pro Ala Leu Phe Asp Ala Ala Leu His Ala Met Phe Thr
1040 1045 1050Trp Asp Gly Asp Asp Gly Gly Gly Val Gly Met Pro Phe Ser Trp1055 1060 1065Thr Gly Val Arg Leu His Ala Thr Gly Cys Ala Arg Leu Arg Val1070 1075 1080Arg Leu Ala Arg Arg Gly Glu Ser Asp Phe Thr Val Thr Leu Thr1085 1090 1095Asp Glu Ala Gly Asp Pro Val Val Ser Val Asp Ser Leu Val Val1100 1105 1110Arg Arg Met Thr Gly Ala Ala Pro Asp Thr Val Arg Thr Asp Thr1115 1120 1125Leu Tyr Arg Leu Asp Trp Lys Thr Val Arg Ala Gly Glu Glu Thr1130 1135 1140Ser Ala Pro Arg Cys Val Leu Leu Gly Thr Asp Pro Leu Gly Val1145 1150 1155Ala Ala Ala Leu Pro Gly Thr Ala Arg Val Ala Asp Val Glu Arg1160 1165 1170Leu Ala Glu Leu Ala Ala Ala Gly Gly Pro Val Thr Ala Leu Leu1175 1180 1185Pro Val Ala Gly Asp Gly Ser Ala Glu Arg Ile Gly Asp Pro Val1190 1195 1200Ile Asp Thr Val Ala Val Leu Gln Ser Trp Ile Ala Asp Gly Arg1205 1210 1215Leu Asp Asp Thr Arg Leu Val Val Leu Thr Arg Gly Ala Val Ala1220 1225 1230Thr Ala Pro Arg Glu Asp Val Thr Asp Leu Ala Ala Ala Gly Val1235 1240 1245Trp Gly Leu Met Arg Ser Ala Gln Asn Glu His Pro Gly Arg Phe1250 1255 1260Gly Leu Ile Asp Leu Asp Thr Ala Glu Ser Ser Thr Ala Ala Leu1265 1270 1275Gly Thr Ala Leu Ala Ser Glu Glu Glu Gln Leu Ala Leu Arg Asp1280 1285 1290Gly Val Leu Arg Gly Pro Ser Leu Thr Arg Trp Asp Pro Gly Thr1295 1300 1305Thr Ile Leu Pro Pro Ala Gly Glu Ser Ala Trp Arg Leu Glu Asn1310 1315 1320Thr Arg Pro Gly Thr Ile Glu Gly Leu Asp Ala Ala Pro Cys Pro1325 1330 1335Glu Leu Leu Ala Pro Leu Gly Pro Arg Gln Val Arg Ile Ala Val1340 1345 1350Arg Ala Ala Gly Ile Asn Phe Lys Asp Val Val Val Ala Leu Asp1355 1360 1365Leu Val Pro Gly Leu Thr Gly Leu Gly Gly Glu Val Ala Gly Val1370 1375 1380Ile Thr Ala Val Gly Ala Glu Val Thr Tyr His Arg Val Gly Asp1385 1390 1395
Gln Val Phe Gly Leu Ala Thr Glu Val Phe Gly Pro Val Thr Val1400 1405 1410Ala Asp Glu Arg Thr Val His Arg Ile Pro Asp Gly Trp Thr Phe1415 1420 1425Glu Glu Ala Ala Ser Val Ala Val Thr Tyr Met Thr Ala Tyr Tyr1430 1435 1440Gly Leu Val Asp Leu Gly Gly Leu Arg Ala Gly Gln Ser Val Leu1445 1450 1455Ile His Ala Gly Ala Gly Gly Val Gly Ser Ala Ala Val Gln Leu1460 1465 1470Ala Arg His Leu Gly Ala Glu Val Tyr Ala Thr Ala Ser Pro Gly1475 1480 1485Lys Trp Gly Ala Leu Arg Ala Gln Gly Leu Asp Gly Ala His Ile1490 1495 1500Ala Asn Ser Arg Thr Leu Asp Phe Glu Gln Trp Phe Leu His Ser1505 1510 1515Thr Asp Gly Arg Gly Met Asp Val Val Leu Asp Gys Leu Ala Gly1520 1525 1530Glu Phe Val Asp Ala Gly Leu Arg Leu Leu Pro Arg Gly Gly His1535 1540 1545Phe Leu Glu Met Gly Lys Thr Asp Lys Arg Asp Ala Glu Gln Val1550 1555 1560Gly Ala Ala His Pro Gly Val Val Tyr Arg Ala Tyr Asp Leu Pro1565 1570 1575Glu Ala Gly Pro Asp Arg Ile His Glu Met Leu Val Thr Leu Thr1580 1585 1590Gly Leu Phe Glu Asp Gly Val Leu Arg Pro Pro His Val Asn Ala1595 1600 1605Trp Asp Ile Arg Asp Ala Arg Ala Ala Phe Arg Ala Leu Ser Gln1610 1615 1620Ala Ala Leu Val Gly Lys Ala Val Leu Thr Leu Pro Gly Val Pro1625 1630 1635Phe Ser Pro His Gly Thr Val Leu Ile Thr Gly Gly Thr Gly Met1640 1645 1650Leu Gly Ala Leu Leu Ala Arg His Leu Val Thr Ala His Asn Val1655 1660 1665Thr Ser Leu Leu Leu Thr Ser Arg Arg Gly Pro Asp Ala Pro Gly1670 1675 1680Ala Ala Glu Leu Thr Ala Glu Leu Thr Glu Ala Gly Ala Arg Val1685 1690 1695Asp Val Val Ala Cys Asp Val Ala Asp Arg Asp Gln Leu Ala Ala1700 1705 1710Leu Leu Ala Gly Ile Pro Ala Glu Arg Pro Leu Thr Ala Val Leu1715 1720 1725His Thr Ala Ala Ala Leu Asp Asp Gly Leu Val Glu Ser Leu Thr1730 1735 1740
Ala Glu Arg Thr Arg Ala Val Leu Arg Pro Lys Val Asp Gly Ala1745 1750 1755Val Gln Leu His Glu Leu Thr Arg Asp Leu Asp Leu Gly Ala Phe1760 1765 1770Val Leu Phe Ser Ser Leu Ala Gly Thr Met Gly Ala Pro Gly Gln1775 1780 1785Gly Asn Tyr Ala Ala Ala Asn Val Met Leu Asp Ala Leu Ala Ala1790 1795 1800His Arg Arg Ala Gln Gly Leu Pro Gly Leu Ser Leu Ala Trp Gly1805 1810 1815Phe Trp Asp Gln Arg Ser Glu Met Ser Gly Asn Leu Asp Asp Arg1820 1825 1830Asp Ile Gln Arg Met Ser Arg Gly Gly Ile Val Pro Met Ser Ser1835 1840 1845Glu Glu Gly Leu Ala Thr Phe Asp Leu Ala Cys Arg Thr Asp Arg1850 1855 1860Ala Gln Leu Val Pro Ala Arg Leu Asp Pro Ala Ala Leu Ala Gly1865 1870 1875Thr Thr Gly Arg Val Pro Pro Val Met Arg Ala Leu Ile Pro Ala1880 1885 1890Pro Ala Arg Arg Ser Gly Arg Arg Ser Ala Glu Ala Gly Asp Asp1895 1900 1905Ser Leu Arg Ala Arg Leu Val Pro Leu Thr Gly Thr Glu Arg Thr1910 1915 1920Arg Ile Leu Leu Gln Leu Val Arg Ser Asn Ala Ala Thr Val Leu1925 1930 1935Gly His Thr Asp Pro Asp Ala Val Gly Ala Ala Thr Pro Phe Arg1940 1945 1950Glu Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Phe Arg Asn Arg1955 1960 1965Leu Thr Gly Ala Val Gly Phe Arg Leu Pro Val Thr Val Val Phe1970 1975 1980Asp His Pro Thr Pro Gly Ala Leu Thr Asp Phe Leu Ala Ala Glu1985 1990 1995Leu Leu Gly Gly Leu Asp Glu Thr Asp Ala Pro Ala Gly Pro Ser2000 2005 2010Arg Ala Thr Pro Ala Ala Val Ala Arg Thr Asp Glu Glu Pro Leu2015 2020 2025Val Ile Val Gly Met Ala Cys Arg Tyr Pro Gly Gly Ile Ser Thr2030 2035 2040Pro Glu Glu Leu Trp Asp Phe Val Leu Ala Glu Arg Asp Ala Ile2045 2050 2055Ser Gly Phe Pro Glu Asp Arg Gly Trp Arg Arg Glu Arg Ser Ala2060 2065 2070Asp Gly Ser Ala Pro Gln Gln Gly Gly Phe Leu Asp Arg Val Ala2075 2080 2085Glu Phe Asp Ala Ala Phe Phe Gly Ile Ser Pro Arg Glu Ala Leu
2090 2095 2100Thr Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Thr Ser Trp Glu2105 2110 2115Ala Leu Glu Arg Ala Gly Ile Ala Pro Gly Thr Leu Arg Gly Ser2120 2125 2130Arg Thr Gly Ile Phe Val Gly Ala Ala Ala Ser Gly Tyr Thr Ser2135 2140 2145Leu Phe Arg Arg Gly Ser Glu Ala Leu Ala Gly Tyr Gly Val Thr2150 2155 2160Gly Ala Ser Thr Ser Val Val Ser Gly Arg Val Ala Tyr Val Leu2165 2170 2175Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser2180 2185 2190Ser Leu Val Ala Leu His Thr Ala Ala Leu Ser Leu Arg Ala Gly2195 2200 2205Asp Cys Asp Leu Ala Leu Ala Gly Gly Val Ala Val Met Thr Ser2210 2215 2220Pro Phe Leu Phe Asp Asp Phe Ala Arg Gln Gly Gly Leu Ser Pro2225 2230 2235Asp Gly Arg Cys Lys Ala Phe Ala Gly Ser Ala Asp Gly Thr Gly2240 2245 2250Trp Ala Glu Gly Thr Gly Met Val Leu Leu Glu Arg Leu Ser Asp2255 2260 2265Ala Arg Arg Asn Gly His Pro Val Leu Ala Val Leu Arg Gly Ser2270 2275 2280Ala Val Asn Gln Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn2285 2290 2295Gly Pro Ser Gln Gln Arg Val Ile Arg Gln Ala Leu Asp Arg Ala2300 2305 2310Gly Leu Thr Pro Ala Asp Ile Asp Ala Val Glu Ala His Gly Thr2315 2320 2325Gly Thr Val Leu Gly Asp Pro Ile Glu Ala Gln Ala Val Leu Ala2330 2335 2340Thr Tyr Gly Arg Asp Arg Asp Pro Asp Arg Pro Val Leu Leu Gly2345 2350 2355Ser Leu Lys Ser Asn Ile Gly His Ser Gln Ala Ala Ala Gly Ile2360 2365 2370Gly Gly Val Ile Lys Thr Val Gln Ala Leu Leu His Gly Ile Leu2375 2380 2385Pro Arg Ser Leu His Ile Asp Glu Pro Thr Pro His Val Asp Trp2390 2395 2400Ser Ala Gly Ala Val Asp Leu Leu Thr Glu Thr Arg Ser Trp Pro2405 2410 2415Ala Thr Asp His Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Val2420 2425 2430Ser Gly Thr Asn Ala His Ala Ile Leu Glu Gln Ala Thr Glu Pro2435 2440 2445
Glu Pro Pro Ile Val Asp Gln Ala Pro Leu Pro Val Thr Pro Trp2450 2455 2460Leu Leu Ser Gly His Asp Glu Gln Gly Leu Arg Ala Gln Ala Glu2465 2470 2475Thr Leu Val Ser Trp Leu Arg Glu Gln Pro Glu Gly Ser Val Thr2480 2485 2490Asp Ile Gly His Ala Leu Ala Thr Arg Arg Ala Ala Leu Glu His2495 2500 2505Arg Ala Ala Leu Pro Val Thr Asp Arg Asp Glu Ala Leu Ala Arg2510 2515 2520Leu Ala Glu Phe Ala Ala Gly Arg Val Pro Asp Gly Leu Leu Arg2525 2530 2535Gly Thr Ala Gln Glu Gly Cys Leu Ala Leu Leu Phe Ala Gly Gln2540 2545 2550Gly Thr Gln Arg Pro Gly Met Gly Arg Asp Leu Tyr Ala Ala Phe2555 2560 2565Pro Ala Phe Ala His Ala Phe Asp Glu Ala Cys Ala His Leu Asp2570 2575 2580Pro Leu Leu Gly Arg Pro Leu Arg Asp Thr Val Phe Thr Ala Glu2585 2590 2595Ala Ala Glu Leu Asp Arg Thr Ala Ile Thr Gln Pro Ala Leu Phe2600 2605 2610Ala Leu Glu Val Ala Leu Tyr Arg Leu Leu Glu Ser Trp Gly Val2630 2635 2640Ala His Ala Ala Gly Val Leu Asp Leu Pro Asp Ala Ala Arg Leu2645 2650 2655Val Ala Ala Arg Gly Arg Leu Met Gln Ala Leu Pro Pro Gly Gly2660 2665 2670Ala Met Leu Ala Val Gln Val Gly Glu Thr Glu Ala Thr Glu Ala2675 2680 2685Leu Gly Ala Val Leu Gly Glu Arg Ala Ala Thr Val Asp Leu Ala2690 2695 2700Ala Val Asn Gly Pro His Ser Val Val Phe Ser Gly Thr Ala Arg2705 2710 2715Ser Val Asp Ala Leu Asp Ala His Phe Thr Ala Arg Gly Arg Arg2720 2725 2730Thr Arg Arg Leu Thr Val Ser His Ala Phe His Ser Pro Leu Met2735 2740 2745Glu Pro Met Leu Asp Glu Phe Ala Glu Leu Val Ser Arg Leu Thr2750 2755 2760Phe Ala Ala Pro Arg Ile Pro Val Val Ser Asp Leu Thr Gly Ser2765 2770 2775Val Leu Gly Ala Gly Asp Leu Ala Asp Pro Arg His Trp Val Arg2780 2785 2790
His Ala Arg His Thr Val Arg Phe Ala Asp Gly Ile Asp Thr Leu2795 2800 2805Val Gly Ala Gly Val Thr Asp Phe Leu Glu Leu Gly Pro Asp Ala2810 2815 2820Thr Leu Ala Thr Met Ala Glu Asp Cys Phe Ala Thr Ala Pro Thr2825 2830 2835Gly Val Cys Thr Ser Leu Leu Arg Arg Asp Gly Ser Glu Pro Val2840 2845 2850Thr Leu Leu Met Ala Leu Ala Arg Ala His Val His Gly Val Thr2855 2860 2865Val Asp Trp Lys Ala Val Leu Ala Gly Thr Gly Ala Arg Trp Val2870 2875 2880Asp Leu Pro Thr Tyr Ala Phe Gln Arg Glu Ser Tyr Trp Pro Ala2885 2890 2905Glu Ser Thr Ala Gly Arg Ser Asp Pro Ser Ser Ala Gly Phe Asp2900 2905 2910Gly Asp Val Leu Phe Thr Gly Glu Leu Ser Leu Ala Ala Gln Pro2930 2935 2940Trp Leu Ala Asp His Arg Val Leu Asp Ala Val Leu Phe Pro Gly2945 2950 2955Thr Gly Phe Leu Glu Leu Ala Ser Trp Ala Gly Ser Arg Leu Asp2960 2965 2970Ala Gly Asp Leu Glu Glu Leu Val Val His Arg Pro Leu Val Leu2975 2980 2985Pro Glu His Gly Gly Val Thr Val Gln Val Val Val Gly Glu Ala2990 2995 3000Thr Asp Glu Asp Arg Arg Pro Val Ala Val Tyr Ser Arg Ala Ala3005 3010 3015Asp Asp Ala Gly Trp Thr Arg His Ala Glu Gly Leu Leu Ala Thr3020 3025 3030Gly Pro Ala Ala Gln Pro Ala Asp Pro Ser Ala His Trp Pro Pro3035 3040 3045Gln Gly Ala Glu Arg Val Asp Leu Asp Glu Phe Tyr Ala Gly Leu3050 3055 3060Ala Asp Ala Gly Thr Ala Tyr Gly Pro Val Phe Gln Gly Leu Thr3065 3070 3075Ala Val Trp Arg Leu Asp Gly Glu Ile Tyr Ala Asp Val Ala Leu3080 3085 3090Pro Ala Gln Ala Ala Asp Asp Ala Arg Gly Phe Gly Val His Pro3095 3100 3105Ala Leu Leu Asp Ala Ala Leu His Thr Leu Ala Phe Leu Pro Gly3110 3115 3120Ala Asp Arg Ser Ser Gly Pro Phe Leu Pro Phe Ala Trp Arg Asp3125 3130 3135Val Thr Val Pro Gly Pro Gly Ala Thr Ser Cys Arg Ile Arg Leu
3140 3145 3150Thr Pro Gly Asn Gly Thr Asp Glu Val Ala Ala Thr Leu Trp Asp3155 3160 3165Gly Asp Gly Arg Pro Leu Ala Ala Val Gly Gly Leu Ser Leu Arg3170 3175 3180Ser Val Ser Arg Thr Gln Leu Gly Thr Ser Ala Val Ala Ser Ser3185 3190 3195Leu Phe Arg Met Asp Trp Thr Pro Ala Ser Gln Pro Arg Ala Val3200 3205 3210Gly Ala Pro Thr Val Arg Trp Ala Val Val Gly Pro Asp Ala Pro3215 3220 3225Gly Thr Pro Asp Ile Asp His Tyr Ala Asp Leu Val Ala Leu Arg3230 3235 3240Arg His Leu Ala Asp Gly Gly Pro Val Pro Asp Gln Val Leu Leu3245 3250 3255Pro Cys Ala Pro Ser Ala Gly Gly Ala Asp Ala Gly Ala Ala Arg3260 3265 3270Asp Ala Val His Ala Ala Leu His Thr Leu Arg Thr Trp Ala Glu3275 3280 3285Asp Glu His Phe Ala Lys Ser Arg Leu Val Leu Cys Thr Arg Gly3290 3295 3300Ala Val Val Ala Gln Pro Gly Glu Gly Val Arg Asp Leu Ala His3305 3310 3315Ala Ala Val Trp Gly Leu Ala Arg Ser Ala Gln Leu Glu His Pro3320 3325 3330Asp Arg Phe Val Leu Val Asp Leu Asp Thr Gly Thr Thr Leu Asp3335 3340 3345Asp Leu Thr Arg Ser Gln Leu Leu Ala Arg Thr Glu Ser Thr Asp3350 3355 3360Ala Ala Gln Phe Ala Ile Arg Gly Ala Leu Thr Leu Val Pro Ala3365 3370 3375Val Thr Arg Gln Ala Gly Gln Val Pro Ala Pro Glu Ala Pro Trp3380 3385 3390Pro Ala Asp Gly Thr Thr Leu Ile Thr Gly Ala Gly Gly Met Ile3395 3400 3405Gly Gly Leu Leu Ala Arg His Leu Val Arg Glu His Gly Val Arg3410 3415 3420His Leu Leu Leu Leu Gly Arg Arg Gly Glu Asp Thr Pro Gly Met3425 3430 3435Ala Glu Leu Arg Arg Glu Leu Thr Asp Ala Gly Ala Asp Val His3440 3445 3450Val Thr Ala Cys Asp Ala Ala Asp Arg Glu Ala Leu Ala Ala Val3455 3460 3465Leu Gly Arg Ile Pro Ser Thr Ala Pro Leu Thr Ala Val Val His3470 3475 3480Ala Ala Gly Val Val Asp Asp Gly Val Leu Gly Ser Val Thr Asp3485 3490 3495
Glu Gln Val Asp Arg Val Leu Arg Pro Lys Ile Asp Ala Ala Val3500 3505 3510Asn Leu His His Leu Thr Ala Pro Leu Gly Leu Arg Ala Phe Val3515 3520 3525Val Cys Ser Ser Leu Ala Gly Ala Leu Gly Gly Gly Gly Gln Ser3530 3535 3540Ala Tyr Ala Ala Ala Asn Ala Tyr Leu Asp Ala Leu Cys Leu Arg3545 3550 3555Arg Arg Ala Asp Gly Leu Pro Ala Leu Ser Leu Ala Trp Gly Pro3560 3565 3570Trp Glu Ser Ser Ala Gly Met Thr Ala Gln Leu Ala Ala Ala Asp3575 3580 3585Leu Arg Arg Ile Ser Arg Ala Gly Met Gln Pro Leu Thr Pro Asp3590 3595 3600Asp Gly Leu Ala Leu Phe Asp Ala Ala His Ala Thr Gly Glu Ala3605 3610 3615Val Leu Leu Pro Phe Arg Phe Glu Pro Gly Gly Leu Ser Thr Ala3620 3625 3630Asp Arg Ala Ser Leu Pro Pro Ala Leu Arg Pro Leu Val Pro Arg3635 3640 3645Pro Arg Arg Arg Pro Gly Asp Pro Val Pro Gly Leu Ser Gly Leu3650 3655 3660Arg Asp Arg Leu Arg Pro Leu Ser Gln Asp Asp Arg Thr Gly Ala3665 3670 3675Leu Glu Asn Leu Val Arg Ala Glu Val Ala Ser Val Leu Ala Leu3680 3685 3690Pro Ser Ala Asp Ala Val Pro Val Thr Lys Ala Phe Lys Thr Leu3695 3700 3705Gly Phe Asp Ser Leu Met Ala Val Asp Leu Arg Asn Arg Leu Ser3710 3715 3720Ala Leu Thr Gly Val Arg Leu Pro Ala Thr Leu Val Phe Asp His3725 3730 3735Pro Thr Pro Arg Ala Leu Ala Thr Arg Leu Leu Thr Gly Met Glu3740 3745 3750Leu Asp Thr Ala Thr Ala Thr Asp Pro Ala Leu Leu Ala Leu Arg3755 3760 3765Glu Leu Glu Thr Ala Val Arg Ser Met Ala Pro Gly Ala Asp Asp3770 3775 3780Arg Gly Ala Met Ala Thr Arg Leu Arg Val Leu Leu Thr Ala Leu3785 3790 3795Glu Glu Thr Ala Asp Asp Thr Asp Gly Ala Asp Thr Asp Gly Asp3800 3805 3810Thr Asp Leu Asp Ser Val Ser Thr Glu Glu Leu Val Asn Leu Leu3815 3820 3825Gly Asp Glu Phe Gly Leu Thr3830 3835
<210>5<211>3897<212>PRT<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)<400>1Val Thr Asn Glu Ala Gln Leu Val Asp Tyr Leu Lys Lys Leu Ala Ala1 5 10 15Asp Leu Arg Gln Ala His Arg Arg Ile Lys Lys Leu Glu Ala Gly Glu20 25 30Thr Glu Pro Ile Ala Ile Val Gly Met Ala Cys Arg Leu Pro Gly Gly35 40 45Val Gly Ser Pro Glu Glu Leu Trp Asp Leu Val Leu Arg Gly Glu Asp50 55 60Ala Val Thr Asp Met Pro Ser Asp Arg Gly Trp Ala Leu Gly Glu Leu65 70 75 80Tyr Asp Val Asp Pro Asp Arg Pro Gly Thr Thr Tyr Ala Thr Gln Gly85 90 95Gly Phe Leu Arg Gly Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly Ile100 105 110Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Leu Leu Leu115 120 125Glu Thr Ala Trp Glu Ala Leu Glu Asn Thr Gly Val Asp Pro Arg Ser130 135 140Leu Ala Gly Ser Arg Thr Gly Ile Phe Thr Gly Leu Met Tyr His Asp145 150 155 160Tyr Ala Ser Gly Pro Gly Thr Leu Pro Asp Glu Val Glu Gly Tyr Leu165 170 175Ser Thr Gly Met Ala Gly Ser Val Ala Ser Gly Arg Ile Ser Tyr Phe180 185 190Leu Glu Leu Glu Gly Pro Ala Val Thr Leu Asp Thr Ala Cys Ser Ser195 200 205Ser Leu Val Ala Leu His Leu Ala Val Gln Ala Leu Arg Asp Gly Glu210 215 220Cys Asp Leu Ala Leu Ala Gly Gly Ala Thr Val Met Ala Thr Pro Ala225 230 235 240Thr Phe Val Glu Asn Ser Arg Gln Arg Gly Leu Ala Thr Asp Gly Arg245 250 255Cys Lys Ala Phe Ala Ala Ala Ala Asp Gly Val Gly Trp Gly Glu Gly260 265 270Ser Ala Leu Leu Val Val Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly275 280 285His Asp Val Leu Ala Ile Val Arg Gly Ser Ala Val Asn Gln Asp Gly290 295 300Ala Ser Asn Gly Leu Thr Ser Pro Asn Gly Pro Ser Gln Glu Arg Val305 310 315 320Ile Glu Gln Ala Leu Ala Ser Ala Arg Leu Gly Phe Ala Asp Ile Asp325 330 335Val Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu
340 345 350Ala Gln Ala Leu Ile Ala Thr Tyr Gly Arg Glu Arg Pro Asp Ser Ser355 360 365Pro Leu Arg Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr Gln Ala370 375 380Ala Ala Gly Ala Ala Gly Ile Ile Lys Met Val Met Ala Met Arg His385 390 395 400Gln Gln Leu Pro Arg Thr Leu His Val Asp Arg Pro Thr Pro Glu Val405 410 415Asp Trp Ser Ala Gly Thr Val Glu Leu Leu Thr Glu Asn His Ala Trp420 425 430Pro Arg Thr Gly Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Ile435 440 445Ser Gly Thr Asn Ala His Val Ile Leu Glu Glu Pro Pro Thr Ala Asp450 455 460Pro Pro Thr Asp Thr Ala Lys Gly Ala Asp Leu Pro Ala His Gly Ala465 470 475 480Glu Arg Ala Asp Val Ser Asp Ser Met Val Ser Ala Val Leu Pro Val485 490 495Val Pro Val Pro Leu Ser Ala Ala Thr Pro Ala Ala Leu Pro Ala Gln500 505 510Ala Ala Arg Leu His Ala His Leu Leu Asp Arg Pro Asp Leu Pro Leu515 520 525Gly Asp Leu Ala Ala Ala Leu Ala Thr Thr Arg Thr Ala Phe Glu His530 535 540Arg Ala Val Leu Leu Thr Glu Ser Arg Glu Glu Leu Leu Gly Gly Leu545 550 555 560Ala Glu Leu Ala Arg Gly Glu Arg Pro Ala Gly Leu Val Asp Gly Val565 570 575Ala Asp Glu Val Arg Cys Ala Phe Leu Phe Thr Gly Gln Gly Ala Gln580 585 590Arg Ala Leu Asp Glu Val Cys Ala Glu Leu Gly Ala Arg Leu Asp Met610 615 620Pro Leu Leu Pro Leu Leu Leu Ala Asp Ala Asn Ser Ala Glu Ala Arg625 630 635 640Leu Leu Asp Arg Thr Leu Tyr Thr Gln Ser Ala Thr Phe Ala Leu Gly645 650 655Val Ala Leu Phe Arg Leu Leu Glu Glu Trp Gly Val Arg Pro Arg Leu660 665 670Leu Ser Gly His Ser Val Gly Glu Leu Thr Ala Thr His Val Ser Gly675 680 685Met Leu Ser Leu Ala Asp Ala Cys Glu Leu Val Ala Thr Arg Gly Arg690 695 700Leu Met Gln Glu Leu Pro Glu Gly Gly Ala Met Val Ser Val Ala Ala705 710 715 720
Thr Ala Asp Glu Val Leu Pro Leu Leu Ala Gly His Glu Ser Val Ala725 730 735Gly Val Ala Ala Val Asn Gly Pro Gly Ser Val Val Val Ser Gly Asp740 745 750Glu Asp Val Val Thr Gly Ile Ala Ala His Phe Thr Glu Leu Gly Arg755 760 765Arg Thr Arg Arg Ile Pro Val Ser His Ala Phe His Ser Pro Leu Met770 775 780Asp Pro Val Val Glu Pro Leu Gly Glu Val Ala Gly Arg Leu Ser Phe785 790 795 800Glu Pro Pro Arg Ile Pro Val Val Ser Ser Val Thr Gly Thr Val Leu805 810 815Asp Ala Ala Asp Trp Ala Asp Pro Ala Tyr Trp Ala Arg Gln Ala Arg820 825 830Glu Pro Val Arg Phe His Asp Val Val His Thr Leu Val Ala Glu Glu835 840 845Val Thr Val Phe Leu Glu Leu Gly Ala Asp Ala Ala Leu Thr Ser Met850 855 860Thr Glu Glu Thr Leu Ala Ala Ser Gly Thr Pro Thr Val Val Ala Pro865 870 875 880Ala Leu Arg Arg Gln Arg Pro Glu Val Arg Thr Leu Thr Ala Met Leu885 890 895Ala Gln Ala His Thr Ala Gly Val Pro Ile Asp Trp Arg Thr Phe Phe900 905 910Gly Gly Arg Pro Thr Ser Arg Val Pro Leu Pro Thr Tyr Ala Phe Gln915 920 925Gly Thr Arg Tyr Trp Leu Glu Thr Ala Pro Gly Ala Gly Asp Met Gly930 935 940Ala Ala Gly Leu Val Ala Ala Glu His Pro Leu Leu Gly Ala Thr Leu945 950 955 960Val Pro Ala Val Gly Gly Gly Arg Leu Phe Thr Gly Arg Leu Ser Val965 970 975Glu Ala Gln Pro Trp Leu Ala Asp His Ala lle Asp Asp Ala Val Leu980 985 990Leu Pro Gly Thr Ala Val Ala Glu Leu Ala Leu Trp Ala Ala Arg HisVal Gly Leu Asn His Val Ala Asp Leu Val Leu Glu Val Pro Leu1010 1015 1020Ala Leu Pro Arg Gly Gly Gly Leu Arg Val Gln Leu Ala Val Asp1025 1030 1035Ser Pro Asp Ala Ser Gly Asp Arg Gly Phe Gly Leu Tyr Thr Gln1040 1045 1050Pro Glu Gly Ser Ala Asp Asp Val Trp Thr Arg His Ala Gly Gly1055 1060 1065Thr Leu Thr Ala Val Arg Thr Ala Ser Ala Glu Glu Leu Thr Val1070 1075 1080
Trp Pro Pro Ala Gly Ala Glu Lys Leu Asp Thr Asp Gly Cys Tyr1085 1090 1095Ala Asp Phe Ala Ala Ala Gly Val Arg Tyr Gly Pro Ala Phe Gln1110 1105 1110Gly Leu Arg Ala Val Trp Arg His Gly Glu Glu Val Tyr Ala Glu1115 1120 1125Val Arg Leu Pro Glu Asp Val Thr Gly Asp Ala Gly Glu Phe Cys1130 1135 1140Leu His Pro Ala Leu Ala Asp Ala Ala Leu His Ala Ser Ala Phe1145 1150 1155Val Pro Gly Glu Phe Gly Arg Glu Gln Arg Ala Arg Leu Pro Phe1160 1165 1170Ala Trp Arg Gly Val Ser Leu His Ala Ala Gly Ala Ser Phe Leu1175 1180 1185Arg Val Arg Leu Ala Pro Thr Gly Pro Asp Thr Leu Ala Leu Leu1190 1195 1200Phe Ala Asp Ala Ala Gly Arg Thr Val Ala Thr Val Glu Ser Leu1205 1210 1215Ala Val Arg Pro Ala Gly Ala Val Glu Ala Leu Asp Gly Ala Val1120 1125 1230Asp Ala Leu Leu Met Pro Ser Trp Val Pro Val Gly Gly Cys Glu1235 1240 1245Thr Pro Gly Arg Trp Ala Val Leu Gly Pro Gly Pro Leu Ala Gly1250 1255 1260Leu Pro His Ala Asp Val His Ala Asp Leu Ala Gly Leu Glu Ala1265 1270 1275Ala Val Glu Ala Gly Ala Glu Val Pro Asp Phe Ile Val Gly Thr1280 1285 1290Val Gly Thr Ser Asp Gly Ser Val Asp Ala Asp Ala Ala His Glu1295 1300 1305Ala Ala Glu Arg Ala Leu Ala Leu Leu Agr Ser Trp Leu Ala Gly1310 1315 1320Glu Arg Leu Gly Thr Ala Arg Leu Val Met Val Thr Trp Asn Ala1325 1330 1335Ala Ala Val Ala Asp Gly Asp Ala Pro Asp Pro Val Gln Ala Ala1340 1345 1350Val Trp Gly Leu Leu Ser Ser Ala Val Thr Glu His Pro Gly Arg1355 1360 1365Ile Ala Leu Val Asp Leu Asp Gly Thr Ala Glu Ser Leu Ala Ala1370 1375 1380Leu Ala Ser Thr Val Gly Val Asp Glu Pro Arg Leu Ala Leu Arg1385 1390 1395Glu Gly Arg Ala Thr Ala Pro Arg Leu Thr Arg Ala Ser Ala Gly1400 1405 1410Ser Pro Arg Pro Pro Arg Gly Ile Asp Pro Asn Gly Thr Ala Leu1415 1420 1425Ile Thr Gly Gly Thr Gly Thr Leu Gly Ala Leu Leu Ala Arg His
1430 1435 1440Leu Val His Gln His Gly Val Thr Asp Leu Leu Leu Thr Ser Arg1445 1450 1455Arg Gly Pro Asp Ala Pro Gly Ala Thr Glu Leu Thr Ala Glu Leu1460 1465 1470Thr Lys Ala Gly Ala His Val Thr Ile Thr Ala Cys Asp Thr Ala1475 1480 1485Asp Pro Asp Gln Leu Ala Ala Leu Leu Ser His His Thr Leu Thr1490 1495 1500Thr Val Ile His Thr Ala Gly Ile Leu Asp Asp Ala Thr Ile Thr1505 1510 1515Thr Leu Thr Asn Thr Gln Leu His Asn Val Leu Arg Pro Lys Ile1520 1525 1530Asp Ala Ala Thr His Leu His His Leu Thr Leu Asn His Pro Val1535 1540 1545Thr Thr Phe Ile Leu Tyr Ser Ser Ala Ala Gly Gln Leu Gly Ala1550 1555 1560Pro Gly Gln Ala Asn Tyr Ala Ala Ala Asn Thr Tyr Leu Asp Ala1565 1570 1575Leu Ala His His Arg Arg Thr His Gly Leu Pro Ala Thr Ser Leu1580 1585 1590Ala Trp Gly Leu Trp Asn Thr Arg Ser Thr Met Thr Gly His Leu1595 1600 1605Asn Asp Lys Glu Leu His Arg Met Glu Arg Ala Gly Val Val Pro1610 1615 1620Ile Ser Glu Glu Gln Gly Met Ala Leu Leu Asp Ala Ala Val Gly1625 1630 1635Leu Asp Ala Pro Val Ala Val Pro Leu Pro Leu Glu Pro Gly Ala1640 1645 1650Leu Arg Ser Gln Ala Ala Ala Gly Thr Leu Pro Pro Leu Leu Arg1655 1660 1665Gly Phe Val Arg Val Pro Val Arg Arg Ala Ala Asn Ala Ala Glu1670 1675 1680Gly Ala Tyr Ala Gly Met Thr Phe Ala Ala Gly Leu Arg Glu Leu1685 1690 1695Pro Glu Ala Glu Arg Leu Arg Leu Leu Leu Asp Leu Val Arg Thr1700 1705 1710His Ala Ala Arg Ala Leu Gly His Ala Asn Thr Asp Gly Leu Glu1715 1720 1725Ala Arg Arg Ser Phe Arg Glu Leu Gly Phe Asp Ser Leu Ala Ala1730 1735 1740Ile Glu Leu Arg Asn Gly Val Gly Ala Ala Thr Gly Leu Gly Leu1745 1750 1755Pro Ala Thr Leu Val Phe Asp His Pro Thr Pro Gln Arg Leu Ala1760 1765 1770Glu His Leu His Glu Lys Leu Phe Asp Arg Gly Ala Glu Val Ala1775 1780 1785
Leu Pro Glu Leu Arg Ala Thr Asp Asp Asp Pro Ile Val Ile Val1790 1795 1800Gly Met Ala Cys Arg Tyr Pro Gly Gly Val Ala Thr Pro Asp Ala1805 1810 1815Leu Trp Glu Leu Val Ala Ala Glu Arg Asp Ala Ile Ser Gly Met1820 1825 1830Pro Glu Asp Arg Gly Trp Asp Val Glu Glu Leu Tyr Asp Pro Glu1835 1840 1845Leu Ala Arg Pro Gly Thr Ser Tyr Val Arg Arg Gly Gly Phe Leu1850 1855 1860Tyr Glu Ala Ala Asp Phe Asp Ala Glu Phe Phe Gly Ile Ser Pro1865 1870 1875Arg Glu Ala Leu Ala Met Asp Pro Gln Gln Arg Leu Leu Leu Glu1880 1885 1890Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly Ile Asp Pro Arg Ala1895 1900 1905Val Arg Gly Ser Arg Thr Gly Val Phe Ala Gly Leu Met Tyr His1910 1915 1920Asp Tyr Gly Ser Gly Pro Gly Thr Leu Pro Asp Glu Val Glu Gly1925 1930 1935Phe Ile Gly Thr Gly Ser Ala Gly Ser Val Ala Ser Gly Arg Val1940 1945 1950Ala Phe Thr Leu Gly Leu Glu Gly Pro Ala Val Thr Leu Asp Thr1955 1960 1965Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Ala Gln Ala1970 1975 1980Leu Arg Gly Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr1985 1990 1995Val Met Ala Thr Pro Gly Val Phe Val Glu Leu Ser Arg Gln Gly2000 2005 2010Gly Leu Ala Pro Asp Gly Arg Cys Lys Ala Phe Ala Ala Gly Ala2015 2020 2025Asp Gly Thr Gly Trp Gly Glu Gly Val Gly Met Leu Leu Val Glu2030 2035 2040Arg Leu Ser Asp Ala Arg Arg His Gly His Pro Val Leu Ala Val2045 2050 2055Val Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn Gly Leu2060 2065 2070Thr Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Gly Gln Ala2075 2080 2085Leu Ala Ser Ala Gly Leu Ala Ala Val Asp Val Asp Val Val Glu2090 2095 2100Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro Ile Glu Ala Gln2105 2110 2115Ala Leu Leu Ala Thr Tyr Gly Gln Asp Arg Pro Val Asp Arg Pro2120 2125 2130
Leu Trp Leu Gly Ser Val Lys Ser Asn Ile Gly His Thr Gln Ala2135 2140 2145Ala Ala Gly Val Ala Gly Val Ile Lys Ser Val Leu Ala Met Arg2150 2155 2160His Gly Val Leu Pro Arg Thr Leu His Val Glu Glu Pro Thr Pro2165 2170 2175Glu Val Asp Trp Ser Ser Gly Ala Val Glu Leu Leu Ala Gln Ala2180 2185 2190Arg Glu Trp Pro Glu Thr Gly Arg Pro Arg Arg Ala Gly Val Ser2195 2200 2205Ala Phe Gly Ile Ser Gly Thr Asn Ala His Val Ile Leu Glu Gln2210 2215 2220Ala Pro Glu Thr Val Glu Glu Ser Ala Pro Gly Glu Thr Gly Ser2225 2230 2235Val Leu Val Pro Trp Val Ile Ser Ala Arg Ser Ala Gln Ala Leu2240 2245 2250Arg Glu Gln Ala Arg Asn Leu Ala Gly His Val Ala Arg His Gly2255 2260 2265Leu Arg Pro Val Asp Val Gly Phe Ser Leu Ala Ala Ala Arg Ala2270 2275 2280Gly Leu Gly His Arg Ala Val Leu Val Gly Arg Glu Thr Ser Glu2285 2290 2295Leu Leu Ala Gln Leu Glu Ala Leu Ala Glu Gly Arg Val Ala Gly2300 2305 2310Gly Ser Val Thr Asp Gly Gly Thr Ala Phe Leu Phe Ser Gly Gln2315 2320 2325Gly Ser Gln Arg Ala Ser Met Gly Arg Glu Leu Tyr Glu Ala Phe2330 2335 2340Pro Val Phe Ala Ala Ala Phe Asp Glu Val Cys Ala Gly Phe Asp2345 2350 2355Gly Met Leu Pro Gly Ser Leu Arg Asp Ala Val Phe Ala Gly Gly2360 2365 2370Glu Val Leu Asp Arg Thr Glu Trp Thr Gln Ala Gly Leu Phe Ala2375 2380 2385Leu Glu Val Ala Leu Phe Glu Leu Val Gly Ser Trp Gly Val Arg2390 2395 2400Ala Asp Val Leu Leu Gly His Ser Ile Gly Glu Leu Ala Ala Ala2405 2410 2415Tyr Val Ala Gly Val Trp Ser Leu Gln Asp Ala Cys Arg Val Val2420 2425 2430Ala Ala Arg Gly Arg Leu Met Gln Ala Leu Pro Ser Gly Gly Val2435 2440 2445Met Val Ala Val Gln Ala Ala Glu Glu Glu Leu Pro Glu Leu Pro2450 2455 2460Ala Gly Val Ser Val Ala Ala Val Asn Gly Pro Arg Ser Leu Val2465 2470 2475Leu Ser Gly Asp Glu Glu Pro Val Thr Ala Val Ala Gln Glu Leu
2480 2485 2490Ala Gly Arg Gly Arg Arg Ile Lys Arg Leu Ala Val Gly His Ala2495 2500 2505Phe His Ser Ala Arg Met Glu Pro Met Leu Ala Gln Phe Ala Glu2510 2515 2520Val Leu Ala Gly Val Glu Phe Arg Arg Pro Arg Ile Ala Val Val2525 2530 2535Ser Asn Val Thr Gly Gln Ile Ala Asp Glu Glu Leu Ala Thr Pro2540 2545 2550Ala Tyr Trp Val Arg His Val Arg Glu Ala Val Arg Phe Ala AspGly Val Thr Thr Ala His Ser Arg Gly Val Asp Lys Phe Leu Glu2570 2575 2580Leu Gly Pro Gly Gly Ser Leu Thr Ala Met Ala Glu Glu Thr Leu2585 2590 2595Asp His Thr Gly Thr Gly Thr Val Cys Thr Pro Ile Leu His Pro2600 2605 2610Glu Arg Pro Glu Ala Gln Ser Val Val His Ala Leu Gly Arg Ile2615 2620 2625Tyr Ala Ala Gly Ala Pro Ala Asp Trp Ser Ala Phe Phe Thr Gly2630 2635 2640Thr Gly Ala Arg Arg Val Asp Leu Pro Thr Tyr Ala Phe Gln Arg2645 2650 2655Arg Arg Phe Trp Leu Glu His Arg Arg Gly Ala Gly Asp Leu Thr2660 2665 2670Ala Met Gly Leu Gln Ala Ala Asp His Pro Leu Leu Gly Ala Ala2675 2680 2685Val Thr Leu Ala Asp Gly Glu Gly Val Leu Leu Thr Gly Arg Leu2690 2695 2700Ser Gly Arg Ala Gln Pro Trp Leu Leu Asp His Ala Leu Leu Gly2705 2710 2715Gln Val Leu Leu Pro Gly Ser Ala Phe Val Asp Leu Val Ile Arg2720 2725 2730Ala Gly Asp Leu Leu Asn Arg Pro Tyr Leu Glu Val Leu Thr Pro2735 2740 2745His Thr Pro Leu Leu Leu Gly Ala Gly Pro Glu Asp Glu Val Thr2750 2755 2760Val Gln Val Arg Ala Thr Pro Asp Ser Asp Ser Gly Arg Cys Thr2765 2770 2775Val Thr Leu His Ser Arg Thr Ser Asp Gly Asp Trp Thr Leu His2780 2785 2790Ala Thr Gly Thr Leu Ser Ala Asp Ala Pro Ala Glu Pro Ala Pro2795 2800 2805Leu Pro Ser Trp Pro Pro Ala Gly Ala Glu Ala Val Glu Thr Asp2810 2815 2820Gly Val Tyr Gln Glu Leu Ala Ala Ser Gly Tyr His Tyr Gly Pro2825 2830 2835
Ala Phe Gln Cys Leu His Ala Leu Trp Arg Gln His Asp Glu Leu2840 2845 2850Phe Ala Glu Val Arg Leu Pro Glu Ser Glu Arg Glu Glu Gly Thr2855 2860 2865Arg Tyr Gly Val His Pro Ala Leu Leu Asp Ala Ala Leu His Ala2870 2875 2880Met Ala Phe Val Gly Gly Arg Asp Glu Gly Val Arg Leu Pro Ser2885 2890 2895Ser Trp Ala Ala Val Arg Leu Tyr Ala Ser Gly Ala Thr Thr Ala2900 2905 2910Arg Val Arg Leu Thr Pro Ser Gly Asp Gln Leu Ala Leu Leu Val2915 2920 2925Thr Asp Glu Ala Gly Arg Pro Val Val Ser Val Gly Ser Val Val2930 2935 2940Thr Lys Pro Ala Val Phe Asp Gln Pro Ser Gly Gly Thr Leu Glu2945 2950 2955Gln Ala Leu Leu His Leu Asp Trp Thr Ala Leu Pro Val Ala Ala2960 2965 2970Ala Gln Ser Tyr Ala Leu Val Gly Asp Asp Pro Phe Gly Leu Thr2975 2980 2985Gly Ala Ala Leu Arg Val Ala Ala Thr Phe Glu Glu Leu Ala Ala2990 2995 3000Asn Gly Pro Val Pro Gly Ile Val Val Arg Cys Leu Ala Pro Arg3005 3010 3015Val Ser Asp Asp Pro Ala Ala Asp Ala His Ala Ala Ala Glu Ala3020 3025 3030Thr Leu Gly Val Ile Arg Ala Trp Leu Ala Asp Asp Arg Phe Ala3035 3040 3045Ser Ala Arg Leu Val Leu Val Thr Ser Gly Ala Val Ala Ala Gly3050 3055 3060Asp Ala Glu Asp Val Thr Asp Leu Ala Asn Ser Thr Ser Trp Gly3065 3070 3075Leu Val Gly Ser Ala Gln Thr Glu His Pro Asp Arg Phe Phe Leu3080 3085 3090Val Asp Leu Asp Gly Leu Asp Thr Ser Arg Glu Val Phe Gly Asp3095 3100 3105Ala Leu Ala Cys Ala Glu Pro Arg Ile Ala Val Arg Arg Gly Thr3110 3115 3120Val Ala Ala Pro Arg Leu Ala Arg Ala Arg Ser His Pro Ala Leu3125 3130 3135Leu Pro Pro Ser Gly Pro Val Pro Trp Arg Leu Glu Ser Thr Gly3140 3145 3150Leu Asp Pro Leu Gly Pro Gly Gln Val Arg Ile Ala Val His Ala3170 3175 3180
Val Gly Leu Asp Phe Gln Asp Val Val Ala Ser Leu Asp Pro Ala3185 3190 3195Glu Gly Ser Lys Gly Ile Ser Gly Tyr Ala Ala Gly Thr Val Gln3200 3205 3210Glu Thr Gly Ala Glu Val Thr Asp Leu Ala Val Gly Asp Arg Val3215 3220 3225Leu Ala Leu Arg Ser Gly Ser Ser Gly Pro Phe Ala Val Tyr Asp3230 3235 3240His Arg Cys Leu Ala Pro Met Pro Asp Gly Trp Ser Tyr Glu Gln3245 3250 3255Ala Ala Ala Val Pro Leu Thr Tyr Leu Ile Pro Tyr Tyr Gly Leu3260 3265 3270Val Asp Leu Ala Asp Val Gln Pro Gly Met Ser Val Leu Val His3275 3280 3285Asp Ala Ala Asp Gly Ser Gln Leu Ala Ala Val Gln Leu Ala His3290 3295 3300Gln Leu Gly Ala Glu Val Tyr Gly Thr Ala Ala Thr Gly Lys Trp3305 3310 3315Pro Thr Leu Arg Lys Tyr Gly Leu Asp Asp Ala His Ile Ala Asp3320 3325 3330Ser Arg Thr Pro Glu Phe Glu His Arg Phe Met Glu Thr Ser Gly3335 3340 3345Gly Cys Gly Val Asp Val Val Leu Asn Cys Leu Ala Gly Glu Ser3350 3355 3360Val Asp Ala Gly Leu Arg Leu Leu Pro Arg Gly Gly Arg Phe Leu3365 3370 3375Glu Thr Gly Arg Ala Asp Arg Arg Asp Pro Ala Gln Val Ala Glu3380 3385 3390Ala His Ala Gly Val Ala Tyr Arg Thr Tyr Asp Leu Ala Glu Ala3395 3400 3405Asp Pro Asp Arg Ile Arg Glu Met Leu Val Ala Val Met Ser Leu3410 3415 3420Cys His Asp Gly Lys Leu Thr Pro Pro Arg Ile Thr Val Arg Asp3425 3430 3435Leu Arg Arg Ala Arg Glu Ala Ser Arg His Ala Asn Arg Ala Thr3440 3445 3450Pro Ala Gly Pro Ala Val Leu Thr Val Pro Arg Gly Ile Asp Pro3455 3460 2465Asn Gly Thr Ala Leu Ile Thr Gly Gly Thr Gly Thr Leu Gly Ala3470 3475 3480Leu Leu Ala Arg His Leu Val His Gln His Gly Val Thr Asp Leu3485 3490 3495Leu Leu Thr Ser Arg Arg Gly Pro Asp Ala Pro Gly Ala Thr Glu3500 3505 3510Leu Thr Ala Glu Leu Thr Lys Ala Gly Ala His Val Thr Ile Thr3515 3520 3525Ala Cys Asp Thr Ala Asp Pro Asp Gln Leu Ala Ala Leu Leu Ser
3530 3535 3540His His Thr Leu Thr Thr Val Ile His Thr Ala Gly Ile Leu Asp3545 3550 3555Asp Ala Thr Ile Thr Thr Leu Thr Asn Thr Gln Leu His Asn Val3560 3565 3570Leu Arg Pro Lys Ile Asp Ala Ala Thr His Leu His His Leu Thr3575 3580 3585Leu Asn His Pro Val Thr Thr Phe Ile Leu Tyr Ser Ser Ala Ala3590 3595 3600Gly Gln Leu Gly Ala Pro Gly Gln Ala Asn Tyr Ala Ala Ala Asn3605 3610 3615Thr Tyr Leu Asp Ala Leu Ala His His Arg Arg Thr His Gly Leu3620 3625 3630Pro Ala Thr Ser Leu Ala Trp Gly Leu Trp Asn Thr Arg Ser Thr3635 3640 3645Met Thr Gly His Leu Asn Asp Lys Glu Leu His Arg Met Glu Arg3650 3655 3660Ala Gly Val Val Pro Leu Glu Asp Ala Glu Ala Leu Ala Leu Phe3665 3670 3675Asp Leu Ala Cys Gly Ala Asp Val Pro Leu Gln Val Ile Thr Arg3680 3685 3690Leu Thr Pro Ser Thr Leu Arg Ser Gly Ala Asp Glu Val Pro His3695 3700 3705Leu Leu Arg Gly Leu Val Gln Gly Thr Ser Arg Arg Thr Ala Arg3710 3715 3720Ser Gly Ser Asn Gly Ser Gly Leu Arg Thr Arg Leu Ala Arg Leu3725 3730 3735Pro Ala Val Glu Gln His Arg Arg Val Leu Glu Leu Val Arg Ser3740 3745 3750His Ala Ala Thr Val Leu Gly His Ala Ser Val Ala Ala Val Thr3755 37603765Ala Glu Arg Ser Phe Ser Glu Leu Gly Phe Ser Ser Leu Thr Ala3770 3775 3780Val Glu Phe Arg Asn Arg Leu Gly Ala Ala Thr Gly Leu Arg Leu3785 3790 3795Pro Ala Thr Leu Val Phe Glu His Pro Thr Pro Thr Ala Leu Ala3800 3805 3810Thr Glu Leu Leu Thr Ala Leu Val Pro Ala Gly Leu Ser Gly Val3815 3820 3825Glu Ala Ala Leu Ala Glu Val Asp Ala Leu Glu Ala Ala Leu Lys3830 3835 3840Thr Ile Asp Ala Asp Asn Gly Asp Arg Asp Arg Val Val Arg Arg3845 3850 3855Leu Arg Gly Leu Leu Ser Glu Trp Arg Glu Pro Asp Thr Gly Pro3860 3865 3870Ala Ala Leu Asp Asp Leu Ala Thr Ala Thr Thr Asp Asp Leu Phe3875 3880 3885
Glu Ala Ile Asp Gln Gly Phe Gly Leu3890 3895<210>6<211>1868<212>PRT<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)<400>1Met Ser Asp Asp Glu Lys Tyr Arg Glu Tyr Leu Lys Arg Ala Val Thr1 5 10 15Glu Ala Arg Gly Leu Gln Arg Arg Leu Arg Glu Val Glu Asp Arg Ala20 25 30Arg Glu Pro Ile Ala Ile Ile Gly Met Ala Cys Arg Leu Pro Gly Gly35 40 45Ala Asp Thr Pro Glu Asp Val Trp Arg Met Leu Ser Glu Glu Ala Asp50 55 60Ala Val Ala Gly Phe Pro Asp Asp Arg Gly Trp Asn Leu Asp Gly Leu65 70 75 80Tyr Glu Thr Asp Ala Ala Gly Ala Gly Thr Ser Thr Pro Leu Glu Gly85 90 95Gly Phe Leu Arg Cys Ala Gly Glu Phe Asp Ala Ala Phe Phe Gly Ile100 105 110Ala Pro Arg Glu Ala Leu Thr Thr Asp Pro Gln Gln Arg Leu Leu Leu115 120 125Ala Ser Ser Trp Glu Ala Leu Glu Arg Ala Arg Ile Asp Pro Arg Ser130 135 140Leu Arg Gly Ser Asp Thr Gly Val Phe Phe Gly Gly Thr Ser Gly Asp145 150 155Phe Ala Gly Leu Leu Ala Ala Ser Pro His Ala Leu Asp Gly Tyr Leu165 170 175Met Thr Gly Thr Ser Ser Ser Val Leu Ser Gly Arg Val Ala Tyr Thr180 185 190Leu Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser195 200 205Ser Leu Val Ser Leu His Leu Ala Val Gln Ala Leu Arg Lys Asp Glu210 215 220Ile Ser Leu Ala Leu Ala Gly Gly Val Thr Val Leu Ala Thr Pro Gly225 230 235Ala Phe Pro Glu Phe Ser Arg Gln Gly Gly Leu Ala Ser Asp Gly Arg245 250 255Cys Lys Ala Phe Ser Ser Asp Ala Asp Gly Thr Gly Trp Gly Glu Gly260 265 270Val Gly Val Leu Val Leu Gln Arg Leu Ser Asp Ala Gln Arg Thr Gly275 280 285His Pro Val Leu Ala Val Val Arg Glu Thr Gly Ile Asn Gln Asp Gly290 295 300Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gln Arg His Leu305 310 315 320
Ile Leu Arg Val Leu Asp Asn Ala Gly Leu Ala Thr Ala Asp Val Asp325 330 335Met Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu340 345 350Ala Arg Ala Leu Leu Ala Thr Tyr Gly Gln Asp Arg Pro Ala Asp Arg355 360 365Pro Leu Trp Leu Gly Ser Ile Lys Ser Asn Val Gly His Thr Gln Tyr370 375 380Ala Ala Gly Val Ser Gly Val Ile Lys Thr Val Met Ala Leu Arg His385 390 395 400Gly Val Met Pro Lys Thr Leu His Val Asp Glu Pro Thr Pro His Val405 410 415Asp Trp Ser Ser Gly Ala Val Arg Leu Leu Thr Glu Ala Arg Glu Trp420 425 430Pro Glu Thr Gly Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Val435 440 445Ser Gly Thr Asn Ala His Val Ile Leu Glu Gln Ala Pro Glu Ala Glu450 455 460Pro Val Glu Val Asp Glu Ala Asp Arg Pro Val Leu Met Gly Ser Val465 470 475 480Pro Trp Val Val Ser Ala Arg Gly Glu Gly Ala Leu Arg Ala Gln Ala485 490 495Gly Arg Leu Leu Glu Trp Leu Val Glu Arg Pro Gly Leu Gly Pro Val500 505 510Asp Val Gly Phe Ser Leu Val Gly Thr Arg Ser Ala Phe Glu Gln Arg515 520 525Ala Val Val Leu Gly Gly Asp Arg Glu Glu Leu Leu Ala Gly Leu Arg530 535 540Ser Val Ala Glu Gly Val Pro Gly Ala Gly Val Val Ser Gly Arg Ala545 550 555 560Ala Gly Asp Gly Gly Met Gly Val Val Phe Val Phe Pro Gly Gln Gly565 570 575Ser Gln Trp Val Gly Met Gly Arg Glu Leu Trp Glu Val Ser Ser Val580 585 590Phe Ala Glu Ser Met Val Ala Cys Glu Arg Ala Leu Val Pro Phe Val595 600 605Asp Trp Ser Leu Arg Asp Val Val Phe Gly Gly Gly Gly Asp Gly Leu610 615 620Trp Glu Arg Val Asp Val Val Gln Pro Val Leu Trp Ala Val Met Val625 630 635 640Ser Leu Ala Ala Val Trp Arg Ser Phe Gly Val Glu Pro Ala Ala Val645 650 655Val Gly His Ser Gln Gly Glu Val Ala Ala Ala Cys Val Ala Gly Gly660 665 670Leu Ser Leu Glu Asp Gly Ala Arg Val Val Ala Val Arg Ser Arg Leu675 680 685Val Gly Glu Arg Leu Ser Gly Arg Gly Gly Met Val Ser Val Gly Leu
690 695 700Ser Val Gly Glu Val Glu Glu Trp Leu Ala Gly Leu Gly Gly Arg Val705 710 715 720Gly Val Ala Ala Val Asn Gly Pro Ser Ser Val Val Val Ser Gly Glu725 730 735Ala Glu Val Leu Glu Gly Leu Leu Ala Gly Phe Glu Gly Ala Gly Val740 745 750Arg Ala Arg Arg Ile Ala Val Asp Tyr Ala Ser His Ser Val Gln Val755 760 765Asp Ala Leu Gly Asp Asp Leu Leu Ala Gly Leu Ala Gly Ile Arg Pro770 775 780Val Ser Ser Ser Val Ala Phe Tyr Ser Thr Val Ser Gly Glu Arg Met785 790 795 800Asp Thr Ala Gly Leu Asp Ala Gly Tyr Trp Leu Arg Asn Met Arg Glu805 810 815Thr Val Ala Phe Glu Ala Ala Val Arg Ala Thr Leu Asp Glu Gly His820 825 830Arg Thr Leu Leu Glu Val Ser Pro His Pro Val Val Ala Met Ala Leu835 840 845Gln Glu Ile Ile Asp Gly Ala Gly Val Ser Ala His Val Ser Gly Thr850 855 860Ile Arg Arg Asp Asp Ala Gly Ala Gly Arg Leu Leu Thr Ser Leu Ala865 870 875 880Glu Ala Tyr Val Ala Gly Ala Pro Val Asn Trp Ser Val Val Phe Glu885 890 895Gly Thr Gly Ala Arg Pro Val Asp Leu Pro Thr Tyr Ala Phe Gln His900 905 910Gln Arg Tyr Trp Leu Arg Met Pro Val Ser Gly Ser Gly Asp Val Thr915 920 925Ala Ala Gly Leu Arg Ser Pro Gly His Pro Leu Leu Gly Ala Ala Val930 935 940Glu Pro Ala Glu Ser Asp Gly Leu Val Leu Thr Gly Arg Leu Ser Leu945 950 955 960Arg Asp His Pro Trp Leu Ala Asp His Arg Val Ala Gly Thr Val Pro965 970 975Leu Pro Gly Thr Ala Phe Val Glu Leu Ala Ala Val Ala Gly Asp Leu980 985 990Ala Glu Cys Pro Tyr Ile Glu Glu Leu Thr Leu Gln Thr Pro Leu Thr995 10001005Leu Pro Glu Thr Gly Gly Val Asp Leu Gln Leu Thr Val Gly Ala1010 1015 1020Pro Asp Asp Ala Gly Arg Arg Glu Val Gly Phe Phe Ala Arg Thr1025 1030 1035Asp Glu Glu Phe Thr Ala Ala Glu Trp Thr Arg His Ala Thr Gly1040 1045 1050Val Leu Cys Pro Ala Gly Pro Ala Pro Lys Ala Glu Pro Ala Asp1055 1060 1065
Trp Pro Pro Arg Gly Ala Glu Arg Ile Asp Ile Gly Ser Leu Tyr1070 1075 1080Glu Asp Leu Ala Gly Gly Pro Leu Ala Tyr Gly Pro Ala Phe Arg1085 1090 1095Gly Leu Arg Ala Val Trp Ser Arg Gly Arg Glu Val Phe Ala Glu1100 1105 1110Ile Glu Leu Pro Gln Glu Leu His Glu Ala Ala Gly Glu Phe Leu1115 1120 1125Leu His Pro Ala Leu Leu Asp Ala Ala Leu His Ala Val Gly Phe1130 1135 1140Leu Gly Glu Leu Asp Ala Pro Ser Ala Pro Leu Arg Pro Phe Ala1145 1150 1155Trp Asn Ala Val Ser Leu Gln Ala Thr Gly Ala Thr Ala Leu Arg1160 1165 1170Val Ala Leu Ser Pro Ala Gly Lys Ser Ala Val Ser Leu Arg Ala1175 1180 1185Val Asp Gly Thr Gly Thr Pro Val Val Ser Ile Gly Ser Leu Leu1190 1195 1200Leu Arg Pro Ala Asp Leu Thr Asp Gly Asp Pro Gly Ser Gly Arg1205 1210 1215Thr Ala Thr His Ser Ser Leu Leu Ser Met Val Trp Thr Pro Val1220 1225 1230Pro Leu Pro Thr Val Lys Thr Ala Ala Trp Ala Val Val Gly Thr1235 1240 1245Ala Pro Ala Trp Ala Gly Pro Asp Ser Gly Ala Val His His Pro1250 1255 1260Asp Leu Leu Ala Leu Ser Ala Ser Val Ala Ala Gly Asp Pro Val1265 1270 1275Pro Ala Phe Val Val Leu Thr Pro Asp Asp Gly Glu Pro Ala Gly1280 1285 1290Pro Gly Phe Ala Glu Leu Pro Ala Leu Thr Arg Glu Arg Ala Gly1295 1300 1305Leu Val Leu Glu Ala Ala Arg Thr Trp Val Ser Glu Glu Leu Asp1310 1315 1320Glu Arg Leu Ala Ser Ile Pro Leu Val Val Thr Thr Thr Asp Ala1325 1330 1335Val Gly Ile Ser Ala Asp Asp Arg Val Ala Gly Leu Gly Ser Ala1340 1345 1350Pro Leu Trp Gly Leu Val Arg Ser Ile Gln Ser Glu Asn Pro Gly1355 1360 1365Arg Leu Val Leu Leu Asp Thr Asp Gly Ser Val Glu Ser Gly Arg1370 1375 1380Ala Val Arg Ala Ala Val Ala Ser Gly Glu Ala Gln Leu Ala Leu1385 1390 1395Arg Asp Gly Val Ala Leu Met Pro Arg Leu Asn Arg Pro Pro Ala1400 1405 1410
Ala Asp Val Pro Asp Thr Val Pro Gly Thr Glu Leu Asp Ala Leu1415 1420 1425Asp Pro Ser Gly Thr Val Leu Val Thr Gly Ala Thr Gly Gly Ile1430 1435 1440Gly Ala Leu Val Ala Thr Arg Leu Ala Lys Leu His Gly Val His1445 1450 1455His Leu Leu Leu Leu Ser Arg Gln Gly Pro Asp Ala Glu Gly Ala1460 1465 1470Gly Glu Leu Val His Asp Leu Glu Glu Leu Gly Ala Thr Val Thr1475 1480 1485Leu Val Ala Cys Asp Val Ser Asp Arg Ala Ala Leu Ala Ala Val1490 1495 1500Leu Asp Gly Val Ser Ala Gly His Pro Leu Thr Ala Val Val His1505 1510 1515Cys Ala Gly Thr Ala Glu Asn Ala Leu Leu Ala Ser Leu Ala Pro1520 1525 1530Glu Leu Ile Asp Arg Val Phe Arg Ala Lys Val Asp Ala Ala Val1535 1540 1545His Leu His Glu Leu Thr Ala Glu Leu Asp Leu Ser Ala Phe Val1550 1555 1560Leu Phe Ser Ser Ile Ala Gly Thr Leu Gly Gly Thr Gly Gln Gly1565 1570 1575Asn Tyr Ala Ala Ala Asn Thr Phe Leu Asp Ser Leu Ala Gln Tyr1580 1585 1590Arg Arg Arg Asn Gly Leu Ala Ala Thr Ser Leu Gly Trp Gly Leu1595 1600 1605Trp Ala Thr Glu Arg Gly Met Asp Ser His Leu Ala Glu Gly Ala1610 1615 1620Ser Ala Gly Ser Pro Met Gly Gly Val Ser Ala Met Pro Ala Asp1625 1630 1635Gln Gly Leu Ala Leu Phe Asp Leu Gly Trp Arg Arg Ala Glu Pro1640 1645 1650Val Val Phe Pro Val Arg Leu Asn Ser Ala Ala Leu Arg Ala Gln1655 1660 1665Ala Ala Ala Gly Ser Leu Pro Pro Val Leu Arg Gly Leu Val Arg1670 1675 1680Val Pro Ala Gln Arg Ser Ala Gln Thr Gly Ser Gln Ala Pro Glu1685 1690 1695Ser Gln Leu Arg His Arg Leu Ala Glu Met Gly Pro Ala Glu Arg1700 1705 1710Gln Glu Thr Leu Leu Ala Leu Val Arg Asp Arg Ile Ala Ala Val1715 1720 1725Leu Gly His Ala Ser Ser Asp Gln Ile Glu Thr Asp Arg Pro Phe1730 1735 1740Arg Asp Leu Gly Phe Thr Ser Leu Thr Ala Val Glu Leu Arg Asn1745 1750 1755Arg Leu Asn Ala Ala Thr Gly Leu Arg Leu Pro Val Ser Leu Val
1760 1765 1770Phe Asp Tyr Pro Thr Leu Gly Ala Leu Val Ala Leu Leu Val Ala1775 1780 1785Arg Leu Ala Pro Asp Gly Ala Gln Ser Ala Thr Thr Pro Glu Ala1790 1795 1800Glu Gln Glu Ala Ala Val Arg Arg Ala Leu Met Ser Val Pro Leu1805 1810 1815Asp Arg Leu Arg Glu His Gly Leu Leu Glu Ala Leu Leu Ala Leu1820 1825 1830Thr Gly Asp Glu Arg Ala Glu Pro Glu Val Ala Asp Arg Ser Glu1835 1840 1845Glu Ile Lys Ser Met Asp Val Thr Ala Leu Leu Ala Met Ala Arg1850 1855 1860Ser Thr Ser Thr Arg1865<210>7<211>4635<212>PRT<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)<400>1Met Ser Glu Ile Val Asp Ala Leu Arg Ala Ser Leu Leu Glu Asn Glu1 5 10 15Arg Leu Arg Gln Gln Asn Gln Arg Leu Ser Ala Ala Ser Ser Glu Pro20 25 30Leu Ala Ile Val Gly Ile Gly Cys Arg Tyr Pro Gly Gly Val Arg Asp35 40 45Thr Glu Gly Leu Trp Gln Leu Ile Ala Glu Gly Arg Asp Ala Met Ser50 55 60Asp Phe Pro Thr Asp Arg Gly Trp Glu Asp Arg Asp Val Pro Ala Ala65 70 75 80Arg Thr Gly Ala Phe Leu His Asp Ala Gly Asp Phe Asp Pro Ala Phe85 90 95Phe Arg Ile Ser Pro Arg Glu Ala Met Ala Met Asp Pro Gln Gln Arg100 105 110Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly Ile Asp115 120 125Pro Val Ser Leu Lys Gly Ser Arg Thr Gly Val Phe Ile Gly Gly Ala130 135 140Pro Gln Glu Tyr Gly Ala Leu Val Met Asn Ser Ala Gln Gly Ala Gly145 150 155 160Gly Tyr Ala Leu Thr Gly Ala Pro Gly Ser Val Leu Ser Gly Arg Ile165 170 175Ser Tyr Val Leu Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala180 185 190Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Ile Lys Ser Leu Arg195 200 205Thr Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Leu Val Leu Ile210 215 220
Thr Pro Thr Ile Phe Thr Glu Phe Ser Ala Thr Gly Gly Ser Ala Gly225 230 235 240Asp Gly Arg Cys Lys Ala Phe Ser Ser Asp Ala Asp Gly Thr Gly Trp245 250 255Gly Glu Gly Ala Gly Val Leu Ala Ile Gln Arg Leu Ser Asp Ala Arg260 265 270Arg Asp Gly Asn Pro Val Leu Ala Val Ile Arg Gly Ser Ala Val Asn275 280 285Gln Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Asn Gly Pro Ser Gln290 295 300Gln Arg Val Ile Arg Gln Ala Ile Ala Asn Ala Gly Leu Thr Leu Ala305 310 315 320Asp Val Asp Met Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp325 330 335Pro Ile Glu Ala Glu Ala Leu Leu Ala Thr Tyr Gly Gln Glu Arg His340 345 350Asp Gly Arg Pro Leu Trp Leu Gly Thr Leu Lys Ser Asn Val Gly His355 360 365Thr Gln Ala Ala Ala Gly Ile Ser Gly Val Ile Lys Ala Ala Leu Ala370 375 380Leu Gln His Gly Ile Met Pro Lys Thr Leu His Val Asp Glu Pro Thr385 390 395 400Pro Glu Val Asp Trp Ser Ala Gly Ala Val Glu Leu Leu Thr Glu Ala405 410 415Arg Gln Trp Pro Glu Thr Gly Gln Pro Arg Arg Val Gly Val Ser Ser420 425 430Phe Gly Ile Ser Gly Thr Asn Ala His Val Ile Leu Glu Gln Ala Pro435 440 445Glu Ala Ala Pro Ala Glu Gln Ala Asp Gly Asp Ala Pro Ala Glu Leu450 455 460Pro Val Thr Pro Trp Val Val Thr Gly Arg Asn Glu Ala Ala Leu Arg465 470 475 480Glu Gln Ala Ala Arg Leu Leu Asp His Leu Thr Gln Gln Pro Asp Leu485 490 495Ser Pro Arg Asp Val Gly Phe Ser Leu Val Gly Thr Arg Ser Ala Phe500 505 510Glu Gln Arg Ala Val Val Leu Gly Gly Asp Met Ala Ala Leu Thr Glu515 520 525Gly Val Arg Ala Leu Ala Ala Gln Glu Pro Asn Thr His Val Ile Ala530 535 540Gly Thr Ala Glu Val Arg Ser Gly Ile Val Phe Val Phe Pro Gly Gln545 550 555 560Gly Ser Gln Trp Val Gly Met Gly Arg Glu Leu Trp Asp Ala Ser Pro565 570 575Val Phe Ala Glu Ser Met Val Ala Cys Glu Arg Ala Leu Ala Pro Phe580 585 590
Val Asp Trp Ser Leu Lys Asp Val Val Phe Arg Gly Ala Glu Asp Pro595 600 605Leu Trp Ala Arg Val Asp Val Val Gln Pro Val Leu Trp Ala Val Met610 615 620Val Ser Leu Ala Ala Val Trp Arg Ser Phe Gly Val Glu Pro Val Ala625 630 635 640Val Val Gly His Ser Gln Gly Glu Val Ala Ala Ala Cys Val Ala Gly645 650 655Gly Leu Ser Leu Glu Asp Gly Ala Arg Val Val Ala Val Arg Ser Arg660 665 670Leu Val Arg Glu Lys Leu Ser Gly Leu Gly Gly Met Gly Ser Val Ala675 680 685Leu Pro Val Glu Ala Val Glu Val Arg Leu Gly Arg Phe Gly Gly Arg690 695 700Val Gly Val Ala Ala Val Asn Gly Pro Thr Ser Val Val Val Ser Gly705 710 715 720Glu Val Glu Ala Leu Asp Ala Leu Leu Ala Glu Cys Glu Glu Ala Gly725 730 735Val Arg Ala Arg Arg Ile Ala Val Asp Tyr Ala Ser His Ser Ala Gln740 745 750Val Asp Ala Leu Thr Asp Asp Leu Leu Ala Glu Leu Ala Glu Leu Arg755 760 765Pro Gln Ser Ser Ser Val Ala Phe Tyr Ser Thr Val Thr Gly Glu Arg770 775 780Leu Asp Thr Ala Gly Leu Asp Ala Arg Tyr Trp Val Thr Asn Leu Arg785 790 795 800Glu Arg Val Asn Phe Glu Pro Val Thr Arg Leu Leu Ala Glu Lys Gly805 810 815Ala Gly Val Phe Val Glu Ser Ser Pro His Pro Val Leu Thr Val Ala820 825 830Val Thr Glu Thr Gly Glu Ala Ala Asp Arg Ser Val Val Ala Val Gly835 840 845Ser Leu Arg Arg Glu Glu Gly Gly Leu Arg Arg Phe Leu Ala Ser Leu850 855 860Ala Glu Ala Tyr Val Ala Gly Val Pro Val Asp Trp Ser Val Thr Phe865 870 875 880Ala Gly Ser Gly Ala Arg Arg Val Asp Leu Pro Thr Tyr Ala Phe Gln885 890 895His Gln Arg Try Trp Leu Asp Asp Val Val Leu Pro Gly Gln Gly Gly900905 910Gly Gly Ser Ser Asp Pro Ala Asp Ala Ala Phe Trp Gly Ala Val Glu915 920 925Arg Ala Asp Ala Glu Ser Val Val Ser Leu Val Asp Gly Ala Asp Ala930 935 940Gln Val Trp Glu Ser Val Leu Pro Ala Leu Ser Ala Trp Arg Lys Gly945 950 955 960Arg Arg Thr Gln Ser Thr Leu Asp Ser Trp Arg Tyr Arg Thr Val Trp
965 970 975Arg Ser Val Thr Val Ser Ser Ala Ala Ser Leu Cys Gly Val Trp Leu980 985 990Val Val Ser Ser Gly Pro Gly Ala Pro Val Glu Gln Val Thr Leu Ala995 10001005Leu Thr Ala Ala Gly Ala Glu Val Arg Val Leu Asp Val Pro Val1010 1015 1020Glu Arg Gly Ala Leu Ala Glu Trp Phe Ala Glu Ala Gly Glu Val1025 1030 1035Ala Gly Val Val Ser Leu Leu Ala Trp Asp Glu Asp Glu Ala Leu1040 1045 1050Ala Ser Ser Leu Ala Leu Val Gln Ala His Gly Asp Ala Gly Leu055 1060 1065Ser Ala Pro Val Trp Val Leu Thr Arg Gly Ala Ala Ala Val Gly1070 1075 1080Ser Asp Asp Ala Val Cys Ala Thr Gln Thr Ser Leu Trp Ala Trp1085 1090 1095Gly Gln Val Val Gly Leu Glu Leu Pro Ala Val Trp Gly Gly Leu1100 1105 1110Val Asp Val Pro Ala Glu Trp Asp Gly Arg Val Ser Ser Ala Leu1115 1120 1125Ala Ala Val Leu Ala Ala Gly Glu Gly Glu Asp Gln Val Ala Val1130 1135 1140Arg Ser Ser Gly Val Tyr Ala Arg Arg Leu Val Trp Ala Pro Leu1145 1150 1155Gly Ala Gly Ala Ala Ala Val Arg Glu Phe Lys Pro Gln Gly Thr1160 1165 1170Val Leu Ile Thr Gly Gly Thr Gly Gly Val Gly Gly His Leu Ala1175 1180 1185Arg Trp Leu Ala Arg Glu Gly Ala Glu His Leu Leu Leu Val Asn1190 1195 1200Arg Thr Gly Glu Gly Ala Ala Glu Leu Leu Glu Glu Leu Arg Gly1205 1210 1215Ser Gly Ala Glu Val Thr Val Ala Ala Cys Asp Val Thr Asp Arg1220 1225 1230Ala Ala Leu Ala Glu Leu Leu Ala Gly Ile Pro Ala Glu Arg Pro1235 1240 1245Leu Thr Ala Val Phe His Ala Ala Gly Val Ala Gly Tyr Gly Leu1250 1255 1260Val Arg Glu Leu Asp Val Ala Asp Leu Asp Val Glu Met Ala Ala1265 1270 1275Arg Thr Leu Gly Ala Arg His Leu Asp Glu Leu Thr Ala Glu Leu1280 1285 1290Gly Leu Asp Leu Asp Ala Phe Val Val Phe Ser Thr Gly Ala Ser1295 1300 1305Val Trp Gly Ser Ala Gly Asn Gly Ala Asn Ala Ala Ala Gly Gly1310 1315 1320
Tyr Leu Asp Gly Leu Ile Arg Gly Arg Arg Ala Arg Gly Leu Val1325 1330 1335Gly Ser Ser Val Ser Trp Gly Gly Trp Gly Ala Thr Ala Met Ala1340 1345 1350Val Gly Glu Thr Ala Glu Arg Leu Ser Arg Arg Gly Val Arg Leu1355 1360 1365Leu Glu Pro Glu Leu Ala Val Arg Ala Leu Arg Gln Val Leu Glu1370 1375 1380Gln Asp Glu Val Ser Val Thr Val Ala Asp Leu Asp Trp Ser Leu1385 1390 1395Phe Thr Pro Gly Tyr Ala Met Ala Arg Arg Arg Pro Leu Ile Glu1400 1405 1410Asp Ile Pro Glu Ala Ala Arg Ala Leu Arg Asp Ile Thr Glu Thr1415 1420 1425Asp Glu Thr Gln Asp Ala Ala Ala Gly Gly Leu Arg Glu Arg Leu1430 1435 1440Ala Gly Leu Ala Glu Ser Glu Gln Gln Ala Leu Leu Leu Gly Leu1445 1450 1455Val Arg Gly Glu Ala Ala Gln Val Leu Ala His Gly Ser Thr Ala1460 1465 1470Glu Ile Thr Pro Ser Arg Pro Phe Lys Glu Leu Gly Phe Asp Ser1475 1480 1485Leu Thr Gly Met Glu Leu Arg Asn Arg Leu Ser Lys Ala Thr Gly1490 1495 1500Leu Arg Leu Pro Ala Thr Leu Val Phe Asp Tyr Pro Asn Leu Gln1505 1510 1515Gln Leu Ala Ser Leu Leu Arg Thr Ala Leu Ile Asp Gly Leu Pro1520 1525 1530Gly Ala Gly Ala Val Ala Thr Thr Val Arg Leu Val Asp Asp Glu1535 1540 1545Pro Leu Ala Ile Ile Gly Met Ala Cys Arg Tyr Pro Gly Asp Val1550 1555 1560Arg Asp Pro Glu Asp Leu Trp Arg Leu Val Ser Glu Gly Arg Asp1565 1570 1575Glu Leu Ser Asp Phe Pro Thr Asp Arg Gly Trp Glu Arg Trp Gly1580 1585 1590Thr Pro Ala Val Gly Gln Ala Gly Phe Leu His Glu Ala Gly Asp1595 1600 1605Phe Asp Ala Ala Phe Phe Gly Ile Ser Pro Arg Glu Ala Ala Ser1610 1615 1620Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Val Ser Trp Glu Ala1625 1630 1635Phe Glu Gln Ala Gly Ile Asp Pro Trp Ser Leu Arg Asn Ser Pro1640 1645 1650Thr Gly Val Phe Val Gly Gly Gly Pro Gln Asp Tyr Pro Thr Val1655 1660 1665
Leu Met Gly Ser Ala Glu Ala Ala Ser Gly Tyr Gly Met Thr Gly1670 1675 1680Ala Leu Gly Ser Val Met Ser Gly Arg Val Ser Tyr Met Leu Gly1685 1690 1695Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser1700 1705 1710Leu Val Ala Leu His Leu Ala Ala Gln Ser Leu His Asn Gly Glu1715 1720 1725Cys Gly Leu Ala Val Ala Gly Gly Val Thr Ile Met Ala Thr Pro1730 1735 1740Gly Ala Phe Leu Gly Phe Asp Thr Leu Gly Gly Leu Ala Glu Asp1745 1750 1755Gly Arg Cys Lys Ala Phe Ala Ala Ser Ala Asp Gly Thr Gly Trp1760 1765 1770Ala Glu Gly Val Gly Met Val Val Leu Glu Arg Leu Ser Asp Ala1775 1780 1785Arg Arg Asn Gly His Glu Val Leu Ala Val Val Arg Gly Ser Ala1790 1795 1800Val Asn Gln Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Asn Gly1805 1810 1815Pro Ser Gln Gln Arg Val Ile Arg Gln Ala Leu Ala Asn Ala Gly1820 1825 1830Leu Ser Ala Ala Asp Val Asp Met Val Glu Ala His Gly Thr Gly1835 1840 1845Thr Thr Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala Thr1850 1855 1860Tyr Gly Gln Asp Arg Pro Ala Asp Arg Pro Leu Trp Leu Gly Ser1865 1870 1875Val Lys Ser Asn Phe Gly His Thr Gly Ala Ala Ala Gly Val Ala1880 1885 1890Gly Val Ile Lys Ser Val Leu Ala Leu Arg His GLy Leu Met Pro1895 1900 1905Lys Thr Leu His Val Asp Glu Pro Thr Pro Glu Val Asp Trp Ser1910 1915 1920Ala Gly Ala Val Glu Leu Leu Thr Glu Ala Arg Gln Trp Pro Glu1925 1930 1935Thr Glu Gln Pro Arg Arg Val Gly Val Ser Ser Phe Gly Ile Ser1940 1945 1950Gly Thr Asn Ala His Leu Ile Leu Glu Glu Ala Pro Gln Ala Ala1955 1960 1965Ala Val Glu Asp Glu Arg Asp Gly Ser Val Ala Pro Val Ser Ser1970 1975 1980Pro Val Val Pro Trp Val Val Ser Gly Arg Ser Glu Thr Ala Leu1985 1990 1995Arg Ala Gln Ala Ala Arg Leu Ala Glu His Leu Ala Gln Arg Pro2000 2005 2010Glu Ala Gly Ala Leu Asp Val Gly Phe Ser Leu Val Glu Ser Arg
2015 2020 2025Ser Ala Phe Glu Gln Arg Ala Val Val Leu Gly Ala Asp Arg Glu2030 2035 2040Glu Leu Leu Ala Gly Val Arg Ala Val Gly Glu Gly Ala Gln Ala2045 2050 2055Ser Gly Val Val Thr Gly Arg Ala Ala Gln Ser Gly Val Val Phe2060 2065 2070Val Phe Pro Gly Gln Gly Ser Gln Trp Val Gly Met Gly Arg Glu2075 2080 2085Leu Trp Asp Ala Ser Pro Val Phe Ala Glu Ser Met Val Ala Cys2090 2095 2100Glu Arg Ala Leu Ala Pro Phe Val Asp Trp Ser Leu Lys Asp Val2105 2110 2115Val Phe Arg Gly Ala Glu Asp Pro Leu Trp Ala Arg Val Asp Val2120 2125 2130Val Gln Pro Val Leu Trp Ala Val Met Val Ser Leu Ala Ala Val2135 2140 2145Trp Arg Ser Phe Gly Val Glu Pro Val Ala Val Val Gly His Ser2150 2155 2160Gln Gly Glu Val Ala Ala Ala Cys Val Ala Gly Gly Leu Ser Leu2165 2170 2170Glu Asp Gly Ala Arg Val Val Ala Val Arg Ser Arg Leu Val Arg2180 2185 2190Glu Lys Leu Ser Gly Leu Gly Gly Met Gly Ser Val Ala Leu Pro2195 2200 2205Val Glu Ala Val Glu Val Arg Leu Gly Arg Phe Gly Gly Arg Val2210 2215 2220Gly Val Ala Ala Val Asn Gly Pro Thr Ser Val Val Val Ser Gly2225 2230 2235Glu Val Glu Ala Leu Asp Ala Leu Leu Ala Glu Cys Glu Glu Ala2240 2245 2250Gly Val Arg Ala Arg Arg Ile Ala Val Asp Tyr Ala Ser His Ser2255 2260 2265Ala Gln Val Asp Ala Leu Thr Asp Asp Leu Leu Ala Glu Leu Ala2270 2275 2280Glu Leu Arg Pro Gln Ser Ser Ser Val Ala Phe Tyr Ser Thr Val2285 2290 2295Thr Gly Glu Arg Leu Asp Thr Ala Gly Leu Asp Ala Arg Tyr Trp2300 2305 2310Val Thr Asn Leu Arg Glu Arg Val Asn Phe Glu Pro Val Thr Arg2315 2320 2325Leu Leu Ala Glu Arg Glu His Gln Phe Phe Val Glu Ser Ser Pro2330 2335 2340His Pro Val Leu Thr Val Ala Val Thr Glu Thr Gly Glu Ala Ala2345 2350 2355Asp Arg Ser Val Val Ala Val Gly Ser Leu Arg Arg Glu Glu Gly2360 2365 2370
Gly Val Gln Arg Leu Leu Thr Ser Leu Ala Glu Ala Tyr Val Ala2375 2380 2385Gly Val Pro Val Asp Trp Ser Lys Thr Phe His Gly Thr Gly Ala2390 2395 2400Gln Ser Val Asp Leu Pro Thr Tyr Ala Phe Gln His Gln His Tyr2405 2410 2415Trp Leu Asp Asp Val Val Leu Pro Gly Gln Gly Gly Gly Gly Ser2420 2425 2430Ser Asp Pro Ala Asp Ala Ala Phe Trp Gly Ala Val Glu Arg Ala2435 2440 2445Asp Ile Asp Ser Val Ala Ser Ile Val Asp Gly Val Asp Gln Gln2450 2455 2460Ala Trp Glu Ser Val Val Pro Ala Leu Ser Ala Trp Arg Lys Gly2465 2470 2475Arg Gln Glu Arg Ala Leu Leu Asp Ser Trp Arg Tyr Arg Thr Val2480 2485 2490Trp Arg Ser Val Thr Val Ser Ser Ala Ala Ser Leu Cys Gly Val2495 2500 2505Trp Leu Val Val Ser Ser Gly Pro Gly Ala Pro Val Glu Gln Val2510 2515 2520Thr Leu Ala Leu Thr Ala Ala Gly Ala Glu Val Arg Val Leu Asp2525 2530 2535Val Pro Val Glu Arg Gly Ala Leu Ala Glu Trp Phe Ala Glu Ala2540 2545 2550Gly Glu Val Ala Gly Val Val Ser Leu Leu Ala Trp Asp Glu Asp2555 2560 2565Glu Ala Leu Ala Ser Ser Leu Ala Leu Val Gln Ala His Gly Asp2570 2575 2580Ala Gly Leu Ser Ala Pro Val Trp Val Leu Thr Arg Gly Ala Ala2585 2590 2595Ala Val Gly Ser Asp Asp Ala Val Cys Ala Thr Gln Thr Ser Leu2600 2605 2610Trp Ala Trp Gly Gln Val Val Gly Leu Glu Leu Pro Ala Val Trp2615 2620 2625Gly Gly Leu Val Asp Val Pro Ala Glu Trp Asp Gly Arg Val Ser2630 2635 2640Ser Ala Leu Ala Ala Val Leu Ala Ala Gly Glu Gly Glu Asp Gln2645 2650 2655Val Ala Val Arg Ser Ser Gly Val Tyr Ala Arg Arg Leu Val Trp2660 2665 2670Ala Pro Leu Gly Ala Gly Ala Ala Ala Val Arg Glu Phe Lys Pro2675 2680 2685Gln Gly Thr Val Leu Ile Thr Gly Gly Thr Gly Gly Val Gly Gly2690 2695 2700His Leu Ala Arg Trp Leu Ala Arg Glu Gly Ala Glu His Leu Leu2705 2710 2715
Leu Val Asn Arg Thr Gly Glu Gly Ala Ala Glu Leu Leu Glu Glu2720 2725 2730Leu Arg Gly Ser Gly Ala Glu Val Thr Val Ala Ala Cys Asp Val2735 2740 2745Thr Asp Arg Ala Ala Leu Ala Glu Leu Leu Ala Gly Ile Pro Ala2750 2755 2760Glu Arg Pro Leu Thr Ala Val Phe His Ala Ala Gly Val Ala Gly2765 2770 2775Tyr Gly Leu Val Arg Glu Leu Asp Ala Ala Asp Leu Asp Ala Glu2780 2785 2790Met Ala Ala Lys Thr Leu Gly Ala Arg His Leu Asp Glu Leu Thr2795 2800 2805Ala Glu Leu Gly Leu Asp Leu Glu Ala Phe Val Leu Phe Ser Ser2810 2815 2820Gly Ala Ala Val Trp Gly Ser Ala Gly Ser Gly Gly Tyr Ala Ala2825 2830 2835Ala Asn Gly Tyr Leu Asp Gly Leu Ala Gln Glu Arg Arg Ala Arg2840 2845 2850Gly Leu Ala Ala Thr Ser Val Ser Trp Gly Asn Trp Lys Asp Thr2855 2860 2865Gly Leu Ala Thr Asp Thr Thr Ala Glu Gln Leu Ala Arg Leu Gly2870 2875 2880Val Arg Pro Met Asp Pro Ala Leu Ala Val Ala Ala Leu Arg Gln2885 2890 2895Val Leu Glu His Asp Glu Ile Ala Leu Thr Val Thr Asp Met Asp2900 2905 2910Trp Ala Arg Phe Ala Pro Gly Tyr Thr Leu Ala Arg Arg Arg Pro2915 2920 2925Leu Ile Glu Asp Ile Pro Glu Ala Thr Arg Ala Leu Ser Glu Asp2930 2935 2940Ser Ala Asp Pro Ala Asn Asp Met Ala Gly Ala Ala Leu Arg Ala2945 2950 2955Glu Leu Glu Gly Leu Gly Arg Ala Glu Gln Leu Ala Val Leu Met2960 2965 2970Asp Leu Val Arg Ser Glu Val Thr Arg Ile Leu Ala Gly Ala Ser2975 2980 2985Ala Ala Asp Ile Thr Pro Glu Arg Pro Phe Lys Glu Leu Gly Phe2990 2995 3000Asp Ser Leu Thr Ala Met Glu Leu Arg Asn Leu Leu Thr Ile Ala3005 3010 3015Thr Gly Leu Arg Leu Pro Ala Thr Leu Val Phe Asp Tyr Pro Asn3020 3025 3030Pro Arg Gln Leu Ala Ala His Leu Cys Asp Glu Leu Ile Gly Val3035 3040 3045Gly Aia Asp Pro Val Gly Ala Asp Val Val Val Arg Gly Ser Ser3050 3055 3060Asp Glu Pro Leu Ala Val Val Gly Met Ala Cys Arg Tyr Ala Gly
3065 3070 3075Gly Val Ser Thr Pro Glu Asp Leu Trp Gln Met Val Ala Glu Asn3080 3085 3090Arg Glu Gly Leu Thr Asp Val Pro Ser Tyr Arg Gly Trp GLu Gly3095 3100 3105Trp Asn Val Ala Ser Leu Arg Arg Ala Gly Phe Leu His Glu Ala3110 3115 3120Gly Asp Phe Asp Ala Gly Phe Phe Gly Ile Ser Pro Arg Glu Ala3125 3130 3135Ala Thr Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Val Ser Trp3140 3145 3150Glu Ala Val Glu Arg Ala Gly Ile Asp Pro Lys Sar Leu Arg Gly3155 3160 3165Ser Asp Thr Gly Val Phe Val Gly Gly Thr Ala Val Glu Tyr Gly3170 3175 3180Ala Leu Leu Met Asn Ser Pro Thr Gly Gln Gly Tyr Ala Val Thr3185 3190 3195Ser Ser Ser Gly Ser Val Leu Ser Gly Arg Val Ser Tyr Thr Leu3200 3205 3210Gly Leu Glu Gly Pro Ala Val Ser Val Asp Thr Ala Cys Ser Ser3215 3220 3225Ser Leu Val Ala Leu His Leu Ala Ala Gln Ala Leu Arg Asn Gly3230 3235 3240Glu Cys Gly Leu Ala Leu Thr Gly Gly Val Gly Leu Met Ala Thr3245 3250 3255Pro Gly Gly Phe Val Glu Phe Asp Thr Leu Gly Gly Leu Ser Ser3260 3265 3270Asp Gly His Thr Lys Ala Phe Ala Ala Ser Ala Asp Gly Ile Gly3275 3280 3285Trp Gly Glu Gly Val Gly Met Ile Val Leu Glu Arg Leu Ser Asp3290 3295 3300Ala Arg Arg Asn Gly His Glu Val Leu Ala Val Val Arg Gly Ser3305 3310 3315Ala Val Asn Gln Asp Gly Ala Ser Asn Gly Leu Ser Ala Pro Asn3320 3325 3330Gly Pro Ser Gln Gln Arg Val Ile Arg Gln Ala Val Ala Asn Ala3335 3340 3345Gly Leu Thr Leu Ala Asp Ile Asp Met Val Glu Ala His Gly Thr3350 3355 3360Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Asn3365 3370 3375Thr Tyr Gly Gln Glu Arg His Asp Gly Gln Pro Leu Trp Leu Gly3380 3385 3390Ser Val Lys Thr Asn Ile Gly His Thr Gly Ala Ala Ala Gly Val3395 3400 3405Ala Gly Ile Ile Lys Ser Val Leu Ala Leu Arg Asn Gly Val Met3410 3415 3420
Pro Met Thr Leu Asn Val Asp Gly Pro Thr Pro Lys Val Asp Trp3425 3430 3435Ser Ala Gly Ala Val Glu Leu Leu Thr Gln Gly Arg Glu Trp Pro3440 3445 3450Gln Thr Asp Arg Thr Arg Arg Ala Gly Val Ser Ser Phe Gly Ile3455 3460 3465Ser Gly Thr Asn Ala His Val Ile Ile Glu Glu Ala Pro Pro Ala3470 3475 3480Glu Glu Pro Pro Ala Gln Pro Gly Thr Asp Leu Pro Ala Ala Pro3485 3490 3495Ala Leu Ala Thr Pro Val Val Pro Trp Val Phe Ser Gly Arg Ser3500 3505 3510Asn Gly Ala Leu Arg Gly Gln Ala Glu Arg Leu Ser Ala Leu Ala3515 3520 3525Glu Asn Glu Pro Gly Leu Asp Leu Thr Asp Ala Ala Phe Ser Leu3530 3535 3540Ala Thr Thr Arg Ala Ser Leu Glu His Arg Ala Val Val Leu Gly3545 3550 3555Arg Asp Thr Ser Glu Met Leu Asp Gly Leu Arg Gly Leu Thr Ala3560 3565 3570Gln Gly Ser Val Ala Gly Val Val Ser Gly Val Thr Ala Ala Asp3575 3580 3585Ser Arg Ala Val Phe Val Phe Pro Gly Gln Gly Ser Gln Trp Val3590 3595 3600Gly Met Gly Arg Glu Leu Trp Glu Val Ser Ser Val Phe Ala Glu3605 3610 3615Ser Met Val Ala Cys Glu Arg Ala Leu Val Pro Phe Val Asp Trp3620 3625 3630Ser Leu Arg Asp Val Val Phe Gly Gly Gly Gly Asp Gly Leu Trp3635 3640 3645Glu Arg Val Asp Val Val Gln Pro Val Leu Trp Ala Val Met Val3650 3655 3660Ser Leu Ala Ala Val Trp Arg Ser Phe Gly Val Glu Pro Ala Ala3665 3670 3675Val Val Gly His Ser Gln Gly Glu Val Ala Ala Ala Cys Val Ala3680 3685 3690Gly Gly Leu Ser Leu Glu Asp Gly Ala Arg Val Val Ala Val Arg3695 3700 3705Ser Val Gly Leu Ser Val Gly Glu Val Glu Glu Trp Leu Ala Gly3725 3730 3735Leu Gly Gly Arg Val Gly Val Ala Ala Val Asn Gly Pro Ser Ser3740 3745 3750Val Val Val Ser Gly Glu Ala Glu Val Leu Glu Gly Leu Leu Ala3755 3760 3765
Gly Phe Glu Gly Ala Gly Val Arg Ala Arg Arg Ile Ala Val Asp3770 3775 3780Tyr Ala Ser His Ser Val Gln Val Asp Ala Leu Gly Asp Asp Leu3785 3790 3795Leu Ala Gly Leu Ala Gly Ile Arg Pro Val Ser Ser Ser Val Ala3800 3805 3810Phe Tyr Ser Thr Val Ser Gly Glu Arg Met Asp Thr Ala Gly Leu3815 3820 3825Asp Ala Gly Tyr Trp Val Ala Asn Leu Arg Glu Arg Val Leu Phe3830 3835 3840Glu Pro Val Val Arg Met Leu Val Glu Arg Gly Ser Ala Val Phe3845 3850 3855Val Glu Ser Ser Pro His Pro Val Leu Ala Met Ala Val Gln Glu3860 3865 3870Thr Gly Glu Ala Val Gly Arg Ser Val Val Ala Val Gly Ser Leu3875 3880 3885Arg Arg Asp Asp Gly Gly Ala Gly Arg Phe Leu Ala Ser Leu Ala3890 3895 3900Glu Ala Tyr Val Val Gly Ala Pro Val Asp Trp Ser Val Leu Phe3905 3910 3915Ala Gly Ala Gly Ala Arg Arg Val Asp Leu Pro Thr Tyr Ala Phe3920 3925 3930Gln His Gln Arg Tyr Trp Leu Glu Gly Val Thr Val Gly Gly Glu3935 3940 3945Pro Gln Asp Thr Val Glu Asp Asp Thr Asp Ala Ala Phe Trp Asp3950 3955 3960Ala Val Glu Arg Glu Ser Leu Ser Asp Leu Ala Glu Val Leu Asp3965 3970 3975Val Ser Asp Ala Gly Ala Ala Ala Glu Ala Trp Leu Pro Thr Leu3980 3985 3990Ser Ala Trp Arg Lys Gly Arg Arg Arg Gln Met Thr Leu Asp Ser3995 4000 4005Trp Arg Tyr Arg Thr Thr Trp Arg Ala Tyr Ser Leu Pro Ser Gly4010 4015 4020Thr Arg Leu Ser Gly Met Trp Val Val Val Ala Ser Gly Gly Asp4025 4030 4035Ala Pro Val Val Glu Val Arg Arg Ala Leu Glu Ala Ala Gly Ala4040 4045 4050Glu Val Ser Val Arg Glu Val Leu Asp Gly Val Ala Leu Ala Asp4055 4060 4065Val Ser Gly Val Val Ser Leu Leu Ala Trp Asp Glu Gly Ser Ala4070 4075 4080Leu Glu Ser Met Leu Arg Leu Val Arg Ala Val Gly Gly Gly Glu4085 4090 4095Val Pro Leu Trp Val Leu Thr Arg Gly Ala Ala Val Val Gly Val4100 4105 4110Asp Asp Pro Val Ser Ala Val Gln Ser Gln Val Trp Ala Leu Gly
4115 4120 4125Gln Val Val Gly Leu Glu Gln Pro Gln Gly Trp Gly Gly Leu Val4130 4135 4140Asp Val Pro Gly Val Trp Asp Glu Arg Val Ala Ser Leu Leu Ala4145 4150 4155Gly Val Leu Ala Ala Gly Glu Gly Glu Asp Gln Val Ala Val Arg4160 4165 4170Ser Ser Gly Val Tyr Gly Arg Arg Leu Val Arg Ala Pro Leu Gly4175 4180 4185Gly Ser Pro Val Pro Val Arg Glu Trp Gly Pro Ser Gly Thr Val4190 4195 4200Leu Val Thr Gly Gly Thr Gly Gly Ile Gly Gly His Leu Ala Arg4205 4210 4230Trp Leu Ala Lys Glu Gly Ala Glu His Leu Leu Leu Val Ser Arg4220 4225 4230Gly Glu Arg Ala Gln Gly Ala Ala Glu Leu Val Glu Glu Val Arg4235 4240 4245Gly Leu Gly Ala Glu Val Thr Val Ala Ala Cys Asp Val Thr Asp4250 4255 4260Arg Ala Ala Leu Ala Glu Leu Leu Ala Glu His Pro Val Thr Ser4265 4270 4275Ile Phe His Thr Ala Gly Ile Ala Ala His Gly Phe Leu Thr Asp4280 4285 4290Leu Asp Pro Ala Glu Leu Gly Asp Gln Met Gly Ala Arg Val Val4295 4300 4305Gly Ala Arg His Leu Asp Glu Leu Ser Val Glu Leu Gly Leu Asp4310 4315 4320Leu Asp Ala Phe Val Val Phe Ser Thr Gly Ala Ser Val Trp Gly4325 4330 4335Ser Ala Gly Asn Gly Ala Asn Ala Ala Ala Gly Gly Tyr Leu Asp4340 4345 4350Gly Leu Ile Arg Gly Arg Arg Ala Arg Gly Leu Val Gly Ser Ser4355 4360 4365Val Ser Trp Gly Gly Trp Gly Ala Thr Ala Met Ala Val Gly Glu4370 4375 4380Thr Ala Glu Arg Leu Ser Arg Arg Gly Val Arg Leu Leu Glu Pro4385 4390 4395Glu Leu Ala Val Arg Ala Leu Arg Gln Val Leu Glu Gln Asp Glu4400 4405 4410Val Ser Val Thr Val Ala Asp Leu Asp Trp Ser Leu Phe Thr Pro4415 4420 4425Gly Tyr Ala Met Ala Arg Arg Arg Pro Leu Ile Glu Asp Ile Pro4430 4435 4440Glu Ala Ala Arg Ala Leu Arg Asp Ile Thr Glu Thr Asp Glu Thr4445 4450 4455Gln Asp Ala Ala Ala Gly Gly Leu Arg Glu Arg Leu Ala Gly Leu4460 4465 4470
Ala Glu Ser Glu Gln Gln Ala Leu Leu Leu Gly Leu Val Arg Gly4475 4480 4485Glu Ala Ala Gln Val Leu Ala His Gly Ser Thr Ala Glu Ile Thr4490 4495 4500Pro Ser Arg Pro Phe Lys Glu Leu Gly Phe Asp Ser Leu Thr Gly4505 4510 4515Met Glu Leu Arg Asn Arg Leu Ser Lys Ala Thr Gly Leu Arg Leu4520 4525 4530Pro Ala Thr Leu Val Phe Asp Tyr Pro Asn Pro Gln Arg Val Thr4535 4540 4545Asp Leu Leu Leu Thr Asp Leu Asp Gln Gln Asp Gly Arg Pro Gly4550 4555 4560Ile Ala Asp Val Leu Asp Ile Lys Arg Glu Leu Ser Arg Ile Gly4565 4570 4575Glu Ala Leu Glu Gly Val Ala Pro Asp Gln Gln Ala Arg Glu Asp4580 4585 4590Ile Val Ala His Leu Arg Asp Leu Ile Thr Gln Leu Ser Ala Thr4595 4600 4605Glu Gln His Gly Ala Thr Asp Leu Glu Ala Ala Thr Asp Asp Glu4610 4615 4620Ile Phe Asp Phe Ile Asp Arg Asp Leu Gly Val Ser4625 4630 4635<210>8<211>3593<212>PRT<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)<400>1Met Thr Glu Asp Lys Leu Arg Thr Tyr Leu Arg Arg Val Thr Ala Glu1 5 10 15Leu Gln Gln Thr Arg Gln Gln Leu Lys Asp Ser Gln Asp Arg Gly Arg20 25 30Glu Pro Leu Ala Ile Val Gly Met Ala Cys Arg Leu Pro Gly Gly Ala35 40 45Asp Ser Pro Glu Gln Leu Trp Gln Met Val Arg Asp Gly Ala Asp Gly50 55 60Val Gly Gly Phe Pro Asp Asp Arg Gly Trp Asp Leu Thr Ser Leu Leu65 70 75 80Ser Asp Asp Pro Asp Arg Pro Gly Thr Thr Tyr Thr Gln Glu Gly Ala85 90 95Phe Leu Lys Gly Ala Gly Asp Phe Asp Ala Gly Leu Phe Gly Ile Ser100 105 110Pro Arg Glu Ala Ala Thr Met Asp Pro Gln Gln Arg Leu Leu Leu Glu115 120 125Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly Ile Asp Pro His Ser Leu130 135 140Arg Gly Ser Arg Thr Gly Val Phe Val Gly Gly Thr Ala Ile Glu His145 150 155 160
Ile Val Lys Leu Met Asn Ser Pro Thr Asp Gln Gly Tyr Ala Ile Thr165 170 175Gly Gly Ser Gly Ser Ile Met Ser Gly Arg Ile Ser Tyr Val Leu Gly180 185 190Leu Glu Gly Pro Ala Val Thr Ile Asp Thr Ala Cys Ser Ser Ser Leu195 200 205Val Ala Leu His Ser Ala Val Gln Ser Leu Arg Gln Gly Asp Cys Ser210 215 220Leu Ala Leu Ala Gly Gly Val Ala Val Met Ala Thr Pro Ser Ala Phe225 230 235 240Val Thr Phe Ala Arg Gln Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys245 250 255Ala Phe Ser Asp Asp Ala Asp Gly Ile Gly Trp Gly Glu Gly Val Ala260 265 270Val Val Leu Leu Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Glu275 280 285Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser290 295 300Asn Gly Leu Ser Ala Pro Asn Gly Pro Ser Gln Gln Arg Val Ile Arg305 310 315 320Gln Ala Val Ala Asn Ala Gly Leu Thr Leu Ala Asp Val Asp Met Val325 330 335Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro Ile Glu Ala Gln340 345 350Ala Leu Leu Asn Thr Tyr Gly Gln Glu Arg His Asp Gly Gln Pro Leu355 360 365Trp Leu Gly Ser Leu Lys Ser Asn Ile Ala His Thr Gln Gly Val Ser370 375 380Gly Val Ala Gly Val Ile Lys Thr Val Leu Ala Leu Arg His Gly Ile385 390 395 400Leu Pro Lys Thr Leu His Val Gly Glu Arg Ser Ser Gln Val Asp Trp405 410 415Ser Val Gly Ala Val Glu Leu Leu Thr Glu Ala Arg Glu Trp Pro Glu420 425 430Thr Gly Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Ile Ser Gly435 440 445Thr Asn Val His Val Ile Ile Glu Gln Ala Pro Gln Glu Glu Ser Ala450 455 460Glu Pro Arg Thr Asp Glu Ala Pro Ser Leu Glu Ser Pro Phe Ala Thr465 470 475 480Lys Pro Ala Thr Leu Pro Trp Leu Ile Ser Gly Asn Thr Glu Ala Ala485 490 495Leu Arg Glu Gln Ala Ala Arg Leu Arg Ala His Leu Asn Ala His Pro500 505 510Gly Leu Ala Ala Ala Asp Ile Gly His Ser Leu Leu Thr Ser Arg Thr515 520 525Arg Phe Ala His Arg Ala Val Leu Leu Thr Glu Gln Asp Gly Asp Arg
530 535 540Arg Thr Ala Leu Thr Ala Leu Ala Asp Gly Leu Asp Ala Pro Gly Leu545 550 555 560Ile Arg Gly Thr Gly Asp Thr Gly Ala Gly Val Val Phe Val Phe Pro565 570 575Gly Gln Gly Ser Gln Trp Val Gly Met Gly Arg Glu Leu Trp Glu Val580 585 590Ser Ser Val Phe Ala Glu Ser Met Val Ala Cys Glu Arg Ala Leu Ala595 600 605Pro Phe Val Gly Trp Ser Leu Arg Asp Val Val Phe Glu Gly Gly Gly610 615 620Glu Gly Leu Trp Gly Arg Val Asp Val Val Gln Pro Val Leu Trp Ala625 630 635 640Val Met Val Ser Leu Ala Ala Val Trp Arg Ser Phe Gly Val Glu Pro645 650 655Val Gly Val Val Gly His Ser Gln Gly Glu Val Ala Ala Ala Cys Val660 665 670Ala Gly Gly Leu Ser Leu Glu Asp Gly Ala Arg Val Val Ala Val Arg675 680 685Ser Arg Leu Val Gly Glu Arg Leu Ser Gly Arg Gly Gly Met Val Ser690 695 700Val Thr Leu Pro Val Ala Gln Val Glu Glu Trp Leu Ala Gly Ser Gly705 710 715 720Gly Arg Val Gly Val Ala Ala Val Asn Gly Pro Ser Ser Val Val Val725 730 735Ser Gly Glu Val Glu Ala Leu Asp Gly Leu Leu Val Glu Leu Asp Gly740 745 750Ala Gly Val Arg Ala Arg Arg Ile Ala Val Asp Tyr Ala Ser His Ser755 760 765Ala Gln Val Asp Ala Leu Asn Asp Asp Leu Leu Ala Gly Leu Ala Asp770 775 780Ile Arg Pro Val Ser Ser Pro Val Ala Phe T r Ser Thr Val Thr Gly785 790 795 800Glu Arg Met Asp Thr Ala Gly Leu Asp Ala Ala Tyr Trp Ala Ala Asn805 810 815Leu Arg Glu Arg Val Leu Phe Glu Pro Val Val Arg Thr Leu Ala Glu820 825 830Leu Glu His Gln Val Phe Val Glu Ser Ser Pro His Pro Val Leu Ala835 840 845Met Ala Val Gln Glu Thr Leu Glu Ser Ala Ser Gly Ala Gly Ala Ala850 855 860Val Gly Ser Leu Arg Arg Asp Asp Gly Gly Ala Gly Arg Phe Leu Ala865 870 875 880Ser Leu Ala Glu Ala Tyr Val Ala Gly Ala Pro Val Asp Trp Ser Val885 890 895Leu Phe Glu Gly Thr Gly Thr Arg Arg Val Asp Leu Pro Thr Tyr Ala900 905 910
Phe Gln His Gln Arg Tyr Trp Leu Glu Asp Ala Ser Ala Pro Gly Ala915 920 925Glu Gly Val Val Asp Pro Val Asp Ala Ala Phe Trp Gly Ala Val Glu930 935 940Arg Ala Asp Val Gln Gly Val Ala Ala Leu Val Asp Gly Ser Val Pro950 955 960Gly Val Trp Glu Pro Val Val Pro Val Leu Ser Ala Trp Arg Lys Gly965 970 975Arg Glu Glu Arg Ser Val Leu Asp Ser Trp Arg Tyr Arg Thr Thr Trp980 985 990Arg Ala Phe Ser Leu Pro Ser Gly Thr Arg Leu Ser Gly Met Trp Leu995 10001005Val Val Ala Ser Gly Gly Asp Ala Pro Val Asp Glu Val Arg Gln1010 1015 1020Ala Leu Glu Ala Ala Gly Ala Glu Val Cys Val Arg Ala Asp Leu1025 1030 1035Asp Gly Ala Ala Leu Ala Gly Val Ser Gly Val Val Ser Leu Leu1040 1045 1050Ala Trp Asp Glu Gly Ser Ala Val Val Ser Thr Val Gly Leu Val1055 1060 1065Gln Ala Cys Gly Gly Gly Gly Glu Val Pro Leu Trp Val Leu Thr1070 1075 1080Arg Gly Ala Ala Val Val Gly Val Asp Asp Pro Val Ser Ala Val1085 1090 1095Gln Ser Gln Val Trp Ala Leu Gly Gln Val Val Gly Leu Glu Gln1100 1105 1110Pro Gly Gly Trp Gly Gly Leu Val Asp Val Pro Gly Val Trp Asp1115 1120 1125Glu Arg Val Ala Ser Leu Leu Ala Gly Val Leu Ala Ala Gly Gly1130 1135 1140Gly Glu Asp Gln Val Ala Val Arg Ser Ser Gly Ala Tyr Gly Arg1145 1150 1155Arg Leu Val Arg Ala Pro Leu Gly Ala Ser Pro Val Arg Val Arg1160 1165 1170Glu Trp Ser Pro Ser Gly Thr Ala Leu Val Thr Gly Gly Thr Gly1175 1180 1185Gly Ile Gly Gly His Leu Ala Arg Trp Leu Ala Arg Glu Gly Val1190 1195 1200Gly His Leu Leu Leu Val Ser Arg Arg Gly Pro Glu Ala Glu Gly1205 1210 1215Val Ala Glu Leu Val Glu Glu Leu Gly Gly Leu Gly Val Glu Val1220 1225 1230Thr Val Val Ala Cys Asp Val Thr Asp Arg Ala Ala Leu Ala Glu1235 1240 1245Leu Leu Ala Thr Ile Pro Ala Glu Tyr Pro Leu Thr Ser Val Phe1250 1255 1260
His Ala Ala Gly Ile Ala Gly Tyr Gly Leu Val Arg Glu Leu Asp1265 1270 1275Ala Ala Gly Leu Asp Ala Glu Met Ala Ala Lys Thr Leu Gly Ala1280 1285 1290Arg His Leu Asp Glu Leu Thr Ala Glu Leu Gly Leu Asp Leu Asp1295 1300 1305Ala Phe Val Val Phe Ser Ser Gly Ala Ala Val Trp Gly Ser Ala1310 1315 1320Gly Ser Gly Gly Tyr Ala Ala Ala Asn Ala Tyr Leu Asp Gly Leu1325 1330 1335Ala Arg Glu Arg Arg Ala Arg Gly Leu Val Ala Thr Ser Val Ser1340 1345 1350Trp Gly Asn Trp Lys Asn Thr Gly Leu Ala Thr Asp Thr Thr Ala1355 1360 1365Glu Gln Leu Thr Arg Ile Gly Val Arg Pro Met Glu Pro Glu Leu1370 1370 1380Ala Val Arg Ala Leu Arg Gln Ala Leu Glu Gln Asp Glu Val Ser1385 1390 1395Met Thr Val Ala Asp Met Asp Trp Ser Leu Phe Thr Pro Gly Tyr1400 1405 1410Ala Leu Ala Arg Arg Arg Pro Leu Ile Glu Glu Ile Pro Glu Ala1415 1420 1425Ala Arg Ala Leu Ser Glu Asp Ser Ala Asp Pro Ala Asn Asp Thr1430 1435 1440Val Gly Gly Asp Ser Pro Leu Arg Gln Ser Leu Ala Ala Leu Thr1445 1450 1455Glu Ser Glu Gln His Glu Arg Leu Leu Gly Ala Val Arg Thr Glu1460 1465 1470Ala Ala Ala Val Leu Thr His Ser Thr Thr Asp Glu Ile Thr Ala1475 1480 1485Gly Lys Pro Phe Arg Asp Leu Gly Phe Asp Ser Leu Thr Ala Met1490 1495 1500Glu Leu Arg Asn Arg Leu Asn Ala Ala Thr Gly Leu Arg Leu Pro1505 1510 1515Ala Thr Ile Val Phe Asp Tyr Pro Thr Pro Arg Arg Leu Ala Gly1520 1525 1530His Leu His Asp Lys Leu Phe Asp Ser Gly Ala Glu Val Ala Leu1535 1540 1545Pro Gln Leu Arg Ala Thr Asp Asp Asp Pro Ile Val Ile Val Gly1550 1555 1560Met Ala Cys Arg Phe Pro Gly Gly Val Arg Gly Pro Glu Asp Leu1565 1570 1575Trp Arg Leu Leu Ala Glu Gly Arg Asp Glu Met Thr Glu Phe Pro1580 1585 1590Ala Asp Arg Gly Trp Gln Gly Pro Ala Met Asn Ala Phe Val Glu1595 1600 1605Glu Phe Gly Gly Ala Arg Gln Gly Ala Phe Leu Ala Asp Ala Ala
1610 1615 1620Glu Phe Asp Ala Ala Phe Phe Gly Ile Ser Pro Arg Glu Ala Arg1625 1630 1635Ala Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Thr Ser Trp Glu1640 1645 1650Val Leu Glu Arg Ala Gly Tyr Asp Pro Val Ser Leu Arg Gly Ser1655 1660 1665Arg Thr Gly Val Phe Val Gly Gly Thr Pro Gln Glu Tyr Thr Thr1670 1675 1680Val Leu Met Asn Ser Ala Glu Ala Gly Ser Gly Tyr Ala Leu Thr1685 1690 1695Gly Thr Ser Gly Ser Val Met Ser Gly Arg Val Ala Tyr Thr Leu1700 1705 1710Gly Leu Glu Gly Pro Ala Val Thr Ile Asp Thr Ala Cys Ser Ser1715 1720 1725Ser Leu Val Thr Leu His Leu Ala Ala Gln Ala Leu Arg Gly Gly1730 1735 1740Glu Cys Asp Leu Ala Leu Val Gly Gly Val Thr Val Met Ala Thr1745 1750 1755Pro Gly Ala Phe Val Glu Phe Ala Arg Gln Gly Gly Leu Ala Gly1760 1765 1770Asp Gly Arg Cys Lys Ala Phe Ala Ala Gly Ala Asp Gly Thr Gly1775 1780 1785Trp Gly Glu Gly Val Gly Met Leu Ala Val Gln Arg Leu Ser Asp1790 1795 1800Ala Val Arg Asp Gly Arg Arg Val Leu Ala Val Val Arg Gly Ser1805 1810 1815Ala Val Asn Ser Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn1820 1825 1830Gly Pro Ser Gln Gln Arg Val Ile Arg Gln Ala Leu Ala Ser Ala1835 1840 1845Gly Leu Ser Ala Ala Asp Val Asp Val Val Glu Gly His Gly Thr1850 1855 1860Gly Thr Ala Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala1865 1870 1875Thr Thr Gly Gln Asp Arg Pro Ala Asp Arg Pro Leu Trp Leu Gly1880 1885 1890Ser Val Lys Ser Asn Ile Gly His Thr Gln Tyr Ala Ala Gly Val1895 1900 1905Ala Gly Val Ile Lys Ala Val Leu Ala Leu Gln His Arg Leu Leu1910 1915 1920Pro Lys Thr Leu His Val Glu Glu Pro Thr Pro Glu Val Asp Trp1925 1930 1935Ser Ser Gly Ala Val Gly Val Leu Thr Glu Ala Arg Glu Trp Pro1940 1945 1950Glu Thr Gly Arg Pro Arg Arg Ala Gly Val Ser Ala Phe Gly Ile1955 1960 1965
Ser Gly Thr Asn Ala His Val Ile Leu Glu Gln Ala Pro Glu Ala1970 1975 1980Val Glu Glu Ser Ala Ser Gly Glu Thr Gly Ser Val Leu Val Pro1985 1990 1995Trp Val Ile Ser Ala Arg Ser Glu Gln Ala Leu Arg Glu Gln Ala2000 2005 2010Arg Arg Leu Ala Gly His Leu Arg Ala His Asp Leu Arg Pro Val2015 2020 2025Asp Val Gly Phe Ser Leu Ala Thr Thr Arg Ala Gly Leu Glu His2030 2035 2040Arg Ala Val Leu Val Gly Arg Glu Thr Ser Glu Phe Leu Ala Gln2045 2050 2055Leu Glu Thr Val Ala Gly Asp Gly Pro Val Ser Glu Gly Gly Thr2060 2065 2070Ala Phe Leu Phe Ser Gly Gln Gly Ser Gln Arg Ala Gly Met Gly2075 2080 2085Arg Glu Leu Tyr Glu Ala Tyr Pro Val Phe Ala Ala Ala Phe Asp2090 2095 2100Glu Val Cys Gly His Leu Asp Val Leu Leu Glu Arg Pro Val Lys2105 2110 2115Glu Val Val Phe Ala Gly Gly Lys Ala Leu Asp Arg Thr Val Phe2120 2125 2130Thr Gln Ala Gly Leu Phe Ala Leu Glu Val Ala Leu Phe Glu Leu2135 2140 2145Val Gly Ser Trp Gly Val Arg Ala Asp Val Leu Leu Gly His Ser2150 2155 2160Ile Gly Glu Leu Ala Ala Ala Tyr Ala Ala Gly Val Trp Ser Leu2165 2170 2175Glu Asp Ala Cys Arg Val Val Ala Ala Arg Gly Arg Leu Met Gln2180 2185 2190Ala Leu Pro Glu Gly Gly Val Met Val Ala Val Glu Ala Ala Glu2195 2200 2205Glu Glu Leu Pro Gln Leu Pro Ala Gly Val Ser Val Ala Ala Val2210 2215 2220Asn Gly Pro Arg Ser Leu Val Leu Ser Gly Asp Asp Glu Pro Val2225 2230 2235Thr Ala Leu Ala Gln Thr Phe Ala Gly Gln Gly Arg Arg Thr Arg2240 2245 2250Arg Leu Thr Val Ser His Ala Phe His Ser Ala Trp Met Glu Pro2255 2260 2265Met Leu Ala Asp Phe Ala Glu Val Leu Gly Ser Val Glu Phe Arg2270 2275 2280Ala Pro Arg Ile Pro Val Val Ser Asn Val Thr Gly Gln Val Ala2285 2290 2295Gly Glu Glu Leu Ala Thr Pro Asp Tyr Trp Val Arg His Val Arg2300 2305 2310
Glu Ala Val Arg Phe Ala Asp Gly Val Thr Thr Val Leu Gly Arg2315 2320 2325Gly Val Asp Lys Phe Leu Glu Leu Gly Pro Gly Gly Ala Leu Thr2330 2335 2340Ala Met Ala Glu Glu Ala Leu Asp His Thr Gly Thr Asp Ala Val2345 2350 2355Cys Ala Pro Val Leu His Pro Glu His Pro Glu Ala Ser Ser Ala2360 2365 2370Val Arg Gly Leu Gly Arg Ile Tyr Ala Val Gly Ala Pro Ala Asp2375 2380 2385Trp Ser Ala Leu Phe Ala Gly Thr Gly Ala Arg Arg Val Asp Leu2390 2395 2400Pro Thr Tyr Ala Phe Gln Arg Arg Arg Phe Trp Leu Asp Ser Leu2405 2410 2415Ala Thr Gly Ser Gly Asp Pro Ala Ser Leu Gly Leu Thr Thr Thr2420 2425 2430Gly His Pro Leu Leu Gly Ala Gly Val Arg Leu Pro Asp Ser Asp2435 2440 2445Gly Phe Leu Phe Thr Gly Arg Leu Ser Leu Ala Thr Gln Pro Trp2450 2455 2460Ile Ala Gln His Ala Leu Leu Gly Thr Ala Leu Leu Pro Gly Thr2465 2470 2475Ala Phe Val Glu Leu Ala Leu Arg Ala Gly Ala Glu Ser Gly Cys2480 2485 2490Glu Val lle Glu Glu Leu Thr Leu Glu Ala Pro Leu Val Leu Glu2495 2500 2505Glu His Gly Gly Arg Ala Val His Val Thr Val Gly Gly Leu Asp2510 2515 2520Glu Ser Gly Arg Arg Thr Ile Thr Leu His Ser Arg Pro Asp Gly2525 2530 2435Ala Asp Asp Asp Glu Ser Trp Leu Arg His Ala Thr Gly Val Leu2540 2545 2550Val Glu Arg Arg Glu Thr Glu Ser Ala Asp Ala Pro Thr Glu Gly2555 2560 2565Val Trp Pro Pro Asp Gly Ala Thr Gln Ile Ser Val Gln Asp Phe2570 2575 2580Tyr Pro Asp Met Ala Glu Ala Gly Phe Thr Tyr Gly Pro Val Phe2585 2590 2595Gln Gly Leu Arg Val Leu Trp Ser Lys Asp Gly Glu Leu Phe Ala2600 2605 2610Glu Val Arg Leu Pro Asp Glu Ala Gly Glu Ala Gly Asp Glu Gly2615 2620 2625Ser Gly Phe Gly Val His Pro Ala Leu Leu Asp Ala Ala Leu Gln2630 2635 2640Pro Leu Ala Leu Ser Val Leu Gly Gly Thr Asp Gly Arg Gln Pro2645 2650 2655Val Lys Gly Gly Met Pro Phe Val Trp Thr Gly Val Arg Leu His
2660 2665 2670Ala Thr His Ala Thr Val Ala Arg Val Lys Leu Ala Pro Val Gly2675 2680 2685Arg Ser Glu Val Ser Val Val Val Thr Asp Asp Ser Gly Leu Pro2690 2695 2700Ile Ala Thr Val Asp Ser Leu Ala Met Arg Asp Pro Ile Leu Glu2705 2710 2715Gln Phe Thr Ala Ser Ala Pro Arg Gln Asp Ala Leu Phe Gly Val2720 2725 2730Arg Trp Thr Pro Ile Pro Leu Ala Ala His Ala Glu Pro Gly Glu2735 2740 2745Trp Ala Met Leu Gly Phe Asp Pro Leu Glu Ile Arg Gln Arg Leu2750 2755 2760Val Glu Ala Gly Leu Thr Gly Thr Pro Tyr Leu Asp Pro Gln Sar2765 2770 2775Leu Ile Asp Thr Val Glu Ser Gly Lys Pro Val Pro Pro Val Val2780 2785 2790Ala Val Ser Cys Phe Gly Gly Gly Gly Ser Thr Val Thr Ala Thr2795 2800 2805His Glu Ala Val Gly Arg Ala Leu Gly Val Leu Gln His Trp Leu2810 2815 2820Ala Asp Ala Arg Leu Met Ser Ser Arg Leu Val Leu Leu Thr Arg2825 2830 2835Gly Ala Val Pro Ala Val Asp Thr Asp Arg Ile Glu Asp Leu Ala2840 2845 2850Ala Ser Ala Val Trp Gly Leu Val Arg Ala Ala Gln Ser Glu His2855 2860 2865Pro Asp Arg Ile Val Leu Ile Asp Leu Asp Asp Asp Pro Thr Ser2870 2875 2880Tyr Arg Ala Leu Pro Ala Ala Leu Gly Thr Gly Glu Pro Gln Leu2885 2890 2895Ala Leu Arg Thr Gly Ala Ala Ser Ala Pro Arg Leu Ala Arg His2900 2905 2910Thr Gly Ala Pro Glu Val Thr Pro Gly Phe Gly Pro Asp Gly Thr2915 2920 2925Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly Ala Val Val Ala2930 2935 2940Arg His Leu Ala Ala Ala His Gly Val Arg His Leu Val Leu Ala2945 2950 2955Ser Arg Ser Gly Ala Glu Ala Ser Gly Ala Asp Ala Leu Leu Ala2960 2965 2970Asp Leu Thr Glu Leu Gly Ala Asp Ala Thr Ile Val Ala Cys Asp2975 2980 2985Val Ser Asp Arg Ala Ala Leu Ala Ala Leu Leu Asp Ala Ile Pro2990 2995 3000Ala Glu Arg Pro Leu Thr Gly Val Val His Thr Ala Gly Val Leu3005 3010 3015
Ala Asp Gly Thr Val Glu Ser Leu Thr Pro Asp Gln Ala Asp Thr3020 3025 3030Val Leu Arg Ala Lys Ala Asp Ala Ala Trp His Leu His Glu Leu3035 3040 3045Thr Ala Leu Thr Pro Val Arg Glu Phe Val Leu Phe Ser Ser Ala3050 3055 3060Ala Gly Leu Leu Gly Ser Gln Gly Gln Gly Asn Tyr Ala Ala Ala30653070 3075Asn Ala Phe Leu Asp Ala Leu Ala Ala His Arg Arg Ala Ala Gly3080 3085 3090Leu Ala Gly Thr Ser Leu Ala Trp Gly Trp Trp Asp Leu Pro Gly3095 3100 3105Gly Met Ala Ala Asp Leu Gly Arg Ala Glu Arg Ala Arg Met Ala3110 3115 3120Arg Gly Gly Leu Thr Pro Phe Thr Ala Glu Thr Gly Met Asp Ala3125 3130 3135Phe Asp Gln Thr Leu Ala Ala Gly Thr Glu Pro Leu Leu Val Pro3140 3145 3150Met Arg Met Asn Thr Ala Val Ala Arg Ala Ser Ala Gly Gln Gln3155 3160 3165Ile Pro Ser Val Leu Arg Gly Leu Val Arg Ala Pro Arg Arg Arg3170 3175 3180Ala Val Arg Ser Asp Glu Gly Ser Ala Ser Arg Leu Arg Glu Arg3185 3190 3195Leu Ala Gly Ala Asn Ala Asp Glu Arg Leu Ala Met Leu Thr Glu3200 3205 3210Leu Val Arg Val Glu Ala Ala Gln Val Leu Gly His Ser Gly Ala3215 3220 3225Glu Ala Val Glu Asp Gly Ser Ser Phe Ala Glu Leu Gly Phe Asp3230 3235 3240Ser Leu Thr Ser Val Glu Leu Arg Asn Arg Ile Gly Glu Arg Thr3245 3250 3255Gly Leu Arg Leu Ala Ser Thr Val Val Phe Asp His Pro Thr Pro3260 3265 3270Ala Ala Leu Ala Ala Glu Leu Gly Asp Arg Leu Gly Asp Thr Ala3275 3280 3285Asp Phe Val Ser Ala Ala Gln Pro Ser Glu Ala Pro Gly Ala Gly3290 3295 3300Gly Ser Gly Val Glu Thr Thr Ala Asp Thr Ala Val Ile Asn Gly3305 3310 3315Val Glu Ala Leu Tyr Arg Arg Ser Ile Glu Leu Gly Arg Leu Asp3320 3325 3330Leu Gly His Ser Val Leu Lys Asn Ser Val Asp Leu Arg Ala Ser3335 3340 3345Phe Ser Val Pro Asp Glu Val Arg Asn Gly Pro Glu Leu Val Arg3350 3355 3360
Leu Val Glu Gly Ala Gln His Pro Lys Ile Ile Cys Phe Pro Ser3365 3370 3375Gln Ser Val Trp Ala Ser Asn Gln Glu Leu Val Gly Met Ala Val3380 3385 3390Pro Leu Arg Gly Val Arg Asp Leu Trp Ser Leu Met Leu Pro Gly3395 3400 3405Phe Val Thr Gly Gln Pro Val Ala Ala Asp Val Asp Ala Ala Ala3410 3415 3420Glu Tyr Ala Val Arg Leu Ile Glu Glu Leu Val Gln Asp Glu Pro3425 3430 3435Phe Val Leu Ala Gly Arg Ser Ser Gly Gly Arg Ile Ala His Glu3440 34453450Val Ala Val Arg Leu Glu Gly Arg Gly Arg Ala Pro Lys Gly Leu3455 3460 3465Val Leu Ile Asn Ser Tyr Met Ala Gly Tyr Glu Ala Thr Ser Tyr3470 3475 3480Ile Thr Pro Val Met Glu Ser Lys Ala Leu Glu Leu Glu Lys Asp3485 3490 3495Phe Gly Gln Met Thr Gly Thr Arg Leu Thr Ala Met Ala Ala Tyr3500 3505 3510Phe Ala Met Phe Glu Ala Trp Gln Pro Glu Glu Thr Ser Val Pro3515 3520 3525Thr Leu Leu Val Arg Ala Ser Glu Arg Tyr Gly Ile Glu Pro Gly3530 3535 3540Gln Glu Gln Pro Pro Ala Glu Glu Trp Gln Ser Ala Trp Pro Leu3545 3550 3555Pro His Asp Ala Ile Asp Val Pro Gly Asn His Tyr Ser Met Ile3560 3565 3570Glu Gly Ser Gly Asp Val Thr Ala Ala Ala Val His Arg Trp Leu3575 3580 3585Val Glu Arg Asp Ala3590<210>9<211>405<212>PRT<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)<400>1Met Ala Glu Ala Pro Ser Glu Pro Ile Pro Phe Pro Phe Pro Asp Pro1 5 10 15Pro Ser Val Cys Glu Leu Pro Pro Glu Leu Ala Glu Val Arg Asp Gly20 25 30Glu Ser Val Val Glu Val Lys Phe Pro Asp Gly Ile Thr Gly Trp Met35 40 45Val Thr Lys His Ala Asp Val Arg Lys Val Leu Leu Asp Pro Arg Phe50 55 60Ser Ser Arg Val Ile Ala Thr Ala Ala Ala Ala Met Ser Glu Thr Glu65 70 75 80Thr Gly Lys Leu Met Asn Glu Ser Leu Val Gly Met Asp Pro Pro Glu
85 90 95His Thr Arg Leu Arg Lys Leu Val Ser Lys Ala Phe Thr Ala Arg Arg100 105 110Val Glu Gln Leu Arg Pro Arg Ile Val Glu Leu Val Val Glu Leu Leu115 120 125Asp Glu Leu Gln Thr Leu Pro Arg Pro Val Asp Leu Val Lys Asn Phe130 135 140Ala Val Pro Leu Pro Val Arg Val Val Cys Glu Leu Leu Gly Val Pro145 150 155 160Ala Gly Asp Gln Asp Thr Phe His Ala Trp Ser Asn Ala Leu Leu Gly165 170 175Asp Trp His Gln Val Ala Glu Lys Glu Ala Ala Thr Val Ala Leu Val180 185 190Asn Tyr Phe Gly Asp Leu Ile Ala Val Lys Arg Gln Lys Pro Ala Asp195 200 205Asp Met Ile Ser Glu Leu Ile Ala Val Ser Glu Glu Glu Asp Ser Thr210 215 220Leu Thr Glu Arg Glu Ile Ile Thr Leu Ser Ile Gly Ile Leu Ser Ala225 230 235 240Gly His Glu Thr Thr Ala Asn Leu Ile Ser Met Phe Leu Leu Thr Leu245 250 255Leu His His Pro Glu Glu Phe Asp Lys Leu Arg Ala Asn Pro Glu Ala260 265 270Leu Pro Lys Ala Ile Asp Glu Leu Leu Arg Phe Val Pro Leu Thr Ala275 280 285Thr Gly Gly Ile Thr Pro Arg Leu Thr Thr Ala Glu Val Glu Leu Ser290 295 300Asn Gly Lys Val Leu Pro Ala Gly Val Val Val Leu Pro Ala Val Ala305 310 315 320Thr Ala Asn Arg Asp Pro Asp Val Phe Glu Asp Gly Asp Arg Leu Asp325 330 335Leu Ala Arg Glu Gln Asn Pro His Leu Ala Phe Ser Thr Gly Ile His340 345 350Tyr Cys Leu Gly Ala Gln Leu Ala Arg Ile Glu Leu Gln Glu Ala Phe355 360 365Arg Ala Ile Met Glu Arg Met Pro Glu Val Arg Leu Ala Val Pro Glu370 375 380Ser Glu Leu Arg Leu Lys Pro Ala Ser Ile Leu Arg Gly Leu Glu Ser385 390 395 400Leu Pro Ile Thr Trp405<210>10<211>167<212>PRT<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)<400>1Met Gly Val Phe Glu Gln Glu Ala Ala Glu Ser Thr Gly Glu Lys Phe1 5 10 15
Val Arg Pro Ala Ala Pro Glu Arg Met Arg Asp Leu Asp Phe Leu Leu20 25 30Gly Asp Phe Arg Val Glu Trp Thr Asn Phe Thr Ala Asp Pro Pro Val35 40 45Lys Gly Thr Ala Ala Trp Asn Thr Val Ser Thr Phe Ala Gly His Ala50 55 60Tyr Glu Met Thr Gln Leu Val Pro Lys Asp Asp Leu Thr Gly Arg Phe65 70 75 80Val Ile Gln Trp Val Glu Ser Glu Ser Ser Phe Ser Gly Tyr Tyr Tyr85 90 95Asp Asp Trp Gly Asn Arg Thr Leu Leu Thr Ala Lys Gly Trp Gln Asp100 105 110Gly Tyr Leu Ser Phe Thr Gly Glu Cys Ile Gly Phe Gly Arg Trp Phe115 120 125Leu Leu Lys Glu Arg Tyr Gln Val Ile Asp Glu Asn His Tyr Leu Lys130 135 140Cys Gly Phe Ile Arg Phe Glu Ala Asp Gly Glu Trp Val Pro Ala Asp145 150 155 160Glu Val His Cys Tyr Arg Val165<210>11<211>317<212>PRT<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)<400>1Met Thr Ser Thr Asp Asp Ile Leu Gly Lys Gly Thr Thr Ile Ile Ser1 5 10 15Arg Arg Ser Thr Ala Ala Arg Glu His Gly Gly Glu Arg Leu Pro Thr20 25 30Arg Leu Pro Thr Pro Ser His Thr Thr Ser Ser Arg Ala Asp Gly Phe35 40 45Ser Ala Gly Ala Thr Leu Leu Thr Trp His Arg Arg Leu Val Arg Ala50 55 60Arg Glu Pro Asp Leu Gly Val Arg Gln Val Pro Gly Arg Ala Ala Thr65 70 75 80Ala Trp Pro Ser Gly Cys Arg Arg Thr Ile Arg Arg Ala Leu Arg Arg85 90 95Ser Gly Leu Pro Pro Ala Pro Gln Arg Ala Ser Gln Gln Thr Trp Arg100 105 110Ser Phe Leu Arg Ser Gln Ala His Thr Leu Leu Ala Cys Asp Phe Met115 120 125Arg Val Glu Thr Val Phe Leu Lys Arg Leu Tyr Val Phe Phe Val Met130 135 140Glu Ile Lys Thr Arg Arg Val His Val Leu Gly Val Thr Val Arg Pro145 150 155 160Thr Gly Ala Trp Val Thr Gln Phe Ala Arg Asn Leu Leu Lys Asp Leu165 170 175
Glu Glu Arg Ala Gly Cys Phe Arg Phe Leu Ile Arg Asp Arg Asp Ser180 185 190Lys Phe Thr Ala Ala Phe Asp Ala Val Phe Ala Asp Asn Gly Thr Ala195 200 205Val Ile Pro Thr Pro Pro Gln Ser Pro Arg Ser Asn Ala Phe Ala Glu210 215 220Arg Trp Ile Arg Thr Ala Arg Ala Glu Cys Thr Asp Arg Ile Leu Ile225 230 235 240Thr Gly Glu Arg His Leu Arg Ala Val Leu Thr Thr Tyr Ala Glu His245 250 255Tyr Asn Thr Gly Arg Ala His Arg Ser Leu Asp Leu Arg Ala Pro Asp260 265 270Asp Arg Pro Ser Val Ile Pro Leu Pro Ala Ala Val Val Arg Arg Arg275 280 285Arg Leu Leu Gly Gly Leu Leu Asn Glu Tyr His Thr Thr Pro Pro Gln290 295 300Arg Leu Leu His Pro Gln Glu Thr Pro Ser Ser Ala Ala305 310 315<210>12<211>593<212>PRT<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)<400>1Val Leu Ile Val Ala Ala Gly Trp Ser Gly Gly Arg Ser Phe Ser Phe1 5 10 15Pro Val Thr Glu Trp Glu Gly Leu Val Pro Met Glu Pro Arg Ser Trp20 25 30Pro Glu Pro Ala Pro Glu Val Ala Arg Ala Val Arg Ala Lys Tyr Ser35 40 45Gly Arg Gln Val Pro Leu Pro Val Val Val Arg Asp Arg Leu Gly Glu50 55 60Leu Phe Ala Asp Ala Glu Phe Ala Glu Ala Phe Ala Val Thr Gly Pro65 70 75 80Arg Gly Trp Ser Pro Gly Arg Leu Ala Leu Val Thr Val Leu Gln Met85 90 95Ala Glu Asn Leu Thr Asp Arg Gln Ala Ala Glu Ala Val Arg Asp Lys100 105 110Leu Ser Trp Ser Tyr Ala Leu Gly Leu Gly Leu Glu Asp Pro Gly Phe115 120 125Asp Phe Ser Val Leu Ser Gln Phe Arg Ser Arg Val Ala Ala His Gly130 135 140Leu Glu Glu Lys Val Leu Asp Leu Leu Val Ala Arg Leu Thr Glu Gln145 150 155 160Gly Leu Leu Ala Ala Gly Gly Lys Gln Arg Thr Asp Ser Thr His Val165 170 175Val Ala Ala Val Arg Asp Leu Asn Arg Leu Glu Leu Ala Gly Glu Ala180 185 190Val Arg Ala Ala Leu Glu Ala Leu Thr Cys Ala Gly Pro Asp Trp Val
195 200 205Ala Gln Ala Val Asp Val Ala Ser Trp Ser Arg Arg Tyr Gly Pro Arg210 215 220Val Asp Ser Trp Arg Leu Pro Thr Ser Arg Ala Arg Gln Gln Lys Leu225 230 235 240Ala Val Asp Phe Ala Arg Asp Gly Phe Ala Leu Leu Gly Ala Val Tyr245 250 255His Ser Ser Ser Pro Val Trp Leu Arg Glu Leu Pro Ala Val Gln Val260 265 270Leu Trp Cys Val Leu Val Gln Asn Tyr Thr Arg Thr Ile Thr Arg Gly275 280 285Gly Arg Glu Val Val Lys Arg Arg Glu Lys Thr Asp Glu Gly Gly Asp290 295 300Gly Arg Pro Pro Gly His Leu Arg Leu Ser Ser Pro Tyr Asp Thr Asp305 310 315Ala Arg Trp Ser Ala Lys Arg Asp Met Phe Trp Asn Gly Tyr Lys Leu325 330 335His Ile Ser Glu Thr Cys Thr Ser Ala Pro Glu Lys Ala Arg Thr His340 345 350Pro Asn Leu Ile Thr Asn Ile Ala Thr Thr His Ser Thr Val Pro Asp355 360 365Ser Lys Thr Leu Asn Ala Ile His His Ala Leu Gln Gln Arg Gly Leu370 375 380Leu Pro Asp Glu His Tyr Pro Asp Ser Gly Tyr Ala Thr Ala Glu Leu385 390 395 400Ile His Gly Ser Val Lys Thr Tyr Gly Ile Ala Leu Ile Thr Pro Val405 410 415Leu Leu Asp Thr Ser Arg Gln Ala Lys Ala Gln Ala Gly Phe Ala Ala420 425 430Thr Asp Phe Thr Ile Asp Arg Glu Ala Gly Lys Ala Thr Cys Pro Ala435 440 445Gly His Thr Ser Ala Thr Trp Asn Pro Val Val Ser Glu Gly Ile Pro450 455 460Lys Thr Val Val Ser Phe Ala Ala Leu Asp Cys Ile Pro Cys Pro Phe465 470 475 480Lys Pro Gln Cys Thr Thr Ala Lys Lys Asn Arg Arg Gln Leu Ser Leu485 490 495His Leu Arg Gln Met Thr Glu Ala Leu Arg His Thr Arg Thr Gln Gln500 505 510Lys Thr Lys Asp Trp Asn Thr Asp Tyr Ala Leu Arg Ser Gly Ile Glu515 520 525Gly Thr Ile Arg Gln Ala Thr Ala Val Thr Gly Thr Arg Arg Ala Arg530 535 540Tyr Arg Gly Leu Ala Lys Thr His Leu Glu His Ile Tyr Ser Ala Val545 550 555 560Ala Leu Asn Leu Ile Arg Leu Asn Ala Trp Trp Asn Asp Arg Pro Leu565 570 575
Asp Arg Thr Arg Thr Ser His Leu Thr Arg Leu Glu His Thr Leu Thr580 585 590Ala<210>13<211>940<212>PRT<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)<400>1Met Lys Leu Ser Glu Pro Ser Tyr Tyr Pro Glu Ile Val Glu Arg Ser1 5 10 15Glu Glu Ile Ser Leu Leu Ala Gln Asp Leu Ala Asn Thr Lys Arg Gly20 25 30Glu Gly Ala Val Val Val Ile His Ser Gly Pro Gly Val Gly Arg Thr35 40 45Ala Leu Leu Asp Glu Phe Leu Arg Gln Ser Gly Asn Ser Gly Ala Arg50 55 60Val Cys Ala Ala Thr Gly Ser Ala Ala Glu Thr Gly Asn Glu Leu Gly65 70 75 80Val Val Thr Gln Leu Phe Pro Glu Asp Gly Pro Ile Ala Ala Ala Val85 90 95Trp Leu Ala Arg Ala Leu Asp Asp His His Gly Asp Pro Ser Pro Asp100 105 110Ala Asp Arg Leu Phe Asp Met Leu Arg Gly Glu Phe Arg Gln Gly Pro115 120 125Leu Val Leu Ala Val Asp Asp Val Gln Leu Ala Asp Ala Ala Ser Leu130 135 140Arg Phe Leu Leu His Leu Ile Arg Arg Leu Arg Thr Thr Pro Val Leu145 150 155 160Ile Val Leu Thr Glu Pro Val Gly Ser Cys Ala Leu Pro Leu Ala Phe165 170 175Gln Ala Glu Leu Leu Arg His Pro Arg Cys Arg Arg Leu Arg Leu Gln180 185 190Pro Leu Ser Val Asp Gly Val Thr Arg Met Ile Glu Pro Tyr Val Ala195 200 205Glu Thr Glu Val Ala Arg Leu Ala Thr Gln Phe His Ala Val Ser Gly210 215 220Gly Asn Pro Val Leu Val Arg Gly Leu Leu Ala Asp His Arg Ala Gly225 230 235 240Gln Arg Leu Glu Glu Gln Gly Ile Gly Ala Gln Tyr Asn Gly Tyr Pro245 250 255Ala Phe Thr Gln Ala Ala Leu Val Ser Ala Tyr Arg Asp Asp Pro Val260 265 270Leu Phe Glu Val Val Cys Gly Ile Ala Val Leu Gly Glu Asn Ala Ser275 280 285Pro Ala Leu Val Ala Cys Leu Val Asp Arg Gly Ala Asp Val Val Ala290 295 300Arg Val Met Thr Ala Leu Asn Thr Ala Ser Leu Leu Asn Gly Pro Ala
305 310 315 320Phe Arg Ser Pro Leu Val Ala Lys Ala Leu Leu Glu Leu Leu Asp Val325 330 335Glu Thr Arg Gly Glu Leu His Arg Arg Ala Ala Glu Leu Leu His Ala340 345 350Asp Ala Ala Leu Pro Ala Asp Val Ala His His Leu Leu Ala Thr Pro355 360 365Ile Ala Glu Ser Trp Val Leu Pro Thr Leu Leu Ala Ala Ala Glu Gln370 375 380Ala Val Gln Gly Gly Gly Gln Asp Phe Arg Leu Asp Cys Leu Arg Leu385 390 395 400Ala Gly Arg Gln Ala Ala Thr Glu Glu Glu Arg Ala Ala Val Val Ala405 410 415Ala Arg Val Arg Ile Gly Trp Glu Ile Asp Pro Arg Leu Ile Thr Pro420 425 430Trp Leu Gly Glu Leu Gly Ala Ala Leu Arg Arg Gly His Val Gly Ser435 440 445Glu Glu Ala Ala Asp Ile Leu Ser Ala Leu Met Glu Arg Thr Glu Glu465 470 475 480Asn Ser Asp Ala His Ala Glu Leu Glu Ile Val Arg His Trp Val Arg485 490 495Tyr Thr Cys Pro Thr Leu Leu Glu Gly Ser Val Asp Ala Asp Ala Pro500 505 510Ser Leu Ser Gly Pro Phe Pro Gln Arg Phe Gln Leu Arg Pro Ala Ser515 520 525Tyr Ala Val Glu Met Leu Gly Arg Leu Phe Thr Glu Gly Pro Cys Asp530 535 540Gln Ala Ala Ala Met Ala Glu Glu Ile Leu Arg Gly Cys Arg Phe Gly545 550 555 560Glu Thr Thr Val Glu Ala Val Glu Gly Ala Leu Leu Val Leu Val Tyr565 570 575Ala Glu Arg Pro Gly Arg Ala Leu His Trp Cys Glu Ala Leu Leu Glu580 585 590Gln Ala Gly Asp His Pro Thr Gly Thr Ala Ala Ala Ile Leu Ser Ser595 600 605Ile Arg Ala Glu Ile Ala Leu Arg Gln Gly Ala Leu Glu Glu Ala Glu610 615 620Thr Tyr Ala Asp Arg Ala Leu Asn Ala Ile Ser Arg Leu Gly Trp Gly625 630 635 640Val Ala Ile Gly Ser Pro Leu Ala Val Arg Val Arg Ala Ala Met Ala645 650 655Ala Gly Arg Thr Gly Leu Ala Gly Ala Trp Leu Asn Gln Asp Val Pro660 665 670Gln Gly Met Phe Arg Thr Arg His Gly Leu Leu Tyr Met His Ala Arg675 680 685
Gly His Tyr His Leu Ala Thr Asp Arg Pro Thr Val Ala Leu Glu Asp690 695 700Phe Leu Thr Cys Gly Arg Leu Ala Lys Glu Trp Gly Met Asp Val Pro705 710 715 720Thr Phe Leu Pro Trp Arg Thr Ser Ala Ala Leu Ala His Leu Ala Leu725 730 735Gly Asn Gly Ser Arg Ala Ser Ala Leu Ala Arg Glu Gln Leu Thr Arg740 745 750Pro Gly Gly Gly Trp Pro Arg Csy Arg Ala Val Ser Leu Arg Val Leu755 760 765Ala Ala Thr Ser Glu Leu Asp Arg Arg Pro Ala Leu Leu Arg Glu Ser770 775 780Val Asn Leu Leu Glu Ser Cys Gly Asp His Val Glu Leu Leu His Ser785 790 795 800Leu Ala Asp Gln Phe Gln Ala Leu Ser Glu Ala Gly Ala Pro Ala Lys805 810 815Ala Arg Ile Ala Ala Arg His Ala Arg Thr Val Ala Asp Asn Cys Gly820 825 830Thr Glu Thr Leu Phe Arg Arg Leu Phe Lys Glu Glu Val Pro Glu Asp835 840 845Thr Asp Glu Ser Ala Asp Phe Gly Gln Asp His Gln Gly Phe Ala Ser850 855 860Leu Thr Asp Ala Glu Arg Arg Val Thr Ala Leu Ala Ala Leu Gly Tyr865 870 875 880Ser Asn Arg Glu Ile Gly Arg Lys Leu Phe Ile Thr Lys Ser Thr Val885 890 895Glu Gln His Leu Thr Arg Val Tyr Arg Lys Leu Gly Val Arg Asn Arg900 905 910Ala Asp Leu Gly Asp Leu Leu Ala Gly Ile Asn Leu Ala Ala Gln Pro915 920 925Gln Val Met Gly Arg Thr Ser Ser Ala Ala Val Gly930 935 940<210>14211>447<212>PRT<213>南昌链霉菌NS3226(Streptomyces nanchangensis n.sp.NS3226)<400>1Met Thr Leu Leu Ser Glu Ala Val Arg Ala Gly Ala Ser Pro Gln Glu1 5 10 15Leu Glu Arg Ala Glu Pro Pro Arg Glu Tyr Thr Ala Ala Tyr Ile His20 25 30Ser Glu Asp Thr Arg Met Phe Glu Gly Val Ala Asp Lys Asp Val Arg35 40 45Lys Ser Leu Arg Val Gly Arg Val Pro Met Pro Glu Leu Ala Pro Asp50 55 60Glu Val Leu Val Ala Val Met Ala Ser Ala Val Asn Tyr Asn Thr Val65 70 75 80
Trp Ser Ala Ile Phe Glu Pro Leu Pro Thr Phe Arg Phe Leu Arg Gln85 90 95Phe Ala Ala Gln Gly Gly Trp Ala Ser Arg His Asp Leu Pro Tyr His100 105 110Val Leu Gly Ser Asp Gly Ala Gly Val Val Val Arg Thr Gly Pro Gly115 120 125Val Arg His Trp Lys Thr Gly Asp His Val Val Val Ser Cys Val Gln130 135 140Ala Asp Asp Gln Glu Ala Ala Thr Gln Ala Asp Gly Met Leu Gly Ala145 150 166 160Glu Gln Arg Ile Trp Gly Phe Glu Thr Asn Phe Gly Gly Leu Ala His165 170 175Tyr Ala Val Val Arg Ala Ser Gln Leu Ile Pro Lys Pro Gly His Leu180 185 190Ser Trp Glu Glu Ala Ala Cys Asn Pro Leu Cys Gly Gly Thr Ala Tyr195 200 205Arg Met Leu Val Gly Asp Arg Gly Ala Arg Leu Lys Gln Gly Glu Ile210 215 220Val Leu Ile Trp Gly Ala Ala Gly Gly Leu Gly Ala Tyr Ala Val Gln225 230 235 240Leu Val Lys Asn Gly Gly Gly Ile Pro Val Gly Val Val Ser Ser Pro245 250 255Ala Lys Ala Glu Ala Ala Arg Arg Leu Gly Cys Asp Val Val Ile Asp260 265 270Arg Gln Glu Ile Gly Leu Asp Asp Arg Thr Ala Tyr Asp Pro Ala Ala275 280 285Val Ile Glu Thr Gly Lys Gln Leu Gly Arg Ile Ile Arg Arg Glu Val290 295 300Gly Glu Asp Pro His Ile Val Phe Glu His Val Gly Arg Ser Thr Phe305 310 315 320Pro Val Ser Val Phe Ala Val Arg Arg Gly Gly Thr Val Val Thr Cys325 330 335Gly Ser Ser Thr Gly Tyr Gln His Thr Tyr Asp Asn Arg Tyr Leu Trp340 345 350Met Lys Leu Lys Arg Ile Ile Gly Ser His Ala Ala Asn Leu Gln Glu355 360 365Gln Trp Glu Leu Asn Arg Leu Val Ser Arg Gly Gln Ile Val Pro Thr370 375 380Leu Ser Ala Val Tyr Pro Leu Ala Glu Val Ala Ala Ala Thr Arg Ser385 390 395 400Val Gln Thr Asn Arg His Ile Gly Lys Val Gly Val Leu Cys Leu Ala405 410 415Glu Ala Pro Gly Gln Gly Val Thr Asp Pro Ala Leu Arg Ala Arg Val420 425 430Gly Glu Glu Arg Leu Ser Leu Leu Arg Asp Leu Ser Pro Thr Ala435 440 44权利要求
1.一种南寡霉素生物合成基因簇,其特征在于,整个南昌霉素生物合成基因簇共13个基因,具体为(1)聚酮合酶基因,即nlmA1,nlmA2,nlmA3,nlmA4,nlmA5,nlmA6,nlmA7共7个基因;(2)南寡霉素的修饰基因,即nlmB,nlmOI共2个基因;(3)南寡霉素转座酶基因,即nlmTI,nlmTII共2个基因;(4)南寡霉素的调节基因,即nlmRI;(5)南寡霉素前体物合成基因,即ccrA。
2.根据权利要求1所述的南寡霉素生物合成基因簇,其特征是,所述的聚酮合酶基因,其编码催化南昌链霉菌NS3226中二十六元大环内酯类抗生素南寡霉素聚酮糖苷配基生物合成所需的7个I型聚酮合酶开放读码框,即nlmA1,nlmA2,nlmA3,nlmA4,nlmA5,nlmA6,nlmA7的核苷酸序列或互补序列及其相应的氨基酸序列。
3.根据权利要求2所述的南寡霉素生物合成基因簇,其特征是,所述的7个I型聚酮合酶开放读码框,其模块或结构域,即酮基合成酶结构域、酰基转移酶结构域、酮基还原酶结构域、脱水酶结构域、烯酰基还原酶结构域、酰基载体蛋白结构域、硫酯酶结构域的核苷酸序列或互补序列及其相应的氨基酸序列。
4.根据权利要求1所述的南寡霉素生物合成基因簇,其特征是,所述的南寡霉素的修饰基因,其编码参与南寡霉素聚酮链氧化修饰的2个开放读码框,即nlmB,nlmOI的核苷酸序列或互补序列及其相应的氨基酸序列。
5.根据权利要求1所述的南寡霉素生物合成基因簇,其特征是,所述的南寡霉素转座酶基因,其编码参与南寡霉素生物合成基因簇转座的2个开放读码框,即nlmTI,nlmTII的核苷酸序列或互补序列及其相应的氨基酸序列。
6.根据权利要求1所述的南寡霉素生物合成基因簇,其特征是,所述的南寡霉素的调节基因,其编码参与南寡霉素生物合成调节的1个开放读码框,即nlmRI的核苷酸序列或互补序列及其相应的氨基酸序列。
7.根据权利要求1所述的南寡霉素生物合成基因簇,其特征是,所述的南寡霉素前体物合成基因,其编码丁酰辅酶A还原酶的开放读码框,即ccrA的核苷酸序列或互补序列及其相应的氨基酸序列。
全文摘要
一种南寡霉素生物合成基因簇,属于基因技术领域。整个南昌霉素生物合成基因簇共13个基因,具体为(1)聚酮合酶基因,即nlmA1,nlmA2,nlmA3,nlmA4,nlmA5,nlmA6,nlmA7共7个基因;(2)南寡霉素的修饰基因,即nlmB,nlmOI共2个基因;(3)南寡霉素转座酶基因,即nlmTI,nlmTII共2个基因;(4)南寡霉素的调节基因,即nlmRI;(5)南寡霉素前体物合成基因,即ccrA。本发明所提供的基因及其蛋白质、抗体也可以用来查找和发展可用于医药、工业、农业的化合物或蛋白。
文档编号C12N15/52GK1523034SQ0315092
公开日2004年8月25日 申请日期2003年9月11日 优先权日2003年9月11日
发明者邓子新, 孙宇晖, 周秀芬, 涂国全 申请人:上海交通大学
网友询问留言 已有0条留言
  • 还没有人留言评论。精彩留言会获得点赞!
1