专利名称:流感A/Udorn/72(H3N2)基因组的核苷酸序列的制作方法
技术领域:
本发明涉及流感病毒菌株A/Udorn/72(H3N2)各节段的核苷酸序列。
背景技术:
流感病毒由命名为A、B和C的亚型构成。流感A病毒具有8个节段的单负链RNA基因组,其编码病毒生命周期所需的10种多肽(蛋白质)。这些蛋白质的顺序如下节段 12345678蛋白质PB2 PB1 PA HA NP NA M1 NS1剪接产物 M2 NS2流感A亚型完整基因组的8个RNA节段各与核壳蛋白(NP)的多种亚基衣壳化,并与三聚聚合酶的一些分子(PB1、PB2和PA亚基)结合,从而形成核糖核蛋白复合物(RNP)(Lamb,R.A.,《流感病毒》(The Influenza Viruses),第1-87页,R.M.Krug编辑(Plenum Press,1989))。NP蛋白是结构和转录/复制调节蛋白。这些结构周围围绕着一层基质蛋白(M1),这种蛋白是病毒颗粒的主要结构成分而且似乎在病毒核心和病毒包膜间起融合膜作用。这种宿主细胞衍生的包膜布满两种主要的病毒编码的表面糖蛋白血凝素(HA)和神经氨酸酶(NA)(这两种表面糖蛋白是抗原性的决定簇)以及数量少很多的未糖基化的小蛋白M2(Lamb 1989;Lamb,R.A.等,Cell,40,627-633(1985))。M2蛋白是7#节段(还编码M1蛋白)的剪接基因产物,其具有离子通道活性;剪接基因仅在感染的细胞中存在。类似地,非结构性NS2蛋白是8#节段(还编码非结构性NS1蛋白)的剪接基因产物;剪接基因仅在感染的细胞中存在。蛋白酶切割HA糖蛋白形成HA1和HA2。
流感病毒感染是由表面血凝素附着于含唾液酸的细胞受体引发的。这种病毒-细胞的初次相互反应,通过受体介导的胞吞作用将病毒颗粒吸收到细胞内。在核内体的低pH条件下,HA经历了促进HA2的疏水NH2末端区域和核内体膜间相互作用的构象变化,导致膜融合且随后将核心RNP以及基质蛋白(M1)释放到细胞质中。在RNP被转运到发生完整基因组的转录和复制的核心之前,在细胞质中出现RNP和基质蛋白的解离(Martin,K.,和Helenius,A.,Cell,67,117-130(1991);Shapiro,G.I.,等,J.Virology,61,764-773(1987))。
在初次转录后,新合成的蛋白质引发病毒基因组的复制,从而增加转录和蛋白合成。在病毒生命周期的这一阶段,表面糖蛋白HA和NA开始在质膜的分散区域聚集,而新装配的病毒则将从质膜释放。假设病毒的装配是通过四种病毒编码的结构蛋白的细胞质和/或跨膜区域间的某些相互作用起始的,这四种蛋白是膜锚着点蛋白(HA、NA和M2)和下面的基质蛋白(M1),基质蛋白维持与RNP的紧密连接(Garoff,H.,等,“微生物学和分子生物学综述”,62,1171-1190(1998);Nayak,D.P.,ASM News,62,411-414(1996))。至今还未很好地定义基质蛋白M1和RNP复合物间的接触,以及8种RNP整套的掺入成熟病毒颗粒的机制。假定结构成分间特定的分子接触支配形态发生过程如何启动以及成熟病毒颗粒装配的过程和从宿主细胞的表面出芽。
在病毒基因组复制的过程中,有大量流感A病毒亚型和抗原性变体通过RNA聚合酶的引入突变连续地析出。当它们感染相同的细胞宿主时,两种不同亚型的节段间,通过基因重配也发生突变。流感A病毒的这种生物学特性产生了新的病毒菌株,从而人们就必须周期性地更新抵抗流感的免疫原性组合物。因此,对新颖和革新的抵抗流感的免疫原性组合物的开发而言,必须有流感A病毒某一特定菌株的完整核苷酸序列和全长克隆的有效性。所以,需要确定流感A病毒的完整核苷酸序列并构建含有该序列的全长克隆。
具体说,1972年首次分离了被命名为流感A/Udorn/72(H3N2)的流感A菌株。至今尚未测出聚合酶基因PB2、PB1和PA的核苷酸序列。已测序了编码HA、NP、M1、M2、NS1和NS2的节段的核苷酸序列。然而,多年前采用仪器得到的那些不能提供精确的序列。因此,需要对这些节段重复测序以提供最新的精确序列。
发明概述本发明的目的是确定流感A/Udorn/72(H3N2)病毒菌株的完整核苷酸序列和7#和8# mRNA节段的剪接产物。本发明的另一目的是制备8种基因的各个克隆和剪接信息。
本发明的另一目的是确定这些基因和剪接信息的所有10种产物的演绎的氨基酸序列。
通过阐明分离的构成流感A/Udorn/72(H3N2)菌株所有节段的核苷酸序列的核酸分子完成本发明的这些以及其它目的,这些核酸分子一起包含该菌株的完整核苷酸序列以及它们的剪接信息。所有这些核苷酸序列都是以正链、反基因组(antigenomic)信息链显示的。
具体说,由SEQ ID NO1、SEQ ID NO3、SEQ ID NO5、SEQ ID NO7、SEQID NO9、SEQ ID NO11、SEQ ID NO13和SEQ ID NO17及其生物学等效物构成完整的核苷酸序列。
或者,完整的核苷酸序列包含HA序列的变体,命名为HA(P1)(SEQ IDNO21),且所述的完整的核苷酸序列由SEQ ID NO1、SEQ ID NO3、SEQ ID NO5、SEQ ID NO9、SEQ ID NO11、SEQ ID NO13、SEQ ID NO17和SEQ ID NO21及其生物学等效物构成。
在本发明的另一实施例中,这些分离的核苷酸分子定向于编码各个蛋白质的单独分离的流感A病毒的核酸分子,且选自SEQ ID NO1、SEQ ID NO3、SEQ ID NO5、SEQ ID NO7、SEQ ID NO11和SEQ ID NO21(SEQ ID NO7的变体)及其生物学等效物。
在本发明的另一实施例中,这些单独分离的核酸分子编码具有选自SEQ IDNO2、SEQ ID NO4和SEQ ID NO6及其生物学等效物的氨基酸序列的蛋白质。
本发明的另一实施例提供了单独分离的流感A病毒的氨基酸序列,所述的序列选自SEQ ID NO2、SEQ ID NO4和SEQ ID NO6及其生物学等效物。
本发明的另一实施例提供了单独的流感A病毒氨基酸序列,所述的序列具有SEQ ID NO8或SEQ ID NO12的序列。
在本发明的另一实施例中,使用具有流感A/Udorn/72(H3N2)菌株节段的核苷酸序列的分离的核酸分子(1)设计聚合酶链式反应(PCR)的引物,用于PCR分析,以探测样品中相应的病毒节段的存在;或(2)设计和选择用于ELISA的肽,以检测样品中由该节段产生的相应的蛋白质的存在。
发明详述已出版了流感A/Udorn/72(H3N2)菌株一些节段的核苷酸序列及演绎的氨基酸序列。Yuferov等报道了第4节段的HA基因的序列(Yuferov,V.P.,等,Proceedings of the Academy of Sciences of the USSR,278,738-742(1984))。Buckler-White等报道了第5节段的NP基因的序列(Buckler-White,A.J.,和Murphy,B.R.,Virology,155,345-355(1986))。Markoff等报道了第6节段的NA基因的序列(Markoff,L.,和Lai,C.J.,Virology,119,288-297(1982))。Lamb等在1981年报道了第7节段的M1基因和M2基因剪接产物的序列(M1在Lamb,R.A.,和Lai,C.J.Virology,112,746-751(1981)(″Lamb Virology1981″);M2在Lamb,R.A.等Proc.Natl.Acad.Sci.USA,78,4170-4174(1981)(″Lamb PNAS 1981″))。Lamb等在1980年报道了第8节段的NS1基因和NS2基因剪接产物的序列(Lamb,R.A.,和Lai,C.J.,Cell,21,475-485(1980))。
为了对这些节段进行重测序并对尚未阐明过的那些节段进行测序,在Madin-Darby犬肾(MDCK)细胞中培育流感A/Udorn/72(H3N2)菌株并从中纯化。从纯化的病毒颗粒提取病毒RNA,并用末端特异性引物进行RT-PCR扩增。将凝胶纯化的基因克隆入pGemT载体(Promega),并用多套引物测序。通过3’和5’连接和交换接头的RT-PCR片段的测序,确定末端的序列(Galarza,J.M.等,J.Virol.,70,2360-2368(1996))。M2和NS2基因分别是第7和第8节段的mRNA的剪接产物。从用寡-dT和基因特异性引物进行的RT-PCR得到的MDCK-流感A/Udorn感染的细胞纯化的mRNA回收这些基因。RT-PCR产物是用凝胶纯化的,并克隆入pGemT载体。用荧光染料终止子和AmpliTaq DNA聚合酶(Perkin-Elmer)并用Applied Biosystems ABI 377 DNA测序仪确定所有基因序列。重复这种步骤用于各片段的多个克隆;得到的序列是一致的。
流感A/Udorn/72(H3N2)菌株的完整基因组总共13628个核苷酸。这是首次描述流感A/Udorn/72(H3N2)菌株的完整核苷酸序列。
在编码区域、5’和3’非编码区域(包括调节序列如启动子、增强子和聚腺苷酸化信号)测定8个节段的序列。以下为这些节段、它们的核苷酸数量、它们分离的核酸分子序列(以正链、反基因组信息链显示,即从5’到3’方向)、它们的编码区域和氨基酸翻译节段1#2341个核苷酸,SEQ ID NO1,编码区域核苷酸28-2304,编码PB2,759个氨基酸(SEQ ID NO2)。
节段2#2341个核苷酸,SEQ ID NO3,编码区域核苷酸25-2295,编码PB1,757个氨基酸(SEQ ID NO4)。
节段3#2233个核苷酸,SEQ ID NO5,编码区域核苷酸25-2172,编码PA,716个氨基酸(SEQ ID NO6)。
节段4#1765个核苷酸,SEQID NO7,编码区域核苷酸30-1727,编码HA,566个氨基酸(SEQ ID NO8)。
在一些方面,SEQ ID NO7和SEQ ID NO8与Yuferov等出版的HA的核苷酸和氨基酸序列不同。
Yuferov SEQ ID NO7或8密码子#氨基酸#密码子# 氨基酸#GACAspAAC Asn81-83 1881-83 18GGTGlyGGG Gly1101-1103358 1101-1103358TACTyrTTC Phe1485-1487486 1485-1487486GACAspAAC Asn1614-1616529 1614-1616529节段5#1565个核苷酸,SEQ ID NO9,编码区域核苷酸46-1539,编码NP,498个氨基酸(SEQ ID NO10)。
SEQ ID NO9和SEQ ID NO10与Buckler-White等出版的NP的核苷酸和氨基酸序列相同。
节段6#1466个核苷酸,SEQ ID NO11,编码区域核苷酸20-1426,编码NA,469个氨基酸(SEQ ID NO12)。
在一些方面,SEQ ID NO11和SEQ ID NO12与Markoff等出版的NA的核苷酸和氨基酸序列不同。
Markoff SEQ ID NO11或12密码子# 氨基酸# 密码子# 氨基酸#CAG Gln CTG Leu104-106 29104-106 29GGA Gly GCA Ala546-548177546-548177CAA Gln CAG Gln695-697226695-697226ACA Thr GCA Ala992-994325992-994325在氨基酸1中,甲硫氨酸的前导序列上游中也有一些不同。
节段7#1027个核苷酸,SEQ ID NO13,编码区域核苷酸26-781,编码M1,252个氨基酸(SEQ ID NO14)。
SEQ ID NO13和SEQ ID NO14与已出版的Lamb Virology 1981中的M1的核苷酸和氨基酸序列相同。
节段7#,剪接产物322个核苷酸,SEQ ID NO15,编码区域核苷酸26-316,编码M2,97个氨基酸(SEQ ID NO16)[第一剪接的氨基酸在残基10,核苷酸53-55]。
SEQ ID NO15和SEQ ID NO16与已出版的Lamb PNAS 1981中的M2的核苷酸和氨基酸序列相同。
节段8#890个核苷酸,SEQ ID NO17,编码区域核苷酸27-737,编码NS1,237个氨基酸(SEQ ID NO18)。
SEQ ID NO17和SEQ ID NO18与Lamb等1980出版的NS1的核苷酸和氨基酸序列相同。
节段8#,剪接产物402个核苷酸,SEQ ID NO19,编码区域核苷酸27-389,编码NS2,121个氨基酸(SEQ ID NO20)[第一剪接的氨基酸在残基11,核苷酸57-59]。
SEQ ID NO19和SEQ ID NO20与Lamb等1980出版的NS2的核苷酸和氨基酸序列相同。
如上所述,有另一种HA序列,命名为HA(P1)节段4#1764个核苷酸,SEQ ID NO21,编码区域核苷酸30-1727,编码HA,566个氨基酸(SEQ ID NO22)。
不希望被理论束缚,随着时间的推移,HA(P1)序列中可能结合来自流感A/Udorn/72(H3N2)菌株的少量突变。HA(P1)序列(SEQ ID NO21)具有1764个核苷酸,比HA序列(SEQ ID NO7)少一个核苷酸,因为在节段4#的非编码区域中有一个核苷酸缺失(1756位)。
本发明的两种HA序列的核苷酸和氨基酸序列有如下差异核苷酸位点 HAHA(P1) 氨基酸变化35 G A 沉默的81 A G Asn变为Asp1103 G T 沉默的1486 T A Phe变为Tyr1614 A G Asn变为Asp1756 T 缺失 在编码区域外部HA(P1)序列也与Yuferov等人的HA核苷酸序列有如下稍微的不同核苷酸位点 HA(P1) Yuferov 氨基酸变化35 A G 沉默的1756缺失 T 编码区域外部基因组中这些节段的mRNA的核酸序列、负链(即以3’到5’方向)是上述正链、反基因组信息链序列的互补。
除上述编码流感A/Udorn/72(H3N2)病毒蛋白的核苷酸序列外,本发明还包括含各个流感A病毒核苷酸序列的分离的核酸分子,由于遗传密码的丰余在生物学上相当于那些编码病毒蛋白的序列,即其它核苷酸序列的特征与本文所述的核苷酸序列不同,但其编码的蛋白质的氨基酸序列与在SEQ ID NO1、SEQ IDNO3、SEQ ID NO5、SEQ ID NO7、SEQ ID NO11和SEQ ID NO21中任一编码的氨基酸序列相同。
具体说,本发明预计那些足以核苷酸重复任一SEQ ID NO1、SEQ ID NO3、SEQ ID NO5、SEQ ID NO7、SEQ ID NO11和SEQ ID NO21的序列,从而可以在标准高度严格Southern杂交条件下杂交,如由Sambrook等所述的(Sambrook,J.,等,《分子克隆实验室手册》(Molecular CloningA LabortoryManul),第2版,Cold Spring Harbor Laboratory Press,Cold Spring Harbor,N.Y.(1989))。
本发明还包括编码与流感A/Udorn/72(H3N2)病毒蛋白的氨基酸序列不同的氨基酸序列的核苷酸序列,但其所编码的氨基酸序列在生物学上与这些病毒蛋白(SEQ ID NO2、SEQ ID NO4和SEQ ID NO6)中的一种等效。如果这些氨基酸序列与病毒蛋白序列的差异仅在于少量的缺失、蛋白质序列中的插入或替代,从而与病毒蛋白的序列相比这些序列的三级构型基本未改变,就将这种氨基酸序列称为与任一病毒蛋白是生物学上等效的。
例如,氨基酸丙氨酸(一种疏水氨基酸)的密码子可以被编码另一种疏水性较弱的残基(如甘氨酸)或疏水性更强的残基(如缬氨酸、亮氨酸或异亮氨酸)替代。类似地,将导致一种带负电荷的(酸性)残基替代另一种(如天冬氨酸替代谷氨酸),或一种带正电荷的(碱性)残基替代另一种(如赖氨酸替代精氨酸或组氨酸),以及基于亲水性指数中残基的相似性的替代,预计可产生生物学上等效的产物。预计将导致蛋白质分子的N-端或C-端部分变化的核苷酸替代也不会改变蛋白质的活性。
所建议的各种变化是本领域技术人员熟知的,且决定了所编码的产物的结构及生物活性的保留。
本发明还涉及分离的氨基酸序列,其包含选自SEQ ID NO2、SEQ ID NO4和SEQ ID NO6的各个流感A病毒氨基酸序列。
本发明还涉及单独分离的SEQ ID NO8和SEQ ID NO12的氨基酸序列。
流感A/Udorn/72(H3N2)病毒的基本生物学特性取决于其由8个节段构成的基因组。流感病毒的节段由负极性RNA的单链构成,各病毒链具有其自身的特定核苷酸序列。
因此,病毒菌株间完整的差异鉴定就必须确定病毒的核苷酸序列。由于已清楚确定了完整的特异性核苷酸序列是由13,628个碱基构成的(如本文所述),就可在基因水平上鉴定病毒,因此这种鉴定方法提供了绝对的确定。
本文所述的序列可用于提供鉴定方法,该方法包括采用聚合酶链式反应(PCR)方法检测流感病毒反基因组DNA的核苷酸序列的一部分,以及制备肽用于ELISA检测DNA产生的抗原。
具体一种流感病毒的完全测序以及克隆的基因组,对以该基因蓝图为基础的新颖和革新的免疫原性组合物的研究和开发是有利的。
将确定的突变引入上述序列的所需的一个或多个节段中。通过定向诱变引入一个或多个突变。然而,不能从病毒RNA直接产生突变体流感病毒,因为基因组病毒RNA或反基因组cDNA都不能作为合成的直接模板。相反,在用NP衣壳化病毒后,必须通过病毒RNA聚合酶复合物将病毒RNA转录到正义mRNA中。
Pales等人(美国专利No.5,166,057,本文将其纳入作为参考)描述了用于“拯救”流感A病毒节段的反向遗传学辅助病毒依赖性系统。简单地说,在三种聚合酶蛋白和NP存在下,通过体外合成制备核糖核蛋白(RNP)复合物。然后将该RNP复合物用于转染真核细胞。随后用流感A辅助病毒感染,产生具有衍生自克隆的cDNA节段的基因的病毒。然后用选择方法从大量的辅助病毒中分离出所需的转染子。
如果要拯救完整的突变体流感A菌株,则使用一种不同的系统。使用Neumann等(Neumann,G.,等,Proc.Natl.Acad.Sci.USA,96,9345-9350(1999),本文将其纳入作为参考)所述的系统,可以从克隆的cDNA制备修饰过的流感病毒菌株A/Udorn/72(H3N2)。这种以质粒为基础的系统不需要使用辅助病毒感染。简单地说,用八种质粒(各编码菌株的病毒RNA节段且侧翼于合适的RNA聚合酶启动子和终止子)以及另四种编码病毒NP、PB2、PB1和PA蛋白的质粒(用于在体内细胞内合成RNP)转染细胞系,如人胚肾细胞系293T。添加另五种表达病毒结构蛋白HA、NA、M1、M2和NS2的质粒产量显著增加。
在该系统的变更中,仅需要八种质粒(Hoffmann,E.,等,Proc.Natl.Acad.Sci.USA,97,6108-6113(2000),本文将其纳入作为参考)。各质粒包含两种启动子、人RNA聚合酶I(pol I)启动子和人RNA聚合酶II(pol II)启动子。在用八种表达质粒转染真核细胞后,人pol I和II启动子分别转录质粒模板。这导致了病毒mRNA和vRNA的合成,从而最终产生传染性的流感A病毒。
由于8、12或17质粒转染系统的任何一种都不需要辅助病毒,因此无需噬斑纯化就可回收转染子病毒。这种系统促进活的减毒流感A病毒的产生,用于抵抗包括新HA或NA亚型的流行病,其中使流感A/Udorn/72(H3N2)菌株的相应节段突变以匹配新亚型的序列。
采用定向诱变将预定的突变引入流感A/Udorn/72(H3N2)菌株中编码HA或NA的节段(其相应于新循环亚型的序列)。通过标准重组DNA方法将这种突变引入病毒基因组的DNA拷贝中。
如果需要,用不同流感病毒菌株的相应节段替代编码流感A/Udorn/72(H3N2)菌株(或其突变体)的HA或NA的节段,产生重排列病毒。
在本发明的另一实施例中,用分离的具有流感A/Udorn/72(H3N2)菌株节段的核苷酸序列的核酸分子,制备寡核苷酸探针(从正链反基因组信息或从负链互补基因组信息)且表达这些肽(仅从正链反基因组信息),可用于检测体液或组织样品中流感A菌株(或其突变体)的节段的存在。用核苷酸序列设计高度特异性和敏感性的诊断测试,以检测样品中病毒节段的存在。
以本文所述的病毒序列为基础,用这些序列合成PCR引物。将测试样品进行RNA逆转录,随后PCR扩增选定的相应于本文所述核苷酸序列(具有与该病毒菌株确定节段相同的核苷酸)的cDNA区域。在凝胶上鉴定扩增的PCR产物,并用特异性核苷酸探针通过杂交验证它们的特异性。
用ELISA测试检测病毒节段产生的蛋白质的存在。设计并选择出包含一个或多个不同的残基(以本文所述的病毒序列为基础)的肽。然后将这些肽偶联于半抗原(如匙孔 血蓝蛋白(KLH))并用于疫苗动物(如兔),以产生单特异性多克隆抗体。然后这些多克隆抗体的选择或多克隆和单克隆抗体的组合可用于“捕获ELISA”以检测该病毒节段产生的蛋白质。
本文将所引用的参考文献全部纳入作为参考。
为了让本发明更易理解,列出以下实施例。这些实施例仅用于说明,对本发明的范围无任何限制。
实施例实施例1流感A/Udorn/72(H3N2)菌株的测序如下进行8个节段的分别测序RT-PCR方法用Madin-Darby犬肾(MDCK)细胞培育流感A/Udorn/72(H3N2),并从澄清的培养物上清液中浓缩。用RNAeasy提取试剂盒(Qiagen)从100μl病毒原液制备基因组病毒RNA(vRNA)。用Oligotex mRNA试剂盒(Qiagen),从流感A/Udorn感染的MDCK细胞的总mRNA中分离出节段7和8的mRNA的剪接产物(分别为M2和NS2)。将约1μg vRNA或者mRNA添加到装有2pmol基因特异性引物和二乙基焦炭酸盐处理的水(以抑制RNA酶)的Eppendorf管中,至12μl终体积。用寡-dT引物逆转录M2和NS2剪接信息。在70℃加热RNA-引物混合物10分钟,然后在冰上迅速冷冻。在该管中添加如下试剂4μl 5X第一条链的缓冲液(250mMTris-HCl pH 8.3,375mM KCl,15mM MgCl2),2μl 0.1M DTT和1μl dNTP混合物(各为10mM dATP、dTTP、dGTP和dCTP),并在42℃培育2分钟。随后,加入1μl SuperScript II(Gibco)逆转录酶,并在42℃再培育50分钟。在70℃培育15分钟终止RT反应。将2μl上述RT添加到如下试剂的混合物中开始PCR反应50mM Tris-HCl(pH 8.0)、1.5mM MgCl2、50mM KCl、1μM各引物、0.2mM各dNTP和2单位Taq聚合酶。如下用Perkin-Elmer 9600 PCR仪扩增DNA1轮,94℃变性2分钟;30轮,94℃变性30秒;42℃退火30秒并在72℃延伸1分钟。然后在72℃进行一轮延伸若干分钟。在1%琼脂糖凝胶上分析PCR扩增的DNA,并用Qiagen纯化试剂盒纯化。
测序方法将约500ng质粒pGEM-T(含有相应的由RT-PCR产生的DNA节段)添加到O.2ml装有3.2pmol测序引物和8μl终止预备反应混合液(Perkin-Elmer)的试管中,并将该试管置于Perkin-Elmer 9600 PCR仪中。通过25轮96℃10秒,50℃5秒和60℃ 4分钟,扩增反应。然后在G50 Sepadex自旋柱上纯化产物,并将其冷冻干燥,然后重悬浮于3μl 25mM EDTA(pH8.0)和50mg/ml葡聚糖蓝色负载染料中。在ABI 377自动测序仪上测试样品,并用Sequencher(Gene CodesCorporation的序列分析程序)分析序列。
3’和5’端的核苷酸序列的确定在37℃,用10U烟草酸焦磷酸酶(Epicenter Technologies,Madison,Wis.)处理纯化的基因组vRNA(2.5μg)(如上)30分钟,以除去5’端的磷酸基团。将处理过的RNA进行苯酚-氯仿提取,并用乙醇沉淀。随后,在37℃用50U T4 RNA-连接酶(Pharmaci Biotech)连接RNA的5’和3’端1小时,用苯酚-氯仿提取,用乙醇沉淀。逆转录连接的RNA,并用引物的正反对(如下),用GeneAmp GoldRNA PCR试剂盒(PE Biosystems)在单次反应中进行PCR。如上所述,用各RNA节段的特异性引物进行测序反应(Galarza,J.M.,等,1996,J of Virol.702360-2368)。
用于流感A/Udron/72(H3N2)测序的引物以“F”标记正向引物,且为反基因组加有义。反向引物用“R”标记,且为基因组负有义。
PB2(节段1)引物引物F/R 引物序列 跨越的bp·RT引物1107 F5′-TATGGAAAGAATAAAAGAACTACGGAA-3′ 27-53(SEQ ID NO23)·PCR引物1108 R5′-TCGTTTTTAAACTATTCAACAT-3′ 2307-2328(SEQ ID NO24)1107 F5′-TATGGAAAGAATAAAAGAACTACGGAA-3′ 27-53(SEQ ID NO23)·测序引物1101 F5′-AGCAGGTCAATTATATTCAATATG-3′ 1-24(SEQ ID NO25)该引物序列与PB2基因序列有如下差异位点 PB2基因1101 F5A G6A G7A T8G C9C A11 G T12 G T13 T A14 C T16 A T18 T C20 T A21 A T22 T A24 C G1102 R 5′-AACAAGGTCGTTTTTAAACTATTC-3′ 2312-2335(SEQ ID NO26)1103 F 5′-AGAACTCTATTCCAACAAATG-3′ 1816-1836(SEQ ID NO27)1104 R 5′-AATCGGATATTTCATTGCCAT-3′ 178-198(SEQ ID NO28;该引物序列在核苷酸195为C,而PB2基因序列为T)1105 F 5′-AGAACTCTATTCCAACAAATGAGGGATGTAGTT-3′ 1816-1848(SEQ ID NO29;该引物序列在核苷酸1846为G,而PB2基因序列为C)1106 R 5′-AATCGGATATTTCATTGCCATCATCCATTTCAT-3′ 166-198(SEQ ID NO30;该引物序列在核苷酸195为C,而PB2基因序列为T)1107 F 5′-TATGGAAAGAATAAAAGAACTACGGAA-3′27-53(SEQ ID NO23)1108 R 5′-TCGTTTTTAAACTATTCAACAT-3′ 2307-2328(SEQ ID NO24)1109 F 5′-AGACGTGGTGTTGGTAATGAA-3′ 2214-2234(SEQ ID NO31)1110 F 5′-CGAAGAGTTGACATAAACCCT-3′ 454-474(SEQ ID NO32)1111 R5′-TCATCCCTCAICCCCTCACAT-3′ 1943-1963(SEQ ID NO33;该引物序列在核苷酸1955为C;而PB2基因序列为G)1112 R5′-ATTTTCTGTTATCCTCTTGTCA-3′204-224(SEQ ID NO34;与实际PB2基因序列相比,该引物在核苷酸220后还具有一附加的核苷酸(T)1113 F5′-GTGGAGTCCGCTGTTTTGAG-3′ 2083-2102(SEQ ID NO35)1114 F5′-AGGATGGTGGACATTCTTAGG-3′ 904-924(SEQ ID NO36)1115 R5′-CCGATCAATGCTAACCACTAC-3′ 1507-1527(SEQ ID NO37)1116 F5′-AGCAAAAGCAGGTCAATTATATTCA-3′ 1-25(SEQ ID NO38)1117 R5′-AGTAGAAACAAGGTCGTTTTTAAAC-3′ 2317-2341(SEQ ID NO39)·用于确定末端序列的引物1104 R5′-AATCGGATATTTCATTGCCAT-3′ 178-198(SEQ ID NO28)1109 F5′-AGACGTGGTGTTGGTAATGAA-3′ 2214-2234(SEQ ID NO31)1112 R5′-ATTTTCTGTTATCCTCTTGTCA-3′204-224(SEQ ID NO34)1113 F5′-GTGGAGTCCGCTGTTTTGAG-3′ 2083-2102(SEQ ID NO35)PB1(节段2)引物引物F/R引物序列 跨越的bp·RT引物1215 F 5′-AGCAAAAGCAGGCAAACCAT-3′ 1-20(SEQ ID NO40;该引物序列在核苷酸4为A,而PB1基因序列为G)·PCR引物1216 R 5′-AGTAGAAACAAGGCATTTTTT-3′ 2321-2341(SEQ ID NO41)1215 F 5′-AGCAAAAGCAGGCAAACCAT-3′ 1-20(SEQ ID NO40)·测序引物1201 F 5′-TCGAGCTGAAGAAGCTATGG-3′ 1745-1764(SEQ ID NO42;该引物序列在核苷酸1752为G,而PB1基因序列为A,且该引物序列在核苷酸1761为A,而PB1基因序列为G)1202 R 5′-GTTCTGTTGACTGTGTCCAT-3′ 142-161(SEQ ID NO43)1203 F5′-TCGAGCTGAAGAAGCTATGGGAGCAGACCCGT-3′ 1745-1776(SEQ ID NO44)该引物序列与PB1基因序列有如下差异位点PB2基因1203 F1752 A G1761 G A1770 A G1776 C T1204 R 5′-GTTCTGTTGACTGTGTCCATGGTGTATCC-3′ 133-161(SEQ ID NO45)1205 F 5′-AGCGAAAGCAGGCAAACCATTTGAATGGATGTCAA-3′ 1-35(SEQ ID NO46)1206 R 5′-ATTAAAAACAAGGCATTTTTTCATGAAGGAC-3′2341-2311(SEQ ID NO47;该引物序列在核苷酸2337为A,而PB1基因序列为G,且该引物序列在核苷酸2340为T,而PB1基因序列为G)1207 F 5′-AATTTCCAGCATGGTGGAGGCCATGGTG-3′ 2154-2181(SEQ ID NO48)1208 F 5′-GGGATCTTTGAAAACTCGTG-3′ 325-344
(SEQ ID NO49)1209 R 5′-CAGCATTGTTTACAGACTC-3′ 1936-1954(SEQ ID NO50)1210 F 5′-ACCAAAGATGCAGAAAGAGG-3′706-725(SEQ ID NO51)1211 R 5′-CTCATATTGATTCCGACTAA-3′1438-1457(SEQ ID NO52)1212 R 5′-TGTCCACTTC CCTTTTTCTG A-3′ 172-192(SEQ ID NO53)1213 F 5′-ATTTTTCCCC AGTAGTTCAT AC-3′2118-2139(SEQ ID NO54)1214 F 5′-ACGCTGTTGC AACTACACAC TCCTG-3′ 1997-2021(SEQ ID NO55)1215 F 5′-AGCAAAAGCAGGCAAACCAT-3′1-20(SEQ ID NO40)1216 R 5′-AGTAGAAACAAGGCATTTTTT-3′ 2321-2341(SEQ ID NO41)·用于确定末端序列的引物1204 R 5′-GTTCTGTTGACTGTGTCCATGGTGTATCC-3′ 133-161(SEQ ID NO45)1212 R 5′-TGTCCACTTC CCTTTTTCTG A-3′ 172-192(SEQ ID NO53)1213 F 5′-ATTTTTCCCC AGTAGTTCAT AC-3′2118-2139(SEQ ID NO54)1214 F 5′-ACGCTGTTGC AACTACACAC TCCTG-3′ 1997-2021(SEQ ID NO55)PA(节段3)引物引物F/R 引物序列 跨越的bp·RT引物1315 F 5′-AGCAAAAGCA GGTACTGATT CGAGA-3′ 1-25(SEQ ID NO56)·PCR引物1315 F5′-AGCAAAAGCA GGTACTGATT CGAGA-3′1-25(SEQ ID NO56)1316 R5′-AGTAGAAACA AGGTACTTTT TTGGA-3′ 2209-2233(SEQ ID NO57)·测序引物1301 F5′-GAACCTGGAA CCTTTGATCT t-3′ 2053-2073(SEQ ID NO58)1302 R5′-ATTCACCACT GTCCAGGCCA T-3′ 280-300(SEQ ID NO59)该引物序列与PA基因序列有如下差异位点 PA基因 1302 F294TC297TC300GA1303 F5′-GAACCTGGAA CCTTTGATCT TGAGGGGCTA-3′ 2053-2082(SEQ ID NO60;该引物序列在核苷酸2075为A,而PA基因序列为G)1304 R5′-ATTCACCACT GTCCAGGCCA TTGACGCGTC-3′ 271-300(SEQ ID NO61)该引物序列与PA基因序列有如下差异位点 PA基因 1304 F274 G C275 C G276 G C277 T A294 T C297 T C300 G A1305 F5′-AGCAAAAGCA GGTAGTGATA G-3′ 1-21(SEQ ID NO62)
该引物序列与PA基因序列有如下差异位点 PA基因 1302 F15 CG20 TA21 CG1306 R 5′-AGTAGAAACA AGGTA-3′ 2219-2233(SEQ ID NO63)1307 F 5′-AGCGAAAGCA GGTAGTGATT CGAGATGGA-3′ 1-29(SEQ ID NO64;该引物序列在核苷酸4为G,而PA基因序列为A)1308 R 5′-AGTAGAAACA AGGTACTTTT TTGGACAG-3′2206-2233(SEQ ID NO65)1309 R 5′-TCGATTTGTT GGAGTGACTG A-3′ 1782-1802(SEQ ID NO66)1310 F 5′-GCCGAACTTC TCCTGCCTTG A-3′684-704(SEQ ID NO67)1311 F5′-GTATTCAATA GCCTGTATG-3′ 1957-1975(SEQ ID NO68)1312 R5′-GACTTCTCTC CTTGTCACTC-3′ 386-405(SEQ ID NO69)1313 F 5′-ACGAGTCAGC TAAAGTGGGC A-3′ 1111-1131(SEQ ID NO70)1314 R 5′-GACACCTCTG CTGTGAAGTA A-3′ 1356-1376(SEQ ID NO71)1315 F 5′-AGCAAAAGCA GGTACTGATT CGAGA-3′1-25(SEQ ID NO56)1316 R 5′-AGTAGAAACA AGGTACTTTT TTGGA-3′ 2209-2233(SEQ ID NO57)1317 F 5′-GGAGCTGAGA AACCGAAGTT T-3′319-339(SEQ ID NO72)1318 R 5′-GACTTGGCCA ATAAAGTCCT A-3′1935-1955
(SEQ ID NO73)1319 R 5′-GCCTGCGCAT AGTTCCTGTG A-3′ 644-664(SEQ ID NO74)1320 R 5′-TCTACCACTA TTGACTCGCC-3′ 196-215(SEQ ID NO75)·用于确定末端序列的引物1301 F 5′-GAACCTGGAA CCTTTGATCT t-3′ 2053-2073(SEQ ID NO58)1303 F 5′-GAACCTGGAA CCTTTGATCT TGAGGGGCTA-3′ 2053-2082(SEQ ID NO60)1311 F5′-GTATTCAATA GCCTGTATG-3′ 1957-1975(SEQ ID NO68)1312 R 5′-GACTTCTCTC CTTGTCACTC-3′ 386-405(SEQ ID NO69)1320 R 5′-TCTACCACTA TTGACTCGCC-3′ 196-215(SEQ ID NO75)HA(节段4)引物引物F/R引物序列 跨越的bp·RT引物1401 F 5′-AGCAAAAGCA GGGGATAATT CTA-3′1-23(SEQ ID NO76)·PCR引物1401 F 5′-AGCAAAAGCA GGGGATAATT CTA-3′1-23(SEQ ID NO76)1402 R 5′-AGTAGAAACA AGGGTGTTTT TAA-3′ 1743-1765(SEQ ID NO77)·测序引物1401 F 5′-AGCAAAAGCA GGGGATAATT CTA-3′1-23(SEQ ID NO76)1402 R 5′-AGTAGAAACA AGGGTGTTTT TAA-3′ 1743-1765(SEQ ID NO77)1403 R 5′-ACAGTTTGTT CATTTCCGAG-3′ 1400-1419
(SEQ ID NO78)1404 R 5′-ACTTCAGGGT GTTTTGCTTA-3′ 1004-1023(SEQ ID NO79)1405 R 5′-GAACCCCCCA AATGTATAGT-3′ 605-624(SEQ ID NO80)1406 R 5′-TGAGGAACTC TGAACCAGCT-3′ 199-218(SEQ ID NO81)1407 F 5′-GTCACTAGTT GCCTCGTCAG-3′ 404-423(SEQ ID NO82)1408 F 5′-CCGGGAGACA TACTGGTAAT-3′ 792-811(SEQ ID NO83)1409 F 5′-CAAATCAATG GGAAACTGAA-3′ 1203-1222(SEQ ID NO84)1410 F 5′-TGAACTGAAG TCAGGATACA-3′ 1592-1611(SEQ ID NO85)·用于确定末端序列的引物1405 R 5′-GAACCCCCCA AATGTATAGT-3′ 605-624(SEQ ID NO80)1406 R 5′-TGAGGAACTC TGAACCAGCT-3′ 199-218(SEQ ID NO81)1409 F 5′-CAAATCAATG GGAAACTGAA-3′ 1203-1222(SEQ ID NO84)1410 F 5′-TGAACTGAAG TCAGGATACA-3′ 1592-1611(SEQ ID NO85)NP(节段5)引物引物F/R 引物序列 跨越的bp·RT引物1509 F 5′-AGCAAAAGCA GGGTTAATAA TCAC-3′ 1-24(SEQ ID NO86)·PCR引物1509 F 5′-AGCAAAAGCA GGGTTAATAA TCAC-3′ 1-24(SEQ ID NO86)1510 R 5′-AGTAGAAACA AGGGTATTTT TCCT-3′ 1542-1565(SEQ ID NO87)·测序引物1501 R 5′-TTGCACCTTC CATCATCCTT-3′ 1380-1399(SEQ ID NO88)1502 R 5′-GTCTATTCCC ACTAAAGAGT-3′ 932-951(SEQ ID NO89)1503 R 5′-TGCATCAGAG AGCACATCCT-3′ 529-548(SEQ ID NO90)1504 F 5′-GAACTCGTCC TTTATGACAA-3′ 364-383(SEQ ID NO91)1505 F 5′-AGAGCAATGG ATCAAGT-3′ 751-770(SEQ ID NO92;在核苷酸761-763上有三个碱基ATG的缺失)1506 F 5′-TACTATGGAA TCAAGTACTC-3′ 1161-1180(SEQ ID NO93)1507 R 5′-ATCAATCATC TTCCCGAC-3′ 130-147(SEQ ID NO94)1508 F 5′-AGTGTCCTTC CGTGGGCG-3′ 1410-1427(SEQ ID NO95)1509 F 5′-AGCAAAAGCA GGGTTAATAA TCAC-3′1-24(SEQ ID NO86)1510 R 5′-AGTAGAAACA AGGGTATTTT TCCT-3′ 1542-1565(SEQ ID NO87)·用于确定末端序列的引物1503 R 5′-TGCATCAGAG AGCACATCCT-3′ 529-548(SEQ ID NO90)1506 F 5′-TACTATGGAA TCAAGTACTC-3′1161-1180(SEQ ID NO93)NA(节段6)引物引物F/R 引物序列 跨越的bp·RT引物1601 F 5′-AGCAAAAGCA GGAGTGAAGA TGA-3′ 1-23(SEQ ID NO96)·PCR引物1601 F 5′-AGCAAAAGCA GGAGTGAAGA TGA-3′ 1-23(SEQ ID NO96)1602 R 5′-AGTAGAAACA AGGAGTTTTT TCTA-3′ 1443-1466(SEQ ID NO97)·测序引物1601 F 5′-AGCAAAAGCA GGAGTGAAGA TGA-3′ 1-23(SEQ ID NO96)1602 R 5′-AGTAGAAACA AGGAGTTTTT TCTA-3′ 1443-1466(SEQ ID NO97)1603 R 5’-TCGTTGTTTC TGGGTGTGTC-3′ 989-1008(SEQ ID NO98;该引物序列在核苷酸992为T,而NA基因序列为C)1604 R 5’-TTATCATACC CAGTGACACA-3′ 596-615(SEQ ID NO99)1605 R 5’-ATTATTATTG GTTCACACGG-3′ 173-192(SEQ ID NO100)1606 F 5’-TTATGTGTCA TGCGATCCTG-3′ 379-398(SEQ ID NO101)1607 F 5’-ATTGTTCATA TTAGCCCATT-3′ 803-822(SEQ ID NO102)1608 F 5’-ATAGGTCAGG TTATTCTGGT-3′1224-1243(SEQ ID NO103)·用于确定末端序列的引物1608 F 5’-ATAGGTCAGG TTATTCTGGT-3′1224-1243(SEQ ID NO103)1605 R 5’-ATTATTATTG GTTCACACG-3′ 173-192(SEQ ID NO100)M1(节段7)引物引物F/R 引物序列 跨越的bp·RT引物*1706 F 5’-AGCAAAAGCA GGTAG-3′1-15(SEQ ID NO104)·PCR引物1707 R 5’-AGTAGAAACA AGGTA-3′ 1013-1027(SEQ ID NO105)1706 F 5’-AGCAAAAGCA GGTAG-3′1-15(SEQ ID NO104)*1701R 5’-TTTACTCCAG CTCTATGCTG ACAA-3′ 985-1008(SEQ ID NO106)*这些引物将扩增M1和M2基因·测序引物1701 R 5’-TTTACTCCAG CTCTATGCTG ACAA-3′ 985-1008(SEQ ID NO106)1702 R 5’-GATCCAGCCA TTTGCTCCAT-3′ 590-609(SEQ ID NO107)1703 R 5’-GAGGTGACAG GATTGGTCTT-3′ 169-188(SEQ ID NO108)1704 F 5’-CATGGACAGA GCAGTTAAAC-3′ 301-320(SEQ ID NO109)°1705F 5’-GCGAGTATCA TTGGGATCTT-3′ 801-820(SEQ ID NO110)°为M1和M2序列共有·用于确定末端序列的引物1705 F 5’-GCGAGTATCA TTGGGATCTT-3′ 801-820(SEQ ID NO110)1702 R 5’-GATCCAGCCA TTTGCTCCAT-3′ 590-609(SEQ ID NO107)1703 R 5’-GAGGTGACAG GATTGGTCTT-3′ 169-188(SEQ ID NO108)对序列M2而言,引物1701、1705和1706是足够的。NS1(节段8)引物引物F/R引物序列跨越的bp·RT引物*1801F 5’-AGCAAAAGCA GGGTGACAAA GACA-3′ 1-24(SEQ ID NO111)·PCR引物1801 F 5’-AGCAAAAGCA GGGTGACAAA GACA-3′ 1-24(SEQ ID NO111)1802 R 5’-AGTAGAAACA AGGGTGTTTT TTAT-3′ 867-890(SEQ ID NO112)*1803R 5’-TTTTTTATCA TTAAATAAGC TGAA-3′ 851-874(SEQ ID NO113)*这些引物扩增NS1和NS2基因。·测序引物°1801F 5’-AGCAAAAGCA GGGTGACAAA GACA-3′ 1-24(SEQ ID NO111)1802 R 5’-AGTAGAAACA AGGGTGTTTT TTAT-3′ 867-890(SEQ ID NO112)°1803R 5’-TTTTTTATCA TTAAATAAGC TGAA-3′ 851-874(SEQ ID NO113)1804 R 5’-AATTGCATTT TTGACATCCT-3′ 541-560(SEQ ID NO114)1805 R 5’-ATCTTCTCTA CTATCTGCTT-3′ 210-229(SEQ ID NO115)1806 F 5’-CAAGCAATCA TGGATAAGAA-3′ 387-406(SEQ ID NO116)°1807F 5’-GGCGAGAACA GCTAGGTCAA-3′ 692-711(SEQ ID NO117)°为NS1和NS2序列共有·用于确定末端序列的引物1807 F 5’-GGCGAGAACA GCTAGGTCAA-3′ 692-711(SEQ ID NO117)1804 R 5’-AATTGCATTT TTGACATCCT-3′ 541-560
(SEQ ID NO114)1806 F 5’-CAAGCAATCA TGGATAAGAA-3′387-406(SEQ ID NO116)1805 R 5’-ATCTTCTCTA CTATCTGCTT-3′210-229(SEQ ID NO115)对序列NS2而言,引物1801、1803和1807是足够的。
实施例2PCR试验检测流感A/Udorn/72(H3N2)菌株的节段用PCR试验检测流感A/Udorn/72(H3N2)菌株的节段的存在。根据本文所述的病毒序列的同源性设计和选择PCR引物。如下进行该试验将样品进行RNA逆转录,PCR扩增所选的相应于本文所述的特异性核苷酸序列的cDNA区域。在凝胶上鉴定扩增的PCR产物,并与特异性核苷酸探针杂交验证它们的特异性。
实施例3ELISA检测由流感A/Udorn/72(H3N2)菌株的节段产生的抗原用ELISA检测由流感A/Udorn/72(H3N2)菌株的节段产生的抗原的存在。根据本文所述的病毒序列的同源性设计和选择肽。然后将这些肽偶联于KLH,并用于免疫接种兔,以产生单特异性多克隆抗体。然后在“捕获ELISA”中使用这些多克隆抗体的选择或多克隆和单克隆抗体的组合,以检测由该病毒节段产生的蛋白的存在。
序列表序列表<110>美国氰胺公司<120>流感A/Udorn/72(H3N2)基因组的核苷酸序列<130>AM100289PCT<140><141><150>60/213,650<151>2000-06-23<160>117<170>PatentIn Ver.2.1版本<210>1<211>2341<212>DNA<213>流感A病毒(Influenza A virus)<220><221>CDS<222>(28)..(2304)<400>1agcaaaagca ggtcaattat attcaat atg gaa aga ata aaa gaa cta cgg aat 54Met Glu Arg Ile Lys Glu Leu Arg Asn1 5ctg atg tcg cag tct cgc act cgc gag ata cta aca aaa acc aca gtg 102Leu Met Ser Gln Ser Arg Thr Arg Glu Ile Leu Thr Lys Thr Thr Val10 15 20 25gac cat atg gcc ata att aag aag tac aca tca ggg aga cag gaa aag 150Asp His Met Ala Ile Ile Lys Lys Tyr Thr Set Gly Arg Gln Glu Lys30 35 40aac ccg tca ctt agg atg aaa tgg atg atg gca atg aaa tat cca att 198Asn Pro Ser Leu Arg Met Lys Trp Met Met Ala Met Lys Tyr Pro Ile45 50 55aca gct gac aag agg ata aca gaa atg gtt cct gag aga aat gag caa 246Thr Ala Asp Lys Arg Ile Thr Glu Met Val Pro Glu Arg Asn Glu Gln60 65 70gga caa acc cta tgg agt aaa atg agt gat gcc ggg tca gat cga gtg 294Gly Gln Thr Leu Trp Ser Lys Met Ser Asp Ala Gly Ser Asp Arg Val75 80 85atg gta tca cct ttg gcg gtg aca tgg tgg aat aga aat gga cca gtg 342Met Val Ser Pro Leu Ala Val Thr Trp Trp Asn Arg Asn Gly Pro Val90 95 100 105aca agt acg gtt cat tat cca aaa gtc tac aag act tat ttt gat aaa 390Thr Ser Thr Val His Tyr Pro Lys Val Tyr Lys Thr Tyr Phe Asp Lys110 115 120gtc gaa agg tta aaa cat gga acc ttt ggc cct gtc cat ttt aga aac 438Val Glu Arg Leu Lys His Gly Thr Phe Gly Pro Val His Phe Arg Asn125 130 135caa gtc aaa ata cgc cga aga gtt gac ata aac cct ggt cat gca gac 486Gln Val Lys Ile Arg Arg Arg Val Asp Ile Asn Pro Gly His Ala Asp140 145 150ctc agt gcc aag gag gca caa gat gta atc atg gaa gtt gtt ttc ccc 534Leu Ser Ala Lys Glu Ala Gln Asp Val Ile Met Glu Val Val Phe Pro155 160 165aat gaa gtg ggg gcc agg ata cta acg tcg gaa tca caa tta aca ata 582Asn Glu Val Gly Ala Arg Ile Leu Thr Ser Glu Ser Gln Leu Thr Ile170 175 180 185acc aaa gag aaa aaa gaa gaa ctc caa gat tgc aaa att tct cct ttg 630Thr Lys Glu Lys Lys Glu Glu Leu Gln Asp Cys Lys Ile Ser Pro Leu190 195 200atg gtt gca tac atg tta gag aga gaa ctt gtc cga aaa acg aga ttt 678Met Val Ala Tyr Met Leu Glu Arg Glu Leu Val Arg Lys Thr Arg Phe205 210 215ctc cca gtt gct ggt gga aca agc agt gtg tac att gaa gtg tta cac 726Leu Pro Val Ala Gly Gly Thr Ser Ser Val Tyr Ile Glu Val Leu His220 225 230ttg act caa gga acg tgt tgg gaa cag atg tac act cca ggt gga gaa 774Leu Thr Gln Gly Thr Cys Trp Glu Gln Met Tyr Thr Pro Gly Gly Glu235 240 245gtg agg aat gac gat gtt gac caa agc cta att att gca gcc agg aac 822Val Arg Asn Asp Asp Val Asp Gln Ser Leu Ile Ile Ala Ala Arg Asn250 255 260 265ata gtg aga aga gca gca gta tca gca gat cca cta gca tct tta ttg 870Ile Val Arg Arg Ala Ala Val Ser Ala Asp Pro Leu Ala Ser Leu Leu270 275 280gag atg tgc cac agc aca ctg att ggc ggg aca agg atg gtg gac att 918Glu Met Cys His Ser Thr Leu Ile Gly Gly Thr Arg Met Val Asp Ile285 290 295ctt agg cag aac ccg acg gaa gaa caa gct gtg gat ata tgc aag gct 966Leu Arg Gln Asn Pro Thr Glu Glu Gln Ala Val Asp Ile Cys Lys Ala300 305 310gca atg gga ctg agg atc agc tca tcc ttc agt ttt ggt ggg ttc aca 1014Ala Met Gly Leu Arg Ile Ser Ser Ser Phe Ser Phe Gly Gly Phe Thr315 320 325ttt aag aga aca agc ggg tca tca atc aaa aga gag gaa gaa gtg ctt 1062Phe Lys Arg Thr Ser Gly Ser Ser Ile Lys Arg Glu Glu Glu Val Leu330 335 340 345acg ggc aat ctc caa aca ttg aaa ata agg gtg cat gag ggg tac gag 1110Thr Gly Asn Leu Gln Thr Leu Lys Ile Arg Val His Glu Gly Tyr Glu
350 355 360gag ttc aca atg gtg ggg aaa agg gca aca gct ata ctc aga aaa gca 1158Glu Phe Thr Met Val Gly Lys Arg Ala Thr Ala Ile Leu Arg Lys Ala365 370 375acc agg aga ttg gtt cag ctt ata gtg agt gga agg gac gag cag tca 1206Thr Arg Arg Leu Val Gln Leu Ile Val Ser Gly Arg Asp Glu Gln Ser380 385 390ata gcc gaa gcg ata att gta gcc atg gtg ttt tca caa gag gat tgc 1254Ile Ala Glu Ala Ile Ile Val Ala Met Val Phe Ser Gln Glu Asp Cys395 400 405atg ata aaa gca gtt aga ggt gac ctg aat ttc gtt aac agg gca aat 1302Met Ile Lys Ala Val Arg Gly Asp Leu Asn Phe Val Asn Arg Ala Asn410 415 420 425cag cgg ttg aat ccc atg cat caa ctt tta agg cat ttt cag aaa gat 1350Gln Arg Leu Asn Pro Met His Gln Leu Leu Arg His Phe Gln Lys Asp430 435 440gcg aaa gtg ctt ttt cag aat tgg gga att gaa cat atc gac aat gtg 1398Ala Lys Val Leu Phe Gln Asn Trp Gly Ile Glu His Ile Asp Asn Val445 450 455atg gga atg gtt gga gta tta cca gac atg act cca agc aca gag atg 1446Met Gly Met Val Gly Val Leu Pro Asp Met Thr Pro Ser Thr Glu Met460 465 470tca atg aga gga ata aga gtc agc aaa atg ggc gtg gat gaa tac tcc 1494Ser Met Arg Gly Ile Arg Val Ser Lys Met Gly Val Asp Glu Tyr Ser475 480 485agc aca gag agg gta gtg gtt agc att gat cgg ttt ttg aga gtt cga 1542Ser Thr Glu Arg Val Val Val Ser Ile Asp Arg Phe Leu Arg Val Arg490 495 500 505gac caa cgt ggg aat gta tta cta tct cct gag gag gtc agt gaa aca 1590Asp Gln Arg Gly Asn Val Leu Leu Ser Pro Glu Glu Val Ser Glu Thr510 515 520cag ggg aca gag aga ctg aca ata act tac tca tcg tca atg atg tgg 1638Gln Gly Thr Glu Arg Leu Thr Ile Thr Tyr Ser Ser Set Met Met Trp525 530 535gag att aat ggc cct gag tca gtg ttg gtc aat acc tat caa tgg atc 1686Glu Ile Asn Gly Pro Glu Ser Val Leu Val Asn Thr Tyr Gln Trp Ile540 545 550atc aga aac tgg gaa act gtt aaa att caa tgg tct cag aat cct aca 1734Ile Arg Asn Trp Glu Thr Val Lys Ile Gln Trp Ser Gln Asn Pro Thr555 560 565atg ttg tac aac aaa atg gaa ttt gag cca ttt cag tct tta gtt cct 1782Met Leu Tyr Asn Lys Met Glu Phe Glu Pro Phe Gln Ser Leu Val Pro570 575 580 585aag gcc att aga ggc caa tac agt gga ttt gtc aga act cta ttc caa 1830Lys Ala Ile Arg Gly Gln Tyr Ser Gly Phe Val Arg Thr Leu Phe Gln590 595 600caa atg agg gat gta ctt ggg aca ttt gat acc acc cag ata ata aaa 1878Gln Met Arg Asp Val Leu Gly Thr Phe Asp Thr Thr Gln Ile Ile Lys605 610 615ctt ctc ccc ttt gca gcc gcc cca cca aag caa agt aga atg cag ttc 1926Leu Leu Pro Phe Ala Ala Ala Pro Pro Lys Gln Ser Arg Met Gln Phe620 625 630tct tca ttg act gtg aat gtg agg gga tca ggg atg aga ata ctt gta 1974Ser Ser Leu Thr Val Asn Val Arg Gly Ser Gly Met Arg Ile Leu Val635 640 645agg ggc aat tct cct gta ttc aac tac aac aag acc act aaa aga cta 2022Arg Gly Asn Ser Pro Val Phe Asn Tyr Asn Lys Thr Thr Lys Arg Leu650 655 660 665aca att ctc gga aaa gat gct ggc act tta att gaa gac cca gat gaa 2070Thr Ile Leu Gly Lys Asp Ala Gly Thr Leu Ile Glu Asp Pro Asp Glu670 675 680agc aca tcc gga gtg gag tcc gct gtt ttg aga gga ttt ctc att cta 2118Ser Thr Ser Gly Val Glu Ser Ala Val Leu Arg Gly Phe Leu Ile Leu685 690 695ggt aag gaa gat aga aga tac gga cca gca tta agc atc aat gaa ctg 2166Gly Lys Glu Asp Arg Arg Tyr Gly Pro Ala Leu Ser Ile Asn Glu Leu700 705 710agt aac ctt gca aaa gga gaa aag gct aat gtg cta att ggg caa gga 2214Ser Asn Leu Ala Lys Gly Glu Lys Ala Asn Val Leu Ile Gly Gln Gly715 720 725gac gtg gtg ttg gta atg aaa cga aaa cgg gac tct agc ata ctt act 2262Asp Val Val Leu Val Met Lys Arg Lys Arg Asp Ser Ser Ile Leu Thr730 735 740 745gac agc cag aca gcg acc aaa aga att cgg atg gcc atc aat 2304Asp Ser Gln Thr Ala Thr Lys Arg Ile Arg Met Ala Ile Asn750 755taatgttgaa tagtttaaaa acgaccttgt ttctact 2341<210>2<211>759<212>PRT<213>流感A病毒<400>2Met Glu Arg Ile Lys Glu Leu Arg Asn Leu Met Ser Gln Sar Arg Thr1 5 10 15Arg Glu Ile Leu Thr Lys Thr Thr Val Asp His Met Ala Ile Ile Lys20 25 30Lys Tyr Thr Ser Gly Arg Gln Glu Lys Asn Pro Ser Leu Arg Met Lys
35 40 45Trp Met Met Ala Met Lys Tyr Pro Ile Thr Ala Asp Lys Arg Ile Thr50 55 60Glu Met Val Pro Glu Arg Asn Glu Gln Gly Gln Thr Leu Trp Ser Lys65 70 75 80Met Ser Asp Ala Gly Ser Asp Arg Val Met Val Ser Pro Leu Ala Val85 90 95Thr Trp Trp Asn Arg Asn Gly Pro Val Thr Ser Thr Val His Tyr Pro100 105 110Lys Val Tyr Lys Thr Tyr Phe Asp Lys Val Glu Arg Leu Lys His Gly115 120 125Thr Phe Gly Pro Val His Phe Arg Asn Gln Val Lys Ile Arg Arg Arg130 135 140Val Asp Ile Asn Pro Gly His Ala Asp Leu Ser Ala Lys Glu Ala Gln145 150 155 160Asp Val Ile Met Glu Val Val Phe Pro Asn Glu Val Gly Ala Arg Ile165 170 175Leu Thr Ser Glu Ser Gln Leu Thr Ile Thr Lys Glu Lys Lys Glu Glu180 185 190Leu Gln Asp Cys Lys Ile Ser Pro Leu Met Val Ala Tyr Met Leu Glu195 200 205Arg Glu Leu Val Arg Lys Thr Arg Phe Leu Pro Val Ala Gly Gly Thr210 215 220Ser Ser Val Tyr Ile Glu Val Leu His Leu Thr Gln Gly Thr Cys Trp225 230 235 240Glu Gln Met Tyr Thr Pro Gly Gly Glu Val Arg Asn Asp Asp Val Asp245 250 255Gln Ser Leu Ile Ile Ala Ala Arg Asn Ile Val Arg Arg Ala Ala Val260 265 270Ser Ala Asp Pro Leu Ala Ser Leu Leu Glu Met Cys His Ser Thr Leu275 280 285Ile Gly Gly Thr Arg Met Val Asp Ile Leu Arg Gln Asn Pro Thr Glu290 295 300Glu Gln Ala Val Asp Ile Cys Lys Ala Ala Met Gly Leu Arg Ile Ser305 310 315 320Ser Ser Phe Ser Phe Gly Gly Phe Thr Phe Lys Arg Thr Ser Gly Ser325 330 335Ser Ile Lys Arg Glu Glu Glu Val Leu Thr Gly Asn Leu Gln Thr Leu340 345 350Lys Ile Arg Val His Glu Gly Tyr Glu Glu Phe Thr Met Val Gly Lys355 360 365Arg Ala Thr Ala Ile Leu Arg Lys Ala Thr Arg Arg Leu Val Gln Leu370 375 380Ile Val Ser Gly Arg Asp Glu Gln Ser Ile Ala Glu Ala Ile Ile Val385 390 395 400Ala Met Val Phe Ser Gln Glu Asp Cys Met Ile Lys Ala Val Arg Gly405 410 415Asp Leu Asn Phe Val Asn Arg Ala Asn Gln Arg Leu Asn Pro Met His420 425 430Gln Leu Leu Arg His Phe Gln Lys Asp Ala Lys Val Leu Phe Gln Asn435 440 445Trp Gly Ile Glu His Ile Asp Asn Val Met Gly Met Val Gly Val Leu450 455 460Pro Asp Met Thr Pro Ser Thr Glu Met Ser Met Arg Gly Ile Arg Val465 470 475 480Ser Lys Met Gly Val Asp Glu Tyr Ser Ser Thr Glu Arg Val Val Val485 490 495Ser Ile Asp Arg Phe Leu Arg Val Arg Asp Gln Arg Gly Asn Val Leu500 505 510Leu Ser Pro Glu Glu Val Ser Glu Thr Gln Gly Thr Glu Arg Leu Thr515 520 525Ile Thr Tyr Ser Ser Ser Met Met Trp Glu Ile Asn Gly Pro Glu Ser530 535 540Val Leu Val Asn Thr Tyr Gln Trp Ile Ile Arg Asn Trp Glu Thr Val545 550 555 560Lys Ile Gln Trp Ser Gln Asn Pro Thr Met Leu Tyr Asn Lys Met Glu565 570 575Phe Glu Pro Phe Gln Ser Leu Val Pro Lys Ala Ile Arg Gly Gln Tyr580 585 590Ser Gly Phe Val Arg Thr Leu Phe Gln Gln Met Arg Asp Val Leu Gly595 600 605Thr Phe Asp Thr Thr Gln Ile Ile Lys Leu Leu Pro Phe Ala Ala Ala610 615 620Pro Pro Lys Gln Ser Arg Met Gln Phe Ser Ser Leu Thr Val Asn Val625 630 635 640Arg Gly Ser Gly Met Arg Ile Leu Val Arg Gly Asn Ser Pro Val Phe645 650 655Asn Tyr Asn Lys Thr Thr Lys Arg Leu Thr Ile Leu Gly Lys Asp Ala660 665 670Gly Thr Leu Ile Glu Asp Pro Asp Glu Ser Thr Ser Gly Val Glu Ser675 680 685Ala Val Leu Arg Gly Phe Leu Ile Leu Gly Lys Glu Asp Arg Arg Tyr690 695 700Gly Pro Ala Leu Ser Ile Asn Glu Leu Ser Asn Leu Ala Lys Gly Glu705 710 715 720Lys Ala Asn Val Leu Ile Gly Gln Gly Asp Val Val Leu Val Met Lys725 730 735Arg Lys Arg Asp Ser Ser Ile Leu Thr Asp Ser Gln Thr Ala Thr Lys740 745 750Arg Ile Arg Met Ala Ile Asn755<210>3<211>2341<212>DNA<213>流感A病毒<220><221>CDS<222>(25)..(2295)<400>3agcgaaagca ggcaaaccat ttga atg gat gtc aac ccg act tta ctt ttc51Met Asp Val Asn Pro Thr Leu Leu Phe1 5ttg aaa gtt cca gcg caa aat gcc ata agc acc aca ttc cct tat act 99Leu Lys Val Pro Ala Gln Asn Ala Ile Ser Thr Thr Phe Pro Tyr Thr10 15 20 25gga gat cct cca tac agc cat gga aca gga aca gga tac acc atg gac 147Gly Asp Pro Pro Tyr Ser His Gly Thr Gly Thr Gly Tyr Thr Met Asp30 35 40aca gtc aac aga aca cat caa tat tca gaa aaa ggg aag tgg aca aca 195Thr Val Asn Arg Thr His Gln Tyr Ser Glu Lys Gly Lys Trp Thr Thr45 50 55aac aca gaa act ggg gcg ccc caa ctt aac cca att gat gga cca cta 243Asn Thr Glu Thr Gly Ala Pro Gln Leu Asn Pro Ile Asp Gly Pro Leu60 65 70cct gag gat aat gag cca agt gga tat gca caa aca gac tgt gtc ctg 291Pro Glu Asp Asn Glu Pro Ser Gly Tyr Ala Gln Thr Asp Cys Val Leu75 80 85gaa gca atg gct ttc ctt gaa gaa tcc cac cca ggg atc ttt gaa aac 339Glu Ala Met Ala Phe Leu Glu Glu Ser His Pro Gly Ile Phe Glu Asn90 95 100 105tcg tgc ctt gaa acg atg gaa gtc gtt caa caa aca agg gtg gac aga 387Ser Cys Leu Glu Thr Met Glu Val Val Gln Gln Thr Arg Val Asp Arg110 115 120ctg acc caa ggt cgt cag acc tat gat tgg aca tta aac aga aat caa 435Leu Thr Gln Gly Arg Gln Thr Tyr Asp Trp Thr Leu Asn Arg Asn Gln125 130 135ccg gcc gca act gca tta gcc aac act ata gaa gtc ttc aga tcg aat 483Pro Ala Ala Thr Ala Leu Ala Asn Thr Ile Glu Val Phe Arg Ser Asn140 145 150ggt cta aca gct aat gag tcg gga agg cta ata gat ttc ctc aag gat 531Gly Leu Thr Ala Asn Glu Ser Gly Arg Leu Ile Asp Phe Leu Lys Asp155 160 165gtg atg gaa tca atg gat aaa gag gaa atg gag ata aca aca cac ttc 579Val Met Glu Ser Met Asp Lys Glu Glu Met Glu Ile Thr Thr His Phe170 175 180 185caa aga aaa aga aga gta aga gac aac atg acc aag aaa atg gtc aca 627Gln Arg Lys Arg Arg Val Arg Asp Asn Met Thr Lys Lys Met Val Thr190 195 200caa aga aca ata gga aag aag aag cag aga gtg aac aag aga agc tat 675Gln Arg Thr Ile Gly Lys Lys Lys Gln Arg Val Asn Lys Arg Ser Tyr205 210 215cta ata aga gca tta aca ttg aac aca atg acc aaa gat gca gaa aga 723Leu Ile Arg Ala Leu Thr Leu Asn Thr Met Thr Lys Asp Ala Glu Arg220 225 230ggt aaa tta aag aga aga gct att gca aca ccc ggg atg caa atc aga 771Gly Lys Leu Lys Arg Arg Ala Ile Ala Thr Pro Gly Met Gln Ile Arg235 240 245ggg ttc gtg tac ttt gtt gaa act cta gct agg agc att tgt gag aag 819Gly Phe Val Tyr Phe Val Glu Thr Leu Ala Arg Ser Ile Cys Glu Lys250 255 260 265ctt gaa cag tct gga ctt cca gtt gga ggt aat gaa aag aag gcc aaa 867Leu Glu Gln Ser Gly Leu Pro Val Gly Gly Asn Glu Lys Lys Ala Lys270 275 280ctg gca aat gtt gtg aga aag atg atg act aat tca caa gac aca gag 915Leu Ala Asn Val Val Arg Lys Met Met Thr Asn Ser Gln Asp Thr Glu285 290 295ctt tct ttc aca att act gga gac aat act aag tgg aat gaa aat caa 963Leu Ser Phe Thr Ile Thr Gly Asp Asn Thr Lys Trp Asn Glu Asn Gln300 305 310aat cct cga atg ttc ctg gcg atg att aca tat atc aca aaa aat caa 1011Asn Pro Arg Met Phe Leu Ala Met Ile Thr Tyr Ile Thr Lys Asn Gln315 320 325cct gaa tgg ttc aga aac att ctg agc atc gca ccc ata atg ttc tca 1059Pro Glu Trp Phe Arg Asn Ile Leu Ser Ile Ala Pro Ile Met Phe Ser330 335 340 345aac aaa atg gcg aga cta ggg aaa gga tac atg ttc gaa agt aag aga 1107Asn Lys Met Ala Arg Leu Gly Lys Gly Tyr Met Phe Glu Ser Lys Arg350 355 360atg aag ctc cga aca caa ata cca gca gaa atg cta gca agc att gac 1155Met Lys Leu Arg Thr Gln Ile Pro Ala Glu Met Leu Ala Ser Ile Asp365 370 375cta aag tat ttc aat gaa tca aca aga aag aaa att gag aaa ata aag 1203Leu Lys Tyr Phe Asn Glu Ser Thr Arg Lys Lys Ile Glu Lys Ile Lys380 385 390cct ctt cta ata gat ggc aca gcg tca ttg agt cct gga atg atg atg 1251Pro Leu Leu Ile Asp Gly Thr Ala Ser Leu Ser Pro Gly Met Met Met395 400 405ggc atg ttc aac atg cta agt acg gtt tta gga gtc tca atc ctg aat 1299Gly Met Phe Asn Met Leu Ser Thr Val Leu Gly Val Ser Ile Leu Asn410 415 420 425ctt ggg caa aag aaa tac acc aaa aca aca tac tgg tgg gat gga ctc 1347Leu Gly Gln Lys Lys Tyr Thr Lys Thr Thr Tyr Trp Trp Asp Gly Leu430 435 440caa tcc tct gat gat ttt gct ctc ata gtg aat gca cca aat cat gag 1395Gln Ser Ser Asp Asp Phe Ala Leu Ile Val Asn Ala Pro Asn His Glu445 450 455gga ata caa gca gga gtg gat aga ttc tac agg acc tgc aag tta gtc 1443Gly Ile Gln Ala Gly Val Asp Arg Phe Tyr Arg Thr Cys Lys Leu Val460 465 470gga atc aat atg agc aag aag aag tcc tat ata aat agg aca gga aca 1491Gly Ile Asn Met Ser Lys Lys Lys Ser Tyr Ile Asn Arg Thr Gly Thr475 480 485ttt gaa ttc aca agc ttt ttt tat cgc tat gga ttt gta gcc aat ttt 1539Phe Glu Phe Thr Ser Phe Phe Tyr Arg Tyr Gly Phe Val Ala Asn Phe490 495 500 505agc atg gag ctg ccc agt ttt gga gtg tct ggg att aat gag tca gct 1587Ser Met Glu Leu Pro Ser Phe Gly Val Ser Gly Ile Asn Glu Ser Ala510 515 520gat atg agc att gga gta aca gtg ata aag aac aac atg ata aac aat 1635Asp Met Ser Ile Gly Val Thr Val Ile Lys Asn Asn Met Ile Asn Asn525 530 535gac ctt gga cca gca aca gcc cag atg gct ctt caa ctg ttc atc aag 1683Asp Leu Gly Pro Ala Thr Ala Gln Met Ala Leu Gln Leu Phe Ile Lys540 545 550gac tac aga tat aca tat cgg tgc cac aga gga gac aca caa att cag 1731Asp Tyr Arg Tyr Thr Tyr Arg Cys His Arg Gly Asp Thr Gln Ile Gln555 560 565acg agg aga tca ttc gag cta aag aag ctg tgg gag caa acc cgc tca 1779Thr Arg Arg Ser Phe Glu Leu Lys Lys Leu Trp Glu Gln Thr Arg Ser570 575 580 585aag gca gga cta ttg gtt tca gat gga gga cca aac tta tac aat atc 1827Lys Ala Gly Leu Leu Val Ser Asp Gly Gly Pro Asn Leu Tyr Asn Ile590 595 600cgg aat ctt cac atc cct gaa gtc tgc tta aag tgg gag cta atg gat 1875Arg Asn Leu His Ile Pro Glu Val Cys Leu Lys Trp Glu Leu Met Asp605 610 615gag gac tat cag gga aga ctt tgt aat ccc ctg aat cca ttt gtc agc 1923Glu Asp Tyr Gln Gly Arg Leu Cys Asn Pro Leu Asn Pro Phe Val Ser620 625 630cat aag gag att gag tct gta aac aat gct gtg gta atg cca gct cat 1971His Lys Glu Ile Glu Ser Val Asn Asn Ala Val Val Met Pro Ala His635 640 645ggt cca gcc aag agc atg gaa tat gac gct gtt gca act aca cac tcc 2019Gly Pro Ala Lys Ser Met Glu Tyr Asp Ala Val Ala Thr Thr His Ser650 655 660 665tgg att ccc aag agg aac cgc tct att ctc aac aca agc caa agg gga 2067Trp Ile Pro Lys Arg Asn Arg Ser Ile Leu Asn Thr Ser Gln Arg Gly670 675 680att ctt gag gat gaa cag atg tat cag aag tgc tgc aac ctg ttc gag 2115Ile Leu Glu Asp Glu Gln Met Tyr Gln Lys Cys Cys Asn Leu Phe Glu685 690 695aaa ttt ttc ccc agt agt tca tac agg aga ccg gtt gga att tcc agc 2163Lys Phe Phe Pro Ser Ser Ser Tyr Arg Arg Pro Val Gly Ile Ser Ser700 705 710atg gtg gag gcc atg gtg tct agg gcc cgg att gat gcc aga att gac 2211Met Val Glu Ala Met Val Ser Arg Ala Arg Ile Asp Ala Arg Ile Asp715 720 725ttc gaa tct gga cgg att aag aaa gaa gag ttc gcc gag atc atg aag 2259Phe Glu Ser Gly Arg Ile Lys Lys Glu Glu Phe Ala Glu Ile Met Lys730 735 740 745atc tgt tcc acc att gaa gag ctc aga cgg caa aaa taatgaattt2305Ile Cys Ser Thr Ile Glu Glu Leu Arg Arg Gln Lys750 755agcttgtcct tcatgaaaaa atgccttgtt tctact 2341<210>4<211>757<212>PRT<213>流感A病毒<400>4Met Asp Val Asn Pro Thr Leu Leu Phe Leu Lys Val Pro Ala Gln Asn1 5 10 15Ala Ile Ser Thr Thr Phe Pro Tyr Thr Gly Asp Pro Pro Tyr Ser His
20 25 30Gly Thr Gly Thr Gly Tyr Thr Met Asp Thr Val Asn Arg Thr His Gln35 40 45Tyr Ser Glu Lys Gly Lys Trp Thr Thr Asn Thr Glu Thr Gly Ala Pro50 55 60Gln Leu Asn Pro Ile Asp Gly Pro Leu Pro Glu Asp Asn Glu Pro Ser65 70 75 80Gly Tyr Ala Gln Thr Asp Cys Val Leu Glu Ala Met Ala Phe Leu Glu85 90 95Glu Ser His Pro Gly Ile Phe Glu Asn Ser Cys Leu Glu Thr Met Glu100 105 110Val Val Gln Gln Thr Arg Val Asp Arg Leu Thr Gln Gly Arg Gln Thr115 120 125Tyr Asp Trp Thr Leu Asn Arg Asn Gln Pro Ala Ala Thr Ala Leu Ala130 135 140Asn Thr Ile Glu Val Phe Arg Ser Asn Gly Leu Thr Ala Asn Glu Ser145 150 155 160Gly Arg Leu Ile Asp Phe Leu Lys Asp Val Met Glu Ser Met Asp Lys165 170 175Glu Glu Met Glu Ile Thr Thr His Phe Gln Arg Lys Arg Arg Val Arg180 185 190Asp Asn Met Thr Lys Lys Met Val Thr Gln Arg Thr Ile Gly Lys Lys195 200 205Lys Gln Arg Val Asn Lys Arg Ser Tyr Leu Ile Arg Ala Leu Thr Leu210 215 220Asn Thr Met Thr Lys Asp Ala Glu Arg Gly Lys Leu Lys Arg Arg Ala225 230 235 240Ile Ala Thr Pro Gly Met Gln Ile Arg Gly Phe Val Tyr Phe Val Glu245 250 255Thr Leu Ala Arg Ser Ile Cys Glu Lys Leu Glu Gln Ser Gly Leu Pro260 265 270Val Gly Gly Asn Glu Lys Lys Ala Lys Leu Ala Asn Val Val Arg Lys275 280 285Met Met Thr Asn Ser Gln Asp Thr Glu Leu Ser Phe Thr Ile Thr Gly290 295 300Asp Asn Thr Lys Trp Asn Glu Asn Gln Asn Pro Arg Met Phe Leu Ala305 310 315 320Met Ile Thr Tyr Ile Thr Lys Asn Gln Pro Glu Trp Phe Arg Asn Ile325 330 335Leu Ser Ile Ala Pro Ile Met Phe Ser Asn Lys Met Ala Arg Leu Gly340 345 350Lys Gly Tyr Met Phe Glu Ser Lys Arg Met Lys Leu Arg Thr Gln Ile355 360 365Pro Ala Glu Met Leu Ala Ser Ile Asp Leu Lys Tyr Phe Asn Glu Ser370 375 380Thr Arg Lys Lys Ile Glu Lys Ile Lys Pro Leu Leu Ile Asp Gly Thr385 390 395 400Ala Ser Leu Ser Pro Gly Met Met Met Gly Met Phe Asn Met Leu Ser405 410 415Thr Val Leu Gly Val Ser Ile Leu Asn Leu Gly Gln Lys Lys Tyr Thr420 425 430Lys Thr Thr Tyr Trp Trp Asp Gly Leu Gln Ser Ser Asp Asp Phe Ala435 440 445Leu Ile Val Asn Ala Pro Asn His Glu Gly Ile Gln Ala Gly Val Asp450 455 460Arg Phe Tyr Arg Thr Cys Lys Leu Val Gly Ile Asn Met Ser Lys Lys465 470 475 480Lys Ser Tyr Ile Asn Arg Thr Gly Thr Phe Glu Phe Thr Ser Phe Phe485 490 495Tyr Arg Tyr Gly Phe Val Ala Asn Phe Ser Met Glu Leu Pro Ser Phe500 505 510Gly Val Ser Gly Ile Asn Glu Ser Ala Asp Met Ser Ile Gly Val Thr515 520 525Val Ile Lys Asn Asn Met Ile Asn Asn Asp Leu Gly Pro Ala Thr Ala530 535 540Gln Met Ala Leu Gln Leu Phe Ile Lys Asp Tyr Arg Tyr Thr Tyr Arg545 550 555 560Cys His Arg Gly Asp Thr Gln Ile Gln Thr Arg Arg Ser Phe Glu Leu565 570 575Lys Lys Leu Trp Glu Gln Thr Arg Ser Lys Ala Gly Leu Leu Val Ser580 585 590Asp Gly Gly Pro Asn Leu Tyr Asn Ile Arg Asn Leu His Ile Pro Glu595 600 605Val Cys Leu Lys Trp Glu Leu Met Asp Glu Asp Tyr Gln Gly Arg Leu610 615 620Cys Asn Pro Leu Asn Pro Phe Val Ser His Lys Glu Ile Glu Ser Val625 630 635 640Asn Asn Ala Val Val Met Pro Ala His Gly Pro Ala Lys Ser Met Glu645 650 655Tyr Asp Ala Val Ala Thr Thr His Ser Trp Ile Pro Lys Arg Asn Arg660 665 670Ser Ile Leu Asn Thr Ser Gln Arg Gly Ile Leu Glu Asp Glu Gln Met675 680 685Tyr Gln Lys Cys Cys Asn Leu Phe Glu Lys Phe Phe Pro Ser Ser Ser690 695 700Tyr Arg Arg Pro Val Gly Ile Ser Ser Met Val Glu Ala Met Val Ser705 710 715 720Arg Ala Arg Ile Asp Ala Arg Ile Asp Phe Glu Ser Gly Arg Ile Lys725 730 735Lys Glu Glu Phe Ala Glu Ile Met Lys Ile Cys Ser Thr Ile Glu Glu740 745 750Leu Arg Arg Gln Lys755<210>5<211>2233<212>DNA<213>流感A病毒<220><221>CDS<222>(25)..(2172)<400>5agcaaaagca ggtactgatt cgag atg gaa gat ttt gtg cga caa tgc ttc51Met Glu Asp Phe Val Arg Gln Cys Phe1 5aat ccg atg att gtc gag ctt gca gaa aag gca atg aaa gag tat gga 99Asn Pro Met Ile Val Glu Leu Ala Glu Lys Ala Met Lys Glu Tyr Gly10 15 20 25gag gat ctg aaa atc gaa aca aac aaa ttt gca gca ata tgc act cac 147Glu Asp Leu Lys Ile Glu Thr Asn Lys Phe Ala Ala Ile Cys Thr His30 35 40ttg gag ata tgt ttc atg tat tca gat ttt cat ttc atc aat gaa caa 195Leu Glu Ile Cys Phe Met Tyr Ser Asp Phe His Phe Ile Asn Glu Gln45 50 55ggc gag tca ata gtg gta gag ctt gat gat cca aat gca ctg tta aag 243Gly Glu Ser Ile Val Val Glu Leu Asp Asp Pro Asn Ala Leu Leu Lys60 65 70cac aga ttt gaa ata ata gag gga aga gac cgc aca atg gcc tgg aca 291His Arg Phe Glu Ile Ile Glu Gly Arg Asp Arg Thr Met Ala Trp Thr75 80 85gta gta aac agt att tgc aac act act gga gct gag aaa ccg aag ttt 339Val Val Asn Ser Ile Cys Asn Thr Thr Gly Ala Glu Lys Pro Lys Phe90 95 100 105ctg cca gat ttg tat gat tac aag gag aat aga ttc atc gag att gga 387Leu Pro Asp Leu Tyr Asp Tyr Lys Glu Asn Arg Phe Ile Glu Ile Gly110 115 120gtg aca agg aga gaa gtc cac ata tac tac ctt gaa aag gcc aat aaa 435Val Thr Arg Arg Glu Val His Ile Tyr Tyr Leu Glu Lys Ala Asn Lys125 130 135att aaa tct gag aat aca cac atc cac att ttc tca ttc act ggg gag 483Ile Lys Ser Glu Asn Thr His Ile His Ile Phe Ser Phe Thr Gly Glu140 145 150gaa atg gcc aca aag gcc gac tac act ctt gat gat gaa agc agg gct 531Glu Met Ala Thr Lys Ala Asp Tyr Thr Leu Asp Asp Glu Ser Arg Ala155 160 165agg atc aaa acc agg cta ttt acc ata aga caa gaa atg gcc aac aga 579Arg Ile Lys Thr Arg Leu Phe Thr Ile Arg Gln Glu Met Ala Asn Arg170 175 180 185ggc ctc tgg gat tcc ttt cgt cag tcc gaa aga ggc gaa gaa aca att 627Gly Leu Trp Asp Ser Phe Arg Gln Ser Glu Arg Gly Glu Glu Thr Ile190 195 200gaa gaa aga ttt gaa atc aca gga act atg cgc agg ctt gcc gac caa 675Glu Glu Arg Phe Glu Ile Thr Gly Thr Met Arg Arg Leu Ala Asp Gln205 210 215agt ctc ccg ccg aac ttc tcc tgc ctt gag aat ttt aga gcc tat gtg 723Ser Leu Pro Pro Asn Phe Ser Cys Leu Glu Asn Phe Arg Ala Tyr Val220 225 230gat gga ttc gaa ccg aac ggc tgc att gag ggc aag ctt tct caa atg 771Asp Gly Phe Glu Pro Asn Gly Cys Ile Glu Gly Lys Leu Ser Gln Met235 240 245tcc aaa gaa gtg aat gca aga att gaa cct ttt ctg aag aca aca cca 819Ser Lys Glu Val Asn Ala Arg Ile Glu Pro Phe Leu Lys Thr Thr Pro250 255 260 265aga cca atc aaa ctt ccg gat ggg cct cct tgt ttt cag cgg tcc aaa 867Arg Pro Ile Lys Leu Pro Asp Gly Pro Pro Cys Phe Gln Arg Ser Lys270 275 280ttc ctt ctg atg gat gct tta aaa tta agc att gaa gac cca agt cac 915Phe Leu Leu Met Asp Ala Leu Lys Leu Ser Ile Glu Asp Pro Ser His285 290 295gaa gga gag gga ata cca cta tat gat gcg atc aag tgc atg aga aca 963Glu Gly Glu Gly Ile Pro Leu Tyr Asp Ala Ile Lys Cys Met Arg Thr300 305 310ttc ttt gga tgg aaa gaa ccc tat atc gtc aaa cca cac gaa aag gga 1011Phe Phe Gly Trp Lys Glu Pro Tyr Ile Val Lys Pro His Glu Lys Gly315 320 325ata aat cca aat tat ctg ctg tca tgg aag caa gta ctg gca gaa cta 1059Ile Asn Pro Asn Tyr Leu Leu Set Trp Lys Gln Val Leu Ala Glu Leu330 335 340 345cag gac att gaa aat gag gag aag att cca aga act aaa aac atg aag 1107Gln Asp Ile Glu Asn Glu Glu Lys Ile Pro Arg Thr Lys Asn Met Lys350 355 360aaa acg agt cag cta aag tgg gca ctt ggt gag aac atg gca cct gag 1155Lys Thr Ser Gln Leu Lys Trp Ala Leu Gly Glu Asn Met Ala Pro Glu365 370 375aaa gta gac ttt gac aac tgt aga gac ata agc gat ttg aag caa tat 1203Lys Val Asp Phe Asp Asn Cys Arg Asp Ile Ser Asp Leu Lys Gln Tyr380 385 390gat agt gac gaa cct gaa tta agg tca ctt tca agc tgg atc cag aat 1251Asp Ser Asp Glu Pro Glu Leu Arg Ser Leu Ser Ser Trp Ile Gln Asn395 400 405gag ttc aac aag gca tgc gag ctg acc gat tca atc tgg ata gag ctc 1299Glu Phe Asn Lys Ala Cys Glu Leu Thr Asp Ser Ile Trp Ile Glu Leu410 415 420 425gat gag att gga gaa gac gtg gct cca att gaa tac att gca agc atg 1347Asp Glu Ile Gly Glu Asp Val Ala Pro Ile Glu Tyr Ile Ala Ser Met430 435 440agg agg aat tac ttc aca gca gag gtg tcc cat tgc aga gcc aca gaa 1395Arg Arg Asn Tyr Phe Thr Ala Glu Val Ser His Cys Arg Ala Thr Glu445 450 455tac ata atg aag ggg gta tac att aat act gcc ttg ctt aat gca tcc 1443Tyr Ile Met Lys Gly Val Tyr Ile Asn Thr Ala Leu Leu Asn Ala Ser460 465 470tgt gca gca atg gac gat ttc caa cta att ccc atg ata agc aag tgc 1491Cys Ala Ala Met Asp Asp Phe Gln Leu Ile Pro Met Ile Ser Lys Cys475 480 485aga act aaa gag gga agg cga aaa acc aat tta tat gga ttc atc ata 1539Arg Thr Lys Glu Gly Arg Arg Lys Thr Asn Leu Tyr Gly Phe Ile Ile490 495 500 505aaa gga aga tct cac tta agg aat gac acc gac gtg gta aac ttt gtg 1587Lys Gly Arg Ser His Leu Arg Asn Asp Thr Asp Val Val Asn Phe Val510 515 520agc atg gag ttt tct ctc act gac ccg aga ctt gag cca cat aaa tgg 1635Ser Met Glu Phe Ser Leu Thr Asp Pro Arg Leu Glu Pro His Lys Trp525 530 535gag aaa tac tgt gtc ctt gag ata gga gat atg cta cta aga agt gcc 1683Glu Lys Tyr Cys Val Leu Glu Ile Gly Asp Met Leu Leu Arg Ser Ala540 545 550ata ggc cag atg tca agg cct atg ttc ttg tat gtg agg aca aat gga 1731Ile Gly Gln Met Ser Arg Pro Met Phe Leu Tyr Val Arg Thr Asn Gly555 560 565aca tca aag att aaa atg aaa tgg gga atg gag atg aga cgt tgc ctc 1779Thr Ser Lys Ile Lys Met Lys Trp Gly Met Glu Met Arg Arg Cys Leu570 575 580 585ctt cag tca ctc caa caa atc gag agc atg att gaa gcc gag tct tct 1827Leu Gln Ser Leu Gln Gln Ile Glu Ser Met Ile Glu Ala Glu Ser Ser590 595 600gtc aaa gag aaa gac atg acc aaa gag ttt ttt gag aat aaa tca gaa 1875Val Lys Glu Lys Asp Met Thr Lys Glu Phe Phe Glu Asn Lys Ser Glu605 610 615aca tgg ccc att ggg gag tcc ccc aag gga gtg gaa gaa ggt tcc att 1923Thr Trp Pro Ile Gly Glu Ser Pro Lys Gly Val Glu Glu Gly Ser Ile620 625 630gga aag gtc tgt agg act tta ttg gcc aag tcg gta ttc aat agc ctg 1971Gly Lys Val Cys Arg Thr Leu Leu Ala Lys Ser Val Phe Asn Ser Leu635 640 645tat gca tcc cca caa ttg gaa gga ttt tea gcg gag tca aga aaa ctg 2019Tyr Ala Ser Pro Gln Leu Glu Gly Phe Ser Ala Glu Ser Arg Lys Leu650 655 660 665ctt ctt gtc gtt cag gct ctt agg gac aac ctt gaa cct gga acc ttt 2067Leu Leu Val Val Gln Ala Leu Arg Asp Asn Leu Glu Pro Gly Thr Phe670 675 680gat ctt ggg ggg cta tat gaa gca att gag gag tgc ctg att aat gat 2115Asp Leu Gly Gly Leu Tyr Glu Ala Ile Glu Glu Cys Leu Ile Asn Asp685 690 695ccc tgg gtt ttg ctt aat gcg tct tgg ttc aac tcc ttc cta aca cat 2163Pro Trp Val Leu Leu Asn Ala Ser Trp Phe Asn Ser Phe Leu Thr His700 705 710gca tta aga tagttgtggc aatgctacta tttgctatcc atactgtcca 2212Ala Leu Arg715aaaaagtacc ttgtttctac t 2233<210>6<211>716<212>PRT<213>流感A病毒<400>6Met Glu Asp Phe Val Arg Gln Cys Phe Asn Pro Met Ile Val Glu Leu1 5 10 15Ala Glu Lys Ala Met Lys Glu Tyr Gly Glu Asp Leu Lys Ile Glu Thr20 25 30Asn Lys Phe Ala Ala Ile Cys Thr His Leu Glu Ile Cys Phe Met Tyr35 40 45Ser Asp Phe His Phe Ile Asn Glu Gln Gly Glu Ser Ile Val Val Glu50 55 60Leu Asp Asp Pro Asn Ala Leu Leu Lys His Arg Phe Glu Ile Ile Glu65 70 75 80Gly Arg Asp Arg Thr Met Ala Trp Thr Val Val Asn Ser Ile Cys Asn85 90 95Thr Thr Gly Ala Glu Lys Pro Lys Phe Leu Pro Asp Leu Tyr Asp Tyr100 105 110Lys Glu Asn Arg Phe Ile Glu Ile Gly Val Thr Arg Arg Glu Val His115 120 125Ile Tyr Tyr Leu Glu Lys Ala Asn Lys Ile Lys Ser Glu Asn Thr His130 135 140Ile His Ile Phe Ser Phe Thr Gly Glu Glu Met Ala Thr Lys Ala Asp145 150 155 160Tyr Thr Leu Asp Asp Glu Ser Arg Ala Arg Ile Lys Thr Arg Leu Phe165 170 175Thr Ile Arg Gln Glu Met Ala Asn Arg Gly Leu Trp Asp Ser Phe Arg180 185 190Gln Ser Glu Arg Gly Glu Glu Thr Ile Glu Glu Arg Phe Glu Ile Thr195 200 205Gly Thr Met Arg Arg Leu Ala Asp Gln Ser Leu Pro Pro Asn Phe Ser210 215 220Cys Leu Glu Asn Phe Arg Ala Tyr Val Asp Gly Phe Glu Pro Asn Gly225 230 235 240Cys Ile Glu Gly Lys Leu Ser Gln Met Ser Lys Glu Val Asn Ala Arg245 250 255Ile Glu Pro Phe Leu Lys Thr Thr Pro Arg Pro Ile Lys Leu Pro Asp260 265 270Gly Pro Pro Cys Phe Gln Arg Ser Lys Phe Leu Leu Met Asp Ala Leu275280 285Lys Leu Ser Ile Glu Asp Pro Ser His Glu Gly Glu Gly Ile Pro Leu290 295 300Tyr Asp Ala Ile Lys Cys Met Arg Thr Phe Phe Gly Trp Lys Glu Pro305 310 315 320Tyr Ile Val Lys Pro His Glu Lys Gly Ile Asn Pro Asn Tyr Leu Leu325 330 335Ser Trp Lys Gln Val Leu Ala Glu Leu Gln Asp Ile Glu Asn Glu Glu340 345 350Lys Ile Pro Arg Thr Lys Asn Met Lys Lys Thr Ser Gln Leu Lys Trp355 360 365Ala Leu Gly Glu Asn Met Ala Pro Glu Lys Val Asp Phe Asp Asn Cys370 375 380Arg Asp Ile Ser Asp Leu Lys Gln Tyr Asp Ser Asp Glu Pro Glu Leu385 390 395 400Arg Ser Leu Ser Ser Trp Ile Gln Asn Glu Phe Asn Lys Ala Cys Glu405 410 415Leu Thr Asp Ser Ile Trp Ile Glu Leu Asp Glu Ile Gly Glu Asp Val420 425 430Ala Pro Ile Glu Tyr Ile Ala Ser Met Arg Arg Asn Tyr Phe Thr Ala435 440 445Glu Val Ser His Cys Arg Ala Thr Glu Tyr Ile Met Lys Gly Val Tyr450 455 460Ile Asn Thr Ala Leu Leu Asn Ala Ser Cys Ala Ala Met Asp Asp Phe465 470 475 480Gln Leu Ile Pro Met Ile Ser Lys Cys Arg Thr Lys Glu Gly Arg Arg485 490 495Lys Thr Asn Leu Tyr Gly Phe Ile Ile Lys Gly Arg Ser His Leu Arg500 505 510Asn Asp Thr Asp Val Val Asn Phe Val Ser Met Glu Phe Ser Leu Thr515 520 525Asp Pro Arg Leu Glu Pro His Lys Trp Glu Lys Tyr Cys Val Leu Glu530 535 540Ile Gly Asp Met Leu Leu Arg Ser Ala Ile Gly Gln Met Ser Arg Pro545 550 555 560Met Phe Leu Tyr Val Arg Thr Asn Gly Thr Ser Lys Ile Lys Met Lys565 570 575Trp Gly Met Glu Met Arg Arg Cys Leu Leu Gln Ser Leu Gln Gln Ile580 585 590Glu Ser Met Ile Glu Ala Glu Ser Ser Val Lys Glu Lys Asp Met Thr595 600 605Lys Glu Phe Phe Glu Asn Lys Ser Glu Thr Trp Pro Ile Gly Glu Ser610 615 620Pro Lys Gly Val Glu Glu Gly Ser Ile Gly Lys Val Cys Arg Thr Leu625 630 635 640Leu Ala Lys Ser Val Phe Asn Ser Leu Tyr Ala Ser Pro Gln Leu Glu645 650 655Gly Phe Ser Ala Glu Ser Arg Lys Leu Leu Leu Val Val Gln Ala Leu660 665 670Arg Asp Asn Leu Glu Pro Gly Thr Phe Asp Leu Gly Gly Leu Tyr Glu
675 680 685Ala Ile Glu Glu Cys Leu Ile Asn Asp Pro Trp Val Leu Leu Asn Ala690 695 700Ser Trp Phe Asn Ser Phe Leu Thr His Ala Leu Arg705 710 715<210>7<211>1765<212>DNA<213>流感A病毒<220><221>CDS<222>(30)..(1727)<400>7agcaaaagca ggggataatt ctattaacc atg aag act atc att gct ttg agc 53Met Lys Thr Ile Ile Ala Leu Ser1 5tac att ttc tgt ctg gtt ctc ggc caa aac ttt cca gga aat gac aac 101Tyr Ile Phe Cys Leu Val Leu Gly Gln Asn Phe Pro Gly Asn Asp Asn10 15 20agc aca gca acg ctg tgc ctg gga cat cat gcg gtg cca aac gga aca 149Ser Thr Ala Thr Leu Cys Leu Gly His His Ala Val Pro Asn Gly Thr25 30 35 40cta gtg aaa aca atc aca aat gat cag att gaa gtg act aat gct act 197Leu Val Lys Thr Ile Thr Asn Asp Gln Ile Glu Val Thr Asn Ala Thr45 50 55gag ctg gtt cag agt tcc tca acg ggg aaa ata tgc aac aat cct cat 245Glu Leu Val Gln Ser Ser Ser Thr Gly Lys Ile Cys Asn Asn Pro His60 65 70cga atc ctt gat gga ata gac tgc aca ctg ata gat gct cta ttg ggg 293Arg Ile Leu Asp Gly Ile Asp Cys Thr Leu Ile Asp Ala Leu Leu Gly75 80 85gac cct cat tgt gat ggc ttt caa aat gag aca tgg gac ctt ttc gtt 341Asp Pro His Cys Asp Gly Phe Gln Asn Glu Thr Trp Asp Leu Phe Val90 95 100gaa cgc agc aaa gct ttc agc aac tgt tac cct tat gat gtg cca gat 389Glu Arg Ser Lys Ala Phe Ser Asn Cys Tyr Pro Tyr Asp Val Pro Asp105 110 115 120tat gcc tcc ctt agg tca cta gtt gcc tcg tca ggc act ctg gag ttt 437Tyr Ala Ser Leu Arg Ser Leu Val Ala Ser Ser Gly Thr Leu Glu Phe125 130 135atc agt gaa ggc ttc act tgg act ggg gtc act cag aat ggg gga agc 485Ile Ser Glu Gly Phe Thr Trp Thr Gly Val Thr Gln Asn Gly Gly Ser140 145 150aat gct tgc aaa agg gga cct gat agc ggt ttt ttc agt aga ctg aac 533Asn Ala Cys Lys Arg Gly Pro Asp Ser Gly Phe Phe Ser Arg Leu Asn155 160 165tgg ttg tac aaa tca gga agc aca tat cca gtg ctg aac gtg act atg 581Trp Leu Tyr Lys Ser Gly Ser Thr Tyr Pro Val Leu Asn Val Thr Met170 175 180cca aac aat gac aat ttt gac aaa cta tac att tgg ggg gtt cac cac 629Pro Asn Asn Asp Asn Phe Asp Lys Leu Tyr Ile Trp Gly Val His His185 190 195 200ccg agc acg gac caa gaa caa acc agc cta tat gtt caa gca tca ggg 677Pro Ser Thr Asp Gln Glu Gln Thr Ser Leu Tyr Val Gln Ala Ser Gly205 210 215aga gtc aca gtc tct acc aag aga agc cag caa act ata atc ccg aat 725Arg Val Thr Val Ser Thr Lys Arg Ser Gln Gln Thr Ile Ile Pro Asn220 225 230atc ggg tct aga ccc tgg gta agg ggt ctg tct agt aga ata agc atc 773Ile Gly Ser Arg Pro Trp Val Arg Gly Leu Ser Ser Arg Ile Ser Ile235 240 245tat tgg aca ata gtt aaa ccg gga gac ata ctg gta att aat agt aat 821Tyr Trp Thr Ile Val Lys Pro Gly Asp Ile Leu Val Ile Asn Ser Asn250 255 260ggg aac cta att gct cct cgg ggt tat ttt aaa atg cgc act ggg aaa 869Gly Asn Leu Ile Ala Pro Arg Gly Tyr Phe Lys Met Arg Thr Gly Lys265 270 275 280agc tca ata atg agg tca gat gca cct att ggc acc tgc att tct gaa 917Ser Ser Ile Met Arg Ser Asp Ala Pro Ile Gly Thr Cys Ile Ser Glu285 290 295tgc atc act cca aat gga agc att ccc aat gac aag ccc ttt caa aac 965Cys Ile Thr Pro Asn Gly Ser Ile Pro Asn Asp Lys Pro Phe Gln Asn300 305 310gta aac aag atc aca tat ggg gca tgt ccc aag tat gtt aag caa aac 1013Val Asn Lys Ile Thr Tyr Gly Ala Cys Pro Lys Tyr Val Lys Gln Asn315 320 325acc ctg aag ttg gca aca ggg atg cgg aat gta cca gag aaa caa act 1061Thr Leu Lys Leu Ala Thr Gly Met Arg Asn Val Pro Glu Lys Gln Thr330 335 340aga ggc cta ttc agc gca ata gca ggt ttc ata gaa aat ggg tgg gag 1109Arg Gly Leu Phe Ser Ala Ile Ala Gly Phe Ile Glu Asn Gly Trp Glu345 350 355 360gga atg ata gac ggt tgg tac ggt ttc agg cat caa aat tct gag ggc 1157Gly Met Ile Asp Gly Trp Tyr Gly Phe Arg His Gln Asn Ser Glu Gly365 370 375aca gga caa gca gca gat ctt aaa agc act caa gca gcc atc gac caa 1205Thr Gly Gln Ala Ala Asp Leu Lys Ser Thr Gln Ala Ala Ile Asp Gln
380 385 390atc aat ggg aaa ctg aat agg gta atc gag aag acg aac gag aaa ttc 1253Ile Asn Gly Lys Leu Asn Arg Val Ile Glu Lys Thr Asn Glu Lys Phe395 400 405cat caa atc gaa aag gaa ttc tca gaa gta gaa ggg aga att cag gac 1301His Gln Ile Glu Lys Glu Phe Ser Glu Val Glu Gly Arg Ile Gln Asp410 415 420ctc gag aaa tac gtt gaa gac act aaa ata gat ctc tgg tct tac aat 1349Leu Glu Lys Tyr Val Glu Asp Thr Lys Ile Asp Leu Trp Ser Tyr Asn425 430 435 440gcg gag ctt ctt gtc gct ctg gag aac caa cat aca att gat ctg act 1397Ala G1u Leu Leu Val Ala Leu Glu Asn Gln His Thr Ile Asp Leu Thr445 450 455gac tcg gaa atg aac aaa ctg ttt gaa aaa aca agg agg caa ctg agg 1445Asp Ser Glu Met Asn Lys Leu Phe Glu Lys Thr Arg Arg Gln Leu Arg460 465 470gaa aat gct gag gac atg ggc aat ggt tgc ttc aaa ata ttc cac aaa 1493Glu Asn Ala Glu Asp Met Gly Asn Gly Cys Phe Lys Ile Phe His Lys475 480 485tgt gac aat gct tgc ata ggg tca atc aga aat ggg act tat gac cat 1541Cys Asp Asn Ala Cys Ile Gly Ser Ile Arg Asn Gly Thr Tyr Asp His490 495 500gat gta tac aga gac gaa gca tta aac aac cgg ttt cag atc aaa ggt 1589Asp Val Tyr Arg Asp Glu Ala Leu Asn Asn Arg Phe Gln Ile Lys Gly505 510 515 520gtt gaa ctg aag tca gga tac aaa aac tgg atc ctg tgg att tcc ttt 1637Val Glu Leu Lys Ser Gly Tyr Lys Asn Trp Ile Leu Trp Ile Ser Phe525 530 535gcc ata tca tgc ttt ttg ctt tgt gtt gtt ttg ctg ggg ttc atc atg 1685Ala Ile Ser Cys Phe Leu Leu Cys Val Val Leu Leu Gly Phe Ile Met540 545 550tgg gcc tgc cag aaa ggc aac att agg tgc aac att tgc att 1727Trp Ala Cys Gln Lys Gly Asn Ile Arg Cys Asn Ile Cys Ile555 560 565tgagtgtatt agtaattaaa aacacccttg tttctact 1765<210>8<211>566<212>PRT<213>流感A病毒<400>8Met Lys Thr Ile Ile Ala Leu Ser Tyr Ile Phe Cys Leu Val Leu Gly1 5 10 15Gln Asn Phe Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly
20 25 30His His Ala Val Pro Asn Gly Thr Leu Val Lys Thr Ile Thr Asn Asp35 40 45Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr50 55 60Gly Lys Ile Cys Asn Asn Pro His Arg Ile Leu Asp Gly Ile Asp Cys65 70 75 80Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro His Cys Asp Gly Phe Gln85 90 95Asn Glu Thr Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Phe Ser Asn100 105 110Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val115 120 125Ala Ser Ser Gly Thr Leu Glu Phe Ile Ser Glu Gly Phe Thr Trp Thr130 135 140Gly Val Thr Gln Asn Gly Gly Ser Asn Ala Cys Lys Arg Gly Pro Asp145 150 155 160Ser Gly Phe Phe Ser Arg Leu Asn Trp Leu Tyr Lys Ser Gly Ser Thr165 170 175Tyr Pro Val Leu Asn Val Thr Met Pro Asn Asn Asp Asn Phe Asp Lys180 185 190Leu Tyr Ile Trp Gly Val His His Pro Ser Thr Asp Gln Glu Gln Thr195 200 205Ser Leu Tyr Val Gln Ala Ser Gly Arg Val Thr Val Ser Thr Lys Arg210 215 220Ser Gln Gln Thr Ile Ile Pro Asn Ile Gly Ser Arg Pro Trp Val Arg225 230 235 240Gly Leu Ser Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly245 250 255Asp Ile Leu Val Ile Asn Ser Asn Gly Asn Leu Ile Ala Pro Arg Gly260 265 270Tyr Phe Lys Met Arg Thr Gly Lys Ser Ser Ile Met Arg Ser Asp Ala275 280 285Pro Ile Gly Thr Cys Ile Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile290 295 300Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Lys Ile Thr Tyr Gly Ala305 310 315 320Cys Pro Lys Tyr Val Lys Gln Asn Thr Leu Lys Leu Ala Thr Gly Met325 330 335Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Leu Phe Ser Ala Ile Ala340 345 350Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Ile Asp Gly Trp Tyr Gly355 360 365Phe Arg His Gln Asn Ser Glu Gly Thr Gly Gln Ala Ala Asp Leu Lys370 375 380Ser Thr Gln Ala Ala Ile Asp Gln Ile Asn Gly Lys Leu Asn Arg Val385 390 395 400Ile Glu Lys Thr Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser405 410 415Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr420 425 430Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Glu435 440 445Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe450 455 460Glu Lys Thr Arg Arg Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn465 470 475 480Gly Cys Phe Lys Ile Phe His Lys Cys Asp Asn Ala Cys Ile Gly Ser485 490 495Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu500 505 510Asn Asn Arg Phe Gln Ile Lys Gly Val Glu Leu Lys Ser Gly Tyr Lys515 520 525Asn Trp Ile Leu Trp Ile Ser Phe Ala Ile Ser Cys Phe Leu Leu Cys530 535 540Val Val Leu Leu Gly Phe Ile Met Trp Ala Cys Gln Lys Gly Asn Ile545 550 555 560Arg Cys Asn Ile Cys Ile565<210>9<211>1565<212>DNA<213>流感A病毒<220><221>CDS<222>(46)..(1539)<400>9agcaaaagca gggttaataa tcactcactg agtgacatca aaatc atg gcg tcc caa 57Met Ala Ser Glnggc acc aaa cgg tct tat gaa cag atg gaa act gat ggg gaa cgc cag 105Gly Thr Lys Arg Ser Tyr Glu Gln Met Glu Thr Asp Gly Glu Arg Gln5 10 15 20aat gca act gag atc agg gca tcc gtc ggg aag atg att gat gga att 153Asn Ala Thr Glu Ile Arg Ala Ser Val Gly Lys Met Ile Asp Gly Ile25 30 35gga cga ttc tac atc caa atg tgc act gaa ctt aaa ctc agt gat tat 201Gly Arg Phe Tyr Ile Gln Met Cys Thr Glu Leu Lys Leu Ser Asp Tyr40 45 50gag ggg cga ttg atc cag aac agc tta aca ata gag aga atg gtg ctc 249Glu Gly Arg Leu Ile Gln Asn Ser Leu Thr Ile Glu Arg Met Val Leu55 60 65tct gct ttt gat gag aga agg aat aga tat ctg gaa gaa cat ccc agc 297Ser Ala Phe Asp Glu Arg Arg Asn Arg Tyr Leu Glu Glu His Pro Ser70 75 80gcg ggg aaa gat cct aag aaa act gga ggg ccc ata tac aag aga gta 345Ala Gly Lys Asp Pro Lys Lys Thr Gly Gly Pro Ile Tyr Lys Arg Val85 90 95 100gat aga aag tgg atg agg gaa ctc gtc ctt tat gac aaa gaa gaa ata 393Asp Arg Lys Trp Met Arg Glu Leu Val Leu Tyr Asp Lys Glu Glu Ile105 110 115agg cga atc tgg cgc caa gcc aat aat ggt gat gat gcg aca gct ggt 441Arg Arg Ile Trp Arg Gln Ala Asn Asn Gly Asp Asp Ala Thr Ala Gly120 125 130cta acc cac atg atg atc tgg cat tcc aat ttg aat gat aca aca tac 489Leu Thr His Met Met Ile Trp His Ser Asn Leu Asn Asp Thr Thr Tyr135 140 145cag agg aca aga gct ctt gtt cgc acc gga atg gat ccc agg atg tgc 537Gln Arg Thr Arg Ala Leu Val Arg Thr Gly Met Asp Pro Arg Met Cys150 155 160tct ctg atg cag ggt tcg act ctc cct agg agg tct gga gct gca ggc 585Ser Leu Met Gln Gly Ser Thr Leu Pro Arg Arg Ser Gly Ala Ala Gly165 170 175 180gct gca gtc aaa gga gtc ggg aca atg gtg atg gag ctg atc aga atg 633Ala Ala Val Lys Gly Val Gly Thr Met Val Met Glu Leu Ile Arg Met185 190 195atc aaa cgt ggg atc aac gat cgg aac ttc tgg aga ggt gag aat gga 681Ile Lys Arg Gly Ile Asn Asp Arg Asn Phe Trp Arg Gly Glu Asn Gly200 205 210cga aaa aca agg ggt gct tat gag aga atg tgc aac att ctc aaa gga 729Arg Lys Thr Arg Gly Ala Tyr Glu Arg Met Cys Asn Ile Leu Lys Gly215 220 225aaa ttt caa aca gct gca caa aga gca atg atg gat caa gtg aga gaa 777Lys Phe Gln Thr Ala Ala Gln Arg Ala Met Met Asp Gln Val Arg Glu230 235 240agc cgg aac cca gga aat gct gag atc gaa gat ctc ata ttt ttg gca 825Ser Arg Asn Pro Gly Asn Ala Glu Ile Glu Asp Leu Ile Phe Leu Ala245 250 255 260cgg tct gca cta ata ttg aga ggg tca gtt gct cac aaa tct tgt ctg 873Arg Ser Ala Leu Ile Leu Arg Gly Ser Val Ala His Lys Ser Cys Leu265 270 275cct gcc tgt gtg tat gga cct gcc gta gcc agt ggg tac gac ttc gaa 921Pro Ala Cys Val Tyr Gly Pro Ala Val Ala Ser Gly Tyr Asp Phe Glu280 285 290aaa gag gga tac tct tta gtg gga ata gac cct ttc aaa cta ctt caa 969Lys Glu Gly Tyr Ser Leu Val Gly Ile Asp Pro Phe Lys Leu Leu Gln295 300 305aac agc caa gta tac agc cta atc aga ccg aac gag aat cca gca cac 1017Asn Ser Gln Val Tyr Ser Leu Ile Arg Pro Asn Glu Asn Pro Ala His310 315 320aag agt cag ctg gtg tgg atg gca tgc aat tct gct gca ttt gaa gat 1065Lys Ser Gln Leu Val Trp Met Ala Cys Asn Ser Ala Ala Phe Glu Asp325 330 335 340cta aga tta tta agc ttc atc aga ggg acc aaa gta tct cca agg ggg 1113Leu Arg Leu Leu Ser Phe Ile Arg Gly Thr Lys Val Ser Pro Arg Gly345 350 355aaa ctt tca act aga gga gta caa att gct tca aat gaa aac atg gat 1161Lys Leu Ser Thr Arg Gly Val Gln Ile Ala Ser Asn Glu Asn Met Asp360 365 370act atg gaa tca agt act ctt gaa ctg aga agc agg tac tgg gcc ata 1209Thr Met Glu Ser Ser Thr Leu Glu Leu Arg Ser Arg Tyr Trp Ala Ile375 380 385agg acc aga agt gga gga aac act aat caa cag agg gcc tcc gca ggc 1257Arg Thr Arg Ser Gly Gly Asn Thr Asn Gln Gln Arg Ala Ser Ala Gly390 395 400caa atc agt gtg caa cct gca ttt tct gtg caa aga aac ctc cca ttt 1305Gln Ile Ser Val Gln Pro Ala Phe Ser Val Gln Arg Asn Leu Pro Phe405 410 415 420gac aaa tca acc atc atg gca gca ttc act ggg aat acg gag gga aga 1353Asp Lys Ser Thr Ile Met Ala Ala Phe Thr Gly Asn Thr Glu Gly Arg425 430 435acc tca gac atg agg gca gaa atc ata agg atg atg gaa ggt gca aaa 1401Thr Ser Asp Met Arg Ala Glu Ile Ile Arg Met Met Glu Gly Ala Lys440 445 450cca gaa gaa gtg tcc ttc cgt ggg cgg gga gtc ttc gag ctc tcg gac 1449Pro Glu Glu Val Ser Phe Arg Gly Arg Gly Val Phe Glu Leu Ser Asp455 460 465gaa aag gca acg aac ccg atc gtg ccc tct ttt gac atg agt aat gaa 1497Glu Lys Ala Thr Asn Pro Ile Val Pro Ser Phe Asp Met Ser Asn Glu470 475 480gga tct tat ttc ttc gga gac aat gca gag gag tac gac aat 1539Gly Ser Tyr Phe Phe Gly Asp Asn Ala Glu Glu Tyr Asp Asn485 490 495taaggaaaaa tacccttgtt tctact 1565<210>10<211>498<212>PRT<213>流感A病毒<400>10Met Ala Ser Gln Gly Thr Lys Arg Ser Tyr Glu Gln Met Glu Thr Asp1 5 10 15Gly Glu Arg Gln Asn Ala Thr Glu Ile Arg Ala Ser Val Gly Lys Met20 25 30Ile Asp Gly Ile Gly Arg Phe Tyr Ile Gln Met Cys Thr Glu Leu Lys35 40 45Leu Ser Asp Tyr Glu Gly Arg Leu Ile Gln Asn Ser Leu Thr Ile Glu50 55 60Arg Met Val Leu Ser Ala Phe Asp Glu Arg Arg Asn Arg Tyr Leu Glu65 70 75 80Glu His Pro Ser Ala Gly Lys Asp Pro Lys Lys Thr Gly Gly Pro Ile85 90 95Tyr Lys Arg Val Asp Arg Lys Trp Met Arg Glu Leu Val Leu Tyr Asp100 105 110Lys Glu Glu Ile Arg Arg Ile Trp Arg Gln Ala Asn Asn Gly Asp Asp115 120 125Ala Thr Ala Gly Leu Thr His Met Met Ile Trp His Ser Asn Leu Asn130 135 140Asp Thr Thr Tyr Gln Arg Thr Arg Ala Leu Val Arg Thr Gly Met Asp145 150 155 160Pro Arg Met Cys Ser Leu Met Gln Gly Ser Thr Leu Pro Arg Arg Ser165 170 175Gly Ala Ala Gly Ala Ala Val Lys Gly Val Gly Thr Met Val Met Glu180 185 190Leu Ile Arg Met Ile Lys Arg Gly Ile Asn Asp Arg Asn Phe Trp Arg195 200 205Gly Glu Asn Gly Arg Lys Thr Arg Gly Ala Tyr Glu Arg Met Cys Asn210 215 220Ile Leu Lys Gly Lys Phe Gln Thr Ala Ala Gln Arg Ala Met Met Asp225 230 235 240Gln Val Arg Glu Ser Arg Asn Pro Gly Asn Ala Glu Ile Glu Asp Leu245 250 255Ile Phe Leu Ala Arg Ser Ala Leu Ile Leu Arg Gly Ser Val Ala His260 265 270Lys Ser Cys Leu Pro Ala Cys Val Tyr Gly Pro Ala Val Ala Ser Gly275 280 285Tyr Asp Phe Glu Lys Glu Gly Tyr Ser Leu Val Gly Ile Asp Pro Phe290 295 300Lys Leu Leu Gln Asn Ser Gln Val Tyr Ser Leu Ile Arg Pro Asn Glu305 310 315 320Asn Pro Ala His Lys Ser Gln Leu Val Trp Met Ala Cys Asn Ser Ala325 330 335Ala Phe Glu Asp Leu Arg Leu Leu Ser Phe Ile Arg Gly Thr Lys Val340 345 350Ser Pro Arg Gly Lys Leu Ser Thr Arg Gly Val Gln Ile Ala Ser Asn355 360 365Glu Asn Met Asp Thr Met Glu Ser Ser Thr Leu Glu Leu Arg Ser Arg370 375 380Tyr Trp Ala Ile Arg Thr Arg Ser Gly Gly Asn Thr Asn Gln Gln Arg385 390 395 400Ala Ser Ala Gly Gln Ile Ser Val Gln Pro Ala Phe Ser Val Gln Arg405 410 415Asn Leu Pro Phe Asp Lys Ser Thr Ile Met Ala Ala Phe Thr Gly Asn420 425 430Thr Glu Gly Arg Thr Ser Asp Met Arg Ala Glu Ile Ile Arg Met Met435 440 445Glu Gly Ala Lys Pro Glu Glu Val Ser Phe Arg Gly Arg Gly Val Phe450 455 460Glu Leu Ser Asp Glu Lys Ala Thr Asn Pro Ile Val Pro Ser Phe Asp465 470 475 480Met Ser Asn Glu Gly Ser Tyr Phe Phe Gly Asp Asn Ala Glu Glu Tyr485 490 495Asp Asn<210>11<211>1466<212>DNA<213>流感A病毒<220><221>CDS<222>(20)..(1426)<400>11agcaaaagca ggagtgaag atg aat cca aat caa aag ata ata aca att ggc 52Met Asn Pro Asn Gln Lys Ile Ile Thr Ile Gly1 5 10tct gtc tct ctc acc att gca aca ata tgc ttc ctc atg cag att gcc 100Ser Val Ser Leu Thr Ile Ala Thr Ile Cys Phe Leu Met Gln Ile Ala15 20 25atc ctg gta act act gta aca ttg cat ttc aag caa tat gag tgc gac 148Ile Leu Val Thr Thr Val Thr Leu His Phe Lys Gln Tyr Glu Cys Asp30 35 40tcc ccc gcg aac aac caa gta atg ccg tgt gaa cca ata ata ata gaa 196Ser Pro Ala Asn Asn Gln Val Met Pro Cys Glu Pro Ile Ile Ile Glu45 50 55agg aac ata aca gag ata gtg tat ttg act aac acc acc ata gag aaa 244Arg Asn Ile Thr Glu Ile Val Tyr Leu Thr Asn Thr Thr Ile Glu Lys60 65 70 75gag ata tgc ccc aaa tta gtg gaa tac agg aat tgg tca aag ccg caa 292Glu Ile Cys Pro Lys Leu Val Glu Tyr Arg Asn Trp Ser Lys Pro Gln80 85 90tgt aaa att aca gga ttt gca cct ttt tct aag gac aat tca att cgg 340Cys Lys Ile Thr Gly Phe Ala Pro Phe Ser Lys Asp Asn Ser Ile Arg95 100 105ctt tct gct ggt ggg gac att tgg gtg acg aga gaa cct tat gtg tca 388Leu Ser Ala Gly Gly Asp Ile Trp Val Thr Arg Glu Pro Tyr Val Ser110 115 120tgc gat cct ggc aag tgt tat caa ttt gca ctc ggg cag ggg acc aca 436Cys Asp Pro Gly Lys Cys Tyr Gln Phe Ala Leu Gly Gln Gly Thr Thr125 130 135cta gac aac aaa cat tca aat gac aca ata cat gat aga acc cct cat 484Leu Asp Asn Lys His Ser Asn Asp Thr Ile His Asp Arg Thr Pro His140 145 150 155cga acc cta ttg atg aat gag ttg ggt gtt cca ttt cat ttg gga acc 532Arg Thr Leu Leu Met Asn Glu Leu Gly Val Pro Phe His Leu Gly Thr160 165 170agg caa gtg tgt ata gca tgg tcc agc tca agt tgt cac gat gga aaa 580Arg Gln Val Cys Ile Ala Trp Ser Ser Ser Ser Cys His Asp Gly Lys175 180 185gca tgg ctg cat gtt tgt gtc act ggg tat gat aaa aat gca act gct 628Ala Trp Leu His Val Cys Val Thr Gly Tyr Asp Lys Asn Ala Thr Ala190 195 200agc ttc att tac gat ggg agg ctt gta gac agt att ggt tca tgg tct 676Ser Phe Ile Tyr Asp Gly Arg Leu Val Asp Ser Ile Gly Ser Trp Ser205 210 215caa aat atc ctc agg acc cag gag tcg gaa tgc gtt tgt atc aat ggg 724Gln Asn Ile Leu Arg Thr Gln Glu Ser Glu Cys Val Cys Ile Asn Gly220 225 230 235act tgt aca gta gta atg act gat gga agt gct tca gga aga gct gat 772Thr Cys Thr Val Val Met Thr Asp Gly Ser Ala Ser Gly Arg Ala Asp240 245 250act aaa ata cta ttc att gaa gag ggg aaa att gtt cat att agc cca 820Thr Lys Ile Leu Phe Ile Glu Glu Gly Lys Ile Val His Ile Ser Pro255 260 265ttg tca gga agt gct cag cat gta gag gag tgt tcc tgt tat cct cga 868Leu Ser Gly Ser Ala Gln His Val Glu Glu Cys Ser Cys Tyr Pro Arg270 275 280tat cct ggt gtc aga tgt atc tgc aga gac aac tgg aaa ggc tct aat 916Tyr Pro Gly Val Arg Cys Ile Cys Arg Asp Asn Trp Lys Gly Ser Asn285 290 295agg ccc gtc gta gat ata aat gtg aaa gat tat agc att gat tcc agt 964Arg Pro Val Val Asp Ile Asn Val Lys Asp Tyr Ser Ile Asp Ser Ser300 305 310 315tat gtg tgc tca ggg ctt gtt ggc gac gca ccc aga aac aac gac aga 1012Tyr Val Cys Ser Gly Leu Val Gly Asp Ala Pro Arg Asn Asn Asp Arg320 325 330tct agc aat agc tat tgc cgg aat cct aac aat gag aaa ggg aat cac 1060Ser Ser Asn Ser Tyr Cys Arg Asn Pro Asn Asn Glu Lys Gly Asn His335 340 345gga gtg aaa ggc tgg gcc ttt gac gat gga aat gac gtg tgg atg gga 1108Gly Val Lys Gly Trp Ala Phe Asp Asp Gly Asn Asp Val Trp Met Gly350 355 360aga acg atc agc gag gat tca cgc tca ggt tat gaa acc ttc aaa gtc 1156Arg Thr Ile Ser Glu Asp Ser Arg Ser Gly Tyr Glu Thr Phe Lys Val365 370 375att ggt ggt tgg tcc aca cct aat tcc aaa ttg cag ata aat agg caa 1204Ile Gly Gly Trp Ser Thr Pro Asn Ser Lys Leu Gln Ile Asn Arg Gln380 385 390 395gtc ata gtt gac agc gat aat agg tca ggt tat tct ggt att ttc tct 1252Val Ile Val Asp Ser Asp Asn Arg Ser Gly Tyr Ser Gly Ile Phe Ser400 405 410gtt gag ggc aaa agc tgc atc aat agg tgc ttt tat gtg gag ttg ata 1300Val Glu Gly Lys Ser Cys Ile Asn Arg Cys Phe Tyr Val Glu Leu Ile415 420 425agg gga agg gaa cag gaa act aga gta tgg tgg acc tca aac agt att 1348Arg Gly Arg Glu Gln Glu Thr Arg Val Trp Trp Thr Ser Asn Ser Ile430 435 440gtt gtg ttt tgt ggc act tca ggt acc tat ggg aca ggc tca tgg cct 1396Val Val Phe Cys Gly Thr Ser Gly Thr Tyr Gly Thr Gly Ser Trp Pro445 450 455gat ggg gcg gac atc aat ctc atg cct ata taagctttcg caattttaga 1446Asp Gly Ala Asp Ile Asn Leu Met Pro Ile460 465aaaaactcct tgtttctact 1466<210>12<211>469<212>PRT<213>流感A病毒<400>12Met Asn Pro Asn Gln Lys Ile Ile Thr Ile Gly Ser Val Ser Leu Thr1 5 10 15Ile Ala Thr Ile Cys Phe Leu Met Gln Ile Ala Ile Leu Val Thr Thr20 25 30Val Thr Leu His Phe Lys Gln Tyr Glu Cys Asp Ser Pro Ala Asn Asn35 40 45Gln Val Met Pro Cys Glu Pro Ile Ile Ile Glu Arg Asn Ile Thr Glu50 55 60Ile Val Tyr Leu Thr Asn Thr Thr Ile Glu Lys Glu Ile Cys Pro Lys65 70 75 80Leu Val Glu Tyr Arg Asn Trp Ser Lys Pro Gln Cys Lys Ile Thr Gly85 90 95Phe Ala Pro Phe Ser Lys Asp Asn Ser Ile Arg Leu Ser Ala Gly Gly100 105 110Asp Ile Trp Val Thr Arg Glu Pro Tyr Val Ser Cys Asp Pro Gly Lys115 120 125Cys Tyr Gln Phe Ala Leu Gly Gln Gly Thr Thr Leu Asp Asn Lys His130 135 140Ser Asn Asp Thr Ile His Asp Arg Thr Pro His Arg Thr Leu Leu Met145 150 155 160Asn Glu Leu Gly Val Pro Phe His Leu Gly Thr Arg Gln Val Cys Ile165 170 175Ala Trp Ser Ser Ser Ser Cys His Asp Gly Lys Ala Trp Leu His Val180 185 190Cys Val Thr Gly Tyr Asp Lys Asn Ala Thr Ala Ser Phe Ile Tyr Asp195 200 205Gly Arg Leu Val Asp Ser Ile Gly Ser Trp Ser Gln Asn Ile Leu Arg210 215 220Thr Gln Glu Ser Glu Cys Val Cys Ile Asn Gly Thr Cys Thr Val Val225 230 235 240Met Thr Asp Gly Ser Ala Ser Gly Arg Ala Asp Thr Lys Ile Leu Phe245 250 255Ile Glu Glu Gly Lys Ile Val His Ile Ser Pro Leu Ser Gly Ser Ala260 265 270Gln His Val Glu Glu Cys Ser Cys Tyr Pro Arg Tyr Pro Gly Val Arg275 280 285Cys Ile Cys Arg Asp Asn Trp Lys Gly Ser Asn Arg Pro Val Val Asp290 295 300Ile Asn Val Lys Asp Tyr Ser Ile Asp Ser Ser Tyr Val Cys Ser Gly305 310 315 320Leu Val Gly Asp Ala Pro Arg Asn Asn Asp Arg Ser Ser Asn Set Tyr325 330 335Cys Arg Asn Pro Asn Asn Glu Lys Gly Asn His Gly Val Lys Gly Trp340 345 350Ala Phe Asp Asp Gly Asn Asp Val Trp Met Gly Arg Thr Ile Ser Glu355 360 365Asp Ser Arg Ser Gly Tyr Glu Thr Phe Lys Val Ile Gly Gly Trp Ser370 375 380Thr Pro Asn Ser Lys Leu Gln Ile Asn Arg Gln Val Ile Val Asp Ser385 390 395 400Asp Asn Arg Ser Gly Tyr Ser Gly Ile Phe Ser Val Glu Gly Lys Ser405 410 415Cys Ile Asn Arg Cys Phe Tyr Val Glu Leu Ile Arg Gly Arg Glu Gln420 425 430Glu Thr Arg Val Trp Trp Thr Ser Asn Ser Ile Val Val Phe Cys Gly435 440 445Thr Ser Gly Thr Tyr Gly Thr Gly Ser Trp Pro Asp Gly Ala Asp Ile450 455 460Asn Leu Met Pro Ile465<210>13<211>1027<212>DNA<213>流感A病毒<220><221>CDS<222>(26)..(781)<400>13agcaaaagca ggtagatatt gaaag atg agc ctt ctg acc gag gtc gaa acg 52Met Ser Leu Leu Thr Glu Val Glu Thr1 5tat gtt ctc tct atc gtt ccg tea ggc ccc ctc aaa gcc gag atc gcg 100Tyr Val Leu Ser Ile Val Pro Ser Gly Pro Leu Lys Ala Glu Ile Ala10 15 20 25cag aga ctt gaa gat gtc ttt gct gga aag aac aca gat ctt gag gct 148Gln Arg Leu Glu Asp Val Phe Ala Gly Lys Asn Thr Asp Leu Glu Ala30 35 40ctc atg gaa tgg cta aag aca aga cca atc ctg tca cct ctg act aaa 196Leu Met Glu Trp Leu Lys Thr Arg Pro Ile Leu Ser Pro Leu Thr Lys45 50 55ggg att ttg gga ttt gta ttc acg ctc acc gtg ccc agt gag cga gga 244Gly Ile Leu Gly Phe Val Phe Thr Leu Thr Val Pro Ser Glu Arg Gly60 65 70ctg cag cgt aga cgc ttt gtc caa aat gcc ctc aat ggg aat ggg gat 292Leu Gln Arg Arg Arg Phe Val Gln Asn Ala Leu Asn Gly Asn Gly Asp75 80 85cca aat aac atg gac aga gca gtt aaa ctg tat aga aaa ctt aag agg 340Pro Asn Asn Met Asp Arg Ala Val Lys Leu Tyr Arg Lys Leu Lys Arg90 95 100 105gag ata aca ttc cat ggg gcc aaa gaa ata gca ctc agt tat tct gct 388Glu Ile Thr Phe His Gly Ala Lys Glu Ile Ala Leu Ser Tyr Ser Ala110 115 120ggt gca ctt gcc agt tgc atg ggc ctc ata tac aac agg atg ggg gct 436Gly Ala Leu Ala Ser Cys Met Gly Leu Ile Tyr Asn Arg Met Gly Ala125 130 135gtg acc act gaa gtg gcc ttt ggc ctg gtt tgt gca acc tgt gaa cag 484Val Thr Thr Glu Val Ala Phe Gly Leu Val Cys Ala Thr Cys Glu Gln140 145 150att gct gac tcc cag cac agg tct cat agg caa atg gtg gca aca acc 532Ile Ala Asp Ser Gln His Arg Ser His Arg Gln Met Val Ala Thr Thr155 160 165aat cca cta ata aga cat gag aac aga atg gtt ctg gcc agc act aca 580Asn Pro Leu Ile Arg His Glu Asn Arg Met Val Leu Ala Ser Thr Thr170 175 180 185gct aag gct atg gag caa atg gct gga tca agt gag cag gca gca gag 628Ala Lys Ala Met Glu Gln Met Ala Gly Ser Ser Glu Gln Ala Ala Glu190 195 200gcc atg gag gtt gct agt cag gcc agg caa atg gtg cag gca atg aga 676Ala Met Glu Val Ala Ser Gln Ala Arg Gln Met Val Gln Ala Met Arg205 210 215gcc att ggg act cat cct agc tcc agt gct ggt cta aaa gat gat ctt 724Ala Ile Gly Thr His Pro Ser Ser Ser Ala Gly Leu Lys Asp Asp Leu220 225 230ctt gaa aat ttg cag gcc tat cag aaa cga atg ggg gtg cag atg caa 772Leu Glu Asn Leu Gln Ala Tyr Gln Lys Arg Met Gly Val Gln Met Gln235 240 245cga ttc aag tgaccctctt gttgttgctg cgagtatcat tgggatcttg 821Arg Phe Lys250cacttgatat tgtggattct tgatcgtctt tttttcaaat gcatctatcg attctttgaa 881cacggtctga aaagagggcc ttctacggaa ggagtacctg agtctatgag ggaagaatat 941cgaaaggaac agcagagtgc tgtggatgct gacgacagtc attttgtcag catagagctg 1001gagtaaaaaa ctaccttgtt tctact 1027<210>14<211>252<212>PRT<213>流感A病毒<400>14Met Ser Leu Leu Thr Glu Val Glu Thr Tyr Val Leu Ser Ile Val Pro1 5 l0 15Ser Gly Pro Leu Lys Ala Glu Ile Ala Gln Arg Leu Glu Asp Val Phe20 25 30Ala Gly Lys Asn Thr Asp Leu Glu Ala Leu Met Glu Trp Leu Lys Thr35 40 45Arg Pro Ile Leu Ser Pro Leu Thr Lys Gly Ile Leu Gly Phe Val Phe50 55 60Thr Leu Thr Val Pro Ser Glu Arg Gly Leu Gln Arg Arg Arg Phe Val65 70 75 80Gln Asn Ala Leu Asn Gly Asn Gly Asp Pro Asn Asn Met Asp Arg Ala85 90 95Val Lys Leu Tyr Arg Lys Leu Lys Arg Glu Ile Thr Phe His Gly Ala100 105 110Lys Glu Ile Ala Leu Ser Tyr Ser Ala Gly Ala Leu Ala Ser Cys Met115 120 125Gly Leu Ile Tyr Asn Arg Met Gly Ala Val Thr Thr Glu Val Ala Phe130 135 140Gly Leu Val Cys Ala Thr Cys Glu Gln Ile Ala Asp Ser Gln His Arg145 150 155 160Ser His Arg Gln Met Val Ala Thr Thr Asn Pro Leu Ile Arg His Glu165 170 175Asn Arg Met Val Leu Ala Ser Thr Thr Ala Lys Ala Met Glu Gln Met180 185 190Ala Gly Ser Ser Glu Gln Ala Ala Glu Ala Met Glu Val Ala Ser Gln195 200 205Ala Arg Gln Met Val Gln Ala Met Arg Ala Ile Gly Thr His Pro Ser2l0 215 220Ser Ser Ala Gly Leu Lys Asp Asp Leu Leu Glu Asn Leu Gln Ala Tyr225 230 235 240Gln Lys Arg Met Gly Val Gln Met Gln Arg Phe Lys245 250<210>15<211>322<212>DNA<213>流感A病毒<220><221>CDS<222>(26)..(316)<400>15agcaaaagca ggtagatatt gaaag atg agc ctt ctg acc gag gtc gaa acg 52Met Ser Leu Leu Thr Glu Val Glu Thr1 5cct atc aga aac gaa tgg ggg tgc aga tgc aac gat tca agt gac cct 100Pro Ile Arg Asn Glu Trp Gly Cys Arg Cys Asn Asp Ser Ser Asp Pro10 15 20 25ctt gtt gtt gct gcg agt atc att ggg atc ttg cac ttg ata ttg tgg 148Leu Val Val Ala Ala Ser Ile Ile Gly Ile Leu His Leu Ile Leu Trp30 35 40att ctt gat cgt ctt ttt ttc aaa tgc atc tat cga ttc ttt gaa cac 196Ile Leu Asp Arg Leu Phe Phe Lys Cys Ile Tyr Arg Phe Phe Glu His45 50 55ggt ctg aaa aga ggg cct tct acg gaa gga gta cct gag tct atg agg 244Gly Leu Lys Arg Gly Pro Ser Thr Glu Gly Val Pro Glu Ser Met Arg60 65 70gaa gaa tat cga aag gaa cag cag agt gct gtg gat gct gac gac agt 292Glu Glu Tyr Arg Lys Glu Gln Gln Ser Ala Val Asp Ala Asp Asp Ser75 80 85cat ttt gtc agc ata gag ctg gag taaaaa322His Phe Val Ser Ile Glu Leu Glu90 95<210>16<211>97<212>PRT<213>流感A病毒<400>16Met Ser Leu Leu Thr Glu Val Glu Thr Pro Ile Arg Asn Glu Trp Gly1 5 10 15Cys Arg Cys Asn Asp Ser Ser Asp Pro Leu Val Val Ala Ala Ser Ile20 25 30Ile Gly Ile Leu His Leu Ile Leu Trp Ile Leu Asp Arg Leu Phe Phe35 40 45Lys Cys Ile Tyr Arg Phe Phe Glu His Gly Leu Lys Arg Gly Pro Ser50 55 60Thr Glu Gly Val Pro Glu Ser Met Arg Glu Glu Tyr Arg Lys Glu Gln65 70 75 80Gln Ser Ala Val Asp Ala Asp Asp Ser His Phe Val Ser Ile Glu Leu85 90 95Glu<210>17<211>890<212>DNA<213>流感A病毒<220><221>CDS<222>(27)..(737)<400>17agcaaaagca gggtgacaaa gacata atg gat tcc aac act gtg tca agt ttt 53Met Asp Ser Asn Thr Val Ser Ser Phe1 5cag gta gac tgc ttc ctt tgg cat gtc cga aaa caa gtt gta gac caa 101Gln Val Asp Cys Phe Leu Trp His Val Arg Lys Gln Val Val Asp Gln10 15 20 25gaa cta ggt gat gcc cca ttc ctt gat cgg ctt cgc cga gat cag aag 149Glu Leu Gly Asp Ala Pro Phe Leu Asp Arg Leu Arg Arg Asp Gln Lys30 35 40tcc cta agg gga aga ggc agc act ctc ggt cta aac atc gaa gca gcc 197Ser Leu Arg Gly Arg Gly Ser Thr Leu Gly Leu Asn Ile Glu Ala Ala45 50 55acc cat gtt gga aag cag ata gta gag aag att ctg aag gaa gaa tct 245Thr His Val Gly Lys Gln Ile Val Glu Lys Ile Leu Lys Glu Glu Ser60 65 70gat gag gca ctt aaa atg acc atg gcc tcc aca cct gct tcg cga tac 293Asp Glu Ala Leu Lys Met Thr Met Ala Ser Thr Pro Ala Ser Arg Tyr75 80 85ata act gac atg act att gag gaa ttg tca agg gac tgg ttc atg cta 341Ile Thr Asp Met Thr Ile Glu Glu Leu Ser Arg Asp Trp Phe Met Leu90 95 100 105atg ccc aag cag aaa gtg gaa gga cct ctt tgc atc aga ata gac caa 389Met Pro Lys Gln Lys Val Glu Gly Pro Leu Cys Ile Arg Ile Asp Gln110 115 120gca atc atg gat aag aac atc atg ttg aaa gcg aat ttc agt gtg att 437Ala Ile Met Asp Lys Asn Ile Met Leu Lys Ala Asn Phe Ser Val Ile125 130 135ttt gac cgg cta gag acc cta ata tta cta agg gct ttc acc gaa gag 485Phe Asp Arg Leu Glu Thr Leu Ile Leu Leu Arg Ala Phe Thr Glu Glu140 145 150gga gca att gtt ggc gaa atc tca cca ttg cct tct ttt cca gga cat 533Gly Ala Ile Val Gly Glu Ile Ser Pro Leu Pro Ser Phe Pro Gly His155 160 165act att gag gat gtc aaa aat gca att ggg gtc ctc atc gga gga ctt 581Thr Ile Glu Asp Val Lys Asn Ala Ile Gly Val Leu Ile Gly Gly Leu170 175 180 185gaa tgg aat gat aac aca gtt cga gtc tct aaa act cta cag aga ttc 629Glu Trp Asn Asp Asn Thr Val Arg Val Ser Lys Thr Leu Gln Arg Phe190 195 200gct tgg gga agc agt aat gag aat ggg aga cct cca ctt act cca aaa 677Ala Trp Gly Ser Ser Asn Glu Asn Gly Arg Pro Pro Leu Thr Pro Lys205 210 215cag aaa cgg aaa atg gcg aga aca gct agg tca aaa gtt cga aga gat 725Gln Lys Arg Lys Met Ala Arg Thr Ala Arg Ser Lys Val Arg Arg Asp220 225 230aag atg gct gat tgaagaagtg agacacagac tgaagacaac agagaatagt 777Lys Met Ala Asp235tttgagcaaa taacattcat gcaagcctta cagctactat ttgaagtgga acaggagata 837agaactttct cgtttcagct tatttaatga taaaaaacac ccttgtttct act890<210>18<211>237<212>PRT<213>流感A病毒<400>18Met Asp Ser Asn Thr Val Ser Ser Phe Gln Val Asp Cys Phe Leu Trp1 5 10 15His Val Arg Lys Gln Val Val Asp Gln Glu Leu Gly Asp Ala Pro Phe20 25 30Leu Asp Arg Leu Arg Arg Asp Gln Lys Ser Leu Arg Gly Arg Gly Ser
35 40 45Thr Leu Gly Leu Asn Ile Glu Ala Ala Thr His Val Gly Lys Gln Ile50 55 60Val Glu Lys Ile Leu Lys Glu Glu Ser Asp Glu Ala Leu Lys Met Thr65 70 75 80Met Ala Ser Thr Pro Ala Ser Arg Tyr Ile Thr Asp Met Thr Ile Glu85 90 95Glu Leu Ser Arg Asp Trp Phe Met Leu Met Pro Lys Gln Lys Val Glu100 105 110Gly Pro Leu Cys Ile Arg Ile Asp Gln Ala Ile Met Asp Lys Asn Ile115 120 125Met Leu Lys Ala Asn Phe Ser Val Ile Phe Asp Arg Leu Glu Thr Leu130 135 140Ile Leu Leu Arg Ala Phe Thr Glu Glu Gly Ala Ile Val Gly Glu Ile145 150 155 160Ser Pro Leu Pro Ser Phe Pro Gly His Thr Ile Glu Asp Val Lys Asn165 170 175Ala Ile Gly Val Leu Ile Gly Gly Leu Glu Trp Asn Asp Asn Thr Val180 185 190Arg Val Ser Lys Thr Leu Gln Arg Phe Ala Trp Gly Ser Ser Asn Glu195 200 205Asn Gly Arg Pro Pro Leu Thr Pro Lys Gln Lys Arg Lys Met Ala Arg210 215 220Thr Ala Arg Ser Lys Val Arg Arg Asp Lys Met Ala Asp225 230 235<210>19<211>402<212>DNA<213>流感A病毒<220><221>CDS<222>(27)..(389)<400>19agcaaaagca gggtgacaaa gacata atg gat tcc aac act gtg tca agt ttt 53Met Asp Ser Asn Thr Val Ser Ser Phe1 5cag gac ata cta ttg agg atg tca aaa atg caa ttg ggg tcc tca tcg 101Gln Asp Ile Leu Leu Arg Met Ser Lys Met Gln Leu Gly Ser Ser Ser10 15 20 25gag gac ttg aat gga atg ata aca cag ttc gag tct cta aaa ctc tac 149Glu Asp Leu Asn Gly Met Ile Thr Gln Phe Glu Ser Leu Lys Leu Tyr30 35 40aga gat tcg ctt ggg gaa gca gta atg aga atg gga gac ctc cac tta 197Arg Asp Ser Leu Gly Glu Ala Val Met Arg Met Gly Asp Leu His Leu45 50 55ctc caa aac aga aac gga aaa tgg cga gaa cag cta ggt caa aag ttc 245Leu Gln Asn Arg Asn Gly Lys Trp Arg Glu Gln Leu Gly Gln Lys Phe60 65 70gaa gag ata aga tgg ctg att gaa gaa gtg aga cac aga ctg aag aca 293Glu Glu Ile Arg Trp Leu Ile Glu Glu Val Arg His Arg Leu Lys Thr75 80 85aca gag aat agt ttt gag caa ata aca ttc atg caa gcc tta cag cta 341Thr Glu Asn Ser Phe Glu Gln Ile Thr Phe Met Gln Ala Leu Gln Leu90 95 100 105cta ttt gaa gtg gaa cag gag ata aga act ttc tcg ttt cag ctt att 389Leu Phe Glu Val Glu Gln Glu Ile Arg Thr Phe Ser Phe Gln Leu Ile1l0 115 120taatgataaa aaa402<210>20<211>121<212>PRT<213>流感A病毒<400>20Met Asp Ser Asn Thr Val Ser Ser Phe Gln Asp Ile Leu Leu Arg Met1 5 10 15Ser Lys Met Gln Leu Gly Ser Ser Ser Glu Asp Leu Asn Gly Met Ile20 25 30Thr Gln Phe Glu Ser Leu Lys Leu Tyr Arg Asp Ser Leu Gly Glu Ala35 40 45Val Met Arg Met Gly Asp Leu His Leu Leu Gln Asn Arg Asn Gly Lys50 55 60Trp Arg Glu Gln Leu Gly Gln Lys Phe Glu Glu Ile Arg Trp Leu Ile65 70 75 80Glu Glu Val Arg His Arg Leu Lys Thr Thr Glu Asn Ser Phe Glu Gln85 90 95Ile Thr Phe Met Gln Ala Leu Gln Leu Leu Phe Glu Val Glu Gln Glu100 105 110Ile Arg Thr Phe Ser Phe Gln Leu Ile115 120<210>21<211>1764<212>DNA<213>流感A病毒<220><221>CDS<222>(30)..(1727)<400>21agcaaaagca ggggataatt ctattaacc atg aaa act atc att gct ttg agc 53Met Lys Thr Ile Ile Ala Leu Ser1 5tac att ttc tgt ctg gtt ctc ggc caa gac ttt cca gga aat gac aac 101Tyr Ile Phe Cys Leu Val Leu Gly Gln Asp Phe Pro Gly Asn Asp Asn10 15 20agc aca gca acg ctg tgc ctg gga cat cat gcg gtg cca aac gga aca 149Ser Thr Ala Thr Leu Cys Leu Gly His His Ala Val Pro Asn Gly Thr25 30 35 40cta gtg aaa aca atc aca aat gat cag att gaa gtg act aat gct act 197Leu Val Lys Thr Ile Thr Asn Asp Gln Ile Glu Val Thr Asn Ala Thr45 50 55gag ctg gtt cag agt tcc tca acg ggg aaa ata tgc aac aat cct cat 245Glu Leu Val Gln Ser Ser Ser Thr Gly Lys Ile Cys Asn Asn Pro His60 65 70cga atc ctt gat gga ata gac tgc aca ctg ata gat gct cta ttg ggg 293Arg Ile Leu Asp Gly Ile Asp Cys Thr Leu Ile Asp Ala Leu Leu Gly75 80 85gac cct cat tgt gat ggc ttt caa aat gag aca tgg gac ctt ttc gtt 341Asp Pro His Cys Asp Gly Phe Gln Asn Glu Thr Trp Asp Leu Phe Val90 95 100gaa cgc agc aaa gct ttc agc aac tgt tac cct tat gat gtg cca gat 389Glu Arg Ser Lys Ala Phe Ser Asn Cys Tyr Pro Tyr Asp Val Pro Asp105 110 115 120tat gcc tcc ctt agg tca cta gtt gcc tcg tca ggc act ctg gag ttt 437Tyr Ala Ser Leu Arg Ser Leu Val Ala Ser Ser Gly Thr Leu Glu Phe125 130 135atc agt gaa ggc ttc act tgg act ggg gtc act cag aat ggg gga agc 485Ile Ser Glu Gly Phe Thr Trp Thr Gly Val Thr Gln Asn Gly Gly Ser140 145 150aat gct tgc aaa agg gga cct gat agc ggt ttt ttc agt aga ctg aac 533Asn Ala Cys Lys Arg Gly Pro Asp Ser Gly Phe Phe Ser Arg Leu Asn155 160 165tgg ttg tac aaa tca gga agc aca tat cca gtg ctg aac gtg act atg 581Trp Leu Tyr Lys Ser Gly Ser Thr Tyr Pro Val Leu Asn Val Thr Met170 175 180cca aac aat gac aat ttt gac aaa cta tac att tgg ggg gtt cac cac 629Pro Asn Asn Asp Asn Phe Asp Lys Leu Tyr Ile Trp Gly Val His His185 190 195 200ccg agc acg gac caa gaa caa acc agc cta tat gtt caa gca tca ggg 677Pro Ser Thr Asp Gln Glu Gln Thr Ser Leu Tyr Val Gln Ala Ser Gly205 210 215aga gtc aca gtc tct acc aag aga agc cag caa act ata atc ccg aat 725Arg Val Thr Val Ser Thr Lys Arg Ser Gln Gln Thr Ile Ile Pro Asn220 225 230atc ggg tct aga ccc tgg gta agg ggt ctg tct agt aga ata agc atc 773Ile Gly Ser Arg Pro Trp Val Arg Gly Leu Ser Ser Arg Ile Ser Ile235 240 245tat tgg aca ata gtt aaa ccg gga gac ata ctg gta att aat agt aat 821Tyr Trp Thr Ile Val Lys Pro Gly Asp Ile Leu Val Ile Asn Ser Asn250 255 260ggg aac cta att gct cct cgg ggt tat ttt aaa atg cgc act ggg aaa 869Gly Asn Leu Ile Ala Pro Ara Gly Tyr Phe Lys Met Arg Thr Gly Lys265 270 275 280agc tca ata atg agg tca gat gca cct att ggc acc tgc att tct gaa 917Ser Ser Ile Met Arg Ser Asp Ala Pro Ile Gly Thr Cys Ile Ser Glu285 290 295tgc atc act cca aat gga agc att ccc aat gac aag ccc ttt caa aac 965Cys Ile Thr Pro Asn Gly Ser Ile Pro Asn Asp Lys Pro Phe Gln Asn300 305 310gta aac aag atc aca tat ggg gca tgt ccc aag tat gtt aag caa aac 1013Val Asn Lys Ile Thr Tyr Gly Ala Cys Pro Lys Tyr Val Lys Gln Asn315 320 325acc ctg aag ttg gca aca ggg atg cgg aat gta cca gag aaa caa act 1061Thr Leu Lys Leu Ala Thr Gly Met Arg Asn Val Pro Glu Lys Gln Thr330 335 340aga ggc cta ttc agc gca ata gca ggt ttc ata gaa aat ggt tgg gag 1109Arg Gly Leu Phe Ser Ala Ile Ala Gly Phe Ile Glu Asn Gly Trp Glu345 350 355 360gga atg ata gac ggt tgg tac ggt ttc agg cat caa aat tct gag ggc 1157Gly Met Ile Asp Gly Trp Tyr Gly Phe Arg His Gln Asn Ser Glu Gly365 370 375aca gga caa gca gca gat ctt aaa age act caa gca gcc atc gac caa 1205Thr Gly Gln Ala Ala Asp Leu Lys Ser Thr Gln Ala Ala Ile Asp Gln380 385 390atc aat ggg aaa ctg aat agg gta atc gag aag acg aac gag aaa ttc 1253Ile Asn Gly Lys Leu Asn Arg Val Ile Glu Lys Thr Asn Glu Lys Phe395 400 405cat caa atc gaa aag gaa ttc tca gaa gta gaa ggg aga att cag gac 1301His Gln Ile Glu Lys Glu Phe Ser Glu Val Glu Gly Arg Ile Gln Asp410 415 420ctc gag aaa tac gtt gaa gac act aaa ata gat ctc tgg tct tac aat 1349Leu Glu Lys Tyr Val Glu Asp Thr Lys Ile Asp Leu Trp Ser Tyr Asn425 430 435 440gcg gag ctt ctt gtc gct ctg gag aac caa cat aca att gat ctg act 1397Ala Glu Leu Leu Val Ala Leu Glu Asn Gln His Thr Ile Asp Leu Thr445 450 455gac tcg gaa atg aac aaa ctg ttt gaa aaa aca agg agg caa ctg agg 1445Asp Ser Glu Met Asn Lys Leu Phe Glu Lys Thr Arg Arg Gln Leu Arg460 465 470gaa aat gct gag gac atg ggc aat ggt tgc ttc aaa ata tac cac aaa 1493Glu Asn Ala Glu Asp Met Gly Asn Gly Cys Phe Lys Ile Tyr His Lys475 480 485tgt gac aat gct tgc ata ggg tca atc aga aat ggg act tat gac cat 1541Cys Asp Asn Ala Cys Ile Gly Ser Ile Arg Asn Gly Thr Tyr Asp His490 495 500gat gta tac aga gac gaa gca tta aac aac cgg ttt cag atc aaa ggt 1589Asp Val Tyr Arg Asp Glu Ala Leu Asn Asn Arg Phe Gln Ile Lys Gly505 510 515 520gtt gaa ctg aag tca gga tac aaa gac tgg atc ctg tgg att tcc ttt 1637Val Glu Leu Lys Ser Gly Tyr Lys Asp Trp Ile Leu Trp Ile Ser Phe525 530 535gcc ata tca tgc ttt ttg ctt tgt gtt gtt ttg ctg ggg ttc atc atg 1685Ala Ile Ser Cys Phe Leu Leu Cys Val Val Leu Leu Gly Phe Ile Met540 545 550tgg gcc tgc cag aaa ggc aac att agg tgc aac att tgc att 1727Trp Ala Cys Gln Lys Gly Asn Ile Arg Cys Asn Ile Cys Ile555 560 565tgagtgtatt agtaattaaa aacaccctgt ttctact 1764<210>22<211>566<212>PRT<213>流感A病毒<400>22Met Lys Thr Ile Ile Ala Leu Ser Tyr Ile Phe Cys Leu Val Leu Gly1 5 10 15Gln Asp Phe Pro Gly Asn Asp Asn Ser Thr Ala Thr Leu Cys Leu Gly20 25 30His His Ala Val Pro Asn Gly Thr Leu Val Lys Thr Ile Thr Asn Asp35 40 45Gln Ile Glu Val Thr Asn Ala Thr Glu Leu Val Gln Ser Ser Ser Thr50 55 60Gly Lys Ile Cys Asn Asn Pro His Arg Ile Leu Asp Gly Ile Asp Cys65 70 75 80Thr Leu Ile Asp Ala Leu Leu Gly Asp Pro His Cys Asp Gly Phe Gln85 90 95Asn Glu Thr Trp Asp Leu Phe Val Glu Arg Ser Lys Ala Phe Ser Asn100 105 110Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser Leu Arg Ser Leu Val115 120 125Ala Ser Ser Gly Thr Leu Glu Phe Ile Ser Glu Gly Phe Thr Trp Thr130 135 140Gly Val Thr Gln Asn Gly Gly Ser Asn Ala Cys Lys Arg Gly Pro Asp145 150 155 160Ser Gly Phe Phe Ser Arg Leu Asn Trp Leu Tyr Lys Ser Gly Ser Thr165 170 175Tyr Pro Val Leu Asn Val Thr Met Pro Asn Asn Asp Asn Phe Asp Lys180 185 190Leu Tyr Ile Trp Gly Val His His Pro Ser Thr Asp Gln Glu Gln Thr195 200 205Ser Leu Tyr Val Gln Ala Ser Gly Arg Val Thr Val Ser Thr Lys Arg210 215 220Ser Gln Gln Thr Ile Ile Pro Asn Ile Gly Ser Arg Pro Trp Val Arg225 230 235 240Gly Leu Ser Ser Arg Ile Ser Ile Tyr Trp Thr Ile Val Lys Pro Gly245 250 255Asp Ile Leu Val Ile Asn Ser Asn Gly Asn Leu Ile Ala Pro Arg Gly260 265 270Tyr Phe Lys Met Arg Thr Gly Lys Ser Ser Ile Met Arg Ser Asp Ala275 280 285Pro Ile Gly Thr Cys Ile Ser Glu Cys Ile Thr Pro Asn Gly Ser Ile290 295 300Pro Asn Asp Lys Pro Phe Gln Asn Val Asn Lys Ile Thr Tyr Gly Ala305 310 315 320Cys Pro Lys Tyr Val Lys Gln Asn Thr Leu Lys Leu Ala Thr Gly Met325 330 335Arg Asn Val Pro Glu Lys Gln Thr Arg Gly Leu Phe Ser Ala Ile Ala340 345 350Gly Phe Ile Glu Asn Gly Trp Glu Gly Met Ile Asp Gly Trp Tyr Gly355 360 365Phe Arg His Gln Asn Ser Glu Gly Thr Gly Gln Ala Ala Asp Leu Lys370 375 380Ser Thr Gln Ala Ala Ile Asp Gln Ile Asn Gly Lys Leu Asn Arg Val385 390 395 400Ile Glu Lys Thr Asn Glu Lys Phe His Gln Ile Glu Lys Glu Phe Ser405 410 415Glu Val Glu Gly Arg Ile Gln Asp Leu Glu Lys Tyr Val Glu Asp Thr420 425 430Lys Ile Asp Leu Trp Ser Tyr Asn Ala Glu Leu Leu Val Ala Leu Glu435 440 445Asn Gln His Thr Ile Asp Leu Thr Asp Ser Glu Met Asn Lys Leu Phe450 455 460Glu Lys Thr Arg Arg Gln Leu Arg Glu Asn Ala Glu Asp Met Gly Asn465 470 475 480Gly Cys Phe Lys Ile Tyr His Lys Cys Asp Asn Ala Cys Ile Gly Ser485 490 495Ile Arg Asn Gly Thr Tyr Asp His Asp Val Tyr Arg Asp Glu Ala Leu500 505 510Asn Asn Arg Phe Gln Ile Lys Gly Val Glu Leu Lys Ser Gly Tyr Lys515 520 525Asp Trp Ile Leu Trp Ile Ser Phe Ala Ile Ser Cys Phe Leu Leu Cys530 535 540Val Val Leu Leu Gly Phe Ile Met Trp Ala Cys Gln Lys Gly Asn Ile545 550 555 560Arg Cys Asn Ile Cys Ile565<210>23<211>27<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>23tatggaaaga ataaaagaac tacggaa 27<210>24<211>22<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>24tcgtttttaa actattcaac at 22<210>25<211>24<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>25agcaggtcaa ttatattcaa tatg24<210>26<211>24<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>26aacaaggtcg tttttaaact attc24<210>27<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>27agaactctat tccaacaaat g 21<210>28<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>28aatcggatat ttcattgcca t 21<210>29<211>33<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>29agaactctat tccaacaaat gagggatgta gtt 33<210>30<211>33<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>30aatcggatat ttcattgcca tcatccattt cat 33<210>31<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>31agacgtggtg ttggtaatga a 21<210>32<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>32cgaagagttg acataaaccc t 21<210>33<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>33tcatccctca tcccctcaca t21<210>34<211>22<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>34attttctgtt atcctcttgt ca 22<210>35<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>35gtggagtccg ctgttttgag 20<210>36<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>36aggatggtgg acattcttag g 21<210>37<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>37ccgatcaatg ctaaccacta c 21<210>38<211>25<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>38agcaaaagca ggtcaattat attca 25<210>39<211>25<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>39agtagaaaca aggtcgtttt taaac25<210>40<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>40agcaaaagca ggcaaaccat 20<210>41<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>41agtagaaaca aggcattttt t21<210>42<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>42tcgagctgaa gaagctatgg 20<210>43<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>43gttctgttga ctgtgtccat 20<210>44<211>32<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>44tcgagctgaa gaagctatgg gagcagaccc gt 32<210>45<211>29<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>45gttctgttga ctgtgtccat ggtgtatcc 29<210>46<211>35<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>46agcgaaagca ggcaaaccat ttgaatggat gtcaa35<210>47<211>31<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>47attaaaaaca aggcattttt tcatgaagga c31<210>48<211>28<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>48aatttccagc atggtggagg ccatggtg28<210>49<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>49gggatctttg aaaactcgtg 20<210>50<211>19<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>50cagcattgtt tacagactc19<210>51<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>51accaaagatg cagaaagagg 20<210>52<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>52ctcatattga ttccgactaa 20<210>53<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>53tgtccacttc cctttttctg a 21<210>54<211>22<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>54atttttcccc agtagttcat ac 22<210>55<211>25<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>55acgctgttgc aactacacac tcctg 25<210>56<211>25<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>56agcaaaagca ggtactgatt cgaga 25<210>57<211>25<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>57agtagaaaca aggtactttt ttgga 25<210>58<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>58gaacctggaa cctttgatct t 21<210>59<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>59attcaccact gtccaggcca t 21<210>60<211>30<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>60gaacctggaa cctttgatct tgaggggcta 30<210>61<211>30<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>61attcaccact gtccaggcca ttgacgcgtc 30<210>62<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>62agcaaaagca ggtagtgata g 21<210>63<211>15<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>63agtagaaaca aggta 15<210>64<211>29<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>64agcgaaagca ggtagtgatt cgagatgga 29<210>65<211>28<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>65agtagaaaca aggtactttt ttggacag28<210>66<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>66tcgatttgtt ggagtgactg a 21<210>67<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>67gccgaacttc tcctgccttg a 21<210>68<211>19<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>68gtattcaata gcctgtatg 19<210>69<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>69gacttctctc cttgtcactc 20<210>70<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>70acgagtcagc taaagtgggc a 21<210>71<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>71gacacctctg ctgtgaagta a 21<210>72<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>72ggagctgaga aaccgaagtt t 21<210>73<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>73gacttggcca ataaagtcct a21<210>74<211>21<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>74gcctgcgcat agttcctgtg a 21<210>75<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>75tctaccacta ttgactcgcc20<210>76<211>23<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>76agcaaaagca ggggataatt cta 23<210>77<211>23<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>77agtagaaaca agggtgtttt taa 23<210>78<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>78acagtttgtt catttccgag 20<210>79<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>79acttcagggt gttttgctta 20<210>80<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>80gaacccccca aatgtatagt 20<210>81<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>81tgaggaactc tgaaccagct20<210>82<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>82gtcactagtt gcctcgtcag20<210>83<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>83ccgggagaca tactggtaat20<210>84<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>84caaatcaatg ggaaactgaa20<210>85<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>85tgaactgaag tcaggataca20<210>86<211>24<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>86agcaaaagca gggttaataa tcac 24<210>87<211>24<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>87agtagaaaca agggtatttt tcct 24<210>88<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>88ttgcaccttc catcatcctt20<210>89<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>89gtctattccc actaaagagt 20<210>90<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>90tgcatcagag agcacatcct 20<210>91<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>91gaactcgtcc tttatgacaa 20<210>92<211>17<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>92agagcaatgg atcaagt17<210>93<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>93tactatggaa tcaagtactc 20<210>94<211>18<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>94atcaatcatc ttcccgac18<210>95<211>18<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>95agtgtccttc cgtgggcg18<210>96<211>23<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>96agcaaaagca ggagtgaaga tga 23<210>97<211>24<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>97agtagaaaca aggagttttt tcta 24<210>98<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>98tcgttgtttc tgggtgtgtc20<210>99<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>99ttatcatacc cagtgacaca20<210>100<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>100attattattg gttcacacgg20<210>101<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>101ttatgtgtca tgcgatcctg20<210>102<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>102attgttcata ttagcccatt20<210>103<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>103ataggtcagg ttattctggt20<210>104<211>15<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>104agcaaaagca ggtag 15<210>105<211>15<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>105agtagaaaca aggta 15<210>106<211>24<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>106tttactccag ctctatgctg acaa 24<210>107<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>107gatccagcca tttgctccat20<210>108<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>108gaggtgacag gattggtctt20<210>109<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>109catggacaga gcagttaaac20<210>110<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>110gcgagtatca ttgggatctt20<210>111<211>24<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>111agcaaaagca gggtgacaaa gaca 24<210>112<211>24<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>112agtagaaaca agggtgtttt ttat 24<210>113<211>24<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>113ttttttatca ttaaataagc tgaa 24<210>114<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>114aattgcattt ttgacatcct20<210>115<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>115atcttctcta ctatctgctt20<210>116<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>116caagcaatca tggataagaa20<210>117<211>20<212>DNA<213>人工序列<220><223>人工序列的描述引物<400>117ggcgagaaca gctaggtcaa20
权利要求
1.一种分离的核酸分子,其特征在于,包含流感A/Udorn/72(H3N2)菌株的正链即反基因组有义链的完整核苷酸序列,,所述的完整核苷酸序列由以下序列构成(a)SEQ ID NO1、SEQ ID NO3、SEQ ID NO5、SEQ ID NO7、SEQ9、SEQ ID NO11、SEQ ID NO13和SEQ ID NO17,或(b)SEQ ID NO1、SEQ ID NO3、SEQ ID NO5、SEQ ID NO9、SEQ ID NO11、SEQ ID NO13、SEQ ID NO17和SEQ ID NO21。
2.如权利要求1所述的分离的核酸分子,其特征在于,包含流感A/Udorn/72(H3N2)菌株的完整核苷酸序列,所述的完整序列由以下序列构成SEQID NO1、SEQ ID NO3、SEQ ID NO5、SEQ ID NO7、SEQ ID NO9、SEQ ID NO11、SEQ ID NO13和SEQ ID NO17。
3.如权利要求1所述的分离的核酸分子,其特征在于,包含流感A/Udorn/72(H3N2)菌株的完整核苷酸序列,所述的完整序列由以下序列构成SEQID NO1、SEQ ID NO3、SEQ ID NO5、SEQ ID NO9、SEQ ID NO11、SEQ IDNO13、SEQ ID NO17和SEQ ID NO21。。
4.一种分离的核酸分子,其特征在于,包含单个流感A病毒的正链即反基因组有义链的核苷酸序列,所述的序列选自SEQ ID NO1、SEQ ID NO3、SEQ IDNO5、SEQ ID NO7、SEQ ID NO11和SEQ ID NO21。
5.如权利要求4所述的分离的核酸分子,其特征在于,所述的流感A病毒核苷酸序列是SEQ ID NO1。
6.如权利要求4所述的分离的核酸分子,其特征在于,所述的流感A病毒核苷酸序列是SEQ ID NO3。
7.如权利要求4所述的分离的核酸分子,其特征在于,所述的流感A病毒核苷酸序列是SEQ ID NO5。
8.如权利要求4所述的分离的核酸分子,其特征在于,所述的流感A病毒核苷酸序列是SEQ ID NO7。
9.如权利要求4所述的分离的核酸分子,其特征在于,所述的流感A病毒核苷酸序列是SEQ ID NO11。
10.如权利要求4所述的分离的核酸分子,其特征在于,所述的流感A病毒核苷酸序列是SEQ ID NO21。
11.一种分离的核酸分子,其特征在于,包含单个流感A病毒的正链即反基因组有义链的核苷酸序列,所述的序列编码蛋白质,该蛋白质具有选自以下的序列SEQ ID NO2、SEQ ID NO4和SEQ ID NO6。
12.如权利要求11所述的分离的核苷酸分子,其特征在于,所述的流感A病毒核苷酸序列编码序列为SEQ ID NO2的蛋白质。
13.如权利要求11所述的分离的核苷酸分子,其特征在于,所述的流感A病毒核苷酸序列编码序列为SEQ ID NO4的蛋白质。
14.如权利要求11所述的分离的核苷酸分子,其特征在于,所述的流感A病毒核苷酸序列编码序列为SEQ ID NO6的蛋白质。
15.一种分离的氨基酸序列,其特征在于,包含单个流感A病毒氨基酸序列,所述的序列选自SEQ ID NO2、SEQ ID NO4和SEQ ID NO6。
16.如权利要求15所述的分离的氨基酸序列,其特征在于,所述的流感A病毒的氨基酸序列是SEQ ID NO2。
17.如权利要求15所述的分离的氨基酸序列,其特征在于,所述的流感A病毒的氨基酸序列是SEQ ID NO4。
18.如权利要求15所述的分离的氨基酸序列,其特征在于,所述的流感A病毒的氨基酸序列是SEQ ID NO6。
19.一种分离的氨基酸序列,其包含单个流感A病毒的序列,其特征在于,所述的序列是SEQ ID NO8。
20.一种分离的氨基酸序列,其包含单个流感A病毒的序列,其特征在于,所述的序列是SEQ ID NO12。
全文摘要
本文公开了负链流感A病毒菌株A/Udron/72(H3N2)的8个节段的完整核苷酸序列,以反基因组信息提供了完整的基因组序列。这些序列可用于诊断,且通过突变修饰可产生新的流感A变体菌株。
文档编号C07K14/11GK1437612SQ01811649
公开日2003年8月20日 申请日期2001年6月21日 优先权日2000年6月23日
发明者J·M·加拉尔萨, T·E·莱瑟姆 申请人:美国氰胺公司