科学网—标签 - 基因组测序

相关帖子	版块	作者	回复/查看	最后发表

fqng1008 2019-9-16 17:21

（神秘的地球uux.cn报道）据Eurek Alert：两项对远古人类基因组的新研究为我们祖先及其表亲尼安德特人的生活提供了宝贵的线索。科学家们首先对采自一个女性尼安德特人的新基因组进行了测序，后者仅是对尼安德特人所做高品质完全测序的第二个基因组。这一进展证实了有关尼安德特人的若干推测，但它也揭示了新发现的该人种对现代人基因的贡献。尼安德特人在进化上与所有的当代人最为接近，因此它们对人类生物学和历史的研究提供了独特的视角。迄今已经对 5个尼安德特人的基因组进行了测序，然而仅得到一组高品质的数据，该数据来自在西伯利亚发现的一个“阿尔泰尼安德特人”。在克罗地亚凡迪亚（Vindija）某洞穴中发现了3个基因组定义不甚明确的人，在俄国的玛兹梅斯卡亚洞穴（Mezmaiskaya Cave）也发现了一个基因组定义不甚明确的人。 Kay Prüfer和同事在此成功地分析了数十亿个DNA片段，它们采自该克罗地亚洞穴新发现的个体，后者被称作Vindija 33.19，这是一个生活在大约5万2000年前的女性。与先前的研究发现类似，该遗传学数据表明，尼安德特人生活在小型且隔绝的大约有3000人的人群中。从前经过测序的阿尔泰尼安德特人的基因组表明，该人的父母为同父异母或同母异父的兄弟姐妹，这令科学家们怀疑，尼安德特人与家庭成员间的杂交是否常见；然而，新测序的Vindija基因组并未显现类似的乱伦模式，这表明阿尔泰尼安德特人父母间近亲繁殖的极端情况在尼安德特人中可能并非普遍存在。然而，在克罗地亚洞穴中发现的Vindija 33.19看来确实与在该洞穴中发现的经过基因测序的另外3个人中的2人有着共同的母系祖先。作者用 Vindija 33.19基因组来分析尼安德特人、丹尼索瓦人和现代人中的演化分歧和基因流动。他们报告的许多结果包括，在12万年前至14.5万年前，即在克罗地亚尼安德特人与西伯利亚尼安德特人歧化之前，早期现代人基因流入尼安德特人。根据这一新的高品质基因组，作者估计，现代的非非洲人群携带的尼安德特人比例在1.8-2.6%间，这比先前估计的1.5-2.1%要高。最后，他们在该尼安德特人基因组中发现了丰富的新型基因变异，这些变异对现代人都有影响，它们包括与血浆LDL胆固醇浓度、维生素D浓度、饮食失调、内脏脂肪累积、类风湿性关节炎、精神分裂症及抗精神病药反应相关的基因变异。由 Martin Sikora等人所做的第二则研究聚焦于4个古代的但解剖学上属于现代人的基因组，他们是在俄国松希尔的埋葬地被发现的。这些基因组为人们提供了难得一见的旧石器时代晚期的人类社会结构。这些人生活的时期大约在3万4600年前至3万3600年前之间，他们全为男性，而且相互之间没有血缘关系。此外，研究人员在他们中没有发现见于阿尔泰尼安德特人中的近亲繁殖特征。尽管这些人只是一个小型基因群体的一部分，但鉴于这些人中的基因多样性，作者提出，他们的繁殖对象可能来自其狩猎-采集氏族之外。作者还重点介绍了这些基因组与尼安德特人基因组进行比较及混合的方式。

个人分类: 读书笔记|1718 次阅读|0 个评论

老大哥在注视着我们

jiangming800403 2016-8-28 17:30

天网恢恢，疏而不漏。如果基因组测序真得如同验血一样方便，那么老大哥一定会注视着我们，当然任何犯罪也都会无处可藏，除非不留下一丁点痕迹。身份证芯片中都已加载数字化照片和指纹信息，无处不在的监控录像和数字人像识别技术，已经使这个社会正在透明。转载本文请联系原作者获取授权，同时请注明本文来自赵建民科学网博客。链接地址： http://blog.sciencenet.cn/blog-595119-768389.html

个人分类: 读史方舆|2135 次阅读|0 个评论

[转载]千年基因购入最新三代测序系统Sequel

macrogencn 2016-3-4 17:54

作为国内唯一一家通过 Pacific Biosciences CSP （ Certified Service Provider ）认证的公司，千年基因再次与 Pacific Biosciences 达成合作，购入其最新测序系统 Sequel ，并将致力于为研究人员提供更卓越的三代测序服务！千年基因早在 2013 年即与 Pacific Biosciences 达成合作，购入两台三代测序系统 PacBio RS II ，目前已积累丰富的项目经验，各项数据指标已达国内最高水平。其中，读长最长可达 50Kb ， reads N50 高达 25Kb ，数据产出高达 1.4Gb/SMRT Cell 。目前，千年基因已在动植物基因组 de novo 测序、细菌基因组完成图、真菌基因组精细图、宏基因组测序、全长转录本测序、扩增产物测序等领域与研究人员建立广泛合作，相关研究成果也已发表。了解三代测序的应用领域和研究案例： http://www.macrogencn.com/_d277140973.htm 三代测序研究手册独家下载： http://www.macrogencn.com/mod/briefcase/BriefCaseDownLoad.aspx?fid=277131059ip=

1408 次阅读|0 个评论

[转载]千年基因购入最新三代测序系统Sequel

macrogencn 2016-3-4 17:50

作为国内唯一一家通过 Pacific Biosciences CSP （ Certified Service Provider ）认证的公司，千年基因再次与 Pacific Biosciences 达成合作，购入其最新测序系统 Sequel ，并将致力于为研究人员提供更卓越的三代测序服务！千年基因早在 2013 年即与 Pacific Biosciences 达成合作，购入两台三代测序系统 PacBio RS II ，目前已积累丰富的项目经验，各项数据指标已达国内最高水平。其中，读长最长可达 50Kb ， reads N50 高达 25Kb ，数据产出高达 1.4Gb/SMRT Cell 。目前，千年基因已在动植物基因组 de novo 测序、细菌基因组完成图、真菌基因组精细图、宏基因组测序、全长转录本测序、扩增产物测序等领域与研究人员建立广泛合作，相关研究成果也已发表。了解三代测序的应用领域和研究案例： http://www.macrogencn.com/_d277140973.htm 三代测序研究手册独家下载： http://www.macrogencn.com/mod/briefcase/BriefCaseDownLoad.aspx?fid=277131059ip=

819 次阅读|0 个评论

草鱼基因组测序—拉开草鱼分子育种的序幕

热度 1 sciencepress 2015-5-28 16:37

最近，中国科学院水生生物研究所、中国科学院国家基因研究中心以及中山大学等机构的研究人员合作完成了草鱼全基因组序列图谱绘制，揭示了草鱼草食性特征的遗传基础，发现了草鱼性染色体分化与性别决定的新线索，研究结果发表于 Nature Genetics 。《中国科学：生命科学》 2015年第5期“亮点”栏目介绍了这项研究：草鱼因为其典型的草食性特征而得名，其摄食的水生维管束植物富含纤维素，难于消化，易被利用的营养物含量较低。草鱼是如何从水草中获得营养以支持其快速生长的机制一直是研究者们关注的问题。该研究发现草鱼基因组中并不存在纤维素降解酶基因，说明其消化道内的纤维素酶完全来源于肠道微生物，澄清了有关草食性鱼类自身能否产生纤维素酶的争论；研究同时表明草鱼可能通过持续高强度的食物摄入，获得足够的、可利用的营养以维持其快速生长，从而在遗传水平上解开了草鱼如何从水草中获得营养的困惑，这为研发可高效利用植物纤维素的草食性鱼类、甚至肉食性名贵鱼类的人工饵料提供了新思路。性别问题被誉为进化生物学问题中的皇后，作为主要水产养殖对象的鲤科鱼类性别决定与分化机制研究进展缓慢。该研究通过比较草鱼雌性和雄性基因组，发现了 2.38 Mb 雄性特有序列，主要分布在草鱼的第 24 号连锁群上。这些雄性特异的 DNA 片段在其他脊椎动物中找不到同源片段，表明草鱼性别决定基因的形成可能具有种属特异性，为鲤科鱼类性别决定与分化机制研究提供了新的线索。鱼类是人类最后的野生食物资源，水产养殖被认为是满足人类不断增长的对水产动物蛋白需求的唯一途径。草鱼是世界上最重要的淡水养殖对象之一，其 2011 年的养殖产量占全球淡水养殖总量的 15.6% 。中国的淡水养殖对象绝大部分为鲤科鱼类，年养殖产量超过 250 万吨的鱼类全部来自于鲤科。作为鲤科鱼类的代表性物种，草鱼基因组序列的解析不仅将为鱼类基因组演化、性别决定与分化机制等理论问题的探索，而且还将为发掘鱼类重要经济性状相关基因，培育高产、抗病、高效饵料利用等优良养殖鱼类新品种奠定基础。（“ 阅读原文 ” ↙ ）中文： http://life.scichina.com:8082/sciC/CN/abstract/abstract517569.shtml # 英文： http://life.scichina.com:8082/sciCe/EN/abstract/abstract517572.shtml 《中国科学》杂志社微信公众号关注请加： scichina1950 中国科学杂志社或长按识别二维码：

个人分类: 《中国科学》论文|6706 次阅读|2 个评论

遗传学家正在开发利用人类自身的基因敲除资源

热度 2 BJQXKY 2014-11-16 16:32

通过基因组测序获取人类自身具有基因敲除效应的突变，并用于新药研发。最近几十年来，生物学家多通过在小鼠或其它实验动物身上失活某个基因，来观察这个基因的功能。现在，这种基因敲除研究功能的模式又有了新的更理想的物种材料：人类自身。这种方法可不是在实验室中通过基因工程制备基因突变的人，研究人员通过扫描成千上万的人基因组序列，来找寻自然发生的突变所致特定基因的失活。通过观察这些突变如何影响健康，研究者希望获得疾病的生物学根源并找到新的治疗方式。上周，在加州圣地亚哥美国人类遗传学会（American Society of Human Genetics）会议的分会上，遗传学家们讨论了几个这种大规模（测序数据）的效应。来自波士顿马省总医院（Massachusetts General Hospital）的基因组学家Daniel MacArthur说，我们现在所有认知都是基于小鼠和大鼠的，而不是基于人。他的研究组通过扫描超过9万人的基因组蛋白编码区，已经确定了大约15万自然发生的敲除基因。“现在我们发现了拥有某特定失活的基因或者特定修饰的基因，我们可以直接利用他们来检验我们的假设。” 平均每个人都携带不少的基因突变，这些突变导致至少1/200基因出现单拷贝失活，导致大约1/20基因出现双拷贝失活。然而，要得到任何一个基因的敲除突变却很难，因此需要大人口规模来研究其效应。这类功能缺失突变在特定疾病中，例如囊肿性纤维化，被广泛采用。多数突变对健康并无损害，一些突变甚至对携带者有益。“（很多突变不导致疾病，因而）我们不会在临床上看到，但是他们在生物学上依旧很有价值”，MacArthur说。MacArthur及其研究组将关注的焦点放在基因组数据上，他们现在也开始通过患者—健康记录来确定哪些相对微弱突变导致的效应。在一项有超过36000芬兰人参与的研究中(发表于PLoS Genet. 10, e1004494; 2014)，MacArthur的团队发现缺失LPA基因可能有心脏保护作用，另一种敲除突变，在芬兰人中有2.4%的人为该突变的单拷贝携带者，这种突变如果以双拷贝形式出现则容易引起流产。来自休斯敦德克萨斯健康科学中心大学（University of Texas Health Science Center）的Bing Yu在会上说，他的团队将1300多人的基因敲除突变数据与血液中300多分子数据进行了比较，发现基因SLCO1B1上的突变与高脂肪酸有关，高脂肪酸是心衰的风险因子。英国Hinxton之Wellcome Trust Sanger研究所的一个研究组报道：在小鼠中失活具有致死效应的43个基因，在人体这些基因失活时人却看上去很健康，并不呈现致死效应。宣传海报循此方式可以揭开人类成千未知功能基因的作用。并可能通过确认具有保护性（让人免受疾病困扰）的基因或生物通路来辅助药物研发。人类基因敲除效应的宣传海报（poster child）是一类新型、可阻止PCSK9基因的药物(Nature 496, 152–155; 2013)。PCSK9基因于本世纪早期在高胆固醇的法国人家族中被发现。研究人员很快发现：在PCSK9单拷贝失活这样一种罕见突变的人群中，胆固醇含量低，少有心脏病发生。首个阻止PCSK9的药物将在明年问世，并预计将在未来五年中产生250亿美元的销量收入。加利福尼亚La Jolla 的Scripps转化科学研究所（Scripps Translational Science Institute）领导人Eric Topol说：“将来会有成百，甚至成千个像PCSK9基因样的故事出现”。药物可以模拟功能缺失型突变。耶鲁大学的生物信息学家Mark Gerstein则认为人类基因突变在研发对抗衰老性疾病上尤其重要。“你可以想象状况，某个基因在你25岁的时候对你有益，而当你75岁的时候，它或许对你有害了。” “通过这些基因敲除数据，你可以很容易的理解为什么现在越来越多的人通过基因组测序来寻求自己所患疾病的解决方案了吧。” 伦敦肿瘤研究所的医学遗传学家Nazneen Rahman说。Rahman在会上展示了她的团队对1000个英国人基因敲除突变的分析结果，这些信息确切无疑的表明发生在人类的基因突变比我们先前想象的多许多。当利用人类基因组测序确认某神秘疾病的原因时，任何敲除型突变都是可疑原因。拥有这些突变的完整清单以及它们的健康效应（或缺失效应）将有助于确定疾病的真是原因，Rahman说。 MacArthur团队上周刚释放了约63000人的敲除及其它信息，已经有人在利用这些数据了。“当我看到这些数据时，我做的第一件事就是查看我所关注的严重疾病有关基因，看是否有什么人在这数据库中，我还真发现了他们”，休斯敦Baylor医学院的医学遗传学家John Belmont说。（在这批人中）他发现了11个人携带有马凡氏综合征相关基因突变，这种疾病累及结缔组织，如果不予处理，可能引起突发性心力衰竭。这些人可能属于潜伏的病例，他们可能具有保护性、防止发病的机制或是存在测序错误。Belmont说：“携带致病的突变基因而不发病，这太迷人了，我们将好好关注它，这或许预示着马凡氏综合征的新治疗方式。链接： http://www.nature.com/news/geneticists-tap-human-knockouts-1.16239 文题：Geneticists tap human knockouts Sequenced genomes reveal mutations that disable single genes and can point to new drugs. Ewen Callaway For decades, biologists have studied gene function by inactivating the gene in question in mice and other lab animals, and then observing how it affects the organism. Now researchers studying such gene ‘knockouts’ have another, ideal model at their disposal: humans. The approach does not involve genetically engineering mutant people in the lab, as is done in mice. Instead, researchers scan the genomes of thousands or millions of people, looking for naturally occurring mutations that inactivate a particular gene. By observing how these mutations affect health, researchers hope to gain insight into basic biology and to unearth new disease treatments. Geneticists discussed several such large-scale efforts during a packed session at the American Society of Human Genetics meeting in San Diego, California, last week. “So much of what we know is based on mice and rats, and not humans,” says Daniel MacArthur, a genomicist at Massachusetts General Hospital in Boston, whose team identified around 150,000 naturally knocked-out genes by trawling the protein-coding portion of the genome, or exome, in more than 90,000 people. “Now we can find people who actually have a particular gene inactivated or somehow modified, and that allows us to test hypotheses directly.” On average, every person carries mutations that inactivate at least one copy of 200 or so genes and both copies of around 20 genes. However, knockout mutations in any particular gene are rare, so very large populations are needed to study their effects. These ‘loss of function’ mutations have long been implicated in certain debilitating diseases, such as cystic fibrosis. Most, however, seem to be harmless — and some are even beneficial to the persons carrying them. “These are people we’re not going to find in a clinic, but they’re still really informative in biology,” says MacArthur. His group and others had been focusing on genome data, but they are now also starting to mine patient-health records to determine the — sometimes subtle — effects of the mutations. In a study of more than 36,000 Finnish people, published in July (E. T. Lim et al. PLoS Genet. 10, e1004494; 2014), MacArthur and his team discovered that people lacking a gene called LPA might be protected from heart disease, and that another knockout mutation, carried in one copy of a gene by up to 2.4% of Finns, may cause fetuses to miscarry if it is present in both copies. Bing Yu of the University of Texas Health Science Center in Houston told the meeting how he and his collaborators had compared knockout mutations found in more than 1,300 people with measurements of around 300 molecules in their blood. The team found that mutations in one gene, called SLCO1B1, were linked to high levels of fatty acids, a known risk factor for heart failure. And a team from the Wellcome Trust Sanger Institute in Hinxton, UK, reported that 43 genes whose inactivation is lethal to mice were found to be inactivated in humans who are alive and apparently well. Poster child Following up on such insights will help researchers to unpick the functions of the thousands of human genes about which little or nothing is known, say MacArthur and others. It might even aid drug discovery by identifying genes or biological pathways that could protect against disease. The poster child for human-knockout efforts is a new class of drugs that block a gene known as PCSK9 (see Nature 496, 152–155; 2013). The gene was discovered in French families with extremely high cholesterol levels in the early 2000s. But researchers soon found that people with rare mutations that inactivate one copy of PCSK9 have low cholesterol and rarely develop heart disease. The first PCSK9-blocking drugs should hit pharmacies next year, with manufacturers jostling for a share of a market that could reach US$25 billion in five years. “I think there are hundreds more stories like PCSK9 out there, maybe even thousands,” in which a drug can mimic an advantageous loss-of-function mutation, says Eric Topol, director of the Scripps Translational Science Institute in La Jolla, California. Mark Gerstein, a bio-informatician at Yale University in New Haven, Connecticut, predicts that human knockouts will be especially useful for identifying drugs that treat diseases of ageing. “You could imagine there’s a gene that is beneficial to you as a 25-year-old, but the thing is not doing a good job for you when you’re 75.” The human-knockout data presented last week will also make it easier to interpret the growing number of genomes being sequenced from people seeking treatment, says Nazneen Rahman, a medical geneticist at the Institute for Cancer Research in London. Rahman told the meeting about her team’s analysis of knockout mutations in 1,000 British people. “The take-home message is that these types of mutations occur much more commonly than people thought,” she says. When a person’s genome is sequenced to identify the cause of a mysterious illness, any knockout mutations that turn up are obvious suspects. Having a complete list of such mutations and their health effects (or lack thereof) should help to identify the true causes of a disease, Rahman says. To that end, MacArthur’s team last week released knockout and other data from some 63,000 people — and others are already making use of this trove. “One of the first things I did when they released the data was to look at all my favourite genes for very severe diseases, to see if there are people in those databases — and there are,” says John Belmont, a medical geneticist at Baylor College of Medicine in Houston. He found 11 people with mutations associated with Marfan syndrome, a disorder that affects connective tissue and that can cause sudden heart failure if left untreated. However, these people may be silent cases, they might somehow be protected against the disease or their genomes might have been sequenced incorrectly. People who carry disease-causing mutations but don’t get sick are especially intriguing, he says. “We should pay attention to them,” Belmont says. “They may hold a key for new treatments.” 上文由群晓科苑翻译整理，科学推广，服务民众。他人或机构如需使用，请提供该原始链接地址。北京群晓科苑生物技术有限公司主要经营生物医药领域的试剂、耗材和仪器。群晓生物致力于为用户提供优质的材料、技术和实验整体性解决方案，完美配合用户的科研创意和灵感。群晓生物立志把一流的产品、专业的技术和完善的售后服务献给广大用户。北京群晓科苑生物技术有限公司 www.qbioscience.com www.qbiotec.com Tel: 010-84504282/64880108 qbioscience@126.com

个人分类: 科苑杂话|2445 次阅读|2 个评论

[转载]基因组揭示蜜蜂（Apis mellifera）约30万年前起源于中亚

zhuchaodong 2014-9-21 20:19

Study Finds Honey Bees Originated from Asia, not Africa 体会： 1、重新审视蜜蜂（Apis mellifera）在亚洲的进化历史和迁移路径； 2、审视少数基因片段信息在系统学上的局限性； 3、重视基因组数据在种群和物种水平上的应用。 Title: A worldwide survey of genome sequence variation provides insight into the evolutionary history of the honeybee Apis mellifera Authors: Andreas Wallberg , Fan Han , Gustaf Wellhagen , Bjørn Dahle , Masakado Kawata , Nizar Haddad , Zilá Luz Paulino Simões , Mike H Allsopp , Irfan Kandemir , Pilar De la Rúa , Christian W Pirk Matthew T Webster Abstract: The honeybee Apis mellifera has major ecological and economic importance. We analyze patterns of genetic variation at 8.3 million SNPs, identified by sequencing 140 honeybee genomes from a worldwide sample of 14 populations at a combined total depth of 634×. These data provide insight into the evolutionary history and genetic basis of local adaptation in this species. We find evidence that population sizes have fluctuated greatly, mirroring historical fluctuations in climate, although contemporary populations have high genetic diversity, indicating the absence of domestication bottlenecks. Levels of genetic variation are strongly shaped by natural selection and are highly correlated with patterns of gene expression and DNA methylation. We identify genomic signatures of local adaptation, which are enriched in genes expressed in workers and in immune system– and sperm motility–related genes that might underlie geographic variation in reproduction, dispersal and disease resistance. This study provides a framework for future investigations into responses to pathogens and climate change in honeybees. August 25, 2014 by Entomology Today Leave a Comment Photo by Alexander Wild. http://www.alexanderwild.com In a study published in Nature Genetics, researchers from Uppsala University present the first global analysis of genome variation in honey bees. The findings show a surprisingly high level of genetic diversity in honey bees, and indicate that the species most probably originates from Asia, and not from Africa as previously thought. “The evolutionary tree we constructed from genome sequences does not support an origin in Africa,” said Matthew Webster, one of the authors. “This gives us new insight into how honey bees spread and became adapted to habitats across the world.” Another unexpected result was that honey bees seem to be derived from an ancient lineage of cavity-nesting bees that arrived from Asia around 300,000 years ago and rapidly spread across Europe and Africa. Extensive losses of honey bee colonies in recent years are a major cause for concern. Honey bees face threats from disease, climate change, and management practices. To combat these threats, it is important to understand the evolutionary history of honey bees and how they have adapted to different environments across the world. “We have used state-of-the-art, high-throughput genomics to address these questions, and have identified high levels of genetic diversity in honey bees,” Webster said. “In contrast to other domestic species, management of honey bees seems to have increased levels of genetic variation by mixing bees from different parts of the world. The findings may also indicate that high levels of inbreeding are not a major cause of global colony losses.” Also hidden in the patterns of genome variation are signals that indicate that climate change has strongly impacted honey bee populations historically. “Populations in Europe appear to have contracted during ice ages, whereas African populations have expanded at those times, suggesting that environmental conditions there were more favorable,” said Webster. The researchers also identified specific mutations in genes that are important in adaptation to factors such as climate and pathogens, including those involved in morphology, behavior, and innate immunity. “The study provides new insights into evolution and genetic adaptation, and establishes a framework for investigating the biological mechanisms behind disease resistance and adaptation to climate — knowledge that could be vital for protecting honey bees in a rapidly changing world,” Webster said. Read more at: - A worldwide survey of genome sequence variation provides insight into the evolutionary history of the honeybee Apis mellifera

2515 次阅读|0 个评论

基因组测序一个科普-完整基因组的价值

热度 3 gaoshannankai 2014-4-7 18:44

最近，基于illumina等二代测序的项目多如牛毛。全民测序的时代，测序价格急转直下。以前600万的一个植物基因组，到现在变成了60万。几个人来咨询项目，我建议他们测完整基因组（complete genome）。这对他们学科后续科研价值很大，如果仅仅是基因组草图（draft genome），就为了发发文章，我就建议他们找公司。后来，我报价100万，算合作，基本上是白干，就为了搞搞学术。人家听了，差点臭骂我一次，告诉我某知名大公司才报价 80万，一般60万，都是商业价格，说我以为国内都是傻子，回国来蒙人。我怎么说呢， complete genome你懂么，只能有1%的错误，包括gap，难度很大的。二代测序是带来了基因组的爆发，但是99%出来的还只能是草图，草图和完整图的区别，还是巨大的。只有完整图才有价值，很多分析，包括比较基因组，都必须有某个属或者科的一个完整图作为标杆，草图很多情况下就是一个toy。这个区别请看看cns文章里的评述。当前，植物领域能被承认，或者有人使用的，完整基因组没有几个，而且大部分还是一代测序的结果。希望这些不是“傻子”的朋友，好好了解一下。另外，微生物现在拿到完整图很便宜，没有必要为了省几千，拿到一个草图，好几千个片段，有意义么？

3151 次阅读|4 个评论

产黄青霉高产工业菌株的全基因组测序和分析

热度 1 junjim 2014-4-2 20:16

产黄青霉是生产青霉素的主要菌株，为丝状真菌。我们利用ABI3730等第一代和第二代测序仪，对华北制药集团的产黄青霉高产工业菌株（NCPC10086）进行全基因组从头测序，组装出32.3Mb高质量的基因组图谱，并进行基因预测和注释。接着将高产菌株的基因组与低产菌株Wisconsin54进行全基因组比对，发现了一些有趣的突变，详细情况请看： Genome sequencing of high-penicillin producing industrial strain of Penicillium chrysogenum 这是第一个公开发布全基因组序列的产黄青霉工业菌株，为广大的生物医学研究者提供大量宝贵的资料。希望大家在此基础之上挖掘出更好的故事，并且欢迎交流与合作。文章原文： 1471-2164-15-S1-S11.pdf

个人分类: 我的论文|3634 次阅读|2 个评论

四川农大博士生覃成在PNAS上发表辣椒基因组测序论文

热度 1 lipng6575 2014-3-4 19:54

　　3月3日，以四川农业大学为第一作者单位、玉米所作物遗传育种学博士生覃成为第一作者，玉米所沈亚欧研究员为共同第一作者，玉米所张志明研究员为最后责任作者，联合遵义市农业科学研究院、华南农业大学、深圳华大基因研究院、墨西哥生物多样性基因组学国家重点实验室等13家科研院所合作完成的辣椒基因组研究成果发表在国际著名期刊Proceedings of the National Academy of Sciences USA(PNAS)》(《美国国家科学院院刊》)(该期刊2013年影响因子为9.737)上，文章题目为“栽培辣椒和野生种辣椒的全基因组测序揭示了辣椒属的驯化和特异性”。该研究公布了我国辣椒(Capsicum annuum L.)种质资源遵辣1号的基因组测序结果(http://peppersequence.genomics.cn)。辣椒是全球最重要的经济作物之一，开展辣椒基因组学研究，深入挖掘与利用辣椒重要功能基因，将加快分子育种和全基因组育种的进程。辣椒基因组的成功解析及其群体遗传学研究，将为培育高产、优质和抗病的新品种辣椒奠定坚实的遗传学基础，同时也为该物种的基础生物学研究提供宝贵的资源。

个人分类: 论文简介|5959 次阅读|2 个评论

你想测自己基因组序列吗

热度 6 xudabin98 2013-6-11 01:08

1984 年12月美国犹他大学的雷蒙.怀特受能源部委托组织召开了一个小型学术会议，该会议围绕检测病人突变基因的方法进行了讨论。会议结束后得出的结论是：要想有效的检测出病人突变的基因，最好的方法是绘制出人类基因组序列作为参考。1985年5月美国能源部提出了测定人类基因组全部序列的草案。1986年3月，诺贝尔奖获得者达尔贝克在Science上撰文《癌症研究的转折点—人类基因组全序列分析》，在该篇文章中首次提出了“人类基因组计划”这一概念。此后，经过美国能源部、科学院、国会和国立卫生研究院组成的专家组反复论证五年之久，1990年10月1日，美国国会正式批准启动人类基因组计划，投资达30亿美元，利用15年时间，于2005年构建完成详细的人类基因组遗传图和物理图。由于测序技术的进步和美国赛莱拉公司的强势竞争，2000年6月26日发布人类基因组序列框架图，2001年2月更详细的序列图发布。该计划在提出时也讨论了其长远意义，比如在基础科学上，通过对人类所携带全部遗传信息解读，从分子层面上揭示人体生长发育和各种生命活动的机制，以及衰老死亡和疾病发生的分子机理。在医学应用上，则通过检测个人的遗传信息，确定自己可能会发生的疾病，从而在尽可能的情况下预防疾病的发生。例如，当一个妇女通过基因测序发现自己携带BRCAI突变基因，那么意味着该妇女在患乳腺癌或者卵巢癌的可能性上增大，当然检测到携带突变基因并不意味着就一定会发病，同时还受环境因素等的影响。这时，该妇女可选择是否通过切除双乳腺来预防癌症的发生。最近美国影星茱莉乳腺切除就是对这一意义最好的诠释。该计划的实施也推动了测序技术的飞速发展，目前个人基因组测序服务周转时间已低至2周。生物学家曾经描绘的美妙蓝图：“每个人都将拥有自己的基因组序列，并且可以将这些信息装在一个U盘中”，如今想要成为事实已不是什么难事。全球知名测序仪器制造公司Illumina，其总裁兼CEO Jay Flatley曾表示：“Illumina一直相信测序将成为临床上的主流做法。全基因组测序在诊断和治疗疾病上的潜力正迅速获得认可，特别是对于那些光凭症状无法判定的疾病。此时，快速的全基因组测序能提供遗传组成的整体信息，让医生能做出更明智的决定，病人也能更快得到答案。”同时国内很多高通量测序公司也在极力鼓吹个人基因组测序在个性化医疗上的作用，而我们一旦生病进入医院，又常常听从医生的指挥，各种检测与治疗都由他们制定，但其中很多问题却不能忽略，也不应该忽略。随着技术的飞速发展，传统的道德伦理在当今的社会生活中出现了前所未有的困惑局面，比如人的克隆、胚胎干细胞。高通量测序技术的日益成熟，使得测序价格一降再降，是否就应该直接应用在医疗产业上来辅助治疗，其意义有多大？是否违背伦理？比如亨廷顿舞蹈病，通过基因组测序获知其突变信息，结果是不容质疑的，然而对于疾病的解决目前来看是毫无意义的。而对于尚未发病的人，检测结果被病人获知后是否会患上抑郁乃至自杀？这样的检测结果意义何在？又比如产前无创检测，当我们得知胎儿出生后可能会出现某种缺陷，选择流产对于一个家庭以后的负担与幸福生活具有积极的一面，然而我们是否又有权利将将还未见到一丝光明的婴儿杀死于腹中？当个人遗传信息泄露后，是否会遭受社会歧视，影响个人的婚恋、学习、交友？在更多的伦理问题尚未形成一个标准前，以技术发展乃至金钱驱动为目的的个人信息测序应当让当事人有足够的知情权。对待生命伦理问题上，国际社会出台了一系列相关法律文件，比如纽伦堡法典、贝尔蒙报告、世界医学协会赫尔辛基宣言等等。在赫尔辛基宣言中就明确写道：医学研究中，医生的职责就是保护人类受试者的生命、健康、隐私和尊严。涉及人的生物医学研究必须遵循普遍接受的原则，必须以对科学文献及其他有关信息的全面了解为基础，以充分的实验室和动物实验为基础。纽伦堡法典也明确提出：受试者的自愿同意绝对必要。在对待个人基因组序列测定上，不能也不应该只打着临床监测需要或者其他目的而滥加利用与宣传。因此，当个人想要获得自己遗传信息时，一定要有足够的知情权并在目前的情况下承担相应的风险，而不仅仅是听从各方面的鼓吹。

11602 次阅读|24 个评论

[转载]共享信息数据给力科学发现，揭示科研新模式

shawn360 2011-9-7 17:33

2011年7月28日，由深圳华大基因研究院、德国汉堡—埃普多夫大学医学中心、中国疾病预防与控制中心和军事医学科学院微生物流行病研究所等单位主导完成的德国致病性大肠杆菌国际合作研究成果在国际著名杂志《新英格兰医学》（ The New England Journal of Medicine ，NEJM）上在线发表。德国致病性大肠杆菌研究项目首次展示了快速的基因组测序技术和及时的数据共享给全球各科研领域所带来的巨大贡献，证实了信息数据的快速共享在公共卫生事件中可发挥至关重要的作用，同时也为应对全球重大突发性紧急公共卫生事件提供了一个全新的解决思路。在此次疫情中，华大基因和德国汉堡—埃普多夫大学医学中心的研究人员以最快的速度完成对致病菌的基因组测序及分析并向全球免费公开所有数据，并声明在公共领域许可证范围内，所公开的数据没有使用限制。该数据库的公开使得整个科学界在第一时间共享了相关数据信息 ( http://www.sciencemag.org/content/332/6035/1249.full )，省去了从数据产生到文章发表的时间，为了尽快控制疫情奠定了重要的科学基础。据了解，华大基因的相关研究人员在收到样品后，3天内完成了大肠杆菌基因组的测序和初步组装，确定该菌株属于血清型O104，并及时将相关基因组序列数据公开分享，促进了全球生物信息学家的通力合作；在数据发布的24小时之内，完成了对该大肠杆菌的基因组组装；在数据发布的2天后，研究人员发现引起此次爆发的菌株与2001年在欧洲分离的菌株01-09591及2002年的中非分离株55989有高度相似性并展开进一步分析比较；5天后，研制出该大肠杆菌的诊断试剂盒并向全球范围内无偿提供检测实验方案及所需引物序列信息；2周内全球的研究人员在维基百科中公开了24篇关于该菌株的毒力、抗性基因、进化等方面最新的研究报告。华大基因微生物项目负责人覃俊杰博士说：“为尽快揭示德国大肠杆菌的致病机理及控制疫情的恶化，我们与合作方共同决定采用这种快速的数据共享方式，及时公开我们获得的相关数据，希望能够为全球生物信息学家及流行病研究学者的广泛合作提供基础支持，加速对该致病性大肠杆菌的研究进程。” 军事医学科学院微生物流行病研究所全军微生物检测研究中心主任、国家“973”首席科学家、国家科技重大专项课题负责人杨瑞馥教授补充说：“该研究成果不仅对德国致病性大肠杆菌的进化起源和致病性等方面的研究打下了坚实的基础，更为重要的是为全球其他传染性疾病的应对机制提供了一个新的国际协作模式。” 华大基因研究院副院长王俊教授表示：“基因组测序为鉴定和了解新型病原奠定了重要的基础，信息数据的免费、快速发布及共享能够极大地加速疾病诊断和治疗的过程，对大规模突发性公共卫生事件的快速应对具有十分重要的意义。” 德国负责传染病监督及预防的罗伯特-考赫学院7月26日宣布，肠出血性大肠杆菌疫情在德国的爆发已经结束。这场疫情最终导致德国范围内50人死亡，仅汉堡医院就有3496名病人被诊断为感染肠出血性大肠杆菌，在德国以外的欧洲地区也发现了76名患者。另据悉，华大基因（BGI）和BioMed Central（简称 BMC ，生物医学中心）共同主办的《GigaScience》杂志已开始征集投稿并计划于今年11月份正式上线。该杂志是一个面向所有使用或产出海量数据的生物学研究者的开放阅读期刊，主要采用全文文献与大型数据库相结合的崭新模式，为广大科学工作者提供前沿、有效的数据以及生物学发现等资源。具有DOI的德国大肠杆菌基因组数据库使得大肠杆菌成为第一个以这种方式公开数据的物种。此外，研究人员也可以按照同样的方式向数据库中添加已经经过证实的有价值的信息，以不断更新研究发现，并促进全球科学的快速发展。【注！转自华大基因研究院，原文链接已失效】

1908 次阅读|0 个评论

微生物基因组测序的应用

bioyong 2011-8-22 13:18

July/August 2011 By Tracy Vence 一、传染病暴发的时代致死性大肠杆菌传染病在欧洲暴发，至本报道发表前，已导致 27 人死亡， 2800 人染病。在拿到明斯特大学医院卫生研究所的研究人员分离的致病株后， Guenther 所在的 Life Technologie 公司德国实验室利用 Ion PGM 仅用了 3 天时间就完成了该致病型大肠杆菌菌株的全基因组测序和序列的组装，为快速研究该致病菌株的致病机理创造了条件。与此同时华大基因与德国汉堡 -Eppendorf 医疗中心合作，也宣布完成了对致病菌株的测序工作。 Guenther 说：“在有限的时间里完成了对微生物的全基因组测序，极大的方便了研究者从一个整体的水平上去研究微生物，进而揭示在这些目标微生物的基因组究竟发生了哪些改变。”事实上也的确如此，科学家根据从基因组测序的数据所获得的证据，将本次的致病型大肠杆菌鉴定为致病型大肠杆菌的一个新杂交品种，并且携带了一些抗性基因。“从宏观的基因组水平上来研究这类细菌，将在很大程度上革新我们对传染病暴发的认识， 3-4 天内完成对某种微生物的全基因组测序及基因标注，将会开启一个新的研究领域。” 在新奥尔良召开的美国微生物学会年度会议上，一些研究者指出，分子鉴定的方法正被用来打造基因组传染病学这一领域，基因组传染病学致力于重构传染病暴发的过程，以求在将来能够对传染病能进行实时有效的监控和快速反应。二、一个新的领域伯明翰大学的研究者 Mark Pallens 说：“在过去的 18 个月，也许就是基因组遗传病学学科诞生的时期。这段时间里在该领域发表的文献至少有十几篇。”最近伴随着基因组测序技术的进步，研究传染病的起源和病原微生物的动态演变将变得极为方便，传统的方法要求将目标序列导入到大肠杆菌中进行分子克隆。现在只要你有序列模板将会有很多种方法获得序列数据”在将来，单分子测序技术将会有更大的突破，基因组遗传病学也可能会跟着兴盛起来。 Pallen 指出在 2001 年美国司法部开始调查“炭疽杆菌信件”事件 —— 一些政客和媒体收到到含有炭疽杆菌的信 —— 这开启了基因组传染病学研究的先河。在 2002 年美国联邦调查局与美国基因组研究所合作，将信件中的炭疽杆菌与野生的进行了比对。三、高分辨率的病原菌鉴定与追踪在新一代测序测序技术开始前，许多研究者使用传统的分子分型的方法来研究致病菌种种群的结构及种群的结构的进化。 1998 年，伦敦帝国理工学院的 Brian Spratt 与其同事发明了一种叫 MLST （多位点序列分型）的方法， Spratt 说“这是一项重要的技术发明，我们可以利用该方法来鉴定与人类健康相关的一些微生物，特别是一些传染病源，并探究细菌种群在特定生物地理环境下的演化。” 尽管如此， MLST 往往不能区分在同一菌种内关系密切的两个菌株之间的差异。 Spratt 说“其实很多时候我们不能依靠 MLST 来解决问题，对许多菌种 MLST 难以奏效。在这个时候我们需要使用全基因组测序来彻底解决问题，即对分离菌株进行全基因组测序。” Sanger 研究所的 Julian Parkhill 发现对于一种金黄色葡萄球菌分离株，用 MLST 和一些标准的分子分型方法没有检出显著差异。 ASE 的 Parkhill 说：“金黄色葡萄球菌，具有复杂的致病机理，它们拥有一个庞大的族群和丰富的种群多样性，致病性同 MLST 型很难对应上，二者之间的联系是微妙的。” 最近 Parkhill 对金黄色葡萄球菌 ST239 的 63 个分离株进行了全基因组测序，其中 20 个菌分离自近 7 个月（来自同一家医院），其它 43 株菌来自全世界各地（ 1982—2003 的保存菌），在这些菌中一共鉴定出 6700 个 SNPs 其中有 4500 个位于金黄色葡萄球菌的核心序列区，值得注意的是这些菌用传统的分型方法不能区分。由此，看出许多表面上相似的菌株，仍然存在着较大的遗传上的差异。通过对这些菌在基因组上的差异进行比较，构建系统发育树， Parkhill 小组即可追踪 ST239 （ 2002 年伦敦大范围传染病的病原菌）在人与人之间，大陆与大陆之间的传播。这一研究结果于 2010 年 1 月份发表在《科学》期刊上，文中清楚的阐述了伦敦传染病的病原菌来自于东南亚。 Parkhill 说：“通过全基因组测序我们可以鉴定出许多潜在的差异位点，进而我们可以追踪病原菌的传播，并将它与人类种群相联系。” 四、检测与应对美国食品与药物管理局对沙门氏杆菌的发病率进行评估，发现在美国每年有 200-400 万人因该菌致病，沙门氏杆菌的发病率正逐步攀升。据 FDA 相关人员 Cao 透露，在美国有 11% 的食物性疾病由该菌引起。为了检测该类传染病， Cao 及其同事对各类沙门氏杆菌致病菌株进行了全基因组测序，以求从中找到新的 SNPs 类靶向位点。最近 Cao 及其研究组在《新英格兰》医学杂志上报道，他们测序的 35 株肠道型沙门杆氏菌属 . 蒙得维的亚血清型，该菌在 2009 年引发传染病，最先是发现存在于肉类食品生产流程里作为食品添加剂使用的红椒与黑椒中。对研究使用的 35 株菌：一部分分离自个体生产户生产的食品添加剂，一些分离自食用了该添加剂的感染者，还有一些是取自一定历史时期地理环境中（以此作为背景）。 Cao 说：“测序技术帮助研究者很快的找到了与病原菌暴发相关的带菌源 —— 一个国内的温室。而应用该技术可以揭示基因型上的微小差异，这对在追踪致病菌在食品上的出现至关重要”。 Cao 说在将来， FDA 打算用全基因组测序的方法做更多的关于传染病的研究，而在 2011 年他们将对 400 株沙门杆氏菌进行测序。五、方法上的选择据伯明翰大学的 Pallen 说：“许多研究者都习惯于采用 19 世纪的技术来应对来自于 21 世纪微生物诊断方面问题的挑战。我们的许多工作都有赖于巴斯德时期就发明的显微镜技术和纯培养技术，现在是时候采用新一代的高通量技术来诊断病原微生物了！” 因为基因组遗传病学研究将基因组测序数据做为首要的数据源，研究者可以绕开复杂的表型研究与检测，仅仅关注于基因组即可。这是一方法让人难以置信的，但是如果我们回过头来审视过去的三年，也许我们就不会感觉那么诧异了。伦敦的 Spratt 对基因组流行病学的发展前景持乐观态度，他说：“对微生物菌株进行全基因组测序，获取基因组序列数据，将变成一种习以为常的事情，测序的价格也将趋于合理。它将成为规则的改变者！” 基因组网的一个调查：在微生物鉴定方面，全基因组测序能否取代 MLST ？ 16% 认可全基因组测序在五到十年内取代 MLST 54% 认可全基因组测序 (WGS) 在五年内取代 MLST 21% 认为可能会， WGS 将与 MLST 互补使用 4% 不认为会取代，条件上的限制使 WGS 不能取代 MLST 5% 不认为会取代，对于某些菌种来说 MLST 要比 WGS 更为有效新闻源自 www.genomeweb.com , 毅新兴业供稿，更多信息请点击 www.bioyong.com

个人分类: 生物行业通讯|4379 次阅读|0 个评论

Knome瞄准科研市场提供5000$的全基因组测序及分析服务

bioyong 2011-8-15 13:33

将目光投向研究服务， Knome 提供 $5K 的全基因组测序和分析服务 2011 年 8 月 9 日， Julia Karow 报道从 2007 年开始为用户提供个人全基因组测序和分析服务至今，测序公司 Knome 开始将目光投向更诱人的研究服务市场。上周，该私人测序公司与美国证券交易委员会 (SEC) 在文件中透露，公司一次性融资 500 万美元，目标是 2000 万美元。就在这个初夏， Knome 开始使用新版的生物信息分析平台，即 kGAP 2.0 。提供全基因组测序、注释和分析服务，促销价格为 $4,998 / 基因组，一次性订单至少需要有十个基因组样本。英国哥伦比亚大学的研究者近期发表文章，利用 Knome 的基因注释技术发现了与遗传性的帕金森病相关的突变。该研究中， Knome 对两个人的外显子组进行了测序和分析。根据 Knome 的 CEO Jorge Conde 的说法，公司在今天第一季度的销售收入要比去年所有收入更高。大多数订单不是来自个人基因组测序，而是来自行业和学术界的研究人员。 Conde 还说：“目前我们考虑可以服务的而且可有效服务的市场是来自研究领域”。 Knome 并没有透露有关部门订单量和销售额的细节，但是 Illumina 公司说 Knome 已经从 Illumina Genome Network （ IGN ）全基因组测序服务购买了 250 个人类基因组服务。 IGN 目前对 50 个基因组以上的订单收费为 $4,000/ 基因组。 Conde 说 Knome 公司会根据客户的需求从 Illumina, BGI 和 Complete Genomics 外购测序服务。根据 Knome 的网站信息，研究人员使用 KnomeDiscovery 服务项目可以得到 30x 深度的全基因组测序；使用 "project-driven curation” 进行基因注释；基因组比较；可疑变异、基因和基因网络的列表；桌面软件用于进一步分析，包括发现潜在变异位点的 KnomeFinder ，发现基因 - 基因相互作用和基因网络的 KnomePathways 。用户也可以根据项目的大小以不到总价一半的价格只购买基因诠释服务， Conde 说。 Conde 还表示， $4,998 几乎只有公司最近报价的一半。用作分析测序数据的生物信息分析平台 kGAP 2.0 版不同于旧版本，数据目前采用云计算，它可以同时运行好几个项目。另外 kGAP 的产出是数据库的形式而不是平台文件的形式，方便用户查询和比较基因组。 Conde 承认，人类全基因组测序服务的竞争日益激烈，但是 Knome 公司已经确立了其在测序领域的地位，一是因为经验丰富，二是开发研究工具之前花了很长时间思考研究者的需求，特别是基因组的比较分析。竞争对手 Omicia 近期宣布他们计划在今年年底发布一套临床基因组分析平台。 Conde 说，结果的数据库格式和直接数据查询能力以及不同平台产生的序列数据分析能力是 Knome 公司的最核心的竞争力。同时，公司会吸引缺乏基因组分析经验的研究人员作为目标客户。“我们重点关注非生物信息学专家”， “我们开发软件很注重简单易用性，这是我们的独特优势”。 Knome 曾在去年收到来自 FDA 的警告信 , 信中称 Knome 提供的 DTC （ direct to consumer ）服务需要机构的审核，因为有些用户有可能会将结果数据直接作为临床的决策依据。但是 Conde 说 FDA 的警告信不会影响公司对相关服务的积极性。 Knome 公司已经与 FDA 沟通，提供有关服务的更多信息，但目前还没有回音。 Knome 目前正通过财政、私人投资客、策略投资商筹措资金， BioMérieux 去年投入 500 万美金与之合作。 Conde 说“ Knome 与 BioMérieux 一直都在交换一些专有技术和信息，这几乎成为了一种惯例”，但是详细的信息并未见报道。 Knome 目前尚未透露最近一批投资客的身份。

个人分类: 生物行业通讯|26 次阅读|0 个评论

人类基因组测序（Human Genome Sequencing ）研究动态（30天内）

热度 1 xupeiyang 2011-1-13 10:31

Provider: Quertle (www.quertle.info) Content: text/plain; charset=UTF-8 TY- JOUR TI- Genome of a songbird unveiled AU- Pinaud, Raphael PY- 2011 T2- Journal of Biology J2- J Biol UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2871510/ VL- 9 IS- 3 SP- 19-19 DO- 10.1186/jbiol222 C2- 2871510 N2- An international collaborative effort has recently uncovered the genome of the zebra finch, a songbird model that has provided unique insights into an array of biological phenomena. See research articles , ,and N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Reprogramming of the non-coding transcriptome during brain development AU- Valadkhan, Saba AU- Nilsen, Timothy W PY- 2011 T2- Journal of Biology J2- J Biol UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2871522/ VL- 9 IS- 1 SP- 5-5 DO- 10.1186/jbiol197 C2- 2871522 N2- A recent global analysis of gene expression during the differentiation of neuronal stem cells to neurons and oligodendrocytes indicates a complex pattern of changes in the expression of both protein-coding transcripts and long non-protein-coding RNAs. See research article . N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- LINE-1 Retrotransposition Activity in Human Genomes AU- Beck, Christine R AU- Collier, Pamela AU- Macfarlane, Catriona AU- Malig, Maika AU- Kidd, Jeffrey M AU- Eichler, Evan E AU- Badge, Richard M AU- Moran, John V PY- 2011 T2- Cell J2- Cell UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3013285/ VL- 141 IS- 7 SP- 1159-1170 DO- 10.1016/j.cell.2010.05.021 C2- 3013285 N2- Summary: Long Interspersed Element-1 (LINE-1 or L1) sequences comprise the bulk of retrotransposition activity in the human genome; however, the abundance of highly active or hot L1s in the human population remains largely unexplored. Here, we used a fosmid-based, paired-end DNA sequencing strategy to identify 68 full-length L1s which are differentially present among individuals but are absent from the human genome reference sequence. The majority of these L1s were highly active in a cultured cell retrotransposition assay. Genotyping 26 elements revealed that two L1s are only found in Africa and that two more are absent from the H952 subset of the Human Genome Diversity Panel. Therefore, these results suggest that hot L1s are more abundant in the human population than previously appreciated, and that ongoing L1 retrotransposition continues to be a major source of inter-individual genetic variation. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Lrp12/Mig13a Reveals Changing Patterns of Preplate Neuronal Polarity during Corticogenesis that Are Absent in Reeler Mutant Mice AU- Schneider, Stephanie AU- Gulacsi, Alexandra AU- Hatten, Mary E PY- 2011 T2- Cerebral Cortex (New York, NY) J2- Cereb Cortex UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3000567/ VL- 21 IS- 1 SP- 134-144 DO- 10.1093/cercor/bhq070 C2- 3000567 N2- During corticogenesis, the earliest generated neurons form the preplate, which evolves into the marginal zone and subplate. Lrp12/Mig13a, a mammalian gene related to the Caenorhabditis elegans neuroblast migration gene mig-13, is expressed in a subpopulation of preplate neurons that undergo ventrally directed tangential migrations in the preplate layer and pioneer axon projections to the anterior commissure. As the preplate separates, Lrp12/Mig13a-positive neurons polarize in the radial plane and form a pseudocolumnar pattern, prior to moving to a deeper position within the emerging subplate layer. These changes in neuronal polarity do not occur in reeler mutant mice, revealing the earliest known defect in reeler cortical patterning and suggesting that the alignment of preplate neurons into a pseudolayer facilitates the movement of later-born radially migrating neurons into the emerging cortical plate. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- ENCODE whole-genome data in the UCSC genome browser (2011 update) AU- Raney, Brian J AU- Cline, Melissa S AU- Rosenbloom, Kate R AU- Dreszer, Timothy R AU- Learned, Katrina AU- Barber, Galt P AU- Meyer, Laurence R AU- Sloan, Cricket A AU- Malladi, Venkat S AU- Roskin, Krishna M AU- Suh, Bernard B AU- Hinrichs, Angie S AU- Clawson, Hiram AU- Zweig, Ann S AU- Kirkup, Vanessa AU- Fujita, Pauline A AU- Rhead, Brooke AU- Smith, Kayla E AU- Pohl, Andy AU- Kuhn, Robert M AU- Karolchik, Donna AU- Haussler, David AU- Kent, W James PY- 2011 T2- Nucleic Acids Research J2- Nucleic Acids Res UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3013645/ VL- 39 IS- Database issue SP- D871-D875 DO- 10.1093/nar/gkq1017 C2- 3013645 N2- The ENCODE project is an international consortium with a goal of cataloguing all the functional elements in the human genome. The ENCODE Data Coordination Center (DCC) at the University of California, Santa Cruz serves as the central repository for ENCODE data. In this role, the DCC offers a collection of high-throughput, genome-wide data generated with technologies such as ChIP-Seq, RNA-Seq, DNA digestion and others. This data helps illuminate transcription factor-binding sites, histone marks, chromatin accessibility, DNA methylation, RNA expression, RNA binding and other cell-state indicators. It includes sequences with quality scores, alignments, signals calculated from the alignments, and in most cases, element or peak calls calculated from the signal data. Each data set is available for visualization and download via the UCSC Genome Browser (). ENCODE data can also be retrieved using a metadata system that captures the experimental parameters of each assay. The ENCODE web portal at UCSC () provides information about the ENCODE data and links for access. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- BACs as Tools for the Study of Genomic Imprinting AU- Tunster, S J AU- Van De Pette, M AU- John, R M PY- 2011 T2- Journal of Biomedicine and Biotechnology J2- J Biomed Biotechnol UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3010669/ VL- 2011 DO- 10.1155/2011/283013 C2- 3010669 N2- Genomic imprinting in mammals results in the expression of genes from only one parental allele. Imprinting occurs as a consequence of epigenetic marks set down either in the father's or the mother's germ line and affects a very specific category of mammalian gene. A greater understanding of this distinctive phenomenon can be gained from studies using large genomic clones, called bacterial artificial chromosomes (BACs). Here, we review the important applications of BACs to imprinting research, covering physical mapping studies and the use of BACs as transgenes in mice to study gene expression patterns, to identify imprinting centres, and to isolate the consequences of altered gene dosage. We also highlight the significant and unique advantages that rapid BAC engineering brings to genomic imprinting research. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Hymenoptera Genome Database: integrated community resources for insect species of the order Hymenoptera AU- Munoz-Torres, Monica C AU- Reese, Justin T AU- Childers, Christopher P AU- Bennett, Anna K AU- Sundaram, Jaideep P AU- Childs, Kevin L AU- Anzola, Juan M AU- Milshina, Natalia AU- Elsik, Christine G PY- 2011 T2- Nucleic Acids Research J2- Nucleic Acids Res UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3013718/ VL- 39 IS- Database issue SP- D658-D662 DO- 10.1093/nar/gkq1145 C2- 3013718 N2- The Hymenoptera Genome Database (HGD) is a comprehensive model organism database that caters to the needs of scientists working on insect species of the order Hymenoptera. This system implements open-source software and relational databases providing access to curated data contributed by an extensive, active research community. HGD contains data from 9 different species across 200 million years in the phylogeny of Hymenoptera, allowing researchers to leverage genetic, genome sequence and gene expression data, as well as the biological knowledge of related model organisms. The availability of resources across an order greatly facilitates comparative genomics and enhances our understanding of the biology of agriculturally important Hymenoptera species through genomics. Curated data at HGD includes predicted and annotated gene sets supported with evidence tracks such as ESTs/cDNAs, small RNA sequences and GC composition domains. Data at HGD can be queried using genome browsers and/or BLAST/PSI-BLAST servers, and it may also be downloaded to perform local searches. We encourage the public to access and contribute data to HGD at: . N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- ASPicDB: a database of annotated transcript and protein variants generated by alternative splicing AU- Martelli, Pier L AU- D'Antonio, Mattia AU- Bonizzoni, Paola AU- Castrignan, Tiziana AU- D'Erchia, Anna M AU- D'Onorio De Meo, Paolo AU- Fariselli, Piero AU- Finelli, Michele AU- Licciulli, Flavio AU- Mangiulli, Marina AU- Mignone, Flavio AU- Pavesi, Giulio AU- Picardi, Ernesto AU- Rizzi, Raffaella AU- Rossi, Ivan AU- Valletti, Alessio AU- Zauli, Andrea AU- Zambelli, Federico AU- Casadio, Rita AU- Pesole, Graziano PY- 2011 T2- Nucleic Acids Research J2- Nucleic Acids Res UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3013677/ VL- 39 IS- Database issue SP- D80-D85 DO- 10.1093/nar/gkq1073 C2- 3013677 N2- Alternative splicing is emerging as a major mechanism for the expansion of the transcriptome and proteome diversity, particularly in human and other vertebrates. However, the proportion of alternative transcripts and proteins actually endowed with functional activity is currently highly debated. We present here a new release of ASPicDB which now provides a unique annotation resource of human protein variants generated by alternative splicing. A total of 256939 protein variants from 17191 multi-exon genes have been extensively annotated through state of the art machine learning tools providing information of the protein type (globular and transmembrane), localization, presence of PFAM domains, signal peptides, GPI-anchor propeptides, transmembrane and coiled-coil segments. Furthermore, full-length variants can be now specifically selected based on the annotation of CAGE-tags and polyA signal and/or polyA sites, marking transcription initiation and termination sites, respectively. The retrieval can be carried out at gene, transcript, exon, protein or splice site level allowing the selection of data sets fulfilling one or more features settled by the user. The retrieval interface also enables the selection of protein variants showing specific differences in the annotated features. ASPicDB is available at . N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- TIARA: a database for accurate analysis of multiple personal genomes based on cross-technology AU- Hong, Dongwan AU- Park, Sung-Soo AU- Ju, Young Seok AU- Kim, Sheehyun AU- Shin, Jong-Yeon AU- Kim, Sujung AU- Yu, Saet-Byeol AU- Lee, Won-Chul AU- Lee, Seungbok AU- Park, Hansoo AU- Kim, Jong-Il AU- Seo, Jeong-Sun PY- 2011 T2- Nucleic Acids Research J2- Nucleic Acids Res UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3013693/ VL- 39 IS- Database issue SP- D883-D888 DO- 10.1093/nar/gkq1101 C2- 3013693 N2- High-throughput genomic technologies have been used to explore personal human genomes for the past few years. Although the integration of technologies is important for high-accuracy detection of personal genomic variations, no databases have been prepared to systematically archive genomes and to facilitate the comparison of personal genomic data sets prepared using a variety of experimental platforms. We describe here the Total Integrated Archive of Short-Read and Array (TIARA; ) database, which contains personal genomic information obtained from next generation sequencing (NGS) techniques and ultra-high-resolution comparative genomic hybridization (CGH) arrays. This database improves the accuracy of detecting personal genomic variations, such as SNPs, short indels and structural variants (SVs). At present, 36 individual genomes have been archived and may be displayed in the database. TIARA supports a user-friendly genome browser, which retrieves read-depths (RDs) and log2 ratios from NGS and CGH arrays, respectively. In addition, this database provides information on all genomic variants and the raw data, including short reads and feature-level CGH data, through anonymous file transfer protocol. More personal genomes will be archived as more individuals are analyzed by NGS or CGH array. TIARA provides a new approach to the accurate interpretation of personal genomes for genome research. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Primate and Rodent Specific Intron Gains and the Origin of Retrogenes with Splice Variants AU- Szcze?niak, Micha? W AU- Ciomborowska, Joanna AU- Nowak, Witold AU- Rogozin, Igor B AU- Maka?owska, Izabela PY- 2011 T2- Molecular biology and evolution J2- Mol Biol Evol UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3002245/ VL- 28 IS- 1 SP- 33-37 DO- 10.1093/molbev/msq260 C2- 3002245 N2- Retroposition, a leading mechanism for gene duplication, is an important process shaping the evolution of genomes. Retrogenes are also involved in the gene structure evolution as a major player in the process of intron deletion. Here, we demonstrate the role of retrogenes in intron gain in mammals. We identified one case of intronization, the transformation of exonic sequences into an intron, in the primate specific retrogene RNF113B and two independent intronization events in the retrogene DCAF12L2, one in the common ancestor of primates and rodents and another one in the rodent lineage. Intron gain resulted from the origin of new splice variants, and both genes have two transcript forms, one with retained intron and one with the intron spliced out. Evolution of these genes, especially RNF113B, has been very dynamic and has been accompanied by several additional events including parental gene loss, secondary retroposition, and exaptation of transposable elements. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Structural insights and ab initio sequencing within the DING proteins family AU- Elias, Mikael AU- Liebschner, Dorothee AU- Gotthard, Guillaume AU- Chabriere, Eric PY- 2011 T2- Journal of Synchrotron Radiation J2- J Synchrotron Radiat UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3004253/ VL- 18 IS- Pt 1 SP- 45-49 DO- 10.1107/S0909049510036009 C2- 3004253 N2- DING proteins constitute an intriguing family of phosphate-binding proteins that was identified in a wide range of organisms, from prokaryotes and archae to eukaryotes. Despite their seemingly ubiquitous occurrence in eukaryotes, their encoding genes are missing from sequenced genomes. Such a lack has considerably hampered functional studies. In humans, these proteins have been related to several diseases, like atherosclerosis, kidney stones, inflammation processes and HIV inhibition. The human phosphate binding protein is a human representative of the DING family that was serendipitously discovered from human plasma. An original approach was developed to determine ab initio the complete and exact sequence of this 38?kDa protein by utilizing mass spectrometry and X-ray data in tandem. Taking advantage of this first complete eukaryotic DING sequence, a immunohistochemistry study was undertaken to check the presence of DING proteins in various mice tissues, revealing that these proteins are widely expressed. Finally, the structure of a bacterial representative from Pseudomonas fluorescens was solved at sub-angstrom resolution, allowing the molecular mechanism of the phosphate binding in these high-affinity proteins to be elucidated. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- dbDNV: a resource of duplicated gene nucleotide variants in human genome AU- Ho, Meng-Ru AU- Tsai, Kuo-Wang AU- Chen, Chun-Houh AU- Lin, Wen-Chang PY- 2011 T2- Nucleic Acids Research J2- Nucleic Acids Res UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3013738/ VL- 39 IS- Database issue SP- D920-D925 DO- 10.1093/nar/gkq1197 C2- 3013738 N2- Gene duplications are scattered widely throughout the human genome. A single-base difference located in nearly identical duplicated segments may be misjudged as a single nucleotide polymorphism (SNP) from individuals. This imperfection is undistinguishable in current genotyping methods. As the next-generation sequencing technologies become more popular for sequence-based association studies, numerous ambiguous SNPs are rapidly accumulated. Thus, analyzing duplication variations in the reference genome to assist in preventing false positive SNPs is imperative. We have identified 10% of human genes associated with duplicated gene loci (DGL). Through meticulous sequence alignments of DGL, we systematically designated 1236956 variations as duplicated gene nucleotide variants (DNVs). The DNV database (dbDNV) () has been established to promote more accurate variation annotation. Aside from the flat file download, users can explore the gene-related duplications and the associated DNVs by DGL and DNV searches, respectively. In addition, the dbDNV contains 304110 DNV-coupled SNPs. From DNV-coupled SNP search, users observe which SNP records are also variants among duplicates. This is useful while 58% of exonic SNPs in DGL are DNV-coupled. Because of high accumulation of ambiguous SNPs, we suggest that annotating SNPs with DNVs possibilities should improve association studies of these variants with human diseases. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Database resources of the National Center for Biotechnology Information AU- Sayers, Eric W AU- Barrett, Tanya AU- Benson, Dennis A AU- Bolton, Evan AU- Bryant, Stephen H AU- Canese, Kathi AU- Chetvernin, Vyacheslav AU- Church, Deanna M AU- Dicuccio, Michael AU- Federhen, Scott AU- Feolo, Michael AU- Fingerman, Ian M AU- Geer, Lewis Y AU- Helmberg, Wolfgang AU- Kapustin, Yuri AU- Landsman, David AU- Lipman, David J AU- Lu, Zhiyong AU- Madden, Thomas L AU- Madej, Tom AU- Maglott, Donna R AU- Marchler-Bauer, Aron AU- Miller, Vadim AU- Mizrachi, Ilene AU- Ostell, James AU- Panchenko, Anna AU- Phan, Lon AU- Pruitt, Kim D AU- Schuler, Gregory D AU- Sequeira, Edwin AU- Sherry, Stephen T AU- Shumway, Martin AU- Sirotkin, Karl AU- Slotta, Douglas AU- Souvorov, Alexandre AU- Starchenko, Grigory AU- Tatusova, Tatiana A AU- Wagner, Lukas AU- Wang, Yanli AU- Wilbur, W John AU- Yaschenko, Eugene AU- Ye, Jian PY- 2011 T2- Nucleic Acids Research J2- Nucleic Acids Res UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3013733/ VL- 39 IS- Database issue SP- D38-D51 DO- 10.1093/nar/gkq1172 C2- 3013733 N2- In addition to maintaining the GenBank nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI Web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central (PMC), Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Electronic PCR, OrfFinder, Splign, ProSplign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Entrez Probe, GENSAT, Online Mendelian Inheritance in Man (OMIM), Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), IBIS, Biosystems, Peptidome, OMSSA, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page at . N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Isolation of Specific Clones from Nonarrayed BAC Libraries through Homologous Recombination AU- Nefedov, Mikhail AU- Carbone, Lucia AU- Field, Matthew AU- Schein, Jacquie AU- de Jong, Pieter J PY- 2011 T2- Journal of Biomedicine and Biotechnology J2- J Biomed Biotechnol UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2957146/ VL- 2011 DO- 10.1155/2011/560124 C2- 2957146 N2- We have developed a new approach to screen bacterial artificial chromosome (BAC) libraries by recombination selection. To test this method, we constructed an orangutan BAC library using an E. coli strain (DY380) with temperature inducible homologous recombination (HR) capability. We amplified one library segment, induced HR at 42C to make it recombination proficient, and prepared electrocompetent cells for transformation with a kanamycin cassette to target sequences in the orangutan genome through terminal recombineering homologies. Kanamycin-resistant colonies were tested for the presence of BACs containing the targeted genes by the use of a PCR-assay to confirm the presence of the kanamycin insertion. The results indicate that this is an effective approach for screening clones. The advantage of recombination screening is that it avoids the high costs associated with the preparation, screening, and archival storage of arrayed BAC libraries. In addition, the screening can be conceivably combined with genetic engineering to create knockout and reporter constructs for functional studies. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- DBASS3 and DBASS5: databases of aberrant 3- and 5-splice sites AU- Buratti, Emanuele AU- Chivers, Martin AU- Hwang, Gyulin AU- Vorechovsky, Igor PY- 2011 T2- Nucleic Acids Research J2- Nucleic Acids Res UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3013770/ VL- 39 IS- Database issue SP- D86-D91 DO- 10.1093/nar/gkq887 C2- 3013770 N2- DBASS3 and DBASS5 provide comprehensive repositories of new exon boundaries that were induced by pathogenic mutations in human disease genes. Aberrant 5- and 3-splice sites were activated either by mutations in the consensus sequences of natural exonintron junctions (cryptic sites) or elsewhere (de novo sites). DBASS3 and DBASS5 currently contain approximately 900 records of cryptic and de novo 3- and 5-splice sites that were produced by over a thousand different mutations in approximately 360 genes. DBASS3 and DBASS5 data can be searched by disease phenotype, gene, mutation, location of aberrant splice sites in introns and exons and their distance from authentic counterparts, by bibliographic references and by the splice-site strength estimated with several prediction algorithms. The user can also retrieve reference sequences of both aberrant and authentic splice sites with the underlying mutation. These data will facilitate identification of introns or exons frequently involved in aberrant splicing, mutation analysis of human disease genes and study of germline or somatic mutations that impair RNA processing. Finally, this resource will be useful for fine-tuning splice-site prediction algorithms, better definition of auxiliary splicing signals and design of new reporter assays. DBASS3 and DBASS5 are freely available at . N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Identification of causal sequence variants of disease in the next generation sequencing era. AU- Kingsley, Christopher B PY- 2011 T2- Methods in molecular biology (Clifton, N.J.) J2- Methods Mol Biol UR- http://www.ncbi.nlm.nih.gov/pubmed/21204025 VL- 700 SP- 37-46 N2- Over the last decade, genetic studies have identified numerous associations between single nucleotide polymorphism (SNP) alleles in the human genome and important human diseases. Unfortunately, extending these initial associative findings to identification of the true causal variants that underlie disease susceptibility is usually not a straightforward task. Causal variant identification typically involves searching through sizable regions of genomic DNA in the vicinity of disease-associated SNPs for sequence variants in functional elements including protein coding, regulatory, and structural sequences. Prioritization of these searches is greatly aided by knowledge of the location of functional sequences in the human genome. This chapter briefly reviews several of the common approaches used to functionally annotate the human genome and discusses how this information can be used in concert with the emerging technology of next generation high-throughput sequencing to identify causal variants of human disease. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Cell-Based Co-transfection Microarrays for Use with HEK293T Cells on a Poly D: -Lysine-Coated Polystyrene Microplate. AU- Soni, Meenal AU- Lai, Fang PY- 2011 T2- Methods in molecular biology (Clifton, N.J.) J2- Methods Mol Biol UR- http://www.ncbi.nlm.nih.gov/pubmed/21104051 VL- 706 SP- 13-25 N2- Analysis of the human genome sequence has identified thousands of putative genes with unknown function; therefore, a new tool allowing for rapid identification of gene functions is needed. Reverse transfection microarray technology, which turns a DNA microarray into a cell-based microarray, has emerged for simultaneously studying the function of many genes. Since the initial demonstration in 2001, many variations have surfaced, making the technology more versatile for a broad range of applications. We have developed a protocol to make ready-to-transfect DNA microarrays in a 96-well microplate for co-transfection of two plasmids into HEK293T cells. This cell-based microarray in a microplate may be used for screening hundreds of analytes against multiple protein targets in parallel, providing a powerful tool for functional genomics and drug discovery. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Bovine Genome Database: integrated tools for genome annotation and discovery AU- Childers, Christopher P AU- Reese, Justin T AU- Sundaram, Jaideep P AU- Vile, Donald C AU- Dickens, C Michael AU- Childs, Kevin L AU- Salih, Hanni AU- Bennett, Anna K AU- Hagen, Darren E AU- Adelson, David L AU- Elsik, Christine G PY- 2011 T2- Nucleic Acids Research J2- Nucleic Acids Res UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3013744/ VL- 39 IS- Database issue SP- D830-D834 DO- 10.1093/nar/gkq1235 C2- 3013744 N2- The Bovine Genome Database (BGD; ) strives to improve annotation of the bovine genome and to integrate the genome sequence with other genomics data. BGD includes GBrowse genome browsers, the Apollo Annotation Editor, a quantitative trait loci (QTL) viewer, BLAST databases and gene pages. Genome browsers, available for both scaffold and chromosome coordinate systems, display the bovine Official Gene Set (OGS), RefSeq and Ensembl gene models, non-coding RNA, repeats, pseudogenes, single-nucleotide polymorphism, markers, QTL and alignments to complementary DNAs, ESTs and protein homologs. The Bovine QTL viewer is connected to the BGD Chromosome GBrowse, allowing for the identification of candidate genes underlying QTL. The Apollo Annotation Editor connects directly to the BGD Chado database to provide researchers with remote access to gene evidence in a graphical interface that allows editing and creating new gene models. Researchers may upload their annotations to the BGD server for review and integration into the subsequent release of the OGS. Gene pages display information for individual OGS gene models, including gene structure, transcript variants, functional descriptions, gene symbols, Gene Ontology terms, annotator comments and links to National Center for Biotechnology Information and Ensembl. Each gene page is linked to a wiki page to allow input from the research community. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Characterization of New Otic Enhancers of the Pou3f4 Gene Reveal Distinct Signaling Pathway Regulation and Spatio-Temporal Patterns AU- Robert-Moreno, lex AU- Naranjo, Silvia AU- de la Calle-Mustienes, Elisa AU- Gmez-Skarmeta, Jos Luis AU- Alsina, Berta PY- 2010 T2- PLoS ONE J2- PLoS One UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3013142/ VL- 5 IS- 12 DO- 10.1371/journal.pone.0015907 C2- 3013142 N2- POU3F4 is a member of the POU-homedomain transcription factor family with a prominent role in inner ear development. Mutations in the human POU3F4 coding unit leads to X-linked deafness type 3 (DFN3), characterized by conductive hearing loss and progressive sensorineural deafness. Microdeletions found 1 Mb 5 upstream of the coding region also displayed the same phenotype, suggesting that cis-regulatory elements might be present in that region. Indeed, we and others have recently identified several enhancers at the 1 Mb 5 upstream interval of the pou3f4 locus. Here we characterize the spatio-temporal patterns of these regulatory elements in zebrafish transgenic lines. We show that the most distal enhancer (HCNR 81675) is activated earlier and drives GFP reporter expression initially to a broad ear domain to progressively restrict to the sensory patches. The proximal enhancer (HCNR 82478) is switched later during development and promotes expression, among in other tissues, in sensory patches from its onset. The third enhancer (HCNR 81728) is also active at later stages in the otic mesenchyme and in the otic epithelium. We also characterize the signaling pathways regulating these enhancers. While HCNR 81675 is regulated by very early signals of retinoic acid, HCNR 82478 is regulated by Fgf activity at a later stage and the HCNR 81728 enhancer is under the control of Hh signaling. Finally, we show that Sox2 and Pax2 transcription factors are bound to HCNR 81675 genomic region during otic development and specific mutations to these transcription factor binding sites abrogates HCNR 81675 enhancer activity. Altogether, our results suggest that pou3f4 expression in inner ear might be under the control of distinct regulatory elements that fine-tune the spatio-temporal activity of this gene and provides novel data on the signaling mechanisms controlling pou3f4 function. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Activation of Pluripotency Genes in Human Fibroblast Cells by a Novel mRNA Based Approach AU- Plews, Jordan R. AU- Li, JianLiang AU- Jones, Mark AU- Moore, Harry D. AU- Mason, Chris AU- Andrews, Peter W. AU- Na, Jie PY- 2010 T2 - PLoS ONE J2- PLoS One UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3012685/ VL- 5 IS- 12 DO- 10.1371/journal.pone.0014397 C2- 3012685 N2- Background: Several methods have been used to induce somatic cells to re-enter the pluripotent state. Viral transduction of reprogramming genes yields higher efficiency but involves random insertions of viral sequences into the human genome. Although induced pluripotent stem (iPS) cells can be obtained with the removable PiggyBac transposon system or an episomal system, both approaches still use DNA constructs so that resulting cell lines need to be thoroughly analyzed to confirm they are free of harmful genetic modification. Thus a method to change cell fate without using DNA will be very useful in regenerative medicine. Methodology/Principal Findings: In this study, we synthesized mRNAs encoding OCT4, SOX2, cMYC, KLF4 and SV40 large T (LT) and electroporated them into human fibroblast cells. Upon transfection, fibroblasts expressed these factors at levels comparable to, or higher than those in human embryonic stem (ES) cells. Ectopically expressed OCT4 localized to the cell nucleus within 4 hours after mRNA introduction. Transfecting fibroblasts with a mixture of mRNAs encoding all five factors significantly increased the expression of endogenous OCT4, NANOG, DNMT3, REX1 and SALL4. When such transfected fibroblasts were also exposed to several small molecules (valproic acid, BIX01294 and 5-aza-2-deoxycytidine) and cultured in human embryonic stem cell (ES) medium they formed small aggregates positive for alkaline phosphatase activity and OCT4 protein within 30 days. Conclusion/Significance: Our results demonstrate that mRNA transfection can be a useful approach to precisely control the protein expression level and short-term expression of reprogramming factors is sufficient to activate pluripotency genes in differentiated cells. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- The complete genome sequence of Corynebacterium pseudotuberculosis FRC41 isolated from a 12-year-old girl with necrotizing lymphadenitis reveals insights into gene-regulatory networks contributing to virulence. AU- Trost, Eva AU- Ott, Lisa AU- Schneider, Jessica AU- Schroder, Jasmin AU- Jaenicke, Sebastian AU- Goesmann, Alexander AU- Husemann, Peter AU- Stoye, Jens AU- Alves Dorella, Fernanda AU- Souza Rocha, Flavia AU- de Castro Soares, Siomar AU- D'Afonseca, Vivian AU- Miyoshi, Anderson AU- Ruiz, Jeronimo AU- Silva, Artur AU- Azevedo, Vasco AU- Burkovski, Andreas AU- Guiso, Nicole AU- Join-Lambert, Olivier F AU- Kayal, Samer AU- Tauch, Andreas PY- 2010 T2- BMC genomics J2- BMC Genomics UR- http://www.ncbi.nlm.nih.gov/pubmed/21192786 VL- 11 IS- 1 SP- 728 N2- ABSTRACT: BACKGROUND: Corynebacterium pseudotuberculosis is generally regarded as an important animal pathogen that rarely infects humans. Clinical strains are occasionally recovered from human cases of lymphadenitis, such as C. pseudotuberculosis FRC41 that was isolated from the inguinal lymph node of a 12-year-old girl with necrotizing lymphadenitis. To detect potential virulence factors and corresponding gene-regulatory networks in this human isolate, the genome sequence of C. pseudotuberculosis FCR41 was determined by pyrosequencing and functionally annotated. RESULTS: Sequencing and assembly of the C. pseudotuberculosis FRC41 genome yielded a circular chromosome with a size of 2,337,913 bp and a mean G+C content of 52.2%. Specific gene sets associated with iron and zinc homeostasis were detected among the 2,110 predicted protein-coding regions and integrated into a gene-regulatory network that is linked with both the central metabolism and the oxidative stress response of FRC41. Two gene clusters encode proteins involved in the sortase-mediated polymerization of adhesive pili that can probably mediate the adherence to host tissue to facilitate additional ligand-receptor interactions and the delivery of virulence factors. The prominent virulence factors phospholipase D (Pld) and corynebacterial protease CP40 are encoded in the genome of this human isolate. The genome annotation revealed additional serine proteases, neuraminidase H, nitric oxide reductase, an invasion-associated protein, and acyl-CoA carboxylase subunits involved in mycolic acid biosynthesis as potential virulence factors. The cAMP-sensing transcription regulator GlxR plays a key role in controlling the expression of several genes contributing to virulence. CONCLUSION: The functional data deduced from the genome sequencing and the extended knowledge of virulence factors indicate that the human isolate C. pseudotuberculosis FRC41 is equipped with a distinct gene set promoting its survival under unfavorable environmental conditions encountered in the mammalian host. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Analysis of MicroRNA Expression in the Prepubertal Testis AU- Buchold, Gregory M AU- Coarfa, Cristian AU- Kim, Jong AU- Milosavljevic, Aleksandar AU- Gunaratne, Preethi H AU- Matzuk, Martin M PY- 2010 T2- PLoS ONE J2- PLoS One UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3012074/ VL- 5 IS- 12 DO- 10.1371/journal.pone.0015317 C2- 3012074 N2- Only thirteen microRNAs are conserved between D. melanogaster and the mouse; however, conditional loss of miRNA function through mutation of Dicer causes defects in proliferation of premeiotic germ cells in both species. This highlights the potentially important, but uncharacterized, role of miRNAs during early spermatogenesis. The goal of this study was to characterize on postnatal day 7, 10, and 14 the content and editing of murine testicular miRNAs, which predominantly arise from spermatogonia and spermatocytes, in contrast to prior descriptions of miRNAs in the adult mouse testis which largely reflects the content of spermatids. Previous studies have shown miRNAs to be abundant in the mouse testis by postnatal day 14; however, through Next Generation Sequencing of testes from a B6;129 background we found abundant earlier expression of miRNAs and describe shifts in the miRNA signature during this period. We detected robust expression of miRNAs encoded on the X chromosome in postnatal day 14 testes, consistent with prior studies showing their resistance to meiotic sex chromosome inactivation. Unexpectedly, we also found a similar positional enrichment for most miRNAs on chromosome 2 at postnatal day 14 and for those on chromosome 12 at postnatal day 7. We quantified in vivo developmental changes in three types of miRNA variation including 5 heterogeneity, editing, and 3 nucleotide addition. We identified eleven putative novel pubertal testis miRNAs whose developmental expression suggests a possible role in early male germ cell development. These studies provide a foundation for interpretation of miRNA changes associated with testicular pathology and identification of novel components of the miRNA editing machinery in the testis. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Primate-specific evolution of noncoding element insertion into PLA2G4C and human preterm birth AU- Plunkett, Jevon AU- Doniger, Scott AU- Morgan, Thomas AU- Haataja, Ritva AU- Hallman, Mikko AU- Puttonen, Hilkka AU- Menon, Ramkumar AU- Kuczynski, Edward AU- Norwitz, Errol AU- Snegovskikh, Victoria AU- Palotie, Aarno AU- Peltonen, Leena AU- Fellman, Vineta AU- Defranco, Emily A AU- Chaudhari, Bimal P AU- Oates, John AU- Boutaud, Olivier AU- McGregor, Tracy L AU- McElroy, Jude J AU- Teramo, Kari AU- Borecki, Ingrid AU- Fay, Justin C AU- Muglia, Louis J PY- 2010 T2- BMC Medical Genomics J2- BMC Medical Genomics UR- http://www.biomedcentral.com/1755-8794/3/62 VL- 3 IS- 1 SP- 62 DO- 10.1186/1755-8794-3-62 N2- Abstract Background: The onset of birth in humans, like other apes, differs from non-primate mammals in its endocrine physiology. We hypothesize that higher primate-specific gene evolution may lead to these differences and target genes involved in human preterm birth, an area of global health significance. Methods: We performed a comparative genomics screen of highly conserved noncoding elements and identified PLA2G4C, a phospholipase A isoform involved in prostaglandin biosynthesis as human accelerated. To examine whether this gene demonstrating primate-specific evolution was associated with birth timing, we genotyped and analyzed 8 common single nucleotide polymorphisms (SNPs) in PLA2G4C in US Hispanic (n = 73 preterm, 292 control), US White (n = 147 preterm, 157 control) and US Black (n = 79 preterm, 166 control) mothers. Results: Detailed structural and phylogenic analysis of PLA2G4C suggested a short genomic element within the gene duplicated from a paralogous highly conserved element on chromosome 1 specifically in primates. SNPs rs8110925 and rs2307276 in US Hispanics and rs11564620 in US Whites were significant after correcting for multiple tests (p 0.006). Additionally, rs11564620 (Thr360Pro) was associated with increased metabolite levels of the prostaglandin thromboxane in healthy individuals (p = 0.02), suggesting this variant may affect PLA2G4C activity. Conclusions: Our findings suggest that variation in PLA2G4C may influence preterm birth risk by increasing levels of prostaglandins, which are known to regulate labor. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Complete genome sequence of the zoonotic pathogen Chlamydophila psittaci. AU- Seth-Smith, Helena M B AU- Harris, Simon R AU- Rance, Richard AU- West, Anthony P AU- Severin, Juliette A AU- Ossewaarde, Jacobus M AU- Cutcliffe, Lesley T AU- Skilton, Rachel J AU- Marsh, Pete AU- Parkhill, Julian AU- Clarke, Ian N AU- Thomson, Nicholas R PY- 2010 T2- Journal of bacteriology J2- J Bacteriol UR- http://www.ncbi.nlm.nih.gov/pubmed/21183672 N2- We present the first genome sequence of Chlamydophila psittaci, an intracellular pathogen of birds, and a human zoonotic pathogen. A comparison with previously sequenced Chlamydophila genomes shows that, like other chlamydiae, most of the genome diversity is restricted to the plasticity zone. The Cp. psittaci plasmid was also sequenced. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Cancer and Neurodegeneration: Between the Devil and the Deep Blue Sea AU- Plun-Favreau, Hlne AU- Lewis, Patrick A AU- Hardy, John AU- Martins, L Miguel AU- Wood, Nicholas W PY- 2010 T2- PLoS Genetics J2- PLoS Genet UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3009676/ VL- 6 IS- 12 DO- 10.1371/journal.pgen.1001257 C2- 3009676 N2- Cancer and neurodegeneration are often thought of as disease mechanisms at opposite ends of a spectrum; one due to enhanced resistance to cell death and the other due to premature cell death. There is now accumulating evidence to link these two disparate processes. An increasing number of genetic studies add weight to epidemiological evidence suggesting that sufferers of a neurodegenerative disorder have a reduced incidence for most cancers, but an increased risk for other cancers. Many of the genes associated with either cancer and/or neurodegeneration play a central role in cell cycle control, DNA repair, and kinase signalling. However, the links between these two families of diseases remain to be proven. In this review, we discuss recent and sometimes as yet incomplete genetic discoveries that highlight the overlap of molecular pathways implicated in cancer and neurodegeneration. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- A Novel HMM-Based Method for Detecting Enriched Transcription Factor Binding Sites Reveals RUNX3 as a Potential Target in Pancreatic Cancer Biology AU- Levkovitz, Liron AU- Yosef, Nir AU- Gershengorn, Marvin C AU- Ruppin, Eytan AU- Sharan, Roded AU- Oron, Yoram PY- 2010 T2- PLoS ONE J2- PLoS One UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3008686/ VL- 5 IS- 12 DO- 10.1371/journal.pone.0014423 C2- 3008686 N2- Background: Pancreatic adenocarcinoma (PAC) is one of the most intractable malignancies. In order to search for potential new therapeutic targets, we relied on computational methods aimed at identifying transcription factor binding sites (TFBSs) over-represented in the promoter regions of genes differentially expressed in PAC. Though many computational methods have been implemented to accomplish this, none has gained overall acceptance or produced proven novel targets in PAC. To this end we have developed DEMON, a novel method for motif detection. Methodology: DEMON relies on a hidden Markov model to score the appearance of sequence motifs, taking into account all potential sites in a promoter of potentially varying binding affinities. We demonstrate DEMON's accuracy on simulated and real data sets. Applying DEMON to PAC-related data sets identifies the RUNX family as highly enriched in PAC-related genes. Using a novel experimental paradigm to distinguish between normal and PAC cells, we find that RUNX3 mRNA (but not RUNX1 or RUNX2 mRNAs) exhibits time-dependent increases in normal but not in PAC cells. These increases are accompanied by changes in mRNA levels of putative RUNX gene targets. Conclusions: The integrated application of DEMON and a novel differentiation system led to the identification of a single family member, RUNX3, which together with four of its putative targets showed a robust response to a differentiation stimulus in healthy cells, whereas this regulatory mechanism was absent in PAC cells, emphasizing RUNX3 as a promising target for further studies. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Nicotinic Receptor Gene CHRNA4 Interacts with Processing Load in Attention AU- Espeseth, Thomas AU- Sneve, Markus Handal AU- Rootwelt, Helge AU- Laeng, Bruno PY- 2010 T2- PLoS ONE J2- PLoS One UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3008676/ VL- 5 IS- 12 DO- 10.1371/journal.pone.0014407 C2- 3008676 N2- Background: Pharmacological studies suggest that cholinergic neurotransmission mediates increases in attentional effort in response to high processing load during attention demanding tasks . Methodology/Principal Findings: In the present study we tested whether individual variation in CHRNA4, a gene coding for a subcomponent in 42 nicotinic receptors in the human brain, interacted with processing load in multiple-object tracking (MOT) and visual search (VS). We hypothesized that the impact of genotype would increase with greater processing load in the MOT task. Similarly, we predicted that genotype would influence performance under high but not low load in the VS task. Two hundred and two healthy persons (age range?=?39 77, Mean ?=?57.5, SD?=?9.4) performed the MOT task in which twelve identical circular objects moved about the display in an independent and unpredictable manner. Two to six objects were designated as targets and the remaining objects were distracters. The same observers also performed a visual search for a target letter (i.e. X or Z) presented together with five non-targets while ignoring centrally presented distracters (i.e. X, Z, or L). Targets differed from non-targets by a unique feature in the low load condition, whereas they shared features in the high load condition. CHRNA4 genotype interacted with processing load in both tasks. Homozygotes for the T allele (N?=?62) had better tracking capacity in the MOT task and identified targets faster in the high load trials of the VS task. Conclusion: The results support the hypothesis that the cholinergic system modulates attentional effort, and that common genetic variation can be used to study the molecular biology of cognition. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Somatic Mutation Profiles of MSI and MSS Colorectal Cancer Identified by Whole Exome Next Generation Sequencing and Bioinformatics Analysis AU- Timmermann, Bernd AU- Kerick, Martin AU- Roehr, Christina AU- Fischer, Axel AU- Isau, Melanie AU- Boerno, Stefan T AU- Wunderlich, Andrea AU- Barmeyer, Christian AU- Seemann, Petra AU- Koenig, Jana AU- Lappe, Michael AU- Kuss, Andreas W AU- Garshasbi, Masoud AU- Bertram, Lars AU- Trappe, Kathrin AU- Werber, Martin AU- Herrmann, Bernhard G AU- Zatloukal, Kurt AU- Lehrach, Hans AU- Schweiger, Michal R PY- 2010 T2- PLoS ONE J2- PLoS One UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3008745/ VL- 5 IS- 12 DO- 10.1371/journal.pone.0015661 C2- 3008745 N2- Background: Colorectal cancer (CRC) is with approximately 1 million cases the third most common cancer worldwide. Extensive research is ongoing to decipher the underlying genetic patterns with the hope to improve early cancer diagnosis and treatment. In this direction, the recent progress in next generation sequencing technologies has revolutionized the field of cancer genomics. However, one caveat of these studies remains the large amount of genetic variations identified and their interpretation. Methodology/Principal Findings: Here we present the first work on whole exome NGS of primary colon cancers. We performed 454 whole exome pyrosequencing of tumor as well as adjacent not affected normal colonic tissue from microsatellite stable (MSS) and microsatellite instable (MSI) colon cancer patients and identified more than 50,000 small nucleotide variations for each tissue. According to predictions based on MSS and MSI pathomechanisms we identified eight times more somatic non-synonymous variations in MSI cancers than in MSS and we were able to reproduce the result in four additional CRCs. Our bioinformatics filtering approach narrowed down the rate of most significant mutations to 359 for MSI and 45 for MSS CRCs with predicted altered protein functions. In both CRCs, MSI and MSS, we found somatic mutations in the intracellular kinase domain of bone morphogenetic protein receptor 1A, BMPR1A, a gene where so far germline mutations are associated with juvenile polyposis syndrome, and show that the mutations functionally impair the protein function. Conclusions/Significance: We conclude that with deep sequencing of tumor exomes one may be able to predict the microsatellite status of CRC and in addition identify potentially clinically relevant mutations. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- High-Definition Mapping of Retroviral Integration Sites Defines the Fate of Allogeneic T Cells After Donor Lymphocyte Infusion AU- Cattoglio, Claudia AU- Maruggi, Giulietta AU- Bartholomae, Cynthia AU- Malani, Nirav AU- Pellin, Danilo AU- Cocchiarella, Fabienne AU- Magnani, Zulma AU- Ciceri, Fabio AU- Ambrosi, Alessandro AU- von Kalle, Christof AU- Bushman, Frederic D AU- Bonini, Chiara AU- Schmidt, Manfred AU- Mavilio, Fulvio AU- Recchia, Alessandra PY- 2010 T2- PLoS ONE J2- PLoS One UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3008730/ VL- 5 IS- 12 DO- 10.1371/journal.pone.0015688 C2- 3008730 N2- The infusion of donor lymphocytes transduced with a retroviral vector expressing the HSV-TK suicide gene in patients undergoing hematopoietic stem cell transplantation for leukemia/lymphoma promotes immune reconstitution and prevents infections and graft-versus-host disease. Analysis of the clonal dynamics of genetically modified lymphocytes in vivo is of crucial importance to understand the potential genotoxic risk of this therapeutic approach. We used linear amplification-mediated PCR and pyrosequencing to build a genome-wide, high-definition map of retroviral integration sites in the genome of peripheral blood T cells from two different donors and used gene expression profiling and bioinformatics to associate integration clusters to transcriptional activity and to genetic and epigenetic features of the T cell genome. Comparison with matched random controls and with integrations obtained from CD34+ hematopoietic stem/progenitor cells showed that integration clusters occur within chromatin regions bearing epigenetic marks associated with active promoters and regulatory elements in a cell-specific fashion. Analysis of integration sites in T cells obtained ex vivo two months after infusion showed no evidence of integration-related clonal expansion or dominance, but rather loss of cells harboring integration events interfering with RNA post-transcriptional processing. The study shows that high-definition maps of retroviral integration sites are a powerful tool to analyze the fate of genetically modified T cells in patients and the biological consequences of retroviral transduction. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- In silico prediction of drug targets in Vibrio cholerae. AU- Katara, Pramod AU- Grover, Atul AU- Kuntal, Himani AU- Sharma, Vinay PY- 2010 T2- Protoplasma J2- Protoplasma UR- http://www.ncbi.nlm.nih.gov/pubmed/21174131 N2- Identification of potential drug targets is the first step in the process of modern drug discovery, subjected to their validation and drug development. Whole genome sequences of a number of organisms allow prediction of potential drug targets using sequence comparison approaches. Here, we present a subtractive approach exploiting the knowledge of global gene expression along with sequence comparisons to predict the potential drug targets more efficiently. Based on the knowledge of 155 known virulence and their coexpressed genes mined from microarray database in the public domain, 357 coexpressed probable virulence genes for Vibrio cholerae were predicted. Based on screening of Database of Essential Genes using blastn, a total of 102 genes out of these 357 were enlisted as vitally essential genes, and hence good putative drug targets. As the effective drug target is a protein which is only present in the pathogen, similarity search of these 102 essential genes against human genome sequence led to subtraction of 66 genes, thus leaving behind a subset of 36 genes whose products have been called as potential drug targets. The gene ontology analysis using Blast2GO of these 36 genes revealed their roles in important metabolic pathways of V. cholerae or on the surface of the pathogen. Thus, we propose that the products of these genes be evaluated as target sites of drugs against V. cholerae in future investigations. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Gene processing control loops suggested by sequencing, splicing, and RNA folding AU- Jeffries, Clark D AU- Perkins, Diana O AU- Guan, Xiaojun PY- 2010 T2- BMC bioinformatics J2- BMC Bioinformatics UR- http://www.biomedcentral.com/1471-2105/11/602 VL- 11 IS- 1 SP- 602 DO- 10.1186/1471-2105-11-602 N2- Abstract Background: Small RNAs are known to regulate diverse gene expression processes including translation, transcription, and splicing. Among small RNAs, the microRNAs (miRNAs) of 17 to 27 nucleotides (nts) undergo biogeneses including primary transcription, RNA excision and folding, nuclear export, cytoplasmic processing, and then bioactivity as regulatory agents. We propose that analogous hairpins from RNA molecules that function as part of the spliceosome might also be the source of small, regulatory RNAs (somewhat smaller than miRNAs). Results: Deep sequencing technology has enabled discovery of a novel 16-nt RNA sequence in total RNA from human brain that we propose is derived from RNU1, an RNA component of spliceosome assembly. Bioinformatic alignments compel inquiring whether the novel 16-nt sequence or its precursor have a regulatory function as well as determining aspects of how processing intersects with the miRNA biogenesis pathway. Specifically, our preliminary in silico investigations reveal the sequence could regulate splicing factor Arg/Ser rich 1 (SFRS1), a gene coding an essential protein component of the spliceosome. All 16-base source sequences in the UCSC Human Genome Browser are within the 14 instances of RNU1 genes listed in wgEncodeGencodeAutoV3. Furthermore, 10 of the 14 instances of the sequence are also within a common 28-nt hairpin-forming subsequence of RNU1. Conclusions: An abundant 16-nt RNA sequence is sourced from a spliceosomal RNA, lies in a stem of a predicted RNA hairpin, and includes reverse complements of subsequences of the 3'UTR of a gene coding for a spliceosome protein. Thus RNU1 could function both as a component of spliceosome assembly and as inhibitor of production of the essential, spliceosome protein coded by SFRS1. Beyond this example, a general procedure is needed for systematic discovery of multiple alignments of sequencing, splicing, and RNA folding data. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Aromatase Is a Direct Target of FOXL2: C134W in Granulosa Cell Tumors via a Single Highly Conserved Binding Site in the Ovarian Specific Promoter AU- Fleming, Nicholas I AU- Knower, Kevin C AU- Lazarus, Kyren A AU- Fuller, Peter J AU- Simpson, Evan R AU- Clyne, Colin D PY- 2010 T2- PLoS ONE J2- PLoS One UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3004790/ VL- 5 IS- 12 DO- 10.1371/journal.pone.0014389 C2- 3004790 N2- Background: Granulosa cell tumors (GCT) of the ovary often express aromatase and synthesize estrogen, which in turn may influence their progression. Recently a specific point mutation (C134W) in the FOXL2 protein was identified in 94% of adult-type GCT and it is likely to contribute to their development. A number of genes are known to be regulated by FOXL2, including aromatase/CYP19A1, but it is unclear which are direct targets and whether the C134W mutation alters their regulation. Recently, it has been reported that FOXL2 forms a complex with steroidogenic factor 1 (SF-1) which is a known regulator of aromatase in granulosa cells. Methodology/Principal Findings: In this work, the human GCT-derived cell lines KGN and COV434 were heterozygous and wildtype for the FOXL2:C134W mutation, respectively. KGN had abundant FOXL2 mRNA expression but it was not expressed in COV434. Expression of exogenous FOXL2:C134W in COV434 cells induced higher expression of a luciferase reporter for the ovarian specific aromatase promoter, promoter II (PII) (516bp) than expression of wildtype FOXL2, but did not alter induction of a similar reporter for the steroidogenic acute regulatory protein (StAR) promoter (1300bp). Co-immunoprecipitation confirmed that FOXL2 bound SF-1 and that it also bound its homologue, liver receptor homologue 1 (LRH-1), however, the C134W mutation did not alter these interactions or induce a selective binding of the proteins. A highly conserved putative binding site for FOXL2 was identified in PII. FOXL2 was demonstrated to bind the site by electrophoretic mobility shift assays (EMSA) and site-directed mutagenesis of this element blocked its differential induction by wildtype FOXL2 and FOXL2:C134W. Conclusions/Significance: These findings suggest that aromatase is a direct target of FOXL2:C134W in adult-type GCT via a single distinctive and highly conserved binding site in PII and therefore provide insight into the pathogenic mechanism of this mutation. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Evidences showing wide presence of small genomic aberrations in chronic lymphocytic leukemia AU- Kim, Yeong C AU- Jung, Yong-Chul AU- Chen, Jun AU- Alhasan, Ali H AU- Kaewsaard, Parawee AU- Zhang, Yanming AU- Ma, Shuo AU- Rosen, Steve AU- Wang, San Ming PY- 2010 T2- BMC research notes J2- BMC Res Notes UR- http://www.biomedcentral.com/1756-0500/3/341 VL- 3 IS- 1 SP- 341 DO- 10.1186/1756-0500-3-341 C2- 3016268 N2- Abstract Background: Chronic lymphocytic leukemia (CLL) is the most common adult leukemia in the western population. Although genetic factors are considered to contribute to CLL etiology, at present genomic aberrations identified in CLL are limited compared with those identified in other types of leukemia, which raises the question of the degree of genetic influence on CLL. We performed a high-resolution genome scanning study to address this issue. Findings: Using the restriction paired-end-based Ditag Genome Scanning technique, we analyzed three primary CLL samples at a kilobase resolution, and further validated the results in eight primary CLL samples including the two used for ditag collection. From 51,632 paired-end tags commonly detected in the three CLL samples representing 5% of the HindIII restriction fragments in the genomes, we identified 230 paired-end tags that were present in all three CLL genomes but not in multiple normal human genome reference sequences. Mapping the full-length sequences of the fragments detected by these unmapped tags in seven additional CLL samples confirmed that these are the genomic aberrations caused by small insertions and deletions, and base changes spreading across coding and non-coding regions. Conclusions: Our study identified hundreds of loci with insertion, deletion, base change, and restriction site polymorphism present in both coding and non-coding regions in CLL genomes, indicating the wide presence of small genomic aberrations in chronic lymphocytic leukemia. Our study supports the use of a whole genome sequencing approach for comprehensively decoding the CLL genome for better understanding of the genetic defects in CLL. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- A Rare Myelin Protein Zero (MPZ) Variant Alters Enhancer Activity In Vitro and In Vivo AU- Antonellis, Anthony AU- Dennis, Megan Y AU- Burzynski, Grzegorz AU- Huynh, Jimmy AU- Maduro, Valerie AU- Hodonsky, Chani J AU- Khajavi, Mehrdad AU- Szigeti, Kinga AU- Mukkamala, Sandeep AU- Bessling, Seneca L AU- NISC Comparative Sequencing Program AU- Pavan, William J AU- McCallion, Andrew S AU- Lupski, James R AU- Green, Eric D PY- 2010 T2- PLoS ONE J2- PLoS One UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3002941/ VL- 5 IS- 12 DO- 10.1371/journal.pone.0014346 C2- 3002941 N2- Background: Myelin protein zero (MPZ) is a critical structural component of myelin in the peripheral nervous system. The MPZ gene is regulated, in part, by the transcription factors SOX10 and EGR2. Mutations in MPZ, SOX10, and EGR2 have been implicated in demyelinating peripheral neuropathies, suggesting that components of this transcriptional network are candidates for harboring disease-causing mutations (or otherwise functional variants) that affect MPZ expression. Methodology: We utilized a combination of multi-species sequence comparisons, transcription factor-binding site predictions, targeted human DNA re-sequencing, and in vitro and in vivo enhancer assays to study human non-coding MPZ variants. Principal Findings: Our efforts revealed a variant within the first intron of MPZ that resides within a previously described SOX10 binding site is associated with decreased enhancer activity, and alters binding of nuclear proteins. Additionally, the genomic segment harboring this variant directs tissue-relevant reporter gene expression in zebrafish. Conclusions: This is the first reported MPZ variant within a cis-acting transcriptional regulatory element. While we were unable to implicate this variant in disease onset, our data suggests that similar non-coding sequences should be screened for mutations in patients with neurological disease. Furthermore, our multi-faceted approach for examining the functional significance of non-coding variants can be readily generalized to study other loci important for myelin structure and function. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Recombinant expression, purification and copper-binding characteristics of the amino terminus of a Plasmodium falciparum copper transport protein AU- Choveaux, David AU- Goldring, JP Dean PY- 2010 T2- Malaria journal J2- Malar J UR- http://www.malariajournal.com/content/9/S2/-P62 VL- 9 IS- Suppl 2 SP- P62 DO- 10.1186/1475-2875-9-S2-P62 N2- N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Concurrent genetic alterations in DNA polymerase proofreading and mismatch repair in human colorectal cancer. AU- Yoshida, Rintaro AU- Miyashita, Kaname AU- Inoue, Mayuko AU- Shimamoto, Akiyoshi AU- Yan, Zhao AU- Egashira, Akinori AU- Oki, Eiji AU- Kakeji, Yoshishiro AU- Oda, Shinya AU- Maehara, Yoshihiko PY- 2010 T2- European journal of human genetics : EJHG J2- Eur J Hum Genet UR- http://www.ncbi.nlm.nih.gov/pubmed/21157497 N2- Genomic sequences encoding the 3' exonuclease (proofreading) domains of both replicative DNA polymerases, pol delta and pol epsilon, were explored simultaneously in human colorectal carcinomas including six established cell lines. Three unequivocal sequence alterations, including one previously reported, were found, and all these were considered as dysfunctional mutations in light of the local amino-acid sequences. In particular, the F367S mutation found in the POLE gene encoding the pol epsilon catalytic subunit, which includes the proofreading domain, is the first found in human diseases. Surprisingly, the tumours carrying these proofreading domain mutations were all defective in DNA mismatch repair (MMR). In addition to the two cell lines with acknowledged MMR gene mutations, the third tumour was also demonstrated to harbour a distinct mutation in MLH1, and indeed exhibited a microsatellite-unstable phenotype. These findings suggest that, in concert with MMR deficiency, defective polymerase proofreading may also contribute to the mutator phenotype observed in human colorectal cancer. Our observations may suggest previously unrecognised complexities in the molecular abnormalities underlying the mutator phenotype in human neoplasms.European Journal of Human Genetics advance online publication, 15 December 2010; doi:10.1038/ejhg.2010.216. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Evolution of MicroRNAs and the Diversification of Species. AU- Loh, Yong-Hwee E AU- Yi, Soojin V AU- Streelman, J Todd PY- 2010 T2- Genome Biology and Evolution J2- Genome Biol Evol UR- http://www.ncbi.nlm.nih.gov/pubmed/21169229 N2- MicroRNAs (miRNAs) are ancient, short, non-coding RNA molecules that regulate the transcriptome through post-transcriptional mechanisms. miRNA riboregulation is involved in a diverse range of biological processes and mis-regulation is implicated in disease. It is generally thought that miRNAs function to canalize cellular outputs, for instance as 'fail-safe' repressors of gene mis-expression. Genomic surveys in humans have revealed reduced genetic polymorphism and the signature of negative selection for both miRNAs themselves and the target sequences to which they are predicted to bind. We investigated the evolution of miRNAs and their binding sites across cichlid fishes from Lake Malawi (East Africa), where hundreds of diverse species have evolved in the last million years. Using low-coverage genome sequence data, we identified 100 cichlid miRNA genes with mature regions that are highly conserved in other animal species. We computationally predicted target sites on the 3'-UTRs of cichlid genes to which miRNAs may bind, and found that these sites possessed elevated single nucleotide polymorphism (SNP) densities. Furthermore, polymorphic sites in predicted miRNA targets showed higher minor allele frequencies on average and greater genetic differentiation between Malawi lineages when compared to a neutral expectation and non-target 3' UTR SNPs. Our data suggest that divergent selection on miRNA riboregulation may have contributed to the diversification of cichlid species, and may similarly play a role in rapid phenotypic evolution of other natural systems. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- A unique chromatin signature uncovers early developmental enhancers in humans. AU- Rada-Iglesias, Alvaro AU- Bajpai, Ruchi AU- Swigut, Tomek AU- Brugmann, Samantha A AU- Flynn, Ryan A AU- Wysocka, Joanna PY- 2010 T2- Nature J2- Nature UR- http://www.ncbi.nlm.nih.gov/pubmed/21160473 N2- Cell-fate transitions involve the integration of genomic information encoded by regulatory elements, such as enhancers, with the cellular environment. However, identification of genomic sequences that control human embryonic development represents a formidable challenge. Here we show that in human embryonic stem cells (hESCs), unique chromatin signatures identify two distinct classes of genomic elements, both of which are marked by the presence of chromatin regulators p300 and BRG1, monomethylation of histone H3 at lysine 4 (H3K4me1), and low nucleosomal density. In addition, elements of the first class are distinguished by the acetylation of histone H3 at lysine 27 (H3K27ac), overlap with previously characterized hESC enhancers, and are located proximally to genes expressed in hESCs and the epiblast. In contrast, elements of the second class, which we term 'poised enhancers', are distinguished by the absence of H3K27ac, enrichment of histone H3 lysine 27 trimethylation (H3K27me3), and are linked to genes inactive in hESCs and instead are involved in orchestrating early steps in embryogenesis, such as gastrulation, mesoderm formation and neurulation. Consistent with the poised identity, during differentiation of hESCs to neuroepithelium, a neuroectoderm-specific subset of poised enhancers acquires a chromatin signature associated with active enhancers. When assayed in zebrafish embryos, poised enhancers are able to direct cell-type and stage-specific expression characteristic of their proximal developmental gene, even in the absence of sequence conservation in the fish genome. Our data demonstrate that early developmental enhancers are epigenetically pre-marked in hESCs and indicate an unappreciated role of H3K27me3 at distal regulatory elements. Moreover, the wealth of new regulatory sequences identified here provides an invaluable resource for studies and isolation of transient, rare cell populations representing early stages of human embryogenesis. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Next generation sequencing and analysis of a conserved transcriptome of New Zealand's kiwi AU- Subramanian, Sankar AU- Huynen, Leon AU- Millar, Craig D AU- Lambert, David M PY- 2010 T2- BMC evolutionary biology J2- BMC Evol Biol UR- http://www.biomedcentral.com/1471-2148/10/387 VL- 10 IS- 1 SP- 387 DO- 10.1186/1471-2148-10-387 C2- 3009673 N2- Abstract Background: Kiwi is a highly distinctive, flightless and endangered ratite bird endemic to New Zealand. To understand the patterns of molecular evolution of the nuclear protein-coding genes in brown kiwi (Apteryx australis mantelli) and to determine the timescale of avian history we sequenced a transcriptome obtained from a kiwi embryo using next generation sequencing methods. We then assembled the conserved protein-coding regions using the chicken proteome as a scaffold. Results: Using 1,543 conserved protein coding genes we estimated the neutral evolutionary divergence between the kiwi and chicken to be ~45%, which is approximately equal to the divergence computed for the human-mouse pair using the same set of genes. A large fraction of genes was found to be under high selective constraint, as most of the expressed genes appeared to be involved in developmental gene regulation. Our study suggests a significant relationship between gene expression levels and protein evolution. Using sequences from over 700 nuclear genes we estimated the divergence between the two basal avian groups, Palaeognathae and Neognathae to be 132 million years, which is consistent with previous studies using mitochondrial genes. Conclusions: The results of this investigation revealed patterns of mutation and purifying selection in conserved protein coding regions in birds. Furthermore this study suggests a relatively cost-effective way of obtaining a glimpse into the fundamental molecular evolutionary attributes of a genome, particularly when no closely related genomic sequence is available. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Transcriptional Activation of REST by Sp1 in Huntington's Disease Models AU- Ravache, Myriam AU- Weber, Chantal AU- Mrienne, Karine AU- Trottier, Yvon PY- 2010 T2- PLoS ONE J2- PLoS One UR- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3001865/ VL- 5 IS- 12 DO- 10.1371/journal.pone.0014311 C2- 3001865 N2- In Huntington's disease (HD), mutant huntingtin (mHtt) disrupts the normal transcriptional program of disease neurons by altering the function of several gene expression regulators such as Sp1. REST (Repressor Element-1 Silencing Transcription Factor), a key regulator of neuronal differentiation, is also aberrantly activated in HD by a mechanism that remains unclear. Here, we show that the level of REST mRNA is increased in HD mice and in NG108 cells differentiated into neuronal-like cells and expressing a toxic mHtt fragment. Using luciferase reporter gene assay, we delimited the REST promoter regions essential for mHtt-mediated REST upregulation and found that they contain Sp factor binding sites. We provide evidence that Sp1 and Sp3 bind REST promoter and interplay to fine-tune REST transcription. In undifferentiated NG108 cells, Sp1 and Sp3 have antagonistic effect, Sp1 acting as an activator and Sp3 as a repressor. Upon neuronal differentiation, we show that the amount and ratio of Sp1/Sp3 proteins decline, as does REST expression, and that the transcriptional role of Sp3 shifts toward a weak activator. Therefore, our results provide new molecular information to the transcriptional regulation of REST during neuronal differentiation. Importantly, specific knockdown of Sp1 abolishes REST upregulation in NG108 neuronal-like cells expressing mHtt. Our data together with earlier reports suggest that mHtt triggers a pathogenic cascade involving Sp1 activation, which leads to REST upregulation and repression of neuronal genes. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Personalized medicine - The promised land are we there yet? AU- Li, Chumei PY- 2010 T2- Clinical genetics J2- Clin Genet UR- http://www.ncbi.nlm.nih.gov/pubmed/21204795 N2- The delivery of personalized genomic medicine (see box1 for a comparison of genomic vs. genetic medicine and box2 for glossary) hinges on obtaining personal genomic data through genome-wide association studies (GWAS) or whole genome sequencing. After the completion of the human genome project (see box 3 for human genome projects and its derivative projects) in 2003, there appeared to be a period of euphoric optimism that as soon as the cost of sequencing the whole human genome could be brought down to an affordable range, the promise of personalized medicine would become a reality. However, inasmuch as the miraculous technological advancements are making whole genome data acquisition an inexpensive reality, we are also starting to appreciate that making sense of the enormous amount of genomic data is a far bigger hurdle. Issues, both scientific and ethico-legal, will have to be addressed as genomic data are been pushed for clinical and direct-to-consumer utilization. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER- TY- JOUR TI- Mining human genome for novel purinergic P2Y receptors: a sequence analysis and molecular modeling approach. AU- Bhatnagar, Sonika AU- Mishra, Shubhi AU- Pathak, Ravi PY- 2010 T2- Journal of receptor and signal transduction research J2- J Recept Signal Transduct Res UR- http://www.ncbi.nlm.nih.gov/pubmed/21142848 N2- The purinergic P2Y receptors are G-protein coupled receptors (GPCRs) that control many physiological processes by mediating cellular responses to purines, pyrimidines and their analogues. They can be used as potential therapeutic targets in a variety of disease conditions. Therefore, it is critical to identify new members of this family of receptors from the human genome and characterize them for their role in health and disease. In the present work, molecular modeling was carried out for the 21 known P2Y receptors. Binding site analysis was done on the basis of docking and site-directed mutagenesis data. Thus, conserved features of P2Y receptors could be formulated. These features can be used to determine the purinergic nature of potential P2Y receptors in the human genome. We applied this knowledge to human genome GPCR sequences found by sensitive sequence search techniques and identified two orphan receptors, namely GPR34 and GP171 that have all the necessary conserved features of P2Y receptors. N1- Exported from www.Quertle.info. Search query: Genome-sequencing . ER-

个人分类: 热点前沿|8288 次阅读|1 个评论

2011年科研热点信息分析：基因组测序

xupeiyang 2011-1-4 19:23

《自然》杂志预测2011年科研热点 http://news.sciencenet.cn//htmlnews/2011/1/242378.shtm 基因组测序大爆发在2011年里，人类基因组测序所需花费无疑将有所下降。下一代基因测序仪器已开始投放市场，这将使得进行全测序的人类基因组数量节节攀升。 New year, new science http://www.nature.com/news/2010/101231/full/469012a.html Genome-sequencing explosion This year should surely see the price of human-genome sequencing dropping to US$1,000 per genome. As next-generation sequencing machines reach the market, the number of fully sequenced human genomes will skyrocket. http://www.gopubmed.org/web/gopubmed/1?WEB01lys5moqoa0ueIvI1I00h001000j100200010 5,874 documents semantically analyzed 1 2 Top Years Publications 2010 801 2009 563 2008 434 2007 412 2005 381 2006 373 2002 328 2004 323 2003 315 2001 315 2000 259 1998 206 1999 176 1997 173 1996 139 1995 120 1994 92 1991 87 1992 84 1993 74 1 2 1 2 3 4 5 Top Countries Publications USA 2,460 United Kingdom 456 Japan 373 Germany 361 China 282 France 234 Canada 172 Italy 106 Australia 103 Sweden 101 Netherlands 76 Spain 71 India 62 South Korea 56 Brazil 56 Denmark 54 Switzerland 46 Singapore 40 Taiwan 37 Israel 33 1 2 3 4 5 1 2 3 ... 36 Top Cities Publications Bethesda 220 Houston 143 Tokyo 112 Cambridge 111 London 100 St. Louis 100 Seattle 94 Baltimore 91 Boston 87 Paris 78 Beijing, China 77 Rockville 77 New York City 75 Los Angeles 73 San Francisco 65 San Diego 61 Shanghai, China 59 Cambridge, USA 57 Berlin, Germany 57 Heidelberg 55 1 2 3 ... 36 1 2 3 ... 67 Top Journals Publications Genome Res 241 Nucleic Acids Res 144 Genomics 136 J Virol 127 Nature 127 P Natl Acad Sci Usa 113 Science 102 Bmc Genomics 92 Gene 81 Plos One 80 Virology 78 Hum Mol Genet 74 J Gen Virol 62 Genome Biol 60 Am J Hum Genet 59 Bioinformatics 57 J Biol Chem 57 Nat Genet 54 Hum Mutat 53 J Clin Microbiol 51 1 2 3 ... 67 1 2 3 ... 893 Top Terms Publications Genome 5,081 Genomics 5,060 Humans 4,640 Genes 3,668 Genome, Human 2,634 DNA 2,573 Base Sequence 2,456 Animals 2,011 Proteins 1,636 Sequence Analysis, DNA 1,500 Polymerase Chain Reaction 1,292 Mutation 1,265 Chromosomes 1,209 Nucleotides 1,032 chromosome 886 Patients 863 Chromosome Mapping 843 Technology 836 Amino Acid Sequence 811 Viruses 785 1 2 3 ... 893 1 2 3 ... 1527 Top Authors Publications Gibbs R 43 Green E 38 Weinstock G 25 Hillier L 24 Eichler E 22 Venter J 22 Waterston R 21 Scherer S 21 Marra M 20 McPherson J 18 Lander E 18 Wilson R 16 Mardis E 16 Adams M 16 Lehrach H 16 NISC Comparative Sequencing Program 15 Metzker M 14 Ohara O 14 Rosenthal A 14 Hayashizaki Y 13 1 2 3 ... 1527

个人分类: 热点前沿|3117 次阅读|0 个评论

小虫虫基因组测序揭示大问题

DNAgene 2010-11-20 16:59

前天，Science在线发表了海洋浮游尾索动物Oikopleura dioica的基因组测序分析论文（重点是分析）。网络科技新闻杂志The Scientist立即就此文章的结果采访了相关领域的几位专家，并做了跟踪报道。我是看到此报道之后，怀着很好奇的心情看完了Science这篇论文。确实值得研究，值得发在Science上。我们国家总喜欢研究有重大理论和应用价值的物种，这个仅生活4天就死掉的小虫虫有什么意义呢？它在海洋水表挺多，系统发育上是脊椎动物的近亲。这可能是此基因组测序的理由。显然，生物学研究不能太功利，表面上看似有重大研究价值，那个重大价值不一定那么容易被你揭示。比如说大熊猫为什么濒危，测完序离回答问题差得很远，只能说是万里长征走了第一步。法国人带领的这个研究组，测序之前无论如何也不会想到他们的重要发现。甚至我估计，其中一些关键作者，如Scott W. Roy，很可能是在分析数据时临时征召的。这一点也提醒国内的基因组测序带头人，一开始团队不一定很整齐，有什么问题找什么人来做就行了。因为问题是无法预期的，所以预先组织好的团队就不可能是最适于研究你的问题的团队。这个小虫虫测序到底说明了什么，看他们摘要中的一句话multiple genomic features including transposon diversity, developmental gene repertoire, physical gene order, and intron-exon organization are shattered in the tunicate Oikopleura , belonging to the sister group of vertebrates and retaining chordate morphology。什么意思呢？尽管这个小虫虫和脊椎动物一样保留了脊索动物的形态，但是众多基因组特征在在此物种中shattered。shattered的意思有粉碎；砸碎，破坏。说的直观一些就是，这个基因组测序的结果offers unique perspectives on the level of genome plasticity。要看通俗介绍的网友可以参考The Scientist的报道，简单明了：Who needs structure, anyway? http://www.the-scientist.com/news/display/57814/ ，研究相关问题的专业网友请参考Science原文 http://www.sciencemag.org/content/early/2010/11/17/science.1194167 ，或 http://www.doi.org/10.1126/science.1194167 （此链接暂时不能用）。

个人分类: 生活点滴|5476 次阅读|1 个评论

[转载]JGI 撑不住了，把产生的基因组数据转移到NERCS计算中心

lry198010 2010-4-23 14:05

JGI-联合基因组研究中心，美国能源部的一个基因组研究中心，也许是世界上最大的非动物基因组测序研究中心了，预计今年其基因组数据量将到达4-5Tb，这样庞大的数据，他们感到已经难以承受数据的存储、分析所需计算设备的压力了，这促使JGI寻求更专业的计算设备维护和管理中心。从这点来看，以后，测序中心将不会关注数据存储、分析所需要的计算能力，这样的计算能力可以使用别家的计算中心，比如云计算中心。那么测序中心很重要的一点就是如何快速的把测序获得的数据传输到云计算中心上以进行处理。可以预计，随着测序费用的下降，和基因组测序的广泛应用，测序服务中心和云计算中心的合作将会成为一种趋势！ JGI Consolidates High-Performance Computing Operations into NERCS April 20, 2010 By Alex Philippidis NEW YORK (GenomeWeb News) – The Joint Genome Institute says the torrent of sequencing data it has generated, and plans to generate this year, explains its decision to consolidate its high-performance scientific computing operations into the US Department of Energy's National Energy Research Scientific Computer Center (NERSC). JGI has agreed to transfer to NERSC six Lawrence Berkeley National Laboratory employees specializing in scientific computing, including computer and network security and instrumentation computer systems. JGI's desktop support services will remain under the control of the institute, which is located in Walnut Creek, Calif. The consolidation, announced April 12, follows JGI's expectation this year that it will multiply the quantity of data it expects to generate through its sequencing of plant, microbe, fungal, and metagenomes. That quantity surpassed 1 terabase, or 1 trillion bases, in 2009, an eight-fold increase over 2008 — with "maybe 4 to 5 trillion this year" expected to be sequenced, JGI spokesman David Gilbert told GenomeWeb Daily News. "In that alone, you can tell why we need that computational horsepower that we could handle on our own, but now it's getting to the point where it's just crazy. Why build something in house when we've got a partnership where all the folks who are, in effect, being transferred over to NERSC? They've been Lawrence Berkeley people anyhow, so it's not a major change from their perspective," Gilbert said. The institute's current data center lacks the capacity to store the exponentially higher amount of data projected, and JGI staff did not have the same breadth of experience with running very large-scale systems that staffers at the computer center have, Jeff Broughton, systems department head at NERSC, told GWDN. Under the consolidation, NERSC will be responsible for existing JGI scientific computing equipment and new equipment to be procured, which will be housed about 16 miles southwest of Walnut Creek, at the computer center's Oakland facility. Broughton said the new equipment will include 500 dual-socket, quad-core Nehalem processor nodes from SGI — of which 160 nodes are in place, with the remaining 340 nodes "expected to arrive within the next six weeks, by the end of May" — as well as a 120 nodes from the IBM iDataPlex system already in use at NERSC's "Magellan" cloud computing cluster, part of a joint research effort between NERSC and the Argonne Leadership Computing Facility, funded with $32 million from the $862 billion American Recovery and Reinvestment Act. "In general, genomics is a pretty good fit for cloud computing, and they were able to take advantage of that," Broughton said. "The new sequencers are producing ever-increasing flows of data, and it's important to make sure that the computational infrastructure scales appropriately to match it," he added. He said NERSC runs "in excess of" 50,000 cores for high-performance computing now, a figure expected to quadruple by the end of the year. JGI would account for about 10 percent of NERSC's total computing power, based on core count. By teaming with NERSC, JGI can enjoy access to a dedicated 10 Gbps-per-second link between both institutions on the Science Data Network of the Energy Sciences Network, as well as other benefits, such as redundant cooling systems, an uninterruptible source of power, environmental and energy-use monitoring, and a central help desk.

个人分类: next-generation-sequence|7719 次阅读|0 个评论

又一新禾本科模式植物二穗短柄草已完成基因组测序

mengxb 2010-3-4 14:28

Nature 463 , 763-768 (11 February 2010) | :10.1038/nature08747 :10.1038/nature08747 ; Received 29 August 2009; Accepted 9 December 2009 Genome sequencing and analysis of the model grass Brachypodium distachyon Science ：科学家绘制出首幅小麦基因组物理图谱 Science ：玉米B73系基因组序列 Nature Reviews Genetics ：解密41000个水稻基因的功能 Nature ：高粱基因组完成测序

个人分类: 生活点滴|5791 次阅读|0 个评论

帐号		自动登录	找回密码
密码			注册

关闭安全验证

标签: 基因组测序

相关帖子

相关日志

关闭 安全验证

标签: 基因组测序

相关帖子

相关日志

关闭安全验证