Andor Kiss

Andor Kiss

Dec 08, 2019

Group 6 Copy 312
1

Assembly Complete; onto polishing and BUSCO QC'ing

Using two SMRT cells (PacBio SII) worth of long-read data (N50 > 32,000 bp) we opted to use the wtdbg2 redbean (https://github.com/ruanjue/wtdbg2) assembler, we have assembled the wood frog genome - all 6+ Gbp in only 33 hours! We did it on a stand alone computer - not a supercluster. The machine was initally buily by VelocityMicro and further upgraded and modified by us. Briefly, it's a Phanteks Ethnoo Primo Case housing dual 1300W PSUs driving a SuperMicro H11DSi-NT motherboard with dual socket AMD EPYC 7601 CPUs with 2x64 cores (128 threads; 124 used for the assembly); with 2 TB ECC DDR4 2666 MHz RAM, 25 TB HDD RAID10; and dual EKWB liquid cooled RTX TITANs. We're running Ubuntu 18.04 LTS as an O/S. We did have to make some modifications for cooling because of the heat generated by the RAM modules, so we added four GSkills Turbulence III ram coolers and added a few extra fans to increase push/pull air flow thoughout the case. We chose the new AMD EPYC processors because they are true dual threaded processors and pound for pound outclass the INTEL CPUs by every metric.

Next we will "polish" (correct for potential mistakes) the assembly with a single lane's worth of Illumina HiSeq3000 2x150 bp paired-end high quality short-read data. This is necessary to compensate for the the lower quality, but MUCH longer reads generated with the PacBio Sequel II instrument. Why use both? The PBSII instrument gave us single molecule reads lengths of VERY long length, more than half the genome was sequenced in pieces greater than 32,000 base-pairs. This facilitates us putting together the genome in the correct orientation, especially for genomes where we suspect we have large amounts of repeats. We will be using a new polisher just released called ntEdit (https://academic.oup.com/bioinformatics/article/35/21/4430/5490204), which is an ultrafast polisher capable of polishing the white spruce genome (20 Gbp) in 25 minutes. We will experiment and likely perform the polishing step iteratively evaluating the results with BUSCO (https://busco.ezlab.org/).

1 comments

Join the conversation!Sign In
  • Cindy Wu
    Cindy WuBacker
    Amazing. Thanks for sharing the exciting news!
    Dec 09, 2019

About This Project

The North American wood frog Rana sylvatica can survive being frozen solid - no heartbeat, no brain activity. This animal survives the winter in a frozen state. In the Spring, the wood frog thaws out and spontaneously reanimates itself. The wood frog holds the key to organ cryopreservation, understanding how we can rapidly chill people in trauma cases while transporting them to hospital, as well as the key to long-term suspended animation.
Blast off!

Browse Other Projects on Experiment

Related Projects

Urban Pollination: sustain native bees & urban crops

Bee activity on our crop flowers is crucial to human food security, but bees are also declining around the...

Cannibalism in Giant Tyrannosaurs

This is the key question we hope to answer with this study. This project is to fund research into a skull...

Seattle HiveBio Community Lab

Thank you to everyone who has supported HiveBio thus far. As of April 17th we've reached our basic funding...

Backer Badge Funded

A biology project funded by 41 people

Add a comment