Академический Документы
Профессиональный Документы
Культура Документы
by
Ramraj S M.E.,
Assistant Professor
Department of Software Engineerin
Guide
Dr. S. S. Sridhar,Professor
Department of Computer Science and Engineering
Presentation
12-Nov-2016
by Ramraj S
Agenda
Genetics Vs Genomics
Terms in Genomics
Denovo Assembly
Challenges Involved
Distributed Computing
Hadoop
by Ramraj S
Genetics Vs Genomics
by Ramraj S
Terms in Genomics
by Ramraj S
Assembly Process
by Ramraj S
Denovo Assembly
Assemblers such as Velvet, Euler-USR, and SOAPdenovo successfully assembled small genomes from short reads.
Assemblers for assembling the larger mammalian-sized genomesrequire high memory and compute resources
by Ramraj S
For a 100GB NGS file of read length 36 and k-mer size 25,the total
size of intermediate data is 1.2 tera-bytes, i.e., each read is
replicated 12 times. Thus, new strategies to store and process large
quantities of data efficiently are required.
by Ramraj S
Distributed Computing
by Ramraj S
Hadoop
by Ramraj S
by Ramraj S
by Ramraj S
References I
[1]
http://hadoop.apache.org/
[2]
[3]
[4]
Owen, Sean and Anil, Robin and Dunning, Ted and Friedman, Ellen
Mahout in Action
[4]
[4]
by Ramraj S
Thank you
by Ramraj S