Академический Документы
Профессиональный Документы
Культура Документы
ADAM defines a:
2. A Scala API
schema for O(1) metadata access union { boolean, null } primaryAlignment = false;
union { boolean, null } secondaryAlignment = false;
union { boolean, null } supplementaryAlignment = false;
union { null, string } mismatchingPositions = null;
union { null, string } origQual = null;
Genotype schema is strictly union { null, string } attributes = null;
union { null, string } recordGroupSequencingCenter = null;
http://www.parquet.io
3 layers of parallelism:
File/row group
Column chunk
Page
ACACTGCGACTCATCGACTC
Problems:
2
1. Overlapping is O(n ) and single evaluation is expensive anyways