Overview
Syllabus
Intro
Max Ogden @maxogden
dat is an open source tool for sharing and collaborating on data
we are grant funded and 100% open source
reproducible science
analogy time: lets talk about source control
life before git
i want to fix a bug in cool-project
1. somehow geta zip of cool-project 2. unpack and edita file 3. email the file back
claim: currently data sharing is a mess
email csv files
we want to do for data what git did for source code
a data set we can all relate to: npm
calculate how big npm is using dat
transform the npm data using bulk-markdown-to-png
bionode bioinformatics tools on npm
data pipelines dependency management data streaming
gasket is a cross platform pipeline manager
datscript is an experimental pipeline config language
branches, dat checkout 3b2d98V3, multi master replication, sync to databases, registry
Taught by
JSConf