NOSQL APPROACHES TO BIOMEDICAL DATA SCIENCEUSING SPLUNK FOR DATA ANALYTICSSTATISTICAL ANALYSIS OF GENOMIC DATA WITH HADOOPEXTRACTING AND TRANSFORMING GENOMIC DATAPROCESSING EQTL DATAGENERATING MASTER SNP FILES FOR CASES AND CONTROLSGENERATING GENE EXPRESSION FILES FOR CASES AND CONTROLSCLEANING RAW DATA USING MAPREDUCETRANSPOSE DATA USING PYTHONSTATISTICAL ANALYSIS USING SPARKHIVE TABLES WITH PARTITIONSCONCLUSIONNOTESAppendix: A Brief Statistics PrimerContent Contributed by Daniel Peñnaherrera,July 13, 2016FOUNDATIONSPOPULATION AND SAMPLERANDOM VARIABLESEXPECTED VALUE AND VARIANCEREGRESSION ANALYSISMULTIVARIATE LINEAR REGRESSIONLOGISTIC REGRESSION