The "rank product" is a statistical technique, used for detecting differentially regulated genes in replicated microarray experiments. The technique has achieved widespread acceptance and is now used more broadly, in such diverse fields as RNAi analysis, proteomics, and machine learning. The "rank product" technique may be used in ranking users (in social networks) and items (such as Amazon.com).Given large set of genes, users, or items, in this webcast I will present two distinct Spark solutions: (using groupByKey() and combineByKey()) for solving the "rank product".
Table of contents
- Title: Apache Spark Solution for Rank Product
- Release date: August 2015
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781491951064
You might also like
Designing Data-Intensive Applications
Data is at the center of many challenges in system design today. Difficult issues need to …
Go is rapidly becoming the preferred language for building web services. There are plenty of tutorials …
Learning SQL, 3rd Edition
As data floods into your company, you need to put it to work right away—and SQL …
Building Microservices, 2nd Edition
Distributed systems have become more fine-grained as organizations shift from code-heavy monolithic applications to smaller, self-contained …