Skip to Content
数据分析之图算法: 基于Spark和Neo4j
book

数据分析之图算法: 基于Spark和Neo4j

by Mark Needham, Amy E. Hodler
September 2020
Intermediate to advanced
213 pages
5h 25m
Chinese
Posts & Telecom Press
Content preview from 数据分析之图算法: 基于Spark和Neo4j
社团发现算法
97
6.1.2
 将数据导入
Neo4j
接下来对
Neo4j
执行相同的操作。下面的查询导入各节点:
WITH
"https://github.com/neo4j-graph-analytics/book/raw/master/data/"
AS
base
WITH
base + "sw-nodes.csv"
AS
uri
LOAD CSV
WITH
HEADERS FROM uri
AS
row
MERGE (:Library {id: row.id})
下面的代码导入关系。
WITH
"https://github.com/neo4j-graph-analytics/book/raw/master/data/"
AS
base
WITH
base + "sw-relationships.csv"
AS
uri
LOAD CSV
WITH
HEADERS FROM uri
AS
row
MATCH
(source:Library {id: row.src})
MATCH
(destination:Library {id: row.dst})
MERGE (source)-[:DEPENDS_ON]->(destination)
现在图已加载完毕,下面开始算法层面的工作。
6.2
 三角形计数和聚类系数
由于三角形计数和聚类系数经常同时使用,因此一并介绍。三角形计数算法确定了图中经
过每个节点的三角形数量。三角形是由三个节点组成的集合,且每个节点与其他各节点都
有关系。三角形计数算法也可以全局运行,用于评估整个数据集。 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

大数据项目管理:从规划到实现

大数据项目管理:从规划到实现

Ted Malaska, Jonathan Seidman
Presto实战

Presto实战

Matt Fuller, Manfred Moser, Martin Traverso
精實企業|高績效組織如何達成創新規模化

精實企業|高績效組織如何達成創新規模化

Jez Humble, Joanne Molesky, Barry O'Reilly

Publisher Resources

ISBN: 9787115546678