GraphX is a new component in Spark for graphs and graph-parallel computation. At a high level, GraphX extends the Spark RDD by introducing a new Graph abstraction: a directed multigraph with properties attached to each vertex and edge.
Algorithms
PageRank
Connected components
Label propagation
SVD++
Strongly connected components
Triangle count
Giraph, and GraphLab
Apache Giraph is an iterative graph processing framework, built on top of Apache Hadoop.
It is currently used at Facebook to analyze the social graph formed by users and their connections. Giraph originated as the open-source counterpart to Pregel, the graph processing architecture developed at Google and described in a 2010 paper.
GraphLab is high performance, distributed computation framework written in C++.
Started by Prof. Carlos Guestrin of Carnegie Mellon University in 2009. A distributed graph processing system that is written in C++ and uses MPI for communication. Similar to Giraph, it keeps the graph in memory. However, it does not depend on Hadoop
Performance Comparisons
A Small Pipeline in GraphX
GraphX, Giraph, and GraphLab end-to-end performance
Plato vs GraphX
Intel Article on October 19, 2020
https://newsroom.intel.com/articles/intel-katana-graph-team-large-scale-graph-analytics
Katana vs GraphX
文章来源:https://www.toymoban.com/news/detail-425728.html
oneDAL Graph Analytics
文章来源地址https://www.toymoban.com/news/detail-425728.html
到了这里,关于Spark GraphX的文章就介绍完了。如果您还想了解更多内容,请在右上角搜索TOY模板网以前的文章或继续浏览下面的相关文章,希望大家以后多多支持TOY模板网!